The proliferation of edge-based wireless speech applications necessitates the development of resource-efficient, low-latency speech communication systems capable of functioning across diverse communication channel conditions. Ensuring intelligible speech communication under conditions of constrained resources and low-latency presents a challenging problem within the domain of speech transmission. In this paper, we introduce a very low-latency configurable speech transmission system leveraging joint source-channel coding and deep neural networks (DNNs). Our proposed system is a unified deep neural network system engineered to operate effectively across a wide range of wireless communication channel scenarios. The system encompasses both a joint source-channel encoder and a joint source-channel decoder, each with access to channel state information (CSI). In this context, CSI signifies the type of fading in the wireless channel. Notably, our system has a total latency of 2 ms. Through extensive simulations, we empirically demonstrate that the proposed configurable system closely approximates the performance of ideal systems specifically tailored to individual wireless channel scenarios. Our evaluation is rooted in the assessment of instrumental measures of speech quality and intelligibility, affirming the efficacy of our system in diverse and resource-constrained communication contexts.

Keywords

applications, assessment, channel, channel conditions, channel scenarios, channel state information, communication, communication channel conditions, communication context, communication systems, conditions, configuration system, constrained resources, context, decoding, deep neural network system, deep neural networks, development, development of resource-efficient, domain, efficacy, encoding, evaluation, extensive simulations, fading, ideal system, information, intelligence, joint source-channel decoding, latency, low-latency, measures of speech quality, network, network system, neural network, neural network system, performance, problem, proliferation, quality, resource-efficient, resources, scenarios, simulation, source-channel decoding, speech applications, speech communication, speech communication systems, speech quality, speech transmission, speech transmission system, state information, system, transmission, transmission system, wireless channel

Funders

European Commission

Channel-Configurable Deep Wireless Speech Transmission

Contributors

Affiliations

Abstract

Keywords

Funders

Data Provider: Digital Science

LINKS
-

Matching Records in NORA

SUBJECTS
+

DK Main Research Area

UN SDG Classification

OECD Classification

AU/NZ FOR Classification

METRICS
+

Citation Metrics

Attention Metrics

Attention Metrics

DK Open Access Indicator

Contributors

Affiliations

Abstract

Keywords

Funders

Data Provider: Digital Science

LINKS-

Matching Records in NORA

SUBJECTS+

DK Main Research Area

UN SDG Classification

OECD Classification

AU/NZ FOR Classification

METRICS+

Citation Metrics

Attention Metrics

Attention Metrics

DK Open Access Indicator

Matching Records in NORA

DK Open Access Indicator

DK Green Classification

LINKS
-

SUBJECTS
+

METRICS
+