Saved in:
| Main Authors: | Büthe, Jan, Mustafa, Ahmed, Valin, Jean-Marc, Helwani, Karim, Goodwin, Michael M. |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2309.14521 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A lightweight and robust method for blind wideband-to-fullband extension of speech
by: Büthe, Jan, et al.
Published: (2024)
by: Büthe, Jan, et al.
Published: (2024)
Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) with Pitch Prediction
by: Valin, Jean-Marc, et al.
Published: (2024)
by: Valin, Jean-Marc, et al.
Published: (2024)
RADE: A Neural Codec for Transmitting Speech over HF Radio Channels
by: Rowe, David, et al.
Published: (2025)
by: Rowe, David, et al.
Published: (2025)
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
by: Subramani, Krishna, et al.
Published: (2023)
by: Subramani, Krishna, et al.
Published: (2023)
DRED: Deep REDundancy Coding of Speech Using a Rate-Distortion-Optimized Variational Autoencoder
by: Valin, Jean-Marc, et al.
Published: (2022)
by: Valin, Jean-Marc, et al.
Published: (2022)
Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure
by: Togami, Masahito, et al.
Published: (2024)
by: Togami, Masahito, et al.
Published: (2024)
Sound Source Separation Using Latent Variational Block-Wise Disentanglement
by: Helwani, Karim, et al.
Published: (2024)
by: Helwani, Karim, et al.
Published: (2024)
BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec
by: Xin, Detai, et al.
Published: (2024)
by: Xin, Detai, et al.
Published: (2024)
CodecSlime: Temporal Redundancy Compression of Neural Speech Codec via Dynamic Frame Rate
by: Wang, Hankun, et al.
Published: (2025)
by: Wang, Hankun, et al.
Published: (2025)
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules
by: Chiang, Hsin-Tien, et al.
Published: (2024)
by: Chiang, Hsin-Tien, et al.
Published: (2024)
DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation
by: Li, Jiaqi, et al.
Published: (2025)
by: Li, Jiaqi, et al.
Published: (2025)
Personalized Neural Speech Codec
by: Jang, Inseon, et al.
Published: (2024)
by: Jang, Inseon, et al.
Published: (2024)
Language-Codec: Bridging Discrete Codec Representations and Speech Language Models
by: Ji, Shengpeng, et al.
Published: (2024)
by: Ji, Shengpeng, et al.
Published: (2024)
PURE Codec: Progressive Unfolding of Residual Entropy for Speech Codec Learning
by: Shi, Jiatong, et al.
Published: (2025)
by: Shi, Jiatong, et al.
Published: (2025)
MuCodec: Ultra Low-Bitrate Music Codec
by: Xu, Yaoxun, et al.
Published: (2024)
by: Xu, Yaoxun, et al.
Published: (2024)
PhoenixCodec: Taming Neural Speech Coding for Extreme Low-Resource Scenarios
by: Wan, Zixiang, et al.
Published: (2025)
by: Wan, Zixiang, et al.
Published: (2025)
SuperCodec: A Neural Speech Codec with Selective Back-Projection Network
by: Zheng, Youqiang, et al.
Published: (2024)
by: Zheng, Youqiang, et al.
Published: (2024)
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech
by: Shi, Jiatong, et al.
Published: (2024)
by: Shi, Jiatong, et al.
Published: (2024)
A Semantic Information-based Hierarchical Speech Enhancement Method Using Factorized Codec and Diffusion Model
by: Xiang, Yang, et al.
Published: (2025)
by: Xiang, Yang, et al.
Published: (2025)
Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding
by: Yang, Peiji, et al.
Published: (2024)
by: Yang, Peiji, et al.
Published: (2024)
A Neural Speech Codec for Noise Robust Speech Coding
by: Huang, Jiayi, et al.
Published: (2023)
by: Huang, Jiayi, et al.
Published: (2023)
UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook
by: Jiang, Yidi, et al.
Published: (2025)
by: Jiang, Yidi, et al.
Published: (2025)
RepCodec: A Speech Representation Codec for Speech Tokenization
by: Huang, Zhichao, et al.
Published: (2023)
by: Huang, Zhichao, et al.
Published: (2023)
Probing the Robustness Properties of Neural Speech Codecs
by: Tseng, Wei-Cheng, et al.
Published: (2025)
by: Tseng, Wei-Cheng, et al.
Published: (2025)
SpatialCodec: Neural Spatial Speech Coding
by: Xu, Zhongweiyang, et al.
Published: (2023)
by: Xu, Zhongweiyang, et al.
Published: (2023)
SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis
by: Guo, Haohan, et al.
Published: (2024)
by: Guo, Haohan, et al.
Published: (2024)
DS-Codec: Dual-Stage Training with Mirror-to-NonMirror Architecture Switching for Speech Codec
by: Chen, Peijie, et al.
Published: (2025)
by: Chen, Peijie, et al.
Published: (2025)
CodecFake+: A Large-Scale Neural Audio Codec-Based Deepfake Speech Dataset
by: Chen, Xuanjun, et al.
Published: (2025)
by: Chen, Xuanjun, et al.
Published: (2025)
Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference
by: Casanova, Edresson, et al.
Published: (2024)
by: Casanova, Edresson, et al.
Published: (2024)
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech
by: Kim, Jaehyeon, et al.
Published: (2024)
by: Kim, Jaehyeon, et al.
Published: (2024)
Advancing Electrolaryngeal Speech Enhancement Through Speech-Text Representation Learning
by: Ma, Ding, et al.
Published: (2026)
by: Ma, Ding, et al.
Published: (2026)
Investigating Neural Audio Codecs for Speech Language Model-Based Speech Generation
by: Li, Jiaqi, et al.
Published: (2024)
by: Li, Jiaqi, et al.
Published: (2024)
Evaluating Speech Enhancement Systems Through Listening Effort
by: Gelderblom, Femke B., et al.
Published: (2024)
by: Gelderblom, Femke B., et al.
Published: (2024)
Unlocking Temporal Flexibility: Neural Speech Codec with Variable Frame Rate
by: Zhang, Hanglei, et al.
Published: (2025)
by: Zhang, Hanglei, et al.
Published: (2025)
LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec
by: Guo, Yiwei, et al.
Published: (2024)
by: Guo, Yiwei, et al.
Published: (2024)
Towards General Discrete Speech Codec for Complex Acoustic Environments: A Study of Reconstruction and Downstream Task Consistency
by: Wang, Haoran, et al.
Published: (2025)
by: Wang, Haoran, et al.
Published: (2025)
Fewer-token Neural Speech Codec with Time-invariant Codes
by: Ren, Yong, et al.
Published: (2023)
by: Ren, Yong, et al.
Published: (2023)
SECodec: Structural Entropy-based Compressive Speech Representation Codec for Speech Language Models
by: Wang, Linqin, et al.
Published: (2024)
by: Wang, Linqin, et al.
Published: (2024)
Adaptive Convolution for CNN-based Speech Enhancement Models
by: Wang, Dahan, et al.
Published: (2025)
by: Wang, Dahan, et al.
Published: (2025)
Dynamic Frequency-Adaptive Knowledge Distillation for Speech Enhancement
by: Yuan, Xihao, et al.
Published: (2025)
by: Yuan, Xihao, et al.
Published: (2025)
Similar Items
-
A lightweight and robust method for blind wideband-to-fullband extension of speech
by: Büthe, Jan, et al.
Published: (2024) -
Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) with Pitch Prediction
by: Valin, Jean-Marc, et al.
Published: (2024) -
RADE: A Neural Codec for Transmitting Speech over HF Radio Channels
by: Rowe, David, et al.
Published: (2025) -
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
by: Subramani, Krishna, et al.
Published: (2023) -
DRED: Deep REDundancy Coding of Speech Using a Rate-Distortion-Optimized Variational Autoencoder
by: Valin, Jean-Marc, et al.
Published: (2022)