Saved in:
| Main Authors: | Fiedler, Tobias, Hermann, Leon, Müller, Florian, Cohen, Sarel, Chin, Peter, Friedrich, Tobias, Vaadia, Eilon |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.09459 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Over-the-air White-box Attack on the Wav2Vec Speech Recognition Neural Network
by: Alexey, Protopopov
Published: (2026)
by: Alexey, Protopopov
Published: (2026)
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
by: Shankar, Ravi, et al.
Published: (2024)
by: Shankar, Ravi, et al.
Published: (2024)
Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context
by: Nguyen, Tuan, et al.
Published: (2024)
by: Nguyen, Tuan, et al.
Published: (2024)
Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks
by: Stuhlmann, Linus, et al.
Published: (2025)
by: Stuhlmann, Linus, et al.
Published: (2025)
Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
by: Jafarzadeh, Pourya, et al.
Published: (2024)
by: Jafarzadeh, Pourya, et al.
Published: (2024)
Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART
by: Tathe, Aniket, et al.
Published: (2024)
by: Tathe, Aniket, et al.
Published: (2024)
End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2
by: Tathe, Aniket, et al.
Published: (2024)
by: Tathe, Aniket, et al.
Published: (2024)
Inference-Time Backdoors via Chat Templates: From LLM Supply Chains to Agentic System Compromise
by: Fogel, Ariel, et al.
Published: (2026)
by: Fogel, Ariel, et al.
Published: (2026)
Exploring and Exploiting Stability in Latent Flow Matching
by: Briq, Rania, et al.
Published: (2026)
by: Briq, Rania, et al.
Published: (2026)
WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation
by: Della Libera, Luca, et al.
Published: (2026)
by: Della Libera, Luca, et al.
Published: (2026)
VecAug: Unveiling Camouflaged Frauds with Cohort Augmentation for Enhanced Detection
by: Xiao, Fei, et al.
Published: (2024)
by: Xiao, Fei, et al.
Published: (2024)
Robust Partial-Label Learning by Leveraging Class Activation Values
by: Fuchs, Tobias, et al.
Published: (2025)
by: Fuchs, Tobias, et al.
Published: (2025)
Partial-Label Learning with Conformal Candidate Cleaning
by: Fuchs, Tobias, et al.
Published: (2025)
by: Fuchs, Tobias, et al.
Published: (2025)
Automated Tone Transcription and Clustering with Tone2Vec
by: Yang, Yi, et al.
Published: (2024)
by: Yang, Yi, et al.
Published: (2024)
Language Models Implement Simple Word2Vec-style Vector Arithmetic
by: Merullo, Jack, et al.
Published: (2023)
by: Merullo, Jack, et al.
Published: (2023)
RGNMR: A Gauss-Newton method for robust matrix completion with theoretical guarantees
by: Laufer, Eilon Vaknin, et al.
Published: (2025)
by: Laufer, Eilon Vaknin, et al.
Published: (2025)
Wav2Small: Distilling Wav2Vec2 to 72K parameters for Low-Resource Speech emotion recognition
by: Kounadis-Bastian, Dionyssos, et al.
Published: (2024)
by: Kounadis-Bastian, Dionyssos, et al.
Published: (2024)
Transformer Encoder and Multi-features Time2Vec for Financial Prediction
by: Bui, Nguyen Kim Hai, et al.
Published: (2025)
by: Bui, Nguyen Kim Hai, et al.
Published: (2025)
Boulder2Vec: Modeling Climber Performances in Professional Bouldering Competitions
by: Baron, Ethan, et al.
Published: (2024)
by: Baron, Ethan, et al.
Published: (2024)
QDSB: Quantized Diffusion Schrödinger Bridges
by: Fuchs, Tobias, et al.
Published: (2026)
by: Fuchs, Tobias, et al.
Published: (2026)
Partial-Label Learning with a Reject Option
by: Fuchs, Tobias, et al.
Published: (2024)
by: Fuchs, Tobias, et al.
Published: (2024)
Demo2Vec: Learning Region Embedding with Demographic Information
by: Wen, Ya, et al.
Published: (2024)
by: Wen, Ya, et al.
Published: (2024)
Know2Vec: A Black-Box Proxy for Neural Network Retrieval
by: Shang, Zhuoyi, et al.
Published: (2024)
by: Shang, Zhuoyi, et al.
Published: (2024)
Bike2Vec: Vector Embedding Representations of Road Cycling Riders and Races
by: Baron, Ethan, et al.
Published: (2023)
by: Baron, Ethan, et al.
Published: (2023)
Refined Policy Distillation: From VLA Generalists to RL Experts
by: Jülg, Tobias, et al.
Published: (2025)
by: Jülg, Tobias, et al.
Published: (2025)
WavInWav: Time-domain Speech Hiding via Invertible Neural Network
by: Fan, Wei, et al.
Published: (2025)
by: Fan, Wei, et al.
Published: (2025)
Robust Parameter Fitting to Realistic Network Models via Iterative Stochastic Approximation
by: Bläsius, Thomas, et al.
Published: (2024)
by: Bläsius, Thomas, et al.
Published: (2024)
Efficient Fault-Tolerant Search by Fast Indexing of Subnetworks
by: Bilò, Davide, et al.
Published: (2024)
by: Bilò, Davide, et al.
Published: (2024)
LineFlow: A Framework to Learn Active Control of Production Lines
by: Müller, Kai, et al.
Published: (2025)
by: Müller, Kai, et al.
Published: (2025)
TS2Vec-Ensemble: An Enhanced Self-Supervised Framework for Time Series Forecasting
by: Niroshan, Ganeshan, et al.
Published: (2025)
by: Niroshan, Ganeshan, et al.
Published: (2025)
Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing
by: Guo, Yongxin, et al.
Published: (2024)
by: Guo, Yongxin, et al.
Published: (2024)
Phylo2Vec: a vector representation for binary trees
by: Penn, Matthew J, et al.
Published: (2023)
by: Penn, Matthew J, et al.
Published: (2023)
A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
by: Wolter, Axel Friedrich, et al.
Published: (2025)
by: Wolter, Axel Friedrich, et al.
Published: (2025)
Wav-KAN: Wavelet Kolmogorov-Arnold Networks
by: Bozorgasl, Zavareh, et al.
Published: (2024)
by: Bozorgasl, Zavareh, et al.
Published: (2024)
Adapting WavLM for Speech Emotion Recognition
by: Diatlova, Daria, et al.
Published: (2024)
by: Diatlova, Daria, et al.
Published: (2024)
Word2VecGD: Neural Graph Drawing with Cosine-Stress Optimization
by: Yang, Minglai, et al.
Published: (2025)
by: Yang, Minglai, et al.
Published: (2025)
Monte Carlo Stochastic Depth for Uncertainty Estimation in Deep Learning
by: Müller, Adam T., et al.
Published: (2026)
by: Müller, Adam T., et al.
Published: (2026)
Similarity of Neural Network Models: A Survey of Functional and Representational Measures
by: Klabunde, Max, et al.
Published: (2023)
by: Klabunde, Max, et al.
Published: (2023)
Community Detection Guarantees Using Embeddings Learned by Node2Vec
by: Davison, Andrew, et al.
Published: (2023)
by: Davison, Andrew, et al.
Published: (2023)
VEXIR2Vec: An Architecture-Neutral Embedding Framework for Binary Similarity
by: VenkataKeerthy, S., et al.
Published: (2023)
by: VenkataKeerthy, S., et al.
Published: (2023)
Similar Items
-
Over-the-air White-box Attack on the Wav2Vec Speech Recognition Neural Network
by: Alexey, Protopopov
Published: (2026) -
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
by: Shankar, Ravi, et al.
Published: (2024) -
Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context
by: Nguyen, Tuan, et al.
Published: (2024) -
Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks
by: Stuhlmann, Linus, et al.
Published: (2025) -
Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
by: Jafarzadeh, Pourya, et al.
Published: (2024)