Saved in:
| Main Authors: | Bhattacharya, Debasmita, van Schijndel, Marten |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.04596 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Does Dependency Locality Predict Non-canonical Word Order in Hindi?
by: Ranjan, Sidharth, et al.
Published: (2024)
by: Ranjan, Sidharth, et al.
Published: (2024)
Measuring Entrainment in Spontaneous Code-switched Speech
by: Bhattacharya, Debasmita, et al.
Published: (2023)
by: Bhattacharya, Debasmita, et al.
Published: (2023)
Semantics or spelling? Probing contextual word embeddings with orthographic noise
by: Matthews, Jacob A., et al.
Published: (2024)
by: Matthews, Jacob A., et al.
Published: (2024)
Orthogonality and isotropy of speaker and phonetic information in self-supervised speech representations
by: Mohamed, Mukhtar, et al.
Published: (2024)
by: Mohamed, Mukhtar, et al.
Published: (2024)
A stylometric analysis of speaker attribution from speech transcripts
by: Aggazzotti, Cristina, et al.
Published: (2025)
by: Aggazzotti, Cristina, et al.
Published: (2025)
Analyzing the relationships between pretraining language, phonetic, tonal, and speaker information in self-supervised speech models
by: Gubian, Michele, et al.
Published: (2025)
by: Gubian, Michele, et al.
Published: (2025)
AtteSTNet -- An attention and subword tokenization based approach for code-switched text hate speech detection
by: Shingi, Geet, et al.
Published: (2021)
by: Shingi, Geet, et al.
Published: (2021)
Addressing speaker gender bias in large scale speech translation systems
by: Bansal, Shubham, et al.
Published: (2025)
by: Bansal, Shubham, et al.
Published: (2025)
Classifying populist language in American presidential and governor speeches using automatic text analysis
by: van der Veen, Olaf, et al.
Published: (2024)
by: van der Veen, Olaf, et al.
Published: (2024)
1000 African Voices: Advancing inclusive multi-speaker multi-accent speech synthesis
by: Ogun, Sewade, et al.
Published: (2024)
by: Ogun, Sewade, et al.
Published: (2024)
End-to-end Speech Recognition with similar length speech and text
by: Fan, Peng, et al.
Published: (2025)
by: Fan, Peng, et al.
Published: (2025)
IndRegBias: A Dataset for Studying Indian Regional Biases in English and Code-Mixed Social Media Comments
by: Panda, Debasmita, et al.
Published: (2026)
by: Panda, Debasmita, et al.
Published: (2026)
Losing Phonotactic Distinctions in Context
by: John R. Starr, et al.
Published: (2025)
by: John R. Starr, et al.
Published: (2025)
An efficient text augmentation approach for contextualized Mandarin speech recognition
by: Zheng, Naijun, et al.
Published: (2024)
by: Zheng, Naijun, et al.
Published: (2024)
Transferable speech-to-text large language model alignment module
by: Wu, Boyong, et al.
Published: (2024)
by: Wu, Boyong, et al.
Published: (2024)
Rethinking stance detection: A theoretically-informed research agenda for user-level inference using language models
by: Bhattacharya, Prasanta, et al.
Published: (2025)
by: Bhattacharya, Prasanta, et al.
Published: (2025)
Code-mixed Sentiment and Hate-speech Prediction
by: Yadav, Anjali, et al.
Published: (2024)
by: Yadav, Anjali, et al.
Published: (2024)
Target speaker anonymization in multi-speaker recordings
by: Tomashenko, Natalia, et al.
Published: (2025)
by: Tomashenko, Natalia, et al.
Published: (2025)
Natural language guidance of high-fidelity text-to-speech with synthetic annotations
by: Lyth, Dan, et al.
Published: (2024)
by: Lyth, Dan, et al.
Published: (2024)
Exploring the topics, sentiments and hate speech in the Spanish information environment
by: LOPEZ, ALEJANDRO BUITRAGO, et al.
Published: (2024)
by: LOPEZ, ALEJANDRO BUITRAGO, et al.
Published: (2024)
PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech
by: Rahman, Hanif
Published: (2026)
by: Rahman, Hanif
Published: (2026)
Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
by: Fujita, Kenichi, et al.
Published: (2024)
by: Fujita, Kenichi, et al.
Published: (2024)
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
by: Garg, Abhinav, et al.
Published: (2024)
by: Garg, Abhinav, et al.
Published: (2024)
A unified front-end framework for English text-to-speech synthesis
by: Ying, Zelin, et al.
Published: (2023)
by: Ying, Zelin, et al.
Published: (2023)
Are LLMs good pragmatic speakers?
by: Jian, Mingyue, et al.
Published: (2024)
by: Jian, Mingyue, et al.
Published: (2024)
The evaluation of a code-switched Sepedi-English automatic speech recognition system
by: Phaladi, Amanda, et al.
Published: (2024)
by: Phaladi, Amanda, et al.
Published: (2024)
Advancing Chinese biomedical text mining with community challenges
by: Zong, Hui, et al.
Published: (2024)
by: Zong, Hui, et al.
Published: (2024)
ROSA: Addressing text understanding challenges in photographs via ROtated SAmpling
by: Maina, Hernán, et al.
Published: (2025)
by: Maina, Hernán, et al.
Published: (2025)
Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech
by: Pahar, Madhurananda, et al.
Published: (2026)
by: Pahar, Madhurananda, et al.
Published: (2026)
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge
by: Wang, Xiaopeng, et al.
Published: (2024)
by: Wang, Xiaopeng, et al.
Published: (2024)
Moshi: a speech-text foundation model for real-time dialogue
by: Défossez, Alexandre, et al.
Published: (2024)
by: Défossez, Alexandre, et al.
Published: (2024)
SPGISpeech 2.0: Transcribed multi-speaker financial audio for speaker-tagged transcription
by: Grossman, Raymond, et al.
Published: (2025)
by: Grossman, Raymond, et al.
Published: (2025)
Strategies of Code-switching in Human-Machine Dialogs
by: Geckt, Dean, et al.
Published: (2025)
by: Geckt, Dean, et al.
Published: (2025)
Strategies for improving low resource speech to text translation relying on pre-trained ASR models
by: Kesiraju, Santosh, et al.
Published: (2023)
by: Kesiraju, Santosh, et al.
Published: (2023)
Domain-specific long text classification from sparse relevant information
by: D'Cruz, Célia, et al.
Published: (2024)
by: D'Cruz, Célia, et al.
Published: (2024)
An information-theoretic model of shallow and deep language comprehension
by: Li, Jiaxuan, et al.
Published: (2024)
by: Li, Jiaxuan, et al.
Published: (2024)
Improving Cross-lingual Representation for Semantic Retrieval with Code-switching
by: Maimaiti, Mieradilijiang, et al.
Published: (2024)
by: Maimaiti, Mieradilijiang, et al.
Published: (2024)
More than words: Advancements and challenges in speech recognition for singing
by: Kruspe, Anna
Published: (2024)
by: Kruspe, Anna
Published: (2024)
Articulatory strategy in vowel production as a basis for speaker discrimination
by: Lo, Justin J. H., et al.
Published: (2025)
by: Lo, Justin J. H., et al.
Published: (2025)
A Benchmark for Multi-speaker Anonymization
by: Miao, Xiaoxiao, et al.
Published: (2024)
by: Miao, Xiaoxiao, et al.
Published: (2024)
Similar Items
-
Does Dependency Locality Predict Non-canonical Word Order in Hindi?
by: Ranjan, Sidharth, et al.
Published: (2024) -
Measuring Entrainment in Spontaneous Code-switched Speech
by: Bhattacharya, Debasmita, et al.
Published: (2023) -
Semantics or spelling? Probing contextual word embeddings with orthographic noise
by: Matthews, Jacob A., et al.
Published: (2024) -
Orthogonality and isotropy of speaker and phonetic information in self-supervised speech representations
by: Mohamed, Mukhtar, et al.
Published: (2024) -
A stylometric analysis of speaker attribution from speech transcripts
by: Aggazzotti, Cristina, et al.
Published: (2025)