:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bhattacharya, Debasmita, van Schijndel, Marten
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2408.04596
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Does Dependency Locality Predict Non-canonical Word Order in Hindi?
by: Ranjan, Sidharth, et al.
Published: (2024)

Measuring Entrainment in Spontaneous Code-switched Speech
by: Bhattacharya, Debasmita, et al.
Published: (2023)

Semantics or spelling? Probing contextual word embeddings with orthographic noise
by: Matthews, Jacob A., et al.
Published: (2024)

Orthogonality and isotropy of speaker and phonetic information in self-supervised speech representations
by: Mohamed, Mukhtar, et al.
Published: (2024)

A stylometric analysis of speaker attribution from speech transcripts
by: Aggazzotti, Cristina, et al.
Published: (2025)

Analyzing the relationships between pretraining language, phonetic, tonal, and speaker information in self-supervised speech models
by: Gubian, Michele, et al.
Published: (2025)

AtteSTNet -- An attention and subword tokenization based approach for code-switched text hate speech detection
by: Shingi, Geet, et al.
Published: (2021)

Addressing speaker gender bias in large scale speech translation systems
by: Bansal, Shubham, et al.
Published: (2025)

Classifying populist language in American presidential and governor speeches using automatic text analysis
by: van der Veen, Olaf, et al.
Published: (2024)

1000 African Voices: Advancing inclusive multi-speaker multi-accent speech synthesis
by: Ogun, Sewade, et al.
Published: (2024)

End-to-end Speech Recognition with similar length speech and text
by: Fan, Peng, et al.
Published: (2025)

IndRegBias: A Dataset for Studying Indian Regional Biases in English and Code-Mixed Social Media Comments
by: Panda, Debasmita, et al.
Published: (2026)

Losing Phonotactic Distinctions in Context
by: John R. Starr, et al.
Published: (2025)

An efficient text augmentation approach for contextualized Mandarin speech recognition
by: Zheng, Naijun, et al.
Published: (2024)

Transferable speech-to-text large language model alignment module
by: Wu, Boyong, et al.
Published: (2024)

Rethinking stance detection: A theoretically-informed research agenda for user-level inference using language models
by: Bhattacharya, Prasanta, et al.
Published: (2025)

Code-mixed Sentiment and Hate-speech Prediction
by: Yadav, Anjali, et al.
Published: (2024)

Target speaker anonymization in multi-speaker recordings
by: Tomashenko, Natalia, et al.
Published: (2025)

Natural language guidance of high-fidelity text-to-speech with synthetic annotations
by: Lyth, Dan, et al.
Published: (2024)

Exploring the topics, sentiments and hate speech in the Spanish information environment
by: LOPEZ, ALEJANDRO BUITRAGO, et al.
Published: (2024)

PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech
by: Rahman, Hanif
Published: (2026)

Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
by: Fujita, Kenichi, et al.
Published: (2024)

Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
by: Garg, Abhinav, et al.
Published: (2024)

A unified front-end framework for English text-to-speech synthesis
by: Ying, Zelin, et al.
Published: (2023)

Are LLMs good pragmatic speakers?
by: Jian, Mingyue, et al.
Published: (2024)

The evaluation of a code-switched Sepedi-English automatic speech recognition system
by: Phaladi, Amanda, et al.
Published: (2024)

Advancing Chinese biomedical text mining with community challenges
by: Zong, Hui, et al.
Published: (2024)

ROSA: Addressing text understanding challenges in photographs via ROtated SAmpling
by: Maina, Hernán, et al.
Published: (2025)

Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech
by: Pahar, Madhurananda, et al.
Published: (2026)

A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge
by: Wang, Xiaopeng, et al.
Published: (2024)

Moshi: a speech-text foundation model for real-time dialogue
by: Défossez, Alexandre, et al.
Published: (2024)

SPGISpeech 2.0: Transcribed multi-speaker financial audio for speaker-tagged transcription
by: Grossman, Raymond, et al.
Published: (2025)

Strategies of Code-switching in Human-Machine Dialogs
by: Geckt, Dean, et al.
Published: (2025)

Strategies for improving low resource speech to text translation relying on pre-trained ASR models
by: Kesiraju, Santosh, et al.
Published: (2023)

Domain-specific long text classification from sparse relevant information
by: D'Cruz, Célia, et al.
Published: (2024)

An information-theoretic model of shallow and deep language comprehension
by: Li, Jiaxuan, et al.
Published: (2024)

Improving Cross-lingual Representation for Semantic Retrieval with Code-switching
by: Maimaiti, Mieradilijiang, et al.
Published: (2024)

More than words: Advancements and challenges in speech recognition for singing
by: Kruspe, Anna
Published: (2024)

Articulatory strategy in vowel production as a basis for speaker discrimination
by: Lo, Justin J. H., et al.
Published: (2025)

A Benchmark for Multi-speaker Anonymization
by: Miao, Xiaoxiao, et al.
Published: (2024)