Saved in:
| Main Authors: | Kesiraju, Santosh, Sagar, Sangeet, Glembek, Ondřej, Burget, Lukáš, Černocký, Ján, Gangashetty, Suryakanth V |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2007.01359 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
by: Pešán, Jan, et al.
Published: (2024)
by: Pešán, Jan, et al.
Published: (2024)
Aligning Pre-trained Models for Spoken Language Translation
by: Sedláček, Šimon, et al.
Published: (2024)
by: Sedláček, Šimon, et al.
Published: (2024)
Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
by: Polok, Alexander, et al.
Published: (2024)
by: Polok, Alexander, et al.
Published: (2024)
DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
by: Polok, Alexander, et al.
Published: (2025)
by: Polok, Alexander, et al.
Published: (2025)
Joint Speech and Text Training for LLM-Based End-to-End Spoken Dialogue State Tracking
by: Vendrame, Katia, et al.
Published: (2025)
by: Vendrame, Katia, et al.
Published: (2025)
Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
by: Sedláček, Šimon, et al.
Published: (2025)
by: Sedláček, Šimon, et al.
Published: (2025)
Factors affecting the in-context learning abilities of LLMs for dialogue state tracking
by: Hegde, Pradyoth, et al.
Published: (2025)
by: Hegde, Pradyoth, et al.
Published: (2025)
Adapting Diarization-Conditioned Whisper for End-to-End Multi-Talker Speech Recognition
by: Kocour, Martin, et al.
Published: (2025)
by: Kocour, Martin, et al.
Published: (2025)
Robustness assessment of large audio language models in multiple-choice evaluation
by: López, Fernando, et al.
Published: (2025)
by: López, Fernando, et al.
Published: (2025)
IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation
by: Akkiraju, Bhavana, et al.
Published: (2025)
by: Akkiraju, Bhavana, et al.
Published: (2025)
FLiP: Towards understanding and interpreting multimodal multilingual sentence embeddings
by: Kesiraju, Santosh, et al.
Published: (2026)
by: Kesiraju, Santosh, et al.
Published: (2026)
ORCA: Open-ended Response Correctness Assessment for Audio Question Answering
by: Sedláček, Šimon, et al.
Published: (2025)
by: Sedláček, Šimon, et al.
Published: (2025)
Zero-shot Audio Topic Reranking using Large Language Models
by: Qian, Mengjie, et al.
Published: (2023)
by: Qian, Mengjie, et al.
Published: (2023)
Strategies for improving low resource speech to text translation relying on pre-trained ASR models
by: Kesiraju, Santosh, et al.
Published: (2023)
by: Kesiraju, Santosh, et al.
Published: (2023)
Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units
by: Yusuf, Bolaji, et al.
Published: (2024)
by: Yusuf, Bolaji, et al.
Published: (2024)
End-to-End Speech Translation for Low-Resource Languages Using Weakly Labeled Data
by: Pothula, Aishwarya, et al.
Published: (2025)
by: Pothula, Aishwarya, et al.
Published: (2025)
Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios?
by: Vandenbulcke, Zeno, et al.
Published: (2024)
by: Vandenbulcke, Zeno, et al.
Published: (2024)
CzechTopic: A Benchmark for Zero-Shot Topic Localization in Historical Czech Documents
by: Kostelník, Martin, et al.
Published: (2026)
by: Kostelník, Martin, et al.
Published: (2026)
TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
by: Zhang, Yu, et al.
Published: (2025)
by: Zhang, Yu, et al.
Published: (2025)
Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon
by: Koto, Fajri, et al.
Published: (2024)
by: Koto, Fajri, et al.
Published: (2024)
Unsupervised Speech Enhancement using Data-defined Priors
by: Klement, Dominik, et al.
Published: (2025)
by: Klement, Dominik, et al.
Published: (2025)
Multilingual Topic Classification in X: Dataset and Analysis
by: Antypas, Dimosthenis, et al.
Published: (2024)
by: Antypas, Dimosthenis, et al.
Published: (2024)
Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
by: Li, Zekun, et al.
Published: (2024)
by: Li, Zekun, et al.
Published: (2024)
Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions
by: Moraes, Daniel de S., et al.
Published: (2024)
by: Moraes, Daniel de S., et al.
Published: (2024)
Enhancing Zero-shot Chain of Thought Prompting via Uncertainty-Guided Strategy Selection
by: Kumar, Shanu, et al.
Published: (2024)
by: Kumar, Shanu, et al.
Published: (2024)
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
by: Liu, Zhengliang, et al.
Published: (2023)
by: Liu, Zhengliang, et al.
Published: (2023)
Evaluating Cross-Lingual Classification Approaches Enabling Topic Discovery for Multilingual Social Media Data
by: Uniyal, Deepak, et al.
Published: (2026)
by: Uniyal, Deepak, et al.
Published: (2026)
Who Spoke What When? Evaluating Spoken Language Models for Conversational ASR with Semantic and Overlap-Aware Metrics
by: Tawara, Naohiro, et al.
Published: (2026)
by: Tawara, Naohiro, et al.
Published: (2026)
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
by: Zhuo, Le, et al.
Published: (2023)
by: Zhuo, Le, et al.
Published: (2023)
HAMLET: Healthcare-focused Adaptive Multilingual Learning Embedding-based Topic Modeling
by: Sakai, Hajar, et al.
Published: (2025)
by: Sakai, Hajar, et al.
Published: (2025)
SciTopic: Enhancing Topic Discovery in Scientific Literature through Advanced LLM
by: Li, Pengjiang, et al.
Published: (2025)
by: Li, Pengjiang, et al.
Published: (2025)
Abstractive Summarization of Low resourced Nepali language using Multilingual Transformers
by: Dhakal, Prakash, et al.
Published: (2024)
by: Dhakal, Prakash, et al.
Published: (2024)
Modeling Overlapped Speech with Shuffles
by: Wiesner, Matthew, et al.
Published: (2026)
by: Wiesner, Matthew, et al.
Published: (2026)
Multilingual Text Style Transfer: Datasets & Models for Indian Languages
by: Mukherjee, Sourabrata, et al.
Published: (2024)
by: Mukherjee, Sourabrata, et al.
Published: (2024)
Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUD
by: Pražák, Ondřej, et al.
Published: (2024)
by: Pražák, Ondřej, et al.
Published: (2024)
Isolating Culture Neurons in Multilingual Large Language Models
by: Namazifard, Danial, et al.
Published: (2025)
by: Namazifard, Danial, et al.
Published: (2025)
Zero-shot Large Language Models for Automatic Readability Assessment
by: Grossman, Riley, et al.
Published: (2026)
by: Grossman, Riley, et al.
Published: (2026)
Topic Aware Probing: From Sentence Length Prediction to Idiom Identification how reliant are Neural Language Models on Topic?
by: Nedumpozhimana, Vasudevan, et al.
Published: (2024)
by: Nedumpozhimana, Vasudevan, et al.
Published: (2024)
IntentGPT: Few-shot Intent Discovery with Large Language Models
by: Rodriguez, Juan A., et al.
Published: (2024)
by: Rodriguez, Juan A., et al.
Published: (2024)
GeniL: A Multilingual Dataset on Generalizing Language
by: Davani, Aida Mostafazadeh, et al.
Published: (2024)
by: Davani, Aida Mostafazadeh, et al.
Published: (2024)
Similar Items
-
Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
by: Pešán, Jan, et al.
Published: (2024) -
Aligning Pre-trained Models for Spoken Language Translation
by: Sedláček, Šimon, et al.
Published: (2024) -
Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
by: Polok, Alexander, et al.
Published: (2024) -
DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
by: Polok, Alexander, et al.
Published: (2025) -
Joint Speech and Text Training for LLM-Based End-to-End Spoken Dialogue State Tracking
by: Vendrame, Katia, et al.
Published: (2025)