:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Yutian, Kang, Hao, Zhai, Vivian, Li, Liangze, Singh, Rita, Raj, Bhiksha
Format:	Preprint
Published:	2023
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2311.08723
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DELULU: Discriminative Embedding Learning Using Latent Units for Speaker-Aware Self-Trained Speech Foundational Model
by: Baali, Massa, et al.
Published: (2025)

On the Robust Approximation of ASR Metrics
by: Waheed, Abdul, et al.
Published: (2025)

What and When to Learn: CURriculum Ranking Loss for Large-Scale Speaker Verification
by: Baali, Massa, et al.
Published: (2026)

What Do Speech Foundation Models Not Learn About Speech?
by: Waheed, Abdul, et al.
Published: (2024)

PhoniTale: Phonologically Grounded Mnemonic Generation for Typologically Distant Language Pairs
by: Kang, Sana, et al.
Published: (2025)

CAARMA: Class Augmentation with Adversarial Mixup Regularization
by: Baali, Massa, et al.
Published: (2025)

Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models
by: Atwany, Hanin, et al.
Published: (2025)

PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification
by: Baali, Massa, et al.
Published: (2024)

SVeritas: Benchmark for Robust Speaker Verification under Diverse Conditions
by: Baali, Massa, et al.
Published: (2025)

Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
by: Sharma, Roshan, et al.
Published: (2024)

Less is More Tokens: Efficient Math Reasoning via Difficulty-Aware Chain-of-Thought Distillation
by: Waheed, Abdul, et al.
Published: (2025)

NITP: Next Implicit Token Prediction for LLM Pre-training
by: Zhang, Xiangdong, et al.
Published: (2026)

Revisiting Acoustic Features for Robust ASR
by: Shah, Muhammad A., et al.
Published: (2024)

On Fairness of Unified Multimodal Large Language Model for Image Generation
by: Liu, Ming, et al.
Published: (2025)

CoLMbo: Speaker Language Model for Descriptive Profiling
by: Baali, Massa, et al.
Published: (2025)

Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction
by: Qian, Junlang, et al.
Published: (2025)

On the Diversity of Synthetic Data and its Impact on Training Large Language Models
by: Chen, Hao, et al.
Published: (2024)

WhisperRT -- Turning Whisper into a Causal Streaming Model
by: Krichli, Tomer, et al.
Published: (2025)

Human Voice is Unique
by: Singh, Rita, et al.
Published: (2025)

Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text
by: Agrahari, Shifali, et al.
Published: (2025)

OleSpeech-IV: A Large-Scale Multispeaker and Multilingual Conversational Speech Dataset with Diverse Topics
by: Chu, Wei, et al.
Published: (2025)

Explainability-Based Token Replacement on LLM-Generated Text
by: Mohammadi, Hadi, et al.
Published: (2025)

Text Generation Beyond Discrete Token Sampling
by: Zhuang, Yufan, et al.
Published: (2025)

AugSumm: towards generalizable speech summarization using synthetic labels from large language model
by: Jung, Jee-weon, et al.
Published: (2024)

AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition
by: Chen, Zhaorun, et al.
Published: (2024)

Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness
by: Ma, Shixuan, et al.
Published: (2024)

VideoJudge: Bootstrapping Enables Scalable Supervision of MLLM-as-a-Judge for Video Understanding
by: Waheed, Abdul, et al.
Published: (2025)

ImageFolder: Autoregressive Image Generation with Folded Tokens
by: Li, Xiang, et al.
Published: (2024)

Learning to Rewrite: Generalized LLM-Generated Text Detection
by: Li, Ran, et al.
Published: (2024)

TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text
by: Lu, Songshuo, et al.
Published: (2024)

Exons-Detect: Identifying and Amplifying Exonic Tokens via Hidden-State Discrepancy for Robust AI-Generated Text Detection
by: Zhu, Xiaowei, et al.
Published: (2026)

uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation in Low-Data Regimes
by: Waheed, Abdul, et al.
Published: (2024)

Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling
by: Yu, Yao-Ching, et al.
Published: (2024)

Alternatives To Next Token Prediction In Text Generation -- A Survey
by: Wyatt, Charlie, et al.
Published: (2025)

Text-to-Distribution Prediction with Quantile Tokens and Neighbor Context
by: Zhu, Yilun, et al.
Published: (2026)

Implicit Optimization Bias of Next-Token Prediction in Linear Models
by: Thrampoulidis, Christos
Published: (2024)

Text2Token: Unsupervised Text Representation Learning with Token Target Prediction
by: An, Ruize, et al.
Published: (2025)

Zero-Shot Detection of LLM-Generated Text via Implicit Reward Model
by: Liu, Runheng, et al.
Published: (2026)

Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Output Prefilling
by: Cappelletti, Silvia, et al.
Published: (2025)

Evaluating and Improving Continual Learning in Spoken Language Understanding
by: Yang, Muqiao, et al.
Published: (2024)