Saved in:
| Main Authors: | Kargupta, Priyanka, Li, Shuyue Stella, Wang, Haocheng, Lee, Jinu, Chen, Shan, Ahia, Orevaoghene, Light, Dean, Griffiths, Thomas L., Kleiman-Weiner, Max, Han, Jiawei, Celikyilmaz, Asli, Tsvetkov, Yulia |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.16660 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers
by: Xie, Roy, et al.
Published: (2024)
by: Xie, Roy, et al.
Published: (2024)
Teaching LLMs to Abstain across Languages via Multilingual Feedback
by: Feng, Shangbin, et al.
Published: (2024)
by: Feng, Shangbin, et al.
Published: (2024)
BASS: Benchmarking Audio LMs for Musical Structure and Semantic Reasoning
by: Jang, Min, et al.
Published: (2026)
by: Jang, Min, et al.
Published: (2026)
Deep Reasoning in General Purpose Agents via Structured Meta-Cognition
by: Light, Dean, et al.
Published: (2026)
by: Light, Dean, et al.
Published: (2026)
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
by: Faisal, Fahim, et al.
Published: (2024)
by: Faisal, Fahim, et al.
Published: (2024)
MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
by: Ahia, Orevaoghene, et al.
Published: (2024)
by: Ahia, Orevaoghene, et al.
Published: (2024)
Cold-Start Personalization via Training-Free Priors from Structured World Models
by: Bose, Avinandan, et al.
Published: (2026)
by: Bose, Avinandan, et al.
Published: (2026)
IDIOLEX: Unified and Continuous Representations for Idiolectal and Stylistic Variation
by: Kantharuban, Anjali, et al.
Published: (2026)
by: Kantharuban, Anjali, et al.
Published: (2026)
Evaluating LLMs in Open-Source Games
by: Sistla, Swadesh, et al.
Published: (2025)
by: Sistla, Swadesh, et al.
Published: (2025)
PrefPalette: Personalized Preference Modeling with Latent Attributes
by: Li, Shuyue Stella, et al.
Published: (2025)
by: Li, Shuyue Stella, et al.
Published: (2025)
Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest
by: Wu, Addison J., et al.
Published: (2026)
by: Wu, Addison J., et al.
Published: (2026)
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
by: Ahia, Orevaoghene, et al.
Published: (2024)
by: Ahia, Orevaoghene, et al.
Published: (2024)
MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning
by: Li, Shuyue Stella, et al.
Published: (2024)
by: Li, Shuyue Stella, et al.
Published: (2024)
FLEXITOKENS: Flexible Tokenization for Evolving Language Models
by: Owodunni, Abraham Toluwase, et al.
Published: (2025)
by: Owodunni, Abraham Toluwase, et al.
Published: (2025)
HorizonBench: Long-Horizon Personalization with Evolving Preferences
by: Li, Shuyue Stella, et al.
Published: (2026)
by: Li, Shuyue Stella, et al.
Published: (2026)
Explore Theory of Mind: Program-guided adversarial data generation for theory of mind reasoning
by: Sclar, Melanie, et al.
Published: (2024)
by: Sclar, Melanie, et al.
Published: (2024)
Beyond True or False: Retrieval-Augmented Hierarchical Analysis of Nuanced Claims
by: Kargupta, Priyanka, et al.
Published: (2025)
by: Kargupta, Priyanka, et al.
Published: (2025)
Value Internalization: Learning and Generalizing from Social Reward
by: Rong, Frieda, et al.
Published: (2024)
by: Rong, Frieda, et al.
Published: (2024)
Synergizing Unsupervised Episode Detection with LLMs for Large-Scale News Events
by: Kargupta, Priyanka, et al.
Published: (2024)
by: Kargupta, Priyanka, et al.
Published: (2024)
BLAB: Brutally Long Audio Bench
by: Ahia, Orevaoghene, et al.
Published: (2025)
by: Ahia, Orevaoghene, et al.
Published: (2025)
An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents
by: Jin, Bowen, et al.
Published: (2025)
by: Jin, Bowen, et al.
Published: (2025)
PrefDisco: Benchmarking Proactive Personalized Reasoning
by: Li, Shuyue Stella, et al.
Published: (2025)
by: Li, Shuyue Stella, et al.
Published: (2025)
Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis
by: Kargupta, Priyanka, et al.
Published: (2025)
by: Kargupta, Priyanka, et al.
Published: (2025)
InfoGatherer: Principled Information Seeking via Evidence Retrieval and Strategic Questioning
by: Taranukhin, Maksym, et al.
Published: (2026)
by: Taranukhin, Maksym, et al.
Published: (2026)
Frame-Level Internal Tool Use for Temporal Grounding in Audio LMs
by: An, Joesph, et al.
Published: (2026)
by: An, Joesph, et al.
Published: (2026)
MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling
by: Limisiewicz, Tomasz, et al.
Published: (2024)
by: Limisiewicz, Tomasz, et al.
Published: (2024)
ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning
by: Li, Shuyue Stella, et al.
Published: (2025)
by: Li, Shuyue Stella, et al.
Published: (2025)
Boundedly Rational Meta-Learning in Sequential Consumer Choice
by: Khosravi, Mehrzad, et al.
Published: (2026)
by: Khosravi, Mehrzad, et al.
Published: (2026)
Estimating the Empowerment of Language Model Agents
by: Song, Jinyeop, et al.
Published: (2025)
by: Song, Jinyeop, et al.
Published: (2025)
When Empowerment Disempowers
by: Yang, Claire, et al.
Published: (2025)
by: Yang, Claire, et al.
Published: (2025)
Instruct, Not Assist: LLM-based Multi-Turn Planning and Hierarchical Questioning for Socratic Code Debugging
by: Kargupta, Priyanka, et al.
Published: (2024)
by: Kargupta, Priyanka, et al.
Published: (2024)
Instructor-Aligned Knowledge Graphs for Personalized Learning
by: AlRabah, Abdulrahman, et al.
Published: (2026)
by: AlRabah, Abdulrahman, et al.
Published: (2026)
Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration
by: Kargupta, Priyanka, et al.
Published: (2026)
by: Kargupta, Priyanka, et al.
Published: (2026)
ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
by: Park, Chan Young, et al.
Published: (2024)
by: Park, Chan Young, et al.
Published: (2024)
Preserving Sense of Agency: User Preferences for Robot Autonomy and User Control across Household Tasks
by: Yang, Claire, et al.
Published: (2025)
by: Yang, Claire, et al.
Published: (2025)
Finding Flawed Fictions: Evaluating Complex Reasoning in Language Models via Plot Hole Detection
by: Ahuja, Kabir, et al.
Published: (2025)
by: Ahuja, Kabir, et al.
Published: (2025)
Are Language Models Consequentialist or Deontological Moral Reasoners?
by: Samway, Keenan, et al.
Published: (2025)
by: Samway, Keenan, et al.
Published: (2025)
How LLMs Distort Our Written Language
by: Abdulhai, Marwa, et al.
Published: (2026)
by: Abdulhai, Marwa, et al.
Published: (2026)
TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research Corpora
by: Kargupta, Priyanka, et al.
Published: (2025)
by: Kargupta, Priyanka, et al.
Published: (2025)
Grounding Agent Memory in Contextual Intent
by: Yang, Ruozhen, et al.
Published: (2026)
by: Yang, Ruozhen, et al.
Published: (2026)
Similar Items
-
Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers
by: Xie, Roy, et al.
Published: (2024) -
Teaching LLMs to Abstain across Languages via Multilingual Feedback
by: Feng, Shangbin, et al.
Published: (2024) -
BASS: Benchmarking Audio LMs for Musical Structure and Semantic Reasoning
by: Jang, Min, et al.
Published: (2026) -
Deep Reasoning in General Purpose Agents via Structured Meta-Cognition
by: Light, Dean, et al.
Published: (2026) -
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
by: Faisal, Fahim, et al.
Published: (2024)