Saved in:
| Main Authors: | Sathyanathan, Rose, Vasisht, Kinshuk, Pruthi, Danish |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.03050 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Knowledge Graph Guided Evaluation of Abstention Techniques
by: Vasisht, Kinshuk, et al.
Published: (2024)
by: Vasisht, Kinshuk, et al.
Published: (2024)
Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations
by: Bhagat, Kirti, et al.
Published: (2024)
by: Bhagat, Kirti, et al.
Published: (2024)
Evaluating Large Language Models for Health-related Queries with Presuppositions
by: Kaur, Navreet, et al.
Published: (2023)
by: Kaur, Navreet, et al.
Published: (2023)
Infusing Knowledge into Large Language Models with Contextual Prompts
by: Vasisht, Kinshuk, et al.
Published: (2024)
by: Vasisht, Kinshuk, et al.
Published: (2024)
All That Glitters is Not Novel: Plagiarism in AI Generated Research
by: Gupta, Tarun, et al.
Published: (2025)
by: Gupta, Tarun, et al.
Published: (2025)
Revisiting the Robustness of Watermarking to Paraphrasing Attacks
by: Rastogi, Saksham, et al.
Published: (2024)
by: Rastogi, Saksham, et al.
Published: (2024)
STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings
by: Rastogi, Saksham, et al.
Published: (2025)
by: Rastogi, Saksham, et al.
Published: (2025)
Downstream Trade-offs of a Family of Text Watermarks
by: Ajith, Anirudh, et al.
Published: (2023)
by: Ajith, Anirudh, et al.
Published: (2023)
Presupposition and Reasoning in Conditionals: A Theory-Based Study of Humans and LLMs
by: Azin, Tara, et al.
Published: (2026)
by: Azin, Tara, et al.
Published: (2026)
Cancer-Myth: Evaluating Large Language Models on Patient Questions with False Presuppositions
by: Zhu, Wang Bill, et al.
Published: (2025)
by: Zhu, Wang Bill, et al.
Published: (2025)
The Presupposition Problem in Representation Genesis
by: Wu, Yiling
Published: (2026)
by: Wu, Yiling
Published: (2026)
Safer in Translation? Presupposition Robustness in Indic Languages
by: Palnitkar, Aadi, et al.
Published: (2025)
by: Palnitkar, Aadi, et al.
Published: (2025)
Let's CONFER: A Dataset for Evaluating Natural Language Inference Models on CONditional InFERence and Presupposition
by: Azin, Tara, et al.
Published: (2025)
by: Azin, Tara, et al.
Published: (2025)
FairI Tales: Evaluation of Fairness in Indian Contexts with a Focus on Bias and Stereotypes
by: Nawale, Janki Atul, et al.
Published: (2025)
by: Nawale, Janki Atul, et al.
Published: (2025)
TALES: A Taxonomy and Analysis of Cultural Representations in LLM-generated Stories
by: Bhagat, Kirti, et al.
Published: (2025)
by: Bhagat, Kirti, et al.
Published: (2025)
Silencing Empowerment, Allowing Bigotry: Auditing the Moderation of Hate Speech on Twitch
by: Shukla, Prarabdh, et al.
Published: (2025)
by: Shukla, Prarabdh, et al.
Published: (2025)
LLMs Struggle to Reject False Presuppositions when Misinformation Stakes are High
by: Sieker, Judith, et al.
Published: (2025)
by: Sieker, Judith, et al.
Published: (2025)
Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable
by: Saha, Rounak, et al.
Published: (2026)
by: Saha, Rounak, et al.
Published: (2026)
If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition
by: Dipta, Shubhashis Roy, et al.
Published: (2025)
by: Dipta, Shubhashis Roy, et al.
Published: (2025)
Audio-visual training for improved grounding in video-text LLMs
by: Sagare, Shivprasad, et al.
Published: (2024)
by: Sagare, Shivprasad, et al.
Published: (2024)
Beyond World Models: Rethinking Understanding in AI Models
by: Gupta, Tarun, et al.
Published: (2025)
by: Gupta, Tarun, et al.
Published: (2025)
Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula
by: Lucy, Li, et al.
Published: (2024)
by: Lucy, Li, et al.
Published: (2024)
Cats Confuse Reasoning LLM: Query Agnostic Adversarial Triggers for Reasoning Models
by: Rajeev, Meghana, et al.
Published: (2025)
by: Rajeev, Meghana, et al.
Published: (2025)
PATCH: Mitigating PII Leakage in Language Models with Privacy-Aware Targeted Circuit PatcHing
by: Hughes, Anthony, et al.
Published: (2025)
by: Hughes, Anthony, et al.
Published: (2025)
Backtracing: Retrieving the Cause of the Query
by: Wang, Rose E., et al.
Published: (2024)
by: Wang, Rose E., et al.
Published: (2024)
On Code-Induced Reasoning in LLMs
by: Waheed, Abdul, et al.
Published: (2025)
by: Waheed, Abdul, et al.
Published: (2025)
Contextualized Evaluations: Judging Language Model Responses to Underspecified Queries
by: Malaviya, Chaitanya, et al.
Published: (2024)
by: Malaviya, Chaitanya, et al.
Published: (2024)
QUIETT: Query-Independent Table Transformation for Robust Reasoning
by: Najpande, Gaurav, et al.
Published: (2026)
by: Najpande, Gaurav, et al.
Published: (2026)
Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety
by: Melton, Chad, et al.
Published: (2025)
by: Melton, Chad, et al.
Published: (2025)
Evaluating Large Language Models' Responses to Sexual and Reproductive Health Queries in Nepali
by: Sharma, Medha, et al.
Published: (2026)
by: Sharma, Medha, et al.
Published: (2026)
Performance Evaluation of Large Language Models in Bangla Consumer Health Query Summarization
by: Abrar, Ajwad, et al.
Published: (2025)
by: Abrar, Ajwad, et al.
Published: (2025)
Reliability-Oriented Multilingual Orthopedic Diagnosis: A Domain-Adaptive Modeling and a Conceptual Validation Framework
by: Ali, Danish, et al.
Published: (2026)
by: Ali, Danish, et al.
Published: (2026)
Towards Evaluating Large Language Models for Graph Query Generation
by: Munir, Siraj, et al.
Published: (2024)
by: Munir, Siraj, et al.
Published: (2024)
Scaling Evaluation-time Compute with Reasoning Models as Evaluators
by: Kim, Seungone, et al.
Published: (2025)
by: Kim, Seungone, et al.
Published: (2025)
Reducing the Scope of Language Models
by: Yunis, David, et al.
Published: (2024)
by: Yunis, David, et al.
Published: (2024)
Sandwich Reasoning: An Answer-Reasoning-Answer Approach for Low-Latency Query Correction
by: Zhang, Chen, et al.
Published: (2026)
by: Zhang, Chen, et al.
Published: (2026)
Enrich-on-Graph: Query-Graph Alignment for Complex Reasoning with LLM Enriching
by: Li, Songze, et al.
Published: (2025)
by: Li, Songze, et al.
Published: (2025)
Memory-augmented Query Reconstruction for LLM-based Knowledge Graph Reasoning
by: Xu, Mufan, et al.
Published: (2025)
by: Xu, Mufan, et al.
Published: (2025)
Reasoning-Aware Query-Focused Summarization over Multi-Table Data
by: Lin, Xiaochuan, et al.
Published: (2024)
by: Lin, Xiaochuan, et al.
Published: (2024)
Toward Multi-Database Query Reasoning for Text2Cypher
by: Ozsoy, Makbule Gulcin
Published: (2026)
by: Ozsoy, Makbule Gulcin
Published: (2026)
Similar Items
-
Knowledge Graph Guided Evaluation of Abstention Techniques
by: Vasisht, Kinshuk, et al.
Published: (2024) -
Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations
by: Bhagat, Kirti, et al.
Published: (2024) -
Evaluating Large Language Models for Health-related Queries with Presuppositions
by: Kaur, Navreet, et al.
Published: (2023) -
Infusing Knowledge into Large Language Models with Contextual Prompts
by: Vasisht, Kinshuk, et al.
Published: (2024) -
All That Glitters is Not Novel: Plagiarism in AI Generated Research
by: Gupta, Tarun, et al.
Published: (2025)