Saved in:
| Main Authors: | Sundaresan, Sai, Chopra, Harshita, Sinha, Atanu R., Goswami, Koustava, Naidu, Nagasai Saketh, Karan, Raghav, Anushka, N |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.15474 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment
by: Fazili, Barah, et al.
Published: (2026)
by: Fazili, Barah, et al.
Published: (2026)
Feedback-Aware Monte Carlo Tree Search for Efficient Information Seeking in Goal-Oriented Conversations
by: Chopra, Harshita, et al.
Published: (2025)
by: Chopra, Harshita, et al.
Published: (2025)
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
by: Nag, Sayan, et al.
Published: (2024)
by: Nag, Sayan, et al.
Published: (2024)
Beyond Cooperative Simulators: Generating Realistic User Personas for Robust Evaluation of LLM Agents
by: Chopra, Harshita, et al.
Published: (2026)
by: Chopra, Harshita, et al.
Published: (2026)
Semantically Cohesive Word Grouping in Indian Languages
by: Karthika, N J, et al.
Published: (2025)
by: Karthika, N J, et al.
Published: (2025)
Thinking Ahead: Prospection-Guided Retrieval of Memory with Language Models
by: Chopra, Harshita, et al.
Published: (2026)
by: Chopra, Harshita, et al.
Published: (2026)
Delivery Optimized Discovery in Behavioral User Segmentation under Budget Constraint
by: Chopra, Harshita, et al.
Published: (2024)
by: Chopra, Harshita, et al.
Published: (2024)
Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications
by: Kapu, Nirmal Joshua, et al.
Published: (2024)
by: Kapu, Nirmal Joshua, et al.
Published: (2024)
SmartChunk Retrieval: Query-Aware Chunk Compression with Planning for Efficient Document RAG
by: Zhang, Xuechen, et al.
Published: (2025)
by: Zhang, Xuechen, et al.
Published: (2025)
keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection
by: Vemula, Saketh Reddy, et al.
Published: (2025)
by: Vemula, Saketh Reddy, et al.
Published: (2025)
CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios
by: Garg, Raghav, et al.
Published: (2025)
by: Garg, Raghav, et al.
Published: (2025)
An Audit on the Perspectives and Challenges of Hallucinations in NLP
by: Venkit, Pranav Narayanan, et al.
Published: (2024)
by: Venkit, Pranav Narayanan, et al.
Published: (2024)
OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents
by: Thind, Raghav, et al.
Published: (2025)
by: Thind, Raghav, et al.
Published: (2025)
RLHF: A comprehensive Survey for Cultural, Multimodal and Low Latency Alignment Methods
by: Sharma, Raghav, et al.
Published: (2025)
by: Sharma, Raghav, et al.
Published: (2025)
Inductive Linguistic Reasoning with Large Language Models
by: Ramji, Raghav, et al.
Published: (2024)
by: Ramji, Raghav, et al.
Published: (2024)
Chasing Random: Instruction Selection Strategies Fail to Generalize
by: Diddee, Harshita, et al.
Published: (2024)
by: Diddee, Harshita, et al.
Published: (2024)
Knowledge Navigator: LLM-guided Browsing Framework for Exploratory Search in Scientific Literature
by: Katz, Uri, et al.
Published: (2024)
by: Katz, Uri, et al.
Published: (2024)
CIE: Controlling Language Model Text Generations Using Continuous Signals
by: Samuel, Vinay, et al.
Published: (2025)
by: Samuel, Vinay, et al.
Published: (2025)
HYSYNTH: Context-Free LLM Approximation for Guiding Program Synthesis
by: Barke, Shraddha, et al.
Published: (2024)
by: Barke, Shraddha, et al.
Published: (2024)
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
by: Li, Shilong, et al.
Published: (2025)
by: Li, Shilong, et al.
Published: (2025)
View From Above: A Framework for Evaluating Distribution Shifts in Model Behavior
by: Chopra, Tanush, et al.
Published: (2024)
by: Chopra, Tanush, et al.
Published: (2024)
Aligning Large Language Model Behavior with Human Citation Preferences
by: Ando, Kenichiro, et al.
Published: (2026)
by: Ando, Kenichiro, et al.
Published: (2026)
Evaluating Consistency and Reasoning Capabilities of Large Language Models
by: Saxena, Yash, et al.
Published: (2024)
by: Saxena, Yash, et al.
Published: (2024)
How does Misinformation Affect Large Language Model Behaviors and Preferences?
by: Peng, Miao, et al.
Published: (2025)
by: Peng, Miao, et al.
Published: (2025)
Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures
by: Ying, Shuangshuang, et al.
Published: (2025)
by: Ying, Shuangshuang, et al.
Published: (2025)
BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents
by: Ou, Litu, et al.
Published: (2025)
by: Ou, Litu, et al.
Published: (2025)
Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive
by: Pal, Arka, et al.
Published: (2024)
by: Pal, Arka, et al.
Published: (2024)
Steering Risk Preferences in Large Language Models by Aligning Behavioral and Neural Representations
by: Zhu, Jian-Qiao, et al.
Published: (2025)
by: Zhu, Jian-Qiao, et al.
Published: (2025)
Forecasting Live Chat Intent from Browsing History
by: Yoon, Se-eun, et al.
Published: (2024)
by: Yoon, Se-eun, et al.
Published: (2024)
Tourism Question Answer System in Indian Language using Domain-Adapted Foundation Models
by: Gatla, Praveen, et al.
Published: (2025)
by: Gatla, Praveen, et al.
Published: (2025)
Beacon: Single-Turn Diagnosis and Mitigation of Latent Sycophancy in Large Language Models
by: Pandey, Sanskar, et al.
Published: (2025)
by: Pandey, Sanskar, et al.
Published: (2025)
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions
by: Yu, Tao, et al.
Published: (2025)
by: Yu, Tao, et al.
Published: (2025)
ReplicatorBench: Benchmarking LLM Agents for Replicability in Social and Behavioral Sciences
by: Nguyen, Bang, et al.
Published: (2026)
by: Nguyen, Bang, et al.
Published: (2026)
Dissecting Human and LLM Preferences
by: Li, Junlong, et al.
Published: (2024)
by: Li, Junlong, et al.
Published: (2024)
How Small Can You Go? Compact Language Models for On-Device Critical Error Detection in Machine Translation
by: Chopra, Muskaan, et al.
Published: (2025)
by: Chopra, Muskaan, et al.
Published: (2025)
Rethinking Tokenization for Rich Morphology: The Dominance of Unigram over BPE and Morphological Alignment
by: Vemula, Saketh Reddy, et al.
Published: (2025)
by: Vemula, Saketh Reddy, et al.
Published: (2025)
Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation
by: Austin, David Eric, et al.
Published: (2024)
by: Austin, David Eric, et al.
Published: (2024)
Causal Order: The Key to Leveraging Imperfect Experts in Causal Inference
by: Vashishtha, Aniket, et al.
Published: (2023)
by: Vashishtha, Aniket, et al.
Published: (2023)
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
by: Kharyal, Chaitanya, et al.
Published: (2024)
by: Kharyal, Chaitanya, et al.
Published: (2024)
PhonologyBench: Evaluating Phonological Skills of Large Language Models
by: Suvarna, Ashima, et al.
Published: (2024)
by: Suvarna, Ashima, et al.
Published: (2024)
Similar Items
-
Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment
by: Fazili, Barah, et al.
Published: (2026) -
Feedback-Aware Monte Carlo Tree Search for Efficient Information Seeking in Goal-Oriented Conversations
by: Chopra, Harshita, et al.
Published: (2025) -
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
by: Nag, Sayan, et al.
Published: (2024) -
Beyond Cooperative Simulators: Generating Realistic User Personas for Robust Evaluation of LLM Agents
by: Chopra, Harshita, et al.
Published: (2026) -
Semantically Cohesive Word Grouping in Indian Languages
by: Karthika, N J, et al.
Published: (2025)