:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sundaresan, Sai, Chopra, Harshita, Sinha, Atanu R., Goswami, Koustava, Naidu, Nagasai Saketh, Karan, Raghav, Anushka, N
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2508.15474
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment
by: Fazili, Barah, et al.
Published: (2026)

Feedback-Aware Monte Carlo Tree Search for Efficient Information Seeking in Goal-Oriented Conversations
by: Chopra, Harshita, et al.
Published: (2025)

SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
by: Nag, Sayan, et al.
Published: (2024)

Beyond Cooperative Simulators: Generating Realistic User Personas for Robust Evaluation of LLM Agents
by: Chopra, Harshita, et al.
Published: (2026)

Semantically Cohesive Word Grouping in Indian Languages
by: Karthika, N J, et al.
Published: (2025)

Thinking Ahead: Prospection-Guided Retrieval of Memory with Language Models
by: Chopra, Harshita, et al.
Published: (2026)

Delivery Optimized Discovery in Behavioral User Segmentation under Budget Constraint
by: Chopra, Harshita, et al.
Published: (2024)

Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications
by: Kapu, Nirmal Joshua, et al.
Published: (2024)

SmartChunk Retrieval: Query-Aware Chunk Compression with Planning for Efficient Document RAG
by: Zhang, Xuechen, et al.
Published: (2025)

keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection
by: Vemula, Saketh Reddy, et al.
Published: (2025)

CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios
by: Garg, Raghav, et al.
Published: (2025)

An Audit on the Perspectives and Challenges of Hallucinations in NLP
by: Venkit, Pranav Narayanan, et al.
Published: (2024)

OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents
by: Thind, Raghav, et al.
Published: (2025)

RLHF: A comprehensive Survey for Cultural, Multimodal and Low Latency Alignment Methods
by: Sharma, Raghav, et al.
Published: (2025)

Inductive Linguistic Reasoning with Large Language Models
by: Ramji, Raghav, et al.
Published: (2024)

Chasing Random: Instruction Selection Strategies Fail to Generalize
by: Diddee, Harshita, et al.
Published: (2024)

Knowledge Navigator: LLM-guided Browsing Framework for Exploratory Search in Scientific Literature
by: Katz, Uri, et al.
Published: (2024)

CIE: Controlling Language Model Text Generations Using Continuous Signals
by: Samuel, Vinay, et al.
Published: (2025)

HYSYNTH: Context-Free LLM Approximation for Guiding Program Synthesis
by: Barke, Shraddha, et al.
Published: (2024)

MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
by: Li, Shilong, et al.
Published: (2025)

View From Above: A Framework for Evaluating Distribution Shifts in Model Behavior
by: Chopra, Tanush, et al.
Published: (2024)

Aligning Large Language Model Behavior with Human Citation Preferences
by: Ando, Kenichiro, et al.
Published: (2026)

Evaluating Consistency and Reasoning Capabilities of Large Language Models
by: Saxena, Yash, et al.
Published: (2024)

How does Misinformation Affect Large Language Model Behaviors and Preferences?
by: Peng, Miao, et al.
Published: (2025)

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures
by: Ying, Shuangshuang, et al.
Published: (2025)

BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents
by: Ou, Litu, et al.
Published: (2025)

Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive
by: Pal, Arka, et al.
Published: (2024)

Steering Risk Preferences in Large Language Models by Aligning Behavioral and Neural Representations
by: Zhu, Jian-Qiao, et al.
Published: (2025)

Forecasting Live Chat Intent from Browsing History
by: Yoon, Se-eun, et al.
Published: (2024)

Tourism Question Answer System in Indian Language using Domain-Adapted Foundation Models
by: Gatla, Praveen, et al.
Published: (2025)

Beacon: Single-Turn Diagnosis and Mitigation of Latent Sycophancy in Large Language Models
by: Pandey, Sanskar, et al.
Published: (2025)

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions
by: Yu, Tao, et al.
Published: (2025)

ReplicatorBench: Benchmarking LLM Agents for Replicability in Social and Behavioral Sciences
by: Nguyen, Bang, et al.
Published: (2026)

Dissecting Human and LLM Preferences
by: Li, Junlong, et al.
Published: (2024)

How Small Can You Go? Compact Language Models for On-Device Critical Error Detection in Machine Translation
by: Chopra, Muskaan, et al.
Published: (2025)

Rethinking Tokenization for Rich Morphology: The Dominance of Unigram over BPE and Morphological Alignment
by: Vemula, Saketh Reddy, et al.
Published: (2025)

Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation
by: Austin, David Eric, et al.
Published: (2024)

Causal Order: The Key to Leveraging Imperfect Experts in Causal Inference
by: Vashishtha, Aniket, et al.
Published: (2023)

GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
by: Kharyal, Chaitanya, et al.
Published: (2024)

PhonologyBench: Evaluating Phonological Skills of Large Language Models
by: Suvarna, Ashima, et al.
Published: (2024)