:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xu, Yifei, Potje, Guilherme, Shandilya, Shivam, Yuan, Tiancheng, Nunes, Leonardo de Oliveira, Agarwal, Rakshanda, Asgari, Saeid, Atkinson, Adam, Kıcıman, Emre, Lu, Songwu, Chandra, Ranveer, Chakraborty, Tusher
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2602.20751
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Diagnosing Capability Gaps in Fine-Tuning Data
by: Taghanaki, Saeid Asgari, et al.
Published: (2026)

Direct Reasoning Optimization: Token-Level Reasoning Reflectivity Meets Rubric Gates for Unverifiable Tasks
by: Xu, Yifei, et al.
Published: (2025)

RLTHF: Targeted Human Feedback for LLM Alignment
by: Xu, Yifei, et al.
Published: (2025)

Orchestration for Domain-specific Edge-Cloud Language Models
by: Patidar, Prasoon, et al.
Published: (2025)

Domain Adaptation for Sustainable Soil Management using Causal and Contrastive Constraint Minimization
by: Sharma, Somya, et al.
Published: (2024)

Enabling Adoption of Regenerative Agriculture through Soil Carbon Copilots
by: Capetz, Margaret, et al.
Published: (2024)

Sibyll$^{\bigstar}$
by: Riehn, Felix, et al.
Published: (2024)

Head of a Sibyl
by: Scan-the-World
Published: (2026)

Modeling the Data-Generating Process is Necessary for Out-of-Distribution Generalization
by: Kaur, Jivat Neet, et al.
Published: (2022)

Causal Reasoning and Large Language Models: Opening a New Frontier for Causality
by: Kıcıman, Emre, et al.
Published: (2023)

Inverse design of heterodeformations for strain soliton networks in bilayer 2D materials
by: Ahmed, Md Tusher, et al.
Published: (2026)

DeepSpecs: Expert-Level Questions Answering in 5G
by: Manvattira, Aman Ganapathy, et al.
Published: (2025)

Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy
by: Taghanaki, Saeid Asgari, et al.
Published: (2025)

Detecting Generative Parroting through Overfitting Masked Autoencoders
by: Taghanaki, Saeid Asgari, et al.
Published: (2024)

Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations
by: Matton, Katie, et al.
Published: (2025)

Narcotweets: Social Media in Wartime
by: Monroy-Hernández, Andrés, et al.
Published: (2015)

XFeat: Accelerated Features for Lightweight Image Matching
by: Potje, Guilherme, et al.
Published: (2024)

Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence
by: Cadar, Felipe, et al.
Published: (2024)

Sibyl: Forecasting Time-Evolving Query Workloads
by: Huang, Hanxian, et al.
Published: (2024)

Enterprise AI Must Enforce Participant-Aware Access Control
by: Bhatt, Shashank Shreedhar, et al.
Published: (2025)

MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing
by: Agarwal, Siddhant, et al.
Published: (2024)

MMLU-Pro+: Evaluating Higher-Order Reasoning and Shortcut Learning in LLMs
by: Taghanaki, Saeid Asgari, et al.
Published: (2024)

How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model?
by: Taghanaki, Saeid Asgari, et al.
Published: (2024)

Exploring the Efficiency of Renewable Energy-based Modular Data Centers at Scale
by: Sun, Jinghan, et al.
Published: (2024)

Wireless Spectrum in Rural Farmlands: Status, Challenges and Opportunities
by: Shahid, Mukaram, et al.
Published: (2024)

Defending Against Indirect Prompt Injection Attacks With Spotlighting
by: Hines, Keegan, et al.
Published: (2024)

Multiscale analysis of large twist ferroelectricity and swirling dislocations in bilayer hexagonal boron nitride
by: Ahmed, Md Tusher, et al.
Published: (2025)

Probing Protease‐Mediated Decrease of Albumin‐Catalyzed Kemp Elimination: Potential Application as Biosensors
by: Shikha Shikha, et al.
Published: (2025)

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning
by: Wang, Yulong, et al.
Published: (2024)

Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
by: Yi, Jingwei, et al.
Published: (2023)

Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning
by: Mecklenburg, Nick, et al.
Published: (2024)

Quantifying superlubricity of bilayer graphene from the mobility of interface dislocations
by: Ahmed, Md Tusher, et al.
Published: (2025)

Algebraic Structures In Closed Superstring Field Theory, Homotopy Transfer And Effective Actions
by: Singh, Ranveer Kumar
Published: (2024)

Scale-(in)dependence in quantum 4-body scattering
by: Mondal, Sourav, et al.
Published: (2024)

Tuning Interfacial Charge Transfer and Exploring Morphological Insight in Biocarbon/MoSe 2 Heterostructures for Enhanced Photodegradation of Organic Pollutants
by: Shivam Tyagi, et al.
Published: (2025)

Decoding Memes: Benchmarking Narrative Role Classification across Multilingual and Multimodal Models
by: Sharma, Shivam, et al.
Published: (2025)

SibylSat: Using SAT as an Oracle to Perform a Greedy Search on TOHTN Planning
by: Quenard, Gaspard, et al.
Published: (2024)

Configurable Preference Tuning with Rubric-Guided Synthetic Data
by: Gallego, Víctor
Published: (2025)

Beyond QA Pairs: Assessing Parameter-Efficient Fine-Tuning for Fact Embedding in LLMs
by: Ratnakar, Shivam, et al.
Published: (2025)

Exponential Lower Bounds for the Pfaffian Number of Graphs
by: Pant, Priyanshu, et al.
Published: (2026)