Saved in:
| Main Authors: | Xu, Yifei, Potje, Guilherme, Shandilya, Shivam, Yuan, Tiancheng, Nunes, Leonardo de Oliveira, Agarwal, Rakshanda, Asgari, Saeid, Atkinson, Adam, Kıcıman, Emre, Lu, Songwu, Chandra, Ranveer, Chakraborty, Tusher |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.20751 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Diagnosing Capability Gaps in Fine-Tuning Data
by: Taghanaki, Saeid Asgari, et al.
Published: (2026)
by: Taghanaki, Saeid Asgari, et al.
Published: (2026)
Direct Reasoning Optimization: Token-Level Reasoning Reflectivity Meets Rubric Gates for Unverifiable Tasks
by: Xu, Yifei, et al.
Published: (2025)
by: Xu, Yifei, et al.
Published: (2025)
RLTHF: Targeted Human Feedback for LLM Alignment
by: Xu, Yifei, et al.
Published: (2025)
by: Xu, Yifei, et al.
Published: (2025)
Orchestration for Domain-specific Edge-Cloud Language Models
by: Patidar, Prasoon, et al.
Published: (2025)
by: Patidar, Prasoon, et al.
Published: (2025)
Domain Adaptation for Sustainable Soil Management using Causal and Contrastive Constraint Minimization
by: Sharma, Somya, et al.
Published: (2024)
by: Sharma, Somya, et al.
Published: (2024)
Enabling Adoption of Regenerative Agriculture through Soil Carbon Copilots
by: Capetz, Margaret, et al.
Published: (2024)
by: Capetz, Margaret, et al.
Published: (2024)
Sibyll$^{\bigstar}$
by: Riehn, Felix, et al.
Published: (2024)
by: Riehn, Felix, et al.
Published: (2024)
Head of a Sibyl
by: Scan-the-World
Published: (2026)
by: Scan-the-World
Published: (2026)
Modeling the Data-Generating Process is Necessary for Out-of-Distribution Generalization
by: Kaur, Jivat Neet, et al.
Published: (2022)
by: Kaur, Jivat Neet, et al.
Published: (2022)
Causal Reasoning and Large Language Models: Opening a New Frontier for Causality
by: Kıcıman, Emre, et al.
Published: (2023)
by: Kıcıman, Emre, et al.
Published: (2023)
Inverse design of heterodeformations for strain soliton networks in bilayer 2D materials
by: Ahmed, Md Tusher, et al.
Published: (2026)
by: Ahmed, Md Tusher, et al.
Published: (2026)
DeepSpecs: Expert-Level Questions Answering in 5G
by: Manvattira, Aman Ganapathy, et al.
Published: (2025)
by: Manvattira, Aman Ganapathy, et al.
Published: (2025)
Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy
by: Taghanaki, Saeid Asgari, et al.
Published: (2025)
by: Taghanaki, Saeid Asgari, et al.
Published: (2025)
Detecting Generative Parroting through Overfitting Masked Autoencoders
by: Taghanaki, Saeid Asgari, et al.
Published: (2024)
by: Taghanaki, Saeid Asgari, et al.
Published: (2024)
Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations
by: Matton, Katie, et al.
Published: (2025)
by: Matton, Katie, et al.
Published: (2025)
Narcotweets: Social Media in Wartime
by: Monroy-Hernández, Andrés, et al.
Published: (2015)
by: Monroy-Hernández, Andrés, et al.
Published: (2015)
XFeat: Accelerated Features for Lightweight Image Matching
by: Potje, Guilherme, et al.
Published: (2024)
by: Potje, Guilherme, et al.
Published: (2024)
Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence
by: Cadar, Felipe, et al.
Published: (2024)
by: Cadar, Felipe, et al.
Published: (2024)
Sibyl: Forecasting Time-Evolving Query Workloads
by: Huang, Hanxian, et al.
Published: (2024)
by: Huang, Hanxian, et al.
Published: (2024)
Enterprise AI Must Enforce Participant-Aware Access Control
by: Bhatt, Shashank Shreedhar, et al.
Published: (2025)
by: Bhatt, Shashank Shreedhar, et al.
Published: (2025)
MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing
by: Agarwal, Siddhant, et al.
Published: (2024)
by: Agarwal, Siddhant, et al.
Published: (2024)
MMLU-Pro+: Evaluating Higher-Order Reasoning and Shortcut Learning in LLMs
by: Taghanaki, Saeid Asgari, et al.
Published: (2024)
by: Taghanaki, Saeid Asgari, et al.
Published: (2024)
How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model?
by: Taghanaki, Saeid Asgari, et al.
Published: (2024)
by: Taghanaki, Saeid Asgari, et al.
Published: (2024)
Exploring the Efficiency of Renewable Energy-based Modular Data Centers at Scale
by: Sun, Jinghan, et al.
Published: (2024)
by: Sun, Jinghan, et al.
Published: (2024)
Wireless Spectrum in Rural Farmlands: Status, Challenges and Opportunities
by: Shahid, Mukaram, et al.
Published: (2024)
by: Shahid, Mukaram, et al.
Published: (2024)
Defending Against Indirect Prompt Injection Attacks With Spotlighting
by: Hines, Keegan, et al.
Published: (2024)
by: Hines, Keegan, et al.
Published: (2024)
Multiscale analysis of large twist ferroelectricity and swirling dislocations in bilayer hexagonal boron nitride
by: Ahmed, Md Tusher, et al.
Published: (2025)
by: Ahmed, Md Tusher, et al.
Published: (2025)
Probing Protease‐Mediated Decrease of Albumin‐Catalyzed Kemp Elimination: Potential Application as Biosensors
by: Shikha Shikha, et al.
Published: (2025)
by: Shikha Shikha, et al.
Published: (2025)
Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning
by: Wang, Yulong, et al.
Published: (2024)
by: Wang, Yulong, et al.
Published: (2024)
Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
by: Yi, Jingwei, et al.
Published: (2023)
by: Yi, Jingwei, et al.
Published: (2023)
Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning
by: Mecklenburg, Nick, et al.
Published: (2024)
by: Mecklenburg, Nick, et al.
Published: (2024)
Quantifying superlubricity of bilayer graphene from the mobility of interface dislocations
by: Ahmed, Md Tusher, et al.
Published: (2025)
by: Ahmed, Md Tusher, et al.
Published: (2025)
Algebraic Structures In Closed Superstring Field Theory, Homotopy Transfer And Effective Actions
by: Singh, Ranveer Kumar
Published: (2024)
by: Singh, Ranveer Kumar
Published: (2024)
Scale-(in)dependence in quantum 4-body scattering
by: Mondal, Sourav, et al.
Published: (2024)
by: Mondal, Sourav, et al.
Published: (2024)
Tuning Interfacial Charge Transfer and Exploring Morphological Insight in Biocarbon/MoSe 2 Heterostructures for Enhanced Photodegradation of Organic Pollutants
by: Shivam Tyagi, et al.
Published: (2025)
by: Shivam Tyagi, et al.
Published: (2025)
Decoding Memes: Benchmarking Narrative Role Classification across Multilingual and Multimodal Models
by: Sharma, Shivam, et al.
Published: (2025)
by: Sharma, Shivam, et al.
Published: (2025)
SibylSat: Using SAT as an Oracle to Perform a Greedy Search on TOHTN Planning
by: Quenard, Gaspard, et al.
Published: (2024)
by: Quenard, Gaspard, et al.
Published: (2024)
Configurable Preference Tuning with Rubric-Guided Synthetic Data
by: Gallego, Víctor
Published: (2025)
by: Gallego, Víctor
Published: (2025)
Beyond QA Pairs: Assessing Parameter-Efficient Fine-Tuning for Fact Embedding in LLMs
by: Ratnakar, Shivam, et al.
Published: (2025)
by: Ratnakar, Shivam, et al.
Published: (2025)
Exponential Lower Bounds for the Pfaffian Number of Graphs
by: Pant, Priyanshu, et al.
Published: (2026)
by: Pant, Priyanshu, et al.
Published: (2026)
Similar Items
-
Diagnosing Capability Gaps in Fine-Tuning Data
by: Taghanaki, Saeid Asgari, et al.
Published: (2026) -
Direct Reasoning Optimization: Token-Level Reasoning Reflectivity Meets Rubric Gates for Unverifiable Tasks
by: Xu, Yifei, et al.
Published: (2025) -
RLTHF: Targeted Human Feedback for LLM Alignment
by: Xu, Yifei, et al.
Published: (2025) -
Orchestration for Domain-specific Edge-Cloud Language Models
by: Patidar, Prasoon, et al.
Published: (2025) -
Domain Adaptation for Sustainable Soil Management using Causal and Contrastive Constraint Minimization
by: Sharma, Somya, et al.
Published: (2024)