Enregistré dans:
| Auteurs principaux: | Chaudhary, Aryan, Agarwal, Prateek, Alladi, Tejasvi |
|---|---|
| Format: | Preprint |
| Publié: |
2026
|
| Sujets: | |
| Accès en ligne: | https://arxiv.org/abs/2604.21120 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Documents similaires
Causal Reflection with Language Models
par: Aryan, Abi, et autres
Publié: (2025)
par: Aryan, Abi, et autres
Publié: (2025)
Illuminate: A novel approach for depression detection with explainable analysis and proactive therapy using prompt engineering
par: Agrawal, Aryan
Publié: (2024)
par: Agrawal, Aryan
Publié: (2024)
Measuring What LLMs Think They Do: SHAP Faithfulness and Deployability on Financial Tabular Classification
par: AlMarri, Saeed, et autres
Publié: (2025)
par: AlMarri, Saeed, et autres
Publié: (2025)
Dissociating Decodability and Causal Use in Bracket-Sequence Transformers
par: Sharma, Aryan, et autres
Publié: (2026)
par: Sharma, Aryan, et autres
Publié: (2026)
Data Shapley in One Training Run
par: Wang, Jiachen T., et autres
Publié: (2024)
par: Wang, Jiachen T., et autres
Publié: (2024)
Zero-shot Factual Consistency Evaluation Across Domains
par: Agarwal, Raunak
Publié: (2024)
par: Agarwal, Raunak
Publié: (2024)
PARSE: LLM Driven Schema Optimization for Reliable Entity Extraction
par: Shrimal, Anubhav, et autres
Publié: (2025)
par: Shrimal, Anubhav, et autres
Publié: (2025)
Prakriti200: A Questionnaire-Based Dataset of 200 Ayurvedic Prakriti Assessments
par: Singh, Aryan Kumar, et autres
Publié: (2025)
par: Singh, Aryan Kumar, et autres
Publié: (2025)
One-Pass to Reason: Token Duplication and Block-Sparse Mask for Efficient Fine-Tuning on Multi-Turn Reasoning
par: Goru, Ritesh, et autres
Publié: (2025)
par: Goru, Ritesh, et autres
Publié: (2025)
Model Provenance Testing for Large Language Models
par: Nikolic, Ivica, et autres
Publié: (2025)
par: Nikolic, Ivica, et autres
Publié: (2025)
Whisper-GPT: A Hybrid Representation Audio Large Language Model
par: Verma, Prateek
Publié: (2024)
par: Verma, Prateek
Publié: (2024)
Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning Models
par: Jadon, Aryan, et autres
Publié: (2025)
par: Jadon, Aryan, et autres
Publié: (2025)
Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?
par: Sajith, Aryan, et autres
Publié: (2024)
par: Sajith, Aryan, et autres
Publié: (2024)
SafetyNet: Detecting Harmful Outputs in LLMs by Modeling and Monitoring Deceptive Behaviors
par: Chaudhary, Maheep, et autres
Publié: (2025)
par: Chaudhary, Maheep, et autres
Publié: (2025)
TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields
par: Arazi, Alan, et autres
Publié: (2025)
par: Arazi, Alan, et autres
Publié: (2025)
Succeeding at Scale: Automated Dataset Construction and Query-Side Adaptation for Multi-Tenant Search
par: Jain, Prateek, et autres
Publié: (2026)
par: Jain, Prateek, et autres
Publié: (2026)
I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift
par: Sahoo, Subramanyam, et autres
Publié: (2026)
par: Sahoo, Subramanyam, et autres
Publié: (2026)
In-Context Environments Induce Evaluation-Awareness in Language Models
par: Chaudhary, Maheep
Publié: (2026)
par: Chaudhary, Maheep
Publié: (2026)
DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models
par: Tiwari, Utkarsh, et autres
Publié: (2025)
par: Tiwari, Utkarsh, et autres
Publié: (2025)
Large Language Models aren't all that you need
par: Holla, Kiran Voderhobli, et autres
Publié: (2024)
par: Holla, Kiran Voderhobli, et autres
Publié: (2024)
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
par: Yadav, Prateek, et autres
Publié: (2023)
par: Yadav, Prateek, et autres
Publié: (2023)
Influence Functions for Efficient Data Selection in Reasoning
par: Humane, Prateek, et autres
Publié: (2025)
par: Humane, Prateek, et autres
Publié: (2025)
SHAP Meets Tensor Networks: Provably Tractable Explanations with Parallelism
par: Marzouk, Reda, et autres
Publié: (2025)
par: Marzouk, Reda, et autres
Publié: (2025)
DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
par: Majumder, Bodhisattwa Prasad, et autres
Publié: (2024)
par: Majumder, Bodhisattwa Prasad, et autres
Publié: (2024)
Wavelet GPT: Wavelet Inspired Large Language Models
par: Verma, Prateek
Publié: (2024)
par: Verma, Prateek
Publié: (2024)
The Vulnerability of Language Model Benchmarks: Do They Accurately Reflect True LLM Performance?
par: Banerjee, Sourav, et autres
Publié: (2024)
par: Banerjee, Sourav, et autres
Publié: (2024)
Decoding Speculative Decoding
par: Yan, Minghao, et autres
Publié: (2024)
par: Yan, Minghao, et autres
Publié: (2024)
Punctuation and Predicates in Language Models
par: Chauhan, Sonakshi, et autres
Publié: (2025)
par: Chauhan, Sonakshi, et autres
Publié: (2025)
Predicting ATP binding sites in protein sequences using Deep Learning and Natural Language Processing
par: V, Shreyas, et autres
Publié: (2024)
par: V, Shreyas, et autres
Publié: (2024)
An Evaluation Benchmark for Autoformalization in Lean4
par: Gulati, Aryan, et autres
Publié: (2024)
par: Gulati, Aryan, et autres
Publié: (2024)
Glider: Global and Local Instruction-Driven Expert Router
par: Li, Pingzhi, et autres
Publié: (2024)
par: Li, Pingzhi, et autres
Publié: (2024)
Adaptive Large Language Models By Layerwise Attention Shortcuts
par: Verma, Prateek, et autres
Publié: (2024)
par: Verma, Prateek, et autres
Publié: (2024)
Towards Signal Processing In Large Language Models
par: Verma, Prateek, et autres
Publié: (2024)
par: Verma, Prateek, et autres
Publié: (2024)
Know Thyself? On the Incapability and Implications of AI Self-Recognition
par: Bai, Xiaoyan, et autres
Publié: (2025)
par: Bai, Xiaoyan, et autres
Publié: (2025)
Private Fine-tuning of Large Language Models with Zeroth-order Optimization
par: Tang, Xinyu, et autres
Publié: (2024)
par: Tang, Xinyu, et autres
Publié: (2024)
H-Probes: Extracting Hierarchical Structures From Latent Representations of Language Models
par: Dawes, Cutter, et autres
Publié: (2026)
par: Dawes, Cutter, et autres
Publié: (2026)
MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification
par: Shah, Siddhant Bikram, et autres
Publié: (2024)
par: Shah, Siddhant Bikram, et autres
Publié: (2024)
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
par: Setlur, Amrith, et autres
Publié: (2024)
par: Setlur, Amrith, et autres
Publié: (2024)
Certifying Knowledge Comprehension in LLMs
par: Chaudhary, Isha, et autres
Publié: (2024)
par: Chaudhary, Isha, et autres
Publié: (2024)
Optimizing Pre-Training Data Mixtures with Mixtures of Data Expert Models
par: Belenki, Lior, et autres
Publié: (2025)
par: Belenki, Lior, et autres
Publié: (2025)
Documents similaires
-
Causal Reflection with Language Models
par: Aryan, Abi, et autres
Publié: (2025) -
Illuminate: A novel approach for depression detection with explainable analysis and proactive therapy using prompt engineering
par: Agrawal, Aryan
Publié: (2024) -
Measuring What LLMs Think They Do: SHAP Faithfulness and Deployability on Financial Tabular Classification
par: AlMarri, Saeed, et autres
Publié: (2025) -
Dissociating Decodability and Causal Use in Bracket-Sequence Transformers
par: Sharma, Aryan, et autres
Publié: (2026) -
Data Shapley in One Training Run
par: Wang, Jiachen T., et autres
Publié: (2024)