Saved in:
| Main Author: | Kotte, Varun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.06151 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
UCCI: Calibrated Uncertainty for Cost-Optimal LLM Cascade Routing
by: Kotte, Varun
Published: (2026)
by: Kotte, Varun
Published: (2026)
PASC: Pipeline-Aware Conformal Prediction with Joint Coverage Guarantees for Multi-Stage NLP and LLM Pipelines
by: Kotte, Varun
Published: (2026)
by: Kotte, Varun
Published: (2026)
Not All Queries Need Rewriting: When Prompt-Only LLM Refinement Helps and Hurts Dense Retrieval
by: Kotte, Varun
Published: (2026)
by: Kotte, Varun
Published: (2026)
Towards Reliable Latent Knowledge Estimation in LLMs: Zero-Prompt Many-Shot Based Factual Knowledge Extraction
by: Wu, Qinyuan, et al.
Published: (2024)
by: Wu, Qinyuan, et al.
Published: (2024)
Prompting Large Language Models for Clinical Temporal Relation Extraction
by: He, Jianping, et al.
Published: (2024)
by: He, Jianping, et al.
Published: (2024)
xKV: Cross-Layer KV-Cache Compression via Aligned Singular Vector Extraction
by: Chang, Chi-Chih, et al.
Published: (2025)
by: Chang, Chi-Chih, et al.
Published: (2025)
PolyPrompt: Automating Knowledge Extraction from Multilingual Language Models with Dynamic Prompt Generation
by: Roll, Nathan
Published: (2025)
by: Roll, Nathan
Published: (2025)
PARSE: LLM Driven Schema Optimization for Reliable Entity Extraction
by: Shrimal, Anubhav, et al.
Published: (2025)
by: Shrimal, Anubhav, et al.
Published: (2025)
Retrieval Augmented Generation for Domain-specific Question Answering
by: Sharma, Sanat, et al.
Published: (2024)
by: Sharma, Sanat, et al.
Published: (2024)
Towards Lightweight Reliability: Using Soft Prompts for Hallucination Mitigation in Large Language Models
by: Siddiqui, S M Tahmid, et al.
Published: (2026)
by: Siddiqui, S M Tahmid, et al.
Published: (2026)
Provable Knowledge Acquisition and Extraction in One-Layer Transformers
by: Xu, Ruichen, et al.
Published: (2025)
by: Xu, Ruichen, et al.
Published: (2025)
Adaptive Prompting for Continual Relation Extraction: A Within-Task Variance Perspective
by: Le, Minh, et al.
Published: (2024)
by: Le, Minh, et al.
Published: (2024)
Better Prompt Compression Without Multi-Layer Perceptrons
by: Honig, Edouardo, et al.
Published: (2025)
by: Honig, Edouardo, et al.
Published: (2025)
Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression
by: Wang, Jingcun, et al.
Published: (2024)
by: Wang, Jingcun, et al.
Published: (2024)
Reward-based Input Construction for Cross-document Relation Extraction
by: Na, Byeonghu, et al.
Published: (2024)
by: Na, Byeonghu, et al.
Published: (2024)
Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision
by: Su, Jinyan, et al.
Published: (2024)
by: Su, Jinyan, et al.
Published: (2024)
Adaptive Prompt Structure Factorization: A Framework for Self-Discovering and Optimizing Compositional Prompt Programs
by: Liu, Haoyue, et al.
Published: (2026)
by: Liu, Haoyue, et al.
Published: (2026)
Which Client is Reliable?: A Reliable and Personalized Prompt-based Federated Learning for Medical Image Question Answering
by: Zhu, He, et al.
Published: (2024)
by: Zhu, He, et al.
Published: (2024)
An Aspect Extraction Framework using Different Embedding Types, Learning Models, and Dependency Structure
by: Erkan, Ali, et al.
Published: (2025)
by: Erkan, Ali, et al.
Published: (2025)
Structured Prompts Improve Evaluation of Language Models
by: Aali, Asad, et al.
Published: (2025)
by: Aali, Asad, et al.
Published: (2025)
Advancing Multi-Step Mathematical Reasoning in Large Language Models through Multi-Layered Self-Reflection with Auto-Prompting
by: Loureiro, André de Souza, et al.
Published: (2025)
by: Loureiro, André de Souza, et al.
Published: (2025)
CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs
by: Draye, Florent, et al.
Published: (2026)
by: Draye, Florent, et al.
Published: (2026)
Pause Tokens Strictly Increase the Expressivity of Constant-Depth Transformers
by: London, Charles, et al.
Published: (2025)
by: London, Charles, et al.
Published: (2025)
Cross-Layer Discrete Concept Discovery for Interpreting Language Models
by: Garg, Ankur, et al.
Published: (2025)
by: Garg, Ankur, et al.
Published: (2025)
CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill
by: McDanel, Bradley, et al.
Published: (2026)
by: McDanel, Bradley, et al.
Published: (2026)
Dynamic Prompt Fusion for Multi-Task and Cross-Domain Adaptation in LLMs
by: Hu, Xin, et al.
Published: (2025)
by: Hu, Xin, et al.
Published: (2025)
Real-Time Trustworthiness Scoring for LLM Structured Outputs and Data Extraction
by: Goh, Hui Wen, et al.
Published: (2026)
by: Goh, Hui Wen, et al.
Published: (2026)
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention
by: Brandon, William, et al.
Published: (2024)
by: Brandon, William, et al.
Published: (2024)
On the Geometric Structure of Layer Updates in Deep Language Models
by: Yoo, Jun-Sik
Published: (2026)
by: Yoo, Jun-Sik
Published: (2026)
How Reliable is Language Model Micro-Benchmarking?
by: Yauney, Gregory, et al.
Published: (2025)
by: Yauney, Gregory, et al.
Published: (2025)
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse
by: Bai, Yushi, et al.
Published: (2026)
by: Bai, Yushi, et al.
Published: (2026)
CP-Prompt: Composition-Based Cross-modal Prompting for Domain-Incremental Continual Learning
by: Feng, Yu, et al.
Published: (2024)
by: Feng, Yu, et al.
Published: (2024)
SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection
by: Joshi, Maithili, et al.
Published: (2025)
by: Joshi, Maithili, et al.
Published: (2025)
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
by: Bandarkar, Lucas, et al.
Published: (2024)
by: Bandarkar, Lucas, et al.
Published: (2024)
BenchAgents: Multi-Agent Systems for Structured Benchmark Creation
by: Butt, Natasha, et al.
Published: (2024)
by: Butt, Natasha, et al.
Published: (2024)
Attention Speaks Volumes: Localizing and Mitigating Bias in Language Models
by: Adiga, Rishabh, et al.
Published: (2024)
by: Adiga, Rishabh, et al.
Published: (2024)
Designing Informative Metrics for Few-Shot Example Selection
by: Adiga, Rishabh, et al.
Published: (2024)
by: Adiga, Rishabh, et al.
Published: (2024)
Improving Structural Diversity of Blackbox LLMs via Chain-of-Specification Prompting
by: Young, Halley, et al.
Published: (2024)
by: Young, Halley, et al.
Published: (2024)
A Single-Layer Model Can Do Language Modeling
by: Wang, Zanmin
Published: (2026)
by: Wang, Zanmin
Published: (2026)
Do Large Language Model Benchmarks Test Reliability?
by: Vendrow, Joshua, et al.
Published: (2025)
by: Vendrow, Joshua, et al.
Published: (2025)
Similar Items
-
UCCI: Calibrated Uncertainty for Cost-Optimal LLM Cascade Routing
by: Kotte, Varun
Published: (2026) -
PASC: Pipeline-Aware Conformal Prediction with Joint Coverage Guarantees for Multi-Stage NLP and LLM Pipelines
by: Kotte, Varun
Published: (2026) -
Not All Queries Need Rewriting: When Prompt-Only LLM Refinement Helps and Hurts Dense Retrieval
by: Kotte, Varun
Published: (2026) -
Towards Reliable Latent Knowledge Estimation in LLMs: Zero-Prompt Many-Shot Based Factual Knowledge Extraction
by: Wu, Qinyuan, et al.
Published: (2024) -
Prompting Large Language Models for Clinical Temporal Relation Extraction
by: He, Jianping, et al.
Published: (2024)