:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Kotte, Varun
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2601.06151
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

UCCI: Calibrated Uncertainty for Cost-Optimal LLM Cascade Routing
by: Kotte, Varun
Published: (2026)

PASC: Pipeline-Aware Conformal Prediction with Joint Coverage Guarantees for Multi-Stage NLP and LLM Pipelines
by: Kotte, Varun
Published: (2026)

Not All Queries Need Rewriting: When Prompt-Only LLM Refinement Helps and Hurts Dense Retrieval
by: Kotte, Varun
Published: (2026)

Towards Reliable Latent Knowledge Estimation in LLMs: Zero-Prompt Many-Shot Based Factual Knowledge Extraction
by: Wu, Qinyuan, et al.
Published: (2024)

Prompting Large Language Models for Clinical Temporal Relation Extraction
by: He, Jianping, et al.
Published: (2024)

xKV: Cross-Layer KV-Cache Compression via Aligned Singular Vector Extraction
by: Chang, Chi-Chih, et al.
Published: (2025)

PolyPrompt: Automating Knowledge Extraction from Multilingual Language Models with Dynamic Prompt Generation
by: Roll, Nathan
Published: (2025)

PARSE: LLM Driven Schema Optimization for Reliable Entity Extraction
by: Shrimal, Anubhav, et al.
Published: (2025)

Retrieval Augmented Generation for Domain-specific Question Answering
by: Sharma, Sanat, et al.
Published: (2024)

Towards Lightweight Reliability: Using Soft Prompts for Hallucination Mitigation in Large Language Models
by: Siddiqui, S M Tahmid, et al.
Published: (2026)

Provable Knowledge Acquisition and Extraction in One-Layer Transformers
by: Xu, Ruichen, et al.
Published: (2025)

Adaptive Prompting for Continual Relation Extraction: A Within-Task Variance Perspective
by: Le, Minh, et al.
Published: (2024)

Better Prompt Compression Without Multi-Layer Perceptrons
by: Honig, Edouardo, et al.
Published: (2025)

Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression
by: Wang, Jingcun, et al.
Published: (2024)

Reward-based Input Construction for Cross-document Relation Extraction
by: Na, Byeonghu, et al.
Published: (2024)

Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision
by: Su, Jinyan, et al.
Published: (2024)

Adaptive Prompt Structure Factorization: A Framework for Self-Discovering and Optimizing Compositional Prompt Programs
by: Liu, Haoyue, et al.
Published: (2026)

Which Client is Reliable?: A Reliable and Personalized Prompt-based Federated Learning for Medical Image Question Answering
by: Zhu, He, et al.
Published: (2024)

An Aspect Extraction Framework using Different Embedding Types, Learning Models, and Dependency Structure
by: Erkan, Ali, et al.
Published: (2025)

Structured Prompts Improve Evaluation of Language Models
by: Aali, Asad, et al.
Published: (2025)

Advancing Multi-Step Mathematical Reasoning in Large Language Models through Multi-Layered Self-Reflection with Auto-Prompting
by: Loureiro, André de Souza, et al.
Published: (2025)

CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs
by: Draye, Florent, et al.
Published: (2026)

Pause Tokens Strictly Increase the Expressivity of Constant-Depth Transformers
by: London, Charles, et al.
Published: (2025)

Cross-Layer Discrete Concept Discovery for Interpreting Language Models
by: Garg, Ankur, et al.
Published: (2025)

CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill
by: McDanel, Bradley, et al.
Published: (2026)

Dynamic Prompt Fusion for Multi-Task and Cross-Domain Adaptation in LLMs
by: Hu, Xin, et al.
Published: (2025)

Real-Time Trustworthiness Scoring for LLM Structured Outputs and Data Extraction
by: Goh, Hui Wen, et al.
Published: (2026)

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention
by: Brandon, William, et al.
Published: (2024)

On the Geometric Structure of Layer Updates in Deep Language Models
by: Yoo, Jun-Sik
Published: (2026)

How Reliable is Language Model Micro-Benchmarking?
by: Yauney, Gregory, et al.
Published: (2025)

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse
by: Bai, Yushi, et al.
Published: (2026)

CP-Prompt: Composition-Based Cross-modal Prompting for Domain-Incremental Continual Learning
by: Feng, Yu, et al.
Published: (2024)

SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection
by: Joshi, Maithili, et al.
Published: (2025)

Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
by: Bandarkar, Lucas, et al.
Published: (2024)

BenchAgents: Multi-Agent Systems for Structured Benchmark Creation
by: Butt, Natasha, et al.
Published: (2024)

Attention Speaks Volumes: Localizing and Mitigating Bias in Language Models
by: Adiga, Rishabh, et al.
Published: (2024)

Designing Informative Metrics for Few-Shot Example Selection
by: Adiga, Rishabh, et al.
Published: (2024)

Improving Structural Diversity of Blackbox LLMs via Chain-of-Specification Prompting
by: Young, Halley, et al.
Published: (2024)

A Single-Layer Model Can Do Language Modeling
by: Wang, Zanmin
Published: (2026)

Do Large Language Model Benchmarks Test Reliability?
by: Vendrow, Joshua, et al.
Published: (2025)