:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Pachot, Arnault, Petit, Thierry
Formato:	Preprint
Publicado:	2026
Materias:	Machine Learning Artificial Intelligence Computation and Language
Acceso en línea:	https://arxiv.org/abs/2604.19757
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Llms, Virtual Users, and Bias: Predicting Any Survey Question Without Human Data
por: Sinacola, Enzo, et al.
Publicado: (2025)

Exact Synthetic Populations for Scalable Societal and Market Modeling
por: Petit, Thierry, et al.
Publicado: (2025)

Declarative Integration and Management of Large Language Models through Finite Automata: Application to Automation, Communication, and Ethics
por: Petit, Thierry, et al.
Publicado: (2024)

Diagnosing Training Inference Mismatch in LLM Reinforcement Learning
por: Zhong, Tianle, et al.
Publicado: (2026)

PITA: Preference-Guided Inference-Time Alignment for LLM Post-Training
por: Bobbili, Sarat Chandra, et al.
Publicado: (2025)

On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text?
por: Geng, Mingmeng, et al.
Publicado: (2025)

R$^2$PO: Decoupling Training Trajectories from Inference Responses for LLM Reasoning
por: Wang, Jingchu, et al.
Publicado: (2026)

Towards Sustainable Artificial Intelligence: An Overview of Environmental Protection Uses and Issues
por: Pachot, Arnault, et al.
Publicado: (2022)

The Impact of Inference Acceleration on Bias of LLMs
por: Kirsten, Elisabeth, et al.
Publicado: (2024)

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models
por: Pan, Bowen, et al.
Publicado: (2024)

Muon is Scalable for LLM Training
por: Liu, Jingyuan, et al.
Publicado: (2025)

Communication Compression for Tensor Parallel LLM Inference
por: Hansen-Palmus, Jan, et al.
Publicado: (2024)

Defeating the Training-Inference Mismatch via FP16
por: Qi, Penghui, et al.
Publicado: (2025)

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
por: Fu, Qichen, et al.
Publicado: (2024)

Training Proactive and Personalized LLM Agents
por: Sun, Weiwei, et al.
Publicado: (2025)

Concurrent Criterion Validation of a Validity Screen for LLM Confidence Signals via Selective Prediction
por: Cacioli, Jon-Paul
Publicado: (2026)

Star Attention: Efficient LLM Inference over Long Sequences
por: Acharya, Shantanu, et al.
Publicado: (2024)

Speculative Streaming: Fast LLM Inference without Auxiliary Models
por: Bhendawade, Nikhil, et al.
Publicado: (2024)

Vidur: A Large-Scale Simulation Framework For LLM Inference
por: Agrawal, Amey, et al.
Publicado: (2024)

KV Cache Transform Coding for Compact Storage in LLM Inference
por: Staniszewski, Konrad, et al.
Publicado: (2025)

Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers
por: Ma, Wenhan, et al.
Publicado: (2025)

Screening Is Enough
por: Nakanishi, Ken M.
Publicado: (2026)

Reparameterized LLM Training via Orthogonal Equivalence Transformation
por: Qiu, Zeju, et al.
Publicado: (2025)

Memory-Efficient LLM Training with Online Subspace Descent
por: Liang, Kaizhao, et al.
Publicado: (2024)

SABER: Switchable and Balanced Training for Efficient LLM Reasoning
por: Zhao, Kai, et al.
Publicado: (2025)

Scaling with Collapse: Efficient and Predictable Training of LLM Families
por: Bergsma, Shane, et al.
Publicado: (2025)

Training-free LLM Merging for Multi-task Learning
por: Fu, Zichuan, et al.
Publicado: (2025)

TTKV: Temporal-Tiered KV Cache for Long-Context LLM Inference
por: Dzikanyanga, Gradwell, et al.
Publicado: (2026)

Adaptive Layer Selection for Layer-Wise Token Pruning in LLM Inference
por: Taniguchi, Rei, et al.
Publicado: (2026)

Optimal Singular Damage: Efficient LLM Inference in Low Storage Regimes
por: Alipour, Mohammadsajad, et al.
Publicado: (2025)

PQCache: Product Quantization-based KVCache for Long Context LLM Inference
por: Zhang, Hailin, et al.
Publicado: (2024)

ComplexityNet: Increasing LLM Inference Efficiency by Learning Task Complexity
por: Bae, Henry, et al.
Publicado: (2023)

Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
por: Wen, Zhuofan, et al.
Publicado: (2024)

Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies
por: Timor, Nadav, et al.
Publicado: (2025)

FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference
por: Liu, Guangda, et al.
Publicado: (2025)

Green Prompting: Characterizing Prompt-driven Energy Costs of LLM Inference
por: Adamska, Marta, et al.
Publicado: (2025)

The Impact of Language Mixing on Bilingual LLM Reasoning
por: Li, Yihao, et al.
Publicado: (2025)

OpenELM: An Efficient Language Model Family with Open Training and Inference Framework
por: Mehta, Sachin, et al.
Publicado: (2024)

Resa: Transparent Reasoning Models via SAEs
por: Wang, Shangshang, et al.
Publicado: (2025)

On Designing Effective RL Reward at Training Time for LLM Reasoning
por: Gao, Jiaxuan, et al.
Publicado: (2024)