Guardado en:
| Autores principales: | Pachot, Arnault, Petit, Thierry |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2604.19757 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Llms, Virtual Users, and Bias: Predicting Any Survey Question Without Human Data
por: Sinacola, Enzo, et al.
Publicado: (2025)
por: Sinacola, Enzo, et al.
Publicado: (2025)
Exact Synthetic Populations for Scalable Societal and Market Modeling
por: Petit, Thierry, et al.
Publicado: (2025)
por: Petit, Thierry, et al.
Publicado: (2025)
Declarative Integration and Management of Large Language Models through Finite Automata: Application to Automation, Communication, and Ethics
por: Petit, Thierry, et al.
Publicado: (2024)
por: Petit, Thierry, et al.
Publicado: (2024)
Diagnosing Training Inference Mismatch in LLM Reinforcement Learning
por: Zhong, Tianle, et al.
Publicado: (2026)
por: Zhong, Tianle, et al.
Publicado: (2026)
PITA: Preference-Guided Inference-Time Alignment for LLM Post-Training
por: Bobbili, Sarat Chandra, et al.
Publicado: (2025)
por: Bobbili, Sarat Chandra, et al.
Publicado: (2025)
On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text?
por: Geng, Mingmeng, et al.
Publicado: (2025)
por: Geng, Mingmeng, et al.
Publicado: (2025)
R$^2$PO: Decoupling Training Trajectories from Inference Responses for LLM Reasoning
por: Wang, Jingchu, et al.
Publicado: (2026)
por: Wang, Jingchu, et al.
Publicado: (2026)
Towards Sustainable Artificial Intelligence: An Overview of Environmental Protection Uses and Issues
por: Pachot, Arnault, et al.
Publicado: (2022)
por: Pachot, Arnault, et al.
Publicado: (2022)
The Impact of Inference Acceleration on Bias of LLMs
por: Kirsten, Elisabeth, et al.
Publicado: (2024)
por: Kirsten, Elisabeth, et al.
Publicado: (2024)
Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models
por: Pan, Bowen, et al.
Publicado: (2024)
por: Pan, Bowen, et al.
Publicado: (2024)
Muon is Scalable for LLM Training
por: Liu, Jingyuan, et al.
Publicado: (2025)
por: Liu, Jingyuan, et al.
Publicado: (2025)
Communication Compression for Tensor Parallel LLM Inference
por: Hansen-Palmus, Jan, et al.
Publicado: (2024)
por: Hansen-Palmus, Jan, et al.
Publicado: (2024)
Defeating the Training-Inference Mismatch via FP16
por: Qi, Penghui, et al.
Publicado: (2025)
por: Qi, Penghui, et al.
Publicado: (2025)
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
por: Fu, Qichen, et al.
Publicado: (2024)
por: Fu, Qichen, et al.
Publicado: (2024)
Training Proactive and Personalized LLM Agents
por: Sun, Weiwei, et al.
Publicado: (2025)
por: Sun, Weiwei, et al.
Publicado: (2025)
Concurrent Criterion Validation of a Validity Screen for LLM Confidence Signals via Selective Prediction
por: Cacioli, Jon-Paul
Publicado: (2026)
por: Cacioli, Jon-Paul
Publicado: (2026)
Star Attention: Efficient LLM Inference over Long Sequences
por: Acharya, Shantanu, et al.
Publicado: (2024)
por: Acharya, Shantanu, et al.
Publicado: (2024)
Speculative Streaming: Fast LLM Inference without Auxiliary Models
por: Bhendawade, Nikhil, et al.
Publicado: (2024)
por: Bhendawade, Nikhil, et al.
Publicado: (2024)
Vidur: A Large-Scale Simulation Framework For LLM Inference
por: Agrawal, Amey, et al.
Publicado: (2024)
por: Agrawal, Amey, et al.
Publicado: (2024)
KV Cache Transform Coding for Compact Storage in LLM Inference
por: Staniszewski, Konrad, et al.
Publicado: (2025)
por: Staniszewski, Konrad, et al.
Publicado: (2025)
Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers
por: Ma, Wenhan, et al.
Publicado: (2025)
por: Ma, Wenhan, et al.
Publicado: (2025)
Screening Is Enough
por: Nakanishi, Ken M.
Publicado: (2026)
por: Nakanishi, Ken M.
Publicado: (2026)
Reparameterized LLM Training via Orthogonal Equivalence Transformation
por: Qiu, Zeju, et al.
Publicado: (2025)
por: Qiu, Zeju, et al.
Publicado: (2025)
Memory-Efficient LLM Training with Online Subspace Descent
por: Liang, Kaizhao, et al.
Publicado: (2024)
por: Liang, Kaizhao, et al.
Publicado: (2024)
SABER: Switchable and Balanced Training for Efficient LLM Reasoning
por: Zhao, Kai, et al.
Publicado: (2025)
por: Zhao, Kai, et al.
Publicado: (2025)
Scaling with Collapse: Efficient and Predictable Training of LLM Families
por: Bergsma, Shane, et al.
Publicado: (2025)
por: Bergsma, Shane, et al.
Publicado: (2025)
Training-free LLM Merging for Multi-task Learning
por: Fu, Zichuan, et al.
Publicado: (2025)
por: Fu, Zichuan, et al.
Publicado: (2025)
TTKV: Temporal-Tiered KV Cache for Long-Context LLM Inference
por: Dzikanyanga, Gradwell, et al.
Publicado: (2026)
por: Dzikanyanga, Gradwell, et al.
Publicado: (2026)
Adaptive Layer Selection for Layer-Wise Token Pruning in LLM Inference
por: Taniguchi, Rei, et al.
Publicado: (2026)
por: Taniguchi, Rei, et al.
Publicado: (2026)
Optimal Singular Damage: Efficient LLM Inference in Low Storage Regimes
por: Alipour, Mohammadsajad, et al.
Publicado: (2025)
por: Alipour, Mohammadsajad, et al.
Publicado: (2025)
PQCache: Product Quantization-based KVCache for Long Context LLM Inference
por: Zhang, Hailin, et al.
Publicado: (2024)
por: Zhang, Hailin, et al.
Publicado: (2024)
ComplexityNet: Increasing LLM Inference Efficiency by Learning Task Complexity
por: Bae, Henry, et al.
Publicado: (2023)
por: Bae, Henry, et al.
Publicado: (2023)
Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
por: Wen, Zhuofan, et al.
Publicado: (2024)
por: Wen, Zhuofan, et al.
Publicado: (2024)
Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies
por: Timor, Nadav, et al.
Publicado: (2025)
por: Timor, Nadav, et al.
Publicado: (2025)
FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference
por: Liu, Guangda, et al.
Publicado: (2025)
por: Liu, Guangda, et al.
Publicado: (2025)
Green Prompting: Characterizing Prompt-driven Energy Costs of LLM Inference
por: Adamska, Marta, et al.
Publicado: (2025)
por: Adamska, Marta, et al.
Publicado: (2025)
The Impact of Language Mixing on Bilingual LLM Reasoning
por: Li, Yihao, et al.
Publicado: (2025)
por: Li, Yihao, et al.
Publicado: (2025)
OpenELM: An Efficient Language Model Family with Open Training and Inference Framework
por: Mehta, Sachin, et al.
Publicado: (2024)
por: Mehta, Sachin, et al.
Publicado: (2024)
Resa: Transparent Reasoning Models via SAEs
por: Wang, Shangshang, et al.
Publicado: (2025)
por: Wang, Shangshang, et al.
Publicado: (2025)
On Designing Effective RL Reward at Training Time for LLM Reasoning
por: Gao, Jiaxuan, et al.
Publicado: (2024)
por: Gao, Jiaxuan, et al.
Publicado: (2024)
Ejemplares similares
-
Llms, Virtual Users, and Bias: Predicting Any Survey Question Without Human Data
por: Sinacola, Enzo, et al.
Publicado: (2025) -
Exact Synthetic Populations for Scalable Societal and Market Modeling
por: Petit, Thierry, et al.
Publicado: (2025) -
Declarative Integration and Management of Large Language Models through Finite Automata: Application to Automation, Communication, and Ethics
por: Petit, Thierry, et al.
Publicado: (2024) -
Diagnosing Training Inference Mismatch in LLM Reinforcement Learning
por: Zhong, Tianle, et al.
Publicado: (2026) -
PITA: Preference-Guided Inference-Time Alignment for LLM Post-Training
por: Bobbili, Sarat Chandra, et al.
Publicado: (2025)