Guardado en:
| Autores principales: | Fang, Alex, Voice, Thomas, Pang, Ruoming, Schmidt, Ludwig, Gunter, Tom |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2511.04234 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Large Language Model-guided Document Selection
por: Kong, Xiang, et al.
Publicado: (2024)
por: Kong, Xiang, et al.
Publicado: (2024)
Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality
por: Fang, Alex, et al.
Publicado: (2025)
por: Fang, Alex, et al.
Publicado: (2025)
Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?
por: Findeis, Arduin, et al.
Publicado: (2025)
por: Findeis, Arduin, et al.
Publicado: (2025)
RATTENTION: Towards the Minimal Sliding Window Size in Local-Global Attention Models
por: Wang, Bailin, et al.
Publicado: (2025)
por: Wang, Bailin, et al.
Publicado: (2025)
How to Select Pre-Trained Code Models for Reuse? A Learning Perspective
por: Bi, Zhangqian, et al.
Publicado: (2025)
por: Bi, Zhangqian, et al.
Publicado: (2025)
Chain of Methodologies: Scaling Test Time Computation without Training
por: Liu, Cong, et al.
Publicado: (2025)
por: Liu, Cong, et al.
Publicado: (2025)
Synthetic Pre-Pre-Training Improves Language Model Robustness to Noisy Pre-Training Data
por: Guo, Xu, et al.
Publicado: (2026)
por: Guo, Xu, et al.
Publicado: (2026)
CoRefine: Confidence-Guided Self-Refinement for Adaptive Test-Time Compute
por: Jin, Chen, et al.
Publicado: (2026)
por: Jin, Chen, et al.
Publicado: (2026)
Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models
por: Nguyen, Thao, et al.
Publicado: (2025)
por: Nguyen, Thao, et al.
Publicado: (2025)
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
por: Arora, Siddhant, et al.
Publicado: (2025)
por: Arora, Siddhant, et al.
Publicado: (2025)
Inverse Scaling in Test-Time Compute
por: Gema, Aryo Pradipta, et al.
Publicado: (2025)
por: Gema, Aryo Pradipta, et al.
Publicado: (2025)
Evaluating LLM Alignment on Personality Inference from Real-World Interview Data
por: Zhu, Jianfeng, et al.
Publicado: (2025)
por: Zhu, Jianfeng, et al.
Publicado: (2025)
Investigating Large Language Models in Inferring Personality Traits from User Conversations
por: Zhu, Jianfeng, et al.
Publicado: (2025)
por: Zhu, Jianfeng, et al.
Publicado: (2025)
Can LLMs Infer Personality from Real World Conversations?
por: Zhu, Jianfeng, et al.
Publicado: (2025)
por: Zhu, Jianfeng, et al.
Publicado: (2025)
Understanding Risk and Dependency in AI Chatbot Use from User Discourse
por: Zhu, Jianfeng, et al.
Publicado: (2026)
por: Zhu, Jianfeng, et al.
Publicado: (2026)
Resolving Discrepancies in Compute-Optimal Scaling of Language Models
por: Porian, Tomer, et al.
Publicado: (2024)
por: Porian, Tomer, et al.
Publicado: (2024)
FLOP-Efficient Training: Early Stopping Based on Test-Time Compute Awareness
por: Amer, Hossam, et al.
Publicado: (2026)
por: Amer, Hossam, et al.
Publicado: (2026)
On Generalization across Measurement Systems: LLMs Entail More Test-Time Compute for Underrepresented Cultures
por: Bui, Minh Duc, et al.
Publicado: (2025)
por: Bui, Minh Duc, et al.
Publicado: (2025)
Language Models Improve When Pretraining Data Matches Target Tasks
por: Mizrahi, David, et al.
Publicado: (2025)
por: Mizrahi, David, et al.
Publicado: (2025)
Reinforcement Learning on Pre-Training Data
por: Li, Siheng, et al.
Publicado: (2025)
por: Li, Siheng, et al.
Publicado: (2025)
FlashMem: Distilling Intrinsic Latent Memory via Computation Reuse
por: Hou, Yubo, et al.
Publicado: (2026)
por: Hou, Yubo, et al.
Publicado: (2026)
Synthetic bootstrapped pretraining
por: Yang, Zitong, et al.
Publicado: (2025)
por: Yang, Zitong, et al.
Publicado: (2025)
What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights
por: Wen, Xin, et al.
Publicado: (2024)
por: Wen, Xin, et al.
Publicado: (2024)
When to Ponder: Adaptive Compute Allocation for Code Generation via Test-Time Training
por: Sim, Gihyeon
Publicado: (2025)
por: Sim, Gihyeon
Publicado: (2025)
Large Scale Transfer Learning for Tabular Data via Language Modeling
por: Gardner, Josh, et al.
Publicado: (2024)
por: Gardner, Josh, et al.
Publicado: (2024)
TestNUC: Enhancing Test-Time Computing Approaches and Scaling through Neighboring Unlabeled Data Consistency
por: Zou, Henry Peng, et al.
Publicado: (2025)
por: Zou, Henry Peng, et al.
Publicado: (2025)
OLMoASR: Open Models and Data for Training Robust Speech Recognition Models
por: Ngo, Huong, et al.
Publicado: (2025)
por: Ngo, Huong, et al.
Publicado: (2025)
Toxicity of the Commons: Curating Open-Source Pre-Training Data
por: Arnett, Catherine, et al.
Publicado: (2024)
por: Arnett, Catherine, et al.
Publicado: (2024)
Unifying Structured Data as Graph for Data-to-Text Pre-Training
por: Li, Shujie, et al.
Publicado: (2024)
por: Li, Shujie, et al.
Publicado: (2024)
When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening
por: Zhu, Jianfeng, et al.
Publicado: (2026)
por: Zhu, Jianfeng, et al.
Publicado: (2026)
What Do AI Agents Talk About? Discourse and Architectural Constraints in the First AI-Only Social Network
por: Dube, Taksch, et al.
Publicado: (2026)
por: Dube, Taksch, et al.
Publicado: (2026)
Reinforcement Pre-Training
por: Dong, Qingxiu, et al.
Publicado: (2025)
por: Dong, Qingxiu, et al.
Publicado: (2025)
Instruction-Following Pruning for Large Language Models
por: Hou, Bairu, et al.
Publicado: (2025)
por: Hou, Bairu, et al.
Publicado: (2025)
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
por: Geiping, Jonas, et al.
Publicado: (2025)
por: Geiping, Jonas, et al.
Publicado: (2025)
In-Place Test-Time Training
por: Feng, Guhao, et al.
Publicado: (2026)
por: Feng, Guhao, et al.
Publicado: (2026)
ForeCite: Adapting Pre-Trained Language Models to Predict Future Citation Rates of Academic Papers
por: Hull, Gavin, et al.
Publicado: (2025)
por: Hull, Gavin, et al.
Publicado: (2025)
MultiGPrompt for Multi-Task Pre-Training and Prompting on Graphs
por: Yu, Xingtong, et al.
Publicado: (2023)
por: Yu, Xingtong, et al.
Publicado: (2023)
Improving Language Models Trained on Translated Data with Continual Pre-Training and Dictionary Learning Analysis
por: Boughorbel, Sabri, et al.
Publicado: (2024)
por: Boughorbel, Sabri, et al.
Publicado: (2024)
Beyond Public Access in LLM Pre-Training Data
por: Rosenblat, Sruly, et al.
Publicado: (2025)
por: Rosenblat, Sruly, et al.
Publicado: (2025)
Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
por: Zhu, Qin, et al.
Publicado: (2024)
por: Zhu, Qin, et al.
Publicado: (2024)
Ejemplares similares
-
Large Language Model-guided Document Selection
por: Kong, Xiang, et al.
Publicado: (2024) -
Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality
por: Fang, Alex, et al.
Publicado: (2025) -
Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?
por: Findeis, Arduin, et al.
Publicado: (2025) -
RATTENTION: Towards the Minimal Sliding Window Size in Local-Global Attention Models
por: Wang, Bailin, et al.
Publicado: (2025) -
How to Select Pre-Trained Code Models for Reuse? A Learning Perspective
por: Bi, Zhangqian, et al.
Publicado: (2025)