:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Fang, Alex, Voice, Thomas, Pang, Ruoming, Schmidt, Ludwig, Gunter, Tom
Formato:	Preprint
Publicado:	2025
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2511.04234
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Large Language Model-guided Document Selection
por: Kong, Xiang, et al.
Publicado: (2024)

Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality
por: Fang, Alex, et al.
Publicado: (2025)

Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?
por: Findeis, Arduin, et al.
Publicado: (2025)

RATTENTION: Towards the Minimal Sliding Window Size in Local-Global Attention Models
por: Wang, Bailin, et al.
Publicado: (2025)

How to Select Pre-Trained Code Models for Reuse? A Learning Perspective
por: Bi, Zhangqian, et al.
Publicado: (2025)

Chain of Methodologies: Scaling Test Time Computation without Training
por: Liu, Cong, et al.
Publicado: (2025)

Synthetic Pre-Pre-Training Improves Language Model Robustness to Noisy Pre-Training Data
por: Guo, Xu, et al.
Publicado: (2026)

CoRefine: Confidence-Guided Self-Refinement for Adaptive Test-Time Compute
por: Jin, Chen, et al.
Publicado: (2026)

Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models
por: Nguyen, Thao, et al.
Publicado: (2025)

Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
por: Arora, Siddhant, et al.
Publicado: (2025)

Inverse Scaling in Test-Time Compute
por: Gema, Aryo Pradipta, et al.
Publicado: (2025)

Evaluating LLM Alignment on Personality Inference from Real-World Interview Data
por: Zhu, Jianfeng, et al.
Publicado: (2025)

Investigating Large Language Models in Inferring Personality Traits from User Conversations
por: Zhu, Jianfeng, et al.
Publicado: (2025)

Can LLMs Infer Personality from Real World Conversations?
por: Zhu, Jianfeng, et al.
Publicado: (2025)

Understanding Risk and Dependency in AI Chatbot Use from User Discourse
por: Zhu, Jianfeng, et al.
Publicado: (2026)

Resolving Discrepancies in Compute-Optimal Scaling of Language Models
por: Porian, Tomer, et al.
Publicado: (2024)

FLOP-Efficient Training: Early Stopping Based on Test-Time Compute Awareness
por: Amer, Hossam, et al.
Publicado: (2026)

On Generalization across Measurement Systems: LLMs Entail More Test-Time Compute for Underrepresented Cultures
por: Bui, Minh Duc, et al.
Publicado: (2025)

Language Models Improve When Pretraining Data Matches Target Tasks
por: Mizrahi, David, et al.
Publicado: (2025)

Reinforcement Learning on Pre-Training Data
por: Li, Siheng, et al.
Publicado: (2025)

FlashMem: Distilling Intrinsic Latent Memory via Computation Reuse
por: Hou, Yubo, et al.
Publicado: (2026)

Synthetic bootstrapped pretraining
por: Yang, Zitong, et al.
Publicado: (2025)

What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights
por: Wen, Xin, et al.
Publicado: (2024)

When to Ponder: Adaptive Compute Allocation for Code Generation via Test-Time Training
por: Sim, Gihyeon
Publicado: (2025)

Large Scale Transfer Learning for Tabular Data via Language Modeling
por: Gardner, Josh, et al.
Publicado: (2024)

TestNUC: Enhancing Test-Time Computing Approaches and Scaling through Neighboring Unlabeled Data Consistency
por: Zou, Henry Peng, et al.
Publicado: (2025)

OLMoASR: Open Models and Data for Training Robust Speech Recognition Models
por: Ngo, Huong, et al.
Publicado: (2025)

Toxicity of the Commons: Curating Open-Source Pre-Training Data
por: Arnett, Catherine, et al.
Publicado: (2024)

Unifying Structured Data as Graph for Data-to-Text Pre-Training
por: Li, Shujie, et al.
Publicado: (2024)

When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening
por: Zhu, Jianfeng, et al.
Publicado: (2026)

What Do AI Agents Talk About? Discourse and Architectural Constraints in the First AI-Only Social Network
por: Dube, Taksch, et al.
Publicado: (2026)

Reinforcement Pre-Training
por: Dong, Qingxiu, et al.
Publicado: (2025)

Instruction-Following Pruning for Large Language Models
por: Hou, Bairu, et al.
Publicado: (2025)

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
por: Geiping, Jonas, et al.
Publicado: (2025)

In-Place Test-Time Training
por: Feng, Guhao, et al.
Publicado: (2026)

ForeCite: Adapting Pre-Trained Language Models to Predict Future Citation Rates of Academic Papers
por: Hull, Gavin, et al.
Publicado: (2025)

MultiGPrompt for Multi-Task Pre-Training and Prompting on Graphs
por: Yu, Xingtong, et al.
Publicado: (2023)

Improving Language Models Trained on Translated Data with Continual Pre-Training and Dictionary Learning Analysis
por: Boughorbel, Sabri, et al.
Publicado: (2024)

Beyond Public Access in LLM Pre-Training Data
por: Rosenblat, Sruly, et al.
Publicado: (2025)

Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
por: Zhu, Qin, et al.
Publicado: (2024)