Guardado en:
| Autores principales: | Zhou, Runlong, Zhang, Lefan, Wu, Shang-Chen, Zou, Kelvin, Zhou, Hanzhi, Ye, Ke, Feng, Yihao, Yin, Dong, Garcia, Alex Guillen, Babych, Dmytro, Chatterjee, Rohit, Hopkins, Matthew, Kong, Xiang, Lan, Chang, Li, Lezhi, Ma, Yiping, Molinari, Daniele, Tong, Senyu, Sun, Yanchao, Voice, Thomas, Wang, Jianyu, Wang, Chong, Wang, Simon, Weers, Floris, Xu, Yechen, Yin, Guolin, Yu, Muyang, Zhang, Yi, Zhou, Zheng, Zhuo, Danyang, Pang, Ruoming, Leong, Cheng |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2512.06392 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?
por: Findeis, Arduin, et al.
Publicado: (2025)
por: Findeis, Arduin, et al.
Publicado: (2025)
AXLearn: Modular, Hardware-Agnostic Large Model Training
por: Lee, Mark, et al.
Publicado: (2025)
por: Lee, Mark, et al.
Publicado: (2025)
Reusing Pre-Training Data at Test Time is a Compute Multiplier
por: Fang, Alex, et al.
Publicado: (2025)
por: Fang, Alex, et al.
Publicado: (2025)
Parallel Track Transformers: Enabling Fast GPU Inference with Reduced Synchronization
por: Wang, Chong, et al.
Publicado: (2026)
por: Wang, Chong, et al.
Publicado: (2026)
Nixie: Efficient, Transparent Temporal Multiplexing for Consumer GPUs
por: Xu, Yechen, et al.
Publicado: (2026)
por: Xu, Yechen, et al.
Publicado: (2026)
Hilbert: Recursively Building Formal Proofs with Informal Reasoning
por: Varambally, Sumanth, et al.
Publicado: (2025)
por: Varambally, Sumanth, et al.
Publicado: (2025)
Multiscale Supervised Unbalanced Optimal Transport Flow Matching
por: Peng, Qiangwei, et al.
Publicado: (2026)
por: Peng, Qiangwei, et al.
Publicado: (2026)
Instruction-Following Pruning for Large Language Models
por: Hou, Bairu, et al.
Publicado: (2025)
por: Hou, Bairu, et al.
Publicado: (2025)
Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution
por: Xu, Yechen, et al.
Publicado: (2024)
por: Xu, Yechen, et al.
Publicado: (2024)
CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language Models
por: Zhou, Runlong, et al.
Publicado: (2025)
por: Zhou, Runlong, et al.
Publicado: (2025)
Semantic Representation Attack against Aligned Large Language Models
por: Lian, Jiawei, et al.
Publicado: (2025)
por: Lian, Jiawei, et al.
Publicado: (2025)
Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models
por: Lian, Jiawei, et al.
Publicado: (2025)
por: Lian, Jiawei, et al.
Publicado: (2025)
Efficient Top-k s-Biplexes Search over Large Bipartite Graphs
por: Xu, Zhenxiang, et al.
Publicado: (2024)
por: Xu, Zhenxiang, et al.
Publicado: (2024)
Brain Transcriptome Analysis Identifies Sex‐Associated Synaptic and Immune‐Associated Genes Involved in Alzheimer's Disease and Psychiatric Disorders
por: Muyang Zhang, et al.
Publicado: (2025)
por: Muyang Zhang, et al.
Publicado: (2025)
Kunlun Anomaly Troubleshooter: Enabling Kernel-Level Anomaly Detection and Causal Reasoning for Large Model Distributed Inference
por: Liu, Yuyang, et al.
Publicado: (2025)
por: Liu, Yuyang, et al.
Publicado: (2025)
How to Set the Batch Size for Large-Scale Pre-training?
por: Zhou, Yunhua, et al.
Publicado: (2026)
por: Zhou, Yunhua, et al.
Publicado: (2026)
Distillation Scaling Laws
por: Busbridge, Dan, et al.
Publicado: (2025)
por: Busbridge, Dan, et al.
Publicado: (2025)
Mini-Giants: "Small" Language Models and Open Source Win-Win
por: Zhou, Zhengping, et al.
Publicado: (2023)
por: Zhou, Zhengping, et al.
Publicado: (2023)
Analyzing Collision Rates in Large-Scale Mixed Traffic Control via Multi-Agent Reinforcement Learning
por: Fan, Muyang
Publicado: (2025)
por: Fan, Muyang
Publicado: (2025)
Inference for Spiked Eigenstructure under Generalized Covariance and Correlation Models
por: Yin, Yanqing, et al.
Publicado: (2024)
por: Yin, Yanqing, et al.
Publicado: (2024)
The Geometry of Spectral Fluctuations: On Near-Optimal Conditions for Universal Gaussian CLTs, with Statistical Applications
por: Yin, Yanqing, et al.
Publicado: (2026)
por: Yin, Yanqing, et al.
Publicado: (2026)
Toward a Socio‐Cognitive Framework for Validating L2 Listening Classroom Assessment: Comparing Teachers’ and Learners’ Perceptions on English Textbook Listening Tasks
por: Tong Sun, et al.
Publicado: (2025)
por: Tong Sun, et al.
Publicado: (2025)
Revisiting Local PageRank Estimation on Undirected Graphs: Simple and Optimal
por: Wang, Hanzhi
Publicado: (2024)
por: Wang, Hanzhi
Publicado: (2024)
Automatisierte Inhaltserschließung an der Bibliothek des Max-Planck-Instituts für Mathematik in den Naturwissenschaften
por: Weers, Beatrice Simon
Publicado: (2026)
por: Weers, Beatrice Simon
Publicado: (2026)
OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation
por: Wang, Lizhi, et al.
Publicado: (2024)
por: Wang, Lizhi, et al.
Publicado: (2024)
DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction
por: Liu, Bo, et al.
Publicado: (2025)
por: Liu, Bo, et al.
Publicado: (2025)
Large Language Model-guided Document Selection
por: Kong, Xiang, et al.
Publicado: (2024)
por: Kong, Xiang, et al.
Publicado: (2024)
SpecEval: Evaluating Code Comprehension in Large Language Models via Program Specifications
por: Ma, Lezhi, et al.
Publicado: (2024)
por: Ma, Lezhi, et al.
Publicado: (2024)
MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
por: Zhou, Pei, et al.
Publicado: (2024)
por: Zhou, Pei, et al.
Publicado: (2024)
Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models
por: Liu, Ji, et al.
Publicado: (2024)
por: Liu, Ji, et al.
Publicado: (2024)
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
por: Zhou, Runlong, et al.
Publicado: (2024)
por: Zhou, Runlong, et al.
Publicado: (2024)
The Crucial Role of Samplers in Online Direct Preference Optimization
por: Shi, Ruizhe, et al.
Publicado: (2024)
por: Shi, Ruizhe, et al.
Publicado: (2024)
Extragradient Preference Optimization (EGPO): Beyond Last-Iterate Convergence for Nash Learning from Human Feedback
por: Zhou, Runlong, et al.
Publicado: (2025)
por: Zhou, Runlong, et al.
Publicado: (2025)
Generalizing Linear Autoencoder Recommenders with Decoupled Expected Quadratic Loss
por: Guo, Ruixin, et al.
Publicado: (2026)
por: Guo, Ruixin, et al.
Publicado: (2026)
LatencyPrism: Online Non-intrusive Latency Sculpting for SLO-Guaranteed LLM Inference
por: Du, Yin, et al.
Publicado: (2026)
por: Du, Yin, et al.
Publicado: (2026)
Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models
por: Zeng, Hongchuan, et al.
Publicado: (2024)
por: Zeng, Hongchuan, et al.
Publicado: (2024)
TIM: Teaching Large Language Models to Translate with Comparison
por: Zeng, Jiali, et al.
Publicado: (2023)
por: Zeng, Jiali, et al.
Publicado: (2023)
PAC-Bayes Bounds for Multivariate Linear Regression and Linear Autoencoders
por: Guo, Ruixin, et al.
Publicado: (2025)
por: Guo, Ruixin, et al.
Publicado: (2025)
Multimodal Gen-AI for Fundamental Investment Research
por: Li, Lezhi, et al.
Publicado: (2023)
por: Li, Lezhi, et al.
Publicado: (2023)
Benchmarking Quantum Red TEA on CPUs, GPUs, and TPUs
por: Jaschke, Daniel, et al.
Publicado: (2024)
por: Jaschke, Daniel, et al.
Publicado: (2024)
Ejemplares similares
-
Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?
por: Findeis, Arduin, et al.
Publicado: (2025) -
AXLearn: Modular, Hardware-Agnostic Large Model Training
por: Lee, Mark, et al.
Publicado: (2025) -
Reusing Pre-Training Data at Test Time is a Compute Multiplier
por: Fang, Alex, et al.
Publicado: (2025) -
Parallel Track Transformers: Enabling Fast GPU Inference with Reduced Synchronization
por: Wang, Chong, et al.
Publicado: (2026) -
Nixie: Efficient, Transparent Temporal Multiplexing for Consumer GPUs
por: Xu, Yechen, et al.
Publicado: (2026)