Guardado en:
| Autores principales: | Sun, Haocheng, Wen, Cynthia Xin, Wang, Edward Hong |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2510.03289 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Building A Unified AI-centric Language System: analysis, framework and future work
por: Wang, Edward Hong, et al.
Publicado: (2025)
por: Wang, Edward Hong, et al.
Publicado: (2025)
Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions
por: Li, Xiang, et al.
Publicado: (2024)
por: Li, Xiang, et al.
Publicado: (2024)
TransformerFAM: Feedback attention is working memory
por: Hwang, Dongseong, et al.
Publicado: (2024)
por: Hwang, Dongseong, et al.
Publicado: (2024)
Where does output diversity collapse in post-training?
por: Karouzos, Constantinos, et al.
Publicado: (2026)
por: Karouzos, Constantinos, et al.
Publicado: (2026)
Universe Routing: Why Self-Evolving Agents Need Epistemic Control
por: Wang, Zhaohui Geoffrey
Publicado: (2026)
por: Wang, Zhaohui Geoffrey
Publicado: (2026)
Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents
por: Wang, Zehong, et al.
Publicado: (2026)
por: Wang, Zehong, et al.
Publicado: (2026)
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models
por: Wen, Yilin, et al.
Publicado: (2023)
por: Wen, Yilin, et al.
Publicado: (2023)
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
por: Gu, Yuxian, et al.
Publicado: (2025)
por: Gu, Yuxian, et al.
Publicado: (2025)
Reveal and Release: Iterative LLM Unlearning with Self-generated Data
por: Xie, Linxi, et al.
Publicado: (2025)
por: Xie, Linxi, et al.
Publicado: (2025)
FADE: Why Bad Descriptions Happen to Good Features
por: Puri, Bruno, et al.
Publicado: (2025)
por: Puri, Bruno, et al.
Publicado: (2025)
Why Do Safety Guardrails Degrade Across Languages?
por: Zhang, Max, et al.
Publicado: (2026)
por: Zhang, Max, et al.
Publicado: (2026)
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
por: Choi, Yumin, et al.
Publicado: (2025)
por: Choi, Yumin, et al.
Publicado: (2025)
Why Larger Language Models Do In-context Learning Differently?
por: Shi, Zhenmei, et al.
Publicado: (2024)
por: Shi, Zhenmei, et al.
Publicado: (2024)
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models
por: Ni, Zanlin, et al.
Publicado: (2026)
por: Ni, Zanlin, et al.
Publicado: (2026)
Nevermind: Instruction Override and Moderation in Large Language Models
por: Kim, Edward
Publicado: (2024)
por: Kim, Edward
Publicado: (2024)
A Systematic Review of Data-to-Text NLG
por: Osuji, Chinonso Cynthia, et al.
Publicado: (2024)
por: Osuji, Chinonso Cynthia, et al.
Publicado: (2024)
Why is Your Language Model a Poor Implicit Reward Model?
por: Razin, Noam, et al.
Publicado: (2025)
por: Razin, Noam, et al.
Publicado: (2025)
Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?
por: Kang, Deokhyung, et al.
Publicado: (2025)
por: Kang, Deokhyung, et al.
Publicado: (2025)
The Invisible Leash: Why RLVR May or May Not Escape Its Origin
por: Wu, Fang, et al.
Publicado: (2025)
por: Wu, Fang, et al.
Publicado: (2025)
Efficient and Personalized Mobile Health Event Prediction via Small Language Models
por: Wang, Xin, et al.
Publicado: (2024)
por: Wang, Xin, et al.
Publicado: (2024)
SP^2DPO: An LLM-assisted Semantic Per-Pair DPO Generalization
por: He, Chaoyue, et al.
Publicado: (2026)
por: He, Chaoyue, et al.
Publicado: (2026)
Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels
por: Kumar, Anantha Padmanaban Krishna
Publicado: (2025)
por: Kumar, Anantha Padmanaban Krishna
Publicado: (2025)
Detection Is Cheap, Routing Is Learned: Why Refusal-Based Alignment Evaluation Fails
por: Frank, Gregory N.
Publicado: (2026)
por: Frank, Gregory N.
Publicado: (2026)
Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
por: Öncel, Fırat, et al.
Publicado: (2024)
por: Öncel, Fırat, et al.
Publicado: (2024)
A Implies B: Circuit Analysis in LLMs for Propositional Logical Reasoning
por: Hong, Guan Zhe, et al.
Publicado: (2024)
por: Hong, Guan Zhe, et al.
Publicado: (2024)
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
por: Schaeffer, Rylan, et al.
Publicado: (2024)
por: Schaeffer, Rylan, et al.
Publicado: (2024)
MediFact at MEDIQA-CORR 2024: Why AI Needs a Human Touch
por: Saeed, Nadia
Publicado: (2024)
por: Saeed, Nadia
Publicado: (2024)
Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration
por: Chen, Zhipeng, et al.
Publicado: (2026)
por: Chen, Zhipeng, et al.
Publicado: (2026)
Why Don't Prompt-Based Fairness Metrics Correlate?
por: Zayed, Abdelrahman, et al.
Publicado: (2024)
por: Zayed, Abdelrahman, et al.
Publicado: (2024)
Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
por: Maheswaran, Monishwaran, et al.
Publicado: (2025)
por: Maheswaran, Monishwaran, et al.
Publicado: (2025)
Order-Independence Without Fine Tuning
por: McIlroy-Young, Reid, et al.
Publicado: (2024)
por: McIlroy-Young, Reid, et al.
Publicado: (2024)
Health-LLM: Large Language Models for Health Prediction via Wearable Sensor Data
por: Kim, Yubin, et al.
Publicado: (2024)
por: Kim, Yubin, et al.
Publicado: (2024)
Why Does ChatGPT "Delve" So Much? Exploring the Sources of Lexical Overrepresentation in Large Language Models
por: Juzek, Tom S., et al.
Publicado: (2024)
por: Juzek, Tom S., et al.
Publicado: (2024)
Bayesian WeakS-to-Strong from Text Classification to Generation
por: Cui, Ziyun, et al.
Publicado: (2024)
por: Cui, Ziyun, et al.
Publicado: (2024)
Value-Guided Search for Efficient Chain-of-Thought Reasoning
por: Wang, Kaiwen, et al.
Publicado: (2025)
por: Wang, Kaiwen, et al.
Publicado: (2025)
Effectively Controlling Reasoning Models through Thinking Intervention
por: Wu, Tong, et al.
Publicado: (2025)
por: Wu, Tong, et al.
Publicado: (2025)
PrismRAG: Boosting RAG Factuality with Distractor Resilience and Strategized Reasoning
por: Kachuee, Mohammad, et al.
Publicado: (2025)
por: Kachuee, Mohammad, et al.
Publicado: (2025)
Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning
por: Liu, Hui, et al.
Publicado: (2024)
por: Liu, Hui, et al.
Publicado: (2024)
LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts
por: Xiao, Yijia, et al.
Publicado: (2024)
por: Xiao, Yijia, et al.
Publicado: (2024)
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
por: Luo, Kairong, et al.
Publicado: (2025)
por: Luo, Kairong, et al.
Publicado: (2025)
Ejemplares similares
-
Building A Unified AI-centric Language System: analysis, framework and future work
por: Wang, Edward Hong, et al.
Publicado: (2025) -
Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions
por: Li, Xiang, et al.
Publicado: (2024) -
TransformerFAM: Feedback attention is working memory
por: Hwang, Dongseong, et al.
Publicado: (2024) -
Where does output diversity collapse in post-training?
por: Karouzos, Constantinos, et al.
Publicado: (2026) -
Universe Routing: Why Self-Evolving Agents Need Epistemic Control
por: Wang, Zhaohui Geoffrey
Publicado: (2026)