:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Sun, Haocheng, Wen, Cynthia Xin, Wang, Edward Hong
Formato:	Preprint
Publicado:	2025
Materias:	Machine Learning Artificial Intelligence Computation and Language
Acceso en línea:	https://arxiv.org/abs/2510.03289
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Building A Unified AI-centric Language System: analysis, framework and future work
por: Wang, Edward Hong, et al.
Publicado: (2025)

Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions
por: Li, Xiang, et al.
Publicado: (2024)

TransformerFAM: Feedback attention is working memory
por: Hwang, Dongseong, et al.
Publicado: (2024)

Where does output diversity collapse in post-training?
por: Karouzos, Constantinos, et al.
Publicado: (2026)

Universe Routing: Why Self-Evolving Agents Need Epistemic Control
por: Wang, Zhaohui Geoffrey
Publicado: (2026)

Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents
por: Wang, Zehong, et al.
Publicado: (2026)

MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models
por: Wen, Yilin, et al.
Publicado: (2023)

Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
por: Gu, Yuxian, et al.
Publicado: (2025)

Reveal and Release: Iterative LLM Unlearning with Self-generated Data
por: Xie, Linxi, et al.
Publicado: (2025)

FADE: Why Bad Descriptions Happen to Good Features
por: Puri, Bruno, et al.
Publicado: (2025)

Why Do Safety Guardrails Degrade Across Languages?
por: Zhang, Max, et al.
Publicado: (2026)

Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
por: Choi, Yumin, et al.
Publicado: (2025)

Why Larger Language Models Do In-context Learning Differently?
por: Shi, Zhenmei, et al.
Publicado: (2024)

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models
por: Ni, Zanlin, et al.
Publicado: (2026)

Nevermind: Instruction Override and Moderation in Large Language Models
por: Kim, Edward
Publicado: (2024)

A Systematic Review of Data-to-Text NLG
por: Osuji, Chinonso Cynthia, et al.
Publicado: (2024)

Why is Your Language Model a Poor Implicit Reward Model?
por: Razin, Noam, et al.
Publicado: (2025)

Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?
por: Kang, Deokhyung, et al.
Publicado: (2025)

The Invisible Leash: Why RLVR May or May Not Escape Its Origin
por: Wu, Fang, et al.
Publicado: (2025)

Efficient and Personalized Mobile Health Event Prediction via Small Language Models
por: Wang, Xin, et al.
Publicado: (2024)

SP^2DPO: An LLM-assisted Semantic Per-Pair DPO Generalization
por: He, Chaoyue, et al.
Publicado: (2026)

Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels
por: Kumar, Anantha Padmanaban Krishna
Publicado: (2025)

Detection Is Cheap, Routing Is Learned: Why Refusal-Based Alignment Evaluation Fails
por: Frank, Gregory N.
Publicado: (2026)

Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
por: Öncel, Fırat, et al.
Publicado: (2024)

A Implies B: Circuit Analysis in LLMs for Propositional Logical Reasoning
por: Hong, Guan Zhe, et al.
Publicado: (2024)

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
por: Schaeffer, Rylan, et al.
Publicado: (2024)

MediFact at MEDIQA-CORR 2024: Why AI Needs a Human Touch
por: Saeed, Nadia
Publicado: (2024)

Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration
por: Chen, Zhipeng, et al.
Publicado: (2026)

Why Don't Prompt-Based Fairness Metrics Correlate?
por: Zayed, Abdelrahman, et al.
Publicado: (2024)

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
por: Maheswaran, Monishwaran, et al.
Publicado: (2025)

Order-Independence Without Fine Tuning
por: McIlroy-Young, Reid, et al.
Publicado: (2024)

Health-LLM: Large Language Models for Health Prediction via Wearable Sensor Data
por: Kim, Yubin, et al.
Publicado: (2024)

Why Does ChatGPT "Delve" So Much? Exploring the Sources of Lexical Overrepresentation in Large Language Models
por: Juzek, Tom S., et al.
Publicado: (2024)

Bayesian WeakS-to-Strong from Text Classification to Generation
por: Cui, Ziyun, et al.
Publicado: (2024)

Value-Guided Search for Efficient Chain-of-Thought Reasoning
por: Wang, Kaiwen, et al.
Publicado: (2025)

Effectively Controlling Reasoning Models through Thinking Intervention
por: Wu, Tong, et al.
Publicado: (2025)

PrismRAG: Boosting RAG Factuality with Distractor Resilience and Strategized Reasoning
por: Kachuee, Mohammad, et al.
Publicado: (2025)

Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning
por: Liu, Hui, et al.
Publicado: (2024)

LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts
por: Xiao, Yijia, et al.
Publicado: (2024)

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
por: Luo, Kairong, et al.
Publicado: (2025)