:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	He, Pengfei, Li, Zitao, Xing, Yue, Li, Yaling, Tang, Jiliang, Ding, Bolin
Formato:	Preprint
Publicado:	2024
Materias:	Machine Learning
Acceso en línea:	https://arxiv.org/abs/2410.19000
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Video models are zero-shot learners and reasoners
por: Wiedemer, Thaddäus, et al.
Publicado: (2025)

Is continuous CoT better suited for multi-lingual reasoning?
por: Bashir, Ali Hamza, et al.
Publicado: (2026)

A Simple Plug-in for Improving Eviction-Based KV Cache Compression
por: Lin, Yuping, et al.
Publicado: (2026)

Beyond Data Privacy: New Privacy Risks for Large Language Models
por: Du, Yuntao, et al.
Publicado: (2025)

Superiority of Multi-Head Attention in In-Context Linear Regression
por: Cui, Yingqian, et al.
Publicado: (2024)

Multi-Faceted Studies on Data Poisoning can Advance LLM Development
por: He, Pengfei, et al.
Publicado: (2025)

Improving LoRA in Privacy-preserving Federated Learning
por: Sun, Youbang, et al.
Publicado: (2024)

Exploring System 1 and 2 communication for latent reasoning in LLMs
por: Coda-Forno, Julian, et al.
Publicado: (2025)

A Bargaining-based Approach for Feature Trading in Vertical Federated Learning
por: Cui, Yue, et al.
Publicado: (2024)

A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration
por: Cui, Yingqian, et al.
Publicado: (2024)

Are complicated loss functions necessary for teaching LLMs to reason?
por: Carrino, Gabriele, et al.
Publicado: (2026)

Towards the Effect of Examples on In-Context Learning: A Theoretical Case Study
por: He, Pengfei, et al.
Publicado: (2024)

FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model
por: Wu, Feijie, et al.
Publicado: (2024)

HARP: A challenging human-annotated math reasoning benchmark
por: Yue, Albert S., et al.
Publicado: (2024)

QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
por: Li, Belinda Z., et al.
Publicado: (2025)

Making deep neural networks right for the right scientific reasons by interacting with their explanations
por: Schramowski, Patrick, et al.
Publicado: (2020)

Reinforcing privacy reasoning in LLMs via normative simulacra from fiction
por: Franchi, Matt, et al.
Publicado: (2026)

Retrieval Heads are Dynamic
por: Lin, Yuping, et al.
Publicado: (2026)

Self-rewarding correction for mathematical reasoning
por: Xiong, Wei, et al.
Publicado: (2025)

What is the objective of reasoning with reinforcement learning?
por: Davis, Damek, et al.
Publicado: (2025)

Mixture of Parrots: Experts improve memorization more than reasoning
por: Jelassi, Samy, et al.
Publicado: (2024)

Active inference and artificial reasoning
por: Friston, Karl, et al.
Publicado: (2025)

Intra-request branch orchestration for efficient LLM reasoning
por: Jiang, Weifan, et al.
Publicado: (2025)

Entropy After </Think> for reasoning model early exiting
por: Wang, Xi, et al.
Publicado: (2025)

Making medical vision-language models think causally across modalities with retrieval-augmented cross-modal reasoning
por: Yang, Weiqin, et al.
Publicado: (2026)

LLM-Cave: A benchmark and light environment for large language models reasoning and decision-making system
por: Li, Huanyu, et al.
Publicado: (2025)

Counterfactual reasoning: an analysis of in-context emergence
por: Miller, Moritz, et al.
Publicado: (2025)

LLMs cannot find reasoning errors, but can correct them given the error location
por: Tyen, Gladys, et al.
Publicado: (2023)

Branch-and-Browse: Efficient and Controllable Web Exploration with Tree-Structured Reasoning and Action Memory
por: He, Shiqi, et al.
Publicado: (2025)

On the generalization capacity of neural networks during generic multimodal reasoning
por: Ito, Takuya, et al.
Publicado: (2024)

Artificial Expert Intelligence through PAC-reasoning
por: Shalev-Shwartz, Shai, et al.
Publicado: (2024)

When can transformers reason with abstract symbols?
por: Boix-Adsera, Enric, et al.
Publicado: (2023)

Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning
por: Liu, Jiashun, et al.
Publicado: (2025)

Reinforcement Learning in hyperbolic space for multi-step reasoning
por: Xu, Tao, et al.
Publicado: (2025)

ProvMind: Provenance-grounded reasoning for materials synthesis
por: Zhang, Yiming, et al.
Publicado: (2026)

Learning richness modulates equality reasoning in neural networks
por: Tong, William L., et al.
Publicado: (2025)

DiffusionShield: A Watermark for Copyright Protection against Generative Diffusion Models
por: Cui, Yingqian, et al.
Publicado: (2023)

Breakpoint: Scalable evaluation of system-level reasoning in LLM code agents
por: Hariharan, Kaivalya, et al.
Publicado: (2025)

Relational reasoning and inductive bias in transformers and large language models
por: Geerts, Jesse, et al.
Publicado: (2025)

Sudoku-Bench: Evaluating creative reasoning with Sudoku variants
por: Seely, Jeffrey, et al.
Publicado: (2025)