Guardado en:
| Autores principales: | He, Pengfei, Li, Zitao, Xing, Yue, Li, Yaling, Tang, Jiliang, Ding, Bolin |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2410.19000 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Video models are zero-shot learners and reasoners
por: Wiedemer, Thaddäus, et al.
Publicado: (2025)
por: Wiedemer, Thaddäus, et al.
Publicado: (2025)
Is continuous CoT better suited for multi-lingual reasoning?
por: Bashir, Ali Hamza, et al.
Publicado: (2026)
por: Bashir, Ali Hamza, et al.
Publicado: (2026)
A Simple Plug-in for Improving Eviction-Based KV Cache Compression
por: Lin, Yuping, et al.
Publicado: (2026)
por: Lin, Yuping, et al.
Publicado: (2026)
Beyond Data Privacy: New Privacy Risks for Large Language Models
por: Du, Yuntao, et al.
Publicado: (2025)
por: Du, Yuntao, et al.
Publicado: (2025)
Superiority of Multi-Head Attention in In-Context Linear Regression
por: Cui, Yingqian, et al.
Publicado: (2024)
por: Cui, Yingqian, et al.
Publicado: (2024)
Multi-Faceted Studies on Data Poisoning can Advance LLM Development
por: He, Pengfei, et al.
Publicado: (2025)
por: He, Pengfei, et al.
Publicado: (2025)
Improving LoRA in Privacy-preserving Federated Learning
por: Sun, Youbang, et al.
Publicado: (2024)
por: Sun, Youbang, et al.
Publicado: (2024)
Exploring System 1 and 2 communication for latent reasoning in LLMs
por: Coda-Forno, Julian, et al.
Publicado: (2025)
por: Coda-Forno, Julian, et al.
Publicado: (2025)
A Bargaining-based Approach for Feature Trading in Vertical Federated Learning
por: Cui, Yue, et al.
Publicado: (2024)
por: Cui, Yue, et al.
Publicado: (2024)
A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration
por: Cui, Yingqian, et al.
Publicado: (2024)
por: Cui, Yingqian, et al.
Publicado: (2024)
Are complicated loss functions necessary for teaching LLMs to reason?
por: Carrino, Gabriele, et al.
Publicado: (2026)
por: Carrino, Gabriele, et al.
Publicado: (2026)
Towards the Effect of Examples on In-Context Learning: A Theoretical Case Study
por: He, Pengfei, et al.
Publicado: (2024)
por: He, Pengfei, et al.
Publicado: (2024)
FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model
por: Wu, Feijie, et al.
Publicado: (2024)
por: Wu, Feijie, et al.
Publicado: (2024)
HARP: A challenging human-annotated math reasoning benchmark
por: Yue, Albert S., et al.
Publicado: (2024)
por: Yue, Albert S., et al.
Publicado: (2024)
QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
por: Li, Belinda Z., et al.
Publicado: (2025)
por: Li, Belinda Z., et al.
Publicado: (2025)
Making deep neural networks right for the right scientific reasons by interacting with their explanations
por: Schramowski, Patrick, et al.
Publicado: (2020)
por: Schramowski, Patrick, et al.
Publicado: (2020)
Reinforcing privacy reasoning in LLMs via normative simulacra from fiction
por: Franchi, Matt, et al.
Publicado: (2026)
por: Franchi, Matt, et al.
Publicado: (2026)
Retrieval Heads are Dynamic
por: Lin, Yuping, et al.
Publicado: (2026)
por: Lin, Yuping, et al.
Publicado: (2026)
Self-rewarding correction for mathematical reasoning
por: Xiong, Wei, et al.
Publicado: (2025)
por: Xiong, Wei, et al.
Publicado: (2025)
What is the objective of reasoning with reinforcement learning?
por: Davis, Damek, et al.
Publicado: (2025)
por: Davis, Damek, et al.
Publicado: (2025)
Mixture of Parrots: Experts improve memorization more than reasoning
por: Jelassi, Samy, et al.
Publicado: (2024)
por: Jelassi, Samy, et al.
Publicado: (2024)
Active inference and artificial reasoning
por: Friston, Karl, et al.
Publicado: (2025)
por: Friston, Karl, et al.
Publicado: (2025)
Intra-request branch orchestration for efficient LLM reasoning
por: Jiang, Weifan, et al.
Publicado: (2025)
por: Jiang, Weifan, et al.
Publicado: (2025)
Entropy After </Think> for reasoning model early exiting
por: Wang, Xi, et al.
Publicado: (2025)
por: Wang, Xi, et al.
Publicado: (2025)
Making medical vision-language models think causally across modalities with retrieval-augmented cross-modal reasoning
por: Yang, Weiqin, et al.
Publicado: (2026)
por: Yang, Weiqin, et al.
Publicado: (2026)
LLM-Cave: A benchmark and light environment for large language models reasoning and decision-making system
por: Li, Huanyu, et al.
Publicado: (2025)
por: Li, Huanyu, et al.
Publicado: (2025)
Counterfactual reasoning: an analysis of in-context emergence
por: Miller, Moritz, et al.
Publicado: (2025)
por: Miller, Moritz, et al.
Publicado: (2025)
LLMs cannot find reasoning errors, but can correct them given the error location
por: Tyen, Gladys, et al.
Publicado: (2023)
por: Tyen, Gladys, et al.
Publicado: (2023)
Branch-and-Browse: Efficient and Controllable Web Exploration with Tree-Structured Reasoning and Action Memory
por: He, Shiqi, et al.
Publicado: (2025)
por: He, Shiqi, et al.
Publicado: (2025)
On the generalization capacity of neural networks during generic multimodal reasoning
por: Ito, Takuya, et al.
Publicado: (2024)
por: Ito, Takuya, et al.
Publicado: (2024)
Artificial Expert Intelligence through PAC-reasoning
por: Shalev-Shwartz, Shai, et al.
Publicado: (2024)
por: Shalev-Shwartz, Shai, et al.
Publicado: (2024)
When can transformers reason with abstract symbols?
por: Boix-Adsera, Enric, et al.
Publicado: (2023)
por: Boix-Adsera, Enric, et al.
Publicado: (2023)
Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning
por: Liu, Jiashun, et al.
Publicado: (2025)
por: Liu, Jiashun, et al.
Publicado: (2025)
Reinforcement Learning in hyperbolic space for multi-step reasoning
por: Xu, Tao, et al.
Publicado: (2025)
por: Xu, Tao, et al.
Publicado: (2025)
ProvMind: Provenance-grounded reasoning for materials synthesis
por: Zhang, Yiming, et al.
Publicado: (2026)
por: Zhang, Yiming, et al.
Publicado: (2026)
Learning richness modulates equality reasoning in neural networks
por: Tong, William L., et al.
Publicado: (2025)
por: Tong, William L., et al.
Publicado: (2025)
DiffusionShield: A Watermark for Copyright Protection against Generative Diffusion Models
por: Cui, Yingqian, et al.
Publicado: (2023)
por: Cui, Yingqian, et al.
Publicado: (2023)
Breakpoint: Scalable evaluation of system-level reasoning in LLM code agents
por: Hariharan, Kaivalya, et al.
Publicado: (2025)
por: Hariharan, Kaivalya, et al.
Publicado: (2025)
Relational reasoning and inductive bias in transformers and large language models
por: Geerts, Jesse, et al.
Publicado: (2025)
por: Geerts, Jesse, et al.
Publicado: (2025)
Sudoku-Bench: Evaluating creative reasoning with Sudoku variants
por: Seely, Jeffrey, et al.
Publicado: (2025)
por: Seely, Jeffrey, et al.
Publicado: (2025)
Ejemplares similares
-
Video models are zero-shot learners and reasoners
por: Wiedemer, Thaddäus, et al.
Publicado: (2025) -
Is continuous CoT better suited for multi-lingual reasoning?
por: Bashir, Ali Hamza, et al.
Publicado: (2026) -
A Simple Plug-in for Improving Eviction-Based KV Cache Compression
por: Lin, Yuping, et al.
Publicado: (2026) -
Beyond Data Privacy: New Privacy Risks for Large Language Models
por: Du, Yuntao, et al.
Publicado: (2025) -
Superiority of Multi-Head Attention in In-Context Linear Regression
por: Cui, Yingqian, et al.
Publicado: (2024)