Saved in:
| Main Authors: | Yu, Haofei, Xuan, Keyang, Li, Fenghai, Zhu, Kunlun, Lei, Zijie, Zhang, Jiaxun, Qi, Ziheng, Richardson, Kyle, You, Jiaxuan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.06579 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents
by: Zhu, Kunlun, et al.
Published: (2025)
by: Zhu, Kunlun, et al.
Published: (2025)
LiveTradeBench: Seeking Real-World Alpha with Large Language Models
by: Yu, Haofei, et al.
Published: (2025)
by: Yu, Haofei, et al.
Published: (2025)
ResearchTown: Simulator of Human Research Community
by: Yu, Haofei, et al.
Published: (2024)
by: Yu, Haofei, et al.
Published: (2024)
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers
by: Xuan, Keyang, et al.
Published: (2026)
by: Xuan, Keyang, et al.
Published: (2026)
ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery
by: Yu, Haofei, et al.
Published: (2026)
by: Yu, Haofei, et al.
Published: (2026)
Sotopia-RL: Reward Design for Social Intelligence
by: Yu, Haofei, et al.
Published: (2025)
by: Yu, Haofei, et al.
Published: (2025)
Beyond Facts: Evaluating Intent Hallucination in Large Language Models
by: Hao, Yijie, et al.
Published: (2025)
by: Hao, Yijie, et al.
Published: (2025)
ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities
by: Hong, Zhaochen, et al.
Published: (2025)
by: Hong, Zhaochen, et al.
Published: (2025)
GraphPlanner: Graph Memory-Augmented Agentic Routing for Multi-Agent LLMs
by: Feng, Tao, et al.
Published: (2026)
by: Feng, Tao, et al.
Published: (2026)
ResearchArcade: Graph Interface for Academic Tasks
by: Xu, Jingjun, et al.
Published: (2025)
by: Xu, Jingjun, et al.
Published: (2025)
Measure Valued Solution to the Spatially Homogeneous Boltzmann Equation with Inelastic Long-Range Interactions
by: Qi, Kunlun
Published: (2020)
by: Qi, Kunlun
Published: (2020)
Table as Thought: Exploring Structured Thoughts in LLM Reasoning
by: Sun, Zhenjie, et al.
Published: (2025)
by: Sun, Zhenjie, et al.
Published: (2025)
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models
by: Han, Pengrui, et al.
Published: (2024)
by: Han, Pengrui, et al.
Published: (2024)
Multi-Agent Evolve: LLM Self-Improve through Co-evolution
by: Chen, Yixing, et al.
Published: (2025)
by: Chen, Yixing, et al.
Published: (2025)
Which LLM Multi-Agent Protocol to Choose?
by: Du, Hongyi, et al.
Published: (2025)
by: Du, Hongyi, et al.
Published: (2025)
Time-R1: Towards Comprehensive Temporal Reasoning in LLMs
by: Liu, Zijia, et al.
Published: (2025)
by: Liu, Zijia, et al.
Published: (2025)
Probing the Knowledge Boundary: An Interactive Agentic Framework for Deep Knowledge Extraction
by: Yang, Yuheng, et al.
Published: (2026)
by: Yang, Yuheng, et al.
Published: (2026)
Auto-Dreamer: Learning Offline Memory Consolidation for Language Agents
by: Ye, Chongrui, et al.
Published: (2026)
by: Ye, Chongrui, et al.
Published: (2026)
ResearchCube: Multi-Dimensional Trade-off Exploration for Research Ideation
by: Ding, Zijian, et al.
Published: (2026)
by: Ding, Zijian, et al.
Published: (2026)
Where LLM Agents Fail and How They can Learn From Failures
by: Zhu, Kunlun, et al.
Published: (2025)
by: Zhu, Kunlun, et al.
Published: (2025)
Interactive Evaluation Requires a Design Science
by: Xuan, Keyang, et al.
Published: (2026)
by: Xuan, Keyang, et al.
Published: (2026)
SWE-Bench Mobile: Can Large Language Model Agents Develop Industry-Level Mobile Applications?
by: Tian, Muxin, et al.
Published: (2026)
by: Tian, Muxin, et al.
Published: (2026)
Debugging Tabular Log as Dynamic Graphs
by: Liang, Chumeng, et al.
Published: (2025)
by: Liang, Chumeng, et al.
Published: (2025)
SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents
by: Wang, Ruiyi, et al.
Published: (2024)
by: Wang, Ruiyi, et al.
Published: (2024)
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents
by: Zhu, Kunlun, et al.
Published: (2025)
by: Zhu, Kunlun, et al.
Published: (2025)
How Far Are We From AGI: Are LLMs All We Need?
by: Feng, Tao, et al.
Published: (2024)
by: Feng, Tao, et al.
Published: (2024)
OpenTinker: Separating Concerns in Agentic Reinforcement Learning
by: Zhu, Siqi, et al.
Published: (2026)
by: Zhu, Siqi, et al.
Published: (2026)
AI Urban Scientist: Multi-Agent Collaborative Automation for Urban Research
by: Xia, Tong, et al.
Published: (2025)
by: Xia, Tong, et al.
Published: (2025)
LRanker: LLM Ranker for Massive Candidates
by: Feng, Tao, et al.
Published: (2026)
by: Feng, Tao, et al.
Published: (2026)
R1-Ranker: Teaching LLM Rankers to Reason
by: Feng, Tao, et al.
Published: (2025)
by: Feng, Tao, et al.
Published: (2025)
FAIRSECO: An Extensible Framework for Impact Measurement of Research Software
by: Deekshitha, et al.
Published: (2024)
by: Deekshitha, et al.
Published: (2024)
YASPS: A Symbolic Framework for Extensible, High-Performance IPC Simulation
by: Tang, Xuan, et al.
Published: (2026)
by: Tang, Xuan, et al.
Published: (2026)
Solving the BGK Model and Boltzmann equation by Fourier Neural Operator with conservative constraints
by: Hu, Boyun, et al.
Published: (2025)
by: Hu, Boyun, et al.
Published: (2025)
From the Boltzmann equation for gas mixture to the two-fluid incompressible hydrodynamic system
by: Fang, Zhendong, et al.
Published: (2024)
by: Fang, Zhendong, et al.
Published: (2024)
The Semiclassical Limit of the 2D Dirac--Hartree Equation with Periodic Potentials
by: Lee, Jinyeop, et al.
Published: (2025)
by: Lee, Jinyeop, et al.
Published: (2025)
Hydrodynamic limit of the Vlasov-Poisson-Fokker-Planck system in low-field regime
by: Fang, Zhendong, et al.
Published: (2025)
by: Fang, Zhendong, et al.
Published: (2025)
Spectral convergence of a semi-discretized numerical system for the spatially homogeneous Boltzmann equation with uncertainties
by: Liu, Liu, et al.
Published: (2024)
by: Liu, Liu, et al.
Published: (2024)
Convergence of the Fourier-Galerkin spectral method for the Boltzmann equation with uncertainties
by: Liu, Liu, et al.
Published: (2022)
by: Liu, Liu, et al.
Published: (2022)
InternAgent: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
by: InternAgent Team, et al.
Published: (2025)
by: InternAgent Team, et al.
Published: (2025)
A Multi-Stage Optimization Framework for Deploying Learned Image Compression on FPGAs
by: Fang, Jiaxun, et al.
Published: (2025)
by: Fang, Jiaxun, et al.
Published: (2025)
Similar Items
-
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents
by: Zhu, Kunlun, et al.
Published: (2025) -
LiveTradeBench: Seeking Real-World Alpha with Large Language Models
by: Yu, Haofei, et al.
Published: (2025) -
ResearchTown: Simulator of Human Research Community
by: Yu, Haofei, et al.
Published: (2024) -
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers
by: Xuan, Keyang, et al.
Published: (2026) -
ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery
by: Yu, Haofei, et al.
Published: (2026)