Saved in:
| Main Authors: | Yu, Haofei, Qi, Zhengyang, Zhao, Yining, Nottingham, Kolby, Xuan, Keyang, Majumder, Bodhisattwa Prasad, Zhu, Hao, Liang, Paul Pu, You, Jiaxuan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.03905 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
by: Nottingham, Kolby, et al.
Published: (2024)
by: Nottingham, Kolby, et al.
Published: (2024)
ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery
by: Yu, Haofei, et al.
Published: (2026)
by: Yu, Haofei, et al.
Published: (2026)
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers
by: Xuan, Keyang, et al.
Published: (2026)
by: Xuan, Keyang, et al.
Published: (2026)
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions
by: Fan, Xianzhe, et al.
Published: (2025)
by: Fan, Xianzhe, et al.
Published: (2025)
To Tell The Truth: Language of Deception and Language Models
by: Hazra, Sanchaita, et al.
Published: (2023)
by: Hazra, Sanchaita, et al.
Published: (2023)
Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos
by: Kalluri, Tarun, et al.
Published: (2024)
by: Kalluri, Tarun, et al.
Published: (2024)
AI Safety Should Prioritize the Future of Work
by: Hazra, Sanchaita, et al.
Published: (2025)
by: Hazra, Sanchaita, et al.
Published: (2025)
TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents
by: Yu, Haofei, et al.
Published: (2025)
by: Yu, Haofei, et al.
Published: (2025)
ResearchTown: Simulator of Human Research Community
by: Yu, Haofei, et al.
Published: (2024)
by: Yu, Haofei, et al.
Published: (2024)
Beyond Facts: Evaluating Intent Hallucination in Large Language Models
by: Hao, Yijie, et al.
Published: (2025)
by: Hao, Yijie, et al.
Published: (2025)
SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents
by: Wang, Ruiyi, et al.
Published: (2024)
by: Wang, Ruiyi, et al.
Published: (2024)
CTM-AI: A Blueprint for General AI Inspired by a Model of Consciousness
by: Yu, Haofei, et al.
Published: (2026)
by: Yu, Haofei, et al.
Published: (2026)
Accepted with Minor Revisions: Value of AI-Assisted Scientific Writing
by: Hazra, Sanchaita, et al.
Published: (2025)
by: Hazra, Sanchaita, et al.
Published: (2025)
The Good, the Bad, and the Ugly: The Role of AI Quality Disclosure in Lie Detection
by: Bhattacharya, Haimanti, et al.
Published: (2024)
by: Bhattacharya, Haimanti, et al.
Published: (2024)
LiveTradeBench: Seeking Real-World Alpha with Large Language Models
by: Yu, Haofei, et al.
Published: (2025)
by: Yu, Haofei, et al.
Published: (2025)
ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities
by: Hong, Zhaochen, et al.
Published: (2025)
by: Hong, Zhaochen, et al.
Published: (2025)
MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts
by: Yu, Haofei, et al.
Published: (2023)
by: Yu, Haofei, et al.
Published: (2023)
Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
by: Chen, Jiangjie, et al.
Published: (2023)
by: Chen, Jiangjie, et al.
Published: (2023)
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
by: Zhou, Xuhui, et al.
Published: (2023)
by: Zhou, Xuhui, et al.
Published: (2023)
Table as Thought: Exploring Structured Thoughts in LLM Reasoning
by: Sun, Zhenjie, et al.
Published: (2025)
by: Sun, Zhenjie, et al.
Published: (2025)
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models
by: Han, Pengrui, et al.
Published: (2024)
by: Han, Pengrui, et al.
Published: (2024)
Data-driven Discovery with Large Generative Models
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)
Auto-Dreamer: Learning Offline Memory Consolidation for Language Agents
by: Ye, Chongrui, et al.
Published: (2026)
by: Ye, Chongrui, et al.
Published: (2026)
Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning
by: White, Isadora, et al.
Published: (2025)
by: White, Isadora, et al.
Published: (2025)
Tailoring with Targeted Precision: Edit-Based Agents for Open-Domain Procedure Customization
by: Lal, Yash Kumar, et al.
Published: (2023)
by: Lal, Yash Kumar, et al.
Published: (2023)
Few-shot Dialogue Strategy Learning for Motivational Interviewing via Inductive Reasoning
by: Xie, Zhouhang, et al.
Published: (2024)
by: Xie, Zhouhang, et al.
Published: (2024)
On Designing Effective RL Reward at Training Time for LLM Reasoning
by: Gao, Jiaxuan, et al.
Published: (2024)
by: Gao, Jiaxuan, et al.
Published: (2024)
MINT: Multimodal Instruction Tuning with Multimodal Interaction Grouping
by: Shan, Xiaojun, et al.
Published: (2025)
by: Shan, Xiaojun, et al.
Published: (2025)
Debugging Tabular Log as Dynamic Graphs
by: Liang, Chumeng, et al.
Published: (2025)
by: Liang, Chumeng, et al.
Published: (2025)
TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models
by: Lee, Jaewoo, et al.
Published: (2025)
by: Lee, Jaewoo, et al.
Published: (2025)
Onondaga County Public Library, Final Performance Report for Library Services and Construction Act (LSCA) Title VI, Library Literacy Program.
by: Nottingham, Sharon
Published: (1993)
by: Nottingham, Sharon
Published: (1993)
Time-R1: Towards Comprehensive Temporal Reasoning in LLMs
by: Liu, Zijia, et al.
Published: (2025)
by: Liu, Zijia, et al.
Published: (2025)
ResearchArcade: Graph Interface for Academic Tasks
by: Xu, Jingjun, et al.
Published: (2025)
by: Xu, Jingjun, et al.
Published: (2025)
Foundations of Multisensory Artificial Intelligence
by: Liang, Paul Pu
Published: (2024)
by: Liang, Paul Pu
Published: (2024)
Latent Factor Models Meets Instructions: Goal-conditioned Latent Factor Discovery without Task Supervision
by: Xie, Zhouhang, et al.
Published: (2025)
by: Xie, Zhouhang, et al.
Published: (2025)
D6.2 User stories usage scenarios and use case validation v2
by: Nottingham Trent University
Published: (2025)
by: Nottingham Trent University
Published: (2025)
Towards Socially and Morally Aware RL agent: Reward Design With LLM
by: Wang, Zhaoyue
Published: (2024)
by: Wang, Zhaoyue
Published: (2024)
A Vision for Multisensory Intelligence: Sensing, Science, and Synergy
by: Liang, Paul Pu
Published: (2026)
by: Liang, Paul Pu
Published: (2026)
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents
by: Zhu, Kunlun, et al.
Published: (2025)
by: Zhu, Kunlun, et al.
Published: (2025)
Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners
by: Muslimani, Calarina, et al.
Published: (2025)
by: Muslimani, Calarina, et al.
Published: (2025)
Similar Items
-
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
by: Nottingham, Kolby, et al.
Published: (2024) -
ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery
by: Yu, Haofei, et al.
Published: (2026) -
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers
by: Xuan, Keyang, et al.
Published: (2026) -
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions
by: Fan, Xianzhe, et al.
Published: (2025) -
To Tell The Truth: Language of Deception and Language Models
by: Hazra, Sanchaita, et al.
Published: (2023)