Saved in:
| Main Authors: | Wang, Qian, Wu, Jiaying, Jiang, Zichen, Tang, Zhenheng, Luo, Bingqiao, Chen, Nuo, Chen, Wei, He, Bingsheng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.08579 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Exploring LLM Cryptocurrency Trading Through Fact-Subjectivity Aware Reasoning
by: Wang, Qian, et al.
Published: (2024)
by: Wang, Qian, et al.
Published: (2024)
From ChatGPT to DeepSeek: Can LLMs Simulate Humanity?
by: Wang, Qian, et al.
Published: (2025)
by: Wang, Qian, et al.
Published: (2025)
Assessing Judging Bias in Large Reasoning Models: An Empirical Study
by: Wang, Qian, et al.
Published: (2025)
by: Wang, Qian, et al.
Published: (2025)
Position: The Current AI Conference Model is Unsustainable! Diagnosing the Crisis of Centralized AI Conference
by: Chen, Nuo, et al.
Published: (2025)
by: Chen, Nuo, et al.
Published: (2025)
XtraGPT: Context-Aware and Controllable Academic Paper Revision via Human-AI Collaboration
by: Chen, Nuo, et al.
Published: (2025)
by: Chen, Nuo, et al.
Published: (2025)
A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading
by: Li, Yuan, et al.
Published: (2024)
by: Li, Yuan, et al.
Published: (2024)
JudgeLRM: Large Reasoning Models as a Judge
by: Chen, Nuo, et al.
Published: (2025)
by: Chen, Nuo, et al.
Published: (2025)
The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?
by: Tang, Zhenheng, et al.
Published: (2025)
by: Tang, Zhenheng, et al.
Published: (2025)
Towards Evaluting Fake Reasoning Bias in Language Models
by: Wang, Qian, et al.
Published: (2025)
by: Wang, Qian, et al.
Published: (2025)
Beyond Brainstorming: What Drives High-Quality Scientific Ideas? Lessons from Multi-Agent Collaboration
by: Chen, Nuo, et al.
Published: (2025)
by: Chen, Nuo, et al.
Published: (2025)
Diversity Collapse in Multi-Agent LLM Systems: Structural Coupling and Collective Failure in Open-Ended Idea Generation
by: Chen, Nuo, et al.
Published: (2026)
by: Chen, Nuo, et al.
Published: (2026)
MegaAgent: A Large-Scale Autonomous LLM-based Multi-Agent System Without Predefined SOPs
by: Wang, Qian, et al.
Published: (2024)
by: Wang, Qian, et al.
Published: (2024)
XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs
by: Chen, Zichen, et al.
Published: (2023)
by: Chen, Zichen, et al.
Published: (2023)
AI-powered Fraud Detection in Decentralized Finance: A Project Life Cycle Perspective
by: Luo, Bingqiao, et al.
Published: (2023)
by: Luo, Bingqiao, et al.
Published: (2023)
LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?
by: Sun, Lu, et al.
Published: (2025)
by: Sun, Lu, et al.
Published: (2025)
What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents
by: Xue, Zhaoqian, et al.
Published: (2024)
by: Xue, Zhaoqian, et al.
Published: (2024)
Can LLM Agents Simulate Multi-Turn Human Behavior? Evidence from Real Online Customer Behavior Data
by: Lu, Yuxuan, et al.
Published: (2025)
by: Lu, Yuxuan, et al.
Published: (2025)
LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
by: Li, Nathaniel, et al.
Published: (2024)
by: Li, Nathaniel, et al.
Published: (2024)
Characterizing and Evaluating the Reliability of LLMs against Jailbreak Attacks
by: Chen, Kexin, et al.
Published: (2024)
by: Chen, Kexin, et al.
Published: (2024)
Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration
by: Yao, Bingsheng, et al.
Published: (2025)
by: Yao, Bingsheng, et al.
Published: (2025)
When Crypto Economics Meet Graph Analytics and Learning
by: Luo, Bingqiao
Published: (2024)
by: Luo, Bingqiao
Published: (2024)
Is Your LLM-as-a-Recommender Agent Trustable? LLMs' Recommendation is Easily Hacked by Biases (Preferences)
by: Tang, Zichen, et al.
Published: (2026)
by: Tang, Zichen, et al.
Published: (2026)
LLM-REVal: Can We Trust LLM Reviewers Yet?
by: Li, Rui, et al.
Published: (2025)
by: Li, Rui, et al.
Published: (2025)
Dual-Pool Token-Budget Routing for Cost-Efficient and Reliable LLM Serving
by: Liu, Xunzhuo, et al.
Published: (2026)
by: Liu, Xunzhuo, et al.
Published: (2026)
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation
by: Chen, Han, et al.
Published: (2025)
by: Chen, Han, et al.
Published: (2025)
UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
by: Lu, Yuxuan, et al.
Published: (2025)
by: Lu, Yuxuan, et al.
Published: (2025)
Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems
by: Sekulić, Ivan, et al.
Published: (2024)
by: Sekulić, Ivan, et al.
Published: (2024)
Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
by: Chen, Jiaju, et al.
Published: (2025)
by: Chen, Jiaju, et al.
Published: (2025)
MHRC-Bench: A Multilingual Hardware Repository-Level Code Completion benchmark
by: Zou, Qingyun, et al.
Published: (2026)
by: Zou, Qingyun, et al.
Published: (2026)
Reasoning Model Is Superior LLM-Judge, Yet Suffers from Biases
by: Huang, Hui, et al.
Published: (2026)
by: Huang, Hui, et al.
Published: (2026)
CLUE: Conflict-guided Localization for LLM Unlearning Framework
by: Chen, Hang, et al.
Published: (2025)
by: Chen, Hang, et al.
Published: (2025)
DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans
by: Yao, Bingsheng, et al.
Published: (2025)
by: Yao, Bingsheng, et al.
Published: (2025)
Aggressive Post-Training Compression on Extremely Large Language Models
by: Zhang, Zining, et al.
Published: (2024)
by: Zhang, Zining, et al.
Published: (2024)
MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs
by: Gao, Yufei, et al.
Published: (2025)
by: Gao, Yufei, et al.
Published: (2025)
Making Bias Non-Predictive: Training Robust LLM Reasoning via Reinforcement Learning
by: Wang, Qian, et al.
Published: (2026)
by: Wang, Qian, et al.
Published: (2026)
Improving Text-to-Image Generation with Input-Side Inference-Time Scaling
by: Chen, Ruibo, et al.
Published: (2025)
by: Chen, Ruibo, et al.
Published: (2025)
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
by: Kong, Huanjun, et al.
Published: (2024)
by: Kong, Huanjun, et al.
Published: (2024)
GraphWiz: An Instruction-Following Language Model for Graph Problems
by: Chen, Nuo, et al.
Published: (2024)
by: Chen, Nuo, et al.
Published: (2024)
Character-aware Transformers Learn an Irregular Morphological Pattern Yet None Generalize Like Humans
by: Ramarao, Akhilesh Kakolu, et al.
Published: (2026)
by: Ramarao, Akhilesh Kakolu, et al.
Published: (2026)
Reducing Tool Hallucination via Reliability Alignment
by: Xu, Hongshen, et al.
Published: (2024)
by: Xu, Hongshen, et al.
Published: (2024)
Similar Items
-
Exploring LLM Cryptocurrency Trading Through Fact-Subjectivity Aware Reasoning
by: Wang, Qian, et al.
Published: (2024) -
From ChatGPT to DeepSeek: Can LLMs Simulate Humanity?
by: Wang, Qian, et al.
Published: (2025) -
Assessing Judging Bias in Large Reasoning Models: An Empirical Study
by: Wang, Qian, et al.
Published: (2025) -
Position: The Current AI Conference Model is Unsustainable! Diagnosing the Crisis of Centralized AI Conference
by: Chen, Nuo, et al.
Published: (2025) -
XtraGPT: Context-Aware and Controllable Academic Paper Revision via Human-AI Collaboration
by: Chen, Nuo, et al.
Published: (2025)