Saved in:
| Main Authors: | Zhou, Yuhang, Ni, Yuchen, Gan, Yunhui, Yin, Zhangyue, Liu, Xiang, Zhang, Jian, Liu, Sen, Qiu, Xipeng, Ye, Guangnan, Chai, Hongfeng |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.12713 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
$R^3$-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL
by: Zhou, Yuhang, et al.
Published: (2023)
by: Zhou, Yuhang, et al.
Published: (2023)
SilverSight: A Multi-Task Chinese Financial Large Language Model Based on Adaptive Semantic Space Learning
by: Zhou, Yuhang, et al.
Published: (2024)
by: Zhou, Yuhang, et al.
Published: (2024)
Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches
by: Zhou, Yuhang, et al.
Published: (2025)
by: Zhou, Yuhang, et al.
Published: (2025)
RAGFormer: Learning Semantic Attributes and Topological Structure for Fraud Detection
by: Li, Haolin, et al.
Published: (2024)
by: Li, Haolin, et al.
Published: (2024)
FedCAda: Adaptive Client-Side Optimization for Accelerated and Stable Federated Learning
by: Zhou, Liuzhi, et al.
Published: (2024)
by: Zhou, Liuzhi, et al.
Published: (2024)
DogLayout: Denoising Diffusion GAN for Discrete and Continuous Layout Generation
by: Gan, Zhaoxing, et al.
Published: (2024)
by: Gan, Zhaoxing, et al.
Published: (2024)
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
by: Sun, Qiushi, et al.
Published: (2023)
by: Sun, Qiushi, et al.
Published: (2023)
RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization
by: Zhiyuan, Zeng, et al.
Published: (2025)
by: Zhiyuan, Zeng, et al.
Published: (2025)
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
by: Zeng, Zhiyuan, et al.
Published: (2025)
by: Zeng, Zhiyuan, et al.
Published: (2025)
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
by: Sun, Yuhong, et al.
Published: (2025)
by: Sun, Yuhong, et al.
Published: (2025)
GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation
by: Wang, Yifan, et al.
Published: (2026)
by: Wang, Yifan, et al.
Published: (2026)
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
by: Li, Yuan, et al.
Published: (2025)
by: Li, Yuan, et al.
Published: (2025)
Limited or Biased: Modeling Sub-Rational Human Investors in Financial Markets
by: Liu, Penghang, et al.
Published: (2022)
by: Liu, Penghang, et al.
Published: (2022)
Balanced Aggregation: Understanding and Fixing Aggregation Bias in GRPO
by: Zeng, Zhiyuan, et al.
Published: (2026)
by: Zeng, Zhiyuan, et al.
Published: (2026)
Dynamic and Generalizable Process Reward Modeling
by: Yin, Zhangyue, et al.
Published: (2025)
by: Yin, Zhangyue, et al.
Published: (2025)
LLatrieval: LLM-Verified Retrieval for Verifiable Generation
by: Li, Xiaonan, et al.
Published: (2023)
by: Li, Xiaonan, et al.
Published: (2023)
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem
by: Sun, Yuhong, et al.
Published: (2024)
by: Sun, Yuhong, et al.
Published: (2024)
Behavioral Consistency Validation for LLM Agents: An Analysis of Trading-Style Switching through Stock-Market Simulation
by: Li, Zeping, et al.
Published: (2026)
by: Li, Zeping, et al.
Published: (2026)
Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
by: Li, Zeping, et al.
Published: (2026)
by: Li, Zeping, et al.
Published: (2026)
TransXion: A High-Fidelity Graph Benchmark for Realistic Anti-Money Laundering
by: Chen, Keyang, et al.
Published: (2026)
by: Chen, Keyang, et al.
Published: (2026)
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning
by: Li, Yuan, et al.
Published: (2026)
by: Li, Yuan, et al.
Published: (2026)
ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models
by: Yin, Zhangyue, et al.
Published: (2025)
by: Yin, Zhangyue, et al.
Published: (2025)
H2ST: Hierarchical Two-Sample Tests for Continual Out-of-Distribution Detection
by: Liu, Yuhang, et al.
Published: (2025)
by: Liu, Yuhang, et al.
Published: (2025)
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
by: Liu, Xiaoran, et al.
Published: (2025)
by: Liu, Xiaoran, et al.
Published: (2025)
Understanding the Role of LLMs in Multimodal Evaluation Benchmarks
by: Jiang, Botian, et al.
Published: (2024)
by: Jiang, Botian, et al.
Published: (2024)
Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction
by: Song, Yuerong, et al.
Published: (2025)
by: Song, Yuerong, et al.
Published: (2025)
Can AI Assistants Know What They Don't Know?
by: Cheng, Qinyuan, et al.
Published: (2024)
by: Cheng, Qinyuan, et al.
Published: (2024)
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
by: Fan, Zhiting, et al.
Published: (2024)
by: Fan, Zhiting, et al.
Published: (2024)
Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective
by: Chandna, Bhavik, et al.
Published: (2025)
by: Chandna, Bhavik, et al.
Published: (2025)
Perceived Political Bias in LLMs Reduces Persuasive Abilities
by: DiGiuseppe, Matthew, et al.
Published: (2026)
by: DiGiuseppe, Matthew, et al.
Published: (2026)
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
by: Liu, Xiaoran, et al.
Published: (2025)
by: Liu, Xiaoran, et al.
Published: (2025)
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective
by: Zeng, Zhiyuan, et al.
Published: (2024)
by: Zeng, Zhiyuan, et al.
Published: (2024)
MetaRank: Task-Aware Metric Selection for Model Transferability Estimation
by: Liu, Yuhang, et al.
Published: (2025)
by: Liu, Yuhang, et al.
Published: (2025)
How AI Agents Follow the Herd of AI? Network Effects, History, and Machine Optimism
by: Liu, Yu, et al.
Published: (2025)
by: Liu, Yu, et al.
Published: (2025)
When Machines Meet Each Other: Network Effects and the Strategic Role of History in Multi-Agent AI
by: Liu, Yu, et al.
Published: (2025)
by: Liu, Yu, et al.
Published: (2025)
Limited Reference, Reliable Generation: A Two-Component Framework for Tabular Data Generation in Low-Data Regimes
by: Jiang, Mingxuan, et al.
Published: (2025)
by: Jiang, Mingxuan, et al.
Published: (2025)
Does Financial Statement Comparability Reduce Differences in Sentiment‐induced Investor Trading Behaviour?
by: Eun Hye Jo, et al.
Published: (2025)
by: Eun Hye Jo, et al.
Published: (2025)
Elucidating Mechanisms of Demographic Bias in LLMs for Healthcare
by: Ahsan, Hiba, et al.
Published: (2025)
by: Ahsan, Hiba, et al.
Published: (2025)
Reducing False Positives in Static Bug Detection with LLMs: An Empirical Study in Industry
by: Du, Xueying, et al.
Published: (2026)
by: Du, Xueying, et al.
Published: (2026)
Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge
by: Fu, Jinlan, et al.
Published: (2024)
by: Fu, Jinlan, et al.
Published: (2024)
Similar Items
-
$R^3$-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL
by: Zhou, Yuhang, et al.
Published: (2023) -
SilverSight: A Multi-Task Chinese Financial Large Language Model Based on Adaptive Semantic Space Learning
by: Zhou, Yuhang, et al.
Published: (2024) -
Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches
by: Zhou, Yuhang, et al.
Published: (2025) -
RAGFormer: Learning Semantic Attributes and Topological Structure for Fraud Detection
by: Li, Haolin, et al.
Published: (2024) -
FedCAda: Adaptive Client-Side Optimization for Accelerated and Stable Federated Learning
by: Zhou, Liuzhi, et al.
Published: (2024)