:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhou, Yuhang, Ni, Yuchen, Gan, Yunhui, Yin, Zhangyue, Liu, Xiang, Zhang, Jian, Liu, Sen, Qiu, Xipeng, Ye, Guangnan, Chai, Hongfeng
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2402.12713
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

$R^3$-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL
by: Zhou, Yuhang, et al.
Published: (2023)

SilverSight: A Multi-Task Chinese Financial Large Language Model Based on Adaptive Semantic Space Learning
by: Zhou, Yuhang, et al.
Published: (2024)

Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches
by: Zhou, Yuhang, et al.
Published: (2025)

RAGFormer: Learning Semantic Attributes and Topological Structure for Fraud Detection
by: Li, Haolin, et al.
Published: (2024)

FedCAda: Adaptive Client-Side Optimization for Accelerated and Stable Federated Learning
by: Zhou, Liuzhi, et al.
Published: (2024)

DogLayout: Denoising Diffusion GAN for Discrete and Continuous Layout Generation
by: Gan, Zhaoxing, et al.
Published: (2024)

Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
by: Sun, Qiushi, et al.
Published: (2023)

RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization
by: Zhiyuan, Zeng, et al.
Published: (2025)

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
by: Zeng, Zhiyuan, et al.
Published: (2025)

Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
by: Sun, Yuhong, et al.
Published: (2025)

GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation
by: Wang, Yifan, et al.
Published: (2026)

R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
by: Li, Yuan, et al.
Published: (2025)

Limited or Biased: Modeling Sub-Rational Human Investors in Financial Markets
by: Liu, Penghang, et al.
Published: (2022)

Balanced Aggregation: Understanding and Fixing Aggregation Bias in GRPO
by: Zeng, Zhiyuan, et al.
Published: (2026)

Dynamic and Generalizable Process Reward Modeling
by: Yin, Zhangyue, et al.
Published: (2025)

LLatrieval: LLM-Verified Retrieval for Verifiable Generation
by: Li, Xiaonan, et al.
Published: (2023)

Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem
by: Sun, Yuhong, et al.
Published: (2024)

Behavioral Consistency Validation for LLM Agents: An Analysis of Trading-Style Switching through Stock-Market Simulation
by: Li, Zeping, et al.
Published: (2026)

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
by: Li, Zeping, et al.
Published: (2026)

TransXion: A High-Fidelity Graph Benchmark for Realistic Anti-Money Laundering
by: Chen, Keyang, et al.
Published: (2026)

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning
by: Li, Yuan, et al.
Published: (2026)

ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models
by: Yin, Zhangyue, et al.
Published: (2025)

H2ST: Hierarchical Two-Sample Tests for Continual Out-of-Distribution Detection
by: Liu, Yuhang, et al.
Published: (2025)

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
by: Liu, Xiaoran, et al.
Published: (2025)

Understanding the Role of LLMs in Multimodal Evaluation Benchmarks
by: Jiang, Botian, et al.
Published: (2024)

Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction
by: Song, Yuerong, et al.
Published: (2025)

Can AI Assistants Know What They Don't Know?
by: Cheng, Qinyuan, et al.
Published: (2024)

BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
by: Fan, Zhiting, et al.
Published: (2024)

Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective
by: Chandna, Bhavik, et al.
Published: (2025)

Perceived Political Bias in LLMs Reduces Persuasive Abilities
by: DiGiuseppe, Matthew, et al.
Published: (2026)

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
by: Liu, Xiaoran, et al.
Published: (2025)

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective
by: Zeng, Zhiyuan, et al.
Published: (2024)

MetaRank: Task-Aware Metric Selection for Model Transferability Estimation
by: Liu, Yuhang, et al.
Published: (2025)

How AI Agents Follow the Herd of AI? Network Effects, History, and Machine Optimism
by: Liu, Yu, et al.
Published: (2025)

When Machines Meet Each Other: Network Effects and the Strategic Role of History in Multi-Agent AI
by: Liu, Yu, et al.
Published: (2025)

Limited Reference, Reliable Generation: A Two-Component Framework for Tabular Data Generation in Low-Data Regimes
by: Jiang, Mingxuan, et al.
Published: (2025)

Does Financial Statement Comparability Reduce Differences in Sentiment‐induced Investor Trading Behaviour?
by: Eun Hye Jo, et al.
Published: (2025)

Elucidating Mechanisms of Demographic Bias in LLMs for Healthcare
by: Ahsan, Hiba, et al.
Published: (2025)

Reducing False Positives in Static Bug Detection with LLMs: An Empirical Study in Industry
by: Du, Xueying, et al.
Published: (2026)

Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge
by: Fu, Jinlan, et al.
Published: (2024)