:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Guan, Jian, Wu, Wei, Wen, Zujie, Xu, Peng, Wang, Hongning, Huang, Minlie
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2402.01469
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning
by: Wen, Jiaxin, et al.
Published: (2024)

Language Model Decoding as Direct Metrics Optimization
by: Ji, Haozhe, et al.
Published: (2023)

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
by: Zhang, Zhexin, et al.
Published: (2023)

Learning Task Decomposition to Assist Humans in Competitive Programming
by: Wen, Jiaxin, et al.
Published: (2024)

Agent-SafetyBench: Evaluating the Safety of LLM Agents
by: Zhang, Zhexin, et al.
Published: (2024)

RLAR: An Agentic Reward System for Multi-task Reinforcement Learning on Large Language Models
by: Feng, Andrew Zhuoer, et al.
Published: (2026)

Language Models Hallucinate, but May Excel at Fact Verification
by: Guan, Jian, et al.
Published: (2023)

Data Selection via Optimal Control for Language Models
by: Gu, Yuxian, et al.
Published: (2024)

RAVEL: Reasoning Agents for Validating and Evaluating LLM Text Synthesis
by: Feng, Andrew Zhuoer, et al.
Published: (2026)

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation
by: Wen, Bosi, et al.
Published: (2026)

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback
by: Hou, Zhenyu, et al.
Published: (2024)

Think Socially via Cognitive Reasoning
by: Zhou, Jinfeng, et al.
Published: (2025)

Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!
by: Zhang, Zhexin, et al.
Published: (2025)

HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
by: Wen, Bosi, et al.
Published: (2025)

IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation
by: Wen, Bosi, et al.
Published: (2025)

SocialSim: Towards Socialized Simulation of Emotional Support Conversation
by: Chen, Zhuang, et al.
Published: (2025)

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
by: Cheng, Jiale, et al.
Published: (2023)

Towards Efficient Exact Optimization of Language Model Alignment
by: Ji, Haozhe, et al.
Published: (2024)

When Smiley Turns Hostile: Interpreting How Emojis Trigger LLMs' Toxicity
by: Cui, Shiyao, et al.
Published: (2025)

COKE: A Cognitive Knowledge Graph for Machine Theory of Mind
by: Wu, Jincenzi, et al.
Published: (2023)

Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering
by: Wen, Zhihua, et al.
Published: (2024)

A-MEM: Agentic Memory for LLM Agents
by: Xu, Wujiang, et al.
Published: (2025)

ShieldVLM: Safeguarding the Multimodal Implicit Toxicity via Deliberative Reasoning with LVLMs
by: Cui, Shiyao, et al.
Published: (2025)

Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
by: Gong, Zhuocheng, et al.
Published: (2024)

From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks
by: Zhang, Zhexin, et al.
Published: (2024)

MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs
by: Shi, Baorong, et al.
Published: (2026)

HoWToBench: Holistic Evaluation for LLM's Capability in Human-level Writing using Tree of Writing
by: Feng, Andrew Zhuoer, et al.
Published: (2026)

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
by: Gui, Jiayi, et al.
Published: (2024)

Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues
by: Zhou, Jinfeng, et al.
Published: (2025)

MiniPLM: Knowledge Distillation for Pre-Training Language Models
by: Gu, Yuxian, et al.
Published: (2024)

Benchmarking Complex Instruction-Following with Multiple Constraints Composition
by: Wen, Bosi, et al.
Published: (2024)

I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search
by: Liang, Zujie, et al.
Published: (2025)

Human Decision-making is Susceptible to AI-driven Manipulation
by: Sabour, Sahand, et al.
Published: (2025)

Towards Exception Safety Code Generation with Intermediate Representation Agents Framework
by: Zhang, Xuanming, et al.
Published: (2024)

MAGI: Multi-Agent Guided Interview for Psychiatric Assessment
by: Bi, Guanqun, et al.
Published: (2025)

LongSafety: Evaluating Long-Context Safety of Large Language Models
by: Lu, Yida, et al.
Published: (2025)

Ratchet: A Minimal Hygiene Recipe for Self-Evolving LLM Agents
by: Zhang, Xing, et al.
Published: (2026)

CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation
by: Ke, Pei, et al.
Published: (2023)

A Recipe For Building a Compliant Real Estate Chatbot
by: Madani, Navid, et al.
Published: (2024)

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
by: Cheng, Jiale, et al.
Published: (2024)