:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Zezhong, Yang, Fangkai, Wang, Lu, Zhao, Pu, Wang, Hongru, Chen, Liang, Lin, Qingwei, Wong, Kam-Fai
Format:	Preprint
Published:	2023
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2310.15851
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
by: Xue, Boyang, et al.
Published: (2024)

PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
by: Du, Yiming, et al.
Published: (2024)

MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
by: Xue, Boyang, et al.
Published: (2024)

FReM: A Flexible Reasoning Mechanism for Balancing Quick and Slow Thinking in Long-Context Question Answering
by: Zhao, Zhengyi, et al.
Published: (2025)

UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems
by: Wang, Hongru, et al.
Published: (2024)

Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs
by: Wang, Qibin, et al.
Published: (2025)

A Survey of the Evolution of Language Model-Based Dialogue Systems: Data, Task and Models
by: Wang, Hongru, et al.
Published: (2023)

OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
by: Cao, Jingtao, et al.
Published: (2024)

VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models
by: Cao, Jingtao, et al.
Published: (2024)

Self-Evolved Reward Learning for LLMs
by: Huang, Chenghua, et al.
Published: (2024)

Counterspeech for Mitigating the Influence of Media Bias: Comparing Human and LLM-Generated Responses
by: Lin, Luyang, et al.
Published: (2025)

Enhancing Large Language Models Against Inductive Instructions with Dual-critique Prompting
by: Wang, Rui, et al.
Published: (2023)

Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst
by: Wang, Hongru, et al.
Published: (2025)

ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis
by: Wang, Zezhong, et al.
Published: (2024)

T$^2$: An Adaptive Test-Time Scaling Strategy for Contextual Question Answering
by: Zhao, Zhengyi, et al.
Published: (2025)

Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
by: Wang, Rui, et al.
Published: (2024)

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
by: Wang, Rui, et al.
Published: (2025)

DAST: Difficulty-Aware Self-Training on Large Language Models
by: Xue, Boyang, et al.
Published: (2025)

Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning
by: Chen, Liang, et al.
Published: (2025)

SeRTS: Self-Rewarding Tree Search for Biomedical Retrieval-Augmented Generation
by: Hu, Minda, et al.
Published: (2024)

IndiVec: An Exploration of Leveraging Large Language Models for Media Bias Detection with Fine-Grained Bias Indicators
by: Lin, Luyang, et al.
Published: (2024)

UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
by: Wang, Hongru, et al.
Published: (2024)

Guaranteeing Knowledge Integration with Joint Decoding for Retrieval-Augmented Generation
by: Zhao, Zhengyi, et al.
Published: (2026)

UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
by: Xue, Boyang, et al.
Published: (2024)

Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions
by: Wang, Hongru, et al.
Published: (2024)

Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs' Reasoning
by: Wang, Zezhong, et al.
Published: (2025)

Chain-of-Probe: Examining the Necessity and Accuracy of CoT Step-by-Step
by: Wang, Zezhong, et al.
Published: (2024)

WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models
by: Feng, Huawen, et al.
Published: (2024)

PEARL: Towards Permutation-Resilient LLMs
by: Chen, Liang, et al.
Published: (2025)

WarriorMath: Enhancing the Mathematical Ability of Large Language Models with a Defect-aware Framework
by: Chen, Yue, et al.
Published: (2025)

Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges
by: Wang, Hongru, et al.
Published: (2025)

Analysing the Residual Stream of Language Models Under Knowledge Conflicts
by: Zhao, Yu, et al.
Published: (2024)

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
by: Zhao, Yu, et al.
Published: (2024)

MemGuide: Intent-Driven Memory Selection for Goal-Oriented Multi-Session LLM Agents
by: Du, Yiming, et al.
Published: (2025)

WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation
by: Zhao, Zhengyi, et al.
Published: (2025)

GLiGuard: Schema-Conditioned Classification for LLM Safeguard
by: Zaratiana, Urchade, et al.
Published: (2026)

ReliableMath: Benchmark of Reliable Mathematical Reasoning on Large Language Models
by: Xue, Boyang, et al.
Published: (2025)

Understanding and Mitigating Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
by: Li, Miaomiao, et al.
Published: (2025)

Acting Less is Reasoning More! Teaching Model to Act Efficiently
by: Wang, Hongru, et al.
Published: (2025)

Detoxification for LLM: From Dataset Itself
by: Shao, Wei, et al.
Published: (2026)