Saved in:
| Main Authors: | Wang, Zezhong, Yang, Fangkai, Wang, Lu, Zhao, Pu, Wang, Hongru, Chen, Liang, Lin, Qingwei, Wong, Kam-Fai |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.15851 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
by: Xue, Boyang, et al.
Published: (2024)
by: Xue, Boyang, et al.
Published: (2024)
PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
by: Du, Yiming, et al.
Published: (2024)
by: Du, Yiming, et al.
Published: (2024)
MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
by: Xue, Boyang, et al.
Published: (2024)
by: Xue, Boyang, et al.
Published: (2024)
FReM: A Flexible Reasoning Mechanism for Balancing Quick and Slow Thinking in Long-Context Question Answering
by: Zhao, Zhengyi, et al.
Published: (2025)
by: Zhao, Zhengyi, et al.
Published: (2025)
UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems
by: Wang, Hongru, et al.
Published: (2024)
by: Wang, Hongru, et al.
Published: (2024)
Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs
by: Wang, Qibin, et al.
Published: (2025)
by: Wang, Qibin, et al.
Published: (2025)
A Survey of the Evolution of Language Model-Based Dialogue Systems: Data, Task and Models
by: Wang, Hongru, et al.
Published: (2023)
by: Wang, Hongru, et al.
Published: (2023)
OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
by: Cao, Jingtao, et al.
Published: (2024)
by: Cao, Jingtao, et al.
Published: (2024)
VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models
by: Cao, Jingtao, et al.
Published: (2024)
by: Cao, Jingtao, et al.
Published: (2024)
Self-Evolved Reward Learning for LLMs
by: Huang, Chenghua, et al.
Published: (2024)
by: Huang, Chenghua, et al.
Published: (2024)
Counterspeech for Mitigating the Influence of Media Bias: Comparing Human and LLM-Generated Responses
by: Lin, Luyang, et al.
Published: (2025)
by: Lin, Luyang, et al.
Published: (2025)
Enhancing Large Language Models Against Inductive Instructions with Dual-critique Prompting
by: Wang, Rui, et al.
Published: (2023)
by: Wang, Rui, et al.
Published: (2023)
Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst
by: Wang, Hongru, et al.
Published: (2025)
by: Wang, Hongru, et al.
Published: (2025)
ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis
by: Wang, Zezhong, et al.
Published: (2024)
by: Wang, Zezhong, et al.
Published: (2024)
T$^2$: An Adaptive Test-Time Scaling Strategy for Contextual Question Answering
by: Zhao, Zhengyi, et al.
Published: (2025)
by: Zhao, Zhengyi, et al.
Published: (2025)
Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
by: Wang, Rui, et al.
Published: (2024)
by: Wang, Rui, et al.
Published: (2024)
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
by: Wang, Rui, et al.
Published: (2025)
by: Wang, Rui, et al.
Published: (2025)
DAST: Difficulty-Aware Self-Training on Large Language Models
by: Xue, Boyang, et al.
Published: (2025)
by: Xue, Boyang, et al.
Published: (2025)
Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning
by: Chen, Liang, et al.
Published: (2025)
by: Chen, Liang, et al.
Published: (2025)
SeRTS: Self-Rewarding Tree Search for Biomedical Retrieval-Augmented Generation
by: Hu, Minda, et al.
Published: (2024)
by: Hu, Minda, et al.
Published: (2024)
IndiVec: An Exploration of Leveraging Large Language Models for Media Bias Detection with Fine-Grained Bias Indicators
by: Lin, Luyang, et al.
Published: (2024)
by: Lin, Luyang, et al.
Published: (2024)
UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
by: Wang, Hongru, et al.
Published: (2024)
by: Wang, Hongru, et al.
Published: (2024)
Guaranteeing Knowledge Integration with Joint Decoding for Retrieval-Augmented Generation
by: Zhao, Zhengyi, et al.
Published: (2026)
by: Zhao, Zhengyi, et al.
Published: (2026)
UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
by: Xue, Boyang, et al.
Published: (2024)
by: Xue, Boyang, et al.
Published: (2024)
Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions
by: Wang, Hongru, et al.
Published: (2024)
by: Wang, Hongru, et al.
Published: (2024)
Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs' Reasoning
by: Wang, Zezhong, et al.
Published: (2025)
by: Wang, Zezhong, et al.
Published: (2025)
Chain-of-Probe: Examining the Necessity and Accuracy of CoT Step-by-Step
by: Wang, Zezhong, et al.
Published: (2024)
by: Wang, Zezhong, et al.
Published: (2024)
WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models
by: Feng, Huawen, et al.
Published: (2024)
by: Feng, Huawen, et al.
Published: (2024)
PEARL: Towards Permutation-Resilient LLMs
by: Chen, Liang, et al.
Published: (2025)
by: Chen, Liang, et al.
Published: (2025)
WarriorMath: Enhancing the Mathematical Ability of Large Language Models with a Defect-aware Framework
by: Chen, Yue, et al.
Published: (2025)
by: Chen, Yue, et al.
Published: (2025)
Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges
by: Wang, Hongru, et al.
Published: (2025)
by: Wang, Hongru, et al.
Published: (2025)
Analysing the Residual Stream of Language Models Under Knowledge Conflicts
by: Zhao, Yu, et al.
Published: (2024)
by: Zhao, Yu, et al.
Published: (2024)
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
by: Zhao, Yu, et al.
Published: (2024)
by: Zhao, Yu, et al.
Published: (2024)
MemGuide: Intent-Driven Memory Selection for Goal-Oriented Multi-Session LLM Agents
by: Du, Yiming, et al.
Published: (2025)
by: Du, Yiming, et al.
Published: (2025)
WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation
by: Zhao, Zhengyi, et al.
Published: (2025)
by: Zhao, Zhengyi, et al.
Published: (2025)
GLiGuard: Schema-Conditioned Classification for LLM Safeguard
by: Zaratiana, Urchade, et al.
Published: (2026)
by: Zaratiana, Urchade, et al.
Published: (2026)
ReliableMath: Benchmark of Reliable Mathematical Reasoning on Large Language Models
by: Xue, Boyang, et al.
Published: (2025)
by: Xue, Boyang, et al.
Published: (2025)
Understanding and Mitigating Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
by: Li, Miaomiao, et al.
Published: (2025)
by: Li, Miaomiao, et al.
Published: (2025)
Acting Less is Reasoning More! Teaching Model to Act Efficiently
by: Wang, Hongru, et al.
Published: (2025)
by: Wang, Hongru, et al.
Published: (2025)
Detoxification for LLM: From Dataset Itself
by: Shao, Wei, et al.
Published: (2026)
by: Shao, Wei, et al.
Published: (2026)
Similar Items
-
MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
by: Xue, Boyang, et al.
Published: (2024) -
PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
by: Du, Yiming, et al.
Published: (2024) -
MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
by: Xue, Boyang, et al.
Published: (2024) -
FReM: A Flexible Reasoning Mechanism for Balancing Quick and Slow Thinking in Long-Context Question Answering
by: Zhao, Zhengyi, et al.
Published: (2025) -
UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems
by: Wang, Hongru, et al.
Published: (2024)