:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Xie, Liangru, Liu, Hui, Zeng, Jingying, Tang, Xianfeng, Han, Yan, Luo, Chen, Huang, Jing, Li, Zhen, Wang, Suhang, He, Qi
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Artificial Intelligence Computation and Language
Online-Zugang:	https://arxiv.org/abs/2412.12767
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models
von: Cui, Yingqian, et al.
Veröffentlicht: (2025)

Cite Before You Speak: Enhancing Context-Response Grounding in E-commerce Conversational LLM-Agents
von: Zeng, Jingying, et al.
Veröffentlicht: (2025)

A General Framework to Enhance Fine-tuning-based LLM Unlearning
von: Ren, Jie, et al.
Veröffentlicht: (2025)

AgentTTS: Large Language Model Agent for Test-time Compute-optimal Scaling Strategy in Complex Tasks
von: Wang, Fali, et al.
Veröffentlicht: (2025)

Examples as the Prompt: A Scalable Approach for Efficient LLM Adaptation in E-Commerce
von: Zeng, Jingying, et al.
Veröffentlicht: (2025)

Bradley-Terry and Multi-Objective Reward Modeling Are Complementary
von: Zhang, Zhiwei, et al.
Veröffentlicht: (2025)

Adaptive Test-Time Reasoning via Reward-Guided Dual-Phase Search
von: Cui, Yingqian, et al.
Veröffentlicht: (2025)

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
von: Lin, Minhua, et al.
Veröffentlicht: (2025)

How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities
von: Lin, Minhua, et al.
Veröffentlicht: (2025)

Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning
von: Han, Haoyu, et al.
Veröffentlicht: (2025)

Catastrophic Failure of LLM Unlearning via Quantization
von: Zhang, Zhiwei, et al.
Veröffentlicht: (2024)

ViLBench: A Suite for Vision-Language Process Reward Modeling
von: Tu, Haoqin, et al.
Veröffentlicht: (2025)

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
von: Wang, Fali, et al.
Veröffentlicht: (2025)

How Far Are LLMs from Professional Poker Players? Revisiting Game-Theoretic Reasoning with Agentic Tool Use
von: Lin, Minhua, et al.
Veröffentlicht: (2026)

Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
von: Zhang, Zhiwei, et al.
Veröffentlicht: (2025)

Divide-Verify-Refine: Can LLMs Self-Align with Complex Instructions?
von: Zhang, Xianren, et al.
Veröffentlicht: (2024)

Beyond the Black Box: A Survey on the Theory and Mechanism of Large Language Models
von: Gan, Zeyu, et al.
Veröffentlicht: (2026)

A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
von: Wang, Fali, et al.
Veröffentlicht: (2024)

Group Fairness Meets the Black Box: Enabling Fair Algorithms on Closed LLMs via Post-Processing
von: Xian, Ruicheng, et al.
Veröffentlicht: (2025)

Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models
von: Fu, Yu, et al.
Veröffentlicht: (2025)

CrossTune: Black-Box Few-Shot Classification with Label Enhancement
von: Luo, Danqing, et al.
Veröffentlicht: (2024)

A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration
von: Cui, Yingqian, et al.
Veröffentlicht: (2024)

To trust or not to trust: Attention-based Trust Management for LLM Multi-Agent Systems
von: He, Pengfei, et al.
Veröffentlicht: (2025)

m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models
von: Huang, Xiaoke, et al.
Veröffentlicht: (2025)

Consistency Matters: Explore LLMs Consistency From a Black-Box Perspective
von: Zhao, Fufangchen, et al.
Veröffentlicht: (2024)

FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks
von: Chen, Bocheng, et al.
Veröffentlicht: (2024)

Jailbreaking Commercial Black-Box LLMs with Explicitly Harmful Prompts
von: Zhang, Chiyu, et al.
Veröffentlicht: (2025)

Keeping an Eye on LLM Unlearning: The Hidden Risk and Remedy
von: Ren, Jie, et al.
Veröffentlicht: (2025)

Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey
von: Liu, Xiaoou, et al.
Veröffentlicht: (2025)

WEBSERV: A Full-Stack and RL-Ready Web Environment for Training Web Agents at Scale
von: Lu, Yuxuan, et al.
Veröffentlicht: (2025)

Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs
von: Li, Changhao, et al.
Veröffentlicht: (2024)

Beyond Black-Box Interventions: Latent Probing for Faithful Retrieval-Augmented Generation
von: Gao, Linfeng, et al.
Veröffentlicht: (2025)

Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond
von: Wei, Tianxin, et al.
Veröffentlicht: (2024)

Beyond Text: Unveiling Privacy Vulnerabilities in Multi-modal Retrieval-Augmented Generation
von: Zhang, Jiankun, et al.
Veröffentlicht: (2025)

A Survey on Collaborating Small and Large Language Models for Performance, Cost-effectiveness, Cloud-edge Privacy, and Trustworthiness
von: Wang, Fali, et al.
Veröffentlicht: (2025)

Knowledge Distillation of Black-Box Large Language Models
von: Chen, Hongzhan, et al.
Veröffentlicht: (2024)

SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
von: Xu, Ran, et al.
Veröffentlicht: (2024)

BOSCH: Black-Box Binary Optimization for Short-Context Attention-Head Selection in LLMs
von: Ghaddar, Abbas, et al.
Veröffentlicht: (2026)

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
von: Chen, Hardy, et al.
Veröffentlicht: (2025)

Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs
von: Jin, Bowen, et al.
Veröffentlicht: (2024)