:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Sisate, Colin, Goldfinch, Alistair, Waterstone, Vincent, Kingsley, Sebastian, Blackthorn, Mariana
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Machine Learning Artificial Intelligence Computation and Language
Online-Zugang:	https://arxiv.org/abs/2502.00048
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

User-LLM: Efficient LLM Contextualization with User Embeddings
von: Ning, Lin, et al.
Veröffentlicht: (2024)

A Case Study on Contextual Machine Translation in a Professional Scenario of Subtitling
von: Vincent, Sebastian, et al.
Veröffentlicht: (2024)

Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings
von: Xu, Austin, et al.
Veröffentlicht: (2025)

Contextual Drag: How Errors in the Context Affect LLM Reasoning
von: Cheng, Yun, et al.
Veröffentlicht: (2026)

ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models
von: Akhauri, Yash, et al.
Veröffentlicht: (2024)

RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents
von: Kagaya, Tomoyuki, et al.
Veröffentlicht: (2024)

Efficient Contextual LLM Cascades through Budget-Constrained Policy Learning
von: Zhang, Xuechen, et al.
Veröffentlicht: (2024)

Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps
von: Chuang, Yung-Sung, et al.
Veröffentlicht: (2024)

Code Comprehension then Auditing for Unsupervised LLM Evaluation
von: Patel, Bhrij, et al.
Veröffentlicht: (2024)

Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization
von: Sun, Jingyi, et al.
Veröffentlicht: (2026)

Position: The Most Expensive Part of an LLM should be its Training Data
von: Kandpal, Nikhil, et al.
Veröffentlicht: (2025)

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning
von: Zhang, Yifan, et al.
Veröffentlicht: (2025)

Interpreting and Controlling LLM Reasoning through Integrated Policy Gradient
von: Li, Changming, et al.
Veröffentlicht: (2026)

LLM-Guided Indoor Navigation with Multimodal Map Understanding
von: Coffrini, Alberto, et al.
Veröffentlicht: (2025)

TransformLLM: Adapting Large Language Models via LLM-Transformed Reading Comprehension Text
von: Arbel, Iftach, et al.
Veröffentlicht: (2024)

GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning
von: Yang, Ningyuan, et al.
Veröffentlicht: (2026)

Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning
von: Jung, Jaehun, et al.
Veröffentlicht: (2025)

Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
von: Cheng, Wenhua, et al.
Veröffentlicht: (2023)

PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents
von: Gu, Zhuohan, et al.
Veröffentlicht: (2026)

Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
von: Liu, Yibai, et al.
Veröffentlicht: (2025)

Token Trails: Navigating Contextual Depths in Conversational AI with ChatLLM
von: Kowsher, Md., et al.
Veröffentlicht: (2024)

Towards Optimizing the Costs of LLM Usage
von: Shekhar, Shivanshu, et al.
Veröffentlicht: (2024)

From Noisy Traces to Stable Gradients: Bias-Variance Optimized Preference Optimization for Aligning Large Reasoning Models
von: Zhu, Mingkang, et al.
Veröffentlicht: (2025)

GTPO: Stabilizing Group Relative Policy Optimization via Gradient and Entropy Control
von: Simoni, Marco, et al.
Veröffentlicht: (2025)

LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
von: Shimabucoro, Luísa, et al.
Veröffentlicht: (2024)

Where Did It Go Wrong? Attributing Undesirable LLM Behaviors via Representation Gradient Tracing
von: Li, Zhe, et al.
Veröffentlicht: (2025)

CCRS: A Zero-Shot LLM-as-a-Judge Framework for Comprehensive RAG Evaluation
von: Muhamed, Aashiq
Veröffentlicht: (2025)

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
von: Yao, Jiarui, et al.
Veröffentlicht: (2025)

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
von: Su, Zhenpeng, et al.
Veröffentlicht: (2025)

Gradient-Adaptive Policy Optimization: Towards Multi-Objective Alignment of Large Language Models
von: Li, Chengao, et al.
Veröffentlicht: (2025)

A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications
von: Xiao, Wenyi, et al.
Veröffentlicht: (2024)

Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
von: Collot, Stephane, et al.
Veröffentlicht: (2025)

MPO: Boosting LLM Agents with Meta Plan Optimization
von: Xiong, Weimin, et al.
Veröffentlicht: (2025)

Teaching LLMs How to Learn with Contextual Fine-Tuning
von: Choi, Younwoo, et al.
Veröffentlicht: (2025)

ReDit: Reward Dithering for Improved LLM Policy Optimization
von: Wei, Chenxing, et al.
Veröffentlicht: (2025)

Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning
von: Fan, Chongyu, et al.
Veröffentlicht: (2024)

Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
von: Song, Yifan, et al.
Veröffentlicht: (2024)

Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration
von: Chen, Zhipeng, et al.
Veröffentlicht: (2026)

RCStat: A Statistical Framework for using Relative Contextualization in Transformers
von: Mahapatra, Debabrata, et al.
Veröffentlicht: (2025)

Less is More: Local Intrinsic Dimensions of Contextual Language Models
von: Ruppik, Benjamin Matthias, et al.
Veröffentlicht: (2025)