Saved in:
| Main Authors: | Katuka, Gloria Ashiya, Gain, Alexander, Yu, Yen-Yun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.00602 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Automatic Prompt Selection for Large Language Models
by: Do, Viet-Tung, et al.
Published: (2024)
by: Do, Viet-Tung, et al.
Published: (2024)
Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation
by: Vu, Tu, et al.
Published: (2024)
by: Vu, Tu, et al.
Published: (2024)
Automatically Interpreting Millions of Features in Large Language Models
by: Paulo, Gonçalo, et al.
Published: (2024)
by: Paulo, Gonçalo, et al.
Published: (2024)
How to Train Long-Context Language Models (Effectively)
by: Gao, Tianyu, et al.
Published: (2024)
by: Gao, Tianyu, et al.
Published: (2024)
Investigating Symbolic Capabilities of Large Language Models
by: Dave, Neisarg, et al.
Published: (2024)
by: Dave, Neisarg, et al.
Published: (2024)
Investigating Layer Importance in Large Language Models
by: Zhang, Yang, et al.
Published: (2024)
by: Zhang, Yang, et al.
Published: (2024)
Comparison of Scoring Rationales Between Large Language Models and Human Raters
by: Hua, Haowei, et al.
Published: (2025)
by: Hua, Haowei, et al.
Published: (2025)
Generalizing Large Language Model Usability Across Resource-Constrained
by: Tsai, Yun-Da
Published: (2025)
by: Tsai, Yun-Da
Published: (2025)
On Calibration of Large Language Models: From Response To Capability
by: Yang, Sin-Han, et al.
Published: (2026)
by: Yang, Sin-Han, et al.
Published: (2026)
Policy Improvement using Language Feedback Models
by: Zhong, Victor, et al.
Published: (2024)
by: Zhong, Victor, et al.
Published: (2024)
LongReward: Improving Long-context Large Language Models with AI Feedback
by: Zhang, Jiajie, et al.
Published: (2024)
by: Zhang, Jiajie, et al.
Published: (2024)
GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback
by: Abdelaal, Mohamed, et al.
Published: (2025)
by: Abdelaal, Mohamed, et al.
Published: (2025)
From Distributional to Overton Pluralism: Investigating Large Language Model Alignment
by: Lake, Thom, et al.
Published: (2024)
by: Lake, Thom, et al.
Published: (2024)
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models
by: Dong, Guanting, et al.
Published: (2024)
by: Dong, Guanting, et al.
Published: (2024)
Benchmarking Large Language Model Uncertainty for Prompt Optimization
by: Guo, Pei-Fu, et al.
Published: (2024)
by: Guo, Pei-Fu, et al.
Published: (2024)
QA-Calibration of Language Model Confidence Scores
by: Manggala, Putra, et al.
Published: (2024)
by: Manggala, Putra, et al.
Published: (2024)
Feedback Loops With Language Models Drive In-Context Reward Hacking
by: Pan, Alexander, et al.
Published: (2024)
by: Pan, Alexander, et al.
Published: (2024)
Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models
by: Solway, Alec
Published: (2024)
by: Solway, Alec
Published: (2024)
DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models
by: Gao, Shangqian, et al.
Published: (2024)
by: Gao, Shangqian, et al.
Published: (2024)
Calibrated Large Language Models for Binary Question Answering
by: Giovannotti, Patrizio, et al.
Published: (2024)
by: Giovannotti, Patrizio, et al.
Published: (2024)
Discursive Circuits: How Do Language Models Understand Discourse Relations?
by: Miao, Yisong, et al.
Published: (2025)
by: Miao, Yisong, et al.
Published: (2025)
Large Language Models in the Task of Automatic Validation of Text Classifier Predictions
by: Tsymbalov, Aleksandr, et al.
Published: (2025)
by: Tsymbalov, Aleksandr, et al.
Published: (2025)
Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL Translation
by: Ranaldi, Federico, et al.
Published: (2024)
by: Ranaldi, Federico, et al.
Published: (2024)
Transformer-based Joint Modelling for Automatic Essay Scoring and Off-Topic Detection
by: Das, Sourya Dipta, et al.
Published: (2024)
by: Das, Sourya Dipta, et al.
Published: (2024)
A Critical Evaluation of AI Feedback for Aligning Large Language Models
by: Sharma, Archit, et al.
Published: (2024)
by: Sharma, Archit, et al.
Published: (2024)
RDBE: Reasoning Distillation-Based Evaluation Enhances Automatic Essay Scoring
by: Mohammadkhani, Ali Ghiasvand
Published: (2024)
by: Mohammadkhani, Ali Ghiasvand
Published: (2024)
Rethinking Jailbreak Detection of Large Vision Language Models with Representational Contrastive Scoring
by: Hua, Peichun, et al.
Published: (2025)
by: Hua, Peichun, et al.
Published: (2025)
Entity Matching using Large Language Models
by: Peeters, Ralph, et al.
Published: (2023)
by: Peeters, Ralph, et al.
Published: (2023)
Towards Trustable Language Models: Investigating Information Quality of Large Language Models
by: Rejeleene, Rick, et al.
Published: (2024)
by: Rejeleene, Rick, et al.
Published: (2024)
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
by: Yu, Xiaodong, et al.
Published: (2023)
by: Yu, Xiaodong, et al.
Published: (2023)
RePrompt: Planning by Automatic Prompt Engineering for Large Language Models Agents
by: Chen, Weizhe, et al.
Published: (2024)
by: Chen, Weizhe, et al.
Published: (2024)
Self-ensemble: Mitigating Confidence Mis-calibration for Large Language Models
by: Xu, Zicheng, et al.
Published: (2025)
by: Xu, Zicheng, et al.
Published: (2025)
LatentBreak: Jailbreaking Large Language Models through Latent Space Feedback
by: Mura, Raffaele, et al.
Published: (2025)
by: Mura, Raffaele, et al.
Published: (2025)
Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models
by: Bansal, Hritik, et al.
Published: (2023)
by: Bansal, Hritik, et al.
Published: (2023)
Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation
by: Phan, Buu, et al.
Published: (2025)
by: Phan, Buu, et al.
Published: (2025)
Multi-Level Feedback Generation with Large Language Models for Empowering Novice Peer Counselors
by: Chaszczewicz, Alicja, et al.
Published: (2024)
by: Chaszczewicz, Alicja, et al.
Published: (2024)
UltraFeedback: Boosting Language Models with Scaled AI Feedback
by: Cui, Ganqu, et al.
Published: (2023)
by: Cui, Ganqu, et al.
Published: (2023)
FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing
by: Smith, James Seale, et al.
Published: (2025)
by: Smith, James Seale, et al.
Published: (2025)
Training Language Models with Language Feedback at Scale
by: Scheurer, Jérémy, et al.
Published: (2023)
by: Scheurer, Jérémy, et al.
Published: (2023)
Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models
by: Hengle, Amey, et al.
Published: (2024)
by: Hengle, Amey, et al.
Published: (2024)
Similar Items
-
Automatic Prompt Selection for Large Language Models
by: Do, Viet-Tung, et al.
Published: (2024) -
Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation
by: Vu, Tu, et al.
Published: (2024) -
Automatically Interpreting Millions of Features in Large Language Models
by: Paulo, Gonçalo, et al.
Published: (2024) -
How to Train Long-Context Language Models (Effectively)
by: Gao, Tianyu, et al.
Published: (2024) -
Investigating Symbolic Capabilities of Large Language Models
by: Dave, Neisarg, et al.
Published: (2024)