:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Iourovitski, Dmitri
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2406.12043
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Hide and Seek: Fingerprinting Large Language Models with Evolutionary Learning
by: Iourovitski, Dmitri, et al.
Published: (2024)

LLMs May Perform MCQA by Selecting the Least Incorrect Option
by: Wang, Haochun, et al.
Published: (2024)

Mitigating Selection Bias with Node Pruning and Auxiliary Options
by: Choi, Hyeong Kyu, et al.
Published: (2024)

Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
by: Nair, Lakshmi, et al.
Published: (2025)

Retention Score: Quantifying Jailbreak Risks for Vision Language Models
by: Li, Zaitang, et al.
Published: (2024)

From Parameter Dynamics to Risk Scoring : Quantifying Sample-Level Safety Degradation in LLM Fine-tuning
by: Wang, Xiao, et al.
Published: (2026)

Silicon Showdown: Performance, Efficiency, and Ecosystem Barriers in Consumer-Grade LLM Inference
by: Javat, Abdurrahman, et al.
Published: (2026)

OptionZero: Planning with Learned Options
by: Huang, Po-Wei, et al.
Published: (2025)

LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning
by: Yao, Chang, et al.
Published: (2026)

Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning
by: Shek, Chak Lam, et al.
Published: (2025)

Data Compressibility Quantifies LLM Memorization
by: Huang, Yizhan, et al.
Published: (2025)

Pipeline for Verifying LLM-Generated Mathematical Solutions
by: Sazonova, Varvara, et al.
Published: (2026)

From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations
by: Wang, Benlu, et al.
Published: (2025)

Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing
by: Kim, JongWoo, et al.
Published: (2024)

Epistemic Reject Option Prediction
by: Franc, Vojtech, et al.
Published: (2025)

Crucible: Quantifying the Potential of Control Algorithms through LLM Agents
by: Jia, Lianchen, et al.
Published: (2025)

Quantifying Cross-Query Contradictions in Multi-Query LLM Reasoning
by: Salla, Rohit Kumar, et al.
Published: (2026)

LLM-Guided Quantified SMT Solving over Uninterpreted Functions
by: Lv, Kunhang, et al.
Published: (2026)

GradingAttack: Exposing Security Vulnerabilities in LLM Based Educational Grading Agents
by: Li, Xueyi, et al.
Published: (2026)

BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors
by: Li, Lingfeng, et al.
Published: (2026)

From Description to Score: Can LLMs Quantify Vulnerabilities?
by: Jafarikhah, Sima, et al.
Published: (2025)

Optimizing In-Context Demonstrations for LLM-based Automated Grading
by: Chu, Yucheng, et al.
Published: (2026)

Human-in-the-Loop LLM Grading for Handwritten Mathematics Assessments
by: Vanhoyweghen, Arne, et al.
Published: (2026)

Language Ranker: A Metric for Quantifying LLM Performance Across High and Low-Resource Languages
by: Li, Zihao, et al.
Published: (2024)

Grading Scale Impact on LLM-as-a-Judge: Human-LLM Alignment Is Highest on 0-5 Grading Scale
by: Li, Weiyue, et al.
Published: (2026)

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
by: Ye, Jiayi, et al.
Published: (2024)

Quantifying Frontier LLM Capabilities for Container Sandbox Escape
by: Marchand, Rahul, et al.
Published: (2026)

GLIDER: Grading LLM Interactions and Decisions using Explainable Ranking
by: Deshpande, Darshan, et al.
Published: (2024)

Confusion-Aware Rubric Optimization for LLM-based Automated Grading
by: Chu, Yucheng, et al.
Published: (2026)

Accelerate Scaling of LLM Finetuning via Quantifying the Coverage and Depth of Instruction Set
by: Wu, Chengwei, et al.
Published: (2025)

Ran Score: a LLM-based Evaluation Score for Radiology Report Generation
by: Zhang, Ran, et al.
Published: (2026)

How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment
by: Li, Hang, et al.
Published: (2026)

Speech Emotion Recognition via Entropy-Aware Score Selection
by: Chua, ChenYi, et al.
Published: (2025)

The Signal is in the Steps: Local Scoring for Reasoning Data Selection
by: Just, Hoang Anh, et al.
Published: (2025)

OLLM: Options-based Large Language Models
by: Sharma, Shashank, et al.
Published: (2026)

Unveiling Options with Neural Decomposition
by: Alikhasi, Mahdi, et al.
Published: (2024)

Diversity-Enriched Option-Critic
by: Kamat, Anand, et al.
Published: (2020)

RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
by: Zhang, Ziqian, et al.
Published: (2026)

Quantifying Loss Aversion in Cyber Adversaries via LLM Analysis
by: Hans, Soham, et al.
Published: (2025)

Quantifying LLM Attention-Head Stability: Implications for Circuit Universality
by: Bali, Karan, et al.
Published: (2026)