Saved in:
| Main Author: | Iourovitski, Dmitri |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.12043 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Hide and Seek: Fingerprinting Large Language Models with Evolutionary Learning
by: Iourovitski, Dmitri, et al.
Published: (2024)
by: Iourovitski, Dmitri, et al.
Published: (2024)
LLMs May Perform MCQA by Selecting the Least Incorrect Option
by: Wang, Haochun, et al.
Published: (2024)
by: Wang, Haochun, et al.
Published: (2024)
Mitigating Selection Bias with Node Pruning and Auxiliary Options
by: Choi, Hyeong Kyu, et al.
Published: (2024)
by: Choi, Hyeong Kyu, et al.
Published: (2024)
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
by: Nair, Lakshmi, et al.
Published: (2025)
by: Nair, Lakshmi, et al.
Published: (2025)
Retention Score: Quantifying Jailbreak Risks for Vision Language Models
by: Li, Zaitang, et al.
Published: (2024)
by: Li, Zaitang, et al.
Published: (2024)
From Parameter Dynamics to Risk Scoring : Quantifying Sample-Level Safety Degradation in LLM Fine-tuning
by: Wang, Xiao, et al.
Published: (2026)
by: Wang, Xiao, et al.
Published: (2026)
Silicon Showdown: Performance, Efficiency, and Ecosystem Barriers in Consumer-Grade LLM Inference
by: Javat, Abdurrahman, et al.
Published: (2026)
by: Javat, Abdurrahman, et al.
Published: (2026)
OptionZero: Planning with Learned Options
by: Huang, Po-Wei, et al.
Published: (2025)
by: Huang, Po-Wei, et al.
Published: (2025)
LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning
by: Yao, Chang, et al.
Published: (2026)
by: Yao, Chang, et al.
Published: (2026)
Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning
by: Shek, Chak Lam, et al.
Published: (2025)
by: Shek, Chak Lam, et al.
Published: (2025)
Data Compressibility Quantifies LLM Memorization
by: Huang, Yizhan, et al.
Published: (2025)
by: Huang, Yizhan, et al.
Published: (2025)
Pipeline for Verifying LLM-Generated Mathematical Solutions
by: Sazonova, Varvara, et al.
Published: (2026)
by: Sazonova, Varvara, et al.
Published: (2026)
From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations
by: Wang, Benlu, et al.
Published: (2025)
by: Wang, Benlu, et al.
Published: (2025)
Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing
by: Kim, JongWoo, et al.
Published: (2024)
by: Kim, JongWoo, et al.
Published: (2024)
Epistemic Reject Option Prediction
by: Franc, Vojtech, et al.
Published: (2025)
by: Franc, Vojtech, et al.
Published: (2025)
Crucible: Quantifying the Potential of Control Algorithms through LLM Agents
by: Jia, Lianchen, et al.
Published: (2025)
by: Jia, Lianchen, et al.
Published: (2025)
Quantifying Cross-Query Contradictions in Multi-Query LLM Reasoning
by: Salla, Rohit Kumar, et al.
Published: (2026)
by: Salla, Rohit Kumar, et al.
Published: (2026)
LLM-Guided Quantified SMT Solving over Uninterpreted Functions
by: Lv, Kunhang, et al.
Published: (2026)
by: Lv, Kunhang, et al.
Published: (2026)
GradingAttack: Exposing Security Vulnerabilities in LLM Based Educational Grading Agents
by: Li, Xueyi, et al.
Published: (2026)
by: Li, Xueyi, et al.
Published: (2026)
BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors
by: Li, Lingfeng, et al.
Published: (2026)
by: Li, Lingfeng, et al.
Published: (2026)
From Description to Score: Can LLMs Quantify Vulnerabilities?
by: Jafarikhah, Sima, et al.
Published: (2025)
by: Jafarikhah, Sima, et al.
Published: (2025)
Optimizing In-Context Demonstrations for LLM-based Automated Grading
by: Chu, Yucheng, et al.
Published: (2026)
by: Chu, Yucheng, et al.
Published: (2026)
Human-in-the-Loop LLM Grading for Handwritten Mathematics Assessments
by: Vanhoyweghen, Arne, et al.
Published: (2026)
by: Vanhoyweghen, Arne, et al.
Published: (2026)
Language Ranker: A Metric for Quantifying LLM Performance Across High and Low-Resource Languages
by: Li, Zihao, et al.
Published: (2024)
by: Li, Zihao, et al.
Published: (2024)
Grading Scale Impact on LLM-as-a-Judge: Human-LLM Alignment Is Highest on 0-5 Grading Scale
by: Li, Weiyue, et al.
Published: (2026)
by: Li, Weiyue, et al.
Published: (2026)
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
by: Ye, Jiayi, et al.
Published: (2024)
by: Ye, Jiayi, et al.
Published: (2024)
Quantifying Frontier LLM Capabilities for Container Sandbox Escape
by: Marchand, Rahul, et al.
Published: (2026)
by: Marchand, Rahul, et al.
Published: (2026)
GLIDER: Grading LLM Interactions and Decisions using Explainable Ranking
by: Deshpande, Darshan, et al.
Published: (2024)
by: Deshpande, Darshan, et al.
Published: (2024)
Confusion-Aware Rubric Optimization for LLM-based Automated Grading
by: Chu, Yucheng, et al.
Published: (2026)
by: Chu, Yucheng, et al.
Published: (2026)
Accelerate Scaling of LLM Finetuning via Quantifying the Coverage and Depth of Instruction Set
by: Wu, Chengwei, et al.
Published: (2025)
by: Wu, Chengwei, et al.
Published: (2025)
Ran Score: a LLM-based Evaluation Score for Radiology Report Generation
by: Zhang, Ran, et al.
Published: (2026)
by: Zhang, Ran, et al.
Published: (2026)
How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment
by: Li, Hang, et al.
Published: (2026)
by: Li, Hang, et al.
Published: (2026)
Speech Emotion Recognition via Entropy-Aware Score Selection
by: Chua, ChenYi, et al.
Published: (2025)
by: Chua, ChenYi, et al.
Published: (2025)
The Signal is in the Steps: Local Scoring for Reasoning Data Selection
by: Just, Hoang Anh, et al.
Published: (2025)
by: Just, Hoang Anh, et al.
Published: (2025)
OLLM: Options-based Large Language Models
by: Sharma, Shashank, et al.
Published: (2026)
by: Sharma, Shashank, et al.
Published: (2026)
Unveiling Options with Neural Decomposition
by: Alikhasi, Mahdi, et al.
Published: (2024)
by: Alikhasi, Mahdi, et al.
Published: (2024)
Diversity-Enriched Option-Critic
by: Kamat, Anand, et al.
Published: (2020)
by: Kamat, Anand, et al.
Published: (2020)
RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
by: Zhang, Ziqian, et al.
Published: (2026)
by: Zhang, Ziqian, et al.
Published: (2026)
Quantifying Loss Aversion in Cyber Adversaries via LLM Analysis
by: Hans, Soham, et al.
Published: (2025)
by: Hans, Soham, et al.
Published: (2025)
Quantifying LLM Attention-Head Stability: Implications for Circuit Universality
by: Bali, Karan, et al.
Published: (2026)
by: Bali, Karan, et al.
Published: (2026)
Similar Items
-
Hide and Seek: Fingerprinting Large Language Models with Evolutionary Learning
by: Iourovitski, Dmitri, et al.
Published: (2024) -
LLMs May Perform MCQA by Selecting the Least Incorrect Option
by: Wang, Haochun, et al.
Published: (2024) -
Mitigating Selection Bias with Node Pruning and Auxiliary Options
by: Choi, Hyeong Kyu, et al.
Published: (2024) -
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
by: Nair, Lakshmi, et al.
Published: (2025) -
Retention Score: Quantifying Jailbreak Risks for Vision Language Models
by: Li, Zaitang, et al.
Published: (2024)