Saved in:
| Main Authors: | Lorenz, Tobias, Fritz, Mario |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.08889 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MIBP-Cert: Certified Training against Data Perturbations with Mixed-Integer Bilinear Programs
by: Lorenz, Tobias, et al.
Published: (2024)
by: Lorenz, Tobias, et al.
Published: (2024)
FullCert: Deterministic End-to-End Certification for Training and Inference of Neural Networks
by: Lorenz, Tobias, et al.
Published: (2024)
by: Lorenz, Tobias, et al.
Published: (2024)
Pixel-level Certified Explanations via Randomized Smoothing
by: Anani, Alaa, et al.
Published: (2025)
by: Anani, Alaa, et al.
Published: (2025)
Scalable Task Planning via Large Language Models and Structured World Representations
by: Pérez-Dattari, Rodrigo, et al.
Published: (2024)
by: Pérez-Dattari, Rodrigo, et al.
Published: (2024)
Certified Circuits: Stability Guarantees for Mechanistic Circuits
by: Anani, Alaa, et al.
Published: (2026)
by: Anani, Alaa, et al.
Published: (2026)
Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling
by: Feng, Mingqian, et al.
Published: (2026)
by: Feng, Mingqian, et al.
Published: (2026)
Fundamental Risks in the Current Deployment of General-Purpose AI Models: What Have We (Not) Learnt From Cybersecurity?
by: Fritz, Mario
Published: (2024)
by: Fritz, Mario
Published: (2024)
A Scalable Pipeline for Estimating Verb Frame Frequencies Using Large Language Models
by: Morgan, Adam M., et al.
Published: (2025)
by: Morgan, Adam M., et al.
Published: (2025)
Risk Structures: Towards Engineering Risk-aware Autonomous Systems
by: Gleirscher, Mario
Published: (2019)
by: Gleirscher, Mario
Published: (2019)
Post-training Large Language Models for Diverse High-Quality Responses
by: Chen, Yilei, et al.
Published: (2025)
by: Chen, Yilei, et al.
Published: (2025)
An Interpretable and Scalable Framework for Evaluating Large Language Models
by: Qu, Xinhao, et al.
Published: (2026)
by: Qu, Xinhao, et al.
Published: (2026)
Transforming Expert Knowledge into Scalable Ontology via Large Language Models
by: Itoku, Ikkei, et al.
Published: (2025)
by: Itoku, Ikkei, et al.
Published: (2025)
Towards Scalable Schema Mapping using Large Language Models
by: Buss, Christopher, et al.
Published: (2025)
by: Buss, Christopher, et al.
Published: (2025)
Estimating Tail Risks in Language Model Output Distributions
by: Angell, Rico, et al.
Published: (2026)
by: Angell, Rico, et al.
Published: (2026)
Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models
by: Yang, Haowei, et al.
Published: (2024)
by: Yang, Haowei, et al.
Published: (2024)
Risk-Averse Finetuning of Large Language Models
by: Chaudhary, Sapana, et al.
Published: (2025)
by: Chaudhary, Sapana, et al.
Published: (2025)
Risks of Cultural Erasure in Large Language Models
by: Qadri, Rida, et al.
Published: (2025)
by: Qadri, Rida, et al.
Published: (2025)
SAFER: Risk-Constrained Sample-then-Filter in Large Language Models
by: Wang, Qingni, et al.
Published: (2025)
by: Wang, Qingni, et al.
Published: (2025)
Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending
by: Sanz-Guerrero, Mario, et al.
Published: (2024)
by: Sanz-Guerrero, Mario, et al.
Published: (2024)
CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning
by: Yu, Huimu, et al.
Published: (2024)
by: Yu, Huimu, et al.
Published: (2024)
Scalable and Explainable Learner-Video Interaction Prediction using Multimodal Large Language Models
by: Glandorf, Dominik, et al.
Published: (2026)
by: Glandorf, Dominik, et al.
Published: (2026)
Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior Models
by: Wang, Hui-Po, et al.
Published: (2024)
by: Wang, Hui-Po, et al.
Published: (2024)
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
by: Zhu, Lianghui, et al.
Published: (2023)
by: Zhu, Lianghui, et al.
Published: (2023)
GWT: Scalable Optimizer State Compression for Large Language Model Training
by: Wen, Ziqing, et al.
Published: (2025)
by: Wen, Ziqing, et al.
Published: (2025)
InverseScope: Scalable Activation Inversion for Interpreting Large Language Models
by: Luo, Yifan, et al.
Published: (2025)
by: Luo, Yifan, et al.
Published: (2025)
Selecting and Combining Large Language Models for Scalable Code Clone Detection
by: Chochlov, Muslim, et al.
Published: (2025)
by: Chochlov, Muslim, et al.
Published: (2025)
Error Detection and Correction for Interpretable Mathematics in Large Language Models
by: Yang, Yijin, et al.
Published: (2025)
by: Yang, Yijin, et al.
Published: (2025)
Language-Agnostic Suicidal Risk Detection Using Large Language Models
by: Kim, June-Woo, et al.
Published: (2025)
by: Kim, June-Woo, et al.
Published: (2025)
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models
by: AhmadiTeshnizi, Ali, et al.
Published: (2024)
by: AhmadiTeshnizi, Ali, et al.
Published: (2024)
Scalable Token-Level Hallucination Detection in Large Language Models
by: Min, Rui, et al.
Published: (2026)
by: Min, Rui, et al.
Published: (2026)
LANTERN: Scalable Distillation of Large Language Models for Job-Person Fit and Explanation
by: Fu, Zhoutong, et al.
Published: (2025)
by: Fu, Zhoutong, et al.
Published: (2025)
Metacognitive Myopia in Large Language Models
by: Scholten, Florian, et al.
Published: (2024)
by: Scholten, Florian, et al.
Published: (2024)
Exploring the Secondary Risks of Large Language Models
by: Chen, Jiawei, et al.
Published: (2025)
by: Chen, Jiawei, et al.
Published: (2025)
The Human-AI Hybrid Delphi Model: A Structured Framework for Context-Rich, Expert Consensus in Complex Domains
by: Speed, Cathy, et al.
Published: (2025)
by: Speed, Cathy, et al.
Published: (2025)
The Cost of Thinking: Increased Jailbreak Risk in Large Language Models
by: Yang, Fan
Published: (2025)
by: Yang, Fan
Published: (2025)
Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication
by: Chung, Philip, et al.
Published: (2024)
by: Chung, Philip, et al.
Published: (2024)
Understanding Privacy Risks of Embeddings Induced by Large Language Models
by: Zhu, Zhihao, et al.
Published: (2024)
by: Zhu, Zhihao, et al.
Published: (2024)
Semantic Structure in Large Language Model Embeddings
by: Kozlowski, Austin C., et al.
Published: (2025)
by: Kozlowski, Austin C., et al.
Published: (2025)
Structured Chemistry Reasoning with Large Language Models
by: Ouyang, Siru, et al.
Published: (2023)
by: Ouyang, Siru, et al.
Published: (2023)
Scientific Computing with Large Language Models
by: Culver, Christopher, et al.
Published: (2024)
by: Culver, Christopher, et al.
Published: (2024)
Similar Items
-
MIBP-Cert: Certified Training against Data Perturbations with Mixed-Integer Bilinear Programs
by: Lorenz, Tobias, et al.
Published: (2024) -
FullCert: Deterministic End-to-End Certification for Training and Inference of Neural Networks
by: Lorenz, Tobias, et al.
Published: (2024) -
Pixel-level Certified Explanations via Randomized Smoothing
by: Anani, Alaa, et al.
Published: (2025) -
Scalable Task Planning via Large Language Models and Structured World Representations
by: Pérez-Dattari, Rodrigo, et al.
Published: (2024) -
Certified Circuits: Stability Guarantees for Mechanistic Circuits
by: Anani, Alaa, et al.
Published: (2026)