:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jawad, Hussein, Chenik, Yassine, Brunel, Nicolas J. -B.
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2406.02044
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization
by: Jawad, Huseein, et al.
Published: (2025)

ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic Covering
by: Jawad, Hussein, et al.
Published: (2026)

Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
by: Mehrotra, Anay, et al.
Published: (2023)

Audit Me If You Can: Query-Efficient Active Fairness Auditing of Black-Box LLMs
by: Hartmann, David, et al.
Published: (2026)

Predicting the Performance of Black-box LLMs through Follow-up Queries
by: Sam, Dylan, et al.
Published: (2025)

Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs
by: Li, Changhao, et al.
Published: (2024)

PCS: Perceived Confidence Scoring of Black Box LLMs with Metamorphic Relations
by: Salimian, Sina, et al.
Published: (2025)

SafePassage: High-Fidelity Information Extraction with Black Box LLMs
by: Barrow, Joe, et al.
Published: (2025)

In-Context Explainers: Harnessing LLMs for Explaining Black Box Models
by: Kroeger, Nicholas, et al.
Published: (2023)

Bias Similarity Measurement: A Black-Box Audit of Fairness Across LLMs
by: Jeong, Hyejun, et al.
Published: (2024)

How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models
by: Asawa, Parth, et al.
Published: (2025)

FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs
by: Sawczyn, Albert, et al.
Published: (2025)

PAL: Proxy-Guided Black-Box Attack on Large Language Models
by: Sitawarin, Chawin, et al.
Published: (2024)

Does It Make Sense to Explain a Black Box With Another Black Box?
by: Delaunay, Julien, et al.
Published: (2024)

Group Fairness Meets the Black Box: Enabling Fair Algorithms on Closed LLMs via Post-Processing
by: Xian, Ruicheng, et al.
Published: (2025)

Bits Leaked per Query: Information-Theoretic Bounds on Adversarial Attacks against LLMs
by: Kaneko, Masahiro, et al.
Published: (2025)

Kov: Transferable and Naturalistic Black-Box LLM Attacks using Markov Decision Processes and Tree Search
by: Moss, Robert J.
Published: (2024)

Universal Response and Emergence of Induction in LLMs
by: Luick, Niclas
Published: (2024)

Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Sampling
by: Shi, Yuhui, et al.
Published: (2024)

Bounded Behavioral Indistinguishability for Black-Box LLM Distillation
by: Hasan, Munawar
Published: (2026)

Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting
by: Zhao, Xiaohan, et al.
Published: (2026)

Towards Lightweight Reliability: Using Soft Prompts for Hallucination Mitigation in Large Language Models
by: Siddiqui, S M Tahmid, et al.
Published: (2026)

ACING: Actor-Critic for Instruction Learning in Black-Box LLMs
by: Kharrat, Salma, et al.
Published: (2024)

An Evaluation of Explanation Methods for Black-Box Detectors of Machine-Generated Text
by: Schoenegger, Loris, et al.
Published: (2024)

Hierarchical Text Classification Using Black Box Large Language Models
by: Yoshimura, Kosuke, et al.
Published: (2025)

SODA: Semi On-Policy Black-Box Distillation for Large Language Models
by: Chen, Xiwen, et al.
Published: (2026)

Unlocking the Black Box of Latent Reasoning: An Interpretability-Guided Approach to Intervention
by: Chang, Shuochen, et al.
Published: (2026)

A Watermark for Black-Box Language Models
by: Bahri, Dara, et al.
Published: (2024)

Deep Learning-based Method for Expressing Knowledge Boundary of Black-Box LLM
by: Sheng, Haotian, et al.
Published: (2026)

Topic Modelling Black Box Optimization
by: Akramov, Roman, et al.
Published: (2025)

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories
by: Wei, Zhepei, et al.
Published: (2026)

Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs
by: Mendu, Sai Krishna, et al.
Published: (2025)

Density estimation with LLMs: a geometric investigation of in-context learning trajectories
by: Liu, Toni J. B., et al.
Published: (2024)

Training Deliberative Monitors for Black-Box Scheming Detection
by: Sinha, Aditya, et al.
Published: (2026)

Does Unlearning Truly Unlearn? A Black Box Evaluation of LLM Unlearning Methods
by: Doshi, Jai, et al.
Published: (2024)

TRN-R1-Zero: Text-rich Network Reasoning via LLMs with Reinforcement Learning Only
by: Liu, Yilun, et al.
Published: (2026)

Emergent Response Planning in LLMs
by: Dong, Zhichen, et al.
Published: (2025)

Towards Modular LLMs by Building and Reusing a Library of LoRAs
by: Ostapenko, Oleksiy, et al.
Published: (2024)

Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers
by: Bouchard, Dylan, et al.
Published: (2025)

HYDRA: Model Factorization Framework for Black-Box LLM Personalization
by: Zhuang, Yuchen, et al.
Published: (2024)