Saved in:
| Main Authors: | Jawad, Hussein, Chenik, Yassine, Brunel, Nicolas J. -B. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.02044 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization
by: Jawad, Huseein, et al.
Published: (2025)
by: Jawad, Huseein, et al.
Published: (2025)
ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic Covering
by: Jawad, Hussein, et al.
Published: (2026)
by: Jawad, Hussein, et al.
Published: (2026)
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
by: Mehrotra, Anay, et al.
Published: (2023)
by: Mehrotra, Anay, et al.
Published: (2023)
Audit Me If You Can: Query-Efficient Active Fairness Auditing of Black-Box LLMs
by: Hartmann, David, et al.
Published: (2026)
by: Hartmann, David, et al.
Published: (2026)
Predicting the Performance of Black-box LLMs through Follow-up Queries
by: Sam, Dylan, et al.
Published: (2025)
by: Sam, Dylan, et al.
Published: (2025)
Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs
by: Li, Changhao, et al.
Published: (2024)
by: Li, Changhao, et al.
Published: (2024)
PCS: Perceived Confidence Scoring of Black Box LLMs with Metamorphic Relations
by: Salimian, Sina, et al.
Published: (2025)
by: Salimian, Sina, et al.
Published: (2025)
SafePassage: High-Fidelity Information Extraction with Black Box LLMs
by: Barrow, Joe, et al.
Published: (2025)
by: Barrow, Joe, et al.
Published: (2025)
In-Context Explainers: Harnessing LLMs for Explaining Black Box Models
by: Kroeger, Nicholas, et al.
Published: (2023)
by: Kroeger, Nicholas, et al.
Published: (2023)
Bias Similarity Measurement: A Black-Box Audit of Fairness Across LLMs
by: Jeong, Hyejun, et al.
Published: (2024)
by: Jeong, Hyejun, et al.
Published: (2024)
How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models
by: Asawa, Parth, et al.
Published: (2025)
by: Asawa, Parth, et al.
Published: (2025)
FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs
by: Sawczyn, Albert, et al.
Published: (2025)
by: Sawczyn, Albert, et al.
Published: (2025)
PAL: Proxy-Guided Black-Box Attack on Large Language Models
by: Sitawarin, Chawin, et al.
Published: (2024)
by: Sitawarin, Chawin, et al.
Published: (2024)
Does It Make Sense to Explain a Black Box With Another Black Box?
by: Delaunay, Julien, et al.
Published: (2024)
by: Delaunay, Julien, et al.
Published: (2024)
Group Fairness Meets the Black Box: Enabling Fair Algorithms on Closed LLMs via Post-Processing
by: Xian, Ruicheng, et al.
Published: (2025)
by: Xian, Ruicheng, et al.
Published: (2025)
Bits Leaked per Query: Information-Theoretic Bounds on Adversarial Attacks against LLMs
by: Kaneko, Masahiro, et al.
Published: (2025)
by: Kaneko, Masahiro, et al.
Published: (2025)
Kov: Transferable and Naturalistic Black-Box LLM Attacks using Markov Decision Processes and Tree Search
by: Moss, Robert J.
Published: (2024)
by: Moss, Robert J.
Published: (2024)
Universal Response and Emergence of Induction in LLMs
by: Luick, Niclas
Published: (2024)
by: Luick, Niclas
Published: (2024)
Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Sampling
by: Shi, Yuhui, et al.
Published: (2024)
by: Shi, Yuhui, et al.
Published: (2024)
Bounded Behavioral Indistinguishability for Black-Box LLM Distillation
by: Hasan, Munawar
Published: (2026)
by: Hasan, Munawar
Published: (2026)
Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting
by: Zhao, Xiaohan, et al.
Published: (2026)
by: Zhao, Xiaohan, et al.
Published: (2026)
Towards Lightweight Reliability: Using Soft Prompts for Hallucination Mitigation in Large Language Models
by: Siddiqui, S M Tahmid, et al.
Published: (2026)
by: Siddiqui, S M Tahmid, et al.
Published: (2026)
ACING: Actor-Critic for Instruction Learning in Black-Box LLMs
by: Kharrat, Salma, et al.
Published: (2024)
by: Kharrat, Salma, et al.
Published: (2024)
An Evaluation of Explanation Methods for Black-Box Detectors of Machine-Generated Text
by: Schoenegger, Loris, et al.
Published: (2024)
by: Schoenegger, Loris, et al.
Published: (2024)
Hierarchical Text Classification Using Black Box Large Language Models
by: Yoshimura, Kosuke, et al.
Published: (2025)
by: Yoshimura, Kosuke, et al.
Published: (2025)
SODA: Semi On-Policy Black-Box Distillation for Large Language Models
by: Chen, Xiwen, et al.
Published: (2026)
by: Chen, Xiwen, et al.
Published: (2026)
Unlocking the Black Box of Latent Reasoning: An Interpretability-Guided Approach to Intervention
by: Chang, Shuochen, et al.
Published: (2026)
by: Chang, Shuochen, et al.
Published: (2026)
A Watermark for Black-Box Language Models
by: Bahri, Dara, et al.
Published: (2024)
by: Bahri, Dara, et al.
Published: (2024)
Deep Learning-based Method for Expressing Knowledge Boundary of Black-Box LLM
by: Sheng, Haotian, et al.
Published: (2026)
by: Sheng, Haotian, et al.
Published: (2026)
Topic Modelling Black Box Optimization
by: Akramov, Roman, et al.
Published: (2025)
by: Akramov, Roman, et al.
Published: (2025)
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories
by: Wei, Zhepei, et al.
Published: (2026)
by: Wei, Zhepei, et al.
Published: (2026)
Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs
by: Mendu, Sai Krishna, et al.
Published: (2025)
by: Mendu, Sai Krishna, et al.
Published: (2025)
Density estimation with LLMs: a geometric investigation of in-context learning trajectories
by: Liu, Toni J. B., et al.
Published: (2024)
by: Liu, Toni J. B., et al.
Published: (2024)
Training Deliberative Monitors for Black-Box Scheming Detection
by: Sinha, Aditya, et al.
Published: (2026)
by: Sinha, Aditya, et al.
Published: (2026)
Does Unlearning Truly Unlearn? A Black Box Evaluation of LLM Unlearning Methods
by: Doshi, Jai, et al.
Published: (2024)
by: Doshi, Jai, et al.
Published: (2024)
TRN-R1-Zero: Text-rich Network Reasoning via LLMs with Reinforcement Learning Only
by: Liu, Yilun, et al.
Published: (2026)
by: Liu, Yilun, et al.
Published: (2026)
Emergent Response Planning in LLMs
by: Dong, Zhichen, et al.
Published: (2025)
by: Dong, Zhichen, et al.
Published: (2025)
Towards Modular LLMs by Building and Reusing a Library of LoRAs
by: Ostapenko, Oleksiy, et al.
Published: (2024)
by: Ostapenko, Oleksiy, et al.
Published: (2024)
Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers
by: Bouchard, Dylan, et al.
Published: (2025)
by: Bouchard, Dylan, et al.
Published: (2025)
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
by: Zhuang, Yuchen, et al.
Published: (2024)
by: Zhuang, Yuchen, et al.
Published: (2024)
Similar Items
-
PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization
by: Jawad, Huseein, et al.
Published: (2025) -
ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic Covering
by: Jawad, Hussein, et al.
Published: (2026) -
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
by: Mehrotra, Anay, et al.
Published: (2023) -
Audit Me If You Can: Query-Efficient Active Fairness Auditing of Black-Box LLMs
by: Hartmann, David, et al.
Published: (2026) -
Predicting the Performance of Black-box LLMs through Follow-up Queries
by: Sam, Dylan, et al.
Published: (2025)