Saved in:
| Main Authors: | Parmar, Maulik, Narayan, Apurva |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2204.02058 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
On Active Privacy Auditing in Supervised Fine-tuning for White-Box Language Models
by: Sun, Qian, et al.
Published: (2024)
by: Sun, Qian, et al.
Published: (2024)
Exploring Prompt-Based Methods for Zero-Shot Hypernym Prediction with Large Language Models
by: Tikhomirov, Mikhail, et al.
Published: (2024)
by: Tikhomirov, Mikhail, et al.
Published: (2024)
Mafin: Enhancing Black-Box Embeddings with Model Augmented Fine-Tuning
by: Zhang, Mingtian, et al.
Published: (2024)
by: Zhang, Mingtian, et al.
Published: (2024)
A Survey of Calibration Process for Black-Box LLMs
by: Xie, Liangru, et al.
Published: (2024)
by: Xie, Liangru, et al.
Published: (2024)
From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy
by: Zabolotnii, Serhii, et al.
Published: (2026)
by: Zabolotnii, Serhii, et al.
Published: (2026)
Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers
by: Bouchard, Dylan, et al.
Published: (2025)
by: Bouchard, Dylan, et al.
Published: (2025)
Black-Box On-Policy Distillation of Large Language Models
by: Ye, Tianzhu, et al.
Published: (2025)
by: Ye, Tianzhu, et al.
Published: (2025)
Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate
by: Moslonka, Charles, et al.
Published: (2025)
by: Moslonka, Charles, et al.
Published: (2025)
From Black Boxes to Conversations: Incorporating XAI in a Conversational Agent
by: Nguyen, Van Bach, et al.
Published: (2022)
by: Nguyen, Van Bach, et al.
Published: (2022)
Black-Box Hallucination Detection via Consistency Under the Uncertain Expression
by: Joo, Seongho, et al.
Published: (2025)
by: Joo, Seongho, et al.
Published: (2025)
SwissNYF: Tool Grounded LLM Agents for Black Box Setting
by: Kumar, Somnath Sendhil, et al.
Published: (2024)
by: Kumar, Somnath Sendhil, et al.
Published: (2024)
AgentXRay: White-Boxing Agentic Systems via Workflow Reconstruction
by: Shi, Ruijie, et al.
Published: (2026)
by: Shi, Ruijie, et al.
Published: (2026)
High Risk of Political Bias in Black Box Emotion Inference Models
by: Plisiecki, Hubert, et al.
Published: (2024)
by: Plisiecki, Hubert, et al.
Published: (2024)
Topic Modelling Black Box Optimization
by: Akramov, Roman, et al.
Published: (2025)
by: Akramov, Roman, et al.
Published: (2025)
You've Changed: Detecting Modification of Black-Box Large Language Models
by: Dima, Alden, et al.
Published: (2025)
by: Dima, Alden, et al.
Published: (2025)
Inference-Aware Prompt Optimization for Aligning Black-Box Large Language Models
by: Mahmud, Saaduddin, et al.
Published: (2025)
by: Mahmud, Saaduddin, et al.
Published: (2025)
PropNet: a White-Box and Human-Like Network for Sentence Representation
by: Yang, Fei
Published: (2025)
by: Yang, Fei
Published: (2025)
Verify when Uncertain: Beyond Self-Consistency in Black Box Hallucination Detection
by: Xue, Yihao, et al.
Published: (2025)
by: Xue, Yihao, et al.
Published: (2025)
Training Deliberative Monitors for Black-Box Scheming Detection
by: Sinha, Aditya, et al.
Published: (2026)
by: Sinha, Aditya, et al.
Published: (2026)
Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement
by: Kersting, Nicholas S., et al.
Published: (2026)
by: Kersting, Nicholas S., et al.
Published: (2026)
Opening the Black Box: A Survey on the Mechanisms of Multi-Step Reasoning in Large Language Models
by: Pan, Liangming, et al.
Published: (2026)
by: Pan, Liangming, et al.
Published: (2026)
CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds
by: Wang, Lei, et al.
Published: (2024)
by: Wang, Lei, et al.
Published: (2024)
All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks
by: Takemoto, Kazuhiro
Published: (2024)
by: Takemoto, Kazuhiro
Published: (2024)
Graph of Attacks: Improved Black-Box and Interpretable Jailbreaks for LLMs
by: Akbar-Tajari, Mohammad, et al.
Published: (2025)
by: Akbar-Tajari, Mohammad, et al.
Published: (2025)
Effective Black Box Testing of Sentiment Analysis Classification Networks
by: Karbasizadeh, Parsa, et al.
Published: (2024)
by: Karbasizadeh, Parsa, et al.
Published: (2024)
Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs
by: Li, Changhao, et al.
Published: (2024)
by: Li, Changhao, et al.
Published: (2024)
In-Context Explainers: Harnessing LLMs for Explaining Black Box Models
by: Kroeger, Nicholas, et al.
Published: (2023)
by: Kroeger, Nicholas, et al.
Published: (2023)
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
by: Zhuang, Yuchen, et al.
Published: (2024)
by: Zhuang, Yuchen, et al.
Published: (2024)
From Black Box to Transparency: Enhancing Automated Interpreting Assessment with Explainable AI in College Classrooms
by: Jiang, Zhaokun, et al.
Published: (2025)
by: Jiang, Zhaokun, et al.
Published: (2025)
OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization
by: Sun, Yiyou, et al.
Published: (2025)
by: Sun, Yiyou, et al.
Published: (2025)
HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text
by: Liu, Han, et al.
Published: (2024)
by: Liu, Han, et al.
Published: (2024)
Bias Similarity Measurement: A Black-Box Audit of Fairness Across LLMs
by: Jeong, Hyejun, et al.
Published: (2024)
by: Jeong, Hyejun, et al.
Published: (2024)
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration
by: Xu, Ran, et al.
Published: (2025)
by: Xu, Ran, et al.
Published: (2025)
BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models
by: Sun, Haotian, et al.
Published: (2024)
by: Sun, Haotian, et al.
Published: (2024)
Large Language Model Confidence Estimation via Black-Box Access
by: Pedapati, Tejaswini, et al.
Published: (2024)
by: Pedapati, Tejaswini, et al.
Published: (2024)
Effective and Efficient Jailbreaks of Black-Box LLMs with Cross-Behavior Attacks
by: Gohil, Vasudev
Published: (2025)
by: Gohil, Vasudev
Published: (2025)
Box Maze: A Process-Control Architecture for Reliable LLM Reasoning
by: Qiang, Zou
Published: (2026)
by: Qiang, Zou
Published: (2026)
Conformal Sets in Multiple-Choice Question Answering under Black-Box Settings with Provable Coverage Guarantees
by: Yang, Guang, et al.
Published: (2025)
by: Yang, Guang, et al.
Published: (2025)
SeSE: Black-Box Uncertainty Quantification for Large Language Models Based on Structural Information Theory
by: Zhao, Xingtao, et al.
Published: (2025)
by: Zhao, Xingtao, et al.
Published: (2025)
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
by: Mehrotra, Anay, et al.
Published: (2023)
by: Mehrotra, Anay, et al.
Published: (2023)
Similar Items
-
On Active Privacy Auditing in Supervised Fine-tuning for White-Box Language Models
by: Sun, Qian, et al.
Published: (2024) -
Exploring Prompt-Based Methods for Zero-Shot Hypernym Prediction with Large Language Models
by: Tikhomirov, Mikhail, et al.
Published: (2024) -
Mafin: Enhancing Black-Box Embeddings with Model Augmented Fine-Tuning
by: Zhang, Mingtian, et al.
Published: (2024) -
A Survey of Calibration Process for Black-Box LLMs
by: Xie, Liangru, et al.
Published: (2024) -
From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy
by: Zabolotnii, Serhii, et al.
Published: (2026)