:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Parmar, Maulik, Narayan, Apurva
Format:	Preprint
Published:	2022
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2204.02058
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

On Active Privacy Auditing in Supervised Fine-tuning for White-Box Language Models
by: Sun, Qian, et al.
Published: (2024)

Exploring Prompt-Based Methods for Zero-Shot Hypernym Prediction with Large Language Models
by: Tikhomirov, Mikhail, et al.
Published: (2024)

Mafin: Enhancing Black-Box Embeddings with Model Augmented Fine-Tuning
by: Zhang, Mingtian, et al.
Published: (2024)

A Survey of Calibration Process for Black-Box LLMs
by: Xie, Liangru, et al.
Published: (2024)

From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy
by: Zabolotnii, Serhii, et al.
Published: (2026)

Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers
by: Bouchard, Dylan, et al.
Published: (2025)

Black-Box On-Policy Distillation of Large Language Models
by: Ye, Tianzhu, et al.
Published: (2025)

Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate
by: Moslonka, Charles, et al.
Published: (2025)

From Black Boxes to Conversations: Incorporating XAI in a Conversational Agent
by: Nguyen, Van Bach, et al.
Published: (2022)

Black-Box Hallucination Detection via Consistency Under the Uncertain Expression
by: Joo, Seongho, et al.
Published: (2025)

SwissNYF: Tool Grounded LLM Agents for Black Box Setting
by: Kumar, Somnath Sendhil, et al.
Published: (2024)

AgentXRay: White-Boxing Agentic Systems via Workflow Reconstruction
by: Shi, Ruijie, et al.
Published: (2026)

High Risk of Political Bias in Black Box Emotion Inference Models
by: Plisiecki, Hubert, et al.
Published: (2024)

Topic Modelling Black Box Optimization
by: Akramov, Roman, et al.
Published: (2025)

You've Changed: Detecting Modification of Black-Box Large Language Models
by: Dima, Alden, et al.
Published: (2025)

Inference-Aware Prompt Optimization for Aligning Black-Box Large Language Models
by: Mahmud, Saaduddin, et al.
Published: (2025)

PropNet: a White-Box and Human-Like Network for Sentence Representation
by: Yang, Fei
Published: (2025)

Verify when Uncertain: Beyond Self-Consistency in Black Box Hallucination Detection
by: Xue, Yihao, et al.
Published: (2025)

Training Deliberative Monitors for Black-Box Scheming Detection
by: Sinha, Aditya, et al.
Published: (2026)

Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement
by: Kersting, Nicholas S., et al.
Published: (2026)

Opening the Black Box: A Survey on the Mechanisms of Multi-Step Reasoning in Large Language Models
by: Pan, Liangming, et al.
Published: (2026)

CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds
by: Wang, Lei, et al.
Published: (2024)

All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks
by: Takemoto, Kazuhiro
Published: (2024)

Graph of Attacks: Improved Black-Box and Interpretable Jailbreaks for LLMs
by: Akbar-Tajari, Mohammad, et al.
Published: (2025)

Effective Black Box Testing of Sentiment Analysis Classification Networks
by: Karbasizadeh, Parsa, et al.
Published: (2024)

Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs
by: Li, Changhao, et al.
Published: (2024)

In-Context Explainers: Harnessing LLMs for Explaining Black Box Models
by: Kroeger, Nicholas, et al.
Published: (2023)

HYDRA: Model Factorization Framework for Black-Box LLM Personalization
by: Zhuang, Yuchen, et al.
Published: (2024)

From Black Box to Transparency: Enhancing Automated Interpreting Assessment with Explainable AI in College Classrooms
by: Jiang, Zhaokun, et al.
Published: (2025)

OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization
by: Sun, Yiyou, et al.
Published: (2025)

HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text
by: Liu, Han, et al.
Published: (2024)

Bias Similarity Measurement: A Black-Box Audit of Fairness Across LLMs
by: Jeong, Hyejun, et al.
Published: (2024)

Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration
by: Xu, Ran, et al.
Published: (2025)

BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models
by: Sun, Haotian, et al.
Published: (2024)

Large Language Model Confidence Estimation via Black-Box Access
by: Pedapati, Tejaswini, et al.
Published: (2024)

Effective and Efficient Jailbreaks of Black-Box LLMs with Cross-Behavior Attacks
by: Gohil, Vasudev
Published: (2025)

Box Maze: A Process-Control Architecture for Reliable LLM Reasoning
by: Qiang, Zou
Published: (2026)

Conformal Sets in Multiple-Choice Question Answering under Black-Box Settings with Provable Coverage Guarantees
by: Yang, Guang, et al.
Published: (2025)

SeSE: Black-Box Uncertainty Quantification for Large Language Models Based on Structural Information Theory
by: Zhao, Xingtao, et al.
Published: (2025)

Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
by: Mehrotra, Anay, et al.
Published: (2023)