Saved in:
| Main Authors: | Kim, Chaeyun, Lim, YongTaek, Kim, Kihyun, Kim, Junghwan, Kim, Minwoo |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.20170 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
STAR-Teaming: A Strategy-Response Multiplex Network Approach to Automated LLM Red Teaming
by: Jung, MinJae, et al.
Published: (2026)
by: Jung, MinJae, et al.
Published: (2026)
PersonaTeaming: Supporting Persona-Driven Red-Teaming for Generative AI
by: Deng, Wesley Hanwen, et al.
Published: (2026)
by: Deng, Wesley Hanwen, et al.
Published: (2026)
EPLKG: Efficient Prompt Learning with Knowledge Graph
by: Lim, YongTaek, et al.
Published: (2023)
by: Lim, YongTaek, et al.
Published: (2023)
Nunchi-Bench: Benchmarking Language Models on Cultural Reasoning with a Focus on Korean Superstition
by: Kim, Kyuhee, et al.
Published: (2025)
by: Kim, Kyuhee, et al.
Published: (2025)
The ASIR Courage Model: A Phase-Dynamic Framework for Truth Transitions in Human and AI Systems
by: Kim, Hyo Jin
Published: (2026)
by: Kim, Hyo Jin
Published: (2026)
Red Teaming AI Red Teaming
by: Majumdar, Subhabrata, et al.
Published: (2025)
by: Majumdar, Subhabrata, et al.
Published: (2025)
Addressing Selection Bias in Computerized Adaptive Testing: A User-Wise Aggregate Influence Function Approach
by: Kwon, Soonwoo, et al.
Published: (2023)
by: Kwon, Soonwoo, et al.
Published: (2023)
A Study on the Framework for Evaluating the Ethics and Trustworthiness of Generative AI
by: Jeong, Cheonsu, et al.
Published: (2025)
by: Jeong, Cheonsu, et al.
Published: (2025)
Designing Ethical Learning for Agentic AI: Toegye Yi Hwang's Ethical Emotion Regulation Framework
by: Kim, Ji Yeon
Published: (2026)
by: Kim, Ji Yeon
Published: (2026)
SAIF: A Comprehensive Framework for Evaluating the Risks of Generative AI in the Public Sector
by: Lee, Kyeongryul, et al.
Published: (2025)
by: Lee, Kyeongryul, et al.
Published: (2025)
Accuracy of a Large Language Model in Distinguishing Anti- And Pro-vaccination Messages on Social Media: The Case of Human Papillomavirus Vaccination
by: Kim, Soojong, et al.
Published: (2024)
by: Kim, Soojong, et al.
Published: (2024)
Geometry-Correct Diffusion Posterior Sampling with Denoiser-Pullback Curvature Guidance and Manifold-Aligned Damping
by: Shin, Seunghyeok, et al.
Published: (2026)
by: Shin, Seunghyeok, et al.
Published: (2026)
AgentSUMO: An Agentic Framework for Interactive Simulation Scenario Generation in SUMO via Large Language Models
by: Jeong, Minwoo, et al.
Published: (2025)
by: Jeong, Minwoo, et al.
Published: (2025)
Model Fusion through Bayesian Optimization in Language Model Fine-Tuning
by: Jang, Chaeyun, et al.
Published: (2024)
by: Jang, Chaeyun, et al.
Published: (2024)
Benign-to-Toxic Jailbreaking: Inducing Harmful Responses from Harmless Prompts
by: Kim, Hee-Seon, et al.
Published: (2025)
by: Kim, Hee-Seon, et al.
Published: (2025)
KTCF: Actionable Recourse in Knowledge Tracing via Counterfactual Explanations for Education
by: Kim, Woojin, et al.
Published: (2026)
by: Kim, Woojin, et al.
Published: (2026)
Knowledge Tracing in Programming Education Integrating Students' Questions
by: Kim, Doyoun, et al.
Published: (2025)
by: Kim, Doyoun, et al.
Published: (2025)
Perceptions of Discriminatory Decisions of Artificial Intelligence: Unpacking the Role of Individual Characteristics
by: Kim, Soojong
Published: (2024)
by: Kim, Soojong
Published: (2024)
How to Capture and Study Conversations Between Research Participants and ChatGPT: GPT for Researchers (g4r.org)
by: Kim, Jin
Published: (2025)
by: Kim, Jin
Published: (2025)
Susceptibility of Large Language Models to User-Driven Factors in Medical Queries
by: Lim, Kyung Ho, et al.
Published: (2025)
by: Lim, Kyung Ho, et al.
Published: (2025)
FAST-DIPS: Adjoint-Free Analytic Steps and Hard-Constrained Likelihood Correction for Diffusion-Prior Inverse Problems
by: Kim, Minwoo, et al.
Published: (2026)
by: Kim, Minwoo, et al.
Published: (2026)
Rethinking Test-Time Scaling for Medical AI: Model and Task-Aware Strategies for LLMs and VLMs
by: Oh, Gyutaek, et al.
Published: (2025)
by: Oh, Gyutaek, et al.
Published: (2025)
Anecdoctoring: Automated Red-Teaming Across Language and Place
by: Cuevas, Alejandro, et al.
Published: (2025)
by: Cuevas, Alejandro, et al.
Published: (2025)
Benchmark Early and Red Team Often: A Framework for Assessing and Managing Dual-Use Hazards of AI Foundation Models
by: Barrett, Anthony M., et al.
Published: (2024)
by: Barrett, Anthony M., et al.
Published: (2024)
Red Teaming AI Policy: A Taxonomy of Avoision and the EU AI Act
by: Yew, Rui-Jie, et al.
Published: (2025)
by: Yew, Rui-Jie, et al.
Published: (2025)
Bridging the Missing-Modality Gap: Improving Text-Only Calibration of Vision Language Models
by: Kim, Mingyeong, et al.
Published: (2026)
by: Kim, Mingyeong, et al.
Published: (2026)
Red Teaming for Generative AI, Report on a Copyright-Focused Exercise Completed in an Academic Medical Center
by: Wen, James, et al.
Published: (2025)
by: Wen, James, et al.
Published: (2025)
CharDiff-LP: A Diffusion Model with Character-Level Guidance for License Plate Image Restoration
by: Na, Kihyun, et al.
Published: (2025)
by: Na, Kihyun, et al.
Published: (2025)
The Precautionary Principle and the Innovation Principle: Incompatible Guides for AI Innovation Governance?
by: Kaivanto, Kim
Published: (2025)
by: Kaivanto, Kim
Published: (2025)
Students' Perceived Roles, Opportunities, and Challenges of a Generative AI-powered Teachable Agent: A Case of Middle School Math Class
by: Song, Yukyeong, et al.
Published: (2024)
by: Song, Yukyeong, et al.
Published: (2024)
Can You Share Your Story? Modeling Clients' Metacognition and Openness for LLM Therapist Evaluation
by: Kim, Minju, et al.
Published: (2025)
by: Kim, Minju, et al.
Published: (2025)
Integrating Generative AI in BIM Education: Insights from Classroom Implementation
by: Sahraoui, Islem, et al.
Published: (2025)
by: Sahraoui, Islem, et al.
Published: (2025)
Rethinking Garment Conditioning in Diffusion-based Virtual Try-On
by: Na, Kihyun, et al.
Published: (2025)
by: Na, Kihyun, et al.
Published: (2025)
Taxonomy and Analysis of Sensitive User Queries in Generative AI Search
by: Jo, Hwiyeol, et al.
Published: (2024)
by: Jo, Hwiyeol, et al.
Published: (2024)
RedTopic: Toward Topic-Diverse Red Teaming of Large Language Models
by: Ding, Jiale, et al.
Published: (2025)
by: Ding, Jiale, et al.
Published: (2025)
RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs' Contextual Sensitivity
by: Shin, Jisu, et al.
Published: (2025)
by: Shin, Jisu, et al.
Published: (2025)
Position: The AI Conference Peer Review Crisis Demands Author Feedback and Reviewer Rewards
by: Kim, Jaeho, et al.
Published: (2025)
by: Kim, Jaeho, et al.
Published: (2025)
Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision
by: Jang, Chaeyun, et al.
Published: (2025)
by: Jang, Chaeyun, et al.
Published: (2025)
Can AI be a moral victim? The role of moral patiency and ownership perceptions in ethical judgments of using AI-generated content
by: Choung, Hyesun, et al.
Published: (2026)
by: Choung, Hyesun, et al.
Published: (2026)
Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services
by: Dimino, Fabrizio, et al.
Published: (2026)
by: Dimino, Fabrizio, et al.
Published: (2026)
Similar Items
-
STAR-Teaming: A Strategy-Response Multiplex Network Approach to Automated LLM Red Teaming
by: Jung, MinJae, et al.
Published: (2026) -
PersonaTeaming: Supporting Persona-Driven Red-Teaming for Generative AI
by: Deng, Wesley Hanwen, et al.
Published: (2026) -
EPLKG: Efficient Prompt Learning with Knowledge Graph
by: Lim, YongTaek, et al.
Published: (2023) -
Nunchi-Bench: Benchmarking Language Models on Cultural Reasoning with a Focus on Korean Superstition
by: Kim, Kyuhee, et al.
Published: (2025) -
The ASIR Courage Model: A Phase-Dynamic Framework for Truth Transitions in Human and AI Systems
by: Kim, Hyo Jin
Published: (2026)