Saved in:
| Main Authors: | Lu, Yida, Fang, Jianwei, Shao, Xuyang, Chen, Zixuan, Cui, Shiyao, Bian, Shanshan, Su, Guangyao, Ke, Pei, Qiu, Han, Huang, Minlie |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.05028 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks
by: Zhang, Zhexin, et al.
Published: (2024)
by: Zhang, Zhexin, et al.
Published: (2024)
The Superalignment of Superhuman Intelligence with Large Language Models
by: Huang, Minlie, et al.
Published: (2024)
by: Huang, Minlie, et al.
Published: (2024)
Agent-SafetyBench: Evaluating the Safety of LLM Agents
by: Zhang, Zhexin, et al.
Published: (2024)
by: Zhang, Zhexin, et al.
Published: (2024)
ShieldVLM: Safeguarding the Multimodal Implicit Toxicity via Deliberative Reasoning with LVLMs
by: Cui, Shiyao, et al.
Published: (2025)
by: Cui, Shiyao, et al.
Published: (2025)
Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchmark, and Findings
by: Yang, Shujian, et al.
Published: (2025)
by: Yang, Shujian, et al.
Published: (2025)
The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning
by: Chen, Renmiao, et al.
Published: (2026)
by: Chen, Renmiao, et al.
Published: (2026)
Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
by: Yang, Junxiao, et al.
Published: (2025)
by: Yang, Junxiao, et al.
Published: (2025)
Enhanced Survival Trees
by: Zhou, Ruiwen, et al.
Published: (2025)
by: Zhou, Ruiwen, et al.
Published: (2025)
The Missing Half: Unveiling Training-time Implicit Safety Risks Beyond Deployment
by: Zhang, Zhexin, et al.
Published: (2026)
by: Zhang, Zhexin, et al.
Published: (2026)
LongSafety: Evaluating Long-Context Safety of Large Language Models
by: Lu, Yida, et al.
Published: (2025)
by: Lu, Yida, et al.
Published: (2025)
Frailty and the Survival of Patients With Endometrial Cancer: A Meta‐Analysis
by: Shanshan Jia, et al.
Published: (2025)
by: Shanshan Jia, et al.
Published: (2025)
Survival under Dictatorships
by: Borhi, László
Published: (2025)
by: Borhi, László
Published: (2025)
SMSP: A Plug-and-Play Strategy of Multi-Scale Perception for MLLMs to Perceive Visual Illusions
by: Tu, Jinzhe, et al.
Published: (2026)
by: Tu, Jinzhe, et al.
Published: (2026)
End-to-end Multi-source Visual Prompt Tuning for Survival Analysis in Whole Slide Images
by: Qiu, Zhongwei, et al.
Published: (2024)
by: Qiu, Zhongwei, et al.
Published: (2024)
The Impact of Epistemic Curiosity on Traffic Risky Behavior: The Mediating Role of Conformity
by: YiMeng Cui, et al.
Published: (2025)
by: YiMeng Cui, et al.
Published: (2025)
MMSF: Multitask and Multimodal Supervised Framework for WSI Classification and Survival Analysis
by: She, Chengying, et al.
Published: (2026)
by: She, Chengying, et al.
Published: (2026)
Learning Task Decomposition to Assist Humans in Competitive Programming
by: Wen, Jiaxin, et al.
Published: (2024)
by: Wen, Jiaxin, et al.
Published: (2024)
How We Survive: The Cost of (Ground)Water
by: Breanna Rivera Waterman
Published: (2024)
by: Breanna Rivera Waterman
Published: (2024)
Language Model Decoding as Direct Metrics Optimization
by: Ji, Haozhe, et al.
Published: (2023)
by: Ji, Haozhe, et al.
Published: (2023)
End-Cut Preference in Survival Trees
by: Su, Xiaogang
Published: (2025)
by: Su, Xiaogang
Published: (2025)
LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety
by: Yang, Junxiao, et al.
Published: (2026)
by: Yang, Junxiao, et al.
Published: (2026)
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
by: Cheng, Jiale, et al.
Published: (2024)
by: Cheng, Jiale, et al.
Published: (2024)
When Smiley Turns Hostile: Interpreting How Emojis Trigger LLMs' Toxicity
by: Cui, Shiyao, et al.
Published: (2025)
by: Cui, Shiyao, et al.
Published: (2025)
Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!
by: Zhang, Zhexin, et al.
Published: (2025)
by: Zhang, Zhexin, et al.
Published: (2025)
How to Survive in Industry. Cost Justifying Library Services
by: Kramer, Joseph
Published: (1971)
by: Kramer, Joseph
Published: (1971)
Modifying Survival Models To Accommodate Thresholding Behavior
by: Betancourt, Michael
Published: (2022)
by: Betancourt, Michael
Published: (2022)
Does Time Pressure Alter the Affect Gap in Risky Choice?
by: R. Philips, et al.
Published: (2025)
by: R. Philips, et al.
Published: (2025)
Survival analysis under label shift
by: Zong, Yuxiang, et al.
Published: (2025)
by: Zong, Yuxiang, et al.
Published: (2025)
Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity
by: Chen, Zhihong, et al.
Published: (2025)
by: Chen, Zhihong, et al.
Published: (2025)
Survival of the Cheapest: Cost-Aware Hardware Adaptation for Adversarial Robustness
by: Meyers, Charles, et al.
Published: (2024)
by: Meyers, Charles, et al.
Published: (2024)
Survival Benefits Outweigh Germline Competition Costs in Kin Chimeras.
by: Voskoboynik, R, et al.
Published: (2026)
by: Voskoboynik, R, et al.
Published: (2026)
A Behavioral Scorecard Model Using Survival Analysis
by: Lee, Cheng, et al.
Published: (2025)
by: Lee, Cheng, et al.
Published: (2025)
Correction to “Does Time Pressure Alter the Affect Gap in Risky Choice?”
Published: (2025)
Published: (2025)
Group Survival Probability under Contagion in Microlending
by: Jasso-Fuentes, Héctor, et al.
Published: (2025)
by: Jasso-Fuentes, Héctor, et al.
Published: (2025)
Survival mechanisms of plants under hypoxic stress: Physiological acclimation and molecular regulation
by: Lin‐Na Wang, et al.
Published: (2025)
by: Lin‐Na Wang, et al.
Published: (2025)
Exploring the Horizontal Dimension of Political Trust in China
by: Yida Zhai
Published: (2025)
by: Yida Zhai
Published: (2025)
Risky-Bench: Probing Agentic Safety Risks under Real-World Deployment
by: Zheng, Jingnan, et al.
Published: (2026)
by: Zheng, Jingnan, et al.
Published: (2026)
An Investigation on Integral Emotions as Parallel Predictors for Risky Financial Behavior
by: Miriam Rustam, et al.
Published: (2026)
by: Miriam Rustam, et al.
Published: (2026)
Power-Law Inflation Survives Observational Constraints
by: Yu, Yao, et al.
Published: (2025)
by: Yu, Yao, et al.
Published: (2025)
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
by: Zhang, Zhexin, et al.
Published: (2024)
by: Zhang, Zhexin, et al.
Published: (2024)
Similar Items
-
From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks
by: Zhang, Zhexin, et al.
Published: (2024) -
The Superalignment of Superhuman Intelligence with Large Language Models
by: Huang, Minlie, et al.
Published: (2024) -
Agent-SafetyBench: Evaluating the Safety of LLM Agents
by: Zhang, Zhexin, et al.
Published: (2024) -
ShieldVLM: Safeguarding the Multimodal Implicit Toxicity via Deliberative Reasoning with LVLMs
by: Cui, Shiyao, et al.
Published: (2025) -
Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchmark, and Findings
by: Yang, Shujian, et al.
Published: (2025)