:: Library Catalog

Imagem da capa

Na minha lista:

Detalhes bibliográficos
Main Authors:	Liu, Jian, Zhang, Rui, Szyller, Sebastian, Ren, Kui, Asokan, N.
Formato:	Preprint
Publicado em:	2023
Assuntos:	Cryptography and Security Artificial Intelligence
Acesso em linha:	https://arxiv.org/abs/2304.06607
Tags:	Adicionar Tag Sem tags, seja o primeiro a adicionar uma tag!

Registos relacionados

Amulet: a Python Library for Assessing Interactions Among ML Defenses and Risks
Por: Waheed, Asim, et al.
Publicado em: (2025)

Atlas: A Framework for ML Lifecycle Provenance & Transparency
Por: Spoczynski, Marcin, et al.
Publicado em: (2025)

SoK: Unintended Interactions among Machine Learning Defenses and Risks
Por: Duddu, Vasisht, et al.
Publicado em: (2023)

FedTracker: Furnishing Ownership Verification and Traceability for Federated Learning Model
Por: Shao, Shuo, et al.
Publicado em: (2022)

FIT-Print: Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint
Por: Shao, Shuo, et al.
Publicado em: (2025)

Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution
Por: Shao, Shuo, et al.
Publicado em: (2024)

Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models
Por: Chen, Mingjie, et al.
Publicado em: (2025)

Watermarking Graph Neural Networks via Explanations for Ownership Protection
Por: Downer, Jane, et al.
Publicado em: (2025)

DECEIVE-AFC: Adversarial Claim Attacks against Search-Enabled LLM-based Fact-Checking Systems
Por: Ou, Haoran, et al.
Publicado em: (2026)

RobWE: Robust Watermark Embedding for Personalized Federated Learning Model Ownership Protection
Por: Xu, Yang, et al.
Publicado em: (2024)

Towards Understanding and Enhancing Security of Proof-of-Training for DNN Model Ownership Verification
Por: Chang, Yijia, et al.
Publicado em: (2024)

Extracting Training Data from Diffusion Language Models via Infilling
Por: Wang, Yihan, et al.
Publicado em: (2026)

Textual Unlearning Gives a False Sense of Unlearning
Por: Du, Jiacheng, et al.
Publicado em: (2024)

Membership Inference Attacks Against Vision-Language Models
Por: Hu, Yuke, et al.
Publicado em: (2025)

PCDiff: Proactive Control for Ownership Protection in Diffusion Models with Watermark Compatibility
Por: Gai, Keke, et al.
Publicado em: (2025)

ERASER: Machine Unlearning in MLaaS via an Inference Serving-Aware Approach
Por: Hu, Yuke, et al.
Publicado em: (2023)

Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense
Por: Zhang, Jiawen, et al.
Publicado em: (2025)

Can Small Language Models Reliably Resist Jailbreak Attacks? A Comprehensive Evaluation
Por: Zhang, Wenhui, et al.
Publicado em: (2025)

PoLO: Proof-of-Learning and Proof-of-Ownership at Once with Chained Watermarking
Por: Deng, Haiyu, et al.
Publicado em: (2025)

Neural Honeytrace: Plug&Play Watermarking Framework against Model Extraction Attacks
Por: Xu, Yixiao, et al.
Publicado em: (2025)

SSCL-BW: Sample-Specific Clean-Label Backdoor Watermarking for Dataset Ownership Verification
Por: Wang, Yingjia, et al.
Publicado em: (2025)

Fingerprinting Deep Neural Networks for Ownership Protection: An Analytical Approach
Por: Yang, Guang, et al.
Publicado em: (2026)

iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification
Por: Xiong, Zixun, et al.
Publicado em: (2025)

QUEEN: Query Unlearning against Model Extraction
Por: Chen, Huajie, et al.
Publicado em: (2024)

Depth Gives a False Sense of Privacy: LLM Internal States Inversion
Por: Dong, Tian, et al.
Publicado em: (2025)

Real Money, Fake Models: Deceptive Model Claims in Shadow APIs
Por: Zhang, Yage, et al.
Publicado em: (2026)

CHIP: Chameleon Hash-based Irreversible Passport for Robust Deep Model Ownership Verification and Active Usage Control
Por: Xu, Chaohui, et al.
Publicado em: (2025)

False Friends in the Shell: Unveiling the Emoticon Semantic Confusion in Large Language Models
Por: Jiang, Weipeng, et al.
Publicado em: (2026)

MOVE: Effective and Harmless Ownership Verification via Embedded External Features
Por: Li, Yiming, et al.
Publicado em: (2022)

ShallowJail: Steering Jailbreaks against Large Language Models
Por: Liu, Shang, et al.
Publicado em: (2026)

RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent
Por: Xu, Huiyu, et al.
Publicado em: (2024)

LLM Safeguard is a Double-Edged Sword: Exploiting False Positives for Denial-of-Service Attacks
Por: Zhang, Qingzhao, et al.
Publicado em: (2024)

Combating Concept Drift with Explanatory Detection and Adaptation for Android Malware Classification
Por: He, Yiling, et al.
Publicado em: (2024)

RAGShield: Detecting Numerical Claim Manipulation in Government RAG Systems
Por: Patil, KrishnaSaiReddy
Publicado em: (2026)

LoopTrap: Termination Poisoning Attacks on LLM Agents
Por: Xu, Huiyu, et al.
Publicado em: (2026)

Align is not Enough: Multimodal Universal Jailbreak Attack against Multimodal Large Language Models
Por: Wang, Youze, et al.
Publicado em: (2025)

Adversarial Reinforcement Learning for Detecting False Data Injection Attacks in Vehicular Routing
Por: Eghtesad, Taha, et al.
Publicado em: (2026)

TEAM: Temporal Adversarial Examples Attack Model against Network Intrusion Detection System Applied to RNN
Por: Liu, Ziyi, et al.
Publicado em: (2024)

Integrating Identity-Based Identification against Adaptive Adversaries in Federated Learning
Por: Szelag, Jakub Kacper, et al.
Publicado em: (2025)

DSSmoothing: Toward Certified Dataset Ownership Verification for Pre-trained Language Models via Dual-Space Smoothing
Por: Qiao, Ting, et al.
Publicado em: (2025)