Na minha lista:
| Main Authors: | Liu, Jian, Zhang, Rui, Szyller, Sebastian, Ren, Kui, Asokan, N. |
|---|---|
| Formato: | Preprint |
| Publicado em: |
2023
|
| Assuntos: | |
| Acesso em linha: | https://arxiv.org/abs/2304.06607 |
| Tags: |
Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!
|
Registos relacionados
Amulet: a Python Library for Assessing Interactions Among ML Defenses and Risks
Por: Waheed, Asim, et al.
Publicado em: (2025)
Por: Waheed, Asim, et al.
Publicado em: (2025)
Atlas: A Framework for ML Lifecycle Provenance & Transparency
Por: Spoczynski, Marcin, et al.
Publicado em: (2025)
Por: Spoczynski, Marcin, et al.
Publicado em: (2025)
SoK: Unintended Interactions among Machine Learning Defenses and Risks
Por: Duddu, Vasisht, et al.
Publicado em: (2023)
Por: Duddu, Vasisht, et al.
Publicado em: (2023)
FedTracker: Furnishing Ownership Verification and Traceability for Federated Learning Model
Por: Shao, Shuo, et al.
Publicado em: (2022)
Por: Shao, Shuo, et al.
Publicado em: (2022)
FIT-Print: Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint
Por: Shao, Shuo, et al.
Publicado em: (2025)
Por: Shao, Shuo, et al.
Publicado em: (2025)
Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution
Por: Shao, Shuo, et al.
Publicado em: (2024)
Por: Shao, Shuo, et al.
Publicado em: (2024)
Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models
Por: Chen, Mingjie, et al.
Publicado em: (2025)
Por: Chen, Mingjie, et al.
Publicado em: (2025)
Watermarking Graph Neural Networks via Explanations for Ownership Protection
Por: Downer, Jane, et al.
Publicado em: (2025)
Por: Downer, Jane, et al.
Publicado em: (2025)
DECEIVE-AFC: Adversarial Claim Attacks against Search-Enabled LLM-based Fact-Checking Systems
Por: Ou, Haoran, et al.
Publicado em: (2026)
Por: Ou, Haoran, et al.
Publicado em: (2026)
RobWE: Robust Watermark Embedding for Personalized Federated Learning Model Ownership Protection
Por: Xu, Yang, et al.
Publicado em: (2024)
Por: Xu, Yang, et al.
Publicado em: (2024)
Towards Understanding and Enhancing Security of Proof-of-Training for DNN Model Ownership Verification
Por: Chang, Yijia, et al.
Publicado em: (2024)
Por: Chang, Yijia, et al.
Publicado em: (2024)
Extracting Training Data from Diffusion Language Models via Infilling
Por: Wang, Yihan, et al.
Publicado em: (2026)
Por: Wang, Yihan, et al.
Publicado em: (2026)
Textual Unlearning Gives a False Sense of Unlearning
Por: Du, Jiacheng, et al.
Publicado em: (2024)
Por: Du, Jiacheng, et al.
Publicado em: (2024)
Membership Inference Attacks Against Vision-Language Models
Por: Hu, Yuke, et al.
Publicado em: (2025)
Por: Hu, Yuke, et al.
Publicado em: (2025)
PCDiff: Proactive Control for Ownership Protection in Diffusion Models with Watermark Compatibility
Por: Gai, Keke, et al.
Publicado em: (2025)
Por: Gai, Keke, et al.
Publicado em: (2025)
ERASER: Machine Unlearning in MLaaS via an Inference Serving-Aware Approach
Por: Hu, Yuke, et al.
Publicado em: (2023)
Por: Hu, Yuke, et al.
Publicado em: (2023)
Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense
Por: Zhang, Jiawen, et al.
Publicado em: (2025)
Por: Zhang, Jiawen, et al.
Publicado em: (2025)
Can Small Language Models Reliably Resist Jailbreak Attacks? A Comprehensive Evaluation
Por: Zhang, Wenhui, et al.
Publicado em: (2025)
Por: Zhang, Wenhui, et al.
Publicado em: (2025)
PoLO: Proof-of-Learning and Proof-of-Ownership at Once with Chained Watermarking
Por: Deng, Haiyu, et al.
Publicado em: (2025)
Por: Deng, Haiyu, et al.
Publicado em: (2025)
Neural Honeytrace: Plug&Play Watermarking Framework against Model Extraction Attacks
Por: Xu, Yixiao, et al.
Publicado em: (2025)
Por: Xu, Yixiao, et al.
Publicado em: (2025)
SSCL-BW: Sample-Specific Clean-Label Backdoor Watermarking for Dataset Ownership Verification
Por: Wang, Yingjia, et al.
Publicado em: (2025)
Por: Wang, Yingjia, et al.
Publicado em: (2025)
Fingerprinting Deep Neural Networks for Ownership Protection: An Analytical Approach
Por: Yang, Guang, et al.
Publicado em: (2026)
Por: Yang, Guang, et al.
Publicado em: (2026)
iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification
Por: Xiong, Zixun, et al.
Publicado em: (2025)
Por: Xiong, Zixun, et al.
Publicado em: (2025)
QUEEN: Query Unlearning against Model Extraction
Por: Chen, Huajie, et al.
Publicado em: (2024)
Por: Chen, Huajie, et al.
Publicado em: (2024)
Depth Gives a False Sense of Privacy: LLM Internal States Inversion
Por: Dong, Tian, et al.
Publicado em: (2025)
Por: Dong, Tian, et al.
Publicado em: (2025)
Real Money, Fake Models: Deceptive Model Claims in Shadow APIs
Por: Zhang, Yage, et al.
Publicado em: (2026)
Por: Zhang, Yage, et al.
Publicado em: (2026)
CHIP: Chameleon Hash-based Irreversible Passport for Robust Deep Model Ownership Verification and Active Usage Control
Por: Xu, Chaohui, et al.
Publicado em: (2025)
Por: Xu, Chaohui, et al.
Publicado em: (2025)
False Friends in the Shell: Unveiling the Emoticon Semantic Confusion in Large Language Models
Por: Jiang, Weipeng, et al.
Publicado em: (2026)
Por: Jiang, Weipeng, et al.
Publicado em: (2026)
MOVE: Effective and Harmless Ownership Verification via Embedded External Features
Por: Li, Yiming, et al.
Publicado em: (2022)
Por: Li, Yiming, et al.
Publicado em: (2022)
ShallowJail: Steering Jailbreaks against Large Language Models
Por: Liu, Shang, et al.
Publicado em: (2026)
Por: Liu, Shang, et al.
Publicado em: (2026)
RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent
Por: Xu, Huiyu, et al.
Publicado em: (2024)
Por: Xu, Huiyu, et al.
Publicado em: (2024)
LLM Safeguard is a Double-Edged Sword: Exploiting False Positives for Denial-of-Service Attacks
Por: Zhang, Qingzhao, et al.
Publicado em: (2024)
Por: Zhang, Qingzhao, et al.
Publicado em: (2024)
Combating Concept Drift with Explanatory Detection and Adaptation for Android Malware Classification
Por: He, Yiling, et al.
Publicado em: (2024)
Por: He, Yiling, et al.
Publicado em: (2024)
RAGShield: Detecting Numerical Claim Manipulation in Government RAG Systems
Por: Patil, KrishnaSaiReddy
Publicado em: (2026)
Por: Patil, KrishnaSaiReddy
Publicado em: (2026)
LoopTrap: Termination Poisoning Attacks on LLM Agents
Por: Xu, Huiyu, et al.
Publicado em: (2026)
Por: Xu, Huiyu, et al.
Publicado em: (2026)
Align is not Enough: Multimodal Universal Jailbreak Attack against Multimodal Large Language Models
Por: Wang, Youze, et al.
Publicado em: (2025)
Por: Wang, Youze, et al.
Publicado em: (2025)
Adversarial Reinforcement Learning for Detecting False Data Injection Attacks in Vehicular Routing
Por: Eghtesad, Taha, et al.
Publicado em: (2026)
Por: Eghtesad, Taha, et al.
Publicado em: (2026)
TEAM: Temporal Adversarial Examples Attack Model against Network Intrusion Detection System Applied to RNN
Por: Liu, Ziyi, et al.
Publicado em: (2024)
Por: Liu, Ziyi, et al.
Publicado em: (2024)
Integrating Identity-Based Identification against Adaptive Adversaries in Federated Learning
Por: Szelag, Jakub Kacper, et al.
Publicado em: (2025)
Por: Szelag, Jakub Kacper, et al.
Publicado em: (2025)
DSSmoothing: Toward Certified Dataset Ownership Verification for Pre-trained Language Models via Dual-Space Smoothing
Por: Qiao, Ting, et al.
Publicado em: (2025)
Por: Qiao, Ting, et al.
Publicado em: (2025)
Registos relacionados
-
Amulet: a Python Library for Assessing Interactions Among ML Defenses and Risks
Por: Waheed, Asim, et al.
Publicado em: (2025) -
Atlas: A Framework for ML Lifecycle Provenance & Transparency
Por: Spoczynski, Marcin, et al.
Publicado em: (2025) -
SoK: Unintended Interactions among Machine Learning Defenses and Risks
Por: Duddu, Vasisht, et al.
Publicado em: (2023) -
FedTracker: Furnishing Ownership Verification and Traceability for Federated Learning Model
Por: Shao, Shuo, et al.
Publicado em: (2022) -
FIT-Print: Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint
Por: Shao, Shuo, et al.
Publicado em: (2025)