Saved in:
| Main Authors: | Fu, Shaopeng, Sun, Xuexue, Qing, Ke, Zheng, Tianhang, Wang, Di |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.02814 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Secure Transfer Learning: Training Clean Models Against Backdoor in (Both) Pre-trained Encoders and Downstream Datasets
by: Zhang, Yechao, et al.
Published: (2025)
by: Zhang, Yechao, et al.
Published: (2025)
Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders
by: Han, Tingxu, et al.
Published: (2024)
by: Han, Tingxu, et al.
Published: (2024)
Understanding and Improving Continuous Adversarial Training for LLMs via In-context Learning Theory
by: Fu, Shaopeng, et al.
Published: (2026)
by: Fu, Shaopeng, et al.
Published: (2026)
Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence
by: Fu, Shaopeng, et al.
Published: (2025)
by: Fu, Shaopeng, et al.
Published: (2025)
FedReview: A Review Mechanism for Rejecting Poisoned Updates in Federated Learning
by: Zheng, Tianhang, et al.
Published: (2024)
by: Zheng, Tianhang, et al.
Published: (2024)
Privacy Auditing of Multi-domain Graph Pre-trained Model under Membership Inference Attacks
by: Luo, Jiayi, et al.
Published: (2025)
by: Luo, Jiayi, et al.
Published: (2025)
DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders
by: Hou, Sizai, et al.
Published: (2024)
by: Hou, Sizai, et al.
Published: (2024)
Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models
by: Wen, Yuxin, et al.
Published: (2024)
by: Wen, Yuxin, et al.
Published: (2024)
Twin Auto-Encoder Model for Learning Separable Representation in Cyberattack Detection
by: Dinh, Phai Vu, et al.
Published: (2024)
by: Dinh, Phai Vu, et al.
Published: (2024)
How to Enhance Downstream Adversarial Robustness (almost) without Touching the Pre-Trained Foundation Model?
by: Liu, Meiqi, et al.
Published: (2025)
by: Liu, Meiqi, et al.
Published: (2025)
A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference
by: Kumar, Abhinav, et al.
Published: (2023)
by: Kumar, Abhinav, et al.
Published: (2023)
Mitigating Error Amplification in Fast Adversarial Training
by: Zhao, Mengnan, et al.
Published: (2026)
by: Zhao, Mengnan, et al.
Published: (2026)
Probe-Me-Not: Protecting Pre-trained Encoders from Malicious Probing
by: Ding, Ruyi, et al.
Published: (2024)
by: Ding, Ruyi, et al.
Published: (2024)
Mitigating Noise Detriment in Differentially Private Federated Learning with Model Pre-training
by: Jin, Huitong, et al.
Published: (2024)
by: Jin, Huitong, et al.
Published: (2024)
Mitigating Downstream Model Risks via Model Provenance
by: Wang, Keyu, et al.
Published: (2024)
by: Wang, Keyu, et al.
Published: (2024)
Selective Pre-training for Private Fine-tuning
by: Yu, Da, et al.
Published: (2023)
by: Yu, Da, et al.
Published: (2023)
Training Large ASR Encoders with Differential Privacy
by: Chauhan, Geeticka, et al.
Published: (2024)
by: Chauhan, Geeticka, et al.
Published: (2024)
CorruptEncoder: Data Poisoning based Backdoor Attacks to Contrastive Learning
by: Zhang, Jinghuai, et al.
Published: (2022)
by: Zhang, Jinghuai, et al.
Published: (2022)
Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective
by: Naderloui, Nima, et al.
Published: (2025)
by: Naderloui, Nima, et al.
Published: (2025)
Transferable Watermarking to Self-supervised Pre-trained Graph Encoders by Trigger Embeddings
by: Zhao, Xiangyu, et al.
Published: (2024)
by: Zhao, Xiangyu, et al.
Published: (2024)
Machine Unlearning of Pre-trained Large Language Models
by: Yao, Jin, et al.
Published: (2024)
by: Yao, Jin, et al.
Published: (2024)
ArmSSL: Adversarial Robust Black-Box Watermarking for Self-Supervised Learning Pre-trained Encoders
by: Jiang, Yongqi, et al.
Published: (2026)
by: Jiang, Yongqi, et al.
Published: (2026)
MIA-Tuner: Adapting Large Language Models as Pre-training Text Detector
by: Fu, Wenjie, et al.
Published: (2024)
by: Fu, Wenjie, et al.
Published: (2024)
Unveiling the Backdoor Mechanism Hidden Behind Catastrophic Overfitting in Fast Adversarial Training
by: Zhao, Mengnan, et al.
Published: (2026)
by: Zhao, Mengnan, et al.
Published: (2026)
Continuous Multi-Task Pre-training for Malicious URL Detection and Webpage Classification
by: Li, Yujie, et al.
Published: (2024)
by: Li, Yujie, et al.
Published: (2024)
Attacking Attention of Foundation Models Disrupts Downstream Tasks
by: Silva, Hondamunige Prasanna, et al.
Published: (2025)
by: Silva, Hondamunige Prasanna, et al.
Published: (2025)
Multiple-Input Auto-Encoder Guided Feature Selection for IoT Intrusion Detection Systems
by: Dinh, Phai Vu, et al.
Published: (2024)
by: Dinh, Phai Vu, et al.
Published: (2024)
SecEncoder: Logs are All You Need in Security
by: Bulut, Muhammed Fatih, et al.
Published: (2024)
by: Bulut, Muhammed Fatih, et al.
Published: (2024)
Pre-training Differentially Private Models with Limited Public Data
by: Bu, Zhiqi, et al.
Published: (2024)
by: Bu, Zhiqi, et al.
Published: (2024)
Indiscriminate Data Poisoning Attacks on Pre-trained Feature Extractors
by: Lu, Yiwei, et al.
Published: (2024)
by: Lu, Yiwei, et al.
Published: (2024)
MPC-Pipe: an Efficient Pipeline Scheme for Secure Multi-party Machine Learning Inference
by: Wang, Yongqin, et al.
Published: (2022)
by: Wang, Yongqin, et al.
Published: (2022)
Learning to Poison Large Language Models for Downstream Manipulation
by: Zhou, Xiangyu, et al.
Published: (2024)
by: Zhou, Xiangyu, et al.
Published: (2024)
Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction
by: Wu, Shuchi, et al.
Published: (2023)
by: Wu, Shuchi, et al.
Published: (2023)
Detecting AutoEncoder is Enough to Catch LDM Generated Images
by: Vesnin, Dmitry, et al.
Published: (2024)
by: Vesnin, Dmitry, et al.
Published: (2024)
Your Text Encoder Can Be An Object-Level Watermarking Controller
by: Devulapally, Naresh Kumar, et al.
Published: (2025)
by: Devulapally, Naresh Kumar, et al.
Published: (2025)
Augmenting Parameter-Efficient Pre-trained Language Models with Large Language Models
by: Anand, Saurabh, et al.
Published: (2026)
by: Anand, Saurabh, et al.
Published: (2026)
Accuracy Improvement in Differentially Private Logistic Regression: A Pre-training Approach
by: Hoseinpour, Mohammad, et al.
Published: (2023)
by: Hoseinpour, Mohammad, et al.
Published: (2023)
DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately
by: Wu, Huiwen, et al.
Published: (2024)
by: Wu, Huiwen, et al.
Published: (2024)
On the (In)Security of Loading Machine Learning Models
by: Digregorio, Gabriele, et al.
Published: (2025)
by: Digregorio, Gabriele, et al.
Published: (2025)
FT-PrivacyScore: Personalized Privacy Scoring Service for Machine Learning Participation
by: Gu, Yuechun, et al.
Published: (2024)
by: Gu, Yuechun, et al.
Published: (2024)
Similar Items
-
Secure Transfer Learning: Training Clean Models Against Backdoor in (Both) Pre-trained Encoders and Downstream Datasets
by: Zhang, Yechao, et al.
Published: (2025) -
Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders
by: Han, Tingxu, et al.
Published: (2024) -
Understanding and Improving Continuous Adversarial Training for LLMs via In-context Learning Theory
by: Fu, Shaopeng, et al.
Published: (2026) -
Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence
by: Fu, Shaopeng, et al.
Published: (2025) -
FedReview: A Review Mechanism for Rejecting Poisoned Updates in Federated Learning
by: Zheng, Tianhang, et al.
Published: (2024)