:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Mostafa, Ahmed, Nahid, Raisul Arefin, Mulder, Samuel
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Artificial Intelligence Computation and Language Cryptography and Security Machine Learning
Online-Zugang:	https://arxiv.org/abs/2511.03825
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

LLMs Have Rhythm: Fingerprinting Large Language Models Using Inter-Token Times and Network Traffic Analysis
von: Alhazbi, Saeif, et al.
Veröffentlicht: (2025)

Automated Software Vulnerability Static Code Analysis Using Generative Pre-Trained Transformer Models
von: Pelofske, Elijah, et al.
Veröffentlicht: (2024)

Less Data, More Security: Advancing Cybersecurity LLMs Specialization via Resource-Efficient Domain-Adaptive Continuous Pre-training with Minimal Tokens
von: Salahuddin, Salahuddin, et al.
Veröffentlicht: (2025)

Interpreting the Repeated Token Phenomenon in Large Language Models
von: Yona, Itay, et al.
Veröffentlicht: (2025)

Gandalf the Red: Adaptive Security for LLMs
von: Pfister, Niklas, et al.
Veröffentlicht: (2025)

MOCHA: Are Code Language Models Robust Against Multi-Turn Malicious Coding Prompts?
von: Wahed, Muntasir, et al.
Veröffentlicht: (2025)

Sparse Tokens Suffice: Jailbreaking Audio Language Models via Token-Aware Gradient Optimization
von: Fang, Zheng, et al.
Veröffentlicht: (2026)

ObfuscaTune: Obfuscated Offsite Fine-tuning and Inference of Proprietary LLMs on Private Datasets
von: Frikha, Ahmed, et al.
Veröffentlicht: (2024)

SecureCode: A Production-Grade Multi-Turn Dataset for Training Security-Aware Code Generation Models
von: Thornton, Scott
Veröffentlicht: (2025)

Rethinking How to Evaluate Language Model Jailbreak
von: Cai, Hongyu, et al.
Veröffentlicht: (2024)

Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models
von: Russinovich, Mark, et al.
Veröffentlicht: (2025)

Fingerprinting Deep Learning Models via Network Traffic Patterns in Federated Learning
von: Shuvo, Md Nahid Hasan, et al.
Veröffentlicht: (2025)

Time Travel in LLMs: Tracing Data Contamination in Large Language Models
von: Golchin, Shahriar, et al.
Veröffentlicht: (2023)

Teach LLMs to Phish: Stealing Private Information from Language Models
von: Panda, Ashwinee, et al.
Veröffentlicht: (2024)

A Generative Approach to LLM Harmfulness Mitigation with Red Flag Tokens
von: Dobre, David, et al.
Veröffentlicht: (2025)

When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models
von: Wang, Kai, et al.
Veröffentlicht: (2025)

LLMs can be Dangerous Reasoners: Analyzing-based Jailbreak Attack on Large Language Models
von: Lin, Shi, et al.
Veröffentlicht: (2024)

BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints
von: Gill, Waris, et al.
Veröffentlicht: (2025)

Jailbreaking LLMs via Calibration
von: Lu, Yuxuan, et al.
Veröffentlicht: (2026)

The Janus Interface: How Fine-Tuning in Large Language Models Amplifies the Privacy Risks
von: Chen, Xiaoyi, et al.
Veröffentlicht: (2023)

Tool Preferences in Agentic LLMs are Unreliable
von: Faghih, Kazem, et al.
Veröffentlicht: (2025)

Shh, don't say that! Domain Certification in LLMs
von: Emde, Cornelius, et al.
Veröffentlicht: (2025)

Early Signs of Steganographic Capabilities in Frontier LLMs
von: Zolkowski, Artur, et al.
Veröffentlicht: (2025)

AutoBaxBuilder: Bootstrapping Code Security Benchmarking
von: von Arx, Tobias, et al.
Veröffentlicht: (2025)

Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
von: Mehrotra, Anay, et al.
Veröffentlicht: (2023)

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
von: Paulus, Anselm, et al.
Veröffentlicht: (2024)

Can Neural Decompilation Assist Vulnerability Prediction on Binary Code?
von: Cotroneo, D., et al.
Veröffentlicht: (2024)

SequentialBreak: Large Language Models Can be Fooled by Embedding Jailbreak Prompts into Sequential Prompt Chains
von: Saiem, Bijoy Ahmed, et al.
Veröffentlicht: (2024)

Agent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoning
von: Ahmed, Nesreen K., et al.
Veröffentlicht: (2026)

Bypassing the Safety Training of Open-Source LLMs with Priming Attacks
von: Vega, Jason, et al.
Veröffentlicht: (2023)

JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs
von: Chu, Junjie, et al.
Veröffentlicht: (2024)

Enhancing Prompt Injection Attacks to LLMs via Poisoning Alignment
von: Shao, Zedian, et al.
Veröffentlicht: (2024)

HARMONIC: Harnessing LLMs for Tabular Data Synthesis and Privacy Protection
von: Wang, Yuxin, et al.
Veröffentlicht: (2024)

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs
von: Betley, Jan, et al.
Veröffentlicht: (2025)

BaxBench: Can LLMs Generate Correct and Secure Backends?
von: Vero, Mark, et al.
Veröffentlicht: (2025)

LLMs can hide text in other text of the same length
von: Norelli, Antonio, et al.
Veröffentlicht: (2025)

Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
von: Rando, Javier, et al.
Veröffentlicht: (2024)

Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs
von: Xu, Xiaoyu, et al.
Veröffentlicht: (2025)

Tell me about yourself: LLMs are aware of their learned behaviors
von: Betley, Jan, et al.
Veröffentlicht: (2025)

Cross-Session Threats in AI Agents: Benchmark, Evaluation, and Algorithms
von: Azarafrooz, Ari
Veröffentlicht: (2026)