Saved in:
| Main Authors: | Banerjee, Dave, Aarne, Onni |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.00036 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
International Security Applications of Flexible Hardware-Enabled Guarantees
by: Aarne, Onni, et al.
Published: (2025)
by: Aarne, Onni, et al.
Published: (2025)
Technical Options for Flexible Hardware-Enabled Guarantees
by: Petrie, James, et al.
Published: (2025)
by: Petrie, James, et al.
Published: (2025)
Flexible Hardware-Enabled Guarantees for AI Compute
by: Petrie, James, et al.
Published: (2025)
by: Petrie, James, et al.
Published: (2025)
Defending Against Intelligent Attackers at Large Scales
by: Lohn, Andrew J.
Published: (2025)
by: Lohn, Andrew J.
Published: (2025)
Defending Compute Thresholds Against Legal Loopholes
by: Pistillo, Matteo, et al.
Published: (2025)
by: Pistillo, Matteo, et al.
Published: (2025)
To Defend Against Cyber Attacks, We Must Teach AI Agents to Hack
by: Zhuo, Terry Yue, et al.
Published: (2026)
by: Zhuo, Terry Yue, et al.
Published: (2026)
When Style Breaks Safety: Defending LLMs Against Superficial Style Alignment
by: Xiao, Yuxin, et al.
Published: (2025)
by: Xiao, Yuxin, et al.
Published: (2025)
Open Problems in Technical AI Governance
by: Reuel, Anka, et al.
Published: (2024)
by: Reuel, Anka, et al.
Published: (2024)
Examining the Implications of Deepfakes for Election Integrity
by: Ranka, Hriday, et al.
Published: (2024)
by: Ranka, Hriday, et al.
Published: (2024)
Contextual Integrity Games
by: Wolff, Ran
Published: (2024)
by: Wolff, Ran
Published: (2024)
How Generative AI Empowers Attackers and Defenders Across the Trust & Safety Landscape
by: Kelley, Patrick Gage, et al.
Published: (2025)
by: Kelley, Patrick Gage, et al.
Published: (2025)
Asymmetry by Design: Boosting Cyber Defenders with Differential Access to AI
by: Ee, Shaun, et al.
Published: (2025)
by: Ee, Shaun, et al.
Published: (2025)
Research Integrity and GenAI: A Systematic Analysis of Ethical Challenges Across Research Phases
by: Bjelobaba, Sonja, et al.
Published: (2024)
by: Bjelobaba, Sonja, et al.
Published: (2024)
Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity
by: Najjar, Ayat A., et al.
Published: (2025)
by: Najjar, Ayat A., et al.
Published: (2025)
SecureGaze: Defending Gaze Estimation Against Backdoor Attacks
by: Du, Lingyu, et al.
Published: (2025)
by: Du, Lingyu, et al.
Published: (2025)
Informed Consent: We Can Do Better to Defend Privacy
by: Borgesius, Frederik Zuiderveen
Published: (2025)
by: Borgesius, Frederik Zuiderveen
Published: (2025)
Mapping the Regulatory Learning Space for the EU AI Act
by: Lewis, Dave, et al.
Published: (2025)
by: Lewis, Dave, et al.
Published: (2025)
Use of AI Tools: Guidelines to Maintain Academic Integrity in Computing Colleges
by: El-boghdadi, Hatem M., et al.
Published: (2026)
by: El-boghdadi, Hatem M., et al.
Published: (2026)
Computational Foundations for Strategic Coopetition: Formalizing Collective Action and Loyalty
by: Pant, Vik, et al.
Published: (2026)
by: Pant, Vik, et al.
Published: (2026)
AICat: An AI Cataloguing Approach to Support the EU AI Act
by: Golpayegani, Delaram, et al.
Published: (2024)
by: Golpayegani, Delaram, et al.
Published: (2024)
(Beyond) Reasonable Doubt: Challenges that Public Defenders Face in Scrutinizing AI in Court
by: Jin, Angela, et al.
Published: (2024)
by: Jin, Angela, et al.
Published: (2024)
Towards Meaningful Transparency in Civic AI Systems
by: Murray-Rust, Dave, et al.
Published: (2025)
by: Murray-Rust, Dave, et al.
Published: (2025)
Anti-Regulatory AI: How "AI Safety" is Leveraged Against Regulatory Oversight
by: Yew, Rui-Jie, et al.
Published: (2025)
by: Yew, Rui-Jie, et al.
Published: (2025)
An Open Knowledge Graph-Based Approach for Mapping Concepts and Requirements between the EU AI Act and International Standards
by: Hernandez, Julio, et al.
Published: (2024)
by: Hernandez, Julio, et al.
Published: (2024)
Jump off the Bandwagon? Characterizing Bandwagon Fans' Future Loyalty in Online NBA Fan Communities
by: Wang, Yichen, et al.
Published: (2024)
by: Wang, Yichen, et al.
Published: (2024)
Position: Contextual Integrity is Inadequately Applied to Language Models
by: Shvartzshnaider, Yan, et al.
Published: (2025)
by: Shvartzshnaider, Yan, et al.
Published: (2025)
T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
by: Wang, Zhongqi, et al.
Published: (2024)
by: Wang, Zhongqi, et al.
Published: (2024)
The Narrow Depth and Breadth of Corporate Responsible AI Research
by: Ahmed, Nur, et al.
Published: (2024)
by: Ahmed, Nur, et al.
Published: (2024)
Trust and Transparency in AI: Industry Voices on Data, Ethics, and Compliance
by: McCormack, Louise, et al.
Published: (2025)
by: McCormack, Louise, et al.
Published: (2025)
Integrating Differential Privacy and Contextual Integrity
by: Benthall, Sebastian, et al.
Published: (2024)
by: Benthall, Sebastian, et al.
Published: (2024)
Speciesism in AI: Evaluating Discrimination Against Animals in Large Language Models
by: Jotautaitė, Monika, et al.
Published: (2025)
by: Jotautaitė, Monika, et al.
Published: (2025)
Evaluating the Contextual Integrity of False Positives in Algorithmic Travel Surveillance
by: Wernick, Alina, et al.
Published: (2025)
by: Wernick, Alina, et al.
Published: (2025)
Climate AI for Corporate Decarbonization Metrics Extraction
by: Dave, Aditya, et al.
Published: (2024)
by: Dave, Aditya, et al.
Published: (2024)
The Dirty Secret of SSDs: Embodied Carbon
by: Tannu, Swamit, et al.
Published: (2022)
by: Tannu, Swamit, et al.
Published: (2022)
Dead Zone of Accountability: Why Social Claims in Machine Learning Research Should Be Articulated and Defended
by: Kou, Tianqi, et al.
Published: (2025)
by: Kou, Tianqi, et al.
Published: (2025)
Examining Popular Arguments Against AI Existential Risk: A Philosophical Analysis
by: Swoboda, Torben, et al.
Published: (2025)
by: Swoboda, Torben, et al.
Published: (2025)
Sark: Oblivious Integrity Without Global State
by: Lynham, Alex, et al.
Published: (2025)
by: Lynham, Alex, et al.
Published: (2025)
Rethinking Review Citations: Impact on Scientific Integrity
by: Aguilar-Ruiz, Jesus S.
Published: (2025)
by: Aguilar-Ruiz, Jesus S.
Published: (2025)
AI Cards: Towards an Applied Framework for Machine-Readable AI and Risk Documentation Inspired by the EU AI Act
by: Golpayegani, Delaram, et al.
Published: (2024)
by: Golpayegani, Delaram, et al.
Published: (2024)
Defending Our Privacy With Backdoors
by: Hintersdorf, Dominik, et al.
Published: (2023)
by: Hintersdorf, Dominik, et al.
Published: (2023)
Similar Items
-
International Security Applications of Flexible Hardware-Enabled Guarantees
by: Aarne, Onni, et al.
Published: (2025) -
Technical Options for Flexible Hardware-Enabled Guarantees
by: Petrie, James, et al.
Published: (2025) -
Flexible Hardware-Enabled Guarantees for AI Compute
by: Petrie, James, et al.
Published: (2025) -
Defending Against Intelligent Attackers at Large Scales
by: Lohn, Andrew J.
Published: (2025) -
Defending Compute Thresholds Against Legal Loopholes
by: Pistillo, Matteo, et al.
Published: (2025)