Saved in:
| Main Authors: | Cârlan, Carmen, Gomez, Francesca, Mathew, Yohan, Krishna, Ketana, King, René, Gebauer, Peter, Smith, Ben R. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.17618 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Assessing confidence in frontier AI safety cases
by: Barrett, Stephen, et al.
Published: (2025)
by: Barrett, Stephen, et al.
Published: (2025)
Third-party compliance reviews for frontier AI safety frameworks
by: Homewood, Aidan, et al.
Published: (2025)
by: Homewood, Aidan, et al.
Published: (2025)
Safety cases for frontier AI
by: Buhl, Marie Davidsen, et al.
Published: (2024)
by: Buhl, Marie Davidsen, et al.
Published: (2024)
The coordination gap in frontier AI safety policies
by: Mengesha, Isaak
Published: (2026)
by: Mengesha, Isaak
Published: (2026)
How frontier AI companies could implement an internal audit function
by: Gomez, Francesca, et al.
Published: (2025)
by: Gomez, Francesca, et al.
Published: (2025)
The rising costs of training frontier AI models
by: Cottier, Ben, et al.
Published: (2024)
by: Cottier, Ben, et al.
Published: (2024)
Risk thresholds for frontier AI
by: Koessler, Leonie, et al.
Published: (2024)
by: Koessler, Leonie, et al.
Published: (2024)
Safety case template for frontier AI: A cyber inability argument
by: Goemans, Arthur, et al.
Published: (2024)
by: Goemans, Arthur, et al.
Published: (2024)
Domestic frontier AI regulation, an IAEA for AI, an NPT for AI, and a US-led Allied Public-Private Partnership for AI: Four institutions for governing and developing frontier AI
by: Belfield, Haydn
Published: (2025)
by: Belfield, Haydn
Published: (2025)
Affirmative safety: An approach to risk management for high-risk AI
by: Wasil, Akash R., et al.
Published: (2024)
by: Wasil, Akash R., et al.
Published: (2024)
Understanding engagement with platform safety technology for reducing exposure to online harms
by: Bright, Jonathan, et al.
Published: (2024)
by: Bright, Jonathan, et al.
Published: (2024)
Measurement challenges in AI catastrophic risk governance and safety frameworks
by: Kasirzadeh, Atoosa
Published: (2024)
by: Kasirzadeh, Atoosa
Published: (2024)
Scrapyard AI
by: Böhlen, Marc, et al.
Published: (2026)
by: Böhlen, Marc, et al.
Published: (2026)
Using LLMs as prompt modifier to avoid biases in AI image generators
by: Peinl, René
Published: (2025)
by: Peinl, René
Published: (2025)
How malicious AI swarms can threaten democracy: The fusion of agentic AI and LLMs marks a new frontier in information warfare
by: Schroeder, Daniel Thilo, et al.
Published: (2025)
by: Schroeder, Daniel Thilo, et al.
Published: (2025)
Adapting cybersecurity frameworks to manage frontier AI risks: A defense-in-depth approach
by: Ee, Shaun, et al.
Published: (2024)
by: Ee, Shaun, et al.
Published: (2024)
Evaluating energy inefficiency in energy-poor households in India: A frontier analysis approach
by: Gupta, Vallary, et al.
Published: (2025)
by: Gupta, Vallary, et al.
Published: (2025)
Who Decides in AI-Mediated Learning? The Agency Allocation Framework
by: Borchers, Conrad, et al.
Published: (2026)
by: Borchers, Conrad, et al.
Published: (2026)
Governing frontier general-purpose AI in the public sector: adaptive risk management and policy capacity under uncertainty through 2030
by: Xavier, Fabio Correa
Published: (2026)
by: Xavier, Fabio Correa
Published: (2026)
Modeling the Feedback of AI Price Estimations on Actual Market Values
by: Silaghi, Viorel, et al.
Published: (2024)
by: Silaghi, Viorel, et al.
Published: (2024)
Modernizing Ground Truth: Four Shifts Toward Improving Reliability and Validity in AI in Education
by: Thomas, Danielle R., et al.
Published: (2026)
by: Thomas, Danielle R., et al.
Published: (2026)
Designing escalation criteria for international AI incident response: criteria, triggers, and thresholds
by: Gomez, Francesca, et al.
Published: (2026)
by: Gomez, Francesca, et al.
Published: (2026)
Revenge Porn: A Peep into its Awareness among the Youth of Tamilnadu, India
by: M, Mohammed Marzuk T, et al.
Published: (2025)
by: M, Mohammed Marzuk T, et al.
Published: (2025)
Small models, big threats: Characterizing safety challenges from low-compute AI models
by: Puri, Prateek
Published: (2026)
by: Puri, Prateek
Published: (2026)
Developing a Responsible AI Framework for Healthcare in Low Resource Countries: A Case Study in Nepal and Ghana
by: Neupane, Hari Krishna, et al.
Published: (2025)
by: Neupane, Hari Krishna, et al.
Published: (2025)
A cross-regional review of AI safety regulations in the commercial aviation
by: Barr, Penny A., et al.
Published: (2025)
by: Barr, Penny A., et al.
Published: (2025)
Human and AI collaboration in Fitness Education:A Longitudinal Study with a Pilates Instructor
by: Huang, Qian, et al.
Published: (2025)
by: Huang, Qian, et al.
Published: (2025)
SuperSkillsStack: Agency, Domain Knowledge, Imagination, and Taste in Human-AI Design Education
by: Huang, Qian, et al.
Published: (2026)
by: Huang, Qian, et al.
Published: (2026)
Mindsets and Management: AI and Gender (In)Equitable Access to Finance
by: Smith, Genevieve
Published: (2025)
by: Smith, Genevieve
Published: (2025)
Designing Knowledge Tools: How Students Transition from Using to Creating Generative AI in STEAM classroom
by: Huang, Qian, et al.
Published: (2025)
by: Huang, Qian, et al.
Published: (2025)
Semi-automated analysis of audio-recorded lessons: The case of teachers' engaging messages
by: Falcon, Samuel, et al.
Published: (2024)
by: Falcon, Samuel, et al.
Published: (2024)
Foregrounding Artist Opinions: A Survey Study on Transparency, Ownership, and Fairness in AI Generative Art
by: Lovato, Juniper, et al.
Published: (2024)
by: Lovato, Juniper, et al.
Published: (2024)
AI data transparency: an exploration through the lens of AI incidents
by: Worth, Sophia, et al.
Published: (2024)
by: Worth, Sophia, et al.
Published: (2024)
A safety risk assessment framework for children's online safety based on a novel safety weakness assessment approach
by: Ta, Vinh-Thong
Published: (2024)
by: Ta, Vinh-Thong
Published: (2024)
Towards a Harms Taxonomy of AI Likeness Generation
by: Bariach, Ben, et al.
Published: (2024)
by: Bariach, Ben, et al.
Published: (2024)
Fairness Hub Technical Briefs: Definition and Detection of Distribution Shift
by: Acevedo, Nicolas, et al.
Published: (2024)
by: Acevedo, Nicolas, et al.
Published: (2024)
Position Paper: Model Access should be a Key Concern in AI Governance
by: Kembery, Edward, et al.
Published: (2024)
by: Kembery, Edward, et al.
Published: (2024)
Dialect prejudice predicts AI decisions about people's character, employability, and criminality
by: Hofmann, Valentin, et al.
Published: (2024)
by: Hofmann, Valentin, et al.
Published: (2024)
AI-Generated Letters from the Future: A Randomized Test of Personalized Climate Communication
by: Powdthavee, Nattavudh, et al.
Published: (2026)
by: Powdthavee, Nattavudh, et al.
Published: (2026)
Teaching at Scale: Leveraging AI to Evaluate and Elevate Engineering Education
by: Chamberland, Jean-Francois, et al.
Published: (2025)
by: Chamberland, Jean-Francois, et al.
Published: (2025)
Similar Items
-
Assessing confidence in frontier AI safety cases
by: Barrett, Stephen, et al.
Published: (2025) -
Third-party compliance reviews for frontier AI safety frameworks
by: Homewood, Aidan, et al.
Published: (2025) -
Safety cases for frontier AI
by: Buhl, Marie Davidsen, et al.
Published: (2024) -
The coordination gap in frontier AI safety policies
by: Mengesha, Isaak
Published: (2026) -
How frontier AI companies could implement an internal audit function
by: Gomez, Francesca, et al.
Published: (2025)