Saved in:
| Main Authors: | Guo, Wenqi Marshall, Du, Yiyang, Tworek, Heidi J. S., Du, Shan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.08833 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Position: Universal Aesthetic Alignment Narrows Artistic Expression
by: Guo, Wenqi Marshall, et al.
Published: (2025)
by: Guo, Wenqi Marshall, et al.
Published: (2025)
LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New Dataset
by: Guo, Wenqi, et al.
Published: (2025)
by: Guo, Wenqi, et al.
Published: (2025)
Pluralistic Alignment Over Time
by: Klassen, Toryn Q., et al.
Published: (2024)
by: Klassen, Toryn Q., et al.
Published: (2024)
Mind the Gap: Pitfalls of LLM Alignment with Asian Public Opinion
by: Shankar, Hari, et al.
Published: (2026)
by: Shankar, Hari, et al.
Published: (2026)
Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina
by: Gao, Yuan, et al.
Published: (2024)
by: Gao, Yuan, et al.
Published: (2024)
Evaluating the Promise and Pitfalls of LLMs in Hiring Decisions
by: Anzenberg, Eitan, et al.
Published: (2025)
by: Anzenberg, Eitan, et al.
Published: (2025)
AI Identity, Empowerment, and Mindfulness in Mitigating Unethical AI Use
by: Shaayesteh, Mayssam Tarighi, et al.
Published: (2025)
by: Shaayesteh, Mayssam Tarighi, et al.
Published: (2025)
Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs
by: Said, Muhammad Abdullahi, et al.
Published: (2025)
by: Said, Muhammad Abdullahi, et al.
Published: (2025)
Navigating Pitfalls: Evaluating LLMs in Machine Learning Programming Education
by: Kumar, Smitha, et al.
Published: (2025)
by: Kumar, Smitha, et al.
Published: (2025)
The AI Risk Spectrum: From Dangerous Capabilities to Existential Threats
by: Grey, Markov, et al.
Published: (2025)
by: Grey, Markov, et al.
Published: (2025)
FaceLinkGen: Rethinking Identity Leakage in Privacy-Preserving Face Recognition with Identity Extraction
by: Guo, Wenqi, et al.
Published: (2026)
by: Guo, Wenqi, et al.
Published: (2026)
TikTok Engagement Traces Over Time and Health Risky Behaviors: Combining Data Linkage and Computational Methods
by: Zhao, Xinyan, et al.
Published: (2024)
by: Zhao, Xinyan, et al.
Published: (2024)
VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip
by: Guo, Wenqi, et al.
Published: (2025)
by: Guo, Wenqi, et al.
Published: (2025)
Toward Responsible and Beneficial AI: Comparing Regulatory and Guidance-Based Approaches -A Comprehensive Comparative Analysis of Artificial Intelligence Governance Frameworks across the European Union, United States, China, and IEEE
by: Du, Jian
Published: (2025)
by: Du, Jian
Published: (2025)
No Size Fits All: The Perils and Pitfalls of Leveraging LLMs Vary with Company Size
by: Urlana, Ashok, et al.
Published: (2024)
by: Urlana, Ashok, et al.
Published: (2024)
Dynamic Bayesian Item Response Model with Decomposition (D-BIRD): Modeling Cohort and Individual Learning Over Time
by: Lee, Hansol, et al.
Published: (2025)
by: Lee, Hansol, et al.
Published: (2025)
Boosting Fairness and Robustness in Over-the-Air Federated Learning
by: Oksuz, Halil Yigit, et al.
Published: (2024)
by: Oksuz, Halil Yigit, et al.
Published: (2024)
Pitfalls of Evidence-Based AI Policy
by: Casper, Stephen, et al.
Published: (2025)
by: Casper, Stephen, et al.
Published: (2025)
Exploring Safety Alignment Evaluation of LLMs in Chinese Mental Health Dialogues via LLM-as-Judge
by: Cai, Yunna, et al.
Published: (2025)
by: Cai, Yunna, et al.
Published: (2025)
Verify with Caution: The Pitfalls of Relying on Imperfect Factuality Metrics
by: Godbole, Ameya, et al.
Published: (2025)
by: Godbole, Ameya, et al.
Published: (2025)
Faults and Pitfalls in Implementing the Right to be Forgotten
by: Sun, Chen, et al.
Published: (2026)
by: Sun, Chen, et al.
Published: (2026)
Technical Requirements for Halting Dangerous AI Activities
by: Barnett, Peter, et al.
Published: (2025)
by: Barnett, Peter, et al.
Published: (2025)
Risks of AI Scientists: Prioritizing Safeguarding Over Autonomy
by: Tang, Xiangru, et al.
Published: (2024)
by: Tang, Xiangru, et al.
Published: (2024)
Access Over Deception: Fighting Deceptive Patterns through Accessibility
by: Pellkvist, Tobias, et al.
Published: (2026)
by: Pellkvist, Tobias, et al.
Published: (2026)
Recommendation Fairness in Social Networks Over Time
by: Cao, Meng, et al.
Published: (2024)
by: Cao, Meng, et al.
Published: (2024)
Large Language Models as Students Who Think Aloud: Overly Coherent, Verbose, and Confident
by: Borchers, Conrad, et al.
Published: (2026)
by: Borchers, Conrad, et al.
Published: (2026)
The Disintegration of Free Speech
by: Mei, Yiyang
Published: (2026)
by: Mei, Yiyang
Published: (2026)
Personalized Parsons Puzzles as Scaffolding Enhance Practice Engagement Over Just Showing LLM-Powered Solutions
by: Hou, Xinying, et al.
Published: (2025)
by: Hou, Xinying, et al.
Published: (2025)
Out of the Loop Again: How Dangerous is Weaponizing Automated Nuclear Systems?
by: Schwartz, Joshua A., et al.
Published: (2025)
by: Schwartz, Joshua A., et al.
Published: (2025)
Impact of AI Tools on Learning Outcomes: Decreasing Knowledge and Over-Reliance
by: Benedek, Márton, et al.
Published: (2025)
by: Benedek, Márton, et al.
Published: (2025)
Expanding External Access To Frontier AI Models For Dangerous Capability Evaluations
by: Charnock, Jacob, et al.
Published: (2026)
by: Charnock, Jacob, et al.
Published: (2026)
Fluent but Foreign: Even Regional LLMs Lack Cultural Alignment
by: Agarwal, Dhruv, et al.
Published: (2025)
by: Agarwal, Dhruv, et al.
Published: (2025)
Measuring Compliance with the California Consumer Privacy Act Over Space and Time
by: Tran, Van, et al.
Published: (2024)
by: Tran, Van, et al.
Published: (2024)
Reclaiming Constitutional Authority of Algorithmic Power
by: Mei, Yiyang, et al.
Published: (2025)
by: Mei, Yiyang, et al.
Published: (2025)
Optimizing Mastery Learning by Fast-Forwarding Over-Practice Steps
by: Xia, Meng, et al.
Published: (2025)
by: Xia, Meng, et al.
Published: (2025)
Evaluating the Clinical Safety of LLMs in Response to High-Risk Mental Health Disclosures
by: Shah, Siddharth, et al.
Published: (2025)
by: Shah, Siddharth, et al.
Published: (2025)
Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs
by: Khan, Ariba, et al.
Published: (2025)
by: Khan, Ariba, et al.
Published: (2025)
ZS-VCOS: Zero-Shot Video Camouflaged Object Segmentation By Optical Flow and Open Vocabulary Object Detection
by: Guo, Wenqi, et al.
Published: (2025)
by: Guo, Wenqi, et al.
Published: (2025)
An FDA for AI? Pitfalls and Plausibility of Approval Regulation for Frontier Artificial Intelligence
by: Carpenter, Daniel, et al.
Published: (2024)
by: Carpenter, Daniel, et al.
Published: (2024)
The Fair Game: Auditing & Debiasing AI Algorithms Over Time
by: Basu, Debabrota, et al.
Published: (2025)
by: Basu, Debabrota, et al.
Published: (2025)
Similar Items
-
Position: Universal Aesthetic Alignment Narrows Artistic Expression
by: Guo, Wenqi Marshall, et al.
Published: (2025) -
LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New Dataset
by: Guo, Wenqi, et al.
Published: (2025) -
Pluralistic Alignment Over Time
by: Klassen, Toryn Q., et al.
Published: (2024) -
Mind the Gap: Pitfalls of LLM Alignment with Asian Public Opinion
by: Shankar, Hari, et al.
Published: (2026) -
Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina
by: Gao, Yuan, et al.
Published: (2024)