Saved in:
| Main Author: | Kuznetsova, Anna |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.25082 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
In-IDE Programming Courses: Learning Software Development in a Real-World Setting
by: Birillo, Anastasiia, et al.
Published: (2025)
by: Birillo, Anastasiia, et al.
Published: (2025)
Measuring What Matters: A Framework for Evaluating Safety Risks in Real-World LLM Applications
by: Goh, Jia Yi, et al.
Published: (2025)
by: Goh, Jia Yi, et al.
Published: (2025)
RealHarm: A Collection of Real-World Language Model Application Failures
by: Jeune, Pierre Le, et al.
Published: (2025)
by: Jeune, Pierre Le, et al.
Published: (2025)
Case Study of GAI for Generating Novel Images for Real-World Embroidery
by: Glazko, Kate, et al.
Published: (2025)
by: Glazko, Kate, et al.
Published: (2025)
Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
by: Lin, Justin W., et al.
Published: (2025)
by: Lin, Justin W., et al.
Published: (2025)
Who's in Charge? Disempowerment Patterns in Real-World LLM Usage
by: Sharma, Mrinank, et al.
Published: (2026)
by: Sharma, Mrinank, et al.
Published: (2026)
REALM: A Dataset of Real-World LLM Use Cases
by: Cheng, Jingwen, et al.
Published: (2025)
by: Cheng, Jingwen, et al.
Published: (2025)
Cyclic Adaptive Private Synthesis for Sharing Real-World Data in Education
by: Ito, Hibiki, et al.
Published: (2026)
by: Ito, Hibiki, et al.
Published: (2026)
SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users
by: Zhang, Xinnong, et al.
Published: (2025)
by: Zhang, Xinnong, et al.
Published: (2025)
When LLMs Can't Help: Real-World Evaluation of LLMs in Nutrition
by: Li, Karen Jia-Hui, et al.
Published: (2025)
by: Li, Karen Jia-Hui, et al.
Published: (2025)
Documenting Deployment with Fabric: A Repository of Real-World AI Governance
by: Jorgensen, Mackenzie, et al.
Published: (2025)
by: Jorgensen, Mackenzie, et al.
Published: (2025)
Street-Level AI: Are Large Language Models Ready for Real-World Judgments?
by: Pokharel, Gaurab, et al.
Published: (2025)
by: Pokharel, Gaurab, et al.
Published: (2025)
Whose ChatGPT? Unveiling Real-World Educational Inequalities Introduced by Large Language Models
by: Yu, Renzhe, et al.
Published: (2024)
by: Yu, Renzhe, et al.
Published: (2024)
AEDHunter: Investigating AED Retrieval in the Real World via Gamified Mobile Interaction and Sensing
by: Peng, Helinyi, et al.
Published: (2026)
by: Peng, Helinyi, et al.
Published: (2026)
From Framework to Practice: Designing a Real-World Telehealth Application for Palliative Care
by: Zhou, Wei, et al.
Published: (2025)
by: Zhou, Wei, et al.
Published: (2025)
Clio: Privacy-Preserving Insights into Real-World AI Use
by: Tamkin, Alex, et al.
Published: (2024)
by: Tamkin, Alex, et al.
Published: (2024)
FairJob: A Real-World Dataset for Fairness in Online Systems
by: Vladimirova, Mariia, et al.
Published: (2024)
by: Vladimirova, Mariia, et al.
Published: (2024)
Generative AI Purpose-built for Social and Mental Health: A Real-World Pilot
by: Hull, Thomas D., et al.
Published: (2025)
by: Hull, Thomas D., et al.
Published: (2025)
Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset
by: Ward, Abbi, et al.
Published: (2024)
by: Ward, Abbi, et al.
Published: (2024)
A Large-Scale Real-World Evaluation of LLM-Based Virtual Teaching Assistant
by: Kweon, Sunjun, et al.
Published: (2025)
by: Kweon, Sunjun, et al.
Published: (2025)
Are Multimodal LLMs Ready for Clinical Dermatology? A Real-World Evaluation in Dermatology
by: Jiang, Roy, et al.
Published: (2026)
by: Jiang, Roy, et al.
Published: (2026)
Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions
by: Huang, Saffron, et al.
Published: (2025)
by: Huang, Saffron, et al.
Published: (2025)
Understanding Collective Stability of ACC Systems: From Theory to Real-World Observations
by: Korbmacher, Raphael, et al.
Published: (2025)
by: Korbmacher, Raphael, et al.
Published: (2025)
A Closer Look at the Existing Risks of Generative AI: Mapping the Who, What, and How of Real-World Incidents
by: Li, Megan, et al.
Published: (2025)
by: Li, Megan, et al.
Published: (2025)
GDPval: Evaluating AI Model Performance on Real-World Economically Valuable Tasks
by: Patwardhan, Tejal, et al.
Published: (2025)
by: Patwardhan, Tejal, et al.
Published: (2025)
PLawBench: A Rubric-Based Benchmark for Evaluating LLMs in Real-World Legal Practice
by: Shi, Yuzhen, et al.
Published: (2026)
by: Shi, Yuzhen, et al.
Published: (2026)
Longitudinal and Multimodal Recording System to Capture Real-World Patient-Clinician Conversations for AI and Encounter Research: Protocol
by: Zahidy, Misk Al, et al.
Published: (2025)
by: Zahidy, Misk Al, et al.
Published: (2025)
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims
by: Yoon, Yejun, et al.
Published: (2024)
by: Yoon, Yejun, et al.
Published: (2024)
Reality Check: A New Evaluation Ecosystem Is Necessary to Understand AI's Real World Effects
by: Schwartz, Reva, et al.
Published: (2025)
by: Schwartz, Reva, et al.
Published: (2025)
An Investigation of Large Language Models for Real-World Hate Speech Detection
by: Guo, Keyan, et al.
Published: (2024)
by: Guo, Keyan, et al.
Published: (2024)
Face Recognition: to Deploy or not to Deploy? A Framework for Assessing the Proportional Use of Face Recognition Systems in Real-World Scenarios
by: Negri, Pablo, et al.
Published: (2024)
by: Negri, Pablo, et al.
Published: (2024)
The Third-Party Access Effect: An Overlooked Challenge in Secondary Use of Educational Real-World Data
by: Ito, Hibiki, et al.
Published: (2026)
by: Ito, Hibiki, et al.
Published: (2026)
Towards Apples to Apples for AI Evaluations: From Real-World Use Cases to Evaluation Scenarios
by: Choong, Yee-Yin, et al.
Published: (2026)
by: Choong, Yee-Yin, et al.
Published: (2026)
Suitability Filter: A Statistical Framework for Classifier Evaluation in Real-World Deployment Settings
by: Pouget, Angéline, et al.
Published: (2025)
by: Pouget, Angéline, et al.
Published: (2025)
Real-World AI Evaluation: How FRAME Generates Systematic Evidence to Resolve the Decision-Maker's Dilemma
by: Schwartz, Reva, et al.
Published: (2026)
by: Schwartz, Reva, et al.
Published: (2026)
The AI Model Risk Catalog: What Developers and Researchers Miss About Real-World AI Harms
by: Rao, Pooja S. B., et al.
Published: (2025)
by: Rao, Pooja S. B., et al.
Published: (2025)
Even More Kawaii than Real-Person-Driven VTubers? Understanding How Viewers Perceive AI-Driven VTubers
by: Wei, Yiluo, et al.
Published: (2025)
by: Wei, Yiluo, et al.
Published: (2025)
Real-World Receptivity to Adaptive Mental Health Interventions: Findings from an In-the-Wild Study
by: Sahu, Nilesh Kumar, et al.
Published: (2025)
by: Sahu, Nilesh Kumar, et al.
Published: (2025)
Scaffolding Research Projects in Theory of Computing Courses
by: Dougherty, Ryan E.
Published: (2024)
by: Dougherty, Ryan E.
Published: (2024)
Real-World Deployment and Evaluation of Kwame for Science, An AI Teaching Assistant for Science Education in West Africa
by: Boateng, George, et al.
Published: (2023)
by: Boateng, George, et al.
Published: (2023)
Similar Items
-
In-IDE Programming Courses: Learning Software Development in a Real-World Setting
by: Birillo, Anastasiia, et al.
Published: (2025) -
Measuring What Matters: A Framework for Evaluating Safety Risks in Real-World LLM Applications
by: Goh, Jia Yi, et al.
Published: (2025) -
RealHarm: A Collection of Real-World Language Model Application Failures
by: Jeune, Pierre Le, et al.
Published: (2025) -
Case Study of GAI for Generating Novel Images for Real-World Embroidery
by: Glazko, Kate, et al.
Published: (2025) -
Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
by: Lin, Justin W., et al.
Published: (2025)