:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Kapoor, Radhika, Truong, Sang T., Haber, Nick, Ruiz-Primo, Maria Araceli, Domingue, Benjamin W.
Format:	Preprint
Publié:	2025
Sujets:	Computation and Language
Accès en ligne:	https://arxiv.org/abs/2502.20663
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

The Impact of Item-Writing Flaws on Difficulty and Discrimination in Item Response Theory
par: Schmucker, Robin, et autres
Publié: (2025)

A Multi-Agent Framework for Feature-Constrained Difficulty Control in Reading Comprehension Item Generation
par: Hwang, Seonjeong, et autres
Publié: (2026)

Using Vision + Language Models to Predict Item Difficulty
par: Khan, Samin
Publié: (2026)

Can LLMs Estimate Cognitive Complexity of Reading Comprehension Items?
par: Hwang, Seonjeong, et autres
Publié: (2025)

SMART: Simulated Students Aligned with Item Response Theory for Question Difficulty Prediction
par: Scarlatos, Alexander, et autres
Publié: (2025)

Automatic Generation and Evaluation of Reading Comprehension Test Items with Large Language Models
par: Säuberli, Andreas, et autres
Publié: (2024)

RIDE: Difficulty Evolving Perturbation with Item Response Theory for Mathematical Reasoning
par: Li, Xinyuan, et autres
Publié: (2025)

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction
par: Li, Ming, et autres
Publié: (2025)

Items contextualizados de ciencias en PISA: la conexión entre las demandas cognitivas y las características de contexto de los items
par: Maria-Araceli Ruiz-Primo
Publié: (2016)

Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?
par: Zotos, Leonidas, et autres
Publié: (2024)

Take Out Your Calculators: Estimating the Real Difficulty of Question Items with LLM Student Simulations
par: Acquaye, Christabel, et autres
Publié: (2026)

UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions
par: Rogoz, Ana-Cristina, et autres
Publié: (2024)

Polytomous Explanatory Item Response Models for Item Discrimination: Assessing Negative-Framing Effects in Social-Emotional Learning Surveys
par: Gilbert, Joshua B., et autres
Publié: (2024)

Estimating Heterogeneous Treatment Effects with Item-Level Outcome Data: Insights from Item Response Theory
par: Gilbert, Joshua B., et autres
Publié: (2024)

Estimating Item Difficulty Using Large Language Models and Tree-Based Machine Learning Algorithms
par: Razavi, Pooya, et autres
Publié: (2025)

Controlling Cloze-test Question Item Difficulty with PLM-based Surrogate Models for IRT Assessment
par: Zhang, Jingshen, et autres
Publié: (2024)

Text-Based Approaches to Item Alignment to Content Standards in Large-Scale Reading & Writing Tests
par: Fu, Yanbin, et autres
Publié: (2025)

Item-Language Model for Conversational Recommendation
par: Yang, Li, et autres
Publié: (2024)

Efficient Detection of Bad Benchmark Items with Novel Scalability Coefficients
par: Hardy, Michael, et autres
Publié: (2026)

Auditing LLM Benchmarks with Item Response Theory
par: Land, Sander, et autres
Publié: (2026)

Estimating LLM Grading Ability and Response Difficulty in Automatic Short Answer Grading via Item Response Theory
par: Cong, Longwei, et autres
Publié: (2026)

Dynamic Bayesian Item Response Model with Decomposition (D-BIRD): Modeling Cohort and Individual Learning Over Time
par: Lee, Hansol, et autres
Publié: (2025)

Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
par: Peters, Sydney, et autres
Publié: (2025)

IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation
par: Lin, Fan, et autres
Publié: (2024)

RAGSys: Item-Cold-Start Recommender as RAG System
par: Contal, Emile, et autres
Publié: (2024)

Leveraging LLM-Respondents for Item Evaluation: a Psychometric Analysis
par: Liu, Yunting, et autres
Publié: (2024)

From Past To Path: Masked History Learning for Next-Item Prediction in Generative Recommendation
par: Wei, KaiWen, et autres
Publié: (2025)

Enriching Social Science Research via Survey Item Linking
par: Tsereteli, Tornike, et autres
Publié: (2024)

Semantic Cells: Evolutional Process to Acquire Sense Diversity of Items
par: Ohsawa, Yukio, et autres
Publié: (2024)

Action-Item-Driven Summarization of Long Meeting Transcripts
par: Golia, Logan, et autres
Publié: (2023)

The Sound of Syntax: Finetuning and Comprehensive Evaluation of Language Models for Speech Pathology
par: Patel, Fagun, et autres
Publié: (2025)

Break Out the Silverware -- Semantic Understanding of Stored Household Items
par: Levi-Richter, Michaela, et autres
Publié: (2025)

Difficulty as a Proxy for Measuring Intrinsic Cognitive Load Item
par: Cai, Minghao, et autres
Publié: (2025)

Finding Words Associated with DIF: Predicting Differential Item Functioning using LLMs and Explainable AI
par: Maeda, Hotaka, et autres
Publié: (2025)

Multi-Agent Collaborative Filtering: Orchestrating Users and Items for Agentic Recommendations
par: Xia, Yu, et autres
Publié: (2025)

Measuring Competency, Not Performance: Item-Aware Evaluation Across Medical Benchmarks
par: Luo, Zhimeng, et autres
Publié: (2025)

Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators
par: Lim, Sungjib, et autres
Publié: (2025)

Confident Rankings with Fewer Items: Adaptive LLM Evaluation with Continuous Scores
par: Balkır, Esma, et autres
Publié: (2026)

Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
par: Zhou, Hongli, et autres
Publié: (2025)

Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs
par: Feucht, Sheridan, et autres
Publié: (2024)