:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Kumar, Nischal Ashok, Pham, Chau Minh, Iyyer, Mohit, Lan, Andrew
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computation and Language
Online-Zugang:	https://arxiv.org/abs/2502.13028
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

CLIPPER: Compression enables long-context synthetic data generation
von: Pham, Chau Minh, et al.
Veröffentlicht: (2025)

Suri: Multi-constraint Instruction Following for Long-form Text Generation
von: Pham, Chau Minh, et al.
Veröffentlicht: (2024)

Frankentext: Stitching random text fragments into long-form narratives
von: Pham, Chau Minh, et al.
Veröffentlicht: (2025)

Argument Collapse: LLMs Flatten Long-Form Public Debate
von: Kim, Yekyung, et al.
Veröffentlicht: (2026)

Improving Socratic Question Generation using Data Augmentation and Preference Optimization
von: Kumar, Nischal Ashok, et al.
Veröffentlicht: (2024)

Using Large Language Models for Student-Code Guided Test Case Generation in Computer Science Education
von: Kumar, Nischal Ashok, et al.
Veröffentlicht: (2024)

TopicGPT: A Prompt-based Topic Modeling Framework
von: Pham, Chau Minh, et al.
Veröffentlicht: (2023)

StoryScope: Investigating idiosyncrasies in AI fiction
von: Russell, Jenna, et al.
Veröffentlicht: (2026)

Interactive Topic Models with Optimal Transport
von: Dhanania, Garima, et al.
Veröffentlicht: (2024)

BEARCUBS: A benchmark for computer-using web agents
von: Song, Yixiao, et al.
Veröffentlicht: (2025)

OWL: Probing Cross-Lingual Recall of Memorized Texts via World Literature
von: Srivastava, Alisha, et al.
Veröffentlicht: (2025)

VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation
von: Song, Yixiao, et al.
Veröffentlicht: (2024)

People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text
von: Russell, Jenna, et al.
Veröffentlicht: (2025)

Literary Evidence Retrieval via Long-Context Language Models
von: Thai, Katherine, et al.
Veröffentlicht: (2025)

Beyond Precision: Importance-Aware Recall for Factuality Evaluation in Long-Form LLM Generation
von: Jafari, Nazanin, et al.
Veröffentlicht: (2026)

Recovering Diversity Without Losing Alignment: A DPO Recipe for Post-Trained LLMs
von: Samuel, Vinay, et al.
Veröffentlicht: (2026)

SRS-Stories: Vocabulary-constrained multilingual story generation for language learning
von: Kamzela, Wiktor, et al.
Veröffentlicht: (2025)

One ruler to measure them all: Benchmarking multilingual long-context language models
von: Kim, Yekyung, et al.
Veröffentlicht: (2025)

EditLens: Quantifying the Extent of AI Editing in Text
von: Thai, Katherine, et al.
Veröffentlicht: (2025)

Localizing and Mitigating Errors in Long-form Question Answering
von: Sachdeva, Rachneet, et al.
Veröffentlicht: (2024)

Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
von: Naseh, Ali, et al.
Veröffentlicht: (2024)

How good is my story? Towards quantitative metrics for evaluating LLM-generated XAI narratives
von: Ichmoukhamedov, Timour, et al.
Veröffentlicht: (2024)

BooookScore: A systematic exploration of book-length summarization in the era of LLMs
von: Chang, Yapei, et al.
Veröffentlicht: (2023)

VeriFastScore: Speeding up long-form factuality evaluation
von: Rajendhran, Rishanth, et al.
Veröffentlicht: (2025)

WHODUNIT: Evaluation benchmark for culprit detection in mystery stories
von: Gupta, Kshitij
Veröffentlicht: (2025)

Does quantization affect models' performance on long-context tasks?
von: Mekala, Anmol, et al.
Veröffentlicht: (2025)

One Thousand and One Pairs: A "novel" challenge for long-context language models
von: Karpinska, Marzena, et al.
Veröffentlicht: (2024)

Surprisal reveals diversity gaps in image captioning and different scorers change the story
von: Ilinykh, Nikolai, et al.
Veröffentlicht: (2025)

LongStory: Coherent, Complete and Length Controlled Long story Generation
von: Park, Kyeongman, et al.
Veröffentlicht: (2023)

Contextualized Evaluations: Judging Language Model Responses to Underspecified Queries
von: Malaviya, Chaitanya, et al.
Veröffentlicht: (2024)

"Image, Tell me your story!" Predicting the original meta-context of visual misinformation
von: Tonglet, Jonathan, et al.
Veröffentlicht: (2024)

PostMark: A Robust Blackbox Watermark for Large Language Models
von: Chang, Yapei, et al.
Veröffentlicht: (2024)

AI use in American newspapers is widespread, uneven, and rarely disclosed
von: Russell, Jenna, et al.
Veröffentlicht: (2025)

Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation
von: Zhao, Jiachen, et al.
Veröffentlicht: (2023)

A Multi-Agent Approach to Validate and Refine LLM-Generated Personalized Math Problems
von: Ikram, Fareya, et al.
Veröffentlicht: (2026)

Reinforcement Learning from Human Feedback: Whose Culture, Whose Values, Whose Perspectives?
von: Barman, Kristian González, et al.
Veröffentlicht: (2024)

CaLMQA: Exploring culturally specific long-form question answering across 23 languages
von: Arora, Shane, et al.
Veröffentlicht: (2024)

Rethinking Multimodal Sentiment Analysis: A High-Accuracy, Simplified Fusion Architecture
von: Mandal, Nischal, et al.
Veröffentlicht: (2025)

Employing Label Models on ChatGPT Answers Improves Legal Text Entailment Performance
von: Nguyen, Chau, et al.
Veröffentlicht: (2024)

Movie2Story: A framework for understanding videos and telling stories in the form of novel text
von: Li, Kangning, et al.
Veröffentlicht: (2024)