:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jansen, Peter, Tafjord, Oyvind, Radensky, Marissa, Siangliulue, Pao, Hope, Tom, Mishra, Bhavana Dalvi, Majumder, Bodhisattwa Prasad, Weld, Daniel S., Clark, Peter
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2503.22708
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
by: Jansen, Peter, et al.
Published: (2024)

Literature-Grounded Novelty Assessment of Scientific Ideas
by: Shahid, Simra, et al.
Published: (2025)

Human-LLM Compound System for Scientific Ideation through Facet Recombination and Novelty Evaluation
by: Radensky, Marissa, et al.
Published: (2024)

BaRDa: A Belief and Reasoning Dataset that Separates Factual Accuracy and Reasoning Ability
by: Clark, Peter, et al.
Published: (2023)

Papers-to-Posts: Supporting Detailed Long-Document Summarization with an Interactive LLM-Powered Source Outline
by: Radensky, Marissa, et al.
Published: (2024)

HARPA: A Testability-Driven, Literature-Grounded Framework for Research Ideation
by: Vasu, Rosni, et al.
Published: (2025)

Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
by: Nottingham, Kolby, et al.
Published: (2024)

Latent Factor Models Meets Instructions: Goal-conditioned Latent Factor Discovery without Task Supervision
by: Xie, Zhouhang, et al.
Published: (2025)

AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise
by: Agarwal, Dhruv, et al.
Published: (2025)

Facets, Taxonomies, and Syntheses: Navigating Structured Representations in LLM-Assisted Literature Review
by: Fok, Raymond, et al.
Published: (2025)

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)

From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering
by: Weir, Nathaniel, et al.
Published: (2024)

ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery
by: Yu, Haofei, et al.
Published: (2026)

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic
by: Weir, Nathaniel, et al.
Published: (2024)

IdeaSynth: Iterative Research Idea Development Through Evolving and Composing Idea Facets with Literature-Grounded Feedback
by: Pu, Kevin, et al.
Published: (2024)

Digital Socrates: Evaluating LLMs through Explanation Critiques
by: Gu, Yuling, et al.
Published: (2023)

LitPivot: Developing Well-Situated Research Ideas Through Dynamic Contextualization and Critique within the Literature Landscape
by: Kambhamettu, Hita, et al.
Published: (2026)

Generating Literature-Driven Scientific Theories at Scale
by: Jansen, Peter, et al.
Published: (2026)

PreScience: A Benchmark for Forecasting Scientific Contributions
by: Ajith, Anirudh, et al.
Published: (2026)

Data-driven Discovery with Large Generative Models
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)

EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery
by: Lyu, Yougang, et al.
Published: (2026)

Omakase: proactive assistance with actionable suggestions for evolving scientific research projects
by: Siangliulue, Pao, et al.
Published: (2026)

To Tell The Truth: Language of Deception and Language Models
by: Hazra, Sanchaita, et al.
Published: (2023)

Accepted with Minor Revisions: Value of AI-Assisted Scientific Writing
by: Hazra, Sanchaita, et al.
Published: (2025)

ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
by: Newman, Benjamin, et al.
Published: (2024)

HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance
by: Vasu, Rosni, et al.
Published: (2025)

CodeDistiller: Automatically Generating Code Libraries for Scientific Coding Agents
by: Jansen, Peter, et al.
Published: (2025)

Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos
by: Kalluri, Tarun, et al.
Published: (2024)

AI Safety Should Prioritize the Future of Work
by: Hazra, Sanchaita, et al.
Published: (2025)

Tailoring with Targeted Precision: Edit-Based Agents for Open-Domain Procedure Customization
by: Lal, Yash Kumar, et al.
Published: (2023)

Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
by: Zhou, Lianhao, et al.
Published: (2025)

The Good, the Bad, and the Ugly: The Role of AI Quality Disclosure in Lie Detection
by: Bhattacharya, Haimanti, et al.
Published: (2024)

ResearchEVO: An End-to-End Framework for Automated Scientific Discovery and Documentation
by: Zhao, Zhe, et al.
Published: (2026)

SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
by: Gu, Yuling, et al.
Published: (2024)

Neologism Learning for Controllability and Self-Verbalization
by: Hewitt, John, et al.
Published: (2025)

Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
by: Chen, Jiangjie, et al.
Published: (2023)

LMVC: An End-to-End Learned Multiview Video Coding Framework
by: Sheng, Xihua, et al.
Published: (2025)

AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite
by: Bragg, Jonathan, et al.
Published: (2025)

End-to-End Speech Translation for Low-Resource Languages Using Weakly Labeled Data
by: Pothula, Aishwarya, et al.
Published: (2025)

Generalizable End-to-End Tool-Use RL with Synthetic CodeGym
by: Du, Weihua, et al.
Published: (2025)