Saved in:
| Main Authors: | Jansen, Peter, Tafjord, Oyvind, Radensky, Marissa, Siangliulue, Pao, Hope, Tom, Mishra, Bhavana Dalvi, Majumder, Bodhisattwa Prasad, Weld, Daniel S., Clark, Peter |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.22708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
by: Jansen, Peter, et al.
Published: (2024)
by: Jansen, Peter, et al.
Published: (2024)
Literature-Grounded Novelty Assessment of Scientific Ideas
by: Shahid, Simra, et al.
Published: (2025)
by: Shahid, Simra, et al.
Published: (2025)
Human-LLM Compound System for Scientific Ideation through Facet Recombination and Novelty Evaluation
by: Radensky, Marissa, et al.
Published: (2024)
by: Radensky, Marissa, et al.
Published: (2024)
BaRDa: A Belief and Reasoning Dataset that Separates Factual Accuracy and Reasoning Ability
by: Clark, Peter, et al.
Published: (2023)
by: Clark, Peter, et al.
Published: (2023)
Papers-to-Posts: Supporting Detailed Long-Document Summarization with an Interactive LLM-Powered Source Outline
by: Radensky, Marissa, et al.
Published: (2024)
by: Radensky, Marissa, et al.
Published: (2024)
HARPA: A Testability-Driven, Literature-Grounded Framework for Research Ideation
by: Vasu, Rosni, et al.
Published: (2025)
by: Vasu, Rosni, et al.
Published: (2025)
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
by: Nottingham, Kolby, et al.
Published: (2024)
by: Nottingham, Kolby, et al.
Published: (2024)
Latent Factor Models Meets Instructions: Goal-conditioned Latent Factor Discovery without Task Supervision
by: Xie, Zhouhang, et al.
Published: (2025)
by: Xie, Zhouhang, et al.
Published: (2025)
AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise
by: Agarwal, Dhruv, et al.
Published: (2025)
by: Agarwal, Dhruv, et al.
Published: (2025)
Facets, Taxonomies, and Syntheses: Navigating Structured Representations in LLM-Assisted Literature Review
by: Fok, Raymond, et al.
Published: (2025)
by: Fok, Raymond, et al.
Published: (2025)
DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)
From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering
by: Weir, Nathaniel, et al.
Published: (2024)
by: Weir, Nathaniel, et al.
Published: (2024)
ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery
by: Yu, Haofei, et al.
Published: (2026)
by: Yu, Haofei, et al.
Published: (2026)
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic
by: Weir, Nathaniel, et al.
Published: (2024)
by: Weir, Nathaniel, et al.
Published: (2024)
IdeaSynth: Iterative Research Idea Development Through Evolving and Composing Idea Facets with Literature-Grounded Feedback
by: Pu, Kevin, et al.
Published: (2024)
by: Pu, Kevin, et al.
Published: (2024)
Digital Socrates: Evaluating LLMs through Explanation Critiques
by: Gu, Yuling, et al.
Published: (2023)
by: Gu, Yuling, et al.
Published: (2023)
LitPivot: Developing Well-Situated Research Ideas Through Dynamic Contextualization and Critique within the Literature Landscape
by: Kambhamettu, Hita, et al.
Published: (2026)
by: Kambhamettu, Hita, et al.
Published: (2026)
Generating Literature-Driven Scientific Theories at Scale
by: Jansen, Peter, et al.
Published: (2026)
by: Jansen, Peter, et al.
Published: (2026)
PreScience: A Benchmark for Forecasting Scientific Contributions
by: Ajith, Anirudh, et al.
Published: (2026)
by: Ajith, Anirudh, et al.
Published: (2026)
Data-driven Discovery with Large Generative Models
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)
EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery
by: Lyu, Yougang, et al.
Published: (2026)
by: Lyu, Yougang, et al.
Published: (2026)
Omakase: proactive assistance with actionable suggestions for evolving scientific research projects
by: Siangliulue, Pao, et al.
Published: (2026)
by: Siangliulue, Pao, et al.
Published: (2026)
To Tell The Truth: Language of Deception and Language Models
by: Hazra, Sanchaita, et al.
Published: (2023)
by: Hazra, Sanchaita, et al.
Published: (2023)
Accepted with Minor Revisions: Value of AI-Assisted Scientific Writing
by: Hazra, Sanchaita, et al.
Published: (2025)
by: Hazra, Sanchaita, et al.
Published: (2025)
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
by: Newman, Benjamin, et al.
Published: (2024)
by: Newman, Benjamin, et al.
Published: (2024)
HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance
by: Vasu, Rosni, et al.
Published: (2025)
by: Vasu, Rosni, et al.
Published: (2025)
CodeDistiller: Automatically Generating Code Libraries for Scientific Coding Agents
by: Jansen, Peter, et al.
Published: (2025)
by: Jansen, Peter, et al.
Published: (2025)
Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos
by: Kalluri, Tarun, et al.
Published: (2024)
by: Kalluri, Tarun, et al.
Published: (2024)
AI Safety Should Prioritize the Future of Work
by: Hazra, Sanchaita, et al.
Published: (2025)
by: Hazra, Sanchaita, et al.
Published: (2025)
Tailoring with Targeted Precision: Edit-Based Agents for Open-Domain Procedure Customization
by: Lal, Yash Kumar, et al.
Published: (2023)
by: Lal, Yash Kumar, et al.
Published: (2023)
Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
by: Zhou, Lianhao, et al.
Published: (2025)
by: Zhou, Lianhao, et al.
Published: (2025)
The Good, the Bad, and the Ugly: The Role of AI Quality Disclosure in Lie Detection
by: Bhattacharya, Haimanti, et al.
Published: (2024)
by: Bhattacharya, Haimanti, et al.
Published: (2024)
ResearchEVO: An End-to-End Framework for Automated Scientific Discovery and Documentation
by: Zhao, Zhe, et al.
Published: (2026)
by: Zhao, Zhe, et al.
Published: (2026)
SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
by: Gu, Yuling, et al.
Published: (2024)
by: Gu, Yuling, et al.
Published: (2024)
Neologism Learning for Controllability and Self-Verbalization
by: Hewitt, John, et al.
Published: (2025)
by: Hewitt, John, et al.
Published: (2025)
Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
by: Chen, Jiangjie, et al.
Published: (2023)
by: Chen, Jiangjie, et al.
Published: (2023)
LMVC: An End-to-End Learned Multiview Video Coding Framework
by: Sheng, Xihua, et al.
Published: (2025)
by: Sheng, Xihua, et al.
Published: (2025)
AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite
by: Bragg, Jonathan, et al.
Published: (2025)
by: Bragg, Jonathan, et al.
Published: (2025)
End-to-End Speech Translation for Low-Resource Languages Using Weakly Labeled Data
by: Pothula, Aishwarya, et al.
Published: (2025)
by: Pothula, Aishwarya, et al.
Published: (2025)
Generalizable End-to-End Tool-Use RL with Synthetic CodeGym
by: Du, Weihua, et al.
Published: (2025)
by: Du, Weihua, et al.
Published: (2025)
Similar Items
-
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
by: Jansen, Peter, et al.
Published: (2024) -
Literature-Grounded Novelty Assessment of Scientific Ideas
by: Shahid, Simra, et al.
Published: (2025) -
Human-LLM Compound System for Scientific Ideation through Facet Recombination and Novelty Evaluation
by: Radensky, Marissa, et al.
Published: (2024) -
BaRDa: A Belief and Reasoning Dataset that Separates Factual Accuracy and Reasoning Ability
by: Clark, Peter, et al.
Published: (2023) -
Papers-to-Posts: Supporting Detailed Long-Document Summarization with an Interactive LLM-Powered Source Outline
by: Radensky, Marissa, et al.
Published: (2024)