Saved in:
| Main Authors: | Yin, Congrui, Wei, Evan, Zhang, Zhongxing, Zhan, Zaifu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.14271 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models
by: Zhan, Zaifu, et al.
Published: (2024)
by: Zhan, Zaifu, et al.
Published: (2024)
Can Large Language Models Self-Correct in Medical Question Answering? An Exploratory Study
by: Zhan, Zaifu, et al.
Published: (2026)
by: Zhan, Zaifu, et al.
Published: (2026)
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR
by: Burgess, James, et al.
Published: (2026)
by: Burgess, James, et al.
Published: (2026)
RuPLaR : Efficient Latent Compression of LLM Reasoning Chains with Rule-Based Priors From Multi-Step to One-Step
by: Luo, Xiaocheng, et al.
Published: (2026)
by: Luo, Xiaocheng, et al.
Published: (2026)
Paper Summary Attack: Jailbreaking LLMs through LLM Safety Papers
by: Lin, Liang, et al.
Published: (2025)
by: Lin, Liang, et al.
Published: (2025)
PaperAsk: A Benchmark for Reliability Evaluation of LLMs in Paper Search and Reading
by: Wu, Yutao, et al.
Published: (2025)
by: Wu, Yutao, et al.
Published: (2025)
SciCoQA: Quality Assurance for Scientific Paper--Code Alignment
by: Baumgärtner, Tim, et al.
Published: (2026)
by: Baumgärtner, Tim, et al.
Published: (2026)
Paper2Web: Let's Make Your Paper Alive!
by: Chen, Yuhang, et al.
Published: (2025)
by: Chen, Yuhang, et al.
Published: (2025)
RAMIE: Retrieval-Augmented Multi-task Information Extraction with Large Language Models on Dietary Supplements
by: Zhan, Zaifu, et al.
Published: (2024)
by: Zhan, Zaifu, et al.
Published: (2024)
Scientific Paper Retrieval with LLM-Guided Semantic-Based Ranking
by: Zhang, Yunyi, et al.
Published: (2025)
by: Zhang, Yunyi, et al.
Published: (2025)
MMRAG: Multi-Mode Retrieval-Augmented Generation with Large Language Models for Biomedical In-Context Learning
by: Zhan, Zaifu, et al.
Published: (2025)
by: Zhan, Zaifu, et al.
Published: (2025)
Usefulness of LLMs as an Author Checklist Assistant for Scientific Papers: NeurIPS'24 Experiment
by: Goldberg, Alexander, et al.
Published: (2024)
by: Goldberg, Alexander, et al.
Published: (2024)
Data-Efficient Biomedical In-Context Learning: A Diversity-Enhanced Submodular Perspective
by: Wang, Jun, et al.
Published: (2025)
by: Wang, Jun, et al.
Published: (2025)
SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers
by: Singh, Shruti, et al.
Published: (2024)
by: Singh, Shruti, et al.
Published: (2024)
CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
by: Ou, Jiefu, et al.
Published: (2025)
by: Ou, Jiefu, et al.
Published: (2025)
EPEE: Towards Efficient and Effective Foundation Models in Biomedicine
by: Zhan, Zaifu, et al.
Published: (2025)
by: Zhan, Zaifu, et al.
Published: (2025)
Benchmarking Retrieval-Augmented Large Language Models in Biomedical NLP: Application, Robustness, and Self-Awareness
by: Li, Mingchen, et al.
Published: (2024)
by: Li, Mingchen, et al.
Published: (2024)
PaperWeaver: Enriching Topical Paper Alerts by Contextualizing Recommended Papers with User-collected Papers
by: Lee, Yoonjoo, et al.
Published: (2024)
by: Lee, Yoonjoo, et al.
Published: (2024)
When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life
by: Lou, Xinyue, et al.
Published: (2026)
by: Lou, Xinyue, et al.
Published: (2026)
KGQuest: Template-Driven QA Generation from Knowledge Graphs with LLM-Based Refinement
by: Nayab, Sania, et al.
Published: (2025)
by: Nayab, Sania, et al.
Published: (2025)
StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based Learning
by: Chen, Jiaju, et al.
Published: (2023)
by: Chen, Jiaju, et al.
Published: (2023)
IAPT: Instruction-Aware Prompt Tuning for Large Language Models
by: Zhu, Wei, et al.
Published: (2024)
by: Zhu, Wei, et al.
Published: (2024)
JobResQA: A Benchmark for LLM Machine Reading Comprehension on Multilingual Résumés and JDs
by: Carrino, Casimiro Pio, et al.
Published: (2026)
by: Carrino, Casimiro Pio, et al.
Published: (2026)
Analyzing 16,193 LLM Papers for Fun and Profits
by: Xia, Zhiqiu, et al.
Published: (2025)
by: Xia, Zhiqiu, et al.
Published: (2025)
From Isolated Scoring to Collaborative Ranking: A Comparison-Native Framework for LLM-Based Paper Evaluation
by: Zheng, Pujun, et al.
Published: (2026)
by: Zheng, Pujun, et al.
Published: (2026)
Question-Answering (QA) Model for a Personalized Learning Assistant for Arabic Language
by: Sammoudi, Mohammad, et al.
Published: (2024)
by: Sammoudi, Mohammad, et al.
Published: (2024)
Beyond Paper-to-Paper: Structured Profiling and Rubric Scoring for Paper-Reviewer Matching
by: Pan, Yicheng, et al.
Published: (2026)
by: Pan, Yicheng, et al.
Published: (2026)
Benchmarking GPT-5 for biomedical natural language processing
by: Hou, Yu, et al.
Published: (2025)
by: Hou, Yu, et al.
Published: (2025)
Navigating Through Paper Flood: Advancing LLM-based Paper Evaluation through Domain-Aware Retrieval and Latent Reasoning
by: Zheng, Wuqiang, et al.
Published: (2025)
by: Zheng, Wuqiang, et al.
Published: (2025)
Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers
by: Zhao, Yilun, et al.
Published: (2025)
by: Zhao, Yilun, et al.
Published: (2025)
DebateQA: Evaluating Question Answering on Debatable Knowledge
by: Xu, Rongwu, et al.
Published: (2024)
by: Xu, Rongwu, et al.
Published: (2024)
GraphReview: Scientific Paper Evaluation via LLM-Based Graph Message Passing
by: Zheng, Pujun, et al.
Published: (2026)
by: Zheng, Pujun, et al.
Published: (2026)
Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated Scientific Papers
by: Andreev, Nikita, et al.
Published: (2024)
by: Andreev, Nikita, et al.
Published: (2024)
BioGraphletQA: Knowledge-Anchored Generation of Complex QA Datasets
by: Jonker, Richard A. A., et al.
Published: (2026)
by: Jonker, Richard A. A., et al.
Published: (2026)
Medical Knowledge Graph QA for Drug-Drug Interaction Prediction based on Multi-hop Machine Reading Comprehension
by: Gao, Peng, et al.
Published: (2022)
by: Gao, Peng, et al.
Published: (2022)
Automatic Paper Reviewing with Heterogeneous Graph Reasoning over LLM-Simulated Reviewer-Author Debates
by: Li, Shuaimin, et al.
Published: (2025)
by: Li, Shuaimin, et al.
Published: (2025)
Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers
by: Pang, Wei, et al.
Published: (2025)
by: Pang, Wei, et al.
Published: (2025)
LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points
by: Zhang, Xuemiao, et al.
Published: (2025)
by: Zhang, Xuemiao, et al.
Published: (2025)
A Hierarchical Framework for Measuring Scientific Paper Innovation via Large Language Models
by: Tan, Hongming, et al.
Published: (2025)
by: Tan, Hongming, et al.
Published: (2025)
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
by: Seo, Minju, et al.
Published: (2025)
by: Seo, Minju, et al.
Published: (2025)
Similar Items
-
Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models
by: Zhan, Zaifu, et al.
Published: (2024) -
Can Large Language Models Self-Correct in Medical Question Answering? An Exploratory Study
by: Zhan, Zaifu, et al.
Published: (2026) -
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR
by: Burgess, James, et al.
Published: (2026) -
RuPLaR : Efficient Latent Compression of LLM Reasoning Chains with Rule-Based Priors From Multi-Step to One-Step
by: Luo, Xiaocheng, et al.
Published: (2026) -
Paper Summary Attack: Jailbreaking LLMs through LLM Safety Papers
by: Lin, Liang, et al.
Published: (2025)