:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhuang, Xinlin, Peng, Jiahui, Ma, Ren, Wang, Yinfan, Bai, Tianyi, Wei, Xingjian, Qiu, Jiantao, Zhang, Chi, Qian, Ying, He, Conghui
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2504.14194
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Topic Over Source: The Key to Effective Data Mixing for Language Models Pre-training
by: Peng, Jiahui, et al.
Published: (2025)

Efficient Pretraining Data Selection for Language Models via Multi-Actor Collaboration
by: Bai, Tianyi, et al.
Published: (2024)

Harnessing Diversity for Important Data Selection in Pretraining Large Language Models
by: Zhang, Chi, et al.
Published: (2024)

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification
by: Bai, Tianyi, et al.
Published: (2025)

VADE: Variance-Aware Dynamic Sampling via Online Sample-Level Difficulty Estimation for Multimodal RL
by: Hu, Zengjie, et al.
Published: (2025)

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning
by: Bai, Tianyi, et al.
Published: (2025)

Wasserstein distributional adversarial training for deep neural networks
by: Bai, Xingjian, et al.
Published: (2025)

GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
by: Wang, Jingchao, et al.
Published: (2025)

Molecular Identifier Visual Prompt and Verifiable Reinforcement Learning for Chemical Reaction Diagram Parsing
by: Song, Jiahe, et al.
Published: (2026)

AFL: A Single-Round Analytic Approach for Federated Learning with Pre-trained Models
by: He, Run, et al.
Published: (2024)

APEX: Learning Adaptive Priorities for Multi-Objective Alignment in Vision-Language Generation
by: Chen, Dongliang, et al.
Published: (2026)

RxnCaption: Reformulating Reaction Diagram Parsing as Visual Prompt Guided Captioning
by: Song, Jiahe, et al.
Published: (2025)

Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering
by: Fei, Ben, et al.
Published: (2024)

“P‐Strengthening Strategy” of Nickel Single‐Atom Catalyst With Boosting Selective Generation of Nonradicals: Synergy of Metal Center and Substrate
by: Jiantao Tong, et al.
Published: (2025)

Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM
by: Liu, Mengjie, et al.
Published: (2025)

VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving
by: Zhang, Haiming, et al.
Published: (2024)

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024)

LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
by: Zhu, Tong, et al.
Published: (2024)

Human-AI Collaborative Multi-modal Multi-rater Learning for Endometriosis Diagnosis
by: Wang, Hu, et al.
Published: (2024)

An Aggregation‐Induced Emission Active Peptide‐Based Fluorescent Probe for Highly Selective and Sensitive Detection of Hg(II) Ions and Its Multifield Applications
by: Shiyi Xiong, et al.
Published: (2025)

3D Scene Graph Guided Vision-Language Pre-training
by: Liu, Hao, et al.
Published: (2024)

SHapley Estimated Explanation (SHEP): A Fast Post-Hoc Attribution Method for Interpreting Intelligent Fault Diagnosis
by: Chen, Qian, et al.
Published: (2025)

Universal Adversarial Perturbations for Vision-Language Pre-trained Models
by: Zhang, Peng-Fei, et al.
Published: (2024)

Rational Design of Cobalt Phthalocyanine (CoPc)‐Anchored TiO 2 Nanorods for High‐Efficiency Selective Catalytic Oxidation
by: Simeng Zhu, et al.
Published: (2024)

Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
by: Liu, Hao, et al.
Published: (2024)

KeyVideoLLM: Towards Large-scale Video Keyframe Selection
by: Liang, Hao, et al.
Published: (2024)

MultiOrg: A Multi-rater Organoid-detection Dataset
by: Bukas, Christina, et al.
Published: (2024)

Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models
by: Zheng, Junhao, et al.
Published: (2023)

Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection
by: Peng, Xinlin, et al.
Published: (2024)

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
by: Wang, Shaobo, et al.
Published: (2026)

Diversified and Personalized Multi-rater Medical Image Segmentation
by: Wu, Yicheng, et al.
Published: (2024)

Multi-rater Prompting for Ambiguous Medical Image Segmentation
by: Wang, Jinhong, et al.
Published: (2024)

ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution
by: Zhou, Yuanbo, et al.
Published: (2024)

FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models
by: Li, Wei, et al.
Published: (2024)

Fractional Denoising for 3D Molecular Pre-training
by: Feng, Shikun, et al.
Published: (2023)

MAA: Meticulous Adversarial Attack against Vision-Language Pre-trained Models
by: Zhang, Peng-Fei, et al.
Published: (2025)

Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
by: Ma, Shuailei, et al.
Published: (2023)

PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction
by: Xu, Xingjian, et al.
Published: (2024)

Multi-level Asymmetric Contrastive Learning for Volumetric Medical Image Segmentation Pre-training
by: Zeng, Shuang, et al.
Published: (2023)

SwiftTS: A Swift Selection Framework for Time Series Pre-trained Models via Multi-task Meta-Learning
by: Zhang, Tengxue, et al.
Published: (2025)