Saved in:
| Main Authors: | Zhuang, Xinlin, Peng, Jiahui, Ma, Ren, Wang, Yinfan, Bai, Tianyi, Wei, Xingjian, Qiu, Jiantao, Zhang, Chi, Qian, Ying, He, Conghui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.14194 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Topic Over Source: The Key to Effective Data Mixing for Language Models Pre-training
by: Peng, Jiahui, et al.
Published: (2025)
by: Peng, Jiahui, et al.
Published: (2025)
Efficient Pretraining Data Selection for Language Models via Multi-Actor Collaboration
by: Bai, Tianyi, et al.
Published: (2024)
by: Bai, Tianyi, et al.
Published: (2024)
Harnessing Diversity for Important Data Selection in Pretraining Large Language Models
by: Zhang, Chi, et al.
Published: (2024)
by: Zhang, Chi, et al.
Published: (2024)
Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification
by: Bai, Tianyi, et al.
Published: (2025)
by: Bai, Tianyi, et al.
Published: (2025)
VADE: Variance-Aware Dynamic Sampling via Online Sample-Level Difficulty Estimation for Multimodal RL
by: Hu, Zengjie, et al.
Published: (2025)
by: Hu, Zengjie, et al.
Published: (2025)
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning
by: Bai, Tianyi, et al.
Published: (2025)
by: Bai, Tianyi, et al.
Published: (2025)
Wasserstein distributional adversarial training for deep neural networks
by: Bai, Xingjian, et al.
Published: (2025)
by: Bai, Xingjian, et al.
Published: (2025)
GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
by: Wang, Jingchao, et al.
Published: (2025)
by: Wang, Jingchao, et al.
Published: (2025)
Molecular Identifier Visual Prompt and Verifiable Reinforcement Learning for Chemical Reaction Diagram Parsing
by: Song, Jiahe, et al.
Published: (2026)
by: Song, Jiahe, et al.
Published: (2026)
AFL: A Single-Round Analytic Approach for Federated Learning with Pre-trained Models
by: He, Run, et al.
Published: (2024)
by: He, Run, et al.
Published: (2024)
APEX: Learning Adaptive Priorities for Multi-Objective Alignment in Vision-Language Generation
by: Chen, Dongliang, et al.
Published: (2026)
by: Chen, Dongliang, et al.
Published: (2026)
RxnCaption: Reformulating Reaction Diagram Parsing as Visual Prompt Guided Captioning
by: Song, Jiahe, et al.
Published: (2025)
by: Song, Jiahe, et al.
Published: (2025)
Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering
by: Fei, Ben, et al.
Published: (2024)
by: Fei, Ben, et al.
Published: (2024)
“P‐Strengthening Strategy” of Nickel Single‐Atom Catalyst With Boosting Selective Generation of Nonradicals: Synergy of Metal Center and Substrate
by: Jiantao Tong, et al.
Published: (2025)
by: Jiantao Tong, et al.
Published: (2025)
Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM
by: Liu, Mengjie, et al.
Published: (2025)
by: Liu, Mengjie, et al.
Published: (2025)
VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving
by: Zhang, Haiming, et al.
Published: (2024)
by: Zhang, Haiming, et al.
Published: (2024)
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024)
by: Cao, Chenjie, et al.
Published: (2024)
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
by: Zhu, Tong, et al.
Published: (2024)
by: Zhu, Tong, et al.
Published: (2024)
Human-AI Collaborative Multi-modal Multi-rater Learning for Endometriosis Diagnosis
by: Wang, Hu, et al.
Published: (2024)
by: Wang, Hu, et al.
Published: (2024)
An Aggregation‐Induced Emission Active Peptide‐Based Fluorescent Probe for Highly Selective and Sensitive Detection of Hg(II) Ions and Its Multifield Applications
by: Shiyi Xiong, et al.
Published: (2025)
by: Shiyi Xiong, et al.
Published: (2025)
3D Scene Graph Guided Vision-Language Pre-training
by: Liu, Hao, et al.
Published: (2024)
by: Liu, Hao, et al.
Published: (2024)
SHapley Estimated Explanation (SHEP): A Fast Post-Hoc Attribution Method for Interpreting Intelligent Fault Diagnosis
by: Chen, Qian, et al.
Published: (2025)
by: Chen, Qian, et al.
Published: (2025)
Universal Adversarial Perturbations for Vision-Language Pre-trained Models
by: Zhang, Peng-Fei, et al.
Published: (2024)
by: Zhang, Peng-Fei, et al.
Published: (2024)
Rational Design of Cobalt Phthalocyanine (CoPc)‐Anchored TiO 2 Nanorods for High‐Efficiency Selective Catalytic Oxidation
by: Simeng Zhu, et al.
Published: (2024)
by: Simeng Zhu, et al.
Published: (2024)
Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
by: Liu, Hao, et al.
Published: (2024)
by: Liu, Hao, et al.
Published: (2024)
KeyVideoLLM: Towards Large-scale Video Keyframe Selection
by: Liang, Hao, et al.
Published: (2024)
by: Liang, Hao, et al.
Published: (2024)
MultiOrg: A Multi-rater Organoid-detection Dataset
by: Bukas, Christina, et al.
Published: (2024)
by: Bukas, Christina, et al.
Published: (2024)
Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models
by: Zheng, Junhao, et al.
Published: (2023)
by: Zheng, Junhao, et al.
Published: (2023)
Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection
by: Peng, Xinlin, et al.
Published: (2024)
by: Peng, Xinlin, et al.
Published: (2024)
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
by: Wang, Shaobo, et al.
Published: (2026)
by: Wang, Shaobo, et al.
Published: (2026)
Diversified and Personalized Multi-rater Medical Image Segmentation
by: Wu, Yicheng, et al.
Published: (2024)
by: Wu, Yicheng, et al.
Published: (2024)
Multi-rater Prompting for Ambiguous Medical Image Segmentation
by: Wang, Jinhong, et al.
Published: (2024)
by: Wang, Jinhong, et al.
Published: (2024)
ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution
by: Zhou, Yuanbo, et al.
Published: (2024)
by: Zhou, Yuanbo, et al.
Published: (2024)
FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models
by: Li, Wei, et al.
Published: (2024)
by: Li, Wei, et al.
Published: (2024)
Fractional Denoising for 3D Molecular Pre-training
by: Feng, Shikun, et al.
Published: (2023)
by: Feng, Shikun, et al.
Published: (2023)
MAA: Meticulous Adversarial Attack against Vision-Language Pre-trained Models
by: Zhang, Peng-Fei, et al.
Published: (2025)
by: Zhang, Peng-Fei, et al.
Published: (2025)
Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
by: Ma, Shuailei, et al.
Published: (2023)
by: Ma, Shuailei, et al.
Published: (2023)
PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction
by: Xu, Xingjian, et al.
Published: (2024)
by: Xu, Xingjian, et al.
Published: (2024)
Multi-level Asymmetric Contrastive Learning for Volumetric Medical Image Segmentation Pre-training
by: Zeng, Shuang, et al.
Published: (2023)
by: Zeng, Shuang, et al.
Published: (2023)
SwiftTS: A Swift Selection Framework for Time Series Pre-trained Models via Multi-task Meta-Learning
by: Zhang, Tengxue, et al.
Published: (2025)
by: Zhang, Tengxue, et al.
Published: (2025)
Similar Items
-
Topic Over Source: The Key to Effective Data Mixing for Language Models Pre-training
by: Peng, Jiahui, et al.
Published: (2025) -
Efficient Pretraining Data Selection for Language Models via Multi-Actor Collaboration
by: Bai, Tianyi, et al.
Published: (2024) -
Harnessing Diversity for Important Data Selection in Pretraining Large Language Models
by: Zhang, Chi, et al.
Published: (2024) -
Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification
by: Bai, Tianyi, et al.
Published: (2025) -
VADE: Variance-Aware Dynamic Sampling via Online Sample-Level Difficulty Estimation for Multimodal RL
by: Hu, Zengjie, et al.
Published: (2025)