Saved in:
| Main Authors: | Chu, Zhendong, Xie, Jian, Wang, Shen, Wang, Zichao, Wen, Qingsong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.20701 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM Agents for Education: Advances and Applications
by: Chu, Zhendong, et al.
Published: (2025)
by: Chu, Zhendong, et al.
Published: (2025)
ARM2: Adaptive Reasoning Model with Vision Understanding and Executable Code
by: Xie, Jian, et al.
Published: (2025)
by: Xie, Jian, et al.
Published: (2025)
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
by: Yan, Yibo, et al.
Published: (2025)
by: Yan, Yibo, et al.
Published: (2025)
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
by: Wang, Zhecan, et al.
Published: (2023)
by: Wang, Zhecan, et al.
Published: (2023)
PairUni: Pairwise Training for Unified Multimodal Language Models
by: Zheng, Jiani, et al.
Published: (2025)
by: Zheng, Jiani, et al.
Published: (2025)
Large Language Models for Education: A Survey and Outlook
by: Wang, Shen, et al.
Published: (2024)
by: Wang, Shen, et al.
Published: (2024)
ESURF: Simple and Effective EDU Segmentation
by: Sediqin, Mohammadreza, et al.
Published: (2025)
by: Sediqin, Mohammadreza, et al.
Published: (2025)
UniEdit: A Unified Knowledge Editing Benchmark for Large Language Models
by: Chen, Qizhou, et al.
Published: (2025)
by: Chen, Qizhou, et al.
Published: (2025)
Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction
by: Ye, Jingheng, et al.
Published: (2025)
by: Ye, Jingheng, et al.
Published: (2025)
UniVLR: Unifying Text and Vision in Visual Latent Reasoning for Multimodal LLMs
by: Jiang, Houcheng, et al.
Published: (2026)
by: Jiang, Houcheng, et al.
Published: (2026)
AI-Driven Virtual Teacher for Enhanced Educational Efficiency: Leveraging Large Pretrain Models for Autonomous Error Analysis and Correction
by: Xu, Tianlong, et al.
Published: (2024)
by: Xu, Tianlong, et al.
Published: (2024)
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias
by: Wan, Zhongwei, et al.
Published: (2023)
by: Wan, Zhongwei, et al.
Published: (2023)
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training
by: Bawazir, Ameera, et al.
Published: (2024)
by: Bawazir, Ameera, et al.
Published: (2024)
UniICL: An Efficient Unified Framework Unifying Compression, Selection, and Generation
by: Gao, Jun, et al.
Published: (2024)
by: Gao, Jun, et al.
Published: (2024)
UniChange: Unifying Change Detection with Multimodal Large Language Model
by: Zhang, Xu, et al.
Published: (2025)
by: Zhang, Xu, et al.
Published: (2025)
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
by: Yan, Yibo, et al.
Published: (2024)
by: Yan, Yibo, et al.
Published: (2024)
SkillBrew: Multi-Objective Curation of Skill Banks for LLM Agents
by: Hu, Wentao, et al.
Published: (2026)
by: Hu, Wentao, et al.
Published: (2026)
UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets
by: Wang, Pengyu, et al.
Published: (2025)
by: Wang, Pengyu, et al.
Published: (2025)
UniMoT: Unified Molecule-Text Language Model with Discrete Token Representation
by: Guo, Shuhan, et al.
Published: (2024)
by: Guo, Shuhan, et al.
Published: (2024)
UniDial-EvalKit: A Unified Toolkit for Evaluating Multi-Faceted Conversational Abilities
by: Jia, Qi, et al.
Published: (2026)
by: Jia, Qi, et al.
Published: (2026)
MIRL: Mutual Information-Guided Reinforcement Learning for Vision-Language Models
by: Zhang, Yin, et al.
Published: (2026)
by: Zhang, Yin, et al.
Published: (2026)
UniARM: Towards a Unified Autoregressive Reward Model for Multi-Objective Test-Time Alignment
by: Xie, Hongyan, et al.
Published: (2026)
by: Xie, Hongyan, et al.
Published: (2026)
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
by: Zheng, Sipeng, et al.
Published: (2024)
by: Zheng, Sipeng, et al.
Published: (2024)
MathEDU: Feedback Generation on Problem-Solving Processes for Mathematical Learning Support
by: Hsu, Wei-Ling, et al.
Published: (2025)
by: Hsu, Wei-Ling, et al.
Published: (2025)
Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
by: Qin, Luozheng, et al.
Published: (2025)
by: Qin, Luozheng, et al.
Published: (2025)
UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause
by: Hu, Guimin, et al.
Published: (2024)
by: Hu, Guimin, et al.
Published: (2024)
CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries
by: Liu, Shudong, et al.
Published: (2025)
by: Liu, Shudong, et al.
Published: (2025)
UniECG: Understanding and Generating ECG in One Unified Model
by: Jin, Jiarui, et al.
Published: (2025)
by: Jin, Jiarui, et al.
Published: (2025)
UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations
by: Mo, Fengran, et al.
Published: (2025)
by: Mo, Fengran, et al.
Published: (2025)
PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications
by: Yang, Dingkang, et al.
Published: (2024)
by: Yang, Dingkang, et al.
Published: (2024)
Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
by: Girrbach, Leander, et al.
Published: (2024)
by: Girrbach, Leander, et al.
Published: (2024)
CulturePark: Boosting Cross-cultural Understanding in Large Language Models
by: Li, Cheng, et al.
Published: (2024)
by: Li, Cheng, et al.
Published: (2024)
UniSD: Towards a Unified Self-Distillation Framework for Large Language Models
by: Jin, Yiqiao, et al.
Published: (2026)
by: Jin, Yiqiao, et al.
Published: (2026)
EDU-NER-2025: Named Entity Recognition in Urdu Educational Texts using XLM-RoBERTa with X (formerly Twitter)
by: Ullah, Fida, et al.
Published: (2025)
by: Ullah, Fida, et al.
Published: (2025)
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
by: Shi, Dachuan, et al.
Published: (2023)
by: Shi, Dachuan, et al.
Published: (2023)
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
by: Pham, Trinh, et al.
Published: (2024)
by: Pham, Trinh, et al.
Published: (2024)
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models
by: Zhu, Minjie, et al.
Published: (2024)
by: Zhu, Minjie, et al.
Published: (2024)
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges
by: Yan, Yibo, et al.
Published: (2024)
by: Yan, Yibo, et al.
Published: (2024)
A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions
by: Liu, Lei, et al.
Published: (2024)
by: Liu, Lei, et al.
Published: (2024)
Less Redundancy: Boosting Practicality of Vision Language Model in Walking Assistants
by: Li, Chongyang, et al.
Published: (2025)
by: Li, Chongyang, et al.
Published: (2025)
Similar Items
-
LLM Agents for Education: Advances and Applications
by: Chu, Zhendong, et al.
Published: (2025) -
ARM2: Adaptive Reasoning Model with Vision Understanding and Executable Code
by: Xie, Jian, et al.
Published: (2025) -
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
by: Yan, Yibo, et al.
Published: (2025) -
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
by: Wang, Zhecan, et al.
Published: (2023) -
PairUni: Pairwise Training for Unified Multimodal Language Models
by: Zheng, Jiani, et al.
Published: (2025)