Saved in:
| Main Authors: | Wei, Hongyang, Liu, Hongbo, Wang, Zidong, Peng, Yi, Xu, Baixin, Wu, Size, Zhang, Xuying, He, Xianglong, Liu, Zexiang, Wang, Peiyu, Song, Xuchen, Li, Yangguang, Liu, Yang, Zhou, Yahui |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.15664 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
by: Wei, Hongyang, et al.
Published: (2025)
by: Wei, Hongyang, et al.
Published: (2025)
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation
by: Wang, Peiyu, et al.
Published: (2025)
by: Wang, Peiyu, et al.
Published: (2025)
Skywork-R1V3 Technical Report
by: Shen, Wei, et al.
Published: (2025)
by: Shen, Wei, et al.
Published: (2025)
Matrix-game 2.0: An open-source real-time and streaming interactive world model
by: He, Xianglong, et al.
Published: (2025)
by: He, Xianglong, et al.
Published: (2025)
Advances in GRPO for Generation Models: A Survey
by: Liu, Zexiang, et al.
Published: (2026)
by: Liu, Zexiang, et al.
Published: (2026)
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning
by: Wang, Xiaokun, et al.
Published: (2025)
by: Wang, Xiaokun, et al.
Published: (2025)
Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory
by: Wang, Zile, et al.
Published: (2026)
by: Wang, Zile, et al.
Published: (2026)
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning
by: Wang, Peiyu, et al.
Published: (2025)
by: Wang, Peiyu, et al.
Published: (2025)
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought
by: Peng, Yi, et al.
Published: (2025)
by: Peng, Yi, et al.
Published: (2025)
Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch
by: Zhang, Yifan, et al.
Published: (2025)
by: Zhang, Yifan, et al.
Published: (2025)
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs
by: Zeng, Liang, et al.
Published: (2025)
by: Zeng, Liang, et al.
Published: (2025)
UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation
by: Liu, Zexiang, et al.
Published: (2023)
by: Liu, Zexiang, et al.
Published: (2023)
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs
by: Liu, Chris Yuhao, et al.
Published: (2024)
by: Liu, Chris Yuhao, et al.
Published: (2024)
Skywork Open Reasoner 1 Technical Report
by: He, Jujie, et al.
Published: (2025)
by: He, Jujie, et al.
Published: (2025)
ShapeGen: Towards High-Quality 3D Shape Synthesis
by: Li, Yangguang, et al.
Published: (2025)
by: Li, Yangguang, et al.
Published: (2025)
MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs
by: He, Xianglong, et al.
Published: (2025)
by: He, Xianglong, et al.
Published: (2025)
UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing
by: Wang, Dianyi, et al.
Published: (2026)
by: Wang, Dianyi, et al.
Published: (2026)
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
by: Liu, Chris Yuhao, et al.
Published: (2025)
by: Liu, Chris Yuhao, et al.
Published: (2025)
UniEP: Unified Expert-Parallel MoE MegaKernel for LLM Training
by: Zheng, Size, et al.
Published: (2026)
by: Zheng, Size, et al.
Published: (2026)
LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models
by: Zhao, Liang, et al.
Published: (2024)
by: Zhao, Liang, et al.
Published: (2024)
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On
by: Zeng, Liang, et al.
Published: (2024)
by: Zeng, Liang, et al.
Published: (2024)
Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark
by: Zou, Kai, et al.
Published: (2025)
by: Zou, Kai, et al.
Published: (2025)
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
by: Li, Kunchang, et al.
Published: (2022)
by: Li, Kunchang, et al.
Published: (2022)
OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation
by: Wu, Size, et al.
Published: (2025)
by: Wu, Size, et al.
Published: (2025)
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
by: Wei, Tianwen, et al.
Published: (2024)
by: Wei, Tianwen, et al.
Published: (2024)
Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning
by: Zheng, Dian, et al.
Published: (2026)
by: Zheng, Dian, et al.
Published: (2026)
UniMo: Unified Motion Generation and Understanding with Chain of Thought
by: Wang, Guocun, et al.
Published: (2026)
by: Wang, Guocun, et al.
Published: (2026)
Particle manipulation by hydrodynamic effects in vortical Stokes flow
by: Liu, Xuchen
Published: (2025)
by: Liu, Xuchen
Published: (2025)
Cryptocephalus inhumeralis Pic 1922
by: Duan, Wen-Yuan, et al.
Published: (2025)
by: Duan, Wen-Yuan, et al.
Published: (2025)
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
by: Zhang, Xuying, et al.
Published: (2024)
by: Zhang, Xuying, et al.
Published: (2024)
UniAlignment: Semantic Alignment for Unified Image Generation, Understanding, Manipulation and Perception
by: Song, Xinyang, et al.
Published: (2025)
by: Song, Xinyang, et al.
Published: (2025)
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs
by: Jian, Ai, et al.
Published: (2025)
by: Jian, Ai, et al.
Published: (2025)
UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking
by: Chu, Xuangeng, et al.
Published: (2025)
by: Chu, Xuangeng, et al.
Published: (2025)
ResFormer: All-Time Reservoir Memory for Long Sequence Classification
by: Liu, Hongbo, et al.
Published: (2025)
by: Liu, Hongbo, et al.
Published: (2025)
UniShield: Unified Face Attack Detection via KG-Informed Multimodal Reasoning
by: Li, Hongrui, et al.
Published: (2026)
by: Li, Hongrui, et al.
Published: (2026)
UniVideo: Unified Understanding, Generation, and Editing for Videos
by: Wei, Cong, et al.
Published: (2025)
by: Wei, Cong, et al.
Published: (2025)
UniHM: Unified Dexterous Hand Manipulation with Vision Language Model
by: Zhang, Zhenhao, et al.
Published: (2026)
by: Zhang, Zhenhao, et al.
Published: (2026)
Squrve: A Unified and Modular Framework for Complex Real-World Text-to-SQL Tasks
by: Wang, Yihan, et al.
Published: (2025)
by: Wang, Yihan, et al.
Published: (2025)
Uni-Animator: Towards Unified Visual Colorization
by: Chen, Xinyuan, et al.
Published: (2026)
by: Chen, Xinyuan, et al.
Published: (2026)
Does Unification Come at a Cost? Uni-SafeBench: A Safety Benchmark for Unified Multimodal Large Models
by: Peng, Zixiang, et al.
Published: (2026)
by: Peng, Zixiang, et al.
Published: (2026)
Similar Items
-
Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
by: Wei, Hongyang, et al.
Published: (2025) -
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation
by: Wang, Peiyu, et al.
Published: (2025) -
Skywork-R1V3 Technical Report
by: Shen, Wei, et al.
Published: (2025) -
Matrix-game 2.0: An open-source real-time and streaming interactive world model
by: He, Xianglong, et al.
Published: (2025) -
Advances in GRPO for Generation Models: A Survey
by: Liu, Zexiang, et al.
Published: (2026)