:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wei, Hongyang, Liu, Hongbo, Wang, Zidong, Peng, Yi, Xu, Baixin, Wu, Size, Zhang, Xuying, He, Xianglong, Liu, Zexiang, Wang, Peiyu, Song, Xuchen, Li, Yangguang, Liu, Yang, Zhou, Yahui
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.15664
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
by: Wei, Hongyang, et al.
Published: (2025)

Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation
by: Wang, Peiyu, et al.
Published: (2025)

Skywork-R1V3 Technical Report
by: Shen, Wei, et al.
Published: (2025)

Matrix-game 2.0: An open-source real-time and streaming interactive world model
by: He, Xianglong, et al.
Published: (2025)

Advances in GRPO for Generation Models: A Survey
by: Liu, Zexiang, et al.
Published: (2026)

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning
by: Wang, Xiaokun, et al.
Published: (2025)

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory
by: Wang, Zile, et al.
Published: (2026)

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning
by: Wang, Peiyu, et al.
Published: (2025)

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought
by: Peng, Yi, et al.
Published: (2025)

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch
by: Zhang, Yifan, et al.
Published: (2025)

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs
by: Zeng, Liang, et al.
Published: (2025)

UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation
by: Liu, Zexiang, et al.
Published: (2023)

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs
by: Liu, Chris Yuhao, et al.
Published: (2024)

Skywork Open Reasoner 1 Technical Report
by: He, Jujie, et al.
Published: (2025)

ShapeGen: Towards High-Quality 3D Shape Synthesis
by: Li, Yangguang, et al.
Published: (2025)

MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs
by: He, Xianglong, et al.
Published: (2025)

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing
by: Wang, Dianyi, et al.
Published: (2026)

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
by: Liu, Chris Yuhao, et al.
Published: (2025)

UniEP: Unified Expert-Parallel MoE MegaKernel for LLM Training
by: Zheng, Size, et al.
Published: (2026)

LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models
by: Zhao, Liang, et al.
Published: (2024)

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On
by: Zeng, Liang, et al.
Published: (2024)

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark
by: Zou, Kai, et al.
Published: (2025)

UniFormer: Unifying Convolution and Self-attention for Visual Recognition
by: Li, Kunchang, et al.
Published: (2022)

OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation
by: Wu, Size, et al.
Published: (2025)

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
by: Wei, Tianwen, et al.
Published: (2024)

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning
by: Zheng, Dian, et al.
Published: (2026)

UniMo: Unified Motion Generation and Understanding with Chain of Thought
by: Wang, Guocun, et al.
Published: (2026)

Particle manipulation by hydrodynamic effects in vortical Stokes flow
by: Liu, Xuchen
Published: (2025)

Cryptocephalus inhumeralis Pic 1922
by: Duan, Wen-Yuan, et al.
Published: (2025)

TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
by: Zhang, Xuying, et al.
Published: (2024)

UniAlignment: Semantic Alignment for Unified Image Generation, Understanding, Manipulation and Perception
by: Song, Xinyang, et al.
Published: (2025)

CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs
by: Jian, Ai, et al.
Published: (2025)

UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking
by: Chu, Xuangeng, et al.
Published: (2025)

ResFormer: All-Time Reservoir Memory for Long Sequence Classification
by: Liu, Hongbo, et al.
Published: (2025)

UniShield: Unified Face Attack Detection via KG-Informed Multimodal Reasoning
by: Li, Hongrui, et al.
Published: (2026)

UniVideo: Unified Understanding, Generation, and Editing for Videos
by: Wei, Cong, et al.
Published: (2025)

UniHM: Unified Dexterous Hand Manipulation with Vision Language Model
by: Zhang, Zhenhao, et al.
Published: (2026)

Squrve: A Unified and Modular Framework for Complex Real-World Text-to-SQL Tasks
by: Wang, Yihan, et al.
Published: (2025)

Uni-Animator: Towards Unified Visual Colorization
by: Chen, Xinyuan, et al.
Published: (2026)

Does Unification Come at a Cost? Uni-SafeBench: A Safety Benchmark for Unified Multimodal Large Models
by: Peng, Zixiang, et al.
Published: (2026)