Saved in:
| Main Authors: | Li, Boxun, Li, Yadong, Li, Zhiyuan, Liu, Congyi, Liu, Weilin, Niu, Guowei, Tan, Zheyue, Xu, Haiyang, Yao, Zhuyu, Yuan, Tao, Zhou, Dong, Zhuang, Yueqing, Yan, Shengen, Dai, Guohao, Wang, Yu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.15803 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Megrez2 Technical Report
by: Li, Boxun, et al.
Published: (2025)
by: Li, Boxun, et al.
Published: (2025)
ReXMoE: Reusing Experts with Minimal Overhead in Mixture-of-Experts
by: Tan, Zheyue, et al.
Published: (2025)
by: Tan, Zheyue, et al.
Published: (2025)
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K
by: Yuan, Tao, et al.
Published: (2024)
by: Yuan, Tao, et al.
Published: (2024)
Baichuan-Omni Technical Report
by: Li, Yadong, et al.
Published: (2024)
by: Li, Yadong, et al.
Published: (2024)
Kling-Omni Technical Report
by: Kling Team, et al.
Published: (2025)
by: Kling Team, et al.
Published: (2025)
EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models
by: Tan, Zheyue, et al.
Published: (2025)
by: Tan, Zheyue, et al.
Published: (2025)
SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model
by: Lin, Lin, et al.
Published: (2025)
by: Lin, Lin, et al.
Published: (2025)
Baichuan-Omni-1.5 Technical Report
by: Li, Yadong, et al.
Published: (2025)
by: Li, Yadong, et al.
Published: (2025)
BitSnap: Checkpoint Sparsification and Quantization in LLM Training
by: Peng, Yanxin, et al.
Published: (2025)
by: Peng, Yanxin, et al.
Published: (2025)
Logics-Parsing-Omni Technical Report
by: An, Xin, et al.
Published: (2026)
by: An, Xin, et al.
Published: (2026)
LongCat-Flash-Omni Technical Report
by: Meituan LongCat Team, et al.
Published: (2025)
by: Meituan LongCat Team, et al.
Published: (2025)
CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
by: Wang, Luning, et al.
Published: (2024)
by: Wang, Luning, et al.
Published: (2024)
Distilled Decoding 2: One-step Sampling of Image Auto-regressive Models with Conditional Score Distillation
by: Liu, Enshu, et al.
Published: (2025)
by: Liu, Enshu, et al.
Published: (2025)
Qwen3-Omni Technical Report
by: Xu, Jin, et al.
Published: (2025)
by: Xu, Jin, et al.
Published: (2025)
OmniFusion Technical Report
by: Goncharova, Elizaveta, et al.
Published: (2024)
by: Goncharova, Elizaveta, et al.
Published: (2024)
Evaluating Quantized Large Language Models
by: Li, Shiyao, et al.
Published: (2024)
by: Li, Shiyao, et al.
Published: (2024)
Baichuan Alignment Technical Report
by: Lin, Mingan, et al.
Published: (2024)
by: Lin, Mingan, et al.
Published: (2024)
FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Vision Language Models
by: Fu, Tianyu, et al.
Published: (2024)
by: Fu, Tianyu, et al.
Published: (2024)
PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs
by: Liu, Tengxuan, et al.
Published: (2025)
by: Liu, Tengxuan, et al.
Published: (2025)
An Economical and Efficient Helium Recovery System for Vibration-Sensitive Applications
by: Yin, Zhiyuan, et al.
Published: (2024)
by: Yin, Zhiyuan, et al.
Published: (2024)
Qwen3.5-Omni Technical Report
by: Qwen Team
Published: (2026)
by: Qwen Team
Published: (2026)
Qwen2.5-Omni Technical Report
by: Xu, Jin, et al.
Published: (2025)
by: Xu, Jin, et al.
Published: (2025)
Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent
by: Chen, Wei, et al.
Published: (2024)
by: Chen, Wei, et al.
Published: (2024)
QuarkMed Medical Foundation Model Technical Report
by: Li, Ao, et al.
Published: (2025)
by: Li, Ao, et al.
Published: (2025)
COPF: An Online Framework for Deployment-Stable Counterfactual Fairness in Evolving Graphs
by: Li, Sheng'en, et al.
Published: (2026)
by: Li, Sheng'en, et al.
Published: (2026)
MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
by: Zhao, Tianchen, et al.
Published: (2024)
by: Zhao, Tianchen, et al.
Published: (2024)
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
by: Fu, Tianyu, et al.
Published: (2025)
by: Fu, Tianyu, et al.
Published: (2025)
X-OmniClaw Technical Report: A Unified Mobile Agent for Multimodal Understanding and Interaction
by: Ren, Xiaoming, et al.
Published: (2026)
by: Ren, Xiaoming, et al.
Published: (2026)
Causal Relationship Between Autism Spectrum Disorder and Inflammatory Bowel Disease: A Bidirectional Mendelian Randomization Study
by: Weilin Li, et al.
Published: (2024)
by: Weilin Li, et al.
Published: (2024)
VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
by: Yuan, Zhihang, et al.
Published: (2025)
by: Yuan, Zhihang, et al.
Published: (2025)
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
by: Liu, Enshu, et al.
Published: (2024)
by: Liu, Enshu, et al.
Published: (2024)
MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
by: Yuan, Huining, et al.
Published: (2025)
by: Yuan, Huining, et al.
Published: (2025)
ABot-OCR Technical Report
by: Jiang, Kaitao, et al.
Published: (2026)
by: Jiang, Kaitao, et al.
Published: (2026)
Scripts for DN1 single cell data analysis
by: Hu, Shengen
Published: (2025)
by: Hu, Shengen
Published: (2025)
asvspoof2015 in WebDataset Format
by: Yadong, Niu
Published: (2025)
by: Yadong, Niu
Published: (2025)
vocalimitationset in WebDataset Format
by: Yadong, Niu
Published: (2025)
by: Yadong, Niu
Published: (2025)
CASTLE2026 Team WDL Technical Report
by: Li, Zhengyang, et al.
Published: (2026)
by: Li, Zhengyang, et al.
Published: (2026)
Improvement on Mechanical Properties through Acquiring Multiheterogeneous Austenite Morphologies in Warm‐Rolled 3Mn Steel
by: Haibo Wang, et al.
Published: (2025)
by: Haibo Wang, et al.
Published: (2025)
MBQ: Modality-Balanced Quantization for Large Vision-Language Models
by: Li, Shiyao, et al.
Published: (2024)
by: Li, Shiyao, et al.
Published: (2024)
Yi-Lightning Technical Report
by: Wake, Alan, et al.
Published: (2024)
by: Wake, Alan, et al.
Published: (2024)
Similar Items
-
Megrez2 Technical Report
by: Li, Boxun, et al.
Published: (2025) -
ReXMoE: Reusing Experts with Minimal Overhead in Mixture-of-Experts
by: Tan, Zheyue, et al.
Published: (2025) -
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K
by: Yuan, Tao, et al.
Published: (2024) -
Baichuan-Omni Technical Report
by: Li, Yadong, et al.
Published: (2024) -
Kling-Omni Technical Report
by: Kling Team, et al.
Published: (2025)