Saved in:
| Main Authors: | Guo, Xiuyuan, Xu, Chengqi, Guo, Guinan, Zhu, Feiyu, Cai, Changpeng, Wang, Peizhe, Wei, Xiaoming, Su, Junhao, Gao, Jialin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.12780 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Advancing Supervised Local Learning Beyond Classification with Long-term Feature Bank
by: Zhu, Feiyu, et al.
Published: (2024)
by: Zhu, Feiyu, et al.
Published: (2024)
MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary Network
by: Zhang, Yuming, et al.
Published: (2024)
by: Zhang, Yuming, et al.
Published: (2024)
Rethinking Local Learning: A Cheaper and Faster Recipe for LLM Post-Training
by: Shi, Hengyu, et al.
Published: (2026)
by: Shi, Hengyu, et al.
Published: (2026)
SPEAK: Speech-Driven Pose and Emotion-Adjustable Talking Head Generation
by: Cai, Changpeng, et al.
Published: (2024)
by: Cai, Changpeng, et al.
Published: (2024)
Momentum Auxiliary Network for Supervised Local Learning
by: Su, Junhao, et al.
Published: (2024)
by: Su, Junhao, et al.
Published: (2024)
MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks
by: Su, Junhao, et al.
Published: (2025)
by: Su, Junhao, et al.
Published: (2025)
Fine, I'll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging
by: Su, Guinan, et al.
Published: (2025)
by: Su, Guinan, et al.
Published: (2025)
Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs
by: Su, Guinan, et al.
Published: (2026)
by: Su, Guinan, et al.
Published: (2026)
Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models
by: Geiping, Jonas, et al.
Published: (2025)
by: Geiping, Jonas, et al.
Published: (2025)
Replacement Learning: Training Vision Tasks with Fewer Learnable Parameters
by: Zhang, Yuming, et al.
Published: (2024)
by: Zhang, Yuming, et al.
Published: (2024)
HPFF: Hierarchical Locally Supervised Learning with Patch Feature Fusion
by: Su, Junhao, et al.
Published: (2024)
by: Su, Junhao, et al.
Published: (2024)
AdaPtis: Reducing Pipeline Bubbles with Adaptive Pipeline Parallelism on Heterogeneous Models
by: Guo, Jihu, et al.
Published: (2025)
by: Guo, Jihu, et al.
Published: (2025)
Replacement Learning: Training Neural Networks with Fewer Parameters
by: Zhang, Yuming, et al.
Published: (2026)
by: Zhang, Yuming, et al.
Published: (2026)
GPU-accelerated Multi-relational Parallel Graph Retrieval for Web-scale Recommendations
by: Guo, Zhuoning, et al.
Published: (2025)
by: Guo, Zhuoning, et al.
Published: (2025)
Local Grammar Approach to Critical Discourse Analysis: A Case Study on the Discursive Representation of Climate Change in the UN News
by: Jun Ye, et al.
Published: (2024)
by: Jun Ye, et al.
Published: (2024)
An Efficient Hybrid Sparse Attention with CPU-GPU Parallelism for Long-Context Inference
by: Yao, Feiyu, et al.
Published: (2026)
by: Yao, Feiyu, et al.
Published: (2026)
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
by: Wan, Xinyi, et al.
Published: (2025)
by: Wan, Xinyi, et al.
Published: (2025)
MemFactory: Unified Inference & Training Framework for Agent Memory
by: Guo, Ziliang, et al.
Published: (2026)
by: Guo, Ziliang, et al.
Published: (2026)
Scalable Multi-QPU Circuit Design for Dicke State Preparation: Optimizing Communication Complexity and Local Circuit Costs
by: Chen, Ziheng, et al.
Published: (2026)
by: Chen, Ziheng, et al.
Published: (2026)
4-Pipeline Parallel Dispatch for Low-Bit GPU Neural Inference
by: Pirolo, Andres
Published: (2026)
by: Pirolo, Andres
Published: (2026)
A Flexible Programmable Pipeline Parallelism Framework for Efficient DNN Training
by: Jiang, Lijuan, et al.
Published: (2025)
by: Jiang, Lijuan, et al.
Published: (2025)
DawnPiper: A Memory-scablable Pipeline Parallel Training Framework
by: Peng, Xuan, et al.
Published: (2025)
by: Peng, Xuan, et al.
Published: (2025)
Heimdall++: Optimizing GPU Utilization and Pipeline Parallelism for Efficient Single-Pulse Detection
by: Xia, Bingzheng, et al.
Published: (2025)
by: Xia, Bingzheng, et al.
Published: (2025)
Faster Diffusion Sampling with Randomized Midpoints: Sequential and Parallel
by: Gupta, Shivam, et al.
Published: (2024)
by: Gupta, Shivam, et al.
Published: (2024)
SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading
by: Chen, Qiaoling, et al.
Published: (2025)
by: Chen, Qiaoling, et al.
Published: (2025)
LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation
by: Shi, Hengyu, et al.
Published: (2025)
by: Shi, Hengyu, et al.
Published: (2025)
Adaptra: Straggler-Resilient Hybrid-Parallel Training with Pipeline Adaptation
by: Wu, Tianyuan, et al.
Published: (2025)
by: Wu, Tianyuan, et al.
Published: (2025)
AMDP: Asynchronous Multi-Directional Pipeline Parallelism for Large-Scale Models Training
by: Chen, Ling, et al.
Published: (2026)
by: Chen, Ling, et al.
Published: (2026)
CollaPipe: Adaptive Segment-Optimized Pipeline Parallelism for Collaborative LLM Training in Heterogeneous Edge Networks
by: Chen, Jiewei, et al.
Published: (2025)
by: Chen, Jiewei, et al.
Published: (2025)
LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation
by: Wang, Keheng, et al.
Published: (2024)
by: Wang, Keheng, et al.
Published: (2024)
Scaling Deep Learning Training with MPMD Pipeline Parallelism
by: Xhebraj, Anxhelo, et al.
Published: (2024)
by: Xhebraj, Anxhelo, et al.
Published: (2024)
Synergistic Tensor and Pipeline Parallelism
by: Qi, Mengshi, et al.
Published: (2025)
by: Qi, Mengshi, et al.
Published: (2025)
EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization
by: Wu, Yize, et al.
Published: (2025)
by: Wu, Yize, et al.
Published: (2025)
SiPipe: Bridging the CPU-GPU Utilization Gap for Efficient Pipeline-Parallel LLM Inference
by: He, Yongchao, et al.
Published: (2025)
by: He, Yongchao, et al.
Published: (2025)
HARP: Orchestrating Automated Parallel Training on Heterogeneous GPU Clusters
by: Liang, Antian, et al.
Published: (2025)
by: Liang, Antian, et al.
Published: (2025)
ResiHP: Taming LLM Training Failures with Dynamic Hybrid Parallelism
by: Ma, Tenghui, et al.
Published: (2026)
by: Ma, Tenghui, et al.
Published: (2026)
A Readiness-Driven Runtime for Pipeline-Parallel Training under Runtime Variability
by: Liu, Ruitao, et al.
Published: (2026)
by: Liu, Ruitao, et al.
Published: (2026)
Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge
by: Du, Chenpeng, et al.
Published: (2023)
by: Du, Chenpeng, et al.
Published: (2023)
Memory Efficient and Staleness Free Pipeline Parallel DNN Training Framework with Improved Convergence Speed
by: Dutta, Ankita, et al.
Published: (2025)
by: Dutta, Ankita, et al.
Published: (2025)
MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation
by: Yang, Yanwu, et al.
Published: (2025)
by: Yang, Yanwu, et al.
Published: (2025)
Similar Items
-
Advancing Supervised Local Learning Beyond Classification with Long-term Feature Bank
by: Zhu, Feiyu, et al.
Published: (2024) -
MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary Network
by: Zhang, Yuming, et al.
Published: (2024) -
Rethinking Local Learning: A Cheaper and Faster Recipe for LLM Post-Training
by: Shi, Hengyu, et al.
Published: (2026) -
SPEAK: Speech-Driven Pose and Emotion-Adjustable Talking Head Generation
by: Cai, Changpeng, et al.
Published: (2024) -
Momentum Auxiliary Network for Supervised Local Learning
by: Su, Junhao, et al.
Published: (2024)