Saved in:
| Main Authors: | Li, Conglong, Yao, Zhewei, Wu, Xiaoxia, Zhang, Minjia, Holmes, Connor, Li, Cheng, He, Yuxiong |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2212.03597 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference
by: Holmes, Connor, et al.
Published: (2024)
by: Holmes, Connor, et al.
Published: (2024)
Scaling Vision Transformers: Evaluating DeepSpeed for Image-Centric Workloads
by: Trinh, Huy, et al.
Published: (2026)
by: Trinh, Huy, et al.
Published: (2026)
MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility
by: He, Yexiao, et al.
Published: (2025)
by: He, Yexiao, et al.
Published: (2025)
DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science
by: Shu, Fan, et al.
Published: (2026)
by: Shu, Fan, et al.
Published: (2026)
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation
by: Qiao, Aurick, et al.
Published: (2024)
by: Qiao, Aurick, et al.
Published: (2024)
ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback
by: Zhai, Bohan, et al.
Published: (2025)
by: Zhai, Bohan, et al.
Published: (2025)
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences
by: Bekman, Stas, et al.
Published: (2025)
by: Bekman, Stas, et al.
Published: (2025)
Learning to Hint for Reinforcement Learning
by: Xia, Yu, et al.
Published: (2026)
by: Xia, Yu, et al.
Published: (2026)
Computational, Label, and Data Efficiency in Deep Learning for Sparse 3D Data
by: Li, Lanxiao
Published: (2024)
by: Li, Lanxiao
Published: (2024)
Deep Learning HASTE for Upper Abdominal MRI: Improved Image Quality, Speed, and Energy Efficiency in a Prospective Study
by: Jennifer Gotta, et al.
Published: (2025)
by: Jennifer Gotta, et al.
Published: (2025)
Learning to Self-Evolve
by: Chen, Xiaoyin, et al.
Published: (2026)
by: Chen, Xiaoyin, et al.
Published: (2026)
R$^3$-SQL: Ranking Reward and Resampling for Text-to-SQL
by: Han, Hojae, et al.
Published: (2026)
by: Han, Hojae, et al.
Published: (2026)
Data-Efficient Training by Evolved Sampling
by: Cheng, Ziheng, et al.
Published: (2025)
by: Cheng, Ziheng, et al.
Published: (2025)
FastPersist: Accelerating Model Checkpointing in Deep Learning
by: Wang, Guanhua, et al.
Published: (2024)
by: Wang, Guanhua, et al.
Published: (2024)
Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization
by: Kim, Hyeonah, et al.
Published: (2023)
by: Kim, Hyeonah, et al.
Published: (2023)
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
by: Wang, Zhaoyang, et al.
Published: (2026)
by: Wang, Zhaoyang, et al.
Published: (2026)
On Improving the Algorithm-, Model-, and Data- Efficiency of Self-Supervised Learning
by: Cao, Yun-Hao, et al.
Published: (2024)
by: Cao, Yun-Hao, et al.
Published: (2024)
Inference Scaling for Bridging Retrieval and Augmented Generation
by: Lee, Youngwon, et al.
Published: (2024)
by: Lee, Youngwon, et al.
Published: (2024)
CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
by: Lee, Youngwon, et al.
Published: (2024)
by: Lee, Youngwon, et al.
Published: (2024)
Energy-Efficient Prediction in Textile Manufacturing: Enhancing Accuracy and Data Efficiency With Ensemble Deep Transfer Learning
by: Chen, Yan-Chen, et al.
Published: (2026)
by: Chen, Yan-Chen, et al.
Published: (2026)
Efficiency Robustness of Dynamic Deep Learning Systems
by: Rathnasuriya, Ravishka, et al.
Published: (2025)
by: Rathnasuriya, Ravishka, et al.
Published: (2025)
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design
by: Xia, Haojun, et al.
Published: (2024)
by: Xia, Haojun, et al.
Published: (2024)
Deep Learning for Data-Driven Districting-and-Routing
by: Ferraz, Arthur, et al.
Published: (2024)
by: Ferraz, Arthur, et al.
Published: (2024)
Improving Sampling Efficiency in RLVR through Adaptive Rollout and Response Reuse
by: Zhang, Yuheng, et al.
Published: (2025)
by: Zhang, Yuheng, et al.
Published: (2025)
One Sample to Rule Them All: Extreme Data Efficiency in Multidiscipline Reasoning with Reinforcement Learning
by: Li, Yiyuan, et al.
Published: (2026)
by: Li, Yiyuan, et al.
Published: (2026)
Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning
by: Wilson, Alec, et al.
Published: (2024)
by: Wilson, Alec, et al.
Published: (2024)
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
by: Lee, Jaeseong, et al.
Published: (2024)
by: Lee, Jaeseong, et al.
Published: (2024)
Training Data Efficiency in Multimodal Process Reward Models
by: Li, Jinyuan, et al.
Published: (2026)
by: Li, Jinyuan, et al.
Published: (2026)
Improving Memory Efficiency for Training KANs via Meta Learning
by: Zhao, Zhangchi, et al.
Published: (2025)
by: Zhao, Zhangchi, et al.
Published: (2025)
An Empirical Investigation of Speed Patterns on S‐Curves Using Naturalistic Driving Data and Mixed Logit Model
by: Cailin Lei, et al.
Published: (2025)
by: Cailin Lei, et al.
Published: (2025)
Data-Aware Training Quality Monitoring and Certification for Reliable Deep Learning
by: Yeganegi, Farhang, et al.
Published: (2024)
by: Yeganegi, Farhang, et al.
Published: (2024)
To Train or Not to Train: Balancing Efficiency and Training Cost in Deep Reinforcement Learning for Mobile Edge Computing
by: Boscaro, Maddalena, et al.
Published: (2024)
by: Boscaro, Maddalena, et al.
Published: (2024)
Deep Reinforcement Learning‐Driven Cooperative Routing for Energy Efficiency in Wireless Multimedia Sensor Networks
by: M. Nagalingayya, et al.
Published: (2025)
by: M. Nagalingayya, et al.
Published: (2025)
Weighted Low-rank Approximation via Stochastic Gradient Descent on Manifolds
by: Xu, Conglong, et al.
Published: (2025)
by: Xu, Conglong, et al.
Published: (2025)
Optical Routing via High Efficiency Composite Acoustic Diffraction
by: Zhao, Yuxiang, et al.
Published: (2024)
by: Zhao, Yuxiang, et al.
Published: (2024)
Training a General Spiking Neural Network with Improved Efficiency and Minimum Latency
by: Yao, Yunpeng, et al.
Published: (2024)
by: Yao, Yunpeng, et al.
Published: (2024)
Improving the Trade-off Between Watermark Strength and Speculative Sampling Efficiency for Language Models
by: He, Weiqing, et al.
Published: (2026)
by: He, Weiqing, et al.
Published: (2026)
Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution
by: Kowal, Matthew, et al.
Published: (2026)
by: Kowal, Matthew, et al.
Published: (2026)
Safety Filtering While Training: Improving the Performance and Sample Efficiency of Reinforcement Learning Agents
by: Bejarano, Federico Pizarro, et al.
Published: (2024)
by: Bejarano, Federico Pizarro, et al.
Published: (2024)
Quality over Quantity: Boosting Data Efficiency Through Ensembled Multimodal Data Curation
by: Xu, Jinda, et al.
Published: (2025)
by: Xu, Jinda, et al.
Published: (2025)
Similar Items
-
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference
by: Holmes, Connor, et al.
Published: (2024) -
Scaling Vision Transformers: Evaluating DeepSpeed for Image-Centric Workloads
by: Trinh, Huy, et al.
Published: (2026) -
MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility
by: He, Yexiao, et al.
Published: (2025) -
DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science
by: Shu, Fan, et al.
Published: (2026) -
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation
by: Qiao, Aurick, et al.
Published: (2024)