Guardado en:
| Autores principales: | Xu, Yang, Shi, Huihong, Wang, Zhongfeng |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2409.04829 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
AccLLM: Accelerating Long-Context LLM Inference Via Algorithm-Hardware Co-Design
por: Liang, Yanbiao, et al.
Publicado: (2025)
por: Liang, Yanbiao, et al.
Publicado: (2025)
NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models
por: Ji, Mengfei, et al.
Publicado: (2024)
por: Ji, Mengfei, et al.
Publicado: (2024)
M$^2$-ViT: Accelerating Hybrid Vision Transformers with Two-Level Mixed Quantization
por: Liang, Yanbiao, et al.
Publicado: (2024)
por: Liang, Yanbiao, et al.
Publicado: (2024)
An FPGA-Based Reconfigurable Accelerator for Convolution-Transformer Hybrid EfficientViT
por: Shao, Haikuo, et al.
Publicado: (2024)
por: Shao, Haikuo, et al.
Publicado: (2024)
ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
por: You, Haoran, et al.
Publicado: (2022)
por: You, Haoran, et al.
Publicado: (2022)
ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
por: You, Haoran, et al.
Publicado: (2023)
por: You, Haoran, et al.
Publicado: (2023)
P$^2$-ViT: Power-of-Two Post-Training Quantization and Acceleration for Fully Quantized Vision Transformer
por: Shi, Huihong, et al.
Publicado: (2024)
por: Shi, Huihong, et al.
Publicado: (2024)
Trio-ViT: Post-Training Quantization and Acceleration for Softmax-Free Efficient Vision Transformer
por: Shi, Huihong, et al.
Publicado: (2024)
por: Shi, Huihong, et al.
Publicado: (2024)
Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores
por: Ma, Shaobo, et al.
Publicado: (2024)
por: Ma, Shaobo, et al.
Publicado: (2024)
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
por: You, Haoran, et al.
Publicado: (2024)
por: You, Haoran, et al.
Publicado: (2024)
Is Data Shapley Not Better than Random in Data Selection? Ask NASH
por: Tian, Xiao, et al.
Publicado: (2026)
por: Tian, Xiao, et al.
Publicado: (2026)
Graph Neural Architecture Search with GPT-4
por: Wang, Haishuai, et al.
Publicado: (2023)
por: Wang, Haishuai, et al.
Publicado: (2023)
Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment
por: Ji, Yuhao, et al.
Publicado: (2024)
por: Ji, Yuhao, et al.
Publicado: (2024)
APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration
por: Ma, Shaobo, et al.
Publicado: (2025)
por: Ma, Shaobo, et al.
Publicado: (2025)
Multi-Objective Neural Architecture Search by Learning Search Space Partitions
por: Zhao, Yiyang, et al.
Publicado: (2024)
por: Zhao, Yiyang, et al.
Publicado: (2024)
LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search
por: Gao, Yang, et al.
Publicado: (2025)
por: Gao, Yang, et al.
Publicado: (2025)
MicroNAS: Zero-Shot Neural Architecture Search for MCUs
por: Qiao, Ye, et al.
Publicado: (2024)
por: Qiao, Ye, et al.
Publicado: (2024)
Causal-aware Graph Neural Architecture Search under Distribution Shifts
por: Li, Peiwen, et al.
Publicado: (2024)
por: Li, Peiwen, et al.
Publicado: (2024)
Knowledge-aware Evolutionary Graph Neural Architecture Search
por: Wang, Chao, et al.
Publicado: (2024)
por: Wang, Chao, et al.
Publicado: (2024)
OptiProxy-NAS: Optimization Proxy based End-to-End Neural Architecture Search
por: Lyu, Bo, et al.
Publicado: (2025)
por: Lyu, Bo, et al.
Publicado: (2025)
Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness
por: Wang, Dingrong, et al.
Publicado: (2024)
por: Wang, Dingrong, et al.
Publicado: (2024)
Unsupervised Graph Neural Architecture Search with Disentangled Self-supervision
por: Zhang, Zeyang, et al.
Publicado: (2024)
por: Zhang, Zeyang, et al.
Publicado: (2024)
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
por: Gu, Yuxian, et al.
Publicado: (2025)
por: Gu, Yuxian, et al.
Publicado: (2025)
Transferrable Surrogates in Expressive Neural Architecture Search Spaces
por: Qin, Shiwen, et al.
Publicado: (2025)
por: Qin, Shiwen, et al.
Publicado: (2025)
Structured Progressive Knowledge Activation for LLM-Driven Neural Architecture Search
por: Liu, Zhen, et al.
Publicado: (2026)
por: Liu, Zhen, et al.
Publicado: (2026)
DPFNAS: Differential Privacy-Enhanced Federated Neural Architecture Search for 6G Edge Intelligence
por: Lv, Yang, et al.
Publicado: (2025)
por: Lv, Yang, et al.
Publicado: (2025)
A Lightweight Neural Architecture Search Model for Medical Image Classification
por: Xie, Lunchen, et al.
Publicado: (2024)
por: Xie, Lunchen, et al.
Publicado: (2024)
SeqNAS: Neural Architecture Search for Event Sequence Classification
por: Udovichenko, Igor, et al.
Publicado: (2024)
por: Udovichenko, Igor, et al.
Publicado: (2024)
A Survey on Neural Architecture Search Based on Reinforcement Learning
por: Shao, Wenzhu
Publicado: (2024)
por: Shao, Wenzhu
Publicado: (2024)
Weight-Entanglement Meets Gradient-Based Neural Architecture Search
por: Sukthanker, Rhea Sanjay, et al.
Publicado: (2023)
por: Sukthanker, Rhea Sanjay, et al.
Publicado: (2023)
Neural Architecture Search: Two Constant Shared Weights Initialisations
por: Gracheva, Ekaterina
Publicado: (2023)
por: Gracheva, Ekaterina
Publicado: (2023)
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
por: Wang, Junxiong, et al.
Publicado: (2024)
por: Wang, Junxiong, et al.
Publicado: (2024)
PlatformX: An End-to-End Transferable Platform for Energy-Efficient Neural Architecture Search
por: Tu, Xiaolong, et al.
Publicado: (2025)
por: Tu, Xiaolong, et al.
Publicado: (2025)
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation
por: Yang, Yuchen, et al.
Publicado: (2024)
por: Yang, Yuchen, et al.
Publicado: (2024)
SEval-NAS: A Search-Agnostic Evaluation for Neural Architecture Search
por: Mih, Atah Nuh, et al.
Publicado: (2026)
por: Mih, Atah Nuh, et al.
Publicado: (2026)
Kernel-Level Energy-Efficient Neural Architecture Search for Tabular Dataset
por: La, Hoang-Loc, et al.
Publicado: (2025)
por: La, Hoang-Loc, et al.
Publicado: (2025)
Zero-Shot Neural Architecture Search with Weighted Response Correlation
por: Jing, Kun, et al.
Publicado: (2025)
por: Jing, Kun, et al.
Publicado: (2025)
Data-Algorithm-Architecture Co-Optimization for Fair Neural Networks on Skin Lesion Dataset
por: Sheng, Yi, et al.
Publicado: (2024)
por: Sheng, Yi, et al.
Publicado: (2024)
Learning to Reduce Search Space for Generalizable Neural Routing Solver
por: Zhou, Changliang, et al.
Publicado: (2025)
por: Zhou, Changliang, et al.
Publicado: (2025)
Beyond Frequency: The Role of Redundancy in Large Language Model Memorization
por: Zhang, Jie, et al.
Publicado: (2025)
por: Zhang, Jie, et al.
Publicado: (2025)
Ejemplares similares
-
AccLLM: Accelerating Long-Context LLM Inference Via Algorithm-Hardware Co-Design
por: Liang, Yanbiao, et al.
Publicado: (2025) -
NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models
por: Ji, Mengfei, et al.
Publicado: (2024) -
M$^2$-ViT: Accelerating Hybrid Vision Transformers with Two-Level Mixed Quantization
por: Liang, Yanbiao, et al.
Publicado: (2024) -
An FPGA-Based Reconfigurable Accelerator for Convolution-Transformer Hybrid EfficientViT
por: Shao, Haikuo, et al.
Publicado: (2024) -
ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
por: You, Haoran, et al.
Publicado: (2022)