Saved in:
| Main Authors: | Bi, Xiaohan, Qi, Binhang, Sun, Hailong, Gao, Xiang, Yu, Yue, Liang, Xiaojun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.11348 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models
by: Hintersdorf, Dominik, et al.
Published: (2024)
by: Hintersdorf, Dominik, et al.
Published: (2024)
Training Video Foundation Models with NVIDIA NeMo
by: Patel, Zeeshan, et al.
Published: (2025)
by: Patel, Zeeshan, et al.
Published: (2025)
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
by: Shen, Gerald, et al.
Published: (2024)
by: Shen, Gerald, et al.
Published: (2024)
CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging
by: Yang, Zongzhen, et al.
Published: (2025)
by: Yang, Zongzhen, et al.
Published: (2025)
NeMo: Needle in a Montage for Video-Language Understanding
by: Hu, Zi-Yuan, et al.
Published: (2025)
by: Hu, Zi-Yuan, et al.
Published: (2025)
NeMo-Inspector: A Visualization Tool for LLM Generation Analysis
by: Gitman, Daria, et al.
Published: (2025)
by: Gitman, Daria, et al.
Published: (2025)
Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation
by: Ha, Seongsu, et al.
Published: (2024)
by: Ha, Seongsu, et al.
Published: (2024)
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention
by: Zhao, Xiaochen, et al.
Published: (2025)
by: Zhao, Xiaochen, et al.
Published: (2025)
NeMo-map: Neural Implicit Flow Fields for Spatio-Temporal Motion Mapping
by: Zhu, Yufei, et al.
Published: (2025)
by: Zhu, Yufei, et al.
Published: (2025)
NeFT: Negative Feedback Training to Improve Robustness of Compute-In-Memory DNN Accelerators
by: Qin, Yifan, et al.
Published: (2023)
by: Qin, Yifan, et al.
Published: (2023)
Harnessing Neuron Stability to Improve DNN Verification
by: Duong, Hai, et al.
Published: (2024)
by: Duong, Hai, et al.
Published: (2024)
SoftSignSGD(S3): An Enhanced Optimizer for Practical DNN Training and Loss Spikes Minimization Beyond Adam
by: Peng, Hanyang, et al.
Published: (2025)
by: Peng, Hanyang, et al.
Published: (2025)
Decomposing Attention To Find Context-Sensitive Neurons
by: Gibson, Alex
Published: (2025)
by: Gibson, Alex
Published: (2025)
Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement
by: Monir, Nasser-Eddine, et al.
Published: (2025)
by: Monir, Nasser-Eddine, et al.
Published: (2025)
ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training
by: Yang, Rushuai, et al.
Published: (2026)
by: Yang, Rushuai, et al.
Published: (2026)
Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning
by: Bi, Xiaojun, et al.
Published: (2024)
by: Bi, Xiaojun, et al.
Published: (2024)
Decomposing the Time Series Forecasting Pipeline: A Modular Approach for Time Series Representation, Information Extraction, and Projection
by: Leppich, Robert, et al.
Published: (2025)
by: Leppich, Robert, et al.
Published: (2025)
Symphony-MoE: Harmonizing Disparate Pre-trained Models into a Coherent Mixture-of-Experts
by: Wang, Qi, et al.
Published: (2025)
by: Wang, Qi, et al.
Published: (2025)
M$^3$Searcher: Modular Multimodal Information Seeking Agency with Retrieval-Oriented Reasoning
by: Yu, Xiaohan, et al.
Published: (2026)
by: Yu, Xiaohan, et al.
Published: (2026)
Investigating White-Box Attacks for On-Device Models
by: Zhou, Mingyi, et al.
Published: (2024)
by: Zhou, Mingyi, et al.
Published: (2024)
Neurons for Neutrons: A Transformer Model for Computation Load Estimation on Domain-Decomposed Neutron Transport Problems
by: Mote, Alexander, et al.
Published: (2024)
by: Mote, Alexander, et al.
Published: (2024)
Why Inference in Large Models Becomes Decomposable After Training
by: Jin, Jidong
Published: (2026)
by: Jin, Jidong
Published: (2026)
Beyond Direct Generation: A Decomposed Approach to Well-Crafted Screenwriting with LLMs
by: Lei, Hang, et al.
Published: (2025)
by: Lei, Hang, et al.
Published: (2025)
VADE: Variance-Aware Dynamic Sampling via Online Sample-Level Difficulty Estimation for Multimodal RL
by: Hu, Zengjie, et al.
Published: (2025)
by: Hu, Zengjie, et al.
Published: (2025)
FourCastNeXt: Optimizing FourCastNet Training for Limited Compute
by: Guo, Edison, et al.
Published: (2024)
by: Guo, Edison, et al.
Published: (2024)
Enabling Large Batch Size Training for DNN Models Beyond the Memory Limit While Maintaining Performance
by: Piao, XinYu, et al.
Published: (2021)
by: Piao, XinYu, et al.
Published: (2021)
Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons
by: Liu, Zhenyu, et al.
Published: (2024)
by: Liu, Zhenyu, et al.
Published: (2024)
Advancing Direct Training for Spiking Neural Networks with Circulate-Firing Neurons and Learnable Gradients
by: Zhou, Feifan, et al.
Published: (2026)
by: Zhou, Feifan, et al.
Published: (2026)
Towards Understanding and Enhancing Security of Proof-of-Training for DNN Model Ownership Verification
by: Chang, Yijia, et al.
Published: (2024)
by: Chang, Yijia, et al.
Published: (2024)
NeSTR: A Neuro-Symbolic Abductive Framework for Temporal Reasoning in Large Language Models
by: Liang, Feng, et al.
Published: (2025)
by: Liang, Feng, et al.
Published: (2025)
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
by: Zhang, Rongyu, et al.
Published: (2024)
by: Zhang, Rongyu, et al.
Published: (2024)
Identifying Good and Bad Neurons for Task-Level Controllable LLMs
by: Li, Wenjie, et al.
Published: (2026)
by: Li, Wenjie, et al.
Published: (2026)
Efficient DNN-Powered Software with Fair Sparse Models
by: Gao, Xuanqi, et al.
Published: (2024)
by: Gao, Xuanqi, et al.
Published: (2024)
BadSNN: Backdoor Attacks on Spiking Neural Networks via Adversarial Spiking Neuron
by: Miah, Abdullah Arafat, et al.
Published: (2026)
by: Miah, Abdullah Arafat, et al.
Published: (2026)
AI-SearchPlanner: Modular Agentic Search via Pareto-Optimal Multi-Objective Reinforcement Learning
by: Mei, Lang, et al.
Published: (2025)
by: Mei, Lang, et al.
Published: (2025)
DNN Modularization via Activation-Driven Training
by: Ngo, Tuan, et al.
Published: (2024)
by: Ngo, Tuan, et al.
Published: (2024)
GeNeRT: A Physics-Informed Approach to Intelligent Wireless Channel Modeling via Generalizable Neural Ray Tracing
by: Bian, Kejia, et al.
Published: (2025)
by: Bian, Kejia, et al.
Published: (2025)
DecompGAIL: Learning Realistic Traffic Behaviors with Decomposed Multi-Agent Generative Adversarial Imitation Learning
by: Guo, Ke, et al.
Published: (2025)
by: Guo, Ke, et al.
Published: (2025)
Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality
by: Chen, Sishuo, et al.
Published: (2024)
by: Chen, Sishuo, et al.
Published: (2024)
Mirage: An RNS-Based Photonic Accelerator for DNN Training
by: Demirkiran, Cansu, et al.
Published: (2023)
by: Demirkiran, Cansu, et al.
Published: (2023)
Similar Items
-
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models
by: Hintersdorf, Dominik, et al.
Published: (2024) -
Training Video Foundation Models with NVIDIA NeMo
by: Patel, Zeeshan, et al.
Published: (2025) -
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
by: Shen, Gerald, et al.
Published: (2024) -
CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging
by: Yang, Zongzhen, et al.
Published: (2025) -
NeMo: Needle in a Montage for Video-Language Understanding
by: Hu, Zi-Yuan, et al.
Published: (2025)