:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bi, Xiaohan, Qi, Binhang, Sun, Hailong, Gao, Xiang, Yu, Yue, Liang, Xiaojun
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2508.11348
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models
by: Hintersdorf, Dominik, et al.
Published: (2024)

Training Video Foundation Models with NVIDIA NeMo
by: Patel, Zeeshan, et al.
Published: (2025)

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
by: Shen, Gerald, et al.
Published: (2024)

CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging
by: Yang, Zongzhen, et al.
Published: (2025)

NeMo: Needle in a Montage for Video-Language Understanding
by: Hu, Zi-Yuan, et al.
Published: (2025)

NeMo-Inspector: A Visualization Tool for LLM Generation Analysis
by: Gitman, Daria, et al.
Published: (2025)

Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation
by: Ha, Seongsu, et al.
Published: (2024)

X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention
by: Zhao, Xiaochen, et al.
Published: (2025)

NeMo-map: Neural Implicit Flow Fields for Spatio-Temporal Motion Mapping
by: Zhu, Yufei, et al.
Published: (2025)

NeFT: Negative Feedback Training to Improve Robustness of Compute-In-Memory DNN Accelerators
by: Qin, Yifan, et al.
Published: (2023)

Harnessing Neuron Stability to Improve DNN Verification
by: Duong, Hai, et al.
Published: (2024)

SoftSignSGD(S3): An Enhanced Optimizer for Practical DNN Training and Loss Spikes Minimization Beyond Adam
by: Peng, Hanyang, et al.
Published: (2025)

Decomposing Attention To Find Context-Sensitive Neurons
by: Gibson, Alex
Published: (2025)

Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement
by: Monir, Nasser-Eddine, et al.
Published: (2025)

ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training
by: Yang, Rushuai, et al.
Published: (2026)

Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning
by: Bi, Xiaojun, et al.
Published: (2024)

Decomposing the Time Series Forecasting Pipeline: A Modular Approach for Time Series Representation, Information Extraction, and Projection
by: Leppich, Robert, et al.
Published: (2025)

Symphony-MoE: Harmonizing Disparate Pre-trained Models into a Coherent Mixture-of-Experts
by: Wang, Qi, et al.
Published: (2025)

M$^3$Searcher: Modular Multimodal Information Seeking Agency with Retrieval-Oriented Reasoning
by: Yu, Xiaohan, et al.
Published: (2026)

Investigating White-Box Attacks for On-Device Models
by: Zhou, Mingyi, et al.
Published: (2024)

Neurons for Neutrons: A Transformer Model for Computation Load Estimation on Domain-Decomposed Neutron Transport Problems
by: Mote, Alexander, et al.
Published: (2024)

Why Inference in Large Models Becomes Decomposable After Training
by: Jin, Jidong
Published: (2026)

Beyond Direct Generation: A Decomposed Approach to Well-Crafted Screenwriting with LLMs
by: Lei, Hang, et al.
Published: (2025)

VADE: Variance-Aware Dynamic Sampling via Online Sample-Level Difficulty Estimation for Multimodal RL
by: Hu, Zengjie, et al.
Published: (2025)

FourCastNeXt: Optimizing FourCastNet Training for Limited Compute
by: Guo, Edison, et al.
Published: (2024)

Enabling Large Batch Size Training for DNN Models Beyond the Memory Limit While Maintaining Performance
by: Piao, XinYu, et al.
Published: (2021)

Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons
by: Liu, Zhenyu, et al.
Published: (2024)

Advancing Direct Training for Spiking Neural Networks with Circulate-Firing Neurons and Learnable Gradients
by: Zhou, Feifan, et al.
Published: (2026)

Towards Understanding and Enhancing Security of Proof-of-Training for DNN Model Ownership Verification
by: Chang, Yijia, et al.
Published: (2024)

NeSTR: A Neuro-Symbolic Abductive Framework for Temporal Reasoning in Large Language Models
by: Liang, Feng, et al.
Published: (2025)

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
by: Zhang, Rongyu, et al.
Published: (2024)

Identifying Good and Bad Neurons for Task-Level Controllable LLMs
by: Li, Wenjie, et al.
Published: (2026)

Efficient DNN-Powered Software with Fair Sparse Models
by: Gao, Xuanqi, et al.
Published: (2024)

BadSNN: Backdoor Attacks on Spiking Neural Networks via Adversarial Spiking Neuron
by: Miah, Abdullah Arafat, et al.
Published: (2026)

AI-SearchPlanner: Modular Agentic Search via Pareto-Optimal Multi-Objective Reinforcement Learning
by: Mei, Lang, et al.
Published: (2025)

DNN Modularization via Activation-Driven Training
by: Ngo, Tuan, et al.
Published: (2024)

GeNeRT: A Physics-Informed Approach to Intelligent Wireless Channel Modeling via Generalizable Neural Ray Tracing
by: Bian, Kejia, et al.
Published: (2025)

DecompGAIL: Learning Realistic Traffic Behaviors with Decomposed Multi-Agent Generative Adversarial Imitation Learning
by: Guo, Ke, et al.
Published: (2025)

Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality
by: Chen, Sishuo, et al.
Published: (2024)

Mirage: An RNS-Based Photonic Accelerator for DNN Training
by: Demirkiran, Cansu, et al.
Published: (2023)