Saved in:
| Main Authors: | Shen, Qianli, Wang, Yezhen, Yang, Zhouhao, Li, Xiang, Wang, Haonan, Zhang, Yang, Scarlett, Jonathan, Zhu, Zhanxing, Kawaguchi, Kenji |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.14095 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning
by: Li, Xiang, et al.
Published: (2025)
by: Li, Xiang, et al.
Published: (2025)
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline
by: Wang, Haonan, et al.
Published: (2024)
by: Wang, Haonan, et al.
Published: (2024)
Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
by: Wang, Yezhen, et al.
Published: (2025)
by: Wang, Yezhen, et al.
Published: (2025)
VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative Models
by: Li, Xiang, et al.
Published: (2023)
by: Li, Xiang, et al.
Published: (2023)
FilDeep: Learning Large Deformations of Elastic-Plastic Solids with Multi-Fidelity Data
by: Tang, Jianheng, et al.
Published: (2026)
by: Tang, Jianheng, et al.
Published: (2026)
Efficient Linear Attention for Multivariate Time Series Modeling via Entropy Equality
by: Zhang, Mingtao, et al.
Published: (2025)
by: Zhang, Mingtao, et al.
Published: (2025)
SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From
by: Tong, Yao, et al.
Published: (2025)
by: Tong, Yao, et al.
Published: (2025)
Knowledge Distillation in Wide Neural Networks: Risk Bound, Data Efficiency and Imperfect Teacher
by: Ji, Guangda, et al.
Published: (2020)
by: Ji, Guangda, et al.
Published: (2020)
On Copyright Risks of Text-to-Image Diffusion Models
by: Zhang, Yang, et al.
Published: (2023)
by: Zhang, Yang, et al.
Published: (2023)
PrefixMemory-Tuning: Modernizing Prefix-Tuning by Decoupling the Prefix from Attention
by: Wang, Haonan, et al.
Published: (2025)
by: Wang, Haonan, et al.
Published: (2025)
MAVEN: A Mesh-Aware Volumetric Encoding Network for Simulating 3D Flexible Deformation
by: Feng, Zhe, et al.
Published: (2026)
by: Feng, Zhe, et al.
Published: (2026)
Backward-Friendly Optimization: Training Large Language Models with Approximate Gradients under Memory Constraints
by: Yang, Jing, et al.
Published: (2025)
by: Yang, Jing, et al.
Published: (2025)
Advancing Multimodal Teacher Sentiment Analysis:The Large-Scale T-MED Dataset & The Effective AAM-TSA Model
by: Duan, Zhiyi, et al.
Published: (2025)
by: Duan, Zhiyi, et al.
Published: (2025)
Auto-Unrolled Proximal Gradient Descent: An AutoML Approach to Interpretable Waveform Optimization
by: Kaplan, Ahmet
Published: (2026)
by: Kaplan, Ahmet
Published: (2026)
Initialization is Half the Battle: Generating Diverse Images from a Guidance Potential Posterior
by: Li, Xiang, et al.
Published: (2026)
by: Li, Xiang, et al.
Published: (2026)
Efficient Agent: Optimizing Planning Capability for Multimodal Retrieval Augmented Generation
by: Wang, Yuechen, et al.
Published: (2025)
by: Wang, Yuechen, et al.
Published: (2025)
BEAM: Bi-level Memory-adaptive Algorithmic Evolution for LLM-Powered Heuristic Design
by: Xiang, Chuyang, et al.
Published: (2026)
by: Xiang, Chuyang, et al.
Published: (2026)
Clustering Inductive Biases with Unrolled Networks
by: Huml, Jonathan, et al.
Published: (2023)
by: Huml, Jonathan, et al.
Published: (2023)
ViTE: Virtual Graph Trajectory Expert Router for Pedestrian Trajectory Prediction
by: Li, Ruochen, et al.
Published: (2025)
by: Li, Ruochen, et al.
Published: (2025)
AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging
by: Zhao, Yiran, et al.
Published: (2024)
by: Zhao, Yiran, et al.
Published: (2024)
Fostering Video Reasoning via Next-Event Prediction
by: Wang, Haonan, et al.
Published: (2025)
by: Wang, Haonan, et al.
Published: (2025)
Enhancing Multilingual Counterfactual Generation through Alignment-as-Preference Optimization
by: Wang, Yilong, et al.
Published: (2026)
by: Wang, Yilong, et al.
Published: (2026)
DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning
by: He, Yang, et al.
Published: (2026)
by: He, Yang, et al.
Published: (2026)
Statistical Mean Estimation with Coded Relayed Observations
by: Ling, Yan Hao, et al.
Published: (2025)
by: Ling, Yan Hao, et al.
Published: (2025)
Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration
by: Zhao, Yang, et al.
Published: (2026)
by: Zhao, Yang, et al.
Published: (2026)
Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models
by: Liu, Yuhang, et al.
Published: (2025)
by: Liu, Yuhang, et al.
Published: (2025)
iScheduler: Reinforcement Learning-Driven Continual Optimization for Large-Scale Resource Investment Problems
by: Hu, Yi-Xiang, et al.
Published: (2026)
by: Hu, Yi-Xiang, et al.
Published: (2026)
Bi-level Mean Field: Dynamic Grouping for Large-Scale MARL
by: Zheng, Yuxuan, et al.
Published: (2025)
by: Zheng, Yuxuan, et al.
Published: (2025)
LLM Data Selection and Utilization via Dynamic Bi-level Optimization
by: Yu, Yang, et al.
Published: (2025)
by: Yu, Yang, et al.
Published: (2025)
How do Large Language Models Handle Multilingualism?
by: Zhao, Yiran, et al.
Published: (2024)
by: Zhao, Yiran, et al.
Published: (2024)
Efficient Diffusion as Low Light Enhancer
by: Lan, Guanzhou, et al.
Published: (2024)
by: Lan, Guanzhou, et al.
Published: (2024)
Pruning General Large Language Models into Customized Expert Models
by: Zhao, Yirao, et al.
Published: (2025)
by: Zhao, Yirao, et al.
Published: (2025)
Large Language Models Are Still Misled by Simple Bias Ensembles
by: Sun, Zhouhao, et al.
Published: (2025)
by: Sun, Zhouhao, et al.
Published: (2025)
Causal-Guided Active Learning for Debiasing Large Language Models
by: Du, Li, et al.
Published: (2024)
by: Du, Li, et al.
Published: (2024)
Self-Route: Automatic Mode Switching via Capability Estimation for Efficient Reasoning
by: He, Yang, et al.
Published: (2025)
by: He, Yang, et al.
Published: (2025)
$δ$-mem: Efficient Online Memory for Large Language Models
by: Lei, Jingdi, et al.
Published: (2026)
by: Lei, Jingdi, et al.
Published: (2026)
DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problems
by: Zhao, Hang, et al.
Published: (2024)
by: Zhao, Hang, et al.
Published: (2024)
Variational Learning of Gaussian Process Latent Variable Models through Stochastic Gradient Annealed Importance Sampling
by: Xu, Jian, et al.
Published: (2024)
by: Xu, Jian, et al.
Published: (2024)
Deep Learning-Enhanced Preconditioning for Efficient Conjugate Gradient Solvers in Large-Scale PDE Systems
by: Li, Rui, et al.
Published: (2024)
by: Li, Rui, et al.
Published: (2024)
Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models
by: Sun, Zhouhao, et al.
Published: (2025)
by: Sun, Zhouhao, et al.
Published: (2025)
Similar Items
-
LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning
by: Li, Xiang, et al.
Published: (2025) -
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline
by: Wang, Haonan, et al.
Published: (2024) -
Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
by: Wang, Yezhen, et al.
Published: (2025) -
VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative Models
by: Li, Xiang, et al.
Published: (2023) -
FilDeep: Learning Large Deformations of Elastic-Plastic Solids with Multi-Fidelity Data
by: Tang, Jianheng, et al.
Published: (2026)