Saved in:
| Main Authors: | Ito, Akira, Yamada, Masanori, Chijiwa, Daiki, Kumagai, Atsutoshi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.08023 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
by: Ito, Akira, et al.
Published: (2024)
by: Ito, Akira, et al.
Published: (2024)
Transfer Learning with Pre-trained Conditional Generative Models
by: Yamaguchi, Shin'ya, et al.
Published: (2022)
by: Yamaguchi, Shin'ya, et al.
Published: (2022)
Toward Data Efficient Model Merging between Different Datasets without Performance Degradation
by: Yamada, Masanori, et al.
Published: (2023)
by: Yamada, Masanori, et al.
Published: (2023)
Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment
by: Takahashi, Hiroshi, et al.
Published: (2026)
by: Takahashi, Hiroshi, et al.
Published: (2026)
Do We Really Even Need Data?
by: Hoffman, Kentaro, et al.
Published: (2024)
by: Hoffman, Kentaro, et al.
Published: (2024)
Meta-learning for Positive-unlabeled Classification
by: Kumagai, Atsutoshi, et al.
Published: (2024)
by: Kumagai, Atsutoshi, et al.
Published: (2024)
Covariance-aware Feature Alignment with Pre-computed Source Statistics for Test-time Adaptation to Multiple Image Corruptions
by: Adachi, Kazuki, et al.
Published: (2022)
by: Adachi, Kazuki, et al.
Published: (2022)
Analyzing the Role of Permutation Invariance in Linear Mode Connectivity
by: Zhan, Keyao, et al.
Published: (2025)
by: Zhan, Keyao, et al.
Published: (2025)
MambaOut: Do We Really Need Mamba for Vision?
by: Yu, Weihao, et al.
Published: (2024)
by: Yu, Weihao, et al.
Published: (2024)
Is the Hard-Label Cryptanalytic Model Extraction Really Polynomial?
by: Ito, Akira, et al.
Published: (2025)
by: Ito, Akira, et al.
Published: (2025)
Deep Positive-Unlabeled Anomaly Detection for Contaminated Unlabeled Data
by: Takahashi, Hiroshi, et al.
Published: (2024)
by: Takahashi, Hiroshi, et al.
Published: (2024)
Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation
by: Takahashi, Hiroshi, et al.
Published: (2025)
by: Takahashi, Hiroshi, et al.
Published: (2025)
Test-time Adaptation for Regression by Subspace Alignment
by: Adachi, Kazuki, et al.
Published: (2024)
by: Adachi, Kazuki, et al.
Published: (2024)
Sparse-Autoencoder-Guided Internal Representation Unlearning for Large Language Models
by: Yamashita, Tomoya, et al.
Published: (2025)
by: Yamashita, Tomoya, et al.
Published: (2025)
Meta-learning Representations for Learning from Multiple Annotators
by: Kumagai, Atsutoshi, et al.
Published: (2025)
by: Kumagai, Atsutoshi, et al.
Published: (2025)
Do We Really Need to Design New Byzantine-robust Aggregation Rules?
by: Fang, Minghong, et al.
Published: (2025)
by: Fang, Minghong, et al.
Published: (2025)
Do We Really Even Need Data? A Modern Look at Drawing Inference with Predicted Data
by: Salerno, Stephen, et al.
Published: (2025)
by: Salerno, Stephen, et al.
Published: (2025)
Rationale-Enhanced Decoding for Multi-modal Chain-of-Thought
by: Yamaguchi, Shin'ya, et al.
Published: (2025)
by: Yamaguchi, Shin'ya, et al.
Published: (2025)
Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models
by: Chijiwa, Daiki, et al.
Published: (2025)
by: Chijiwa, Daiki, et al.
Published: (2025)
Zero-shot Concept Bottleneck Models
by: Yamaguchi, Shin'ya, et al.
Published: (2025)
by: Yamaguchi, Shin'ya, et al.
Published: (2025)
Parallel In-context Learning for Large Vision Language Models
by: Yamaguchi, Shin'ya, et al.
Published: (2026)
by: Yamaguchi, Shin'ya, et al.
Published: (2026)
Triple-BERT: Do We Really Need MARL for Order Dispatch on Ride-Sharing Platforms?
by: Zhao, Zijian, et al.
Published: (2025)
by: Zhao, Zijian, et al.
Published: (2025)
Do Large Language Models (Really) Need Statistical Foundations?
by: Su, Weijie
Published: (2025)
by: Su, Weijie
Published: (2025)
Do We Really Need Quantum Machine Learning?: A Multidimensional Empirical Study
by: Vhaduri, Sudip, et al.
Published: (2026)
by: Vhaduri, Sudip, et al.
Published: (2026)
Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation
by: Zhang, Weizhi, et al.
Published: (2024)
by: Zhang, Weizhi, et al.
Published: (2024)
Adaptive Random Feature Regularization on Fine-tuning Deep Neural Networks
by: Yamaguchi, Shin'ya, et al.
Published: (2024)
by: Yamaguchi, Shin'ya, et al.
Published: (2024)
Post-pre-training for Modality Alignment in Vision-Language Foundation Models
by: Yamaguchi, Shin'ya, et al.
Published: (2025)
by: Yamaguchi, Shin'ya, et al.
Published: (2025)
Landscaping Linear Mode Connectivity
by: Singh, Sidak Pal, et al.
Published: (2024)
by: Singh, Sidak Pal, et al.
Published: (2024)
Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models?
by: Wang, Yanbo, et al.
Published: (2025)
by: Wang, Yanbo, et al.
Published: (2025)
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms
by: Otsuka, Hikari, et al.
Published: (2025)
by: Otsuka, Hikari, et al.
Published: (2025)
Generalized Linear Mode Connectivity for Transformers
by: Theus, Alexander, et al.
Published: (2025)
by: Theus, Alexander, et al.
Published: (2025)
Layer-wise Linear Mode Connectivity
by: Adilova, Linara, et al.
Published: (2023)
by: Adilova, Linara, et al.
Published: (2023)
On Linear Mode Connectivity of Mixture-of-Experts Architectures
by: Tran, Viet-Hoang, et al.
Published: (2025)
by: Tran, Viet-Hoang, et al.
Published: (2025)
Linear Mode Connectivity in Differentiable Tree Ensembles
by: Kanoh, Ryuichi, et al.
Published: (2024)
by: Kanoh, Ryuichi, et al.
Published: (2024)
Do We Need Transformers to Play FPS Video Games?
by: Batth, Karmanbir, et al.
Published: (2025)
by: Batth, Karmanbir, et al.
Published: (2025)
Do We Need Frontier Models to Verify Mathematical Proofs?
by: Naik, Aaditya, et al.
Published: (2026)
by: Naik, Aaditya, et al.
Published: (2026)
Lossless Vocabulary Reduction for Auto-Regressive Language Models
by: Chijiwa, Daiki, et al.
Published: (2025)
by: Chijiwa, Daiki, et al.
Published: (2025)
Linear Mode Connectivity in Sparse Neural Networks
by: McDermott, Luke, et al.
Published: (2023)
by: McDermott, Luke, et al.
Published: (2023)
Why Do We Need Weight Decay in Modern Deep Learning?
by: D'Angelo, Francesco, et al.
Published: (2023)
by: D'Angelo, Francesco, et al.
Published: (2023)
Do You Really Need Public Data? Surrogate Public Data for Differential Privacy on Tabular Data
by: Hod, Shlomi, et al.
Published: (2025)
by: Hod, Shlomi, et al.
Published: (2025)
Similar Items
-
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
by: Ito, Akira, et al.
Published: (2024) -
Transfer Learning with Pre-trained Conditional Generative Models
by: Yamaguchi, Shin'ya, et al.
Published: (2022) -
Toward Data Efficient Model Merging between Different Datasets without Performance Degradation
by: Yamada, Masanori, et al.
Published: (2023) -
Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment
by: Takahashi, Hiroshi, et al.
Published: (2026) -
Do We Really Even Need Data?
by: Hoffman, Kentaro, et al.
Published: (2024)