:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ito, Akira, Yamada, Masanori, Chijiwa, Daiki, Kumagai, Atsutoshi
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2510.08023
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
by: Ito, Akira, et al.
Published: (2024)

Transfer Learning with Pre-trained Conditional Generative Models
by: Yamaguchi, Shin'ya, et al.
Published: (2022)

Toward Data Efficient Model Merging between Different Datasets without Performance Degradation
by: Yamada, Masanori, et al.
Published: (2023)

Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment
by: Takahashi, Hiroshi, et al.
Published: (2026)

Do We Really Even Need Data?
by: Hoffman, Kentaro, et al.
Published: (2024)

Meta-learning for Positive-unlabeled Classification
by: Kumagai, Atsutoshi, et al.
Published: (2024)

Covariance-aware Feature Alignment with Pre-computed Source Statistics for Test-time Adaptation to Multiple Image Corruptions
by: Adachi, Kazuki, et al.
Published: (2022)

Analyzing the Role of Permutation Invariance in Linear Mode Connectivity
by: Zhan, Keyao, et al.
Published: (2025)

MambaOut: Do We Really Need Mamba for Vision?
by: Yu, Weihao, et al.
Published: (2024)

Is the Hard-Label Cryptanalytic Model Extraction Really Polynomial?
by: Ito, Akira, et al.
Published: (2025)

Deep Positive-Unlabeled Anomaly Detection for Contaminated Unlabeled Data
by: Takahashi, Hiroshi, et al.
Published: (2024)

Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation
by: Takahashi, Hiroshi, et al.
Published: (2025)

Test-time Adaptation for Regression by Subspace Alignment
by: Adachi, Kazuki, et al.
Published: (2024)

Sparse-Autoencoder-Guided Internal Representation Unlearning for Large Language Models
by: Yamashita, Tomoya, et al.
Published: (2025)

Meta-learning Representations for Learning from Multiple Annotators
by: Kumagai, Atsutoshi, et al.
Published: (2025)

Do We Really Need to Design New Byzantine-robust Aggregation Rules?
by: Fang, Minghong, et al.
Published: (2025)

Do We Really Even Need Data? A Modern Look at Drawing Inference with Predicted Data
by: Salerno, Stephen, et al.
Published: (2025)

Rationale-Enhanced Decoding for Multi-modal Chain-of-Thought
by: Yamaguchi, Shin'ya, et al.
Published: (2025)

Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models
by: Chijiwa, Daiki, et al.
Published: (2025)

Zero-shot Concept Bottleneck Models
by: Yamaguchi, Shin'ya, et al.
Published: (2025)

Parallel In-context Learning for Large Vision Language Models
by: Yamaguchi, Shin'ya, et al.
Published: (2026)

Triple-BERT: Do We Really Need MARL for Order Dispatch on Ride-Sharing Platforms?
by: Zhao, Zijian, et al.
Published: (2025)

Do Large Language Models (Really) Need Statistical Foundations?
by: Su, Weijie
Published: (2025)

Do We Really Need Quantum Machine Learning?: A Multidimensional Empirical Study
by: Vhaduri, Sudip, et al.
Published: (2026)

Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation
by: Zhang, Weizhi, et al.
Published: (2024)

Adaptive Random Feature Regularization on Fine-tuning Deep Neural Networks
by: Yamaguchi, Shin'ya, et al.
Published: (2024)

Post-pre-training for Modality Alignment in Vision-Language Foundation Models
by: Yamaguchi, Shin'ya, et al.
Published: (2025)

Landscaping Linear Mode Connectivity
by: Singh, Sidak Pal, et al.
Published: (2024)

Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models?
by: Wang, Yanbo, et al.
Published: (2025)

The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms
by: Otsuka, Hikari, et al.
Published: (2025)

Generalized Linear Mode Connectivity for Transformers
by: Theus, Alexander, et al.
Published: (2025)

Layer-wise Linear Mode Connectivity
by: Adilova, Linara, et al.
Published: (2023)

On Linear Mode Connectivity of Mixture-of-Experts Architectures
by: Tran, Viet-Hoang, et al.
Published: (2025)

Linear Mode Connectivity in Differentiable Tree Ensembles
by: Kanoh, Ryuichi, et al.
Published: (2024)

Do We Need Transformers to Play FPS Video Games?
by: Batth, Karmanbir, et al.
Published: (2025)

Do We Need Frontier Models to Verify Mathematical Proofs?
by: Naik, Aaditya, et al.
Published: (2026)

Lossless Vocabulary Reduction for Auto-Regressive Language Models
by: Chijiwa, Daiki, et al.
Published: (2025)

Linear Mode Connectivity in Sparse Neural Networks
by: McDermott, Luke, et al.
Published: (2023)

Why Do We Need Weight Decay in Modern Deep Learning?
by: D'Angelo, Francesco, et al.
Published: (2023)

Do You Really Need Public Data? Surrogate Public Data for Differential Privacy on Tabular Data
by: Hod, Shlomi, et al.
Published: (2025)