:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Oyama, Yosuke, Majima, Yusuke, Ohta, Eiji, Sakai, Yasufumi
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2505.22208
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Semi-Supervised Reward Modeling via Iterative Self-Training
by: He, Yifei, et al.
Published: (2024)

Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?
by: Sakai, Yusuke, et al.
Published: (2023)

Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
by: Kišš, Martin, et al.
Published: (2025)

Domain Generalization Guided by Large-Scale Pre-Trained Priors
by: Wang, Zongbin, et al.
Published: (2024)

The Wisdom of Many Queries: Complexity-Diversity Principle for Dense Retriever Training
by: Feng, Xincan, et al.
Published: (2026)

Multilinguality of Large Language Models From a Structural Perspective
by: Sakajo, Haruki, et al.
Published: (2026)

Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering
by: Moradi, Fatemeh, et al.
Published: (2025)

Self-Supervised Contrastive Pre-Training for Multivariate Point Processes
by: Shou, Xiao, et al.
Published: (2024)

Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training
by: Wang, Hong, et al.
Published: (2025)

DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training
by: Jimenez-Perez, Guillermo, et al.
Published: (2024)

Self-Supervised Pre-Training for Precipitation Post-Processor
by: An, Sojung, et al.
Published: (2023)

Provable Target Sample Complexity Improvements as Pre-Trained Models Scale
by: Fukuchi, Kazuto, et al.
Published: (2026)

Scaling Law for Large-Scale Pre-Training Using Chaotic Time Series and Predictability in Financial Time Series
by: Takemoto, Yuki
Published: (2025)

Semi-Supervised Learning with Multi-Head Co-Training
by: Chen, Mingcai, et al.
Published: (2021)

Dataset Distillation for Pre-Trained Self-Supervised Vision Models
by: Cazenavette, George, et al.
Published: (2025)

TSPulse: Tiny Pre-Trained Models with Disentangled Representations for Rapid Time-Series Analysis
by: Ekambaram, Vijay, et al.
Published: (2025)

Toward the Evaluation of Large Language Models Considering Score Variance across Instruction Templates
by: Sakai, Yusuke, et al.
Published: (2024)

Heterogeneous Self-Supervised Acoustic Pre-Training with Local Constraints
by: Cui, Xiaodong, et al.
Published: (2025)

Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair
by: Sakai, Yusuke, et al.
Published: (2024)

mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
by: Sakai, Yusuke, et al.
Published: (2024)

A Large Dimensional Analysis of Multi-task Semi-Supervised Learning
by: Leger, Victor, et al.
Published: (2024)

Fractional Heat Kernel for Semi-Supervised Graph Learning with Small Training Sample Size
by: Bozorgnia, Farid, et al.
Published: (2025)

FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale
by: Patel, Ajay, et al.
Published: (2026)

Global path preference and local response: A reward decomposition approach for network path choice analysis in the presence of locally perceived attributes
by: Oyama, Yuki
Published: (2023)

Semi-Supervised Learning for Large Language Models Safety and Content Moderation
by: Dinuta, Eduard Stefan, et al.
Published: (2025)

Semi-Supervised Learning under General Causal Models
by: Moore, Archer, et al.
Published: (2025)

DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
by: Hao, Zhongkai, et al.
Published: (2024)

In-Context Semi-Supervised Learning
by: Fan, Jiashuo, et al.
Published: (2025)

From Human Labels to Literature: Semi-Supervised Learning of NMR Chemical Shifts at Scale
by: Jin, Yongqi, et al.
Published: (2026)

Spectrum-Adaptive Generalization Bounds for Trained Deep Transformers
by: Sakai, Mana, et al.
Published: (2026)

Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale Models with Structured Pruning in Resource-Limited Clients
by: Li, Yan, et al.
Published: (2024)

GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection
by: Su, DiJia, et al.
Published: (2025)

Self-Supervised Modality-Agnostic Pre-Training of Swin Transformers
by: Talasila, Abhiroop, et al.
Published: (2024)

MM-OPERA: Benchmarking Open-ended Association Reasoning for Large Vision-Language Models
by: Huang, Zimeng, et al.
Published: (2025)

From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction
by: Shoghi, Nima, et al.
Published: (2023)

Hypergraph Transformer for Semi-Supervised Classification
by: Liu, Zexi, et al.
Published: (2023)

AaSP: Aliasing-aware Self-Supervised Pre-Training for Audio Spectrogram Transformers
by: Yamamoto, Kohei, et al.
Published: (2025)

TEMPO: Scaling Test-time Training for Large Reasoning Models
by: Zhang, Qingyang, et al.
Published: (2026)

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning
by: Fan, Ziqing, et al.
Published: (2025)

Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning
by: Gan, Kai, et al.
Published: (2024)