Saved in:
| Main Authors: | Oyama, Yosuke, Majima, Yusuke, Ohta, Eiji, Sakai, Yasufumi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.22208 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Semi-Supervised Reward Modeling via Iterative Self-Training
by: He, Yifei, et al.
Published: (2024)
by: He, Yifei, et al.
Published: (2024)
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?
by: Sakai, Yusuke, et al.
Published: (2023)
by: Sakai, Yusuke, et al.
Published: (2023)
Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
by: Kišš, Martin, et al.
Published: (2025)
by: Kišš, Martin, et al.
Published: (2025)
Domain Generalization Guided by Large-Scale Pre-Trained Priors
by: Wang, Zongbin, et al.
Published: (2024)
by: Wang, Zongbin, et al.
Published: (2024)
The Wisdom of Many Queries: Complexity-Diversity Principle for Dense Retriever Training
by: Feng, Xincan, et al.
Published: (2026)
by: Feng, Xincan, et al.
Published: (2026)
Multilinguality of Large Language Models From a Structural Perspective
by: Sakajo, Haruki, et al.
Published: (2026)
by: Sakajo, Haruki, et al.
Published: (2026)
Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering
by: Moradi, Fatemeh, et al.
Published: (2025)
by: Moradi, Fatemeh, et al.
Published: (2025)
Self-Supervised Contrastive Pre-Training for Multivariate Point Processes
by: Shou, Xiao, et al.
Published: (2024)
by: Shou, Xiao, et al.
Published: (2024)
Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training
by: Wang, Hong, et al.
Published: (2025)
by: Wang, Hong, et al.
Published: (2025)
DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training
by: Jimenez-Perez, Guillermo, et al.
Published: (2024)
by: Jimenez-Perez, Guillermo, et al.
Published: (2024)
Self-Supervised Pre-Training for Precipitation Post-Processor
by: An, Sojung, et al.
Published: (2023)
by: An, Sojung, et al.
Published: (2023)
Provable Target Sample Complexity Improvements as Pre-Trained Models Scale
by: Fukuchi, Kazuto, et al.
Published: (2026)
by: Fukuchi, Kazuto, et al.
Published: (2026)
Scaling Law for Large-Scale Pre-Training Using Chaotic Time Series and Predictability in Financial Time Series
by: Takemoto, Yuki
Published: (2025)
by: Takemoto, Yuki
Published: (2025)
Semi-Supervised Learning with Multi-Head Co-Training
by: Chen, Mingcai, et al.
Published: (2021)
by: Chen, Mingcai, et al.
Published: (2021)
Dataset Distillation for Pre-Trained Self-Supervised Vision Models
by: Cazenavette, George, et al.
Published: (2025)
by: Cazenavette, George, et al.
Published: (2025)
TSPulse: Tiny Pre-Trained Models with Disentangled Representations for Rapid Time-Series Analysis
by: Ekambaram, Vijay, et al.
Published: (2025)
by: Ekambaram, Vijay, et al.
Published: (2025)
Toward the Evaluation of Large Language Models Considering Score Variance across Instruction Templates
by: Sakai, Yusuke, et al.
Published: (2024)
by: Sakai, Yusuke, et al.
Published: (2024)
Heterogeneous Self-Supervised Acoustic Pre-Training with Local Constraints
by: Cui, Xiaodong, et al.
Published: (2025)
by: Cui, Xiaodong, et al.
Published: (2025)
Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair
by: Sakai, Yusuke, et al.
Published: (2024)
by: Sakai, Yusuke, et al.
Published: (2024)
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
by: Sakai, Yusuke, et al.
Published: (2024)
by: Sakai, Yusuke, et al.
Published: (2024)
A Large Dimensional Analysis of Multi-task Semi-Supervised Learning
by: Leger, Victor, et al.
Published: (2024)
by: Leger, Victor, et al.
Published: (2024)
Fractional Heat Kernel for Semi-Supervised Graph Learning with Small Training Sample Size
by: Bozorgnia, Farid, et al.
Published: (2025)
by: Bozorgnia, Farid, et al.
Published: (2025)
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale
by: Patel, Ajay, et al.
Published: (2026)
by: Patel, Ajay, et al.
Published: (2026)
Global path preference and local response: A reward decomposition approach for network path choice analysis in the presence of locally perceived attributes
by: Oyama, Yuki
Published: (2023)
by: Oyama, Yuki
Published: (2023)
Semi-Supervised Learning for Large Language Models Safety and Content Moderation
by: Dinuta, Eduard Stefan, et al.
Published: (2025)
by: Dinuta, Eduard Stefan, et al.
Published: (2025)
Semi-Supervised Learning under General Causal Models
by: Moore, Archer, et al.
Published: (2025)
by: Moore, Archer, et al.
Published: (2025)
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
by: Hao, Zhongkai, et al.
Published: (2024)
by: Hao, Zhongkai, et al.
Published: (2024)
In-Context Semi-Supervised Learning
by: Fan, Jiashuo, et al.
Published: (2025)
by: Fan, Jiashuo, et al.
Published: (2025)
From Human Labels to Literature: Semi-Supervised Learning of NMR Chemical Shifts at Scale
by: Jin, Yongqi, et al.
Published: (2026)
by: Jin, Yongqi, et al.
Published: (2026)
Spectrum-Adaptive Generalization Bounds for Trained Deep Transformers
by: Sakai, Mana, et al.
Published: (2026)
by: Sakai, Mana, et al.
Published: (2026)
Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale Models with Structured Pruning in Resource-Limited Clients
by: Li, Yan, et al.
Published: (2024)
by: Li, Yan, et al.
Published: (2024)
GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection
by: Su, DiJia, et al.
Published: (2025)
by: Su, DiJia, et al.
Published: (2025)
Self-Supervised Modality-Agnostic Pre-Training of Swin Transformers
by: Talasila, Abhiroop, et al.
Published: (2024)
by: Talasila, Abhiroop, et al.
Published: (2024)
MM-OPERA: Benchmarking Open-ended Association Reasoning for Large Vision-Language Models
by: Huang, Zimeng, et al.
Published: (2025)
by: Huang, Zimeng, et al.
Published: (2025)
From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction
by: Shoghi, Nima, et al.
Published: (2023)
by: Shoghi, Nima, et al.
Published: (2023)
Hypergraph Transformer for Semi-Supervised Classification
by: Liu, Zexi, et al.
Published: (2023)
by: Liu, Zexi, et al.
Published: (2023)
AaSP: Aliasing-aware Self-Supervised Pre-Training for Audio Spectrogram Transformers
by: Yamamoto, Kohei, et al.
Published: (2025)
by: Yamamoto, Kohei, et al.
Published: (2025)
TEMPO: Scaling Test-time Training for Large Reasoning Models
by: Zhang, Qingyang, et al.
Published: (2026)
by: Zhang, Qingyang, et al.
Published: (2026)
Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning
by: Fan, Ziqing, et al.
Published: (2025)
by: Fan, Ziqing, et al.
Published: (2025)
Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning
by: Gan, Kai, et al.
Published: (2024)
by: Gan, Kai, et al.
Published: (2024)
Similar Items
-
Semi-Supervised Reward Modeling via Iterative Self-Training
by: He, Yifei, et al.
Published: (2024) -
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?
by: Sakai, Yusuke, et al.
Published: (2023) -
Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
by: Kišš, Martin, et al.
Published: (2025) -
Domain Generalization Guided by Large-Scale Pre-Trained Priors
by: Wang, Zongbin, et al.
Published: (2024) -
The Wisdom of Many Queries: Complexity-Diversity Principle for Dense Retriever Training
by: Feng, Xincan, et al.
Published: (2026)