Saved in:
| Main Authors: | Fuller, Anthony, Green, James R., Shelhamer, Evan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02890 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LookWhen? Fast Video Recognition by Learning When, Where, and What to Compute
by: Salamatian, Ali, et al.
Published: (2026)
by: Salamatian, Ali, et al.
Published: (2026)
Self-Distillation of Hidden Layers for Self-Supervised Representation Learning
by: Lowe, Scott C., et al.
Published: (2026)
by: Lowe, Scott C., et al.
Published: (2026)
A Closer Look at In-Distribution vs. Out-of-Distribution Accuracy for Open-Set Test-time Adaptation
by: Li, Zefeng, et al.
Published: (2026)
by: Li, Zefeng, et al.
Published: (2026)
Asymmetric Duos: Sidekicks Improve Uncertainty
by: Zhou, Tim G., et al.
Published: (2025)
by: Zhou, Tim G., et al.
Published: (2025)
Thicker and Quicker: A Jumbo Token for Fast Plain Vision Transformers
by: Fuller, Anthony, et al.
Published: (2025)
by: Fuller, Anthony, et al.
Published: (2025)
LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision
by: Fuller, Anthony, et al.
Published: (2025)
by: Fuller, Anthony, et al.
Published: (2025)
ProtoTTA: Prototype-Guided Test-Time Adaptation
by: Abootorabi, Mohammad Mahdi, et al.
Published: (2026)
by: Abootorabi, Mohammad Mahdi, et al.
Published: (2026)
Adaptive Randomized Smoothing: Certified Adversarial Robustness for Multi-Step Defences
by: Lyu, Saiyue, et al.
Published: (2024)
by: Lyu, Saiyue, et al.
Published: (2024)
GeoCrossBench: Cross-Band Generalization for Remote Sensing
by: Tamazyan, Hakob, et al.
Published: (2025)
by: Tamazyan, Hakob, et al.
Published: (2025)
LT-Soups: Bridging Head and Tail Classes via Subsampled Model Soups
by: Aminbeidokhti, Masih, et al.
Published: (2025)
by: Aminbeidokhti, Masih, et al.
Published: (2025)
Bone Soups: A Seek-and-Soup Model Merging Approach for Controllable Multi-Objective Generation
by: Xie, Guofu, et al.
Published: (2025)
by: Xie, Guofu, et al.
Published: (2025)
Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy
by: Li, Tao, et al.
Published: (2024)
by: Li, Tao, et al.
Published: (2024)
Enhanced Soups for Graph Neural Networks
by: Zuber, Joseph, et al.
Published: (2025)
by: Zuber, Joseph, et al.
Published: (2025)
Spectral Souping: A Unified Framework for Online Preference Alignment
by: Chow, Yinlam, et al.
Published: (2026)
by: Chow, Yinlam, et al.
Published: (2026)
Improving Robustness of Foundation Models in Domain Adaptation with Soup-Adapters
by: Roschkowski, Marco
Published: (2025)
by: Roschkowski, Marco
Published: (2025)
RADIN: Souping on a Budget
by: Menes, Thibaut, et al.
Published: (2024)
by: Menes, Thibaut, et al.
Published: (2024)
Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging
by: Ablin, Pierre, et al.
Published: (2025)
by: Ablin, Pierre, et al.
Published: (2025)
ECG-Soup: Harnessing Multi-Layer Synergy for ECG Foundation Models
by: Nguyen, Phu X., et al.
Published: (2025)
by: Nguyen, Phu X., et al.
Published: (2025)
Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging
by: Zimmer, Max, et al.
Published: (2023)
by: Zimmer, Max, et al.
Published: (2023)
Local Superior Soups: A Catalyst for Model Merging in Cross-Silo Federated Learning
by: Chen, Minghui, et al.
Published: (2024)
by: Chen, Minghui, et al.
Published: (2024)
SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
by: Chegini, Atoosa, et al.
Published: (2024)
by: Chegini, Atoosa, et al.
Published: (2024)
State Soup: In-Context Skill Learning, Retrieval and Mixing
by: Pióro, Maciej, et al.
Published: (2024)
by: Pióro, Maciej, et al.
Published: (2024)
Objective Soups: Multilingual Multi-Task Modeling for Speech Processing
by: Saif, A F M, et al.
Published: (2025)
by: Saif, A F M, et al.
Published: (2025)
Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
by: Biggs, Benjamin, et al.
Published: (2024)
by: Biggs, Benjamin, et al.
Published: (2024)
VC-Soup: Value-Consistency Guided Multi-Value Alignment for Large Language Models
by: Xu, Hefei, et al.
Published: (2026)
by: Xu, Hefei, et al.
Published: (2026)
Soup to go: mitigating forgetting during continual learning with model averaging
by: Kleiman, Anat, et al.
Published: (2025)
by: Kleiman, Anat, et al.
Published: (2025)
Flexible and Efficient Drift Detection without Labels
by: Tan, Nelvin, et al.
Published: (2025)
by: Tan, Nelvin, et al.
Published: (2025)
LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
by: Prabhakar, Akshara, et al.
Published: (2024)
by: Prabhakar, Akshara, et al.
Published: (2024)
SLADE: Detecting Dynamic Anomalies in Edge Streams without Labels via Self-Supervised Learning
by: Lee, Jongha, et al.
Published: (2024)
by: Lee, Jongha, et al.
Published: (2024)
Free Process Rewards without Process Labels
by: Yuan, Lifan, et al.
Published: (2024)
by: Yuan, Lifan, et al.
Published: (2024)
Multi-Label Plant Species Classification with Self-Supervised Vision Transformers
by: Gustineli, Murilo, et al.
Published: (2024)
by: Gustineli, Murilo, et al.
Published: (2024)
Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta
by: Tran, Quoc-Khang, et al.
Published: (2026)
by: Tran, Quoc-Khang, et al.
Published: (2026)
Galileo: Learning Global & Local Features of Many Remote Sensing Modalities
by: Tseng, Gabriel, et al.
Published: (2025)
by: Tseng, Gabriel, et al.
Published: (2025)
Rethinking Self-Distillation: Label Averaging and Enhanced Soft Label Refinement with Partial Labels
by: Jeong, Hyeonsu, et al.
Published: (2024)
by: Jeong, Hyeonsu, et al.
Published: (2024)
LookSharp: Attention Entropy Minimization for Test-Time Adaptation
by: Mali, Yash, et al.
Published: (2025)
by: Mali, Yash, et al.
Published: (2025)
Measuring Pre-training Data Quality without Labels for Time Series Foundation Models
by: Wen, Songkang, et al.
Published: (2024)
by: Wen, Songkang, et al.
Published: (2024)
Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation
by: Zhou, Yujun, et al.
Published: (2025)
by: Zhou, Yujun, et al.
Published: (2025)
CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion
by: Wang, Yuan, et al.
Published: (2025)
by: Wang, Yuan, et al.
Published: (2025)
Ranking and Combining Latent Structured Predictive Scores without Labeled Data
by: Afshar, Shiva, et al.
Published: (2024)
by: Afshar, Shiva, et al.
Published: (2024)
Self-Directed Learning of Convex Labelings on Graphs
by: Sokolov, Georgy, et al.
Published: (2024)
by: Sokolov, Georgy, et al.
Published: (2024)
Similar Items
-
LookWhen? Fast Video Recognition by Learning When, Where, and What to Compute
by: Salamatian, Ali, et al.
Published: (2026) -
Self-Distillation of Hidden Layers for Self-Supervised Representation Learning
by: Lowe, Scott C., et al.
Published: (2026) -
A Closer Look at In-Distribution vs. Out-of-Distribution Accuracy for Open-Set Test-time Adaptation
by: Li, Zefeng, et al.
Published: (2026) -
Asymmetric Duos: Sidekicks Improve Uncertainty
by: Zhou, Tim G., et al.
Published: (2025) -
Thicker and Quicker: A Jumbo Token for Fast Plain Vision Transformers
by: Fuller, Anthony, et al.
Published: (2025)