:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Huang, Benhao, Geng, Zhengyang, Kolter, Zico
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Machine Learning
Online-Zugang:	https://arxiv.org/abs/2605.21488
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

One-Step Diffusion Distillation via Deep Equilibrium Models
von: Geng, Zhengyang, et al.
Veröffentlicht: (2023)

Mean Flows for One-step Generative Modeling
von: Geng, Zhengyang, et al.
Veröffentlicht: (2025)

Consistency Models Made Easy
von: Geng, Zhengyang, et al.
Veröffentlicht: (2024)

One-Step Diffusion Distillation through Score Implicit Matching
von: Luo, Weijian, et al.
Veröffentlicht: (2024)

Weight Ensembling Improves Reasoning in Language Models
von: Dang, Xingyu, et al.
Veröffentlicht: (2025)

Evaluating Language Model Reasoning about Confidential Information
von: Sam, Dylan, et al.
Veröffentlicht: (2025)

Finetuning CLIP to Reason about Pairwise Differences
von: Sam, Dylan, et al.
Veröffentlicht: (2024)

AcceleratedLiNGAM: Learning Causal DAGs at the speed of GPUs
von: Akinwande, Victor, et al.
Veröffentlicht: (2024)

Improved Mean Flows: On the Challenges of Fastforward Generative Models
von: Geng, Zhengyang, et al.
Veröffentlicht: (2025)

Why is SAM Robust to Label Noise?
von: Baek, Christina, et al.
Veröffentlicht: (2024)

FUSE-ing Language Models: Zero-Shot Adapter Discovery for Prompt Optimization Across Tokenizers
von: Williams, Joshua Nathaniel, et al.
Veröffentlicht: (2024)

Mimetic Initialization of MLPs
von: Trockman, Asher, et al.
Veröffentlicht: (2026)

Predicting the Performance of Black-box LLMs through Follow-up Queries
von: Sam, Dylan, et al.
Veröffentlicht: (2025)

Generative Posterior Networks for Approximately Bayesian Epistemic Uncertainty Estimation
von: Roderick, Melrose, et al.
Veröffentlicht: (2023)

Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression
von: Zhai, Runtian, et al.
Veröffentlicht: (2023)

Diffusing Differentiable Representations
von: Savani, Yash, et al.
Veröffentlicht: (2024)

Measuring Five-Nines Reliability: Sample-Efficient LLM Evaluation in Saturated Benchmarks
von: Kim, Eungyeup, et al.
Veröffentlicht: (2026)

Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning
von: Xu, Yixuan Even, et al.
Veröffentlicht: (2025)

Mimetic Initialization Helps State Space Models Learn to Recall
von: Trockman, Asher, et al.
Veröffentlicht: (2024)

The Mixing method: low-rank coordinate descent for semidefinite programming with diagonal constraints
von: Wang, Po-Wei, et al.
Veröffentlicht: (2017)

Context-Parametric Inversion: Why Instruction Finetuning Can Worsen Context Reliance
von: Goyal, Sachin, et al.
Veröffentlicht: (2024)

Massive Activations in Large Language Models
von: Sun, Mingjie, et al.
Veröffentlicht: (2024)

An Axiomatic Approach to Model-Agnostic Concept Explanations
von: Feng, Zhili, et al.
Veröffentlicht: (2024)

Rethinking Distance Metrics for Counterfactual Explainability
von: Williams, Joshua Nathaniel, et al.
Veröffentlicht: (2024)

Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search
von: Sokota, Samuel, et al.
Veröffentlicht: (2025)

A Simple and Effective Pruning Approach for Large Language Models
von: Sun, Mingjie, et al.
Veröffentlicht: (2023)

Looking beyond the next token
von: Thankaraj, Abitha, et al.
Veröffentlicht: (2025)

Forcing Diffuse Distributions out of Language Models
von: Zhang, Yiming, et al.
Veröffentlicht: (2024)

When Should We Introduce Safety Interventions During Pretraining?
von: Sam, Dylan, et al.
Veröffentlicht: (2026)

Understanding Hallucinations in Diffusion Models through Mode Interpolation
von: Aithal, Sumukh K, et al.
Veröffentlicht: (2024)

Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws
von: Jiang, Yiding, et al.
Veröffentlicht: (2024)

Provably Bounding Neural Network Preimages
von: Kotha, Suhas, et al.
Veröffentlicht: (2023)

From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers
von: Gurumurthy, Swaminathan, et al.
Veröffentlicht: (2024)

Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers
von: Williams, Joshua Nathaniel, et al.
Veröffentlicht: (2024)

Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic
von: Goyal, Sachin, et al.
Veröffentlicht: (2024)

Predicting the Performance of Foundation Models via Agreement-on-the-Line
von: Saxena, Rahul, et al.
Veröffentlicht: (2024)

Bayesian Neural Networks with Domain Knowledge Priors
von: Sam, Dylan, et al.
Veröffentlicht: (2024)

Test-Time Adaptation Induces Stronger Accuracy and Agreement-on-the-Line
von: Kim, Eungyeup, et al.
Veröffentlicht: (2023)

Understanding Optimization in Deep Learning with Central Flows
von: Cohen, Jeremy M., et al.
Veröffentlicht: (2024)

T-MARS: Improving Visual Representations by Circumventing Text Feature Learning
von: Maini, Pratyush, et al.
Veröffentlicht: (2023)