:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Grazzi, Riccardo, Siems, Julien, Schrodi, Simon, Brox, Thomas, Hutter, Frank
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.03170
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DeltaProduct: Improving State-Tracking in Linear RNNs via Householder Products
by: Siems, Julien, et al.
Published: (2025)

Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer
by: Schrodi, Simon, et al.
Published: (2025)

Learning State-Tracking from Code Using Linear RNNs
by: Siems, Julien, et al.
Published: (2026)

Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues
by: Grazzi, Riccardo, et al.
Published: (2024)

Concept Bottleneck Models Without Predefined Concepts
by: Schrodi, Simon, et al.
Published: (2024)

When and How Does CLIP Enable Domain and Compositional Generalization?
by: Kempf, Elias, et al.
Published: (2025)

Mamba4Cast: Efficient Zero-Shot Time Series Forecasting with State Space Models
by: Bhethanabhotla, Sathya Kamesh, et al.
Published: (2024)

Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models
by: Schrodi, Simon, et al.
Published: (2024)

Simple LLM Baselines are Competitive for Model Diffing
by: Kempf, Elias, et al.
Published: (2026)

Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems
by: Hoffmann, David T., et al.
Published: (2023)

TempoPFN: Synthetic Pre-training of Linear RNNs for Zero-shot Time Series Forecasting
by: Moroshan, Vladyslav, et al.
Published: (2025)

GAMformer: Bridging Tabular Foundation Models and Interpretable Machine Learning
by: Mueller, Andreas, et al.
Published: (2024)

What Drives Compositional Generalization? The Importance of Continuous Training Objectives in Visual Generative Models
by: Farid, Karim, et al.
Published: (2025)

Convergence Properties of Stochastic Hypergradients
by: Grazzi, Riccardo, et al.
Published: (2020)

Nonsmooth Implicit Differentiation: Deterministic and Stochastic Convergence Rates
by: Grazzi, Riccardo, et al.
Published: (2024)

OptRot: Mitigating Weight Outliers via Data-Free Rotations for Post-Training Quantization
by: Gadhikar, Advait, et al.
Published: (2025)

Early Stopping Tabular In-Context Learning
by: Küken, Jaris, et al.
Published: (2025)

Bayes' Power for Explaining In-Context Learning Generalizations
by: Müller, Samuel, et al.
Published: (2024)

Learning invariant representations of time-homogeneous stochastic dynamical systems
by: Kostic, Vladimir R., et al.
Published: (2023)

Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy
by: Ging, Simon, et al.
Published: (2024)

Balancing Synthetic Data and Replay for Enhancing Task-Specific Capabilities
by: Spiegelhalter, Urs, et al.
Published: (2025)

Do-PFN: In-Context Learning for Causal Effect Estimation
by: Robertson, Jake, et al.
Published: (2025)

Constrained Reinforcement Learning for Safe Heat Pump Control
by: Zhang, Baohe, et al.
Published: (2024)

Unlocking In-Context Learning for Natural Datasets Beyond Language Modelling
by: Bratulić, Jelena, et al.
Published: (2025)

Increasing LLM Coding Capabilities through Diverse Synthetic Coding Tasks
by: Abed, Amal, et al.
Published: (2025)

Curve Your Enthusiasm: Concurvity Regularization in Differentiable Generalized Additive Models
by: Siems, Julien, et al.
Published: (2023)

Constrained Reinforcement Learning with Smoothed Log Barrier Function
by: Zhang, Baohe, et al.
Published: (2024)

In-Context Freeze-Thaw Bayesian Optimization for Hyperparameter Optimization
by: Rakotoarison, Herilalaina, et al.
Published: (2024)

Drift-Resilient TabPFN: In-Context Learning Temporal Distribution Shifts on Tabular Data
by: Helli, Kai, et al.
Published: (2024)

Investigation into In-Context Learning Capabilities of Transformers
by: Chandrupatla, Rushil, et al.
Published: (2026)

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks
by: Park, Jongho, et al.
Published: (2024)

Can Mamba Learn In Context with Outliers? A Theoretical Generalization Analysis
by: Li, Hongkang, et al.
Published: (2025)

CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity
by: Bhatt, Aditya, et al.
Published: (2019)

Self-Correcting Bayesian Optimization through Bayesian Active Learning
by: Hvarfner, Carl, et al.
Published: (2023)

c-TPE: Tree-structured Parzen Estimator with Inequality Constraints for Expensive Hyperparameter Optimization
by: Watanabe, Shuhei, et al.
Published: (2022)

Mamba Can Learn Low-Dimensional Targets In-Context via Test-Time Feature Learning
by: Oh, Junsoo, et al.
Published: (2025)

Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond
by: Galesso, Silvio, et al.
Published: (2024)

Levin Tree Search with Context Models
by: Orseau, Laurent, et al.
Published: (2023)

Eliciting associations between clinical variables from LLMs via comparison questions across populations
by: Kabus, Fabian, et al.
Published: (2026)

On the Learn-to-Optimize Capabilities of Transformers in In-Context Sparse Recovery
by: Liu, Renpu, et al.
Published: (2024)