:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Hosseini, Ryien, Simini, Filippo, Vishwanath, Venkatram, Willett, Rebecca, Hoffmann, Henry
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Machine Learning
Accesso online:	https://arxiv.org/abs/2503.01720
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Sketch-Augmented Features Improve Learning Long-Range Dependencies in Graph Neural Networks
di: Hosseini, Ryien, et al.
Pubblicazione: (2025)

A Deep Probabilistic Framework for Continuous Time Dynamic Graph Generation
di: Hosseini, Ryien, et al.
Pubblicazione: (2024)

Observation, Not Prediction: Conversation-Level Disaggregated Scheduling for Agentic Serving
di: Ding, Jianru, et al.
Pubblicazione: (2026)

LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference
di: Chitty-Venkata, Krishna Teja, et al.
Pubblicazione: (2025)

Extending $μ$P: Spectral Conditions for Feature Learning Across Optimizers
di: Gupta, Akshita, et al.
Pubblicazione: (2026)

Swimba: Switch Mamba Model Scales State Space Models
di: Du, Zhixu, et al.
Pubblicazione: (2026)

Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven Modeling
di: Barwey, Shivam, et al.
Pubblicazione: (2024)

ReLU Neural Networks with Linear Layers are Biased Towards Single- and Multi-Index Models
di: Parkinson, Suzanna, et al.
Pubblicazione: (2023)

PreLoRA: Hybrid Pre-training of Vision Transformers with Full Training and Low-Rank Adapters
di: Thapa, Krishu K, et al.
Pubblicazione: (2025)

BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference
di: Gulhan, Ahmed Burak, et al.
Pubblicazione: (2025)

PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference
di: Chitty-Venkata, Krishna Teja, et al.
Pubblicazione: (2025)

Mesh-based Super-Resolution of Fluid Flows with Multiscale Graph Neural Networks
di: Barwey, Shivam, et al.
Pubblicazione: (2024)

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models
di: Chitty-Venkata, Krishna Teja, et al.
Pubblicazione: (2025)

How do simple rotations affect the implicit bias of Adam?
di: DePavia, Adela, et al.
Pubblicazione: (2025)

Embed and Emulate: Contrastive representations for simulation-based inference
di: Jiang, Ruoxi, et al.
Pubblicazione: (2024)

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators
di: Chitty-Venkata, Krishna Teja, et al.
Pubblicazione: (2024)

Integrating Uncertainty Awareness into Conformalized Quantile Regression
di: Rossellini, Raphael, et al.
Pubblicazione: (2023)

Stabilizing black-box model selection with the inflated argmax
di: Adrian, Melissa, et al.
Pubblicazione: (2024)

Data Assimilation with Machine Learning Surrogate Models: A Case Study with FourCastNet
di: Adrian, Melissa, et al.
Pubblicazione: (2024)

Auto-differentiable data assimilation: Co-learning of states, dynamics, and filtering algorithms
di: Adrian, Melissa, et al.
Pubblicazione: (2026)

Building a stable classifier with the inflated argmax
di: Soloff, Jake A., et al.
Pubblicazione: (2024)

Bagging Provides Assumption-free Stability
di: Soloff, Jake A., et al.
Pubblicazione: (2023)

Depth Separation in Norm-Bounded Infinite-Width Neural Networks
di: Parkinson, Suzanna, et al.
Pubblicazione: (2024)

Learning Paths for Dynamic Measure Transport: A Control Perspective
di: Maurais, Aimee, et al.
Pubblicazione: (2025)

Training neural operators to preserve invariant measures of chaotic attractors
di: Jiang, Ruoxi, et al.
Pubblicazione: (2023)

Beyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal Forecasting
di: Orlova, Elena, et al.
Pubblicazione: (2022)

A Model-Guided Neural Network Method for the Inverse Scattering Problem
di: Tsang, Olivia, et al.
Pubblicazione: (2025)

Solving Inverse Problems with Deep Linear Neural Networks: Global Convergence Guarantees for Gradient Descent with Weight Decay
di: Laus, Hannah, et al.
Pubblicazione: (2025)

Accelerating PDE Surrogates via RL-Guided Mesh Optimization
di: Meng, Yang, et al.
Pubblicazione: (2026)

Assumption-free stability for ranking problems
di: Liang, Ruiting, et al.
Pubblicazione: (2025)

Mean-Field Langevin Dynamics for Signed Measures via a Bilevel Approach
di: Wang, Guillaume, et al.
Pubblicazione: (2024)

Deep Stochastic Mechanics
di: Orlova, Elena, et al.
Pubblicazione: (2023)

Can a calibration metric be both testable and actionable?
di: Rossellini, Raphael, et al.
Pubblicazione: (2025)

Minimax Rates for Hyperbolic Hierarchical Learning
di: Rawal, Divit, et al.
Pubblicazione: (2026)

A Generalized Tikhonov Layer for Interpretable-by-design Graph Neural Networks
di: Tremblay, Nicolas, et al.
Pubblicazione: (2026)

Discount Model Search for Quality Diversity Optimization in High-Dimensional Measure Spaces
di: Tjanaka, Bryon, et al.
Pubblicazione: (2026)

Statistical Guarantees in Synthetic Data through Conformal Adversarial Generation
di: Vishwakarma, Rahul, et al.
Pubblicazione: (2025)

Hierarchical Implicit Neural Emulators
di: Jiang, Ruoxi, et al.
Pubblicazione: (2025)

Comparing Methods for Bias Mitigation in Graph Neural Networks
di: Hoffmann, Barbara, et al.
Pubblicazione: (2025)

Certified Guidance for Planning with Deep Generative Models
di: Giacomarra, Francesco, et al.
Pubblicazione: (2025)