:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Abbott, Vincent, Zardini, Gioele
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Machine Learning
Online-Zugang:	https://arxiv.org/abs/2412.03317
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Weaves, Wires, and Morphisms: Formalizing and Implementing the Algebra of Deep Learning
von: Abbott, Vincent, et al.
Veröffentlicht: (2026)

Diagrammatic Negative Information
von: Abbott, Vincent, et al.
Veröffentlicht: (2024)

FlashMask: Efficient and Rich Mask Extension of FlashAttention
von: Wang, Guoxia, et al.
Veröffentlicht: (2024)

Accelerating Machine Learning Systems via Category Theory: Applications to Spherical Attention for Gene Regulatory Networks
von: Abbott, Vincent, et al.
Veröffentlicht: (2025)

INT-FlashAttention: Enabling Flash Attention for INT8 Quantization
von: Chen, Shimao, et al.
Veröffentlicht: (2024)

AMLA: MUL by ADD in FlashAttention Rescaling
von: Liao, Qichen, et al.
Veröffentlicht: (2025)

Functor String Diagrams: A Novel Approach to Flexible Diagrams for Applied Category Theory
von: Abbott, Vincent, et al.
Veröffentlicht: (2024)

FLASH-D: FlashAttention with Hidden Softmax Division
von: Alexandridis, Kosmas, et al.
Veröffentlicht: (2025)

FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
von: Shah, Jay, et al.
Veröffentlicht: (2024)

FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs
von: Lin, Haoran, et al.
Veröffentlicht: (2024)

Low-Cost FlashAttention with Fused Exponential and Multiplication Hardware Operators
von: Alexandridis, Kosmas, et al.
Veröffentlicht: (2025)

Vectorized FlashAttention with Low-cost Exponential Computation in RISC-V Vector Processors
von: Titopoulos, Vasileios, et al.
Veröffentlicht: (2025)

Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10
von: Zhu, Yifan, et al.
Veröffentlicht: (2026)

Neural Circuit Diagrams: Robust Diagrams for the Communication, Implementation, and Analysis of Deep Learning Architectures
von: Abbott, Vincent
Veröffentlicht: (2024)

Causal Inference with the "Napkin Graph"
von: Guo, Anna, et al.
Veröffentlicht: (2025)

GRAND: Guidance, Rebalancing, and Assignment for Networked Dispatch in Multi-Agent Path Finding
von: Gaber, Johannes, et al.
Veröffentlicht: (2025)

Optimism as Risk-Seeking in Multi-Agent Reinforcement Learning
von: Zhang, Runyu, et al.
Veröffentlicht: (2025)

FlashSinkhorn: IO-Aware Entropic Optimal Transport on GPU
von: Ye, Felix X. -F., et al.
Veröffentlicht: (2026)

Robo-taxi Fleet Coordination at Scale via Reinforcement Learning
von: Tresca, Luigi, et al.
Veröffentlicht: (2025)

Distributionally Robust Imitation Learning: Layered Control Architecture for Certifiable Autonomy
von: Gahlawat, Aditya, et al.
Veröffentlicht: (2025)

Representation Shift: Unifying Token Compression with FlashAttention
von: Choi, Joonmyung, et al.
Veröffentlicht: (2025)

Is Flash Attention Stable?
von: Golden, Alicia, et al.
Veröffentlicht: (2024)

Learning Orbitally Stable Systems for Diagrammatically Teaching
von: Zhi, Weiming, et al.
Veröffentlicht: (2023)

Flash Invariant Point Attention
von: Liu, Andrew, et al.
Veröffentlicht: (2025)

Block Sparse Flash Attention
von: Ohayon, Daniel, et al.
Veröffentlicht: (2025)

SystolicAttention: Fusing FlashAttention within a Single Systolic Array
von: Lin, Jiawei, et al.
Veröffentlicht: (2025)

FlashBias: Fast Computation of Attention with Bias
von: Wu, Haixu, et al.
Veröffentlicht: (2025)

KVBuffer: IO-aware Serving for Linear Attention
von: Zou, Longwei, et al.
Veröffentlicht: (2026)

H-FA: A Hybrid Floating-Point and Logarithmic Approach to Hardware Accelerated FlashAttention
von: Alexandridis, Kosmas, et al.
Veröffentlicht: (2025)

AdaSplash: Adaptive Sparse Flash Attention
von: Gonçalves, Nuno, et al.
Veröffentlicht: (2025)

A Diagrammatic Approach to Improve Computational Efficiency in Group Equivariant Neural Networks
von: Pearce-Crump, Edward, et al.
Veröffentlicht: (2024)

Efficiently Dispatching Flash Attention For Partially Filled Attention Masks
von: Sharma, Agniv, et al.
Veröffentlicht: (2024)

Enhancing Training Efficiency Using Packing with Flash Attention
von: Kundu, Achintya, et al.
Veröffentlicht: (2024)

Distributional Uncertainty and Adaptive Decision-Making in System Co-design
von: Huang, Yujun, et al.
Veröffentlicht: (2026)

Where Should Robotaxis Operate? Strategic Network Design for Autonomous Mobility-on-Demand
von: Li, Xinling, et al.
Veröffentlicht: (2026)

Random-Subspace Sequential Quadratic Programming for Constrained Zeroth-Order Optimization
von: Zhang, Runyu, et al.
Veröffentlicht: (2026)

A general learning scheme for classical and quantum Ising machines
von: Schmid, Ludwig, et al.
Veröffentlicht: (2023)

GatedFWA: Linear Flash Windowed Attention with Gated Associative Memory
von: Liu, Jiaxu, et al.
Veröffentlicht: (2025)

Instructing Robots by Sketching: Learning from Demonstration via Probabilistic Diagrammatic Teaching
von: Zhi, Weiming, et al.
Veröffentlicht: (2023)

DiagrammaticLearning: A Graphical Language for Compositional Training Regimes
von: Lary, Mason, et al.
Veröffentlicht: (2025)