Saved in:
| Main Authors: | Douze, Matthijs, Guzhva, Alexandr, Deng, Chengqi, Johnson, Jeff, Szilvasy, Gergely, Mazaré, Pierre-Emmanuel, Lomeli, Maria, Hosseini, Lucas, Jégou, Hervé |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.08281 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Vector search with small radiuses
by: Szilvasy, Gergely, et al.
Published: (2024)
by: Szilvasy, Gergely, et al.
Published: (2024)
Inference-time sparse attention with asymmetric indexing
by: Mazaré, Pierre-Emmanuel, et al.
Published: (2025)
by: Mazaré, Pierre-Emmanuel, et al.
Published: (2025)
Self-Pruned Key-Value Attention: Learning When to Write by Predicting Future Utility
by: Szilvasy, Gergely, et al.
Published: (2026)
by: Szilvasy, Gergely, et al.
Published: (2026)
Short window attention enables long-term memorization
by: Cabannes, Loïc, et al.
Published: (2025)
by: Cabannes, Loïc, et al.
Published: (2025)
Stochastic activations
by: Lomeli, Maria, et al.
Published: (2025)
by: Lomeli, Maria, et al.
Published: (2025)
evclust: Python library for evidential clustering
by: Soubeiga, Armel, et al.
Published: (2025)
by: Soubeiga, Armel, et al.
Published: (2025)
Cross-Breed Pig Identification Using Auricular Vein Pattern Recognition: A Machine Learning Approach for Small-Scale Farming Applications
by: Nsengiyumvaa, Emmanuel, et al.
Published: (2025)
by: Nsengiyumvaa, Emmanuel, et al.
Published: (2025)
GUing: A Mobile GUI Search Engine using a Vision-Language Model
by: Wei, Jialiang, et al.
Published: (2024)
by: Wei, Jialiang, et al.
Published: (2024)
Can Vision-Language Models Handle Long-Context Code? An Empirical Study on Visual Compression
by: Zhong, Jianping, et al.
Published: (2026)
by: Zhong, Jianping, et al.
Published: (2026)
BLIP-FusePPO: A Vision-Language Deep Reinforcement Learning Framework for Lane Keeping in Autonomous Vehicles
by: Miangoleh, Seyed Ahmad Hosseini, et al.
Published: (2025)
by: Miangoleh, Seyed Ahmad Hosseini, et al.
Published: (2025)
Terrain characterisation for online adaptability of automated sonar processing: Lessons learnt from operationally applying ATR to sidescan sonar in MCM applications
by: Guerneve, Thomas, et al.
Published: (2024)
by: Guerneve, Thomas, et al.
Published: (2024)
SWAN -- Enabling Fast and Mobile Histopathology Image Annotation through Swipeable Interfaces
by: Banerjee, Sweta, et al.
Published: (2025)
by: Banerjee, Sweta, et al.
Published: (2025)
DD-CAM: Minimal Sufficient Explanations for Vision Models Using Delta Debugging
by: Khadka, Krishna, et al.
Published: (2026)
by: Khadka, Krishna, et al.
Published: (2026)
Foundation Models in Remote Sensing: Evolving from Unimodality to Multimodality
by: Hong, Danfeng, et al.
Published: (2026)
by: Hong, Danfeng, et al.
Published: (2026)
Technical Report for Argoverse2 Scenario Mining Challenges on Iterative Error Correction and Spatially-Aware Prompting
by: Chen, Yifei, et al.
Published: (2025)
by: Chen, Yifei, et al.
Published: (2025)
CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent
by: Lin, Haojia, et al.
Published: (2025)
by: Lin, Haojia, et al.
Published: (2025)
How Far Can VLMs Go for Visual Bug Detection? Studying 19,738 Keyframes from 41 Hours of Gameplay Videos
by: Lu, Wentao, et al.
Published: (2026)
by: Lu, Wentao, et al.
Published: (2026)
Natural Adversaries: Fuzzing Autonomous Vehicles with Realistic Roadside Object Placements
by: Sun, Yang, et al.
Published: (2024)
by: Sun, Yang, et al.
Published: (2024)
Earth Embeddings as Products: Taxonomy, Ecosystem, and Standardized Access
by: Fang, Heng, et al.
Published: (2026)
by: Fang, Heng, et al.
Published: (2026)
Interpretable Gallbladder Ultrasound Diagnosis: A Lightweight Web-Mobile Software Platform with Real-Time XAI
by: Bhoyan, Fuyad Hasan, et al.
Published: (2025)
by: Bhoyan, Fuyad Hasan, et al.
Published: (2025)
What to Test Next: Interpretable Coverage Gap Discovery in Driving VLMs
by: Aich, Abhishek, et al.
Published: (2026)
by: Aich, Abhishek, et al.
Published: (2026)
ROMAN: Reward-Orchestrated Multi-Head Attention Network for Autonomous Driving System Testing
by: Chi, Jianlei, et al.
Published: (2026)
by: Chi, Jianlei, et al.
Published: (2026)
Effort-Optimized, Accuracy-Driven Labelling and Validation of Test Inputs for DL Systems: A Mixed-Integer Linear Programming Approach
by: Amini, Mohammad Hossein, et al.
Published: (2025)
by: Amini, Mohammad Hossein, et al.
Published: (2025)
Ear-Keeper: A Cross-Platform AI System for Rapid and Accurate Ear Disease Diagnosis
by: Lu, Feiyan, et al.
Published: (2023)
by: Lu, Feiyan, et al.
Published: (2023)
ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents
by: Meng, Fanqing, et al.
Published: (2026)
by: Meng, Fanqing, et al.
Published: (2026)
Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems
by: Lambertenghi, Stefano Carlo, et al.
Published: (2025)
by: Lambertenghi, Stefano Carlo, et al.
Published: (2025)
A Highly Efficient Diversity-based Input Selection for DNN Improvement Using VLMs
by: Abbasishahkoo, Amin, et al.
Published: (2026)
by: Abbasishahkoo, Amin, et al.
Published: (2026)
Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
by: Lyu, Yunbo, et al.
Published: (2025)
by: Lyu, Yunbo, et al.
Published: (2025)
Evaluating and Enhancing Segmentation Model Robustness with Metamorphic Testing
by: Mzoughi, Seif, et al.
Published: (2025)
by: Mzoughi, Seif, et al.
Published: (2025)
TigAug: Data Augmentation for Testing Traffic Light Detection in Autonomous Driving Systems
by: Lu, You, et al.
Published: (2025)
by: Lu, You, et al.
Published: (2025)
VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing
by: Chang, Zhiyuan, et al.
Published: (2024)
by: Chang, Zhiyuan, et al.
Published: (2024)
ARI3D: A Software for Interactive Quantification of Regions in X-Ray CT 3D Images
by: Albrecht, Jan Phillipp, et al.
Published: (2025)
by: Albrecht, Jan Phillipp, et al.
Published: (2025)
A Plausibility Study of Using Augmented Reality in the Ventriculoperitoneal Shunt Operations
by: Dorji, Tandin, et al.
Published: (2024)
by: Dorji, Tandin, et al.
Published: (2024)
VideoGameBunny: Towards vision assistants for video games
by: Taesiri, Mohammad Reza, et al.
Published: (2024)
by: Taesiri, Mohammad Reza, et al.
Published: (2024)
Open-source automatic pipeline for efficient conversion of large-scale point clouds to IFC format
by: Zbirovský, Slávek, et al.
Published: (2025)
by: Zbirovský, Slávek, et al.
Published: (2025)
DOne: Decoupling Structure and Rendering for High-Fidelity Design-to-Code Generation
by: Huang, Xinhao, et al.
Published: (2026)
by: Huang, Xinhao, et al.
Published: (2026)
Investigating Traffic Accident Detection Using Multimodal Large Language Models
by: Skender, Ilhan, et al.
Published: (2025)
by: Skender, Ilhan, et al.
Published: (2025)
A Retrieval-Augmented Generation Approach to Extracting Algorithmic Logic from Neural Networks
by: Khalid, Waleed, et al.
Published: (2025)
by: Khalid, Waleed, et al.
Published: (2025)
MVOS_HSI: A Python Library for Preprocessing Agricultural Crop Hyperspectral Data
by: Aggarwal, Rishik, et al.
Published: (2026)
by: Aggarwal, Rishik, et al.
Published: (2026)
ITKIT: Feasible CT Image Analysis based on SimpleITK and MMEngine
by: Zhang, Yiqin, et al.
Published: (2026)
by: Zhang, Yiqin, et al.
Published: (2026)
Similar Items
-
Vector search with small radiuses
by: Szilvasy, Gergely, et al.
Published: (2024) -
Inference-time sparse attention with asymmetric indexing
by: Mazaré, Pierre-Emmanuel, et al.
Published: (2025) -
Self-Pruned Key-Value Attention: Learning When to Write by Predicting Future Utility
by: Szilvasy, Gergely, et al.
Published: (2026) -
Short window attention enables long-term memorization
by: Cabannes, Loïc, et al.
Published: (2025) -
Stochastic activations
by: Lomeli, Maria, et al.
Published: (2025)