Guardado en:
| Autores principales: | Nihal, Ragib Amin, Yen, Benjamin, Itoyama, Katsutoshi, Nakadai, Kazuhiro |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2408.04922 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
From Blurry to Brilliant Detection: YOLO-Based Aerial Object Detection with Super Resolution
por: Nihal, Ragib Amin, et al.
Publicado: (2024)
por: Nihal, Ragib Amin, et al.
Publicado: (2024)
Knowledge-Augmented Vision Language Models for Underwater Bioacoustic Spectrogram Analysis
por: Nihal, Ragib Amin, et al.
Publicado: (2025)
por: Nihal, Ragib Amin, et al.
Publicado: (2025)
Cross-Attention with Confidence Weighting for Multi-Channel Audio Alignment
por: Nihal, Ragib Amin, et al.
Publicado: (2025)
por: Nihal, Ragib Amin, et al.
Publicado: (2025)
Weakly Supervised Detection and Temporal Localization of Whale Calls in Long-Duration Bioacoustic Data
por: Nihal, Ragib Amin, et al.
Publicado: (2025)
por: Nihal, Ragib Amin, et al.
Publicado: (2025)
Pattern Enhanced Multi-Turn Jailbreaking: Exploiting Structural Vulnerabilities in Large Language Models
por: Nihal, Ragib Amin, et al.
Publicado: (2025)
por: Nihal, Ragib Amin, et al.
Publicado: (2025)
Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?
por: Hiroe, Atsuo, et al.
Publicado: (2024)
por: Hiroe, Atsuo, et al.
Publicado: (2024)
Bird Vocalization Embedding Extraction Using Self-Supervised Disentangled Representation Learning
por: Shi, Runwu, et al.
Publicado: (2024)
por: Shi, Runwu, et al.
Publicado: (2024)
Ecologically-Constrained Task Arithmetic for Multi-Taxa Bioacoustic Classifiers Without Shared Data
por: Nihal, Ragib Amin, et al.
Publicado: (2026)
por: Nihal, Ragib Amin, et al.
Publicado: (2026)
A Comprehensive Dataset for Human vs. AI Generated Image Detection
por: Roy, Rajarshi, et al.
Publicado: (2026)
por: Roy, Rajarshi, et al.
Publicado: (2026)
3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment
por: Le, Nhut, et al.
Publicado: (2025)
por: Le, Nhut, et al.
Publicado: (2025)
Single-Channel Target Speech Extraction Utilizing Distance and Room Clues
por: Shi, Runwu, et al.
Publicado: (2025)
por: Shi, Runwu, et al.
Publicado: (2025)
A Benchmark Dataset for Spatially Aligned Road Damage Assessment in Small Uncrewed Aerial Systems Disaster Imagery
por: Manzini, Thomas, et al.
Publicado: (2025)
por: Manzini, Thomas, et al.
Publicado: (2025)
ChangeQuery: Advancing Remote Sensing Change Analysis for Natural and Human-Induced Disasters from Visual Detection to Semantic Understanding
por: Sun, Dongwei, et al.
Publicado: (2026)
por: Sun, Dongwei, et al.
Publicado: (2026)
OmniGround: A Comprehensive Spatio-Temporal Grounding Benchmark for Real-World Complex Scenarios
por: Gao, Hong, et al.
Publicado: (2025)
por: Gao, Hong, et al.
Publicado: (2025)
Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions
por: Dong, Yifei, et al.
Publicado: (2025)
por: Dong, Yifei, et al.
Publicado: (2025)
Towards Adaptive Human-centric Video Anomaly Detection: A Comprehensive Framework and A New Benchmark
por: Pazho, Armin Danesh, et al.
Publicado: (2024)
por: Pazho, Armin Danesh, et al.
Publicado: (2024)
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios
por: Zhou, Baichuan, et al.
Publicado: (2024)
por: Zhou, Baichuan, et al.
Publicado: (2024)
MMAUD: A Comprehensive Multi-Modal Anti-UAV Dataset for Modern Miniature Drone Threats
por: Yuan, Shenghai, et al.
Publicado: (2024)
por: Yuan, Shenghai, et al.
Publicado: (2024)
DisasterVQA: A Visual Question Answering Benchmark Dataset for Disaster Scenes
por: Al-Mohannadi, Aisha, et al.
Publicado: (2026)
por: Al-Mohannadi, Aisha, et al.
Publicado: (2026)
BD-SAT: High-resolution Land Use Land Cover Dataset & Benchmark Results for Developing Division: Dhaka, BD
por: Paul, Ovi, et al.
Publicado: (2024)
por: Paul, Ovi, et al.
Publicado: (2024)
AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery
por: Zhou, Hangyu, et al.
Publicado: (2024)
por: Zhou, Hangyu, et al.
Publicado: (2024)
Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
por: Wang, Kai, et al.
Publicado: (2024)
por: Wang, Kai, et al.
Publicado: (2024)
DVD: A Comprehensive Dataset for Advancing Violence Detection in Real-World Scenarios
por: Kollias, Dimitrios, et al.
Publicado: (2025)
por: Kollias, Dimitrios, et al.
Publicado: (2025)
A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario
por: Addy, Cyrus, et al.
Publicado: (2025)
por: Addy, Cyrus, et al.
Publicado: (2025)
Concept-based explanations of Segmentation and Detection models in Natural Disaster Management
por: Heydari, Samar, et al.
Publicado: (2026)
por: Heydari, Samar, et al.
Publicado: (2026)
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking
por: Rahman, Md. Mahfuzur, et al.
Publicado: (2024)
por: Rahman, Md. Mahfuzur, et al.
Publicado: (2024)
HHI-Assist: A Dataset and Benchmark of Human-Human Interaction in Physical Assistance Scenario
por: Saadatnejad, Saeed, et al.
Publicado: (2025)
por: Saadatnejad, Saeed, et al.
Publicado: (2025)
Benchmarking Large Vision-Language Models on CFMME: A Comprehensive Chinese Financial Multimodal Evaluation Dataset
por: Chen, Qian, et al.
Publicado: (2026)
por: Chen, Qian, et al.
Publicado: (2026)
MechVQA: Benchmarking and Enhancing Multimodal LLMs on Comprehensive Mechanical Drawing Understanding
por: Kou, Qian, et al.
Publicado: (2026)
por: Kou, Qian, et al.
Publicado: (2026)
Texture-AD: An Anomaly Detection Dataset and Benchmark for Real Algorithm Development
por: Lei, Tianwu, et al.
Publicado: (2024)
por: Lei, Tianwu, et al.
Publicado: (2024)
UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model
por: Jankovic, Branislava, et al.
Publicado: (2025)
por: Jankovic, Branislava, et al.
Publicado: (2025)
A Large-Scale Multimodal Dataset and Benchmarks for Human Activity Scene Understanding and Reasoning
por: Jiang, Siyang, et al.
Publicado: (2025)
por: Jiang, Siyang, et al.
Publicado: (2025)
Evaluating Dataset Watermarking for Fine-tuning Traceability of Customized Diffusion Models: A Comprehensive Benchmark and Removal Approach
por: Wang, Xincheng, et al.
Publicado: (2025)
por: Wang, Xincheng, et al.
Publicado: (2025)
LeafNet: A Large-Scale Dataset and Comprehensive Benchmark for Foundational Vision-Language Understanding of Plant Diseases
por: Quoc, Khang Nguyen, et al.
Publicado: (2026)
por: Quoc, Khang Nguyen, et al.
Publicado: (2026)
January Food Benchmark (JFB): A Public Benchmark Dataset and Evaluation Suite for Multimodal Food Analysis
por: Hosseinian, Amir, et al.
Publicado: (2025)
por: Hosseinian, Amir, et al.
Publicado: (2025)
MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection
por: Jiang, Xi, et al.
Publicado: (2024)
por: Jiang, Xi, et al.
Publicado: (2024)
SPACT18: Spiking Human Action Recognition Benchmark Dataset with Complementary RGB and Thermal Modalities
por: Ashraf, Yasser, et al.
Publicado: (2025)
por: Ashraf, Yasser, et al.
Publicado: (2025)
Novel Anomaly Detection Scenarios and Evaluation Metrics to Address the Ambiguity in the Definition of Normal Samples
por: Saito, Reiji, et al.
Publicado: (2026)
por: Saito, Reiji, et al.
Publicado: (2026)
Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
por: Qin, Lixiong, et al.
Publicado: (2025)
por: Qin, Lixiong, et al.
Publicado: (2025)
MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios
por: Li, Zhang, et al.
Publicado: (2026)
por: Li, Zhang, et al.
Publicado: (2026)
Ejemplares similares
-
From Blurry to Brilliant Detection: YOLO-Based Aerial Object Detection with Super Resolution
por: Nihal, Ragib Amin, et al.
Publicado: (2024) -
Knowledge-Augmented Vision Language Models for Underwater Bioacoustic Spectrogram Analysis
por: Nihal, Ragib Amin, et al.
Publicado: (2025) -
Cross-Attention with Confidence Weighting for Multi-Channel Audio Alignment
por: Nihal, Ragib Amin, et al.
Publicado: (2025) -
Weakly Supervised Detection and Temporal Localization of Whale Calls in Long-Duration Bioacoustic Data
por: Nihal, Ragib Amin, et al.
Publicado: (2025) -
Pattern Enhanced Multi-Turn Jailbreaking: Exploiting Structural Vulnerabilities in Large Language Models
por: Nihal, Ragib Amin, et al.
Publicado: (2025)