:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Nihal, Ragib Amin, Yen, Benjamin, Itoyama, Katsutoshi, Nakadai, Kazuhiro
Formato:	Preprint
Publicado:	2024
Materias:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
Acceso en línea:	https://arxiv.org/abs/2408.04922
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

From Blurry to Brilliant Detection: YOLO-Based Aerial Object Detection with Super Resolution
por: Nihal, Ragib Amin, et al.
Publicado: (2024)

Knowledge-Augmented Vision Language Models for Underwater Bioacoustic Spectrogram Analysis
por: Nihal, Ragib Amin, et al.
Publicado: (2025)

Cross-Attention with Confidence Weighting for Multi-Channel Audio Alignment
por: Nihal, Ragib Amin, et al.
Publicado: (2025)

Weakly Supervised Detection and Temporal Localization of Whale Calls in Long-Duration Bioacoustic Data
por: Nihal, Ragib Amin, et al.
Publicado: (2025)

Pattern Enhanced Multi-Turn Jailbreaking: Exploiting Structural Vulnerabilities in Large Language Models
por: Nihal, Ragib Amin, et al.
Publicado: (2025)

Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?
por: Hiroe, Atsuo, et al.
Publicado: (2024)

Bird Vocalization Embedding Extraction Using Self-Supervised Disentangled Representation Learning
por: Shi, Runwu, et al.
Publicado: (2024)

Ecologically-Constrained Task Arithmetic for Multi-Taxa Bioacoustic Classifiers Without Shared Data
por: Nihal, Ragib Amin, et al.
Publicado: (2026)

A Comprehensive Dataset for Human vs. AI Generated Image Detection
por: Roy, Rajarshi, et al.
Publicado: (2026)

3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment
por: Le, Nhut, et al.
Publicado: (2025)

Single-Channel Target Speech Extraction Utilizing Distance and Room Clues
por: Shi, Runwu, et al.
Publicado: (2025)

A Benchmark Dataset for Spatially Aligned Road Damage Assessment in Small Uncrewed Aerial Systems Disaster Imagery
por: Manzini, Thomas, et al.
Publicado: (2025)

ChangeQuery: Advancing Remote Sensing Change Analysis for Natural and Human-Induced Disasters from Visual Detection to Semantic Understanding
por: Sun, Dongwei, et al.
Publicado: (2026)

OmniGround: A Comprehensive Spatio-Temporal Grounding Benchmark for Real-World Complex Scenarios
por: Gao, Hong, et al.
Publicado: (2025)

Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions
por: Dong, Yifei, et al.
Publicado: (2025)

Towards Adaptive Human-centric Video Anomaly Detection: A Comprehensive Framework and A New Benchmark
por: Pazho, Armin Danesh, et al.
Publicado: (2024)

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios
por: Zhou, Baichuan, et al.
Publicado: (2024)

MMAUD: A Comprehensive Multi-Modal Anti-UAV Dataset for Modern Miniature Drone Threats
por: Yuan, Shenghai, et al.
Publicado: (2024)

DisasterVQA: A Visual Question Answering Benchmark Dataset for Disaster Scenes
por: Al-Mohannadi, Aisha, et al.
Publicado: (2026)

BD-SAT: High-resolution Land Use Land Cover Dataset & Benchmark Results for Developing Division: Dhaka, BD
por: Paul, Ovi, et al.
Publicado: (2024)

AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery
por: Zhou, Hangyu, et al.
Publicado: (2024)

Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
por: Wang, Kai, et al.
Publicado: (2024)

DVD: A Comprehensive Dataset for Advancing Violence Detection in Real-World Scenarios
por: Kollias, Dimitrios, et al.
Publicado: (2025)

A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario
por: Addy, Cyrus, et al.
Publicado: (2025)

Concept-based explanations of Segmentation and Detection models in Natural Disaster Management
por: Heydari, Samar, et al.
Publicado: (2026)

UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking
por: Rahman, Md. Mahfuzur, et al.
Publicado: (2024)

HHI-Assist: A Dataset and Benchmark of Human-Human Interaction in Physical Assistance Scenario
por: Saadatnejad, Saeed, et al.
Publicado: (2025)

Benchmarking Large Vision-Language Models on CFMME: A Comprehensive Chinese Financial Multimodal Evaluation Dataset
por: Chen, Qian, et al.
Publicado: (2026)

MechVQA: Benchmarking and Enhancing Multimodal LLMs on Comprehensive Mechanical Drawing Understanding
por: Kou, Qian, et al.
Publicado: (2026)

Texture-AD: An Anomaly Detection Dataset and Benchmark for Real Algorithm Development
por: Lei, Tianwu, et al.
Publicado: (2024)

UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model
por: Jankovic, Branislava, et al.
Publicado: (2025)

A Large-Scale Multimodal Dataset and Benchmarks for Human Activity Scene Understanding and Reasoning
por: Jiang, Siyang, et al.
Publicado: (2025)

Evaluating Dataset Watermarking for Fine-tuning Traceability of Customized Diffusion Models: A Comprehensive Benchmark and Removal Approach
por: Wang, Xincheng, et al.
Publicado: (2025)

LeafNet: A Large-Scale Dataset and Comprehensive Benchmark for Foundational Vision-Language Understanding of Plant Diseases
por: Quoc, Khang Nguyen, et al.
Publicado: (2026)

January Food Benchmark (JFB): A Public Benchmark Dataset and Evaluation Suite for Multimodal Food Analysis
por: Hosseinian, Amir, et al.
Publicado: (2025)

MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection
por: Jiang, Xi, et al.
Publicado: (2024)

SPACT18: Spiking Human Action Recognition Benchmark Dataset with Complementary RGB and Thermal Modalities
por: Ashraf, Yasser, et al.
Publicado: (2025)

Novel Anomaly Detection Scenarios and Evaluation Metrics to Address the Ambiguity in the Definition of Normal Samples
por: Saito, Reiji, et al.
Publicado: (2026)

Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
por: Qin, Lixiong, et al.
Publicado: (2025)

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios
por: Li, Zhang, et al.
Publicado: (2026)