Saved in:
| Main Authors: | Wang, Xiao, Wang, Fuling, Li, Yuehang, Ma, Qingchuan, Wang, Shiao, Jiang, Bo, Li, Chuanfu, Tang, Jin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.00379 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
R2GenCSR: Mining Contextual and Residual Information for LLMs-based Radiology Report Generation
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image Formats
by: Chambon, Pierre, et al.
Published: (2024)
by: Chambon, Pierre, et al.
Published: (2024)
Pre-training on High Definition X-ray Images: An Experimental Study
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation
by: Wang, Xiao, et al.
Published: (2025)
by: Wang, Xiao, et al.
Published: (2025)
Sign Language Translation using Frame and Event Stream: Benchmark Dataset and Algorithms
by: Wang, Xiao, et al.
Published: (2025)
by: Wang, Xiao, et al.
Published: (2025)
Event Stream-based Sign Language Translation: A High-Definition Benchmark Dataset and A Novel Baseline
by: Wang, Shiao, et al.
Published: (2024)
by: Wang, Shiao, et al.
Published: (2024)
An Empirical Study of Mamba-based Pedestrian Attribute Recognition
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
Human Activity Recognition using RGB-Event based Sensors: A Multi-modal Heat Conduction Model and A Benchmark Dataset
by: Wang, Shiao, et al.
Published: (2025)
by: Wang, Shiao, et al.
Published: (2025)
CheXPO: Preference Optimization for Chest X-ray VLMs with Counterfactual Rationale
by: Liang, Xiao, et al.
Published: (2025)
by: Liang, Xiao, et al.
Published: (2025)
RGB-Event based Pedestrian Attribute Recognition: A Benchmark Dataset and An Asymmetric RWKV Fusion Framework
by: Wang, Xiao, et al.
Published: (2025)
by: Wang, Xiao, et al.
Published: (2025)
Multi-modal Fusion based Q-distribution Prediction for Controlled Nuclear Fusion
by: Wang, Shiao, et al.
Published: (2024)
by: Wang, Shiao, et al.
Published: (2024)
EMRRG: Efficient Fine-Tuning Pre-trained X-ray Mamba Networks for Radiology Report Generation
by: Zhang, Mingzheng, et al.
Published: (2025)
by: Zhang, Mingzheng, et al.
Published: (2025)
RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models
by: Xian, Xiaoyu, et al.
Published: (2026)
by: Xian, Xiaoyu, et al.
Published: (2026)
Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
Long-Term Visual Object Tracking with Event Cameras: An Associative Memory Augmented Tracker and A Benchmark Dataset
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark
by: Wang, Shiao, et al.
Published: (2025)
by: Wang, Shiao, et al.
Published: (2025)
Exploiting Memory-aware Q-distribution Prediction for Nuclear Fusion via Modern Hopfield Network
by: Ma, Qingchuan, et al.
Published: (2024)
by: Ma, Qingchuan, et al.
Published: (2024)
Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking
by: Wang, Shiao, et al.
Published: (2025)
by: Wang, Shiao, et al.
Published: (2025)
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
by: Zuo, Yuxin, et al.
Published: (2025)
by: Zuo, Yuxin, et al.
Published: (2025)
Semantics Guided Disentangled GAN for Chest X-ray Image Rib Segmentation
by: Huang, Lili, et al.
Published: (2024)
by: Huang, Lili, et al.
Published: (2024)
Vehicle-centric Perception via Multimodal Structured Pre-training
by: Wu, Wentao, et al.
Published: (2025)
by: Wu, Wentao, et al.
Published: (2025)
State Space Model for New-Generation Network Alternative to Transformers: A Survey
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
CheXPO-v2: Preference Optimization for Chest X-ray VLMs with Knowledge Graph Consistency
by: Liang, Xiao, et al.
Published: (2025)
by: Liang, Xiao, et al.
Published: (2025)
CM3AE: A Unified RGB Frame and Event-Voxel/-Frame Pre-training Framework
by: Wu, Wentao, et al.
Published: (2025)
by: Wu, Wentao, et al.
Published: (2025)
T2I-VeRW: Part-level Fine-grained Perception for Text-to-Image Vehicle Retrieval
by: Wang, Xiao, et al.
Published: (2026)
by: Wang, Xiao, et al.
Published: (2026)
R2GenKG: Hierarchical Multi-modal Knowledge Graph for LLM-based Radiology Report Generation
by: Wang, Futian, et al.
Published: (2025)
by: Wang, Futian, et al.
Published: (2025)
The FIX Benchmark: Extracting Features Interpretable to eXperts
by: Jin, Helen, et al.
Published: (2024)
by: Jin, Helen, et al.
Published: (2024)
APTT: An accuracy-preserved tensor-train method for the Boltzmann-BGK equation
by: Zhu, Zhitao, et al.
Published: (2024)
by: Zhu, Zhitao, et al.
Published: (2024)
CorBenchX: Large-Scale Chest X-Ray Error Dataset and Vision-Language Model Benchmark for Report Error Correction
by: Zou, Jing, et al.
Published: (2025)
by: Zou, Jing, et al.
Published: (2025)
Multi-modal Vision Pre-training for Medical Image Analysis
by: Rui, Shaohao, et al.
Published: (2024)
by: Rui, Shaohao, et al.
Published: (2024)
MedRepBench: A Comprehensive Benchmark for Medical Report Interpretation
by: Shang, Fangxin, et al.
Published: (2025)
by: Shang, Fangxin, et al.
Published: (2025)
Grounded Knowledge-Enhanced Medical Vision-Language Pre-training for Chest X-Ray
by: Deng, Qiao, et al.
Published: (2024)
by: Deng, Qiao, et al.
Published: (2024)
MedFILIP: Medical Fine-grained Language-Image Pre-training
by: Liang, Xinjie, et al.
Published: (2025)
by: Liang, Xinjie, et al.
Published: (2025)
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
by: Chen, Jianlyu, et al.
Published: (2024)
by: Chen, Jianlyu, et al.
Published: (2024)
CheX-GPT: Harnessing Large Language Models for Enhanced Chest X-ray Report Labeling
by: Gu, Jawook, et al.
Published: (2024)
by: Gu, Jawook, et al.
Published: (2024)
Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition
by: Wang, Xiao, et al.
Published: (2023)
by: Wang, Xiao, et al.
Published: (2023)
GPT-generated Text Detection: Benchmark Dataset and Tensor-based Detection Method
by: Qazi, Zubair, et al.
Published: (2024)
by: Qazi, Zubair, et al.
Published: (2024)
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
by: Heiman, Alice, et al.
Published: (2024)
by: Heiman, Alice, et al.
Published: (2024)
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline
by: Cheng, Junlong, et al.
Published: (2024)
by: Cheng, Junlong, et al.
Published: (2024)
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
by: Wang, Shumin, et al.
Published: (2025)
by: Wang, Shumin, et al.
Published: (2025)
Similar Items
-
R2GenCSR: Mining Contextual and Residual Information for LLMs-based Radiology Report Generation
by: Wang, Xiao, et al.
Published: (2024) -
CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image Formats
by: Chambon, Pierre, et al.
Published: (2024) -
Pre-training on High Definition X-ray Images: An Experimental Study
by: Wang, Xiao, et al.
Published: (2024) -
Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation
by: Wang, Xiao, et al.
Published: (2025) -
Sign Language Translation using Frame and Event Stream: Benchmark Dataset and Algorithms
by: Wang, Xiao, et al.
Published: (2025)