Saved in:
| Main Authors: | Yang, Shu, Cai, Zhiyuan, Luo, Luyang, Ma, Ning, Xu, Shuchang, Chen, Hao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.20083 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Surgformer: Surgical Transformer with Hierarchical Temporal Attention for Surgical Phase Recognition
by: Yang, Shu, et al.
Published: (2024)
by: Yang, Shu, et al.
Published: (2024)
SemiVT-Surge: Semi-Supervised Video Transformer for Surgical Phase Recognition
by: Li, Yiping, et al.
Published: (2025)
by: Li, Yiping, et al.
Published: (2025)
SurgX: Neuron-Concept Association for Explainable Surgical Phase Recognition
by: Kim, Ka Young, et al.
Published: (2025)
by: Kim, Ka Young, et al.
Published: (2025)
LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learning
by: Li, Jiajie, et al.
Published: (2024)
by: Li, Jiajie, et al.
Published: (2024)
SurgLQA: Scalable Long-Horizon Surgical Video Question Answering
by: Guo, Diandian, et al.
Published: (2026)
by: Guo, Diandian, et al.
Published: (2026)
Thoracic Surgery Video Analysis for Surgical Phase Recognition
by: Mateen, Syed Abdul, et al.
Published: (2024)
by: Mateen, Syed Abdul, et al.
Published: (2024)
SurgMotion: A Video-Native Foundation Model for Universal Understanding of Surgical Videos
by: Wu, Jinlin, et al.
Published: (2026)
by: Wu, Jinlin, et al.
Published: (2026)
HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation
by: Biagini, Diego, et al.
Published: (2025)
by: Biagini, Diego, et al.
Published: (2025)
SurgOnAir: Hierarchy-Aware Real-Time Surgical Video Commentary
by: He, Jingyi, et al.
Published: (2026)
by: He, Jingyi, et al.
Published: (2026)
SurgFed: Language-guided Multi-Task Federated Learning for Surgical Video Understanding
by: Fang, Zheng, et al.
Published: (2026)
by: Fang, Zheng, et al.
Published: (2026)
SurgPLAN++: Universal Surgical Phase Localization Network for Online and Offline Inference
by: Chen, Zhen, et al.
Published: (2024)
by: Chen, Zhen, et al.
Published: (2024)
Intuitive Surgical SurgToolLoc and SurgVU Challenges Results: 2022-2025
by: Zia, Aneeq, et al.
Published: (2023)
by: Zia, Aneeq, et al.
Published: (2023)
Surgical Visual Understanding (SurgVU) Dataset
by: Zia, Aneeq, et al.
Published: (2025)
by: Zia, Aneeq, et al.
Published: (2025)
SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis
by: Wei, Jianhui, et al.
Published: (2025)
by: Wei, Jianhui, et al.
Published: (2025)
LoViT: Long Video Transformer for Surgical Phase Recognition
by: Liu, Yang, et al.
Published: (2023)
by: Liu, Yang, et al.
Published: (2023)
Efficient Surgical Tool Recognition via HMM-Stabilized Deep Learning
by: Wang, Haifeng, et al.
Published: (2024)
by: Wang, Haifeng, et al.
Published: (2024)
Analysis of Transferability Estimation Metrics for Surgical Phase Recognition
by: Singh, Prabhant, et al.
Published: (2025)
by: Singh, Prabhant, et al.
Published: (2025)
Towards Robust Algorithms for Surgical Phase Recognition via Digital Twin Representation
by: Ding, Hao, et al.
Published: (2024)
by: Ding, Hao, et al.
Published: (2024)
Stabilizing Temporal Inference Dynamics for Online Surgical Phase Recognition
by: Liu, Yang, et al.
Published: (2026)
by: Liu, Yang, et al.
Published: (2026)
MoSFormer: Augmenting Temporal Context with Memory of Surgery for Surgical Phase Recognition
by: Ding, Hao, et al.
Published: (2025)
by: Ding, Hao, et al.
Published: (2025)
SurgSora: Object-Aware Diffusion Model for Controllable Surgical Video Generation
by: Chen, Tong, et al.
Published: (2024)
by: Chen, Tong, et al.
Published: (2024)
Robust Surgical Phase Recognition From Annotation Efficient Supervision
by: Rubin, Or, et al.
Published: (2024)
by: Rubin, Or, et al.
Published: (2024)
SurgViVQA: Temporally-Grounded Video Question Answering for Surgical Scene Understanding
by: Drago, Mauro Orazio, et al.
Published: (2025)
by: Drago, Mauro Orazio, et al.
Published: (2025)
SurgCheck: Do Vision-Language Models Really Look at Images in Surgical VQA?
by: Shin, Jongmin, et al.
Published: (2026)
by: Shin, Jongmin, et al.
Published: (2026)
SurgCoT: Advancing Spatiotemporal Reasoning in Surgical Videos through a Chain-of-Thought Benchmark
by: Wang, Gui, et al.
Published: (2026)
by: Wang, Gui, et al.
Published: (2026)
SurgRIPE challenge: Benchmark of Surgical Robot Instrument Pose Estimation
by: Xu, Haozheng, et al.
Published: (2025)
by: Xu, Haozheng, et al.
Published: (2025)
SurgLLM: A Versatile Large Multimodal Model with Spatial Focus and Temporal Awareness for Surgical Video Understanding
by: Chen, Zhen, et al.
Published: (2025)
by: Chen, Zhen, et al.
Published: (2025)
Neural Finite-State Machines for Surgical Phase Recognition
by: Ding, Hao, et al.
Published: (2024)
by: Ding, Hao, et al.
Published: (2024)
SurgWound-Bench: A Benchmark for Surgical Wound Diagnosis
by: Xu, Jiahao, et al.
Published: (2025)
by: Xu, Jiahao, et al.
Published: (2025)
Video Dataset for Surgical Phase, Keypoint, and Instrument Recognition in Laparoscopic Surgery (PhaKIR)
by: Rueckert, Tobias, et al.
Published: (2025)
by: Rueckert, Tobias, et al.
Published: (2025)
SurgVidLM: Towards Multi-grained Surgical Video Understanding with Large Language Model
by: Wang, Guankun, et al.
Published: (2025)
by: Wang, Guankun, et al.
Published: (2025)
SuPRA: Surgical Phase Recognition and Anticipation for Intra-Operative Planning
by: Boels, Maxence, et al.
Published: (2024)
by: Boels, Maxence, et al.
Published: (2024)
SurfSurg6D: Geometry Consistent Dense Correspondence for Textureless Surgical Instrument Pose Estimation
by: Shen, Daiyun, et al.
Published: (2026)
by: Shen, Daiyun, et al.
Published: (2026)
SurgLaVi: Large-Scale Hierarchical Dataset for Surgical Vision-Language Representation Learning
by: Perez, Alejandra, et al.
Published: (2025)
by: Perez, Alejandra, et al.
Published: (2025)
Cosmos-H-Surgical: Learning Surgical Robot Policies from Videos via World Modeling
by: He, Yufan, et al.
Published: (2025)
by: He, Yufan, et al.
Published: (2025)
SurgTEMP: Temporal-Aware Surgical Video Question Answering with Text-guided Visual Memory for Laparoscopic Cholecystectomy
by: Li, Shi, et al.
Published: (2026)
by: Li, Shi, et al.
Published: (2026)
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking
by: Liu, Haofeng, et al.
Published: (2025)
by: Liu, Haofeng, et al.
Published: (2025)
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical Phase Recognition
by: Funke, Isabel, et al.
Published: (2023)
by: Funke, Isabel, et al.
Published: (2023)
SPRMamba: Surgical Phase Recognition for Endoscopic Submucosal Dissection with Mamba
by: Zhang, Xiangning, et al.
Published: (2024)
by: Zhang, Xiangning, et al.
Published: (2024)
HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition
by: Yuan, Kun, et al.
Published: (2024)
by: Yuan, Kun, et al.
Published: (2024)
Similar Items
-
Surgformer: Surgical Transformer with Hierarchical Temporal Attention for Surgical Phase Recognition
by: Yang, Shu, et al.
Published: (2024) -
SemiVT-Surge: Semi-Supervised Video Transformer for Surgical Phase Recognition
by: Li, Yiping, et al.
Published: (2025) -
SurgX: Neuron-Concept Association for Explainable Surgical Phase Recognition
by: Kim, Ka Young, et al.
Published: (2025) -
LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learning
by: Li, Jiajie, et al.
Published: (2024) -
SurgLQA: Scalable Long-Horizon Surgical Video Question Answering
by: Guo, Diandian, et al.
Published: (2026)