Saved in:
| Main Authors: | Alapatt, Deepak, Murali, Aditya, Srivastav, Vinkle, Mascagni, Pietro, Consortium, AI4SafeChole, Padoy, Nicolas |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2312.05968 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer
by: Satyanaik, Siddhant, et al.
Published: (2024)
by: Satyanaik, Siddhant, et al.
Published: (2024)
Endoshare: A Publicly Available, Surgeons-Friendly Solution to De-Identify and Manage Surgical Videos
by: Arboit, Lorenzo, et al.
Published: (2025)
by: Arboit, Lorenzo, et al.
Published: (2025)
Adaptation of Multi-modal Representation Models for Multi-task Surgical Computer Vision
by: Walimbe, Soham, et al.
Published: (2025)
by: Walimbe, Soham, et al.
Published: (2025)
The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark
by: Murali, Aditya, et al.
Published: (2023)
by: Murali, Aditya, et al.
Published: (2023)
CycleSAM: Few-Shot Surgical Scene Segmentation with Cycle- and Scene-Consistent Feature Matching
by: Murali, Aditya, et al.
Published: (2024)
by: Murali, Aditya, et al.
Published: (2024)
SurgTEMP: Temporal-Aware Surgical Video Question Answering with Text-guided Visual Memory for Laparoscopic Cholecystectomy
by: Li, Shi, et al.
Published: (2026)
by: Li, Shi, et al.
Published: (2026)
Multi-modal Representations for Fine-grained Multi-label Critical View of Safety Recognition
by: Baby, Britty, et al.
Published: (2025)
by: Baby, Britty, et al.
Published: (2025)
Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures
by: Yuan, Kun, et al.
Published: (2023)
by: Yuan, Kun, et al.
Published: (2023)
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation
by: Srivastav, Vinkle, et al.
Published: (2024)
by: Srivastav, Vinkle, et al.
Published: (2024)
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation
by: Yuan, Kun, et al.
Published: (2024)
by: Yuan, Kun, et al.
Published: (2024)
HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition
by: Yuan, Kun, et al.
Published: (2024)
by: Yuan, Kun, et al.
Published: (2024)
CliPPER: Contextual Video-Language Pretraining on Long-form Intraoperative Surgical Procedures for Event Recognition
by: Stilz, Florian, et al.
Published: (2026)
by: Stilz, Florian, et al.
Published: (2026)
Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis
by: Chen, Tingxuan, et al.
Published: (2025)
by: Chen, Tingxuan, et al.
Published: (2025)
Learning from Synchronization: Self-Supervised Uncalibrated Multi-View Person Association in Challenging Scenes
by: Chen, Keqi, et al.
Published: (2025)
by: Chen, Keqi, et al.
Published: (2025)
End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data
by: Zarin, Farahdiba, et al.
Published: (2025)
by: Zarin, Farahdiba, et al.
Published: (2025)
Overcoming Dimensional Collapse in Self-supervised Contrastive Learning for Medical Image Segmentation
by: Hassanpour, Jamshid, et al.
Published: (2024)
by: Hassanpour, Jamshid, et al.
Published: (2024)
Multi-view Video-Pose Pretraining for Operating Room Surgical Activity Recognition
by: Hamoud, Idris, et al.
Published: (2025)
by: Hamoud, Idris, et al.
Published: (2025)
Advancing Surgical VQA with Scene Graph Knowledge
by: Yuan, Kun, et al.
Published: (2023)
by: Yuan, Kun, et al.
Published: (2023)
Learning from Sparse Point Labels for Dense Carcinosis Localization in Advanced Ovarian Cancer Assessment
by: Zarin, Farahdiba, et al.
Published: (2025)
by: Zarin, Farahdiba, et al.
Published: (2025)
Self-Supervised Uncalibrated Multi-View Video Anonymization in the Operating Room
by: Chen, Keqi, et al.
Published: (2026)
by: Chen, Keqi, et al.
Published: (2026)
A Skull-Adaptive Framework for AI-Based 3D Transcranial Focused Ultrasound Simulation
by: Srivastav, Vinkle, et al.
Published: (2025)
by: Srivastav, Vinkle, et al.
Published: (2025)
Surgical Text-to-Image Generation
by: Nwoye, Chinedu Innocent, et al.
Published: (2024)
by: Nwoye, Chinedu Innocent, et al.
Published: (2024)
Where are they looking in the operating room?
by: Chen, Keqi, et al.
Published: (2026)
by: Chen, Keqi, et al.
Published: (2026)
When do they StOP?: A First Step Towards Automatically Identifying Team Communication in the Operating Room
by: Chen, Keqi, et al.
Published: (2025)
by: Chen, Keqi, et al.
Published: (2025)
Artificial Intelligence for the Assessment of Peritoneal Carcinosis during Diagnostic Laparoscopy for Advanced Ovarian Cancer
by: Oliva, Riccardo, et al.
Published: (2025)
by: Oliva, Riccardo, et al.
Published: (2025)
Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement
by: Yuan, Kun, et al.
Published: (2025)
by: Yuan, Kun, et al.
Published: (2025)
fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models
by: Sharma, Saurav, et al.
Published: (2025)
by: Sharma, Saurav, et al.
Published: (2025)
SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation
by: Bhat, Aditya, et al.
Published: (2025)
by: Bhat, Aditya, et al.
Published: (2025)
UltraSam: A Foundation Model for Ultrasound using Large Open-Access Segmentation Datasets
by: Meyer, Adrien, et al.
Published: (2024)
by: Meyer, Adrien, et al.
Published: (2024)
SurgiTrack: Fine-Grained Multi-Class Multi-Tool Tracking in Surgical Videos
by: Nwoye, Chinedu Innocent, et al.
Published: (2024)
by: Nwoye, Chinedu Innocent, et al.
Published: (2024)
Early Operative Difficulty Assessment in Laparoscopic Cholecystectomy via Snapshot-Centric Video Analysis
by: Sharma, Saurav, et al.
Published: (2025)
by: Sharma, Saurav, et al.
Published: (2025)
OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
by: Hu, Ming, et al.
Published: (2024)
by: Hu, Ming, et al.
Published: (2024)
CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation
by: Bose, Rupak, et al.
Published: (2025)
by: Bose, Rupak, et al.
Published: (2025)
Surgeons Awareness, Expectations, and Involvement with Artificial Intelligence: a Survey Pre and Post the GPT Era
by: Arboit, Lorenzo, et al.
Published: (2025)
by: Arboit, Lorenzo, et al.
Published: (2025)
DExTeR: Weakly Semi-Supervised Object Detection with Class and Instance Experts for Medical Imaging
by: Meyer, Adrien, et al.
Published: (2026)
by: Meyer, Adrien, et al.
Published: (2026)
Enhancing Gait Video Analysis in Neurodegenerative Diseases by Knowledge Augmentation in Vision Language Model
by: Wang, Diwei, et al.
Published: (2024)
by: Wang, Diwei, et al.
Published: (2024)
CholecTrack20: A Multi-Perspective Tracking Dataset for Surgical Tools
by: Nwoye, Chinedu Innocent, et al.
Published: (2023)
by: Nwoye, Chinedu Innocent, et al.
Published: (2023)
State-Change Learning for Prediction of Future Events in Endoscopic Videos
by: Sharma, Saurav, et al.
Published: (2025)
by: Sharma, Saurav, et al.
Published: (2025)
On-the-Fly Point Annotation for Fast Medical Video Labeling
by: Adrien, Meyer, et al.
Published: (2024)
by: Adrien, Meyer, et al.
Published: (2024)
Information Extraction from Unstructured data using Augmented-AI and Computer Vision
by: Parikh, Aditya
Published: (2023)
by: Parikh, Aditya
Published: (2023)
Similar Items
-
Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer
by: Satyanaik, Siddhant, et al.
Published: (2024) -
Endoshare: A Publicly Available, Surgeons-Friendly Solution to De-Identify and Manage Surgical Videos
by: Arboit, Lorenzo, et al.
Published: (2025) -
Adaptation of Multi-modal Representation Models for Multi-task Surgical Computer Vision
by: Walimbe, Soham, et al.
Published: (2025) -
The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark
by: Murali, Aditya, et al.
Published: (2023) -
CycleSAM: Few-Shot Surgical Scene Segmentation with Cycle- and Scene-Consistent Feature Matching
by: Murali, Aditya, et al.
Published: (2024)