Saved in:
| Main Authors: | Singh, Nikita, Balian, Rob, Martinelli, Lukas |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.12875 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
by: Chen, Ying, et al.
Published: (2024)
by: Chen, Ying, et al.
Published: (2024)
AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval
by: Maniyar, Suyash, et al.
Published: (2025)
by: Maniyar, Suyash, et al.
Published: (2025)
TRINS: Towards Multimodal Language Models that Can Read
by: Zhang, Ruiyi, et al.
Published: (2024)
by: Zhang, Ruiyi, et al.
Published: (2024)
Transcriptomics-guided Slide Representation Learning in Computational Pathology
by: Jaume, Guillaume, et al.
Published: (2024)
by: Jaume, Guillaume, et al.
Published: (2024)
Your AI-Generated Image Detector Can Secretly Achieve SOTA Accuracy, If Calibrated
by: Yang, Muli, et al.
Published: (2026)
by: Yang, Muli, et al.
Published: (2026)
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?
by: Jiao, Qirui, et al.
Published: (2025)
by: Jiao, Qirui, et al.
Published: (2025)
A Hybrid Machine Learning Model for Cerebral Palsy Detection
by: Singh, Karan Kumar, et al.
Published: (2026)
by: Singh, Karan Kumar, et al.
Published: (2026)
Your One-Stop Solution for AI-Generated Video Detection
by: Ma, Long, et al.
Published: (2026)
by: Ma, Long, et al.
Published: (2026)
SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design
by: Tang, Wenxin, et al.
Published: (2025)
by: Tang, Wenxin, et al.
Published: (2025)
Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
by: Guo, Xuyang, et al.
Published: (2025)
by: Guo, Xuyang, et al.
Published: (2025)
Script-to-Slide Grounding: Grounding Script Sentences to Slide Objects for Automatic Instructional Video Generation
by: Suzuki, Rena, et al.
Published: (2026)
by: Suzuki, Rena, et al.
Published: (2026)
Can I Trust Your Answer? Visually Grounded Video Question Answering
by: Xiao, Junbin, et al.
Published: (2023)
by: Xiao, Junbin, et al.
Published: (2023)
Efficient AI-Driven Multi-Section Whole Slide Image Analysis for Biochemical Recurrence Prediction in Prostate Cancer
by: Cho, Yesung, et al.
Published: (2026)
by: Cho, Yesung, et al.
Published: (2026)
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
by: Jia, Chengyou, et al.
Published: (2024)
by: Jia, Chengyou, et al.
Published: (2024)
Is My Data in Your AI? Membership Inference Test (MINT) applied to Face Biometrics
by: DeAlcala, Daniel, et al.
Published: (2024)
by: DeAlcala, Daniel, et al.
Published: (2024)
Can ChatGPT Perform Image Splicing Detection? A Preliminary Study
by: Nath, Souradip
Published: (2025)
by: Nath, Souradip
Published: (2025)
TextCraftor: Your Text Encoder Can be Image Quality Controller
by: Li, Yanyu, et al.
Published: (2024)
by: Li, Yanyu, et al.
Published: (2024)
Can Your Generative Model Detect Out-of-Distribution Covariate Shift?
by: Viviers, Christiaan, et al.
Published: (2024)
by: Viviers, Christiaan, et al.
Published: (2024)
PathNavigate: A Training-Free Pathology Agent with Surprise-Guided Scan and Shared Slide Memory for Whole-Slide Image VQA
by: Yang, Chunze, et al.
Published: (2026)
by: Yang, Chunze, et al.
Published: (2026)
Can AI Assistance Aid in the Grading of Handwritten Answer Sheets?
by: Sil, Pritam, et al.
Published: (2024)
by: Sil, Pritam, et al.
Published: (2024)
Spatial Blindness in Whole-Slide Multiple Instance Learning
by: Li, Xiangyu, et al.
Published: (2026)
by: Li, Xiangyu, et al.
Published: (2026)
Hypergraph Mamba for Efficient Whole Slide Image Understanding
by: Lu, Jiaxuan, et al.
Published: (2025)
by: Lu, Jiaxuan, et al.
Published: (2025)
A Multicenter Benchmark of Multiple Instance Learning Models for Lymphoma Subtyping from HE-stained Whole Slide Images
by: Umer, Rao Muhammad, et al.
Published: (2025)
by: Umer, Rao Muhammad, et al.
Published: (2025)
AI-Generated Content Enhanced Computer-Aided Diagnosis Model for Thyroid Nodules: A ChatGPT-Style Assistant
by: Yao, Jincao, et al.
Published: (2024)
by: Yao, Jincao, et al.
Published: (2024)
SWAT: Sliding Window Adversarial Training for Gradual Domain Adaptation
by: Wang, Zixi, et al.
Published: (2025)
by: Wang, Zixi, et al.
Published: (2025)
DesignLab: Designing Slides Through Iterative Detection and Correction
by: Yun, Jooyeol, et al.
Published: (2025)
by: Yun, Jooyeol, et al.
Published: (2025)
Generating Narrated Lecture Videos from Slides with Synchronized Highlights
by: Holmberg, Alexander
Published: (2025)
by: Holmberg, Alexander
Published: (2025)
Can ChatGPT Learn My Life From a Week of First-Person Video?
by: Harris, Keegan
Published: (2025)
by: Harris, Keegan
Published: (2025)
LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
by: Zhang, Ruiyi, et al.
Published: (2024)
by: Zhang, Ruiyi, et al.
Published: (2024)
Beyond the First Read: AI-Assisted Perceptual Error Detection in Chest Radiography Accounting for Interobserver Variability
by: Vutukuri, Adhrith, et al.
Published: (2025)
by: Vutukuri, Adhrith, et al.
Published: (2025)
Is ChatGPT-5 Ready for Mammogram VQA?
by: Li, Qiang, et al.
Published: (2025)
by: Li, Qiang, et al.
Published: (2025)
PATHS: A Hierarchical Transformer for Efficient Whole Slide Image Analysis
by: Buzzard, Zak, et al.
Published: (2024)
by: Buzzard, Zak, et al.
Published: (2024)
TAKT: Target-Aware Knowledge Transfer for Whole Slide Image Classification
by: Xiong, Conghao, et al.
Published: (2023)
by: Xiong, Conghao, et al.
Published: (2023)
Assessing Greenspace Attractiveness with ChatGPT, Claude, and Gemini: Do AI Models Reflect Human Perceptions?
by: Malekzadeh, Milad, et al.
Published: (2025)
by: Malekzadeh, Milad, et al.
Published: (2025)
GPTDrawer: Enhancing Visual Synthesis through ChatGPT
by: Li, Kun, et al.
Published: (2024)
by: Li, Kun, et al.
Published: (2024)
Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation
by: Carretero, Ilán, et al.
Published: (2024)
by: Carretero, Ilán, et al.
Published: (2024)
Finding Regions of Interest in Whole Slide Images Using Multiple Instance Learning
by: Afonso, Martim, et al.
Published: (2024)
by: Afonso, Martim, et al.
Published: (2024)
Agent Aggregator with Mask Denoise Mechanism for Histopathology Whole Slide Image Analysis
by: Ling, Xitong, et al.
Published: (2024)
by: Ling, Xitong, et al.
Published: (2024)
WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering
by: Chen, Pingyi, et al.
Published: (2024)
by: Chen, Pingyi, et al.
Published: (2024)
PVChat: Personalized Video Chat with One-Shot Learning
by: Shi, Yufei, et al.
Published: (2025)
by: Shi, Yufei, et al.
Published: (2025)
Similar Items
-
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
by: Chen, Ying, et al.
Published: (2024) -
AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval
by: Maniyar, Suyash, et al.
Published: (2025) -
TRINS: Towards Multimodal Language Models that Can Read
by: Zhang, Ruiyi, et al.
Published: (2024) -
Transcriptomics-guided Slide Representation Learning in Computational Pathology
by: Jaume, Guillaume, et al.
Published: (2024) -
Your AI-Generated Image Detector Can Secretly Achieve SOTA Accuracy, If Calibrated
by: Yang, Muli, et al.
Published: (2026)