:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Singh, Nikita, Balian, Rob, Martinelli, Lukas
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2407.12875
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
by: Chen, Ying, et al.
Published: (2024)

AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval
by: Maniyar, Suyash, et al.
Published: (2025)

TRINS: Towards Multimodal Language Models that Can Read
by: Zhang, Ruiyi, et al.
Published: (2024)

Transcriptomics-guided Slide Representation Learning in Computational Pathology
by: Jaume, Guillaume, et al.
Published: (2024)

Your AI-Generated Image Detector Can Secretly Achieve SOTA Accuracy, If Calibrated
by: Yang, Muli, et al.
Published: (2026)

DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?
by: Jiao, Qirui, et al.
Published: (2025)

A Hybrid Machine Learning Model for Cerebral Palsy Detection
by: Singh, Karan Kumar, et al.
Published: (2026)

Your One-Stop Solution for AI-Generated Video Detection
by: Ma, Long, et al.
Published: (2026)

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design
by: Tang, Wenxin, et al.
Published: (2025)

Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
by: Guo, Xuyang, et al.
Published: (2025)

Script-to-Slide Grounding: Grounding Script Sentences to Slide Objects for Automatic Instructional Video Generation
by: Suzuki, Rena, et al.
Published: (2026)

Can I Trust Your Answer? Visually Grounded Video Question Answering
by: Xiao, Junbin, et al.
Published: (2023)

Efficient AI-Driven Multi-Section Whole Slide Image Analysis for Biochemical Recurrence Prediction in Prostate Cancer
by: Cho, Yesung, et al.
Published: (2026)

ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
by: Jia, Chengyou, et al.
Published: (2024)

Is My Data in Your AI? Membership Inference Test (MINT) applied to Face Biometrics
by: DeAlcala, Daniel, et al.
Published: (2024)

Can ChatGPT Perform Image Splicing Detection? A Preliminary Study
by: Nath, Souradip
Published: (2025)

TextCraftor: Your Text Encoder Can be Image Quality Controller
by: Li, Yanyu, et al.
Published: (2024)

Can Your Generative Model Detect Out-of-Distribution Covariate Shift?
by: Viviers, Christiaan, et al.
Published: (2024)

PathNavigate: A Training-Free Pathology Agent with Surprise-Guided Scan and Shared Slide Memory for Whole-Slide Image VQA
by: Yang, Chunze, et al.
Published: (2026)

Can AI Assistance Aid in the Grading of Handwritten Answer Sheets?
by: Sil, Pritam, et al.
Published: (2024)

Spatial Blindness in Whole-Slide Multiple Instance Learning
by: Li, Xiangyu, et al.
Published: (2026)

Hypergraph Mamba for Efficient Whole Slide Image Understanding
by: Lu, Jiaxuan, et al.
Published: (2025)

A Multicenter Benchmark of Multiple Instance Learning Models for Lymphoma Subtyping from HE-stained Whole Slide Images
by: Umer, Rao Muhammad, et al.
Published: (2025)

AI-Generated Content Enhanced Computer-Aided Diagnosis Model for Thyroid Nodules: A ChatGPT-Style Assistant
by: Yao, Jincao, et al.
Published: (2024)

SWAT: Sliding Window Adversarial Training for Gradual Domain Adaptation
by: Wang, Zixi, et al.
Published: (2025)

DesignLab: Designing Slides Through Iterative Detection and Correction
by: Yun, Jooyeol, et al.
Published: (2025)

Generating Narrated Lecture Videos from Slides with Synchronized Highlights
by: Holmberg, Alexander
Published: (2025)

Can ChatGPT Learn My Life From a Week of First-Person Video?
by: Harris, Keegan
Published: (2025)

LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
by: Zhang, Ruiyi, et al.
Published: (2024)

Beyond the First Read: AI-Assisted Perceptual Error Detection in Chest Radiography Accounting for Interobserver Variability
by: Vutukuri, Adhrith, et al.
Published: (2025)

Is ChatGPT-5 Ready for Mammogram VQA?
by: Li, Qiang, et al.
Published: (2025)

PATHS: A Hierarchical Transformer for Efficient Whole Slide Image Analysis
by: Buzzard, Zak, et al.
Published: (2024)

TAKT: Target-Aware Knowledge Transfer for Whole Slide Image Classification
by: Xiong, Conghao, et al.
Published: (2023)

Assessing Greenspace Attractiveness with ChatGPT, Claude, and Gemini: Do AI Models Reflect Human Perceptions?
by: Malekzadeh, Milad, et al.
Published: (2025)

GPTDrawer: Enhancing Visual Synthesis through ChatGPT
by: Li, Kun, et al.
Published: (2024)

Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation
by: Carretero, Ilán, et al.
Published: (2024)

Finding Regions of Interest in Whole Slide Images Using Multiple Instance Learning
by: Afonso, Martim, et al.
Published: (2024)

Agent Aggregator with Mask Denoise Mechanism for Histopathology Whole Slide Image Analysis
by: Ling, Xitong, et al.
Published: (2024)

WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering
by: Chen, Pingyi, et al.
Published: (2024)

PVChat: Personalized Video Chat with One-Shot Learning
by: Shi, Yufei, et al.
Published: (2025)