:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Parikh, Aditya
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2312.09880
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Information Extraction from Unstructured data using Augmented-AI and Computer Vision
by: Aditya N. Parikh
Published: (2022)

Fair Lung Disease Diagnosis from Chest CT via Gender-Adversarial Attention Multiple Instance Learning
by: Parikh, Aditya, et al.
Published: (2026)

Colour Extraction Pipeline for Odonates using Computer Vision
by: Rajaraman, Megan Mirnalini Sundaram, et al.
Published: (2026)

Who Does Your Algorithm Fail? Investigating Age and Ethnic Bias in the MAMA-MIA Dataset
by: Parikh, Aditya, et al.
Published: (2025)

Jumpstarting Surgical Computer Vision
by: Alapatt, Deepak, et al.
Published: (2023)

IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic
by: Parikh, Chirag, et al.
Published: (2024)

ImageHD: Energy-Efficient On-Device Continual Learning of Visual Representations via Hyperdimensional Computing
by: Arockiaraj, Jebacyril, et al.
Published: (2026)

Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision
by: Krishnan, Aditya, et al.
Published: (2024)

Organizing Unstructured Image Collections using Natural Language
by: Liu, Mingxuan, et al.
Published: (2024)

Approximate Nullspace Augmented Finetuning for Robust Vision Transformers
by: Liu, Haoyang, et al.
Published: (2024)

Towards Fairness under Label Bias in Image Segmentation: Impact, Measurement and Mitigation
by: Parikh, Aditya, et al.
Published: (2026)

ConsensusDrop: Fusing Visual and Cross-Modal Saliency for Efficient Vision Language Models
by: Parikh, Dhruv, et al.
Published: (2026)

A Hierarchical Computer Vision Pipeline for Physiological Data Extraction from Bedside Monitors
by: Chau, Vinh, et al.
Published: (2025)

STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds
by: Li, Zikuan, et al.
Published: (2025)

ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents
by: Pala, Furkan, et al.
Published: (2024)

Investigating Label Bias and Representational Sources of Age-Related Disparities in Medical Segmentation
by: Parikh, Aditya, et al.
Published: (2025)

Kornia-rs: A Low-Level 3D Computer Vision Library In Rust
by: Riba, Edgar, et al.
Published: (2025)

Primitive-Driven Acceleration of Hyperdimensional Computing for Real-Time Image Classification
by: Parikh, Dhruv, et al.
Published: (2026)

Can Graphs Help Vision SSMs See Better?
by: Parikh, Dhruv, et al.
Published: (2026)

GraphLeap: Decoupling Graph Construction and Convolution for Vision GNN Acceleration on FPGA
by: Ramachandran, Anvitha, et al.
Published: (2026)

UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
by: Rai, Aashish, et al.
Published: (2025)

Learning Eigenstructures of Unstructured Data Manifolds
by: Velich, Roy, et al.
Published: (2025)

Target Prompting for Information Extraction with Vision Language Model
by: Medhi, Dipankar
Published: (2024)

LARE: Latent Augmentation using Regional Embedding with Vision-Language Model
by: Sakurai, Kosuke, et al.
Published: (2024)

Visual-Inertial SLAM for Unstructured Outdoor Environments: Benchmarking the Benefits and Computational Costs of Loop Closing
by: Schmidt, Fabian, et al.
Published: (2024)

Segmenting Wood Rot using Computer Vision Models
by: Kammerbauer, Roland, et al.
Published: (2024)

Match-and-Fuse: Consistent Generation from Unstructured Image Sets
by: Feingold, Kate, et al.
Published: (2025)

PAV: Personalized Head Avatar from Unstructured Video Collection
by: Caliskan, Akin, et al.
Published: (2024)

Computer Vision and Deep Learning for 4D Augmented Reality
by: Shivashankar, Karthik
Published: (2025)

See then Tell: Enhancing Key Information Extraction with Vision Grounding
by: Liu, Shuhang, et al.
Published: (2024)

From Structured to Unstructured:A Comparative Analysis of Computer Vision and Graph Models in solving Mesh-based PDEs
by: Decke, Jens, et al.
Published: (2024)

xAI-CV: An Overview of Explainable Artificial Intelligence in Computer Vision
by: Van Tu, Nguyen, et al.
Published: (2025)

RAVEN: Multitask Retrieval Augmented Vision-Language Learning
by: Rao, Varun Nagaraj, et al.
Published: (2024)

CM1 -- A Dataset for Evaluating Few-Shot Information Extraction with Large Vision Language Models
by: Wolf, Fabian, et al.
Published: (2025)

Enhancing Representation in Medical Vision-Language Foundation Models via Multi-Scale Information Extraction Techniques
by: Huang, Weijian, et al.
Published: (2024)

Digitization of Document and Information Extraction using OCR
by: Sinha, Rasha, et al.
Published: (2025)

Scaling Vision-and-Language Navigation With Offline RL
by: Bundele, Valay, et al.
Published: (2024)

Computational Imaging for Enhanced Computer Vision
by: Shaikh, Humera, et al.
Published: (2025)

Data Augmentation in Human-Centric Vision
by: Jiang, Wentao, et al.
Published: (2024)

Survey on Datasets for Perception in Unstructured Outdoor Environments
by: Mortimer, Peter, et al.
Published: (2024)