Saved in:
| Main Author: | Parikh, Aditya |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2312.09880 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Information Extraction from Unstructured data using Augmented-AI and Computer Vision
by: Aditya N. Parikh
Published: (2022)
by: Aditya N. Parikh
Published: (2022)
Fair Lung Disease Diagnosis from Chest CT via Gender-Adversarial Attention Multiple Instance Learning
by: Parikh, Aditya, et al.
Published: (2026)
by: Parikh, Aditya, et al.
Published: (2026)
Colour Extraction Pipeline for Odonates using Computer Vision
by: Rajaraman, Megan Mirnalini Sundaram, et al.
Published: (2026)
by: Rajaraman, Megan Mirnalini Sundaram, et al.
Published: (2026)
Who Does Your Algorithm Fail? Investigating Age and Ethnic Bias in the MAMA-MIA Dataset
by: Parikh, Aditya, et al.
Published: (2025)
by: Parikh, Aditya, et al.
Published: (2025)
Jumpstarting Surgical Computer Vision
by: Alapatt, Deepak, et al.
Published: (2023)
by: Alapatt, Deepak, et al.
Published: (2023)
IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic
by: Parikh, Chirag, et al.
Published: (2024)
by: Parikh, Chirag, et al.
Published: (2024)
ImageHD: Energy-Efficient On-Device Continual Learning of Visual Representations via Hyperdimensional Computing
by: Arockiaraj, Jebacyril, et al.
Published: (2026)
by: Arockiaraj, Jebacyril, et al.
Published: (2026)
Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision
by: Krishnan, Aditya, et al.
Published: (2024)
by: Krishnan, Aditya, et al.
Published: (2024)
Organizing Unstructured Image Collections using Natural Language
by: Liu, Mingxuan, et al.
Published: (2024)
by: Liu, Mingxuan, et al.
Published: (2024)
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers
by: Liu, Haoyang, et al.
Published: (2024)
by: Liu, Haoyang, et al.
Published: (2024)
Towards Fairness under Label Bias in Image Segmentation: Impact, Measurement and Mitigation
by: Parikh, Aditya, et al.
Published: (2026)
by: Parikh, Aditya, et al.
Published: (2026)
ConsensusDrop: Fusing Visual and Cross-Modal Saliency for Efficient Vision Language Models
by: Parikh, Dhruv, et al.
Published: (2026)
by: Parikh, Dhruv, et al.
Published: (2026)
A Hierarchical Computer Vision Pipeline for Physiological Data Extraction from Bedside Monitors
by: Chau, Vinh, et al.
Published: (2025)
by: Chau, Vinh, et al.
Published: (2025)
STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds
by: Li, Zikuan, et al.
Published: (2025)
by: Li, Zikuan, et al.
Published: (2025)
ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents
by: Pala, Furkan, et al.
Published: (2024)
by: Pala, Furkan, et al.
Published: (2024)
Investigating Label Bias and Representational Sources of Age-Related Disparities in Medical Segmentation
by: Parikh, Aditya, et al.
Published: (2025)
by: Parikh, Aditya, et al.
Published: (2025)
Kornia-rs: A Low-Level 3D Computer Vision Library In Rust
by: Riba, Edgar, et al.
Published: (2025)
by: Riba, Edgar, et al.
Published: (2025)
Primitive-Driven Acceleration of Hyperdimensional Computing for Real-Time Image Classification
by: Parikh, Dhruv, et al.
Published: (2026)
by: Parikh, Dhruv, et al.
Published: (2026)
Can Graphs Help Vision SSMs See Better?
by: Parikh, Dhruv, et al.
Published: (2026)
by: Parikh, Dhruv, et al.
Published: (2026)
GraphLeap: Decoupling Graph Construction and Convolution for Vision GNN Acceleration on FPGA
by: Ramachandran, Anvitha, et al.
Published: (2026)
by: Ramachandran, Anvitha, et al.
Published: (2026)
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
by: Rai, Aashish, et al.
Published: (2025)
by: Rai, Aashish, et al.
Published: (2025)
Learning Eigenstructures of Unstructured Data Manifolds
by: Velich, Roy, et al.
Published: (2025)
by: Velich, Roy, et al.
Published: (2025)
Target Prompting for Information Extraction with Vision Language Model
by: Medhi, Dipankar
Published: (2024)
by: Medhi, Dipankar
Published: (2024)
LARE: Latent Augmentation using Regional Embedding with Vision-Language Model
by: Sakurai, Kosuke, et al.
Published: (2024)
by: Sakurai, Kosuke, et al.
Published: (2024)
Visual-Inertial SLAM for Unstructured Outdoor Environments: Benchmarking the Benefits and Computational Costs of Loop Closing
by: Schmidt, Fabian, et al.
Published: (2024)
by: Schmidt, Fabian, et al.
Published: (2024)
Segmenting Wood Rot using Computer Vision Models
by: Kammerbauer, Roland, et al.
Published: (2024)
by: Kammerbauer, Roland, et al.
Published: (2024)
Match-and-Fuse: Consistent Generation from Unstructured Image Sets
by: Feingold, Kate, et al.
Published: (2025)
by: Feingold, Kate, et al.
Published: (2025)
PAV: Personalized Head Avatar from Unstructured Video Collection
by: Caliskan, Akin, et al.
Published: (2024)
by: Caliskan, Akin, et al.
Published: (2024)
Computer Vision and Deep Learning for 4D Augmented Reality
by: Shivashankar, Karthik
Published: (2025)
by: Shivashankar, Karthik
Published: (2025)
See then Tell: Enhancing Key Information Extraction with Vision Grounding
by: Liu, Shuhang, et al.
Published: (2024)
by: Liu, Shuhang, et al.
Published: (2024)
From Structured to Unstructured:A Comparative Analysis of Computer Vision and Graph Models in solving Mesh-based PDEs
by: Decke, Jens, et al.
Published: (2024)
by: Decke, Jens, et al.
Published: (2024)
xAI-CV: An Overview of Explainable Artificial Intelligence in Computer Vision
by: Van Tu, Nguyen, et al.
Published: (2025)
by: Van Tu, Nguyen, et al.
Published: (2025)
RAVEN: Multitask Retrieval Augmented Vision-Language Learning
by: Rao, Varun Nagaraj, et al.
Published: (2024)
by: Rao, Varun Nagaraj, et al.
Published: (2024)
CM1 -- A Dataset for Evaluating Few-Shot Information Extraction with Large Vision Language Models
by: Wolf, Fabian, et al.
Published: (2025)
by: Wolf, Fabian, et al.
Published: (2025)
Enhancing Representation in Medical Vision-Language Foundation Models via Multi-Scale Information Extraction Techniques
by: Huang, Weijian, et al.
Published: (2024)
by: Huang, Weijian, et al.
Published: (2024)
Digitization of Document and Information Extraction using OCR
by: Sinha, Rasha, et al.
Published: (2025)
by: Sinha, Rasha, et al.
Published: (2025)
Scaling Vision-and-Language Navigation With Offline RL
by: Bundele, Valay, et al.
Published: (2024)
by: Bundele, Valay, et al.
Published: (2024)
Computational Imaging for Enhanced Computer Vision
by: Shaikh, Humera, et al.
Published: (2025)
by: Shaikh, Humera, et al.
Published: (2025)
Data Augmentation in Human-Centric Vision
by: Jiang, Wentao, et al.
Published: (2024)
by: Jiang, Wentao, et al.
Published: (2024)
Survey on Datasets for Perception in Unstructured Outdoor Environments
by: Mortimer, Peter, et al.
Published: (2024)
by: Mortimer, Peter, et al.
Published: (2024)
Similar Items
-
Information Extraction from Unstructured data using Augmented-AI and Computer Vision
by: Aditya N. Parikh
Published: (2022) -
Fair Lung Disease Diagnosis from Chest CT via Gender-Adversarial Attention Multiple Instance Learning
by: Parikh, Aditya, et al.
Published: (2026) -
Colour Extraction Pipeline for Odonates using Computer Vision
by: Rajaraman, Megan Mirnalini Sundaram, et al.
Published: (2026) -
Who Does Your Algorithm Fail? Investigating Age and Ethnic Bias in the MAMA-MIA Dataset
by: Parikh, Aditya, et al.
Published: (2025) -
Jumpstarting Surgical Computer Vision
by: Alapatt, Deepak, et al.
Published: (2023)