:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kumar, Abhay, Jain, Nishant, Tripathi, Suraj, Singh, Chirag, Krishna, Kamal
Format:	Preprint
Published:	2019
Subjects:	Machine Learning Artificial Intelligence Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/1908.08652
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Curriculum for Crowd Counting -- Is it Worthy?
by: Khan, Muhammad Asif, et al.
Published: (2024)

Examining Common Paradigms in Multi-Task Learning
by: Elich, Cathrin, et al.
Published: (2023)

ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
by: Patni, Suraj, et al.
Published: (2024)

Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation
by: Landgraf, Steven, et al.
Published: (2024)

Bound Tightening Network for Robust Crowd Counting
by: Wu, Qiming
Published: (2024)

Generation of Indian Sign Language Letters, Numbers, and Words
by: Yadav, Ajeet Kumar, et al.
Published: (2025)

A Systematic Survey on Deep Learning Architectures for Point Cloud Classification and Segmentation
by: Kamal, Minhas, et al.
Published: (2026)

Focal Loss based Residual Convolutional Neural Network for Speech Emotion Recognition
by: Tripathi, Suraj, et al.
Published: (2019)

A Comparative Study on Multi-task Uncertainty Quantification in Semantic Segmentation and Monocular Depth Estimation
by: Landgraf, Steven, et al.
Published: (2024)

Multimodal Crowd Counting with Pix2Pix GANs
by: Khan, Muhammad Asif, et al.
Published: (2024)

RCCFormer: A Robust Crowd Counting Network Based on Transformer
by: Liu, Peng, et al.
Published: (2025)

Real-Time Crowd Counting for Embedded Systems with Lightweight Architecture
by: Zhao, Zhiyuan, et al.
Published: (2025)

Generative Adversarial Perturbations with Cross-paradigm Transferability on Localized Crowd Counting
by: Anisha, Alabi Mehzabin, et al.
Published: (2026)

Mechanisms of Non-Monotonic Scaling in Vision Transformers
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)

Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)

Enhancing Multi-task Learning Capability of Medical Generalist Foundation Model via Image-centric Multi-annotation Data
by: Zhu, Xun, et al.
Published: (2025)

AdvantageFlow: Advantage-Weighted Least Squares for RL in Flow Models
by: Kveton, Branislav, et al.
Published: (2026)

Bright 4B: Scaling Hyperspherical Learning for Segmentation in 3D Brightfield Microscopy
by: Khan, Amil, et al.
Published: (2025)

Enhancing Pedestrian Trajectory Prediction with Crowd Trip Information
by: Tamaru, Rei, et al.
Published: (2024)

PALADIN : Robust Neural Fingerprinting for Text-to-Image Diffusion Models
by: L, Murthy, et al.
Published: (2025)

CountCLIP -- [Re] Teaching CLIP to Count to Ten
by: Mestha, Harshvardhan, et al.
Published: (2024)

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs
by: Zhang, Jianrui, et al.
Published: (2026)

VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
by: Lin, Han, et al.
Published: (2023)

Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
by: Miao, Yanting, et al.
Published: (2024)

Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
by: Lin, Han, et al.
Published: (2024)

Advanced Gesture Recognition for Autism Spectrum Disorder Detection: Integrating YOLOv7, Video Augmentation, and VideoMAE for Naturalistic Video Analysis
by: Singh, Amit Kumar, et al.
Published: (2024)

StruMPL: Multi-task Dense Regression under Disjoint Partial Supervision and MNAR Labels
by: Asiyabi, Reza M., et al.
Published: (2026)

Multi-task learning on partially labeled datasets via invariant/equivariant semi-supervised learning
by: Rabadán, Miquel Martí i, et al.
Published: (2026)

Genie 4D: Semantic-Prior-Guided 4D Dynamic Scene Reconstruction
by: Yang, Yiru, et al.
Published: (2026)

TCFormer: A 5M-Parameter Transformer with Density-Guided Aggregation for Weakly-Supervised Crowd Counting
by: Guo, Qiang, et al.
Published: (2025)

Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting
by: Yang, Maochen, et al.
Published: (2025)

Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation
by: Chopra, Shivang, et al.
Published: (2024)

AsyCo: An Asymmetric Dual-task Co-training Model for Partial-label Learning
by: Li, Beibei, et al.
Published: (2024)

HyperGALE: ASD Classification via Hypergraph Gated Attention with Learnable Hyperedges
by: Arora, Mehul, et al.
Published: (2024)

Improve Academic Query Resolution through BERT-based Question Extraction from Images
by: Kamal, Nidhi, et al.
Published: (2024)

FusionINN: Decomposable Image Fusion for Brain Tumor Monitoring
by: Kumar, Nishant, et al.
Published: (2024)

Stepwise Credit Assignment for GRPO on Flow-Matching Models
by: Savani, Yash, et al.
Published: (2026)

Where Bits Matter in World Model Planning: A Paired Mixed-Bit Study for Efficient Spatial Reasoning
by: Ranganath, Suraj, et al.
Published: (2026)

EWGN: Elastic Weight Generation and Context Switching in Deep Learning
by: Sawant, Shriraj P., et al.
Published: (2025)

Foundations of a Developmental Design Paradigm for Integrated Continual Learning, Deliberative Behavior, and Comprehensibility
by: Erden, Zeki Doruk, et al.
Published: (2025)