Saved in:
| Main Authors: | Kumar, Abhay, Jain, Nishant, Tripathi, Suraj, Singh, Chirag, Krishna, Kamal |
|---|---|
| Format: | Preprint |
| Published: |
2019
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/1908.08652 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Curriculum for Crowd Counting -- Is it Worthy?
by: Khan, Muhammad Asif, et al.
Published: (2024)
by: Khan, Muhammad Asif, et al.
Published: (2024)
Examining Common Paradigms in Multi-Task Learning
by: Elich, Cathrin, et al.
Published: (2023)
by: Elich, Cathrin, et al.
Published: (2023)
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
by: Patni, Suraj, et al.
Published: (2024)
by: Patni, Suraj, et al.
Published: (2024)
Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation
by: Landgraf, Steven, et al.
Published: (2024)
by: Landgraf, Steven, et al.
Published: (2024)
Bound Tightening Network for Robust Crowd Counting
by: Wu, Qiming
Published: (2024)
by: Wu, Qiming
Published: (2024)
Generation of Indian Sign Language Letters, Numbers, and Words
by: Yadav, Ajeet Kumar, et al.
Published: (2025)
by: Yadav, Ajeet Kumar, et al.
Published: (2025)
A Systematic Survey on Deep Learning Architectures for Point Cloud Classification and Segmentation
by: Kamal, Minhas, et al.
Published: (2026)
by: Kamal, Minhas, et al.
Published: (2026)
Focal Loss based Residual Convolutional Neural Network for Speech Emotion Recognition
by: Tripathi, Suraj, et al.
Published: (2019)
by: Tripathi, Suraj, et al.
Published: (2019)
A Comparative Study on Multi-task Uncertainty Quantification in Semantic Segmentation and Monocular Depth Estimation
by: Landgraf, Steven, et al.
Published: (2024)
by: Landgraf, Steven, et al.
Published: (2024)
Multimodal Crowd Counting with Pix2Pix GANs
by: Khan, Muhammad Asif, et al.
Published: (2024)
by: Khan, Muhammad Asif, et al.
Published: (2024)
RCCFormer: A Robust Crowd Counting Network Based on Transformer
by: Liu, Peng, et al.
Published: (2025)
by: Liu, Peng, et al.
Published: (2025)
Real-Time Crowd Counting for Embedded Systems with Lightweight Architecture
by: Zhao, Zhiyuan, et al.
Published: (2025)
by: Zhao, Zhiyuan, et al.
Published: (2025)
Generative Adversarial Perturbations with Cross-paradigm Transferability on Localized Crowd Counting
by: Anisha, Alabi Mehzabin, et al.
Published: (2026)
by: Anisha, Alabi Mehzabin, et al.
Published: (2026)
Mechanisms of Non-Monotonic Scaling in Vision Transformers
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)
Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)
Enhancing Multi-task Learning Capability of Medical Generalist Foundation Model via Image-centric Multi-annotation Data
by: Zhu, Xun, et al.
Published: (2025)
by: Zhu, Xun, et al.
Published: (2025)
AdvantageFlow: Advantage-Weighted Least Squares for RL in Flow Models
by: Kveton, Branislav, et al.
Published: (2026)
by: Kveton, Branislav, et al.
Published: (2026)
Bright 4B: Scaling Hyperspherical Learning for Segmentation in 3D Brightfield Microscopy
by: Khan, Amil, et al.
Published: (2025)
by: Khan, Amil, et al.
Published: (2025)
Enhancing Pedestrian Trajectory Prediction with Crowd Trip Information
by: Tamaru, Rei, et al.
Published: (2024)
by: Tamaru, Rei, et al.
Published: (2024)
PALADIN : Robust Neural Fingerprinting for Text-to-Image Diffusion Models
by: L, Murthy, et al.
Published: (2025)
by: L, Murthy, et al.
Published: (2025)
CountCLIP -- [Re] Teaching CLIP to Count to Ten
by: Mestha, Harshvardhan, et al.
Published: (2024)
by: Mestha, Harshvardhan, et al.
Published: (2024)
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs
by: Zhang, Jianrui, et al.
Published: (2026)
by: Zhang, Jianrui, et al.
Published: (2026)
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
by: Lin, Han, et al.
Published: (2023)
by: Lin, Han, et al.
Published: (2023)
Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
by: Miao, Yanting, et al.
Published: (2024)
by: Miao, Yanting, et al.
Published: (2024)
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
by: Lin, Han, et al.
Published: (2024)
by: Lin, Han, et al.
Published: (2024)
Advanced Gesture Recognition for Autism Spectrum Disorder Detection: Integrating YOLOv7, Video Augmentation, and VideoMAE for Naturalistic Video Analysis
by: Singh, Amit Kumar, et al.
Published: (2024)
by: Singh, Amit Kumar, et al.
Published: (2024)
StruMPL: Multi-task Dense Regression under Disjoint Partial Supervision and MNAR Labels
by: Asiyabi, Reza M., et al.
Published: (2026)
by: Asiyabi, Reza M., et al.
Published: (2026)
Multi-task learning on partially labeled datasets via invariant/equivariant semi-supervised learning
by: Rabadán, Miquel Martí i, et al.
Published: (2026)
by: Rabadán, Miquel Martí i, et al.
Published: (2026)
Genie 4D: Semantic-Prior-Guided 4D Dynamic Scene Reconstruction
by: Yang, Yiru, et al.
Published: (2026)
by: Yang, Yiru, et al.
Published: (2026)
TCFormer: A 5M-Parameter Transformer with Density-Guided Aggregation for Weakly-Supervised Crowd Counting
by: Guo, Qiang, et al.
Published: (2025)
by: Guo, Qiang, et al.
Published: (2025)
Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting
by: Yang, Maochen, et al.
Published: (2025)
by: Yang, Maochen, et al.
Published: (2025)
Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation
by: Chopra, Shivang, et al.
Published: (2024)
by: Chopra, Shivang, et al.
Published: (2024)
AsyCo: An Asymmetric Dual-task Co-training Model for Partial-label Learning
by: Li, Beibei, et al.
Published: (2024)
by: Li, Beibei, et al.
Published: (2024)
HyperGALE: ASD Classification via Hypergraph Gated Attention with Learnable Hyperedges
by: Arora, Mehul, et al.
Published: (2024)
by: Arora, Mehul, et al.
Published: (2024)
Improve Academic Query Resolution through BERT-based Question Extraction from Images
by: Kamal, Nidhi, et al.
Published: (2024)
by: Kamal, Nidhi, et al.
Published: (2024)
FusionINN: Decomposable Image Fusion for Brain Tumor Monitoring
by: Kumar, Nishant, et al.
Published: (2024)
by: Kumar, Nishant, et al.
Published: (2024)
Stepwise Credit Assignment for GRPO on Flow-Matching Models
by: Savani, Yash, et al.
Published: (2026)
by: Savani, Yash, et al.
Published: (2026)
Where Bits Matter in World Model Planning: A Paired Mixed-Bit Study for Efficient Spatial Reasoning
by: Ranganath, Suraj, et al.
Published: (2026)
by: Ranganath, Suraj, et al.
Published: (2026)
EWGN: Elastic Weight Generation and Context Switching in Deep Learning
by: Sawant, Shriraj P., et al.
Published: (2025)
by: Sawant, Shriraj P., et al.
Published: (2025)
Foundations of a Developmental Design Paradigm for Integrated Continual Learning, Deliberative Behavior, and Comprehensibility
by: Erden, Zeki Doruk, et al.
Published: (2025)
by: Erden, Zeki Doruk, et al.
Published: (2025)
Similar Items
-
Curriculum for Crowd Counting -- Is it Worthy?
by: Khan, Muhammad Asif, et al.
Published: (2024) -
Examining Common Paradigms in Multi-Task Learning
by: Elich, Cathrin, et al.
Published: (2023) -
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
by: Patni, Suraj, et al.
Published: (2024) -
Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation
by: Landgraf, Steven, et al.
Published: (2024) -
Bound Tightening Network for Robust Crowd Counting
by: Wu, Qiming
Published: (2024)