Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Su, Yukun, Cao, Yiwen, Deng, Jingliang, Rao, Fengyun, Wu, Qingyao
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2401.08086
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913197756252160
author	Su, Yukun Cao, Yiwen Deng, Jingliang Rao, Fengyun Wu, Qingyao
author_facet	Su, Yukun Cao, Yiwen Deng, Jingliang Rao, Fengyun Wu, Qingyao
contents	A large amount of User Generated Content (UGC) is uploaded to the Internet daily and displayed to people world-widely through the client side (e.g., mobile and PC). This requires the cropping algorithms to produce the aesthetic thumbnail within a specific aspect ratio on different devices. However, existing image cropping works mainly focus on landmark or landscape images, which fail to model the relations among the multi-objects with the complex background in UGC. Besides, previous methods merely consider the aesthetics of the cropped images while ignoring the content integrity, which is crucial for UGC cropping. In this paper, we propose a Spatial-Semantic Collaborative cropping network (S2CNet) for arbitrary user generated content accompanied by a new cropping benchmark. Specifically, we first mine the visual genes of the potential objects. Then, the suggested adaptive attention graph recasts this task as a procedure of information association over visual nodes. The underlying spatial and semantic relations are ultimately centralized to the crop candidate through differentiable message passing, which helps our network efficiently to preserve both the aesthetics and the content integrity. Extensive experiments on the proposed UGCrop5K and other public datasets demonstrate the superiority of our approach over state-of-the-art counterparts. Our project is available at https://github.com/suyukun666/S2CNet.
format	Preprint
id	arxiv_https___arxiv_org_abs_2401_08086
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Spatial-Semantic Collaborative Cropping for User Generated Content Su, Yukun Cao, Yiwen Deng, Jingliang Rao, Fengyun Wu, Qingyao Computer Vision and Pattern Recognition A large amount of User Generated Content (UGC) is uploaded to the Internet daily and displayed to people world-widely through the client side (e.g., mobile and PC). This requires the cropping algorithms to produce the aesthetic thumbnail within a specific aspect ratio on different devices. However, existing image cropping works mainly focus on landmark or landscape images, which fail to model the relations among the multi-objects with the complex background in UGC. Besides, previous methods merely consider the aesthetics of the cropped images while ignoring the content integrity, which is crucial for UGC cropping. In this paper, we propose a Spatial-Semantic Collaborative cropping network (S2CNet) for arbitrary user generated content accompanied by a new cropping benchmark. Specifically, we first mine the visual genes of the potential objects. Then, the suggested adaptive attention graph recasts this task as a procedure of information association over visual nodes. The underlying spatial and semantic relations are ultimately centralized to the crop candidate through differentiable message passing, which helps our network efficiently to preserve both the aesthetics and the content integrity. Extensive experiments on the proposed UGCrop5K and other public datasets demonstrate the superiority of our approach over state-of-the-art counterparts. Our project is available at https://github.com/suyukun666/S2CNet.
title	Spatial-Semantic Collaborative Cropping for User Generated Content
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2401.08086

Similar Items