Fine-Grained Scene Graph Generation with Overlap Region and Geometrical Center

Zhao, Yong Qiang; Jin, Zhi; Zhao, Hai Yan; Zhang, Feng; Tao, Zheng Wei; Dou, Cheng Feng; Xu, Xin Hai; Liu, Dong Hong

dc.contributor.author	Zhao, Yong Qiang	en_US
dc.contributor.author	Jin, Zhi	en_US
dc.contributor.author	Zhao, Hai Yan	en_US
dc.contributor.author	Zhang, Feng	en_US
dc.contributor.author	Tao, Zheng Wei	en_US
dc.contributor.author	Dou, Cheng Feng	en_US
dc.contributor.author	Xu, Xin Hai	en_US
dc.contributor.author	Liu, Dong Hong	en_US
dc.contributor.editor	Umetani, Nobuyuki	en_US
dc.contributor.editor	Wojtan, Chris	en_US
dc.contributor.editor	Vouga, Etienne	en_US
dc.date.accessioned	2022-10-04T06:41:23Z
dc.date.available	2022-10-04T06:41:23Z
dc.date.issued	2022
dc.identifier.issn	1467-8659
dc.identifier.uri	https://doi.org/10.1111/cgf.14683
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14683
dc.description.abstract	Scene graph generation refers to the task of identifying the objects and specifically the relationships between the objects from an image. Existing scene graph generation methods generally use the bounding boxes region features of objects to identify the relationships between objects. However, we feel that the overlap region features of two objects may play an important role in fine-grained relationship identification. In fact, some fine-grained relationships can only be obtained from the overlap region features of two objects. Therefore, we propose the Multi-Branch Feature Combination (MFC) module and Overlap Region Transformer (ORT) module to comprehensively obtain the visual features contained in the overlap regions of two objects. Concretely, the MFC module uses deconvolution and multi-branch dilation convolution to obtain high-pixels and multi-receptive field features in the overlap regions. The ORT module uses the vision transformer to obtain the self-attention of the overlap regions. The joint use of these two modules achieves the mutual complementation of local connectivity properties of convolution and the global connectivity properties of attention. We also design a Geometrical Center Augmented (GCA) module to obtain the relative position information of the geometric centers between two objects, to prevent the problem that only relying on the scale of the overlap region cannot accurately capture the relationship between two objects. Experiments show that our model ORGC (Overlap Region and Geometrical Center), the combination of the MFC module, the ORT module, and the GCA module, can enhance the performance of fine-grained relation identification. On the Visual Genome dataset, our model outperforms the current state-of-the-art model by 4.4% on the R@50 evaluation metric, reaching a state-of-the-art result of 33.88.	en_US
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Computing methodologies → Artificial Intelligence; Neural Networks; Computer Vision
dc.subject	Computing methodologies → Artificial Intelligence
dc.subject	Neural Networks
dc.subject	Computer Vision
dc.title	Fine-Grained Scene Graph Generation with Overlap Region and Geometrical Center	en_US
dc.description.seriesinformation	Computer Graphics Forum
dc.description.sectionheaders	Image Detection and Understanding
dc.description.volume	41
dc.description.number	7
dc.identifier.doi	10.1111/cgf.14683
dc.identifier.pages	359-370
dc.identifier.pages	12 pages

Files in this item

Name:: v41i7pp359-370.pdf
Size:: 29.53Mb
Format:: PDF

View/Open

Name:: appendix.pdf
Size:: 75.03Kb
Format:: PDF

View/Open

Name:: figures.zip
Size:: 24.29Mb
Format:: application/zip

View/Open

This item appears in the following Collection(s)

41-Issue 7
Pacific Graphics 2022 - Symposium Proceedings

Show simple item record