An Enhanced Object Detection Model for Scene Graph Generation

2Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

With computer vision improving, a higher level of understanding is needed to solve more complex problems such as semantic image retrieval, image captioning, and scene understanding. Scene understanding has been a long-studied problem due to its complexity and lack of proper data representation. A scene Graph is one of the most powerful data representations that can better understand the scene context. The task of a Scene Graph is to encode the objects presented in the scene, their attributes, as long as the relationships between these objects. With the scene Graph proving its capabilities in complicated tasks, the automation of scene graph generation became a must. Great research has been made to obtain accurate Scene Graphs using different deep learning architectures. The common module among those different architectures is the object detection module, where objects are firstly located in the input image. In this work, we propose using the most recent object detectors from the YOLOv5 family for the scene graph generation task. The proposed YOLOv5x6 achieved a State-Of-The-Art result of 32.7 mean average precision compared to previous works. Furthermore, the paper reviews the different object detectors used in literature for the scene graph generation.

Cite

CITATION STYLE

APA

Essam, M., Khattab, D., Shedeed, H. A., & Tolba, M. F. (2023). An Enhanced Object Detection Model for Scene Graph Generation. In Lecture Notes on Data Engineering and Communications Technologies (Vol. 152, pp. 333–343). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-20601-6_30

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free