Body orientation estimation through graph representation: Expanding accuracy with Data Augmentation and Gradient Boosting

Authors

Keywords:

Computer vision, body orientation, XGBoost, OpenPose

Abstract

Body Orientation Estimation (BOE) is important for a wide array of applications, including robotics, surveillance and consumer analysis. Although multi-sensor approaches are effective, they are not a viable option for in the wild scenarios; the usual approach in such cases is to use single camera images, with imprecise results. Some applications that deal with people benefit from obtaining 2D human skeletons for gesture recognition, and these skeletons bring valuable information about the person's pose. It is proposed to build a 2D skeleton via OpenPose and using its data as training data on XGBoost to detect BOE. To evaluate predictions considering real situations based on a single camera, the TUD Multiview Pedestrian dataset is used and extended considering that a single person is originally considered in images where more people were often identified. It is compared the proposed approach against various state-of-the-art methods and our results indicate better performance. Finally, it is proved that our method is viable for BOE in real-time scenarios by presenting case studies on simulated scenes.

Downloads

Download data is not yet available.

Author Biographies

Pedro Victor Vieira de Paiva, Renato Archer IT Center, Campinas, São Paulo, 13069-901, Brazil

MSc. Pedro V. V. Paiva received the B.Sc. in Computer Science from Universidade Federal de Alagoas (2017), the M.Sc. in Computer Vision from Universidade Estadual de Campinas (2019). He is current fellow researcher at CTI Renato Archer. His research interests are within the fields of intelligent systems, image understanding and Human-Robot Interaction.

Murillo Rehder Batista, Center for Information Technology 'Renato Archer' (CTI)

Dr. Murillo R. Batista received the B.Sc., M.Sc. and PhD in Computer Science from Universidade de São Paulo. He is currently a fellow researcher at CTI Renato Archer. His research interests are Human-Robot Interaction, Social Navigation, and Multi-Robot Systems.

Josue Junior Guimaraes Ramos, Center for Information Technology 'Renato Archer' (CTI)

Dr. Josué Ramos holds a degree in Electrical Engineering from the Federal University of Santa Catarina - UFSC (1979). The Master's Degree in Electrical Engineering from the State University of Campinas (1986) and the Ph.D. in Electrical Engineering from UFSC (2002) had an emphasis on robotic systems. In 2004 and 2013 he was a visiting researcher at the Robotics Institute at Carnegie Mellon University, USA. Since 1983, he has been working in the area of Robotics at the Renato Archer Information Technology Center, and since 2013 the emphasis is on Human-Robot Interaction.

References

B. Lewandowski, D. Seichter, T. Wengefeld, L. Pfennig, H. Drumm, andH.-M. Gross, “Deep orientation: Fast and robust upper body orientationestimation for mobile robotic applications,” inwill be published onInternational Conference on Intelligent Robots and Systems (IROS),2019.[2] S. Müller, T. Wengefeld, T. Q. Trinh, D. Aganian, M. Eisenbach, andH.-M. Gross, “A multi-modal person perception framework for sociallyinteractive mobile service robots,”Sensors, vol. 20, no. 3, p. 722, 2020.[3] P. Liu, W. Liu, and H. Ma, “Weighted sequence loss based spatial-temporal deep learning framework for human body orientation estima-tion,” in2017 IEEE International Conference on Multimedia and Expo(ICME), pp. 97–102, 2017.[4] Z. Cao, G. Hidalgo, T. Simon, S.-E. Wei, and Y. Sheikh, “Openpose:realtime multi-person 2d pose estimation using part affinity fields,”arXivpreprint arXiv:1812.08008, 2018.[5] M. Andriluka, S. Roth, and B. Schiele, “Monocular 3d pose estimationand tracking by detection,” in2010 IEEE Computer Society Conferenceon Computer Vision and Pattern Recognition, pp. 623–630, IEEE, 2010.[6] A. Sebti and H. Hassanpour, “Body orientation estimation with theensemble of logistic regression classifiers,”Multimedia Tools and Ap-plications, vol. 76, no. 22, pp. 23589–23605, 2017.[7] T. Wengefeld, B. Lewandowski, D. Seichter, L. Pfennig, and H.-M.Gross, “Real-time person orientation estimation using colored point-clouds,” in2019 European Conference on Mobile Robots (ECMR),pp. 1–7, 2019.[8] M. M. Islam, A. Lam, H. Fukuda, Y. Kobayashi, and Y. Kuno, “A person-following shopping support robot based on human pose skeleton dataand lidar sensor,” inInternational Conference on Intelligent Computing,pp. 9–19, 2019.[9] T. Chen and C. Guestrin, “Xgboost: A scalable tree boosting system,”inProceedings of the 22nd acm sigkdd international conference onknowledge discovery and data mining, pp. 785–794, 2016.[10] H. W. Kuhn, “The hungarian method for the assignment problem,”Navalresearch logistics quarterly, vol. 2, no. 1-2, pp. 83–97, 1955.[11] D. Baltieri, R. Vezzani, and R. Cucchiara, “People orientation recogni-tion by mixtures of wrapped distributions on random trees,” inEuropeanconference on computer vision, pp. 270–283, Springer, 2012.[12] D. Heo, J. Y. Nam, and B. C. Ko, “Estimation of pedestrian pose ori-entation using soft target training based on teacher–student framework,”Sensors, vol. 19, no. 5, p. 1147, 2019.[13] C. J. Bray Jr, “An ordination of the upland forest communities ofsouthern wisconsin. ecol monogr. 27: 325–349,” 1957.[14] D. Zwillinger,CRC standard mathematical tables and formulae. CRCpress, 2002.[15] A. Sebti and H. Hassanpour, “Body orientation estimation with theensemble of logistic regression classifiers,”Multimedia Tools and Ap-plications, vol. 76, no. 22, pp. 23589–23605, 2017.[16] I. Ardiyanto and J. Miura, “Partial least squares-based human upper bodyorientation estimation with combined detection and tracking,”Image andVision Computing, vol. 32, no. 11, pp. 904–915, 2014.[17] M. Enzweiler and D. M. Gavrila, “Integrated pedestrian classificationand orientation estimation,” in2010 IEEE Computer Society Conferenceon Computer Vision and Pattern Recognition, pp. 982–989, IEEE, 2010.[18] T. Gandhi and M. M. Trivedi, “Image based estimation of pedestrianorientation for improving path prediction,” in2008 IEEE IntelligentVehicles Symposium, pp. 506–511, IEEE, 2008.[19] E. Rohmer, S. P. Singh, and M. Freese, “Coppeliasim (formerly v-rep):a versatile and scalable robot simulation framework,” inProceedingsof The International Conference on Intelligent Robots and Systems(IROS),(Tokyo). Available online at: www. coppeliarobotics. com, 2013.

Published

2022-08-08

How to Cite

Paiva, P. V. V. de, Batista, M. R., & Ramos, J. J. G. (2022). Body orientation estimation through graph representation: Expanding accuracy with Data Augmentation and Gradient Boosting. IEEE Latin America Transactions, 20(12), 2414–2420. Retrieved from https://latamt.ieeer9.org/index.php/transactions/article/view/6143

Issue

Section

Electronics