TY - JOUR
T1 - Attention-Enhanced Lightweight Hourglass Network for Human Pose Estimation
AU - Kappan, Marsha Mariya
AU - Sandoval, Eduardo Benitez
AU - Meijering, Erik
AU - Cruz, Francisco
N1 - Publisher Copyright:
© 2024 Australasian Robotics and Automation Association. All rights reserved.
PY - 2024
Y1 - 2024
N2 - Pose estimation is a critical task in computer vision with a wide range of applications from activity monitoring to human-robot interaction. However, most of the existing methods are computationally expensive or have complex architecture. Here we propose a lightweight attention based pose estimation network that utilizes depthwise separable convolution and Convolutional Block Attention Module on an hourglass backbone. The network significantly reduces the computational complexity (floating point operations) and the model size (number of parameters) containing only about 10% of parameters of original eight stack Hourglass network. Experiments were conducted on COCO and MPII datasets using a two stack hourglass backbone. The results showed that our model performs well in comparison to six other lightweight pose estimation models with an average precision of 72.07. The model achieves this performance with only 2.3M parameters and 3.7G FLOPs.
AB - Pose estimation is a critical task in computer vision with a wide range of applications from activity monitoring to human-robot interaction. However, most of the existing methods are computationally expensive or have complex architecture. Here we propose a lightweight attention based pose estimation network that utilizes depthwise separable convolution and Convolutional Block Attention Module on an hourglass backbone. The network significantly reduces the computational complexity (floating point operations) and the model size (number of parameters) containing only about 10% of parameters of original eight stack Hourglass network. Experiments were conducted on COCO and MPII datasets using a two stack hourglass backbone. The results showed that our model performs well in comparison to six other lightweight pose estimation models with an average precision of 72.07. The model achieves this performance with only 2.3M parameters and 3.7G FLOPs.
UR - https://www.scopus.com/pages/publications/85219579154
M3 - Conference article
AN - SCOPUS:85219579154
SN - 1448-2053
VL - 2024-November
JO - Australasian Conference on Robotics and Automation, ACRA
JF - Australasian Conference on Robotics and Automation, ACRA
T2 - 2024 Australasian Conference on Robotics and Automation, ACRA 2024
Y2 - 27 November 2024 through 29 November 2024
ER -