F sign-Net: Depth Sensor Aggregated Frame-based Fourier Network for Sign Word Recognition

Journal article


Authors/Editors


Strategic Research Themes


Publication Details

Author listSunusi Bala Abdullahi, Kosin Chamnongthai, Lubna A. Gabralla, and Haruna Chiroma

PublisherInstitute of Electrical and Electronics Engineers

Publication year2024

JournalIEEE Sensors Journal (1530-437X)

Start page1

End page16

Number of pages16

ISSN1530-437X

eISSN1558-1748

URLhttps://l.facebook.com/l.php?u=https%3A%2F%2Fieeexplore.ieee.org%2Fdocument%2F10555504%3Ffbclid%3DIwZXh0bgNhZW0CMTAAAR3_RaWjnnHIfxv5V7whWkwsbuibl7I6PruGC1pWZqmKqtPrLmBnFnVlZbI_aem_R3hSYmHauOARFsoa-sXfOw&h=AT32ji_xthbmZHMFgPi6GdPN-F7VbYstOGeau-LOHN5XOqdRnDfUP6xw2il4fMO3S9mFtDuxBbaNn0V5EutWRfFwqQPALqeK9twdzazd9ooD65mOSEzHhSuxAZ-lR6gLjI8Tr_YN2g&__tn__=-UK-R&c[0]=AT1xfe53K04csOra1HabmXlrpgFLbnZfYRRwzfhaWkKFQk5DBgTDeRryOQIJAGsZjYQSmCzMUfkKadv_B_3ADIIr8b6suW4Kf46FnGCbcBT99zL6-wFJAgZuW3IGRuyb60AEzgvl9rHcSP92FMAEE9KFEnIZ-F8chhkTCSctyZYUVPUMdgVzNI_tM52mMdEucpUn4AGRQOVMO8vKKtL8KlwcX8o

LanguagesEnglish-United States (EN-US)


View on publisher site


Abstract

Hand-tracking is a challenging problem during hand gesture recognition due to abnormal hand patterns

across depth signs and errors between normal pixels and backgrounds. In this article, we propose F sign-Net: Fourier pixel-wise approach based on the Fourier Convolution Neural Network (FCNN) and Time distributed-based Bi-directional Long-short-term Memory (BiLSTM). FCNNs have been widely researched, reaching state-of-the-art performance on spatial recognition tasks. However, it is still difficult for the Fourier model to learn the temporal patterns due to the chaotic nature of the hand motion data. F sign- Net aggregates time-based information from spatial and temporal modules to a given Fourier convolution in three stages: (1) Each depth frame is regarded as a window, and it is selected so that the aggregated sums of the pixels across the hand joints of the selected window are aligned, (2) a truncated pooling is applied that summarizes the generated featured map of the Fourier convolution to avoid overfitting, and (3) the long-term temporal dependencies among pixels for the Fourier convolution are captured using the shared Time-based BiLSTM layers. This allows the proposed model to learn hand patterns that are temporally oriented. Finally, the proposed Fnet-Sign is evaluated on depth sign language public data sets and demonstrates state-of-the-art performance. Simulation results proved that improved Fourier features are good features for the proposed hand-tracking approach.


Keywords

Augmented reality technology, hearing impairments, sign languagejoint applications of Laplace and Fourier transformsTracking


Last updated on 2024-16-10 at 00:00