Performance analysis of different deep learning architectures for Hand action recognition

dc.contributor.guideSathiesh Kumar, V
dc.coverage.spatialPerformance analysis of different deep learning architectures for Hand action recognition
dc.creator.researcherRubin Bose, S
dc.date.accessioned2023-02-16T05:24:57Z
dc.date.available2023-02-16T05:24:57Z
dc.date.awarded2022
dc.date.completed2022
dc.date.registered
dc.description.abstractRecognizing the hand actions in an unrestrained context is a challenging computer vision task. Computational cost, rapid movement, illumination changes, self-occlusion, uncertain environment, varying viewpoint, varying hand shape, size, and high degrees of freedom (DOF) are the factors that impact the performance of the hand action recognition system. To address the above specified challenges in the area of hand action recognition two different deep Convolution Neural Network (CNN) based approaches namely, multi-stage CNN and single-stage CNN are proposed and reported in this thesis. The existing standard hand action datasets do not consider most of the complexities or challenges as quoted earlier. Hence, a hand action dataset that can be used for real-time hand action recognition is collected and named MITI-HD . All the below mentioned contributions are evaluated using two standard datasets (NUSHP-II and Senz-3D) and a custom developed dataset (MITI-HD). Each model is trained using different Stochastic Gradient Descent Optimizers (Adam, Momentum, and RMSprop). The Faster R-CNN Inception-V2 is a multi-stage CNN approach utilized to perform a real-time hand action recognition. Inception-V2 is used as a backbone feature extraction network. The proposed model using Adam optimizer produces better performance (Average Precision (AP) = 99.10%, Average Recall (AR) = 96.78%, F1-Score = 97.98%, and Prediction time = 140 ms) than the other optimizers on the MITI-HD dataset. The single-stage CNN based six different deep learning models are evaluated in relation to real-time hand action recognition. newline
dc.description.note
dc.format.accompanyingmaterialNone
dc.format.dimensions21cm
dc.format.extentxvi.189p.
dc.identifier.urihttp://hdl.handle.net/10603/458438
dc.languageEnglish
dc.publisher.institutionFaculty of Information and Communication Engineering
dc.publisher.placeChennai
dc.publisher.universityAnna University
dc.relationp.179-188
dc.rightsuniversity
dc.source.universityUniversity
dc.subject.keywordEngineering and Technology
dc.subject.keywordComputer Science
dc.subject.keywordTelecommunications
dc.subject.keywordDeep learning
dc.subject.keywordHand action recognition
dc.subject.keywordConvolution Neural Network
dc.titlePerformance analysis of different deep learning architectures for Hand action recognition
dc.title.alternative
dc.type.degreePh.D.

Files

Original bundle

Now showing 1 - 5 of 12
Loading...
Thumbnail Image
Name:
01_title.pdf
Size:
1.35 MB
Format:
Adobe Portable Document Format
Description:
Attached File
Loading...
Thumbnail Image
Name:
02_prelim pages.pdf
Size:
2.45 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
03_content.pdf
Size:
379.91 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
04_abstract.pdf
Size:
134.01 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
05_chapter 1.pdf
Size:
687.83 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.79 KB
Format:
Plain Text
Description: