Contextual understanding on natural scene images for improved annotation using heuristics and methods

Contextual understanding on natural scene images for improved annotation using heuristics and methods

Files

01_title.pdf (235.33 KB)

02_prelim pages.pdf (4.88 MB)

03_content.pdf (197.39 KB)

04_abstract.pdf (590.71 KB)

05_chapter1.pdf (838.92 KB)

Abstract

Advancements in object or image annotation techniques have revolutionized computer vision and image analysis applications. By accurately labeling and marking objects or regions of interest within images, these advancements enable more efficient and accurate understanding of visual data in Natural Scene Images (NSI). Object detection and recognition have greatly benefited from improved annotation techniques, resulting in enhanced algorithms for identifying and classifying objects. Image annotation has played a vital role in training models for image captioning and understanding, as well as in augmented reality and virtual reality applications where virtual content needs to align seamlessly with the real world. In general, advancements in object or image annotation have greatly enhanced the accuracy and efficiency of computer vision algorithms, impacting diverse fields such as healthcare, transportation, assisting visually impaired, entertainment, and beyond. newlineFirst, this work presents an object detection scheme that utilizes the AlexNet deep learning model as its base. Different optimizers including SGDM, RMSProp, and Adam are employed during the execution of the AlexNet model, with performance evaluation conducted on the Flicker dataset. The results demonstrate that the Adam optimizer outperforms the others. In addition to object detection, this work introduces a context-based Hidden Markov Model (HIV IM) for improving the Image annotation based on heuristic attributes of objects and their inter-relationships. The HMM model enhances the understanding of the objects by generating annotation for the image that optimally matches the synonymous captions present in the dataset. newline

URI

http://hdl.handle.net/10603/566981

Collections

Faculty of Information and Communication Engineering

Full item page

Contextual understanding on natural scene images for improved annotation using heuristics and methods

Files

Date

item.page.authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced