Lightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving

dc.contributor.guideBhuyan, M K and Ahamed, Shaik Rafi
dc.creator.researcherMazhar, Saquib
dc.date.accessioned2025-07-08T07:10:01Z
dc.date.available2025-07-08T07:10:01Z
dc.date.awarded2025
dc.date.completed2025
dc.date.registered2018
dc.description.abstractSemantic segmentation, which assigns a class label to each pixel in an image, is fundamental to autonomous driving systems that must perceive and understand complex environments in real time. Achieving high segmentation accuracy while maintaining computational efficiency remains a major challenge, particularly for embedded systems with limited resources. This thesis addresses these challenges by proposing novel methods that enhance both segmentation performance and efficiency. A key contribution is a composite loss function that integrates cross-entropy, boundary, and region-based losses to improve accuracy around object edges and better handle small and infrequent classes. This leads to improved segmentation results, especially for critical but underrepresented objects such as pedestrians and traffic lights. To support real-time operation, this work introduces lightweight network architectures. The Inverse esidual Dilation Pyramid Network (IRDPNet) employs efficient bottlenecks and multi-scale dilation to significantly reduce model size while preserving accuracy. The Block Attention Network (BANet) further enhances contextual understanding by integrating a modified attention mechanism that captures long-range dependencies with minimal overhead. Additionally, the Context-Guided Multi-scale Attention Network (CGMANet) is proposed to combine both low-level spatial features and high-level semantic cues through a hybrid attention mechanism. This architecture effectively balances detail preservation and context awareness, making it highly suitable for deployment on embedded platforms. Collectively, these contributions offer practical solutions for real-time, accurate semantic segmentation in autonomous driving scenarios.
dc.format.accompanyingmaterialNone
dc.identifier.urihttp://hdl.handle.net/10603/650800
dc.languageEnglish
dc.publisher.institutionDEPARTMENT OF ELECTRONICS AND ELECTRICAL ENGINEERING
dc.publisher.placeGuwahati
dc.publisher.universityIndian Institute of Technology Guwahati
dc.rightsself
dc.source.universityUniversity
dc.subject.keywordEngineering
dc.subject.keywordEngineering and Technology
dc.subject.keywordEngineering Electrical and Electronic
dc.titleLightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving
dc.type.degreePh.D.

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
01_fulltext.pdf
Size:
4.92 MB
Format:
Adobe Portable Document Format
Description:
Attached File
Loading...
Thumbnail Image
Name:
04_abstract.pdf
Size:
104.38 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
80_recommendation.pdf
Size:
103.72 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.79 KB
Format:
Plain Text
Description: