Lightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving

Lightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving

dc.contributor.guide	Bhuyan, M K and Ahamed, Shaik Rafi
dc.creator.researcher	Mazhar, Saquib
dc.date.accessioned	2025-07-08T07:10:01Z
dc.date.available	2025-07-08T07:10:01Z
dc.date.awarded	2025
dc.date.completed	2025
dc.date.registered	2018
dc.description.abstract	Semantic segmentation, which assigns a class label to each pixel in an image, is fundamental to autonomous driving systems that must perceive and understand complex environments in real time. Achieving high segmentation accuracy while maintaining computational efficiency remains a major challenge, particularly for embedded systems with limited resources. This thesis addresses these challenges by proposing novel methods that enhance both segmentation performance and efficiency. A key contribution is a composite loss function that integrates cross-entropy, boundary, and region-based losses to improve accuracy around object edges and better handle small and infrequent classes. This leads to improved segmentation results, especially for critical but underrepresented objects such as pedestrians and traffic lights. To support real-time operation, this work introduces lightweight network architectures. The Inverse esidual Dilation Pyramid Network (IRDPNet) employs efficient bottlenecks and multi-scale dilation to significantly reduce model size while preserving accuracy. The Block Attention Network (BANet) further enhances contextual understanding by integrating a modified attention mechanism that captures long-range dependencies with minimal overhead. Additionally, the Context-Guided Multi-scale Attention Network (CGMANet) is proposed to combine both low-level spatial features and high-level semantic cues through a hybrid attention mechanism. This architecture effectively balances detail preservation and context awareness, making it highly suitable for deployment on embedded platforms. Collectively, these contributions offer practical solutions for real-time, accurate semantic segmentation in autonomous driving scenarios.
dc.format.accompanyingmaterial	None
dc.identifier.uri	http://hdl.handle.net/10603/650800
dc.language	English
dc.publisher.institution	DEPARTMENT OF ELECTRONICS AND ELECTRICAL ENGINEERING
dc.publisher.place	Guwahati
dc.publisher.university	Indian Institute of Technology Guwahati
dc.rights	self
dc.source.university	University
dc.subject.keyword	Engineering
dc.subject.keyword	Engineering and Technology
dc.subject.keyword	Engineering Electrical and Electronic
dc.title	Lightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving
dc.type.degree	Ph.D.