On the Development of HindiEnglish Code Switching Speech Recognition Systems and Corpus

dc.contributor.guideSinha, Rohit
dc.coverage.spatialElectronics and Electrical Engineering
dc.creator.researcherSreeram, Ganji
dc.date.accessioned2022-12-12T11:30:34Z
dc.date.available2022-12-12T11:30:34Z
dc.date.awarded2021
dc.date.completed2020
dc.date.registered2015
dc.description.abstractquotCode-switching refers to the alternate use of two or more languages (or dialects) during the conversation. This phenomenon has been observed in many multilingual communities across the globe. Therefore, handling code-switching by the spoken input systems is very much required for e cient human-machine interaction. However, due to the lack of domain-speci c resources, the research in this domain is somewhat limited compared to the monolingual case. This thesis aims to address the acoustic and language modeling challenges in code-switching automatic speech newlinerecognition (ASR) tasks. In addition to that, a Hindi-English code-switching corpus has been created towards addressing the data scarcity issue. newlineThe early works on code-switching ASR happen to employ the hybrid framework typically developed for the monolingual case. The created Hindi-English code-switching corpus is rst evaluated in the hybrid framework. The hybrid framework comprises of three sub-modules, namely, a pronunciation model, an acoustic model, and a language model. The end-to-end (E2E) framework has recently emerged as a viable alternative to the hybrid systems in the ASR domain. Unlike the hybrid framework, the E2E framework does not require the phonetically labeled training data, and also does not include any explicit pronunciation model. In the case of code-switching ASR, for multiple languages being involved, these attributes become more attractive. Motivated by that, in this thesis, the E2E framework has been explored for developing the code-switching ASR systems.quot
dc.description.noteNot Available
dc.format.accompanyingmaterialNone
dc.format.dimensionsNot Available
dc.format.extentNot Available
dc.identifier.urihttp://hdl.handle.net/10603/424827
dc.languageEnglish
dc.publisher.institutionDEPARTMENT OF ELECTRONICS AND ELECTRICAL ENGINEERING
dc.publisher.placeGuwahati
dc.publisher.universityIndian Institute of Technology Guwahati
dc.relationNot Available
dc.rightsself
dc.source.universityUniversity
dc.subject.keywordEngineering
dc.subject.keywordEngineering and Technology
dc.subject.keywordEngineering Electrical and Electronic
dc.titleOn the Development of HindiEnglish Code Switching Speech Recognition Systems and Corpus
dc.title.alternative
dc.type.degreePh.D.

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
01_fulltext.pdf
Size:
2.69 MB
Format:
Adobe Portable Document Format
Description:
Attached File
Loading...
Thumbnail Image
Name:
04_abstract.pdf
Size:
115.72 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
80_recommendation.pdf
Size:
268.59 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.79 KB
Format:
Plain Text
Description: