GitHub - akterTaslima/RhetoricalQuestion_Detection: This was the final project for the course Natural Language Processing that I took in Spring 2018.

Automatic Question Detection in Speech using Deep Neural Network

In this project, we describe our efforts towards the automatic detection of questions in speech. We analyze the utility of various features for this task, Spectrogram and Mel-frequency cepstral coefficients (MFCC). We have used IEEE corpus and self-prepared corpus of human-voice recorded audio files and trained the data on Recurrent Neural Networks (RNNs) and Convolutional Recurrent Neural Networks (CRNNs) and compared their performances. Our system, provides state-of-the-art results on the clean corpus and in noisy environments as well.

Dialogue Act Recognition is a challenging problem in dialogue interpretation which aims to attach semantic labels to utterance and characterize the speaker’s intention. The most frequent DA types are are statements and opinions, questions, back-channels. Our project focuses on detecting questions in the speech. Automatic Speech recognition is the versatile field of Computational Linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers. The system analyses the person’s specific voice and and use it to detect the person’s speech with better accuracy. Questions in human dialogues is an important first step to automatically processing and understanding the natural speech. It can be viewed as a subtask of speech act or dialogue act tagging, which aims to label functions of utterances in conversations. The various types of questions are Yes-No, wh, Declarative, Rhetoric, echo, etc. It is useful for meeting indexing and summarization. Examples-

Yes-No: Have you looked at that?

Wh: What was the nature of the email?

Declarative: You are editing your slide?

Echo: He has undergone a surgery?

Rhetorical: Do you know that person?

The main motivation behind this project is to develop a model that will consider intonation, stress and other speech attributes to classify the group questions. The current speech recognizer examples such as Google Home mini and Alexa can understand speech and respond accordingly. But these techniques are not smart enough to detect echo questions, rhetorical questions, etc. For an instance, questions like - "What is your name?" is easily recognizable by the current systems. But if the question is of echo type - "Your name is abc?", then in this case the current systems will classify it as a normal sentence. But our model will classify these types also as "Questions" only unlike the current systems.

Report [LINK].

Full dataset is available on request.

Authors:

TASLIMA AKTER ([email protected]), Indiana University Bloomington, USA

KHANDOKAR MD. NAYEM, Indiana University Bloomington, USA

HASIKA MAHTTA, Indiana University Bloomington, USA

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Final_project.ipynb		Final_project.ipynb
NLP_changeFrequency.m		NLP_changeFrequency.m
NLP_createNoisySpeech.m		NLP_createNoisySpeech.m
NLP_data_generator.m		NLP_data_generator.m
Project_final-chunk-Copy1.ipynb		Project_final-chunk-Copy1.ipynb
Project_final_full_wav.ipynb		Project_final_full_wav.ipynb
README.md		README.md
generateMixture_v2.m		generateMixture_v2.m
knayem_Project_final_full_wav_CRNN.ipynb		knayem_Project_final_full_wav_CRNN.ipynb
nestedSortStruct.m		nestedSortStruct.m
nestedSortStruct2.m		nestedSortStruct2.m
sortStruct.m		sortStruct.m
sortStruct2.m		sortStruct2.m
takter,hmahtta,knayem_automatic-question-detection.pdf		takter,hmahtta,knayem_automatic-question-detection.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic Question Detection in Speech using Deep Neural Network

Authors:

About

Releases

Packages

Languages

akterTaslima/RhetoricalQuestion_Detection

Folders and files

Latest commit

History

Repository files navigation

Automatic Question Detection in Speech using Deep Neural Network

Authors:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages