Automatic MCQ Generator Using Natural Language Techniques
https://doi.org/10.9790/0661-2304011115Abstract
Background: Multiple choice question (MCQ) generation from a text is a popular research area. They are widely used for a learner's knowledge assessment in the education sector. Manual generation of Mcq is hard, time-consuming and expensive. We have developed a system using various applications of Natural Language Processing Techniques. In this paper, we present a set of MCQs (Multiple Choice Questions) generated for any given input text along with a set of distractors and the correct answer to the question. In addition, The BERT(Bidirectional Encoder Representations from Transformers) model is used to generate a summary of the input text. The RAKE (Random Automatic Keyword Extraction) technique is used for extracting the keyword from the generated summary. This keyword is used as a parameter to generate questions as well as distractors for the MCQs.Distractors are the options that are generated apart from the correct answer. For the generation of MCQ, the wordnet and conceptnet algorithms are used. For the question generation, a T5 Transformer Model is trained using the encoder-decoder architecture. For the generation of MCQ, the wordnet and conceptnet algorithms are used. The final result has is a combination of questions with four options and a correct answer. Materials and Methods: In this study , The system will start with a prompt to provide an input file as a text document file .A description box will be available which will give information about the working of the software. Once the user enters the texts file , they will immediately get a list of multiple choice questions along with correct answer. The text file will be converted into a summary using the summarizer. The summary will be a single paragraph. The keywords will be extracted from the summary using RAKE. The Raw Automatic Keyword Extractor will generate mcq using the graph technique of natural language processing. Once the keyword is extracted the summary and the keyword will be provided as an input to the question Generation model. The question generation model will provide a set of questions on the basis of keywords and the summary provided. The keyword that is associated with the question will be used for distractor generation.The distractor generator will be done using conceptnet and wordnet. The final result will be a list of mcqs with correct answer under each question. Results: The final result will be a set of Multiple choice questions with a set of distractors and a correct answer to the question. Conclusion: The basic motivation of creating this desktop application were examinations attempted during Covid-19 pandemic which were inefficient. This system tends to overcome many flaws that the previous application sustained. The system summarizes the text and generates MCQs with distractors and right answer with probably higher accuracy.
FAQs
AI
What specific NLP techniques enhance distractor generation for MCQs?
The study utilizes WordNet and ConceptNet for generating relevant distractors, which helps improve the effectiveness of MCQs. For example, it employs hypernyms and hyponyms from WordNet to create contextually appropriate incorrect answer choices.
How does the T5 Transformer improve the question generation process?
The T5 Transformer is trained to treat the question generation as a text-to-text problem, allowing for questions to be generated effectively from keywords and summarised input. Its architecture enables high-quality output with reduced latency after applying compression techniques.
What impact does the ONNX technique have on model performance?
Applying the ONNX technique compresses the model's size by three times and improves inference speed by almost five times. This enhancement ensures the model remains functional in diverse software environments while maintaining question generation accuracy.
How are keywords extracted from the input text to support MCQ generation?
Keywords are extracted using the Rapid Automatic Keyword Extraction (RAKE) algorithm which analyzes co-occurrence within the summary of the input text. It effectively identifies crucial nouns, proper nouns, adjectives, and verbs to guide question formation.
What advantages does automated MCQ generation offer over manual methods?
Automated MCQ generation significantly reduces human error, such as spelling mistakes or incorrect distractor choices, and accelerates the process of assessment creation. This efficiency was particularly beneficial during the pandemic when standardized MCQ assessments became widespread.
References (6)
- Keyword Extraction Performance Analysis, 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)
- An Analysis on Different Document Keyword Extraction Methods, Proceedings of the Third International Conference on Computing Methodologies and Communication (ICCMC 2019)
- Extractive Text Summarization Using Sentence Ranking, J.N.Madhuri Dept. of Computer science and Engineering CHRIST (Deemed to be University) Bangalore, India, 2019.
- Topic level summary generation using BERT induced Abstractive Summarization Model, Proceedings of the International Conference on Intelligent Computing and Control Systems (ICICCS 2020)
- Automatic Question Generation for Intelligent Tutoring Systems-A survey(2019)
- Automatic Multiple Choice Question Generation From Text : A Survey, IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, VOL. 13, NO. 1, JANUARY-MARCH 2020 (2020)