A Rule-Based Automatic Question Generation Framework for Educational Text

Authors

  • Jeevan Pralhad Tonde Dr. G.Y. Pathrikar College of CS & IT MGM University, Chhatrapati Sambhaji Nagar, Maharashtra, India Author
  • Satish Sankaye Dr. G.Y. Pathrikar College of CS & IT MGM University, Chhatrapati Sambhaji Nagar, Maharashtra, India Author

DOI:

https://doi.org/10.32628/IJSRST2613127

Keywords:

Automatic Question Generation, Rule-Based NLP, Question Generation Algorithms, Linguistic Pattern Matching, Educational Technology, Natural Language Processing, Explain- able AI

Abstract

Automatic Question Generation (AQG) is essential in educational technology, enabling the creation of assessments, practice exercises, and intelligent tutoring systems. Although recent neural approaches have achieved promising results, they often lack interpretability and control, thereby limiting their use in structured educational contexts. To overcome these challenges, this study introduces a structurally modular, rule-based AQG framework that generates factoid questions from educational texts through explicit linguistic rules and deterministic transformations. The proposed framework draws on classical rule-based question generation methods and employs a two-stage algorithmic design that distinctly separates linguistic analysis from question generation. The first stage involves linguistic preprocessing, clause extraction, and auxiliary-verb determination to identify syntactically valid, question-worthy clauses. In the second stage, rule-based transformations generate various factoid question types, such as who, what, where, when, how many, and how much, followed by disambiguation, template-based surface real- ization, and validation. This separation enhances interpretability, extensibility, and control over rule execution. Unlike data-driven models, the proposed framework does not require annotated training data and avoids hallucination by relying on explicit syntactic and semantic patterns. The resulting system produces grammatically correct, semantically meaningful, and reproducible questions, making it well suited for educational and academic applications. The study demonstrates that carefully designed rule-based systems remain a viable and effective alternative for automatic question generation in controlled domains.

Downloads

Download data is not yet available.

References

X. Dong, X. Zhang, Z. Li, Q. Hou, J. Xue, and X. Li, “A literature review of research on question generation in education,” vol. 11, p. e3203. [Online]. Available: https://pmc.ncbi.nlm.nih.gov/articles/ PMC12453861/ DOI: https://doi.org/10.7717/peerj-cs.3203

N. Mulla and P. Gharpure, “Automatic question generation: A review of methodologies, datasets, evaluation metrics, and applications,” vol. 12, no. 1, pp. 1–32. [Online]. Available: https://doi.org/10.1007/s13748-023- 00295-9 DOI: https://doi.org/10.1007/s13748-023-00295-9

T. Desai, P. Dakle, and D. Moldovan, “Generating Questions for Reading Comprehension using Coherence Relations,” in Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications, Y.-H. Tseng, H.-H. Chen, V. Ng, and M. Komachi, Eds. Association for Computational Linguistics, pp.1–10. [Online]. Available: https://aclanthology.org/W18- 3701/ DOI: https://doi.org/10.18653/v1/W18-3701

A. Virani, R. Yadav, P. Sonawane, and S. Jawale, “Automatic Question Answer Generation using T5 and NLP,” in 2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS). IEEE, pp. 1667–1673. [Online]. Available: https://ieeexplore.ieee.org/document/10169726/ DOI: https://doi.org/10.1109/ICSCSS57650.2023.10169726

R. Das, A. Ray, S. Mondal, and D. Das, “A rule based question generation framework to deal with simple and complex sentences,” in 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI). IEEE, pp. 542–548. [Online]. Available: https://ieeexplore.ieee.org/document/7732102/ DOI: https://doi.org/10.1109/ICACCI.2016.7732102

H. Zhang, H. Song, S. Li, M. Zhou, and D. Song, “A Survey of Controllable Text Generation Using Transformer-based Pre-trained Language Models,” vol. 56, no. 3, pp. 64:1–64:37. [Online]. Available: https://dl.acm.org/doi/10.1145/3617680 DOI: https://doi.org/10.1145/3617680

Y. Huang and L. He, “Automatic generation of short answer questions for reading comprehension assessment,” vol. 22, no. 3, pp. 457–489. [Online]. Available: https://www.cambridge.org/ core / journals / natural - language - engineering / article / abs / automatic - generation - of - short - answer- questions - for- reading - comprehension - assessment/701D7F67BE5C8DBEE60C7906C5E0E437 DOI: https://doi.org/10.1017/S1351324915000455

D. T. Vu and J. Blake, “Design and development of a question generator for learners of English,” vol. 102, p. 01011. [Online]. Available: https://www.shs- conferences.org/articles/shsconf/abs/2021/13/shsconf_etltc2021_01011/shsconf_etltc2021_01011.html DOI: https://doi.org/10.1051/shsconf/202110201011

V. Pyatkin, P. Roit, J. Michael, Y. Goldberg, R. Tsarfaty, and I. Dagan, “Asking It All: Generating Contextualized Questions for any Semantic Role,” in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, M.-F. Moens, X. Huang, L. Specia, and S. W.-t. Yih, Eds. Association for Computational Linguistics, pp. 1429–1441. [Online]. Available: https://aclanthology.org/2021.emnlp-main.108/ DOI: https://doi.org/10.18653/v1/2021.emnlp-main.108

T. Alsubait, B. Parsia, and U. Sattler, “Ontology-Based Multiple Choice Question Generation,” vol. 30, no. 2, pp. 183–188. [Online]. Available: https://doi.org/10.1007/s13218-015- 0405-9 DOI: https://doi.org/10.1007/s13218-015-0405-9

S. Al Faraby, A. Adiwijaya, and A. Romadhony, “Review on Neural Question Generation for Education Purposes,” vol. 34, no. 3, pp. 1008–1045. [Online]. Available: https://doi.org/10.1007/s40593-023- 00374-x [12] J. Li, H. Song, and J. Li, “Transformer-based Question Text Generation in the Learning System,” in Proceedings of the 2022 6th International Conference on Innovation in Artificial Intelligence, ser. ICIAI ’22. Association for Computing Machinery, pp. 50–56. [Online]. Available:https://dl.acm.org/doi/10.1145/3529466.3529484

A Review of Hybrid and Ensemble in Deep Learning for Natural Language Processing. [Online]. Available: https://arxiv.org/html/2312.05589v1

Downloads

Published

31-01-2026

Issue

Section

Research Articles

How to Cite

[1]
Jeevan Pralhad Tonde and Satish Sankaye, Trans., “A Rule-Based Automatic Question Generation Framework for Educational Text”, Int J Sci Res Sci & Technol, vol. 13, no. 1, pp. 188–197, Jan. 2026, doi: 10.32628/IJSRST2613127.