Innovative Data Science Model for Analysis on Pesticide Poisoning Using Supervised Learning
Published 2025-04-20
Keywords
- Data science,
- decision support system,
- machine learning,
- pesticide poisoning diagnosis
How to Cite
Copyright (c) 2025 M Venkata Ramana, Y Mahesh, A Keshava, G Keerthi Priya, R Sai Jyoshna, S Nayeem

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Abstract
In a Data Science project, assessing data relevance and identifying patterns that support decision-making based on domain-specific knowledge are critical. Additionally, establishing clear methodologies and comprehensive documentation is essential to guide the project from its initial stages to completion. This study introduces a structured Data Science model, covering the entire process from data collection to model training, aimed at enhancing knowledge discovery. The motivation behind this model stems from limitations in existing Data Science methodologies, particularly the absence of practical, step-by-step guidance for data preparation and deployment. The proposed model, called "Data Refinement Cycle with Supervised Machine Learning (DRC–SML)," was specifically designed to assist healthcare professionals in diagnosing pesticide poisoning among rural workers. The dataset for this project, based on scientific research, included 1027 samples containing toxicity biomarker data and clinical analyses. The model achieved an impressive 99.62% accuracy with only 28 decision rules, significantly improving healthcare practices and quality of life in rural areas. The results validate the effectiveness of the DRC–SML model, demonstrating its potential for enhancing predictive analytics in healthcare and other domains.
References
- Ahmed, S. T., Basha, S. M., Ramachandran, M., Daneshmand, M., & Gandomi, A. H. (2023). An edge-AI-enabled autonomous connected ambulance-route resource recommendation protocol (ACA-R3) for eHealth in smart cities. IEEE Internet of Things Journal, 10(13), 11497-11506.
- Ahmed, S. T., Fathima, A. S., Nishabai, M., & Sophia, S. (2024). Medical ChatBot assistance for primary clinical guidance using machine learning techniques. Procedia Computer Science, 233, 279-287.
- Carvalho, J. (2023). Cases of pesticide exposure. IEEE Dataport. https://doi.org/10.21227/kn78-jf53
- Cremin, C. J., Dash, S., & Huang, X. (2022). Big data: Historic advances and emerging trends in biomedical research. Current Research in Biotechnology, 4, 138–151.
- De Bie, T., De Raedt, L., Hernández-Orallo, J., Hoos, H. H., Smyth, P., & Williams, C. K. I. (2022). Automating data science: Prospects and challenges. Communications of the ACM, 65(2), 76–87.
- Jain, S. (2022, May). Comprehensive survey on data science, lifecycle, tools and its research issues. In Proceedings of the International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (Vol. 1, pp. 838–842).
- Kumar, S. S., Ahmed, S. T., Sandeep, S., Madheswaran, M., & Basha, S. M. (2022). Unstructured Oncological Image Cluster Identification Using Improved Unsupervised Clustering Techniques. Computers, Materials & Continua, 72(1).
- Kumari, N., & Acharjya, D. P. (2022). A decision support system for diagnosis of hepatitis disease using an integrated rough set and fish swarm algorithm. Concurrency and Computation: Practice and Experience, 34(21), e7107. https://doi.org/10.1002/cpe.7107
- Laturiuw, A. K., & Singgalen, Y. A. (2023). Sentiment analysis of Raja Ampat tourism destination using CRISP-DM: SVM, NBC, DT, and k-NN algorithm. Journal of Information Systems and Informatics, 5(2), 518–535.
- Pasha, A., Ahmed, S. T., Painam, R. K., Mathivanan, S. K., Mallik, S., & Qin, H. (2024). Leveraging ANFIS with Adam and PSO optimizers for Parkinson's disease. Heliyon, 10(9).
- Periasamy, K., Periasamy, S., Velayutham, S., Zhang, Z., Ahmed, S. T., & Jayapalan, A. (2022). A proactive model to predict osteoporosis: An artificial immune system approach. Expert Systems, 39(4), e12708.
- Saltz, J., & Krasteva, I. (2022). Current approaches for executing big data science projects: A systematic literature review. PeerJ Computer Science, 8, e862.
- Sarveshwaran, V., Jayakanth, J. J., & Renu, Y. (2023, October). Comparative analysis of diverse classification algorithms of machine learning by using various quality metrics. In Proceedings of the IEEE 5th International Conference on Cybernetics, Cognition and Machine Learning Applications (ICCCMLA) (pp. 551–556).
- Sreedhar, K. S., Ahmed, S. T., & Sreejesh, G. (2022, June). An Improved Technique to Identify Fake News on Social Media Network using Supervised Machine Learning Concepts. In 2022 IEEE World Conference on Applied Intelligence and Computing (AIC) (pp. 652-658). IEEE.
- Yao, Y. (2022). Symbols-Meaning-Value (SMV) space as a basis for a conceptual model of data science. International Journal of Approximate Reasoning, 144, 113–128.