Published 2025-04-20
Keywords
- KYC,
- Blockchain,
- deepfake,
- customer identification
How to Cite
Copyright (c) 2025 M Jyothi, N Haji bablu, M Pavani, K Kalyan Kumar, S Mohammed Jabeer

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Abstract
The growing sophistication of deepfake technology poses a significant challenge to remote identity verification systems, particularly in electronic Know Your Customer (eKYC) applications. Many existing deepfake detection datasets lack the necessary features to assess eKYC systems effectively, as they do not include essential factors like head movements and facial verification protocols. To address this gap, we introduce eKYC-DF, a large-scale dataset comprising over 228,000 high-quality synthetic and real videos, representing diverse demographics. This dataset is designed to facilitate the development and evaluation of eKYC systems by incorporating various head poses, facial expressions, and verification benchmarks. Additionally, our dataset provides protocols for both deepfake detection and facial recognition assessments, making it a valuable resource for enhancing identity-proofing security. The eKYC-DF dataset, along with evaluation tools and pre-trained models, is publicly available to researchers for further study and development.
References
- Serengil, S. I., & Ozpinar, A. (2021, October). HyperExtended LightFace: A facial attribute analysis framework. In Proceedings of the International Conference on Engineering and Emerging Technologies (ICEET) (pp. 1–4).
- Carta, K., Barral, C., El Mrabet, N., & Mouille, S. (2022, March). Video injection attacks on remote digital identity verification solution using face recognition. In 13th International Multi-Conference on Complexity, Informatics and Cybernetics (IMCIC) (pp. 92–97).
- Nanda, A., Shah, S. W. A., Jeong, J. J., Doss, R., & Webb, J. (2023). Towards higher levels of assurance in remote identity proofing. IEEE Consumer Electronics Magazine, 13(1), 1–8.
- Do, T.-L., Tran, M.-K., Nguyen, H. H., & Tran, M.-T. (2022). Potential attacks of DeepFake on eKYC systems and remedy for eKYC with DeepFake detection using two-stream network of facial appearance and motion features. Social Networking and Computational Science, 3(6), 1–17.
- Dagar, D., & Vishwakarma, D. K. (2022). A literature review and perspectives in deepfakes: Generation, detection, and applications. International Journal of Multimedia Information Retrieval, 11(3), 219–289.
- Rössler, A., Cozzolino, D., Verdoliva, L., Riess, C., Thies, J., & Niessner, M. (2019, October). FaceForensics++: Learning to detect manipulated facial images. In IEEE/CVF International Conference on Computer Vision (ICCV) (pp. 1–11).
- Akhtar, Z. (2023). Deepfakes generation and detection: A short survey. Journal of Imaging, 9(1), 18.
- Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022, June). High-resolution image synthesis with latent diffusion models. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 10674–10685).
- Ahmed, S. T., Kumar, V. V., Singh, K. K., Singh, A., Muthukumaran, V., & Gupta, D. (2022). 6G enabled federated learning for secure IoMT resource recommendation and propagation analysis. Computers and Electrical Engineering, 102, 108210.
- Ahmed, S. T., Singh, D. K., Basha, S. M., Abouel Nasr, E., Kamrani, A. K., & Aboudaif, M. K. (2021). Neural network based mental depression identification and sentiments classification technique from speech signals: A COVID-19 Focused Pandemic Study. Frontiers in public health, 9, 781827.
- Ahmed, S. T., Vinoth Kumar, V., Mahesh, T. R., Narasimha Prasad, L. V., Velmurugan, A. K., Muthukumaran, V., & Niveditha, V. R. (2024). FedOPT: federated learning-based heterogeneous resource recommendation and optimization for edge computing. Soft Computing, 1-12.
- Chen, R., Chen, X., Ni, B., & Ge, Y. (2020, October). SimSwap: An efficient framework for high fidelity face swapping. In Proceedings of the 28th ACM International Conference on Multimedia (pp. 2003–2011).
- Dwaram, J. R., & Madapuri, R. K. (2022). Crop yield forecasting by long short‐term memory network with Adam optimizer and Huber loss function in Andhra Pradesh, India. Concurrency and Computation: Practice and Experience, 34(27). https://doi.org/10.1002/cpe.7310
- Fathima, A. S., Basha, S. M., Ahmed, S. T., Khan, S. B., Asiri, F., Basheer, S., & Shukla, M. (2025). Empowering consumer healthcare through sensor-rich devices using federated learning for secure resource recommendation. IEEE Transactions on Consumer Electronics
- Fathima, A. S., Basha, S. M., Ahmed, S. T., Mathivanan, S. K., Rajendran, S., Mallik, S., & Zhao, Z. (2023). Federated learning based futuristic biomedical big-data analysis and standardization. Plos one, 18(10), e0291631.
- Fathima, A. S., Reema, S., & Ahmed, S. T. (2023, December). ANN based fake profile detection and categorization using premetric paradigms on instagram. In 2023 Innovations in Power and Advanced Computing Technologies (i-PACT) (pp. 1-6). IEEE.
- Kumar, A., Satheesha, T. Y., Salvador, B. B. L., Mithileysh, S., & Ahmed, S. T. (2023). Augmented Intelligence enabled Deep Neural Networking (AuDNN) framework for skin cancer classification and prediction using multi-dimensional datasets on industrial IoT standards. Microprocessors and Microsystems, 97, 104755.
- Madapuri, R. K., & Senthil Mahesh, P. C. (2017). HBS-CRA: Scaling impact of change request towards fault proneness: Defining a heuristic and biases scale (HBS) of change request artifacts (CRA). Cluster Computing, 22(S5), 11591–11599. https://doi.org/10.1007/s10586-017-1424-0
- Reddy, B. S. H. (2025). Deep learning-based detection of hair and scalp diseases using CNN and image processing. Milestone Transactions on Medical Technometrics, 3(1), 145–5. https://doi.org/10.5281/zenodo.14965660
- Reddy, B. S. H., Venkatramana, R., & Jayasree, L. (2025). Enhancing apple fruit quality detection with augmented YOLOv3 deep learning algorithm. International Journal of Human Computations & Intelligence, 4(1), 386–396. https://doi.org/10.5281/zenodo.14998944
- Ren, J., Xu, C., Chen, H., Qin, X., Li, C., & Zhu, L. (2023). Towards flexible, scalable, and adaptive multi-modal conditioned face synthesis. arXiv preprint arXiv:2312.16274.