Computational Efficiency Optimization in Python: Benchmarking High-Performance Frameworks for Data Science and Machine Learning Workflows
Abstract
In recent years, Python has become the go-to language for data science and machine learning app developers. It is important to note, however, that data scientists are not always programming experts. In spite of the fact that Python enables them to rapidly develop their algorithms, when they are operating at scale, the need for efficient computation becomes unavoidable. Thus, optimizing high-performance hardware, such as graphics processing units and multi-core processors, to achieve their maximum potential is no simple feat.
It is possible to consider the current narrative survey to be a reference document for practitioners of the Python language, which will assist them in navigating the vast array of tools and approaches that are accessible for use with the Python programming language. The user scenarios that are the focus of our document are designed to cover the majority of the possible circumstances that users may encounter. Tool developers may also find this document useful; by reading it, they will be able to see where our work could be lacking in current tools and be more motivated to fill such gaps. We feel that this information may be of service to tool developers.
Keywords
Downloads
Full Research Paper
Complete research article with detailed methodology, results, and references.
How to Cite
APA Style:
Kapoor, P. (2025). Computational Efficiency Optimization in Python: Benchmarking High-Performance Frameworks for Data Science and Machine Learning Workflows. International Journal of Advanced Research in Engineering and Related Sciences, 1(4), 2.
IEEE Style:
P. Kapoor, "Computational Efficiency Optimization in Python: Benchmarking High-Performance Frameworks for Data Science and Machine Learning Workflows," International Journal of Advanced Research in Engineering and Related Sciences, vol. 1, no. 4, paper 2, 2025.
References
- Van Rossum, G., & Drake, F. L. (2009). Python 3 Reference Manual. CreateSpace.
- McKinney, W. (2010). Data structures for statistical computing in Python. Proceedings of the 9th Python in Science Conference, 51-56.
- Harris, C. R., Millman, K. J., van der Walt, S. J., et al. (2020). Array programming with NumPy. Nature, 585(7825), 357-362.
- Abadi, M., Agarwal, A., Barham, P., et al. (2016). TensorFlow: Large-scale machine learning on heterogeneous systems. arXiv preprint arXiv:1603.04467.
- Pedregosa, F., Varoquaux, G., Gramfort, A., et al. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825-2830.
- Hunter, J. D. (2007). Matplotlib: A 2D graphics environment. Computing in Science & Engineering, 9(3), 90-95.
- Kluyver, T., Ragan-Kelley, B., Pérez, F., et al. (2016). Jupyter Notebooks-a publishing format for reproducible computational workflows. Positioning and Power in Academic Publishing: Players, Agents and Agendas, 87-90.
- Lam, S. K., Pitrou, A., & Seibert, S. (2015). Numba: A LLVM-based Python JIT compiler. Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, 1-6.