Alt Text: Scaling Machine Learning Models with GKE and TensorFlow
Title: Scaling Machine Learning Models with Google Kubernetes Engine and TensorFlow
Caption: Efficiently scaling ML models using GKE and TensorFlow
Description: This article explores how to efficiently scale machine learning models using Google Kubernetes Engine and TensorFlow.
International Journal of Computer Techniques – Volume 10 Issue 1, January 2023
Tulasiram Yadavalli
Role: Senior Software Engineer, USA
Abstract
Scaling machine learning (ML) models for large-scale deployment is a challenging task. Containerization using tools like Google Kubernetes Engine (GKE) and TensorFlow has emerged as a solution. This article looks at how to efficiently scale ML models using GKE alongside TensorFlow. It discusses containerized machine learning workloads, optimization strategies, and practical code-based examples. The use of GKE allows seamless orchestration of containerized applications, while TensorFlow offers powerful tools for training and inference. Together, they provide an efficient, scalable platform for deploying large ML models. The article also highlights common issues and practical solutions to improve performance and efficiency.
Keywords
machine learning, Google Kubernetes engine, TensorFlow, containerization, large-scale deployment, optimization.
References
- Olson, Randal S., and Jason H. Moore. “TPOT: A tree-based pipeline optimization tool for automating machine learning.” In Workshop on automatic machine learning, pp. 66-74. PMLR, 2016.
- P. Carlisle, “Machine Learning for Games at Scale,” in Game AI Uncovered, vol. 1, Florida, US, CRC Press, N.D, p. 11.
- Ghasemi, Arezoo, and Abolfazl Toroghi Haghighat. “A multi-objective load balancing algorithm for virtual machine placement in cloud data centers based on machine learning.” Computing 102, no. 9 (2020): 2049-2072.
- Asghar, Muhammad Zeeshan, Furqan Ahmed, and Jyri Hämäläinen. “Artificial intelligence enabled self-healing for mobile network automation.” In 2021 IEEE Globecom Workshops (GC Wkshps), pp. 1-6. IEEE, 2021.
- I. A. &. M. G. A. Mahmoud Imdoukh, “Machine learning-based auto-scaling for containerized applications,” Neural Computing and Applications, vol. 32, no. 1, p. 9745–9760, 2019.
- Ramos, Felipe, Eduardo Viegas, Altair Santin, Pedro Horchulhack, Roger R. dos Santos, and Allan Espindola. “A machine learning model for detection of docker-based APP overbooking on kubernetes.” In ICC 2021-IEEE International Conference on Communications, pp. 1-6. IEEE, 2021.
- A. Choudhury, “Introduction to Kubeflow & Kubernetes Cloud Architecture,” in Continuous Machine Learning with Kubeflow: Performing Reliable MLOps with Capabilities of TFX, Sagemaker and Kubernetes (English Edition, Noida, BPB Publications, 2021, pp. 7-28.
How to Cite
Independent Researcher, “Scaling Machine Learning Models with Google Kubernetes Engine and TensorFlow,” International Journal of Computer Techniques, Volume 10 Issue 1, January 2023. ISSN 2394-2231.
Opportunities to Collaborate
We invite you to become part of our esteemed reviewer board or submit your research paper for publication.
Join Our Reviewer Board Submit Your Research Paper