Responsibilities
Manage HGX nodes (OS, drivers, GPU allocation) Set up and manage OpenShift/K8s clusters Deploy models to inference servers (Triton, TensorRT, etc.) Handle CI/CD for models (training serving) Develop basic scripting (Python/Bash) for operations automation Work Locations
Additional Responsibilities
Manage artifacts (model checkpoints, fine-tuned versions) Validate fine-tuned models (accuracy, fairness, drift) Alert on anomalies Skills + Experience
OpenShift (bonus) DevOps (CI/CD) Python Torch/TensorFlow familiarity Triton Server or similar deployment tool #J-18808-Ljbffr
MNCJobsGulf.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.