Build reliable ingestion pipelines Clean, transform, and store training/fine-tuning datasets Manage high-volume storage efficiently (object storage like MinIO or NFS for fine-tuning datasets) Work Locations UAE - Abu Dhabi Responsibilities
Version datasets for training reproducibility Implement monitoring dashboards (Prometheus/Grafana/Kibana) Skills + Experience
Strong Python/SQL skills ETL tools (Airflow, dbt optional) Data versioning (DVC or LakeFS optional) Basic familiarity with handling large datasets Tokenization, dataset curation Vector storage (e.g., FAISS) Chunking Prompt design Apply Online #J-18808-Ljbffr
MNCJobsGulf.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.