Job Summary (for Data Engineer position):
- Design, build, and maintain scalable data pipelines using Python, PySpark, and Apache Airflow.
- Develop and optimize ETL workflows on Cloudera Data Platform (CDP).
- Implement data quality checks, monitoring, and alerting mechanisms.
- Ensure data security, governance, and compliance in all data pipelines.
- Collaborate with cross-functional teams to understand data requirements and deliver solutions.
- Troubleshoot and resolve issues in production data pipelines.
- Contribute to the architecture and design of the data platform.
- Work with engineering teams and analysts on AI/ML and Generative AI use cases.
- Automate deployment and monitoring of data workflows using DevOps tools and practices.
- Stay current with emerging trends in data engineering, AI/ML, and Generative AI technologies.
MNCJobsGulf.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.