The team in Singapore is currently looking for a Machine Learning Engineer with strong expertise in handling PySpark pipelines, in order to support our client activities.
The high level job scope and skills required are:
Help our team to connect to and from the Data Lake environment, take over Python based data pipelines, work with team to find and data prep for analytics. Productionize models with Data Scientists.
Prioritized skill keywords: Python, PySpark, SKLearn, ML Software Engineering, SQL, Unix
- Work closely with Data Scientists on feature engineering and productionizing models to be robust and scalable
- Focus on solutions that provide data analytics for omnichannel/marketing/sales domains
- Identify data engineering needs for advanced analytics teams
- Support the Data and Platforms Group to build foundational infrastructure, data governance, and master data management in support of analytics services
- Use expertise to improve the quality of data models, data flows and data processing of data systems, reporting, analytics solutions
Qualifications and Profile
Overall 4/5+ years of experience in Information technology with following skills,
- 4+ Data Engineering Experience
- Strong knowledge in Machine Learning tools such as SKLearn, Spark, Databricks, Docker, EMR, Kubernetes
- Knowledge of Data Management Platforms (SQL, NoSQL, Cloud, ETL, Datawarehousing)
- Knowledge of MLOps and productionization and scaling of models and managing the ML ecosystem