Data Engineer for Machine Learning Projects
Parallel Domain
Job Description
Join Parallel Domain as a Data Engineer focused on Machine Learning, enhancing the Replica platform's data processing capabilities. Your role will involve constructing efficient data pipelines and ensuring data quality. You will be responsible for shaping the data landscape for autonomous systems by building and validating data pipelines.
This role emphasizes your ability to collaborate with ML engineers, ensuring that high-quality, curated datasets are available for effective model training and evaluation. Your engineering skills are vital to advancing our simulation technology. Key Responsibilities: โข Build and maintain data pipelines for diverse datasets โข Define data standards and quality metrics โข Develop curation tools for dataset management โข Provide quality data feeds for ML workflows Requirements: โข Experience in scalable data engineering โข Understanding of data's role in ML training โข Knowledge of 3D concepts and computer vision โข Proficiency in Python for data-intensive applications โข Experience collaborating with cross-functional teams Utilize your data expertise at Parallel Domain to innovate in machine learning and robotics. #J-18808-Ljbffr