Senior Data Engineer (6+ yrs)
MindBrain
Job Description
Senior Data Engineer – Graph Database & Data Intelligence Position: Senior Data Engineer Experience Required: 6+ Years Location: Remote Employment Type: Contract Job Summary We are seeking a highly skilled Senior Data Engineer with expertise in Graph Databases, Data Modeling, and modern backend frameworks. The ideal candidate should have strong experience in building scalable data solutions using Neo4j, Cypher Query Language, PostgreSQL, and FastAPI. Experience in entity resolution and data linking using Splink will be a strong advantage.
Key Responsibilities Design, develop, and maintain scalable data pipelines and graph-based data architectures. Build and optimize Graph Database solutions using Neo4j. Develop complex queries using Cypher Query Language (CypherQL).
Implement Graph Data Modeling techniques for relationship-driven datasets. Work with PostgreSQL databases for structured data management and optimization. Develop APIs and backend services using FastAPI.
Implement entity resolution and probabilistic matching solutions using Splink. Integrate multiple data sources and ensure data consistency and quality. Collaborate with cross-functional teams for analytics, reporting, and business intelligence solutions.
Optimize database performance, query execution, and data workflows. Ensure data governance, security, and scalability standards are maintained. Required Skills Strong experience with Neo4j Graph Database.
Expertise in Graph Database concepts and Graph Data Modeling. Hands-on experience with Cypher Query Language (CypherQL). Strong knowledge of PostgreSQL.
Experience developing REST APIs using FastAPI. Hands-on experience with Splink for entity resolution and record linkage. Strong SQL and Python programming skills.
Experience with ETL/ELT pipeline development. Familiarity with cloud platforms like AWS, Azure, or GCP. Knowledge of data engineering best practices and scalable architectures.
Preferred Skills Experience with Apache Spark or PySpark. Knowledge of Docker, Kubernetes, and CI/CD pipelines. Experience with data governance and metadata management.
Exposure to machine learning data pipelines is a plus. Qualification Bachelor’s/Master’s degree in Computer Science, IT, Engineering, or related field.