โšก New

Data Engineer

HireAlpha

ChennaiFull-timeMid LevelOn-site

Job Description

JOB DESCRIPTION: Senior Data Engineer Experience: 7 โ€“ 10 years of relevant professional experience Location: Remote (Pune Based Preferred) Employment Type: Contract Industry Preference: Any (Healthcare preferred) About the Role We are looking for a Senior Data Engineer with deep expertise in streaming architectures, graph database platforms, and large-scale data pipeline engineering. This is a high-ownership, hands-on role that sits at the intersection of real-time data infrastructure, entity resolution, and multi-database system design. You will architect and build pipelines that drive a complex, multi-layered data platform โ€” ingesting from diverse upstream sources, resolving entities at scale, and keeping graph, relational, search, and caching layers in sync.

You will work closely with data architects, AI engineers, and product teams to deliver reliable, high-performance data infrastructure that powers downstream analytics and intelligent applications across any domain. Key Responsibilities Build and manage real-time data pipelines using Apache Kafka and CDC pipelines Design scalable ETL/ELT workflows using Spark/PySpark Work with graph databases such as Neo4j, Amazon Neptune, or TigerGraph Develop and optimize data models, graph queries, and ingestion pipelines Implement entity resolution and deduplication logic for large datasets Integrate relational, graph, search, and caching databases Build data lakehouse solutions using Iceberg/Parquet/Delta Lake Ensure data quality, observability, monitoring, and governance Collaborate with engineering, architecture, and product teams Required Skills Strong hands-on experience with Apache Kafka and streaming pipelines Expertise in Python and advanced SQL Experience with Spark/PySpark and large-scale data processing Hands-on experience with Graph Databases (Neo4j preferred) Experience with PostgreSQL, Elasticsearch, and Redis Knowledge of AWS/Azure, Docker, Kubernetes, and Terraform Experience with CDC pipelines and real-time architectures Exposure to entity resolution / deduplication systems is highly preferred Preferred Healthcare domain experience preferred Experience in large-scale distributed systems and data platforms

Posted Today

Related Jobs

Related Searches

Apply Now