Hackerrank

Lead Data Engineer

Hybrid in Bangalore, India Full Time

HackerRank helps thousands of companies like OpenAI, NVIDIA, Amazon hire developers based on their skills vs pedigree, and also nurtures a community of millions of developers to upskill themselves to become the next-gen developers.

The people at HackerRank care deeply about their work and have an extremely intense work ethic. In many companies, speed & quality is a tradeoff. At HackerRank, it’s not -- we expect you to ship in about half the time that most competent people think it’s possible while maintaining a standard of quality you’d proudly sign your name on. The only way to make this happen is if you truly love your craft and are deeply committed to growth.

About the role:

As a Data Engineer, you’ll design and scale the infrastructure that powers modern data pipelines, build scalable models, and turn massive datasets into actionable insights. Beyond coding, your work will influence product, engineering, and business decisions, helping HackerRank leverage data to accelerate innovation worldwide. You’ll collaborate with a tight-knit, high-ownership team in a supportive, low-noise environment.

What you’ll do

  • Design and maintain scalable streaming and batch data pipelines using AWS-native and open-source technologies.
  • Architect and evolve our modern data lakehouse, ensuring reliability, security, and performance.
  • Develop data models that power analytics and enable self-service reporting across teams.
  • Evaluate emerging technologies, run POCs, and implement innovative solutions for complex data challenges.
  • Partner with engineering, analytics, and product teams to deliver data systems that drive real business impact.
  • Mentor junior engineers, guiding them on best practices in data design, quality, and infrastructure.

Who you are

  • 5+ years of experience building data engineering or BI solutions at scale.
  • Strong in data modeling and ETL pipeline design using tools like Apache Airflow or MageAI.
  • Hands-on with Spark (PySpark or Scala) and comfortable optimizing queries on Redshift, Trino, or similar.
  • Fluent in SQL and experienced with performance tuning for large datasets.
  • Skilled at solving problems of scalability, reliability, and performance across distributed systems.
  • Collaborative communicator who thrives in fast-moving, cross-functional teams.

Even better if you have

  • Experience with Kafka or other real-time data streaming tools.
  • Exposure to BI platforms such as Redash, Looker, or Metabase
  • Familiarity with data governance tools like Ranger, observability tools like Grafana, or cost-optimization on AWS.

You will thrive in this role if

  • You love designing systems that scale elegantly and perform flawlessly.
  • Enjoy turning complex data into clear, actionable insights.
  • Take pride in building with precision, ensuring reliability and maintainability at every layer.
  • Thrive in an environment that values ownership, curiosity, and craftsmanship over noise.

Want to learn more about HackerRank? Check out HackerRank.com to explore our products, solutions and resources, and dive into our story and mission here.

HackerRank is a proud equal employment opportunity and affirmative action employer. We provide equal opportunity to everyone for employment based on individual performance and qualification. We never discriminate based on race, religion, national origin, gender identity or expression, sexual orientation, age, marital, veteran, or disability status. All your information will be kept confidential according to EEO guidelines. 

Linkedin | X | Blog | Instagram | Life@HackerRank|

Notice to prospective HackerRank job applicants:

  • Our Recruiters use @hackerrank.com email addresses.
  • We never ask for payment or credit check information to apply, interview, or work here.