Business Area:
EngineeringSeniority Level:
Mid-Senior levelJob Description:
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.
Remote Role
At Cloudera, our Data Services Pillar is the heart of data innovation. We don’t just work with technology; we build it. Our mission is to empower data practitioners by creating seamless, enterprise-grade experiences for data engineering, warehousing, streaming, operational databases, and AI.
Cloudera is looking for an exceptional and passionate software engineer to join the Data Warehouse engineering team. The technology stack includes popular open source query engines - Apache Hive, Impala, Trino, and table formats like Apache Iceberg. Thus, there is ample scope for collaboration and contribution to open source. This is an exciting opportunity to work on products that handle complex SQL query workloads on public or private clouds as part of the Cloudera Data Platform (CDP).
As a Senior Software Engineer, you will:
Work on large-scale, distributed systems to help drive Hive innovation and build additional components around it to enhance the Hive ecosystem.
Have an exciting opportunity to work on products that handle complex SQL query workloads on public or private clouds as part of Cloudera Data Platform (CDP).
Design and develop features for parallel and distributed query engines to help drive innovation in CDP.
Focus on query optimization, performance and scalability of SQL queries.
Write design documentation for key features and capabilities.
Improve code quality through writing tests, automation, and code reviews.
Understand the customer’s workload and provide effective technical solutions.
We are excited about you if you have:
Bachelor’s or Master’s degree in Computer Science or equivalent, and 6 years of experience.
Experience with query optimization using tools like Apache Calcite.
Clean coding habits, attention to detail, and a focus on quality.
Hands-on programmer with strong data structures and algorithms skills. Java experience is desired.
Good understanding of database internals, query processing and SQL query optimization.
Strong oral and written communication skills.
Ability to work effectively on cross-functional projects.
You may also have:
Experience with contributing to any of the open-source Apache projects like Hive, Impala, Calcite or an RDBMS.
Experience with the Hadoop ecosystem and file formats like Parquet, ORC.
Experience with public cloud infrastructures such as Microsoft Azure, Amazon Web Services and Google Cloud Platform.
Recognized contributions to open source projects.
Why this role matters:
This is your opportunity to build cloud-native solutions that are deployable anywhere whether in massive clusters on any cloud provider or in private data centers. You’ll work with cutting-edge technologies like Trino, Spark, Airflow, and advanced AI inferencing systems to shape the future of analytics. Your code will directly influence how data engineers, analysts, and developers worldwide find value in their data.
We believe in the power of open source. You’ll collaborate with project committers, contributing upstream to keep technologies like Apache Hive and Impala evolving. You’ll harden these engines for rock-solid security, optimize them for peak performance, and make them effortlessly run across all environments. Join us and help build the trusted, cloud-native platform that powers insights for the most data-intensive companies on the planet.
What you can expect from us:
Generous PTO Policy
Support work life balance with Unplugged Days
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development
Comprehensive Benefits and Competitive Packages
Employee Resource Groups
EEO/VEVRAA
#LI-BV1
#LI-REMOTE