Business Area:
EngineeringSeniority Level:
Mid-Senior levelJob Description:
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.
Cloudera is a leader in the fast-growing big data platforms market. This is a rare chance to make a name for yourself in the industry and in the Open Source world. The candidate will be responsible for Apache Hive and CDW projects.
We are looking for a candidate who would like to work on these projects upstream and downstream. If you are curious about the project and code quality you can check the project and the code at the following link. You can start the development before you join. This is one of the beauties of the OSS world.
As a Senior Software Engineer you will:
Build robust and scalable data infrastructure software
Design and create services and system architecture for your projects
Improve code quality through writing unit tests, automation, and code reviews
The candidate would write Java code and/or build several services in the Cloudera Data Platform.
Worked with a team of engineers who reviewed each other's code/designs and held each other to an extremely high bar for the quality of code/designs
The candidate has to understand the basics of Kubernetes.
Build out the production and test infrastructure.
Develop automation frameworks to reproduce issues and prevent regressions.
Work closely with other developers providing services to our system.
Help to analyze and to understand how customers use the product and improve it where necessary.
We are excited if you have:
Deep familiarity with Java programming language.
Hands-on experience with distributed systems.
Knowledge of database concepts, RDBMS internals.
Has experience working in a distributed team.
Has 3+ years of experience in software development.
Deep knowledge of distributed systems, query optimization, and columnar storage formats (Parquet, ORC).
You might also have:
Experience in open source development and knowledge of Git, JIRA and Jenkins etc
Experience with containerised environments
Experience with the Hadoop ecosystem is a great plus
Experience with distributed file systems / databases is a plus
Familiarity with the internals of RDBMS, SQL, JDBC is a plus
Experience with cloud infrastructure is a great plus
Why this role matters:
This is your opportunity to build cloud-native solutions that are deployable anywhere, whether in massive clusters on any cloud provider or in private data centers. You’ll work with cutting-edge technologies like Trino, Spark, Airflow, and advanced AI inferencing systems to shape the future of analytics. Your code will directly influence how data engineers, analysts, and developers worldwide find value in their data.
We believe in the power of open source. You’ll collaborate with project committers, contributing upstream to keep technologies like Apache Hive and Impala evolving. You’ll harden these engines for rock-solid security, optimize them for peak performance, and make them effortlessly run across all environments. Join us and help build the trusted, cloud-native platform that powers insights for the most data-intensive companies on the planet.
What you can expect from us:
Generous PTO Policy
Support work life balance with Unplugged Days
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development
Comprehensive Benefits and Competitive Packages
Employee Resource Groups
EEO/VEVRAA
#LI-ZC1
#LI-REMOTE