Business Area:
EngineeringSeniority Level:
Entry levelJob Description:
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.
The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance.
The Replication Manager team is looking for passionate developers to join our growing engineering team. The team is responsible for building out the storage, metadata, permissions and lineage replication support for the Cloudera Data Platform. The team's mission is to provide a seamless experience for our customers for moving the data and all entities associated with that to achieve migration, replication as well as disaster recovery use cases.
Replication Manager enables the customers to replicate data across data centers or to/from the cloud. Replication scenarios can include data stored in HDFS, Ozone, or public cloud buckets; data stored in Hive tables, Hive metastore, HBase or Iceberg table data; Ranger permissions and Atlas lineage. The datasets can range from terabytes to petabytes of data with some additional challenges like millions of directories/Ozone keys, individual file sizes ranging in gigabytes, near real time HBase WAL replication.
How we work:
We are a distributed team that values deep technical work and a sustainable, long-term focus. Our culture is built on psychological safety, trust, and respect for an engineer's time.
As a Software Engineer I, you will:
We’re excited about you if you have:
You may also have:
Why this role matters:
You will tackle complex distributed systems challenges, crafting the foundational software for the control and data planes that powers CDP and keeps it running at massive scale. Working at the forefront of hybrid and multi-cloud technology, you will empower data scientists, engineers, and analysts with the tools and infrastructure they need for advanced analytics and modeling.
Collaboration is key, you will work alongside brilliant minds across product, data science, and engineering to drive innovation, standardize best practices, and shape the future of enterprise AI and data platforms. This is your chance to build the future of data and see your work make a global impact.
This role is not eligible for immigration sponsorship.
What you can expect from us:
Generous PTO Policy
Support work life balance with Unplugged Days
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development
Comprehensive Benefits and Competitive Packages
Employee Resource Groups
EEO/VEVRAA
#LI-RB1
#LI-HYBRID