NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
Over the past five years GPU accelerated data processing has moved from proof of concept to production deployments. Many enterprises recognize accelerated computing is necessary to handle their large data processing needs. Multi-node GPU deployments will reduce cloud computing costs and lower latency in batch ETL workloads. At NVIDIA, we are invested in accelerating Apache Spark, providing an open source plugin for to make data processing fly. Apache Spark is the most popular data processing engine in data centers. We strive to accelerate Spark applications on GPUs without any code changes.
What you'll be doing:
Improve coverage of the RAPIDS Spark plugin to enable more operators and execs from Apache Spark to be GPU accelerated
Enable fast I/O on table layout formats like Delta and Apache Iceberg
Profile code to identify and implement performance improvements
Work on native code (C++) implementations of Apache Spark functionality
Work with open source communities to enhance RAPIDS through technical discussion and code contributions
What we need to see:
9+ years of experience in software development, with the majority in data processing
5+ years hands on experience with data platform development
BS/MS/PhD in computer science or a related field (or equivalent experience)
Proficiency in Scala, Java, SQL, solid understanding of C++, Python
Familiarity working on the internals of the open source data platform ecosystem (Apache Spark, Presto, Apache Flink, Apache Arrow, Apache DataFusion, Apache Iceberg, Delta Lake, etc). Code contributions to one or more of these platforms is a plus.
Experience working on cloud platforms
Experience supporting enterprise customers
You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.