Distinguished Engineer - Apache Spark

Source: remoteok

About the Role

NVIDIA is seeking a Distinguished Engineer for the Apache Spark Acceleration group. The role focuses on accelerating Apache Spark applications on GPUs. You will serve as a hands-on architect, working with a team of distributed system engineers and engaging in open-source projects.

Responsibilities

Lead the architecture, design, and implementation of accelerated Apache Spark and related big-data frameworks
Engage open-source communities (including Apache Spark, RAPIDS, Apache Iceberg, Delta Lake, and UCX) for technical discussion and contribution, and engage new communities where we may not have a strong presence yet
Work with NVIDIA partners to deploy GPU-enabled data analytics solutions in public cloud or on-premises clusters
Present technical solutions at industry conferences and meetups
Collaborate with distributed systems teams to define solutions to distributed processing problems challenges at large scale
Provide recommendations and feedback to teams regarding decisions surrounding topics such as infrastructure, continuous integration, and testing strategy
Build, test, and optimize CUDA/C++ libraries across different platforms
Build automation and tools that will increase the productivity of teams developing distributed systems
Mentor members of the engineering team

Requirements

BS, MS, or PhD in Computer Science, Computer Engineering, or a closely related field (or equivalent experience)
17+ years of work or research experience in software development
Prior experience in delivering complex software projects as a lead architect
Outstanding technical skills in designing and implementing high-quality distributed systems
Excellent programming skills in C++, Java, and/or Scala
Highly motivated with strong interpersonal and communication skills
5+ years working experience with key open-source big-data projects as a contributor or committer to Apache Spark, Apache Hadoop, Apache Flink, Apache Kafka, Apache Hive, Apache Arrow, Delta Lake
Excellent knowledge about distributed system schedulers: Kubernetes, Hadoop YARN, Apache Spark
Able to delve into a new area and quickly come up to speed
Able to work with teams across boundaries and geographies

Ways to Stand Out

Working experience in designing and developing columnar query engines would be a huge plus
Committership at major open-source projects (such as Apache Spark, Apache Hadoop, Apache Flink) is a big plus
Working experience with acceleration libraries (CUDA, RAPIDS, UCX) is helpful

Compensation

Base Salary: $320,000 USD - $488,750 USD
Eligibility for equity and benefits

Additional Information

Applications accepted at least until January 13, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is an equal opportunity employer.
Learn more about NVIDIA: https://www.nvidia.com/