AI-enhanced for better readability
Distinguished Engineer - Apache Spark
Source: remoteok
About the Role
NVIDIA is seeking a Distinguished Engineer for the Apache Spark Acceleration group. The role focuses on accelerating Apache Spark applications on GPUs. You will serve as a hands-on architect, working with a team of distributed system engineers and engaging in open-source projects.
Responsibilities
- Lead the architecture, design, and implementation of accelerated Apache Spark and related big-data frameworks
- Engage open-source communities (including Apache Spark, RAPIDS, Apache Iceberg, Delta Lake, and UCX) for technical discussion and contribution, and engage new communities where we may not have a strong presence yet
- Work with NVIDIA partners to deploy GPU-enabled data analytics solutions in public cloud or on-premises clusters
- Present technical solutions at industry conferences and meetups
- Collaborate with distributed systems teams to define solutions to distributed processing problems challenges at large scale
- Provide recommendations and feedback to teams regarding decisions surrounding topics such as infrastructure, continuous integration, and testing strategy
- Build, test, and optimize CUDA/C++ libraries across different platforms
- Build automation and tools that will increase the productivity of teams developing distributed systems
- Mentor members of the engineering team
Requirements
- BS, MS, or PhD in Computer Science, Computer Engineering, or a closely related field (or equivalent experience)
- 17+ years of work or research experience in software development
- Prior experience in delivering complex software projects as a lead architect
- Outstanding technical skills in designing and implementing high-quality distributed systems
- Excellent programming skills in C++, Java, and/or Scala
- Highly motivated with strong interpersonal and communication skills
- 5+ years working experience with key open-source big-data projects as a contributor or committer to Apache Spark, Apache Hadoop, Apache Flink, Apache Kafka, Apache Hive, Apache Arrow, Delta Lake
- Excellent knowledge about distributed system schedulers: Kubernetes, Hadoop YARN, Apache Spark
- Able to delve into a new area and quickly come up to speed
- Able to work with teams across boundaries and geographies
Ways to Stand Out
- Working experience in designing and developing columnar query engines would be a huge plus
- Committership at major open-source projects (such as Apache Spark, Apache Hadoop, Apache Flink) is a big plus
- Working experience with acceleration libraries (CUDA, RAPIDS, UCX) is helpful
Compensation
- Base Salary: $320,000 USD - $488,750 USD
- Eligibility for equity and benefits
Additional Information
- Applications accepted at least until January 13, 2026.
- This posting is for an existing vacancy.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer.
- Learn more about NVIDIA: https://www.nvidia.com/