Senior GPU Software Engineer, System Software, AI Infrastructure
Indexed description
- Bachelor’s degree or equivalent practical experience.
- 5 years of experience with software development in one or more programming languages.
- 3 years of experience testing, maintaining, or launching software products, and 1 year of experience with software design and architecture.
- 3 years of experience working with embedded operating systems.
- Master's degree or PhD in Computer Science or related technical field.
- 3 years of experience in designing and developing device drivers for peripherals such as GPUs, Switches, and connectivity buses such as USB, PCIe, etc.
- Ability to build tools to enable automation of tests to ship high quality systems.
- Experience in large-scale and secure fleet management of servers and components.
- Experience in software or hardware integration, system bring-up, and diagnostics.
- Experience in modularized servers of high performance and reliability
Platforms Engineering performs research, design and development for the hardware, software and networking technologies that power all of Google's products and services (including Google Cloud Platform, Tensor Processing Units, Storage and Networking services). Work on solutions not only to deliver the next generation compute, storage and networking products for Google data centers, but also to counter the implications of Moore’s Law.
In this role, you will be responsible for building top quality GPU compute solutions that power various Google services like Google Cloud, YouTube, Deep Mind, etc. Maintain the systems deployed in the data centers with the reliability monitoring services, kernel rollouts, firmware and driver upgrades.
The AI and Infrastructure team is redefining what’s possible. We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google Cloud customers, and billions of Google users worldwide.
We're the driving team behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more.
Responsibilities
- Design, develop and maintain the system software stack for Graphics Processing Unit (GPU) system software.
- Provide technical leadership to help formulate and drive software development plans.
- Help identify dependencies in cross-functional teams and drive New Product Introduction (NPI) execution with a focus on development velocity and quality.
- Set technical direction and priorities for a small number of adjacent engineers within and across teams, recommend solutions that consider both short and long-term goals.
- Drive System Software integration to enable next generation GPU Accelerators for Google Data Center.
Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.
Unlock free search