Technical Program Manager, ML Fleet
- linkCopy link
- emailEmail a friend
Minimum qualifications:
- Bachelor's degree in a relevant technical or engineering field (e.g., Hardware, Computer Systems, Electrical Engineering, or Software Engineering) or equivalent practical experience.
- 8 years of experience in technical program management, including leading the launch of cross-functional programs involving hardware/systems.
- Experience in machine learning infrastructure or program execution.
- Experience with capacity management, supply chain, or demand forecasting processes in a technology context.
Preferred qualifications:
- MBA or Master's degree in a technical field.
- 8 years of experience managing cross-functional or cross-team projects.
- Experience with Machine Learning infrastructure, accelerators (TPUs/GPUs), or managing AI/ML workloads at scale.
- Experience in defining and implementing governance frameworks or policies for technical resources.
- Experience presenting to executive-level audiences with, excellent communication skills.
- Ability to navigate ambiguity, influence without direct authority, and drive consensus across various technical and non-technical teams.
About the job
A problem isn’t truly solved until it’s solved for all. That’s why Googlers build products that help create opportunities for everyone, whether down the street or across the globe. As a Technical Program Manager at Google, you’ll use your technical expertise to lead complex, multi-disciplinary projects from start to finish. You’ll work with stakeholders to plan requirements, identify risks, manage project schedules, and communicate clearly with cross-functional partners across the company. You're equally comfortable explaining your team's analyses and recommendations to executives as you are discussing the technical tradeoffs in product development with engineers.
In this role, you will help in driving the governance, operations, and optimization of Alphabet's Machine Learning infrastructure capacity. As ML investments continue to rapidly scale, you will be instrumental in ensuring the efficient allocation, utilization, and agile redistribution of scarce ML resources (accelerators and auxiliary infrastructure) across all product areas (PAs). You will thrive in a changing environment, possessing a technical background in infrastructure, excellent program management skills, and the ability to influence cross-functional stakeholders at all levels. You will contribute to the foundational infrastructure supporting Google's most critical AI/ML advancements.
Responsibilities
- Lead cross-functional programs related to ML Fleet capacity management, including the design, update, and maintenance of ML Fleet's cluster-level allocation plan of record.
- Drive the development, implementation, and ongoing maintenance of fleet-wide accelerator and auxiliary resource usage metrics, policies, and governance frameworks.
- Identify gaps and drive initiatives to improve existing tooling and processes, enhancing the efficiency, agility, and responsiveness of ML capacity allocation and management.
- Partner closely with key stakeholders including ML strategy and allocation, product area resource management teams, capital engineering, supply teams, tooling engineering, and system infrastructure site reliability engineers (SREs).
- Manage communications and escalations related to ML resource allocation, performance, and shifts for product areas and other partners.
Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.
Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.
If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.
Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.
To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.