job details

Back to jobs search

Jobs search results

3,068 jobs matched
Back to jobs search

Principal Engineer, Borg Control Plane

GoogleSunnyvale, CA, USADirector+

Minimum qualifications:

  • Bachelor's degree in Computer Science, a similar technical field, or equivalent practical experience.
  • 15 years of experience as a software engineer or 13 years of experience with an advanced degree
  • Experience working with large-scale distributed systems.
  • Experience with ML infrastructure.

Preferred qualifications:

  • 20 years of experience in software development or similar field.
  • Experience with container orchestration systems.
  • Experience working with advanced ML developers.
  • Ability to work cross-functionally, partnering with groups such as Sales, Engineering, Product Management, Product Marketing, UX and UI, brokering trade offs with stakeholders and understanding their needs.

About the job

Google’s Core Machine Learning (ML) team aims to build AI platforms, services, and tools behind Google’s AI research and AI-empowered products. We are owners of the Google AI framework (e.g., TensorFlow, JAX), AI performance and efficiency, AI training and inference platforms, AI compilers (e.g. XLA), AI software/hardware co-design (e.g. TPU), and AI developer experience (e.g., Model Hub, Eval Hub). Further, we are the driver to launch LMs (e.g. large models such as Gemini) from research to production and enable all Google products to adopt LMs to improve end user experiences. The Core ML team is responsible for planning and optimizing ML capacity for all Google products and driving performance and efficiency work to ensure we maximize return on investment for all our AI investments. We look across Google’s AI products and AI research to build central solutions, break down technical barriers, and strengthen existing systems.

As a Principal Engineer leading the Machine Learning strategy for the Borg Control Plane team, you will shape the capabilities and infrastructure necessary to advance Google's ML roadmap. Collaborating with platform, storage, data center, networking, and resource management teams, you will drive new capabilities and support the growth and efficient usage of Google's fleet. You will partner with leads from Google product areas, such as Deepmind, Search, Ads, and YouTube, to accelerate the transition of research innovations to production, with focus on developer experience and acceleration of experimentation and productionization time. You will also contribute to delivering GPUs and Google’s advanced internal technology, TPUs, to external customers via Google’s Cloud Compute Platform.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

The US base salary range for this full-time position is $294,000-$414,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.

Responsibilities

  • Collaborate and engage with ML practitioners, partner teams in Google's ML infrastructure stack, and leads across product areas at Google, to gather requirements and identify opportunities for efficiency initiatives.
  • Define the strategy and roadmap anticipating the needs in support of Google’s ML mission, with focus on a unified infrastructure and well-lit customer user journeys.
  • Lead complex programs with leads across organizations.
  • Lead in-depth designs and delivery of capabilities in the Borg Control Plane infrastructure, working with Engineers in the team.
  • Ensure production excellence, helping own and evolve the Borg Control Plane stack.

Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.

To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.

Google apps
Main menu