Careers
Careers

job details

Back to jobs search

Jobs search results

2,766 jobs matched
Back to jobs search

Staff Software Engineer, Cloud ML Compute Services

GoogleTaipei, Taiwan

Minimum qualifications:

  • Bachelor’s degree in Computer Science or equivalent practical experience.
  • 8 years of experience in software development, and with full stack development, across back-end such as Python, Java, C++, or GO codebases.
  • 5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture.
  • 5 years of experience leading ML design and optimizing ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).
  • 5 years of experience with one or more of the following: reinforcement learning, ML infrastructure, or specialization in another ML field.

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical field.
  • Experience with Block Storage or cloud storage systems.
  • Experience with Generative AI, Large Language Models (LLM), or Machine Learning infrastructure, including model deployment, performance optimization, profiling, and debugging.
  • Experience with distributed computing leveraging GPUs or TPUs.
  • Ability to collaborate with cross-functional and cross-regional teams.
  • Ability to grow in a fluid environment.

About the job

As a software engineer in Cloud ML Compute Services, you will focus on delivering growth in the AI infrastructure space. The team manages the challenges by optimizing ML workload performance at every layer across the technical stack from networking and data storage to ML models, designing custom ML solutions from prototype to production, and providing technical guidance to top customers throughout previews, proof-of-concepts, onboarding, and production phases.

You will advance AI infrastructure, support the cross-team collaboration for customer success, and are passionate about improving the performance of AI technologies. This role offers opportunities for both contributions and growth.Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Responsibilities

  • Measure and enhance performance on Google Cloud across the technical stack, including storage, networking, and model throughput.
  • Conduct performance profiling, debugging, and troubleshooting of AI/ML training and inference workloads.
  • Partner with cross-functional, cross-regional teams to ensure our AI/ML infrastructure delivers exceptional value and drives success for our customers.
  • Identify and resolve performance bottlenecks, ensuring our infrastructure operates at the capacity.
  • Support the future of our AI/ML infrastructure by identifying gaps in the existing products and recommending enhancements.Stay informed of the Artificial Intelligence and Machine Learning technologies and contribute learned expertise to foster collective team growth.

Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.

To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.

Google apps
Main menu