LOG IN
SIGN UP
Tech Job Finder - Find Software, Technology Sales and Product Manager Jobs.
Sign In
OR continue with e-mail and password
E-mail address
Password
Don't have an account?
Reset password
Join Tech Job Finder
OR continue with e-mail and password
E-mail address
First name
Last name
Username
Password
Confirm Password
How did you hear about us?
By signing up, you agree to our Terms & Conditions and Privacy Policy.

Software Engineer, Data Center Infrastructure Management Lifecycle

at Alphabet

Back to all C/C++ jobs
A
Industry not specified

Software Engineer, Data Center Infrastructure Management Lifecycle

at Alphabet

JuniorNo visa sponsorshipC/C++/C#

Posted 4 hours ago

No clicks

Compensation
$141,000 – $202,000 USD

Currency: $ (USD)

City
Not specified
Country
United States

Join Google's Data Center Infrastructure Management (DCIM) Lifecycle team to design, develop, and deploy large-scale distributed monitoring systems for tape health in Google's data centers. You will work on telemetry collection from tape libraries, drives, and robotics, implement fault-detection algorithms, and build dashboards and monitoring tools. The role involves collaborating with hardware engineers, contributing across the full software lifecycle, and occasional on-site data center visits. Base US salary ranges from $141,000 to $202,000 plus bonus, equity, and benefits.

Software Engineer, Data Center Infrastructure Management Lifecycle

  • Copy link
  • Email a friend
GoogleSunnyvale, CA, USA
Mid
  • Copy link
  • Email a friend

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with coding in C++.
  • 1 year of experience with distributed computing.
  • 1 year of experience with debugging, troubleshooting and monitoring systems.

Preferred qualifications:

  • Master's degree or PhD in Computer Science, or a related technical field.
  • 2 years of experience in unit testing, integration testing, and continuous deployment.
  • 2 years of experience in SQL.

About the job

The Data Center Infrastructure Management (DCIM) Lifecycle team operates one of the largest-scale monitoring systems at Google, reading telemetry from millions of devices in every Google data center. Our issues include managing the rapid growth and diversification of the Google fleet and hardware, new use cases for critical monitoring of third-party facilities, and retiring technical debt. Google is bringing back tape libraries to our data centers in order to support various critical requirements including new cold storage tier, better TCO, contingency for HDD/SSD shortage due to unprecedented AI/ML capacity demand. This role is to design and deliver Tape Health at Google scale for reliability.

In this role, you will work with your teammates to design, code, and put into production very large-scale distributed monitoring systems and work with your team and partner teams to enable new use cases for large-scale telemetry gathering. You will also create various system monitoring dashboards, defining service level objectives (SLOs), documentation and playbooks. You will have the opportunity to take onsite trips to one or more of Google's data centers each year to work with new systems and data center technical staff in person.The US base salary range for this full-time position is $141,000-$202,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.

Responsibilities

  • Design, develop, and maintain software services for collecting and analyzing telemetry data from tape libraries, drives, and robotic components.
  • Implement algorithms and rules to detect, diagnose, and predict hardware failures.
  • Integrate tape health systems with Google's data center health monitoring infrastructure (e.g., system health, network doctor) and automated repair workflows (e.g., surgeon, silk roads).
  • Collaborate with hardware engineers and vendors to understand failure modes and improve diagnostic capabilities.
  • Develop dashboards and tools to provide visibility into the health and status of the tape hardware fleet. Participate in the full software development lifecycle, including requirements gathering, design, coding, testing, deployment, and operation.

Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.

To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.

Software Engineer, Data Center Infrastructure Management Lifecycle

at Alphabet

Back to all C/C++ jobs
A
Industry not specified

Software Engineer, Data Center Infrastructure Management Lifecycle

at Alphabet

JuniorNo visa sponsorshipC/C++/C#

Posted 4 hours ago

No clicks

Compensation
$141,000 – $202,000 USD

Currency: $ (USD)

City
Not specified
Country
United States

Join Google's Data Center Infrastructure Management (DCIM) Lifecycle team to design, develop, and deploy large-scale distributed monitoring systems for tape health in Google's data centers. You will work on telemetry collection from tape libraries, drives, and robotics, implement fault-detection algorithms, and build dashboards and monitoring tools. The role involves collaborating with hardware engineers, contributing across the full software lifecycle, and occasional on-site data center visits. Base US salary ranges from $141,000 to $202,000 plus bonus, equity, and benefits.

Software Engineer, Data Center Infrastructure Management Lifecycle

  • Copy link
  • Email a friend
GoogleSunnyvale, CA, USA
Mid
  • Copy link
  • Email a friend

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with coding in C++.
  • 1 year of experience with distributed computing.
  • 1 year of experience with debugging, troubleshooting and monitoring systems.

Preferred qualifications:

  • Master's degree or PhD in Computer Science, or a related technical field.
  • 2 years of experience in unit testing, integration testing, and continuous deployment.
  • 2 years of experience in SQL.

About the job

The Data Center Infrastructure Management (DCIM) Lifecycle team operates one of the largest-scale monitoring systems at Google, reading telemetry from millions of devices in every Google data center. Our issues include managing the rapid growth and diversification of the Google fleet and hardware, new use cases for critical monitoring of third-party facilities, and retiring technical debt. Google is bringing back tape libraries to our data centers in order to support various critical requirements including new cold storage tier, better TCO, contingency for HDD/SSD shortage due to unprecedented AI/ML capacity demand. This role is to design and deliver Tape Health at Google scale for reliability.

In this role, you will work with your teammates to design, code, and put into production very large-scale distributed monitoring systems and work with your team and partner teams to enable new use cases for large-scale telemetry gathering. You will also create various system monitoring dashboards, defining service level objectives (SLOs), documentation and playbooks. You will have the opportunity to take onsite trips to one or more of Google's data centers each year to work with new systems and data center technical staff in person.The US base salary range for this full-time position is $141,000-$202,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.

Responsibilities

  • Design, develop, and maintain software services for collecting and analyzing telemetry data from tape libraries, drives, and robotic components.
  • Implement algorithms and rules to detect, diagnose, and predict hardware failures.
  • Integrate tape health systems with Google's data center health monitoring infrastructure (e.g., system health, network doctor) and automated repair workflows (e.g., surgeon, silk roads).
  • Collaborate with hardware engineers and vendors to understand failure modes and improve diagnostic capabilities.
  • Develop dashboards and tools to provide visibility into the health and status of the tape hardware fleet. Participate in the full software development lifecycle, including requirements gathering, design, coding, testing, deployment, and operation.

Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.

To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.

SIMILAR OPPORTUNITIES

No similar jobs available at the moment.