LOG IN
SIGN UP
Tech Job Finder - Find Software, Technology Sales and Product Manager Jobs.
Sign In
OR continue with e-mail and password
E-mail address
Password
Don't have an account?
Reset password
Join Tech Job Finder
OR continue with e-mail and password
E-mail address
First name
Last name
Username
Password
Confirm Password
How did you hear about us?
By signing up, you agree to our Terms & Conditions and Privacy Policy.

Site Reliability Engineer

at Microsoft

Back to all Data Engineering jobs
Microsoft logo
Industry not specified

Site Reliability Engineer

at Microsoft

JuniorNo visa sponsorshipData Engineering

Posted 11 hours ago

No clicks

Compensation
$100,600 – $199,000 USD

Currency: $ (USD)

City
San Francisco, New York City
Country
United States

Join the Azure Data Fabric platform team as a Site Reliability Engineer to ensure reliability, scalability, and performance of high-throughput, multi-tenant data services. You will bridge development and IT operations, automate processes, manage incidents, and participate in on-call rotations, with a focus on operational excellence through metrics and policy controls. You will design and implement solutions in collaboration with Product Management and partner teams, and contribute to postmortems and continuous improvement. Strong scripting (PowerShell, Python) and automation experience are valued, with opportunities for growth within a data-first, AI-enabled cloud platform.

Overview

Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.

Microsoft’s Azure Data engineering team is looking to hire a Site Realiability Engineer. The team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The products our portfolio include Microsoft Fabric, Azure SQL DB, Azure Cosmos DB, Azure PostgreSQL, Azure Data Factory, Azure Synapse Analytics, Azure Service Bus, Azure Event Grid, and Power BI. Our mission is to build the data platform for the age of AI, powering a new class of data-first applications and driving a data culture.

​​Within Azure Data, the Microsoft Fabric platform team builds and maintains the operating system and provides customers a unified data stack to run an entire data estate. The platform provides a unified experience, unified governance, enables a unified business model and a unified architecture. ​

​​This team (SRE) ensures the reliability, scalability, and performance of systems and services. By integrating software engineering with IT operations, the team automates processes, manages incidents, and enhances system resilience. Acting as a bridge between development and operations, SREs help organizations maintain highly reliable and efficient systems while enabling fast and seamless software delivery.

We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.



Responsibilities
  • Work with all aspects of a high throughput and multi-tenant service
  • Collaborate effectively within the team and with partner teams across Microsoft.
  • Be part of the on-call rotation for maintaining service health.
  • Design, implement, and refine chosen solutions in close partnership with Product Management and partner teams.
  • Champion operational excellence via established metrics, process governance, and policy controls for regular assessment and improvement.
  • Document and define existing data engineering processes, data and technology, while evaluating them for optimization.

    Core responsibilities breakdown includes:

    • System Reliability & Uptime – Ensuring high availability of services.

    • Incident Management – Detecting, responding to, and mitigating system failures.

    • Performance Monitoring – Tracking system health and resolving bottlenecks.

    • Automation & Tooling – Reducing manual work through scripts and automation.

    • Capacity Planning – Scaling infrastructure efficiently to handle demand.

    • Postmortems & Continuous Improvement – Analyzing failures to prevent recurrence.

    • Embody our culture and values



Qualifications

Required Qualifications:

  • Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
    • OR equivalent experience

Other Requirements:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:
    • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • 5+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration.

  • 4+ years technical experience in software engineering, network engineering, or systems administration OR bachelor's degree in computer science, Information Technology, or related field AND 2+ year(s) technical experience in software engineering, network engineering, or systems administration OR Master's Degree in Computer Science, Information Technology, or related field.

  • 2+ years’ experience with scripting languages such as PowerShell, Python etc.

  • Experience writing code to automate day-to-day tasks.

#azdat #azuredata #sre #fabric #powerbi​



Site Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay


This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.



Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Site Reliability Engineer

at Microsoft

Back to all Data Engineering jobs
Microsoft logo
Industry not specified

Site Reliability Engineer

at Microsoft

JuniorNo visa sponsorshipData Engineering

Posted 11 hours ago

No clicks

Compensation
$100,600 – $199,000 USD

Currency: $ (USD)

City
San Francisco, New York City
Country
United States

Join the Azure Data Fabric platform team as a Site Reliability Engineer to ensure reliability, scalability, and performance of high-throughput, multi-tenant data services. You will bridge development and IT operations, automate processes, manage incidents, and participate in on-call rotations, with a focus on operational excellence through metrics and policy controls. You will design and implement solutions in collaboration with Product Management and partner teams, and contribute to postmortems and continuous improvement. Strong scripting (PowerShell, Python) and automation experience are valued, with opportunities for growth within a data-first, AI-enabled cloud platform.

Overview

Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.

Microsoft’s Azure Data engineering team is looking to hire a Site Realiability Engineer. The team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The products our portfolio include Microsoft Fabric, Azure SQL DB, Azure Cosmos DB, Azure PostgreSQL, Azure Data Factory, Azure Synapse Analytics, Azure Service Bus, Azure Event Grid, and Power BI. Our mission is to build the data platform for the age of AI, powering a new class of data-first applications and driving a data culture.

​​Within Azure Data, the Microsoft Fabric platform team builds and maintains the operating system and provides customers a unified data stack to run an entire data estate. The platform provides a unified experience, unified governance, enables a unified business model and a unified architecture. ​

​​This team (SRE) ensures the reliability, scalability, and performance of systems and services. By integrating software engineering with IT operations, the team automates processes, manages incidents, and enhances system resilience. Acting as a bridge between development and operations, SREs help organizations maintain highly reliable and efficient systems while enabling fast and seamless software delivery.

We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.



Responsibilities
  • Work with all aspects of a high throughput and multi-tenant service
  • Collaborate effectively within the team and with partner teams across Microsoft.
  • Be part of the on-call rotation for maintaining service health.
  • Design, implement, and refine chosen solutions in close partnership with Product Management and partner teams.
  • Champion operational excellence via established metrics, process governance, and policy controls for regular assessment and improvement.
  • Document and define existing data engineering processes, data and technology, while evaluating them for optimization.

    Core responsibilities breakdown includes:

    • System Reliability & Uptime – Ensuring high availability of services.

    • Incident Management – Detecting, responding to, and mitigating system failures.

    • Performance Monitoring – Tracking system health and resolving bottlenecks.

    • Automation & Tooling – Reducing manual work through scripts and automation.

    • Capacity Planning – Scaling infrastructure efficiently to handle demand.

    • Postmortems & Continuous Improvement – Analyzing failures to prevent recurrence.

    • Embody our culture and values



Qualifications

Required Qualifications:

  • Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
    • OR equivalent experience

Other Requirements:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:
    • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • 5+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration.

  • 4+ years technical experience in software engineering, network engineering, or systems administration OR bachelor's degree in computer science, Information Technology, or related field AND 2+ year(s) technical experience in software engineering, network engineering, or systems administration OR Master's Degree in Computer Science, Information Technology, or related field.

  • 2+ years’ experience with scripting languages such as PowerShell, Python etc.

  • Experience writing code to automate day-to-day tasks.

#azdat #azuredata #sre #fabric #powerbi​



Site Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay


This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.



Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

SIMILAR OPPORTUNITIES

No similar jobs available at the moment.