LOG IN
SIGN UP
Tech Job Finder - Find Software, Technology Sales and Product Manager Jobs.
Sign In
OR continue with e-mail and password
E-mail address
Password
Don't have an account?
Reset password
Join Tech Job Finder
OR continue with e-mail and password
E-mail address
First name
Last name
Username
Password
Confirm Password
How did you hear about us?
By signing up, you agree to our Terms & Conditions and Privacy Policy.

Senior Systems Operations Engineer

at Walmart

Back to all Cloud & DevOps jobs
W
Industry not specified

Senior Systems Operations Engineer

at Walmart

Mid LevelNo visa sponsorshipAWS/GCP/Azure DevOps

Posted 17 hours ago

No clicks

Compensation
Not specified USD

Currency: $ (USD)

City
Not specified
Country
United States

Wells Fargo is seeking a Senior Systems Operations Engineer to transform traditional operations into a modern SRE model, building reliability by design, defining and improving SLIs/SLOs, and reducing toil through automation. The role involves leading or contributing to the management of installed systems and infrastructure, driving observability, incident and problem management, and mentoring Ops/Dev teams to adopt SRE practices at scale. You will collaborate with vendors and technical staff to resolve issues, lead high-severity incidents, and develop self-service reliability tooling and runbooks to improve system availability and resilience.

About this role:

Wells Fargo is seeking a Senior Systems Operations Engineer.

Transform traditional operations into a modern SRE model—building reliability by design, improving SLIs/SLOs, automating toil, defining and enabling critical monitoring, templatize the observability based on business-critical application and define critical user journeys for the same. Maturing incident & problem management. You’ll be hands-on while also mentoring Ops/Dev teams to adopt SRE practices on scale. 


In this role, you will:

  • Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area

  • Contribute in increasing system efficiencies and lowering the human intervention time on related tasks

  • Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability

  • Work with vendors and other technical personnel for problem resolution

  • Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards

  • Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability


Required Qualifications:

  • 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education

Desired Qualifications:

  • Strong experience in large-scale distributed systems; 5+ years hands-on SRE/DevOps/Platform Engineering.
  • Cloud: One or more—AWS / Azure / GCP (certifications a plus).
  • IaC & Automation: Terraform, Ansible/Chef; solid Git practices (GitOps 
  • Observability: Prometheus, Grafana, OpenTelemetry, Thousandeyes, Appdynamics, Aternity.
  • CI/CD: Azure DevOps, GitHub Actions, Jenkins, or GitLab CI; artifact mgmt and environment promotions.
  • Programming: One of Python/Go/Java (scripting + API integrations).
  • Reliability Practices: SLIs/SLOs, error budgets, capacity planning, canary/bluegreen, chaos/DR testing.
  • Processes: Incident/Problem/Change, blameless postmortems, runbook design, oncall good practices. Strong documentation and communication skills

Job Expectations:

  • Define and implement SLIs/SLOs and error budgets for critical services; drive SLO adoption across teams.
  • Build and tune observability (metrics/logs/traces) with golden signals (latency, traffic, errors, saturation).
  • Partner with Performance Engineering to run load/stress/soak tests and remove performance bottlenecks.
  • Platform & Automation: Eliminate toil , Generate AI based observability assessment and maturity score card for all applications
  • Create selfservice reliability tooling (runbooks, bots, reliability checks, golden paths).
  • Incident, Problem & Change
  • Lead high severity incidents (Major/SEV1), facilitate blameless postmortems, and track corrective actions.
  • Culture & Enablement: Coach product and ops teams on SRE principles; define maturity models and track adoption.
  • Build documentation: runbooks, dashboards, readiness checklists, and reliability reviews. always current.

    Posting End Date: 

    4 Mar 2026

    *Job posting may come down early due to volume of applicants.

    We Value Equal Opportunity

    Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.

    Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.

    Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.

    Applicants with Disabilities

    To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.

    Drug and Alcohol Policy

     

    Wells Fargo maintains a drug free workplace.  Please see our Drug and Alcohol Policy to learn more.

    Wells Fargo Recruitment and Hiring Requirements:

    a. Third-Party recordings are prohibited unless authorized by Wells Fargo.

    b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.


    Senior Systems Operations Engineer

    at Walmart

    Back to all Cloud & DevOps jobs
    W
    Industry not specified

    Senior Systems Operations Engineer

    at Walmart

    Mid LevelNo visa sponsorshipAWS/GCP/Azure DevOps

    Posted 17 hours ago

    No clicks

    Compensation
    Not specified USD

    Currency: $ (USD)

    City
    Not specified
    Country
    United States

    Wells Fargo is seeking a Senior Systems Operations Engineer to transform traditional operations into a modern SRE model, building reliability by design, defining and improving SLIs/SLOs, and reducing toil through automation. The role involves leading or contributing to the management of installed systems and infrastructure, driving observability, incident and problem management, and mentoring Ops/Dev teams to adopt SRE practices at scale. You will collaborate with vendors and technical staff to resolve issues, lead high-severity incidents, and develop self-service reliability tooling and runbooks to improve system availability and resilience.

    About this role:

    Wells Fargo is seeking a Senior Systems Operations Engineer.

    Transform traditional operations into a modern SRE model—building reliability by design, improving SLIs/SLOs, automating toil, defining and enabling critical monitoring, templatize the observability based on business-critical application and define critical user journeys for the same. Maturing incident & problem management. You’ll be hands-on while also mentoring Ops/Dev teams to adopt SRE practices on scale. 


    In this role, you will:

    • Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area

    • Contribute in increasing system efficiencies and lowering the human intervention time on related tasks

    • Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability

    • Work with vendors and other technical personnel for problem resolution

    • Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards

    • Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability


    Required Qualifications:

    • 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education

    Desired Qualifications:

    • Strong experience in large-scale distributed systems; 5+ years hands-on SRE/DevOps/Platform Engineering.
    • Cloud: One or more—AWS / Azure / GCP (certifications a plus).
    • IaC & Automation: Terraform, Ansible/Chef; solid Git practices (GitOps 
    • Observability: Prometheus, Grafana, OpenTelemetry, Thousandeyes, Appdynamics, Aternity.
    • CI/CD: Azure DevOps, GitHub Actions, Jenkins, or GitLab CI; artifact mgmt and environment promotions.
    • Programming: One of Python/Go/Java (scripting + API integrations).
    • Reliability Practices: SLIs/SLOs, error budgets, capacity planning, canary/bluegreen, chaos/DR testing.
    • Processes: Incident/Problem/Change, blameless postmortems, runbook design, oncall good practices. Strong documentation and communication skills

    Job Expectations:

    • Define and implement SLIs/SLOs and error budgets for critical services; drive SLO adoption across teams.
    • Build and tune observability (metrics/logs/traces) with golden signals (latency, traffic, errors, saturation).
    • Partner with Performance Engineering to run load/stress/soak tests and remove performance bottlenecks.
    • Platform & Automation: Eliminate toil , Generate AI based observability assessment and maturity score card for all applications
    • Create selfservice reliability tooling (runbooks, bots, reliability checks, golden paths).
    • Incident, Problem & Change
    • Lead high severity incidents (Major/SEV1), facilitate blameless postmortems, and track corrective actions.
    • Culture & Enablement: Coach product and ops teams on SRE principles; define maturity models and track adoption.
    • Build documentation: runbooks, dashboards, readiness checklists, and reliability reviews. always current.

      Posting End Date: 

      4 Mar 2026

      *Job posting may come down early due to volume of applicants.

      We Value Equal Opportunity

      Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.

      Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.

      Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.

      Applicants with Disabilities

      To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.

      Drug and Alcohol Policy

       

      Wells Fargo maintains a drug free workplace.  Please see our Drug and Alcohol Policy to learn more.

      Wells Fargo Recruitment and Hiring Requirements:

      a. Third-Party recordings are prohibited unless authorized by Wells Fargo.

      b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.


      SIMILAR OPPORTUNITIES

      No similar jobs available at the moment.