LOG IN
SIGN UP
Tech Job Finder - Find Software, Technology Sales and Product Manager Jobs.
Sign In
OR continue with e-mail and password
E-mail address
Password
Don't have an account?
Reset password
Join Tech Job Finder
OR continue with e-mail and password
E-mail address
First name
Last name
Username
Password
Confirm Password
How did you hear about us?
By signing up, you agree to our Terms & Conditions and Privacy Policy.

Site Reliability Engineer

at Cencora

Back to all Cloud & DevOps jobs
C
Industry not specified

Site Reliability Engineer

at Cencora

Mid LevelNo visa sponsorshipAWS/GCP/Azure DevOps

Posted 16 hours ago

No clicks

Compensation
Not specified

Currency: Not specified

City
Not specified
Country
Italy

We are seeking a production support Site Reliability Engineer (SRE) with strong automation skills to join our dynamic team. The role focuses on ensuring reliability, availability and performance of production systems while driving automation and operational excellence. Responsibilities include day-to-day production support, developing and maintaining automation scripts with Bash, Python and Ansible, monitoring performance and leading incident response. You will collaborate across development, QA and infrastructure teams, participate in on-call rotations, and contribute to CI/CD, IaC and documentation.

About the role:
We are seeking a production support Site Reliability Engineer (SRE) with strong automation skills to join our dynamic team. The ideal candidate will be responsible for ensuring the reliability, availability and performance of our production systems, while driving automation and operational excellence.

Key responsibilities:

  • Provide day-to-day operational support for production environments, ensuring high availability and reliability of critical services.
  • Develop, maintain and enhance automation scripts and tools using Bash, Python and Ansible to streamline operational tasks and incident response.
  • Monitor system performance, proactively identify issues and implement solutions to prevent service disruptions.
  • Collaborate with development, QA and infrastructure teams to implement best practices for deployment, monitoring and incident management.
  • Participate in on-call rotation and respond to production incidents, performing root cause analysis and driving resolution.
  • Maintain and improve configuration management, CI/CD pipelines and infrastructure as code practices.
  • Document operational processes, troubleshooting steps and automation workflows.

Required skills and experience:

  • Proven experience in a production support or SRE role within a complex, high-availability environment.
  • Strong automation skills with proficiency in Bash, Python and Ansible.
  • Experience with monitoring and alerting tools (e.g. Prometheus, Grafana, Elastic stack, Datadog).
  • Solid understanding of Linux/Unix systems administration and troubleshooting.
  • Familiarity with cloud platforms (e.g. AWS) and containerisation technologies (e.g. Docker, Kubernetes).
  • Experience with configuration management and infrastructure as code tools (e.g. Terraform, CloudFormation).
  • Knowledge of networking fundamentals, security best practices and incident management processes.
  • Excellent problem-solving skills, attention to detail and ability to work under pressure.
  • Strong communication and collaboration skills.

Desirable skills:

  • Experience with version control systems (e.g. Git).
  • Familiarity with Agile methodologies and DevOps culture.
  • Exposure to database administration and troubleshooting (e.g. MySQL, PostgreSQL, Oracle).
  • Scripting or automation experience with other languages (e.g. Go, Ruby).

Site Reliability Engineer

at Cencora

Back to all Cloud & DevOps jobs
C
Industry not specified

Site Reliability Engineer

at Cencora

Mid LevelNo visa sponsorshipAWS/GCP/Azure DevOps

Posted 16 hours ago

No clicks

Compensation
Not specified

Currency: Not specified

City
Not specified
Country
Italy

We are seeking a production support Site Reliability Engineer (SRE) with strong automation skills to join our dynamic team. The role focuses on ensuring reliability, availability and performance of production systems while driving automation and operational excellence. Responsibilities include day-to-day production support, developing and maintaining automation scripts with Bash, Python and Ansible, monitoring performance and leading incident response. You will collaborate across development, QA and infrastructure teams, participate in on-call rotations, and contribute to CI/CD, IaC and documentation.

About the role:
We are seeking a production support Site Reliability Engineer (SRE) with strong automation skills to join our dynamic team. The ideal candidate will be responsible for ensuring the reliability, availability and performance of our production systems, while driving automation and operational excellence.

Key responsibilities:

  • Provide day-to-day operational support for production environments, ensuring high availability and reliability of critical services.
  • Develop, maintain and enhance automation scripts and tools using Bash, Python and Ansible to streamline operational tasks and incident response.
  • Monitor system performance, proactively identify issues and implement solutions to prevent service disruptions.
  • Collaborate with development, QA and infrastructure teams to implement best practices for deployment, monitoring and incident management.
  • Participate in on-call rotation and respond to production incidents, performing root cause analysis and driving resolution.
  • Maintain and improve configuration management, CI/CD pipelines and infrastructure as code practices.
  • Document operational processes, troubleshooting steps and automation workflows.

Required skills and experience:

  • Proven experience in a production support or SRE role within a complex, high-availability environment.
  • Strong automation skills with proficiency in Bash, Python and Ansible.
  • Experience with monitoring and alerting tools (e.g. Prometheus, Grafana, Elastic stack, Datadog).
  • Solid understanding of Linux/Unix systems administration and troubleshooting.
  • Familiarity with cloud platforms (e.g. AWS) and containerisation technologies (e.g. Docker, Kubernetes).
  • Experience with configuration management and infrastructure as code tools (e.g. Terraform, CloudFormation).
  • Knowledge of networking fundamentals, security best practices and incident management processes.
  • Excellent problem-solving skills, attention to detail and ability to work under pressure.
  • Strong communication and collaboration skills.

Desirable skills:

  • Experience with version control systems (e.g. Git).
  • Familiarity with Agile methodologies and DevOps culture.
  • Exposure to database administration and troubleshooting (e.g. MySQL, PostgreSQL, Oracle).
  • Scripting or automation experience with other languages (e.g. Go, Ruby).

SIMILAR OPPORTUNITIES

No similar jobs available at the moment.