Site Reliability Engineer
at Cencora
Posted 16 hours ago
No clicks
- Compensation
- Not specified
- City
- Not specified
- Country
- Italy
Currency: Not specified
We are seeking a production support Site Reliability Engineer (SRE) with strong automation skills to join our dynamic team. The role focuses on ensuring reliability, availability and performance of production systems while driving automation and operational excellence. Responsibilities include day-to-day production support, developing and maintaining automation scripts with Bash, Python and Ansible, monitoring performance and leading incident response. You will collaborate across development, QA and infrastructure teams, participate in on-call rotations, and contribute to CI/CD, IaC and documentation.
About the role:
We are seeking a production support Site Reliability Engineer (SRE) with strong automation skills to join our dynamic team. The ideal candidate will be responsible for ensuring the reliability, availability and performance of our production systems, while driving automation and operational excellence.
Key responsibilities:
- Provide day-to-day operational support for production environments, ensuring high availability and reliability of critical services.
- Develop, maintain and enhance automation scripts and tools using Bash, Python and Ansible to streamline operational tasks and incident response.
- Monitor system performance, proactively identify issues and implement solutions to prevent service disruptions.
- Collaborate with development, QA and infrastructure teams to implement best practices for deployment, monitoring and incident management.
- Participate in on-call rotation and respond to production incidents, performing root cause analysis and driving resolution.
- Maintain and improve configuration management, CI/CD pipelines and infrastructure as code practices.
- Document operational processes, troubleshooting steps and automation workflows.
Required skills and experience:
- Proven experience in a production support or SRE role within a complex, high-availability environment.
- Strong automation skills with proficiency in Bash, Python and Ansible.
- Experience with monitoring and alerting tools (e.g. Prometheus, Grafana, Elastic stack, Datadog).
- Solid understanding of Linux/Unix systems administration and troubleshooting.
- Familiarity with cloud platforms (e.g. AWS) and containerisation technologies (e.g. Docker, Kubernetes).
- Experience with configuration management and infrastructure as code tools (e.g. Terraform, CloudFormation).
- Knowledge of networking fundamentals, security best practices and incident management processes.
- Excellent problem-solving skills, attention to detail and ability to work under pressure.
- Strong communication and collaboration skills.
Desirable skills:
- Experience with version control systems (e.g. Git).
- Familiarity with Agile methodologies and DevOps culture.
- Exposure to database administration and troubleshooting (e.g. MySQL, PostgreSQL, Oracle).
- Scripting or automation experience with other languages (e.g. Go, Ruby).

