
Site Reliability Engineer [Multiple Positions Available]
at J.P. Morgan
Posted a month ago
No clicks
- Compensation
- Not specified
- City
- Dallas
- Country
- United States
Currency: Not specified
Seeking Site Reliability Engineers to configure and manage SLOs/SLAs for microservices and support production deployments. The role requires hands-on experience with AWS and GCP, CI/CD pipelines, Terraform/Ansible, Kubernetes, Kafka, and database systems (MySQL/Postgres/MongoDB). Candidates will perform Python coding, Linux administration, on-call rotation, debugging production incidents, and automation to improve reliability and operational metrics.
Location: Plano, TX, United States
DESCRIPTION:
Duties: Configure SLO (service level objective)/SLA (Service level Agreement) for the microservices. Work with AWS and cloud formation tools. Work with CI/CD pipeline. Perform Coding using python. Work on Linux administration. Support google cloud platform for running applications. Participate in on-call rotation 24/7. Actively debug production issue and work towards resolving it. Improve operational metrics using Kafka. Work with big query for data analysis. Deploy code in production. Perform automation using Terraform. Actively debug production issue and work towards resolving it. Deploy code in production.
QUALIFICATIONS:
Minimum education and experience required: Master's degree in Information Systems Technologies-Information Assurance, Computer Science, Electrical Engineering or related field of study plus 3 years of experience in the job offered or as Site Reliability Engineer, Senior AWS DevOps Engineer, DevOps Engineer, Programmer Analyst, Configuration Management Developer, or related occupation.
Skills Required: This position requires experience with the following: Architecting, building and evolving site reliability capabilities using Ansible and Terraform; Improving reliability and stability of the applications and platform using Groovy, Ruby and Python; Developing and automating large scale, high performance data processing system using Kafka; Implementing and maintaining database systems including MySQL, PostgreSQL and MongoDB that store and retrieve data for application; Streamlining and accelerating software development lifecycle using CI/CD (continuous integration and continuous development) pipeline; Deploying production code in Amazon Web Services (AWS) and Google Cloud Platform (GCP); Designing, deploying and managing Kubernetes clusters to ensure high availability and disaster recovery; Working and supporting Linux environment including CentOS and RHEL.
Job Location: 8181 Communications Pkwy, Plano, TX 75024





