
Lead Site Reliability Engineer [Multiple Positions Available]
at J.P. Morgan
Posted 8 hours ago
No clicks
- Compensation
- Not specified USD
- City
- Not specified
- Country
- United States
Currency: $ (USD)
Lead a team of Site Reliability Engineers focused on reliability, observability, production support, and application maintenance across multiple applications. Troubleshoot, maintain, escalate, and resolve complex application issues; enable telemetry and alerts for proactive monitoring; develop tools and accelerators to reduce toil. Design and implement observability and reliability architectures; apply site reliability principles daily and drive adoption across projects while managing risk and lifecycle practices. Collaborate with cross-functional teams and ensure production changes align with best practices; location Plano, TX; full-time; requires a Bachelor's degree and 5+ years of experience.
Location: Plano, TX, United States
DESCRIPTION:
Duties: Troubleshoot, maintain, identify, escalate, and resolve application issues. Enable telemetry and alerts for complex enterprise applications for proactive monitoring. Develop tools and accelerators to reduce toil and process improvements. Ensure that production changes are made in light of best practices, lifecycle methodology, and overall risk. Partner with multiple teams for applications' performance or functional issues, troubleshooting, infrastructure service support, and change management. Collaborate with others to create and implement observability and reliability designs for complex systems that are robust, stable, and do not incur additional toil or technical debt. Perform site reliability principles and practices every day and adopt site reliability across multiple applications. Lead a team of Site Reliability Engineers (SREs), overseeing the product portfolio focused on reliability, observability, production support, and application maintenance.
QUALIFICATIONS:
Minimum education and experience required: Bachelor's degree in Information Technology, Computer Science, or related field of study plus 5 years of experience in the job offered or as Lead Site Reliability Engineer, Infrastructure Engineer, IT Project Manager, Test Lead, IT Consultant, or related occupation.
Skills Required: This position requires experience with the following: Troubleshooting application issues with distributed IT infrastructure; implementing end to end monitoring using AppDynamics and Dyna Trace; implementing automated solution using python scripts; analyzing Non Functional Requirements to identify targeted response time as Service Level Objectives (SLO); Log Analysis and Visualization using Splunk; Performance Testing using Performance Center, JMeter and Blazemeter; Performance Testing results analysis and reporting; defect tracking and analysis using Quality Center; Amazon Web Services (AWS) platform.
Job Location: 8181 Communications Pkwy, Plano, TX 75024.
Full-Time.

