
Software Engineer III - Python, Athena Platform Engineer
at J.P. Morgan
Posted 14 days ago
No clicks
- Compensation
- Not specified
- City
- Mumbai
- Country
- India
Currency: Not specified
Platform-focused Software Engineer responsible for designing, evolving, and operating the Python-based Athena platform within Asset & Wealth Management. The role emphasizes platform and cloud engineering, SRE practices, infrastructure-as-code, automation, observability, and reliability for hybrid on‑prem and public cloud environments. Key responsibilities include managing Python runtime environments, building GitOps/IaC pipelines, performance and capacity optimization, incident response, and defining SLIs/SLOs. Position is based in Mumbai and supports large-scale analytics and compute workloads at JPMorgan Chase.
Location: Mumbai, Maharashtra, India
We have an exciting and rewarding opportunity for you to take your software engineering career to the next level.
As a Software Engineer III at JPMorgan Chase within the Asset & Wealth Management, you serve as a seasoned member of an agile team to design and deliver trusted market-leading technology products in a secure, stable, and scalable way. You are responsible for carrying out critical technology solutions across multiple technical areas within various business functions in support of the firm’s business objectives.
Join the Athena Platform team at JPMorgan Chase to design, evolve, and operate the core infrastructure that powers large‑scale analytics and execution across the firm. This role emphasizes platform and cloud engineering—compute, storage, networking, security, observability, and automation—prioritizing reliability, performance, and secure operations over direct business feature delivery. You will apply SRE practices to ensure resilience and safe change, while focusing primarily on building and running the Python‑based Athena platform at scale. You’ll also have the opportunity to work with a fast‑growing derivatives business in Asset Management, learn, and contribute to the growth of this product and asset class.
Job responsibilities
- Architects, implement, and operate core Athena platform services and Python runtime environments across hybrid on‑prem and public cloud.
- Leads end‑to‑end platform changes (OS/kernel upgrades, Python versioning, dependency governance, grid/compute backbone enhancements) with safe rollout, canary, and rollback strategies.
- Builds and maintain infrastructure as code and GitOps pipelines for repeatable, auditable changes (e.g., Terraform, Ansible, Helm), supported by Python automation and tooling.
- Optimizes capacity, performance, and cost across compute and storage tiers; contribute to DR, backup, and multi‑zone/region resilience.
- Defines SLIs/SLOs and error budgets for platform services; instrument telemetry and create actionable dashboards and alerts aligned to reliability goals.
- Leads incident response and blameless postmortems; implement reliability improvements and reduce operational toil through automation.
- Establishes performance/load testing and capacity modeling; tune Linux, networking, and runtime layers for latency‑sensitive workloads.
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and 3+ years applied experience
- Hands‑on experience with Athena/Quartz or similar large‑scale Python platforms.
- Experience in platform/cloud engineering or SRE supporting distributed systems at scale.
- Strong Python skills for automation, tooling, and systems programming in production.
- Experience with grid/compute backbone or container/orchestration platforms; understanding of scheduling, queues, resource isolation, and workload placement.
- Infrastructure as code (e.g., Terraform, Ansible) and CI/CD for platform changes with policy, approvals, and GitOps workflows.
- Observability expertise: instrumentation, telemetry design, metrics/logging/tracing, and alerting aligned to SLIs/SLOs.
- Networking and security for production systems: DNS, load balancing, TLS/mTLS, proxies, firewalls, identity and role‑based access control.
- Proven incident management and root‑cause analysis experience with demonstrable reliability improvements.
Preferred qualifications, capabilities, and skills
- Familiarity with JPMorgan’s Compute Backbone or analogous HPC/grid scheduling systems. Linux fundamentals: system internals, performance profiling, networking, filesystems, and container primitives.
- Exposure to major cloud platforms (e.g., AWS/Azure): private networking, IAM, security groups, KMS, and secret stores.
- FinOps/capacity practices to optimize platform cost and efficiency.
- Working knowledge of Java/C++ for cross‑stack debugging (primary language remains Python).




