Senior Backend Engineer - Service Monitoring & Observability
at Binance
Posted 6 hours ago
No clicks
- Compensation
- Not specified
- City
- Not specified
- Country
- Not specified
Currency: Not specified
Binance is seeking a Senior Backend Engineer focusing on service monitoring and observability. You will lead or participate in the design and development of monitoring modules, and own deployment, tuning, and stability of observability toolchains. You will build and integrate custom metrics and dashboards using Prometheus, Grafana, and Spring Boot Actuator, and analyze large-scale monitoring data to identify bottlenecks. Collaborate with backend, SRE, and platform teams to enhance reliability and real-time monitoring coverage across services, and contribute to internal tooling and best-practice guidelines to elevate observability standards.
Responsibilities
- Lead or participate in the design and development of service monitoring modules, including log management, performance metrics collection, and automated anomaly alerting systems.
- Build and optimize observability solutions using Prometheus, Grafana, Spring Boot Actuator, and related monitoring toolchains; own deployment, tuning, and stability improvement.
- Develop customized monitoring metrics (e.g., JVM memory usage, thread pool health, API response latency) and integrate them into Prometheus/Grafana for visualization and alert rule configuration.
- Analyze large-scale monitoring data to identify performance bottlenecks such as database slow queries, latency spikes, or resource contention, and drive end-to-end optimization.
- Collaborate with backend, SRE, and platform engineering teams to enhance system reliability, scalability, and real-time monitoring coverage across services.
- Contribute to internal tooling, automation frameworks, and best-practice guidelines to elevate observability standards across the engineering organization.
Requirements
- Solid backend engineering experience with strong proficiency in Java (Spring Boot / microservices). multithreading, and JVM performance tuning.
- Strong proficiency in Java, with solid understanding of microservices architecture, multithreading, and JVM performance tuning.
- Experience with mainstream monitoring ecosystems such as Prometheus, Grafana, Spring Boot Actuator, and hands-on deployment and configuration in production environments.
- Practical experience building custom metrics, dashboards, alerting rules, and troubleshooting end-to-end system performance issues.
- Familiarity with database and middleware performance diagnostics (e.g., slow SQL, Redis/Kafka latency, connection pool tuning).
- Good understanding of system-level performance concepts—CPU, memory, I/O, GC, thread pools, network stack, etc.
- Experience with Go or Rust is a strong plus.
- Strong analytical mindset, ownership, and ability to work in a fast-paced, highly distributed environment.

