DevOps Engineer (AWS/Tencent/Ali Cloud)
at Binance
Posted 8 hours ago
No clicks
- Compensation
- Not specified
- City
- Not specified
- Country
- Not specified
Currency: Not specified
Binance is hiring a DevOps Engineer to design, deploy, monitor, and optimize cloud infrastructure across AWS and Tencent/Ali Cloud. You will handle production incidents, manage Kafka or Redis clusters in production, and collaborate with development teams for seamless deployments. The role focuses on performance, cost, and reliability of cloud infrastructure and includes building DevOps platforms and exploring AI-driven operational insights. This is a 100% remote role.
Responsibilities
- Handle production incidents and post-mortem analysis for system stability improvements
- Designing, deploying, monitoring, and troubleshooting Kafka OR Redis clusters in PROD environment, ensuring optimal performance and reliability
- Work closely with development teams to ensure seamless deployment of applications or systems
- Manage and optimize Cloud infrastructure for performance, cost, and reliability
- Develop Devops platform like online load test, change management system
- Continuously explore and integrate AI-driven insights into operational processes to improve reliability, reduce noise, and empower engineering teams with intelligent decision-making.
Requirements
- 2-8 years of hands-on experience in Kafka OR Redis operations in large-scale production environments, able to cooperate with developers to optimize code
- Proficient with at least 1 public Cloud, AwS OR AliCloud OR Tencent Cloud
- Proficient in Python/Go/Java (at least one language) and SQL programming languages
- Hands-on experience with containerization and orchestration - Kubernetes
- Strong experience with CI/CD tools such as GitHub Actions, Ansible, Terraform etc
- Proficient in both English and Chinese communication for efficient cross team collaboration
Bonus
- Leverage LLMs or AI frameworks (OpenAI, Dify, Agno, LangChain) to enhance automation in DevOps infrastructure operations, including intelligent alert triage, RCA (Root Cause Analysis), and chat-based operations (ChatOps)
- Practical experience building or operating AIOps systems (Anomaly detection, Alert correlation, Automated healing, or RCA)
- Familiarity with LLM-based DevOps automation (e.g., building chat-based ops assistants or AI-driven observability workflows)

