Cloud Machine Learning LLM Serving Staff engineer
at Qualcomm
Posted 5 hours ago
No clicks
- Compensation
- Not specified USD
- City
- Not specified
- Country
- Not specified
Currency: $ (USD)
This role focuses on deploying and maintaining cloud-based machine learning models and large language models (LLMs) at scale, ensuring low latency and high availability. You'll design and implement serving infrastructures, monitoring, and reliability improvements for ML workloads. You will collaborate with ML researchers, platform engineers, and product teams to deliver robust end-to-end ML services. This is a senior/lead-level individual contributor role with ownership of critical ML serving systems.
No additional description provided.

