Data Scientist Lead - LLM (Chatbot)
at Binance
Posted 7 hours ago
No clicks
- Compensation
- Not specified
- City
- Not specified
- Country
- Not specified
Currency: Not specified
Own the full LLM pipeline from data preparation to production in customer service scheduling. Design and optimize prompts, build and maintain Retrieval-Augmented Generation (RAG) QA systems, and deploy multi-agent LLM architectures. Develop evaluation pipelines for prompt and model quality, bias, and hallucination rates; collaborate with product and CS teams to integrate AI models into conversational Chatbot scenarios; stay at the forefront of research and improve architecture.
Responsibilities:
- Own the full LLM pipeline from data preparation to production real case usage.
- Design, iterate and optimize prompts (zero-/few-shot, chain-of-thought, tool-calling, etc.) to maximize model utility and safety across products and languages.
- Build and maintain Retrieval-Augmented Generation (RAG) QA/search systems that connect to multi-source knowledge bases.
- Familiar with vLLM/SGLang inference architectures and have proven experience deploying and operating LLM services on multi‑GPU or cluster environments.
- Design, implement and operate multi‑agent LLM architectures (e.g. LangGraph, CrewAI, AutoGen) including task decomposition, agent orchestration, memory sharing and tool‑calling workflows.
- Develop evaluation pipelines (automatic metrics & human feedback) to measure prompt and model quality, bias, and hallucination rates.
- Collaborate with product and CS teams to integrate AI models into conversational Chatbot in different scenarios.
- Track cutting-edge research, author tech blogs, and keep improve current architecture.
Qualifications:
- Master’s degree or higher in Computer Science, Data Science or related field..
- 2+ years of deep-learning/NLP experience, including 1+ year practical LLM work (SFT, DPO, RAG, quantization, inference optimization, etc.).
- Demonstrated prompt engineering & tuning expertise (few-shot design, structured prompting, prefix-/p-tuning, reward re-ranking, safety filtering).
- Practical experience building and deploying multi‑agent LLM workflows, with understanding of agent‑orchestrator patterns, shared memory, long‑horizon planning and guard‑rail design.
- Clean coding practices, good English communication skills, and a passion for rapid learning.
- Excellent self-driven and ownership with good deliverables.
- Eager to learn, be curious about AI new technologies
- Good communication and collaboration skills

