AI Benchmarking Specialist - Chinese, International Seller Growth
at Amazon
Posted 10 hours ago
No clicks
- Compensation
- Not specified
- City
- Not specified
- Country
- Not specified
Currency: Not specified
As an AI Benchmarking Associate on the Seller AI team within International Seller Services, you will help evaluate AI/LLM systems by designing and executing benchmarking and audit activities to measure quality, compliance, robustness, and fairness for Amazon sellers worldwide. You will assist in planning benchmarking exercises, define test plans, metrics, and acceptance criteria across accuracy, robustness, bias, and reliability, and review datasets and model outputs for privacy and regulatory risk. You will prepare clear audit and benchmarking reports with root-cause analysis and recommendations, maintain organized documentation and benchmarking datasets, and contribute to senior stakeholder presentations. You will also help improve AI audit methodologies, checklists, and test frameworks and explore automation opportunities as practices evolve.
Key job responsibilities
As part of your role, you will have the opportunity to,
• Assist in planning and executing benchmarking exercises for AI models, including defining test plans, metrics, and acceptance criteria across accuracy, robustness, bias, and reliability
• Support content accuracy, relevancy, and privacy checks by reviewing datasets, model outputs, and data handling practices, escalating potential regulatory risks.
• Validate data based on specific annotation guidelines, ensuring the accuracy and quality of the collected information
• Prepare clear audit and benchmarking reports, including error ratings, root-cause analysis, and recommendations, and contribute to presentations for senior stakeholders
Maintain organized audit documentation, evidence, and benchmarking datasets to support internal review
• You will work closely with your team members and managers to drive process efficiencies and explore opportunities for automation
• You will strive to enhance the productivity and effectiveness of the data generation by contributing to the development and continuous improvement of AI audit methodologies, checklists, and test frameworks as regulations and best practices evolve
About the team
There are millions of small and medium businesses across international stores such as India, LatAm, Europe, Middle East, Japan etc. who sign up as sellers on Amazon. Our primary focus lies in handling annotations for training, measuring, and improving Artificial Intelligence (AI) and Large Language Models (LLMs), enabling Amazon to deliver a superior seller experience to our sellers worldwide.
Basic Qualifications
- Chinese citizen with native Mandarin proficiencyPreferred Qualifications
- Bachelor's degree or aboveOur inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

