The AI Evaluation team is the main line of defense in ensuring customer safety. Working alongside our AI team, you will design metrics that utilize fleet data and run on large inference clusters to help drive key decisions about end-to-end model architecture, data integrity, and exported model performance. Your work will directly impact FSD v12 customers. A strong candidate will have a computer science, ML or robotics engineering background, strong attention to detail, self-driven attitude, and passion to make an impact.