Collaborated with research teams to analyze model outputs and effectiveness as an educational tool, identifying headroom and edge cases to improve performance and identify trends. Analyzed and evaluated large datasets to refine Large Language Models’ educational outputs through Reinforcement Learning from Human Feedback for clients like OpenAI and Google. Collaborated effectively using internal tools, ranking in the top 5% of a 60+ member team for accuracy, productivity, and self-initiative
WIP