Freelance AI Trainer & LLM Evaluation Specialist April 2016 – Present
Partnering with Global AI Firms (TELUS International AI, Appen, Outlier, Alignerr )
- April 2016 – Present Evaluate, critique, and rank AI-generated textual responses for reasoning capability, factual precision, coherence, context-adherence, and instruction completeness.
- Perform complex prompt evaluation and hallucination detection to align LLM behaviors with safety and operational standards.
- Execute advanced automation workflows using Python and Selenium for technical processes, including optical character recognition (OCR) and automated data extraction from complex PDFs.
- Conduct multilingual transcription QA, linguistic auditing, and voice AI analysis utilizing strict "Sonar" guidelines to filter audio static, false wake triggers, and intent discrepancies.
- Oversee high-precision image and video annotation tasks, including bounding boxes and semantic segmentation, to improve ground truth datasets for computer vision models.
- Consistently achieve top-tier metrics and quality audits across multi-platform distributed projects by carefully tracking technical workflow instructions