Greater Philadelphia Area  ·  Open to Opportunities

Steve.
Rizzo.

AI Model Evaluator IT Professional Full Stack Developer
About

Stephen T. Rizzo Jr.

Stephen T. Rizzo Jr.

I'm an AI model evaluator and full-stack developer based in the Greater Philadelphia Area. I work as a prompt designer and output analyst for AI models, focusing on structured data labeling and evaluation of LLM responses for correctness, reasoning quality, and edge-case behavior.

My evaluation work spans domains including mathematics and physics for standardized testing, audio radio traffic for official agency-style communications, virtual receptionist workflows, and insurance claim generation. I support this with programming experience in Java, Python, and SQL databases, along with Linux system administration and version control.

Location

Greater Philadelphia Area, PA

Education

B.S. — West Chester University

Expertise

Core Skill Areas


Specialized in AI/ML evaluation, IT & software development, and customer-facing roles.

AI & Machine Learning

LLM output evaluation, prompt design, adversarial testing, structured data labeling, and model benchmarking across diverse domains including mathematics, physics, audio traffic analysis, and industry workflows.

IT & Development

Full-stack applications in Java, Python, PHP, HTML/CSS/JS, Flask, and FastAPI. Scalable database design with MySQL and PostgreSQL. Linux server administration, web hosting, and GitHub version control.

Customer Service

Team leadership in high-volume retail environments, front-desk customer relations, issue resolution, safety compliance, interdepartmental communication, and professional stakeholder management.

Java Python PHP HTML / CSS / JS macOS/Linux/Windows FastAPI MySQL PostgreSQL Linux GitHub LLM Evaluation Prompt Design Data Labeling Premiere Pro NSA CAE-CD
Resume

Experience & Education

Work Experience

AI Model Trainer & Evaluator Alignerr Oct 2024 – Present
  • Evaluated large language models on Python code generation tasks, assessing correctness, efficiency, and edge-case handling.
  • Reviewed and validated model-generated code for logical accuracy, runtime performance, and adherence to software engineering best practices.
  • Identified and documented recurring failure patterns, including hallucinations, incomplete implementations, and suboptimal algorithms.
  • Trained and fine-tuned AI models across diverse domains: mathematics and physics (standardized testing), audio/radio traffic analysis, virtual receptionist workflows, and insurance claim generation.
AI Model Evaluator Pareto AI Sep 2024 – Present
  • Conducted adversarial testing of LLMs to identify failure modes, edge cases, and reasoning breakdowns, improving overall model robustness.
  • Evaluated and benchmarked model outputs for correctness, logical consistency, and performance using structured scoring frameworks.
US AI Rater Telus International — Remote Jun 2023 – Present
  • Evaluated the accuracy and relevance of text, web, and image content for major search platforms using internal rating tools.
  • Provided structured feedback on guidelines and training processes to improve evaluation quality, consistency, and operational efficiency.
Full Stack Developer Self-Employed Jan 2014 – Present
  • Developed and deployed full-stack applications and gaming community systems using Java and modern web technologies (HTML, CSS, JavaScript, Flask, FastAPI, PHP).
  • Designed and implemented scalable MySQL/PostgreSQL database systems for cross-server data synchronization, player tracking, and performance optimization.
  • Collaborated using GitHub and Maven for version control and project management, building custom features and data-driven interfaces for user engagement.
Customer Service Representative Acme Markets Jul 2017 – Mar 2018
  • Promoted from Cashier to front-desk customer service within 4 months.
  • Maintained safety standards across departments, enhancing workplace safety and compliance.
  • Led a team in a high-volume retail environment, improving efficiency and team collaboration.

Education

Bachelor of Science

West Chester University of Pennsylvania — West Chester, PA

2017 – 2021

NSA CAE-CD Certificate
Contact

Get In Touch


Open to opportunities in AI/ML evaluation, IT, and customer service. Reach out directly.

Location

Greater Philadelphia Area, PA