About the Role

A structured AI evaluation initiative focused on improving the reliability, reasoning accuracy, and clarity of conversational systems in software engineering contexts. The work emphasizes how models generate, interpret, and explain Ruby-based solutions across a wide range of programming tasks and system-level challenges.

This opportunity is ideal for experienced Ruby engineers with strong problem-solving ability and a deep understanding of modern development practices. It suits individuals who can independently validate code, identify subtle issues, and assess the quality of technical explanations.

The work involves reviewing AI-generated Ruby code, executing and validating outputs, and providing structured feedback on correctness and clarity, where precision and consistency are essential to improving system performance.

What You'll Do

Evaluate AI-generated responses to coding and software engineering problems
Execute and validate Ruby code to ensure correctness and performance
Identify logical errors, inefficiencies, and edge case failures
Annotate outputs with detailed feedback on strengths and weaknesses
Assess code readability, maintainability, and architectural soundness
Perform fact-checking using reliable technical references
Apply standardized evaluation frameworks and scoring criteria
Ensure outputs align with expected engineering and conversational standards

Requirements

5+ years of professional experience in software engineering or related fields
Strong expertise in Ruby programming language
Ability to solve medium to hard algorithmic problems independently
Experience executing, testing, and debugging production-level code
Strong understanding of data structures, algorithms, and system design principles
High attention to detail in reviewing technical reasoning and outputs
Fluent English communication skills
Experience using LLMs in coding workflows and understanding their limitations
Ability to follow structured evaluation frameworks and guidelines
Bachelor’s degree or higher in Computer Science or related discipline
Experience contributing to open-source projects with accepted contributions
Familiarity with additional programming languages or ecosystems (preferred)
Experience in model evaluation, RLHF, or data annotation (preferred)
Background in web frameworks such as Ruby on Rails or similar ecosystems (preferred)
Experience reviewing code in production environments (preferred)
Ability to explain complex technical concepts clearly to varied audiences (preferred)

Senior Ruby Software Engineer (AI Evaluation)

About the Role

What You'll Do

Requirements

Explore Similar Global AI Roles

Tier 3 Network Support Engineer – Autonomous Systems Data Operations

Data Partnerships Lead – AI Workflow Data & Strategic Alliances

MLOps Engineer – AI Model Training Infrastructure