Labs

Built in the open.
Documented with intent.

Each lab is a working prototype built around a real problem — Agentic AI, RAG evaluation, AI-assisted engineering. Not demos. Not tutorials. Systems that had to actually work.

RAG Evaluation Harness

Active

A structured evaluation framework for Retrieval-Augmented Generation pipelines — measuring faithfulness, answer relevance, and context groundedness against a golden dataset.

RAGEvaluationLLM

Agentic Workflow Orchestrator

Active

A multi-step AI agent that automates engineering delivery workflows — ticket triage, context gathering, draft generation, and review routing — using a tool-calling loop.

Agentic AIAutomationTool Use

AI-Assisted Code Review Pipeline

Prototype

An LLM-augmented code review system that surfaces architectural concerns, security patterns, and test coverage gaps — integrated into a GitHub Actions workflow.

Code ReviewGitHub ActionsLLM

Want something built for your situation?

Labs are proof-of-concept. Engagements are production. If a lab resonates, let's talk about what it would take to build it for your context.

Book a strategy conversation

Built in the open.Documented with intent.

RAG Evaluation Harness

Agentic Workflow Orchestrator

AI-Assisted Code Review Pipeline

Want something built for your situation?

Built in the open.
Documented with intent.