Labs
Each lab is a working prototype built around a real problem — Agentic AI, RAG evaluation, AI-assisted engineering. Not demos. Not tutorials. Systems that had to actually work.
A structured evaluation framework for Retrieval-Augmented Generation pipelines — measuring faithfulness, answer relevance, and context groundedness against a golden dataset.
A multi-step AI agent that automates engineering delivery workflows — ticket triage, context gathering, draft generation, and review routing — using a tool-calling loop.
An LLM-augmented code review system that surfaces architectural concerns, security patterns, and test coverage gaps — integrated into a GitHub Actions workflow.
Labs are proof-of-concept. Engagements are production. If a lab resonates, let's talk about what it would take to build it for your context.