Available for high-impact software engineering roles

CTO / Software Engineer / ML Engineer

Alex Ndungu

Building intelligent systems, multi-tenant operating systems, and machine learning-powered execution layers for real-world products.

View Projects Download Resume

Years coding

GitHub projects

30+

ML records processed

45K+

Live systems pulse

How I think when building production software

System-aware engineering

Operating model

Realtime

RBAC + tenant isolation active

Akoko execution context loaded

Ledger-safe workflows online

Search + retrieval pipeline ready

Architecture loop

Model

Control

Observe

Refine

Selected systems

Production-minded work

Shipping

Harlem Manage

Built the full backend of a multi-tenant proptech platform: 46-tool role-based AI agents, real-time M-Pesa WebSocket payment flows, a 3-queue Celery task system, and a PII-scrubbing observability stack.

CodePinion

An open-source developer Q&A platform built across 3 collaboration modes â€” async threads, real-time chat, and integrated video calls â€” moving knowledge sharing from static forum searches to live problem solving.

Catalog-Point

A full-stack library operations system with 5 core relational models, 2 user role types, a borrowing transaction engine with date-based cost calculation, and a deployment-ready Django stack.

E-Commerce Backend System

A backend commerce API built across 3 service domains (catalog, cart, order) with JWT-authenticated role-aware authorization, a relational schema optimized for checkout and order lifecycle workflows, and a separate React frontend â€” 2 public repos.

ML Search Engine

A 2-stage hybrid retrieval system trained on 45,000 StackOverflow records â€” SGD-based tag prediction for query expansion feeding into TF-IDF vectorization with cosine similarity ranking.

Movie Recommendation System

A 3-stage content-based recommendation pipeline â€” metadata extraction, vector representation, and cosine similarity scoring â€” that generates explainable suggestions with no user interaction data required.

AI agent tools across 4 user roles

Celery queue types over RabbitMQ

20+

PII field patterns scrubbed from error reports

45K+

ML records processed end-to-end

Years coding, since 2020

30+

GitHub projects across 3 domains

Why me

Production engineering discipline with ML systems depth.

I focus on the kind of engineering that holds up under real use: multi-tenant architecture, robust data flows, role-aware systems, and software that can be reasoned about, debugged, and improved over time.

Current domain

Proptech operating systems

Leading Harlem Manage as a Kenya-first multi-tenant real estate operating system with embedded AI execution.

Production background

Health + fintech workflows

Experience in structured, reliability-sensitive systems that shaped how I approach financial correctness and backend design.

Engineering style

Systems thinking, scalable backend design, and AI that behaves like a governed product subsystem instead of a bolt-on feature.

Featured projects

Selected work with strong backend, product, and AI systems signals.

These projects highlight how I approach architecture, domain problems, and production-ready implementation.

Multi-Tenant Real Estate Operating System

Harlem Manage

Harlem Manage is a Kenya-first, multi-tenant real estate operating system built for landlords, agencies, and property firms. It combines property workflows, tenant and lease management, financial reconciliation, communication channels, and Akoko, a deployed operational intelligence layer that adapts by role.

Next.jsDjangoDjango REST FrameworkPostgreSQLM-PESAWhatsApp/SMSAkoko AI

AI agent tools

RBAC user roles

Celery queue types

Architecture highlights

- Role-based AI agent system (GPT-4.1 Responses API) with dynamic tool selection across 4 roles and 46 tools â€” full audit trail per message for compliance
- M-Pesa C2B STK Push with OAuth2, field-level encrypted credentials (per organisation + property), and live payment status streamed via JWT-authenticated WebSocket consumers over Redis

Deep divePrivate Repo

Real-Time Developer Collaboration Platform

CodePinion

CodePinion is a developer Q&A platform designed to close the gap between the person asking and the person best positioned to help. Rather than forcing developers through slow async threads, it layers real-time chat and video calling on top of a persistent Q&A base so problems can be worked through in context.

JavaScriptHTMLCSSPythonNode.jsReal-time communicationVideo integration

Collaboration modes

Open source

Public GitHub repo

Full-stack JS

Frontend + backend

Architecture highlights

- 3 collaboration modes in one product: persistent Q&A threads, real-time chat, and integrated video calling
- Real-time signaling layer for WebSocket-based chat and video session coordination

Deep dive GitHub

Library Management System

Catalog-Point

A full-stack library operations system with 5 core relational models, 2 user role types, a borrowing transaction engine with date-based cost calculation, and a deployment-ready Django stack.

Catalog-Point is a Django-based library management system covering the full operational surface of a real library: inventory tracking, category management, borrowing workflows, cost calculation, approval states, return handling, and user activity history â€” for both librarians and members.

DjangoPythonPostgreSQLHTMLCSSJavaScriptDjango AllauthGunicorn

Core relational models

User role types

Deployed

Gunicorn + WhiteNoise

Architecture highlights

- 5 relational models: profiles, categories, books, costs, and transactions â€” covering the full operational surface
- 2 user role types with separated workflows: librarian administration and member-facing catalog access

Deep dive GitHub

Production Backend Engineering Project

E-Commerce Backend System

A backend-first commerce platform focused on clear domain separation, predictable API behavior, and a schema that supports catalog, cart, and order lifecycles without coupling everything into a single service layer. Paired with a public React frontend repo.

JavaSpring BootPostgreSQLJWT AuthReactREST APIs

Service domains

JWT + RBAC

Auth layer

2 repos

Frontend + backend

Architecture highlights

- 3 service domains (catalog, cart, order) separated into composable, independently testable layers
- JWT-authenticated role-aware authorization enforced at the API boundary

Deep dive Frontend Backend

Hybrid ML Search & Retrieval System

ML Search Engine

A 2-stage hybrid retrieval system trained on 45,000 StackOverflow records â€” SGD-based tag prediction for query expansion feeding into TF-IDF vectorization with cosine similarity ranking.

A machine learning search system built on ~45,000 StackOverflow records. The key insight was that a single retrieval technique misses intent â€” so the pipeline runs in 2 stages: classify the query to predict missing context tags, then use those enriched tags to improve the similarity search.

PythonPandasscikit-learnTF-IDFNLP preprocessingCosine similarity

45K+

Training records

2-stage

Hybrid retrieval pipeline

SGD + TF-IDF

Model combination

Architecture highlights

- 45,000 StackOverflow records processed through an HTML cleaning, normalization, and tokenization pipeline
- 2-stage hybrid pipeline: SGD tag prediction for query expansion â†’ TF-IDF + cosine similarity retrieval

Deep dive GitHub

Content-Based Recommendation Engine

Movie Recommendation System

A content-based recommender that processes movie metadata through 3 explicit pipeline stages: feature extraction, vector representation, and similarity scoring. The design prioritizes explainability â€” every suggestion is traceable to specific shared metadata signals rather than opaque collaborative filtering.

PythonPandasscikit-learnFeature engineeringSimilarity modeling

Pipeline stages

Content-based

No user data needed

Explainable

Traceable recommendations

Architecture highlights

- 3-stage pipeline: metadata cleaning + feature extraction â†’ vector representation â†’ cosine similarity scoring
- Content-based approach requiring zero user interaction data â€” recommendations driven purely by metadata signals

Deep dive GitHub

Experience snapshot

Work shaped by product ownership, enterprise structure, and operational rigor.

The combination of CTO-level product building, informatics systems, and fintech experience informs how I think about data integrity, workflows, and software reliability.

Current

Harlem Manage

Backend Engineer / Full-Stack Python Engineer

Explore role

Current

Savannah Informatics

Software Engineer

Explore role

3 months

E&M Tech

Software Engineer Intern / Contract

Explore role

Capabilities

Backend foundations, operational AI, and a product sense for what matters.

Backend Engineering

REST API Design / Systems Design / Django / PostgreSQL / MySQL / Redis / RabbitMQ

ML / Data Science

TF-IDF & NLP / Classification / Similarity Models / scikit-learn / TensorFlow / OpenAI API

Programming

Data Structures & Algorithms / Python / TypeScript / JavaScript / Java / React / Next.js / HTML / CSS

Cloud & Infrastructure

AWS / EC2 / PostgreSQL RDS / S3 / Object Storage / Linux Server Administration / Nginx / Gunicorn / Daphne / SSL & Server Hardening / systemd / CI/CD & Deployment Automation / Grafana & Loki

Tools

Git / Linux / Docker / Grafana / Sentry / GlitchTip / Codex / Claude Code

View skill breakdown

A GPT-5 assistant that knows the whole portfolio.

Meta can talk through Alex's experience, ongoing flagship work, technical projects, stack decisions, and the best way to get in touch.

AI Assistant

Meet Meta

Ask Meta about Harlem Manage, backend work, ML projects, skills, or how to contact Alex.