π§ Gemma 3 from Scratch: I pre-trained a Gemma 3 architecture model from scratch and a 164M parameter language model built entirely from zero, trained on the TinyStories dataset to generate coherent children’s stories.
π This journey has been a deep dive into transformer internals, from implementing Multi-Query Attention and RoPE embeddings to watching a model learn language one training step at a time…
π Building BUD: From Zero to Production AI Agent
I built BUD (Build, Understand, Deploy) β a self-hosted multi-platform AI assistant from scratch, serving 5 platforms simultaneously with 24+ MCP tools, a 5-layer memory system, and real-time voice I/O.
π This journey has been a deep dive into AI agent architecture, multi-platform engineering, and production-grade deployment β all without LangChain or any agent framework…

Firogo Ltd

Firogo Ltd

Firogo Ltd
π Contributing to Open-Source AI with Scala Center
I was selected as a Google Summer of Code 2025 Contributor with the Scala Center, where I worked on enhancing LLM4S β an open-source project focused on tracing and monitoring large language model workflows.
πThis journey has been a deep dive into LLM tracing, open-source collaboration, and real-world AI development…