AI / ML Engineer β’ Computer Vision β’ Generative AI
Building practical AI systems, one focused iteration at a time.
Building intelligent systems that see, understand, and create.
I am a Machine Learning Engineer focused on Computer Vision and Agentic AI, with a strong foundation in scalable backend systems. My engineering philosophy revolves around translating complex research papers into optimized, production-ready code.
- π― Focus: Bypassing computational bottlenecks in high-resolution (4K) object detection using Explainable AI (XAI).
- π€ AI Engineering: Building local LLM agents that seamlessly interact with third-party ecosystems (Google APIs, etc.).
- βοΈ Infrastructure: Architecting robust database migrations and building backend profilers.
- π‘ Goal: I build systems that are not just intelligent, but fast, scalable, and resilient.
β‘ PixelQueue
A sleek, dark-themed control panel designed for decoupled ML microservices and robust task queues, eliminating UX bottlenecks with pure speed and instantaneous rendering. Key Innovations:
|
A novel coarse-to-fine computer vision pipeline designed for efficient small object detection in high-resolution (2K/4K) aerial imagery. Tackles the critical trade-off between resolution and latency in drone forensics. Key Innovations:
|
π¨ Neural Canvas
A fast neural style transfer implementation that generates stylized images using a feed-forward CNN trained with perceptual loss. Performs instant stylization in a single forward pass. Key Features:
|
|
π§ pygog (Google CLI Agent) |
π Depth Estimation + Semantic Seg. |
I actively contribute to the broader developer ecosystem, with recent merged work spanning agent frameworks, AI infrastructure, developer tooling, and performance-focused ML apps. I am also an active Collaborator at SynapseKit organization:
- SynapseKit/SynapseKit: Shipped 109 PRs covering native observability, VoiceAgent audio pipelines, graph-builder tooling, benchmark suites, CronTrigger scheduling, self-healing cost-aware agents, persistent agent memory, multimodal RAG ingestion, knowledge graph retrievers, Discord automation, cloud/data loaders, and local/self-hosted model integrations across 15+ LLM providers.
- lancedb/lancedb: Updated LanceDB's Python Gemini embedding provider to the newer google-genai SDK and opened a fix for async event loop blocking in AsyncTable.add embeddings.
- Nikolaev3Artem/fastapi-silk: Merged 5 PRs adding per-endpoint database trigger counters, multi-version compatibility matrix, SQLite+Alembic setup, SQL profiler tests, and comprehensive README documentation.
- Bessouat40/RAGLight: Submitted 3 PRs adding MCP server configuration CLI support (in review) and Docling-based high-fidelity PDF ingestion (open).
- pydantic/pydantic-ai: Merged Anthropic code execution tool upgrade



