Software Engineer & ML Systems Builder
// Open to Remote · US
I'm a software engineer focused on machine learning systems and backend development, with a Bachelor's in Computer Science from NYU and hands-on experience building production-ready inference servers, data pipelines, and agent workflows.
I completed an AI externship in collaboration with Wayfair through Extern, building AI agents and a live competitor intelligence dashboard for Wayfair's category team. I'm pursuing a Master's in Computer Science at ASU starting Fall 2026, with a focus on Machine Learning and Systems Programming.
I build things from scratch to understand them deeply — from custom HTTP inference daemons to multimodal model pipelines. My target is AI/ML engineering at the intersection of systems, infrastructure, and applied machine learning.
A local LLM/VLM inference server for Apple Silicon — no cloud, no telemetry, no data leaving your machine. Exposes three simultaneous API surfaces (OpenAI-compatible, Anthropic-compatible, Ollama-compatible) from a single FastAPI server, requiring careful cross-API type mapping and schema normalization across divergent API contracts. Supports streaming, tool calling, structured JSON output, vision inputs, KV cache quantization, and speculative decoding. Benchmarked at 2.4× faster prompt processing vs. Ollama (201 t/s vs. 85 t/s) and 2.2× faster tool-call generation (41 t/s vs. 19 t/s).