Machine Learning · Systems · Backend Engineering

Tarek
Noiem

Software Engineer & ML Systems Builder

// Open to Remote  ·  US

NYU
BS Comp Sci
ASU
MSCS · Fall '26
AI
Focused Stack
PythonRust MLX · Apple SiliconFastAPI LLM InferenceGo AI AgentsPyTorch VLMsREST APIs DockerPostgreSQL PythonRust MLX · Apple SiliconFastAPI LLM InferenceGo AI AgentsPyTorch VLMsREST APIs DockerPostgreSQL

System Brief

Building at the Edge of ML

I'm a software engineer focused on machine learning systems and backend development, with a Bachelor's in Computer Science from NYU and hands-on experience building production-ready inference servers, data pipelines, and agent workflows.

I completed an AI externship in collaboration with Wayfair through Extern, building AI agents and a live competitor intelligence dashboard for Wayfair's category team. I'm pursuing a Master's in Computer Science at ASU starting Fall 2026, with a focus on Machine Learning and Systems Programming.

I build things from scratch to understand them deeply — from custom HTTP inference daemons to multimodal model pipelines. My target is AI/ML engineering at the intersection of systems, infrastructure, and applied machine learning.

NYU
BA Comp Sci
ASU
MSCS · Fall '26
AI
Extern · Wayfair
MLX
Inference Engine
ML
Focus Area
ARM
Apple Silicon

Field Record

Work Experience

AI Extern
Extern AI Externship in Collaboration with Wayfair
Oct 2025 – Dec 2025
Remote
  • Designed and deployed multiple AI agents using n8n to automate category management workflows for one of the world's largest home retailers, operating across a catalog of 30M+ products.
  • Built a competitor intelligence pipeline continuously tracking rival product launches, pricing updates, and marketing campaigns, surfacing real-time benchmarks to inform category strategy.
  • Automated marketing content generation via AI agents, enabling the category team to act on trend signals before they peaked.
  • Delivered a live, auto-updating dashboard consolidating trend signals, competitor benchmarks, and AI-generated content suggestions for direct use by category managers.

Deployed Systems

Featured Project

PROJECT // 01
mlx-nim

A local LLM/VLM inference server for Apple Silicon — no cloud, no telemetry, no data leaving your machine. Exposes three simultaneous API surfaces (OpenAI-compatible, Anthropic-compatible, Ollama-compatible) from a single FastAPI server, requiring careful cross-API type mapping and schema normalization across divergent API contracts. Supports streaming, tool calling, structured JSON output, vision inputs, KV cache quantization, and speculative decoding. Benchmarked at 2.4× faster prompt processing vs. Ollama (201 t/s vs. 85 t/s) and 2.2× faster tool-call generation (41 t/s vs. 19 t/s).

Python FastAPI MLX mlx-lm mlx-vlm OpenAI API Anthropic API KV Cache Speculative Decoding

Tech Stack

Skills & Tools

Languages
Python Rust Go SQL
Frameworks & Libraries
FastAPI PyTorch MLX scikit-learn NumPy pandas Tokio Serde
Developer Tools & Infrastructure
Git Docker Google Cloud PostgreSQL Cargo uv
AI & ML
LLM / VLM Inference AI Agent Orchestration REST API Design Model Context Protocol OpenAI / Anthropic API

Academic Record

Education

Master's in Computer Science
Arizona State University
Expected May 2028 · Tempe, AZ
Focus: Machine Learning & Systems Programming. Relevant coursework: Data Mining, Parallel Computing, Artificial Intelligence, Programming Languages.
Bachelor's in Computer Science
New York University
Graduated May 2025 · New York, NY
Relevant coursework: Artificial Intelligence, Data Structures, Algorithms, Computer Systems, Operating Systems, Compilers, Computer Security.

Open Channel

Connect

Actively seeking entry-level ML engineering and software engineering roles. Open to MLOps, backend, and systems positions — remote or on-site.