Machine Learning · Systems · Backend Engineering

Tarek
Noiem

Software Engineer & ML Systems Builder

// Open to Remote · US

System Brief

Building at the Edge of ML

I'm a software engineer focused on machine learning systems and backend development, with a Bachelor's in Computer Science from NYU and hands-on experience building production-ready inference servers, data pipelines, and agent workflows.

I completed an AI externship in collaboration with Wayfair through Extern, building AI agents and a live competitor intelligence dashboard for Wayfair's category team. I'm pursuing a Master's in Computer Science at ASU starting Fall 2026, with a focus on Machine Learning and Systems Programming.

I build things from scratch to understand them deeply — from custom HTTP inference daemons to multimodal model pipelines. My target is AI/ML engineering at the intersection of systems, infrastructure, and applied machine learning.

NYU

BA Comp Sci

ASU

MSCS · Fall '26

Extern · Wayfair

MLX

Inference Engine

Focus Area

ARM

Apple Silicon

Field Record

Work Experience

AI Extern

Extern AI Externship in Collaboration with Wayfair

Oct 2025 – Dec 2025
Remote

Designed and deployed multiple AI agents using n8n to automate category management workflows for one of the world's largest home retailers, operating across a catalog of 30M+ products.
Built a competitor intelligence pipeline continuously tracking rival product launches, pricing updates, and marketing campaigns, surfacing real-time benchmarks to inform category strategy.
Automated marketing content generation via AI agents, enabling the category team to act on trend signals before they peaked.
Delivered a live, auto-updating dashboard consolidating trend signals, competitor benchmarks, and AI-generated content suggestions for direct use by category managers.

Deployed Systems

Featured Project

PROJECT // 01

View on GitHub ↗

mlx-nim

A local LLM/VLM inference server for Apple Silicon — no cloud, no telemetry, no data leaving your machine. Exposes three simultaneous API surfaces (OpenAI-compatible, Anthropic-compatible, Ollama-compatible) from a single FastAPI server, requiring careful cross-API type mapping and schema normalization across divergent API contracts. Supports streaming, tool calling, structured JSON output, vision inputs, KV cache quantization, and speculative decoding. Benchmarked at 2.4× faster prompt processing vs. Ollama (201 t/s vs. 85 t/s) and 2.2× faster tool-call generation (41 t/s vs. 19 t/s).

Python FastAPI MLX mlx-lm mlx-vlm OpenAI API Anthropic API KV Cache Speculative Decoding

Tech Stack

Skills & Tools

Languages

Python Rust Go SQL

Frameworks & Libraries

FastAPI PyTorch MLX scikit-learn NumPy pandas Tokio Serde

Developer Tools & Infrastructure

Git Docker Google Cloud PostgreSQL Cargo uv

AI & ML

LLM / VLM Inference AI Agent Orchestration REST API Design Model Context Protocol OpenAI / Anthropic API

Academic Record

Education

Master's in Computer Science

Arizona State University

Expected May 2028 · Tempe, AZ

Focus: Machine Learning & Systems Programming. Relevant coursework: Data Mining, Parallel Computing, Artificial Intelligence, Programming Languages.

Bachelor's in Computer Science

New York University

Graduated May 2025 · New York, NY

Relevant coursework: Artificial Intelligence, Data Structures, Algorithms, Computer Systems, Operating Systems, Compilers, Computer Security.

Open Channel

Connect

Actively seeking entry-level ML engineering and software engineering roles. Open to MLOps, backend, and systems positions — remote or on-site.

GitHub LinkedIn