About

Marco Punio

I'm an AI Infrastructure Engineer building toward the systems layer of modern AI.

I care about what happens after the model exists: how it runs, how fast it serves, how GPUs get used, where bottlenecks appear, and how cloud infrastructure turns experiments into real workloads.

This portfolio is my proof-of-work: projects built to show performance intuition across GPU inference, ML systems, Kubernetes, and cloud infrastructure.

GPU utilizationLLM inferenceDistributed Systems / ComputeCloud Infrastructure