
About
Marco Punio
I'm an AI Infrastructure Engineer building toward the systems layer of modern AI.
I care about what happens after the model exists: how it runs, how fast it serves, how GPUs get used, where bottlenecks appear, and how cloud infrastructure turns experiments into real workloads.
This portfolio is my proof-of-work: projects built to show performance intuition across GPU inference, ML systems, Kubernetes, and cloud infrastructure.
GPU utilizationLLM inferenceDistributed Systems / ComputeCloud Infrastructure