Welcome! π
Iβm Davide Rutigliano, a Senior Platform Engineer building GPU-accelerated Kubernetes platforms for AI/HPC workloads. Specialized in inference observability (vLLM, TTFT) and cluster lifecycle operations. Open-source contributor: Kubernetes, Kueue and KubeAI.
What is this Site?
This is my personal corner of the internet. A place to share my personal views, document what I learn, and connect with others in the field. Browse my portfolio for featured projects, read my blog for thoughts on platform engineering, or check out my notes for quick technical references.
What I Do
I specialize in building internal platforms and developer tools that scale. My work spans Kubernetes, virtualization, observability, and HPC/GPU infrastructure, with a focus on production readiness, efficiency and cost optimization.
Recent Highlights
- π§ vLLM & GenAI Observability: Engineered OpenTelemetry connectors to instrument vLLM inference (TTFT, KPIs), enabling on-call triage for multi-tenant GPU inference platform
- β‘ High-performance GPU Monitoring: Engineered GPU observability solution for Kubernetes/KubeVirt (NVIDIA MIG/vGPU), unlocking 40+% HPC efficiency
- π 62% infrastructure cost reduction ($100K+ annual savings) by architecting Kubernetes Cluster Auto-scaling with Cluster API across AWS, GCP, and on-prem
- π€ Built the SUSE Observability MCP Server from idea to MVP, embedding LLM-driven analysis directly into the alerting pipeline β recognized by senior leadership for production hardening
- π Designed VM migration orchestration with a Kubernetes operator enabling 100+ VMs migration from KVM to Harvester
- π Architected federated observability migration to SUSE Observability (StackState), cutting troubleshooting time by 25%
π Skills
AI & GPU Infra
Observability
Reliability
Cloud Native
Development
Letβs Connect
Iβm always interested in discussing platform engineering, cloud architecture, and innovative solutions. Check out my portfolio for featured projects, or view my full CV.
