Senior Performance Engineer required to lead performance, reliability, and capacity engineering across business-critical distributed platforms.
Key skills: OpenTelemetry Gatling k6 Prometheus Grafana AWS (ECS, EKS, Lambda, autoscaling) Kubernetes CI/CD (GitHub Actions) SLI/SLO distributed tracing capacity planning C#/.NET MCP/Agentic AI
You'll embed performance engineering practice across product teams, defining observability standards, load and stress testing strategies (Gatling, k6, JMeter), and capacity forecasting. You'll instrument distributed systems using OpenTelemetry and integrate performance checks into CI/CD pipelines, enabling teams to validate releases and troubleshoot independently.
Experience with cloud capacity planning on AWS and an understanding of ECS, EKS, and autoscaling is essential. Familiarity with MCP-based agentic AI for automated telemetry analysis and anomaly detection is a strong advantage.
You'll coach engineers, lead cross-team performance reviews, and own the Performance Engineering roadmap. PostgreSQL and DynamoDB experience is beneficial.