Site Reliability Engineer
Link Group
⚲ Remote
21 000 - 24 000 PLN (B2B)
Wymagania
- SRE
- Azure DevOps
- Kubernetes
- Datadog
- Azure
- CD
- AI (nice to have)
- Grafana (nice to have)
Opis stanowiska
O projekcie: We are looking for a Senior Site Reliability Engineer who will take end-to-end ownership of reliability for AI-driven applications and pipelines. This is a hands-on engineering role, not a coordination or ticket-driven position. The ideal candidate actively diagnoses, resolves, and automates production issues rather than only designing solutions. Wymagania: - 5+ years as SRE / Production / Platform Engineer - Strong incident management & RCA experience - Hands-on with: Azure DevOps, Kubernetes, Datadog, Azure, CI/CD - Proactive, ownership mindset, self-driven - Experience in production environments - Nice to have: AI/LLM pipelines, Grafana Codzienne zadania: - Build and maintain monitoring, alerting, dashboards - Lead incident response & root cause analysis - Ensure reliability and performance of AI pipelines - Standardize telemetry (latency, failures, throughput) - Optimize CI/CD and release quality - Reduce recurring incidents with engineering teams