JustJoin.IT Praca zdalna Mid

Customer Success Engineer

Awareson Sp. z o.o.

⚲ Warszawa

160 - 210 PLN/h netto (B2B)

Wymagania

  • Azure
  • Site Reliability Engineering (SRE)
  • Grafana
  • Prometheus
  • Observability

Opis stanowiska

Customer Success Engineer (Observability / Platform) We are looking for a Customer Success Engineer to support the adoption and effective use of an internal observability and event management platform. This role requires a solid technical background and focuses on working closely with internal teams to design, implement, and optimize monitoring and alerting strategies across distributed systems. Responsibilities: • Support onboarding of applications and services into the observability platform (logs, metrics, alerting, event management) • Help define and implement monitoring strategies, including dashboards, alerting rules, and SLOs / SLIs • Analyze system behavior using telemetry data (logs, metrics, traces) to support troubleshooting and improve visibility • Translate business and operational requirements into technical observability solutions • Work with engineering teams to ensure proper instrumentation and integration with monitoring tools • Handle incident analysis and troubleshooting (L1/L2 support) using observability data • Contribute to improving platform processes, documentation, and onboarding standards • Support adoption of best practices in observability, reliability, and event management Requirements: • Experience in DevOps, SRE, Platform Engineering, or Technical Support (L2/L3) • Solid understanding of observability concepts: logs, metrics, traces, alerting, SLO/SLI • Hands-on experience with tools such as Grafana, Datadog, Prometheus (or similar) • Understanding of distributed systems and application architectures • Ability to troubleshoot issues using monitoring and telemetry data • Experience working with cloud environments (AWS, Azure, or GCP) • Experience interacting with stakeholders and translating requirements into technical solutions • Strong problem-solving skills and willingness to work on operational tasks Nice to have: • Experience with event/incident management platforms • Exposure to CI/CD pipelines or infrastructure automation • Familiarity with instrumentation practices (e.g. OpenTelemetry) • Experience in internal platform or developer tooling environment