NoFluffJobs Praca zdalna Senior

Senior Site Reliability Engineer

SmartRecruiters Inc.

⚲ Kraków

18 500 - 30 000 PLN (PERMANENT)

Wymagania

  • Java
  • Microservices
  • Docker
  • Kubernetes
  • Distributed systems
  • Python (nice to have)
  • Node.js (nice to have)
  • Jenkins (nice to have)
  • Bash (nice to have)
  • Golang (nice to have)

Opis stanowiska

O projekcie: 🚀 SmartRecruiters transforms hiring for the world’s leading enterprises. We deliver an AI-powered hiring platform built for global scale, automating and optimizing the entire talent acquisition process. More than 4,000 companies, including LinkedIn, McDonald's, VISA, CD Projekt Red, Allegro rely on SmartRecruiters to build winning teams.  🚀 In 2025, SmartRecruiters joined SAP, the global leader in enterprise applications. Together, we are accelerating the reinvention of hiring by combining AI innovation with the scale and resources of SAP’s ecosystem. We designed our R&D structure based on the empowered product teams model. It means our teams are responsible for business outcomes and have autonomy in solving problems in a way that “customers love yet work for the business”. Job Description The SmartRecruiters Internal Engineering Team is looking for a Senior Site Reliability Engineer to our reliability initiatives and help us strengthen the reliability and observability of our platform at scale. If you are passionate about cloud, networking, observability, and partnering with product teams to curate reliability practices, we have a spot with your name on it! Important: the position is available only under a standard contract of employment with 80% of tax deductible cost. You may be located anywhere in Poland and work remotely or out of our Cracow office. Wymagania: - While not strictly required, we see most of our Senior Engineers have 5+ years of professional experience - Working knowledge of SRE and observability industry standards and best practices (SLIs/SLOs, error budgets, incident management, on‑call) - Engineering experience in JVM stack - Experience with AWS (or other cloud provider), Kubernetes, and IaC tools and practices, including running and troubleshooting distributed applications - Proven track record of delivering solutions for reliability, monitoring, and container management - Deep knowledge of the Linux operating system, with a focus on system hardening and troubleshooting performance issues - Very good scripting skills (Bash, Golang or Python) - Experience managing and troubleshooting database systems, both SQL and NoSQL is a plus - Solid understanding of networking standards, including TCP/IP, DNS, VPN and load balancing is a plus - Comfortable partnering with teams to design resilient data access and use database observability to prevent and resolve incidents - Strong communication skills, with a good understanding of English, both verbal and written, and the ability to coach and influence other engineers Codzienne zadania: - Cooperate closely with other Platform and Engineering teams on strategic reliability and observability initiatives across SmartRecruiters - Improve, automate and grow SmartRecruiters observability and reliability tooling (metrics, logs, traces, alerting) - Respond to production incidents and client threats, lead remediation, and drive follow‑up improvements - Partner with product engineers working in Java, Node.js, and Python to design, instrument, and operate services for failure, owning SLIs/SLOs and error budgets together - Create reusable building blocks (dashboards, alerts, libraries and IaC modules) that can be rolled out company‑wide - Mentor members of the engineering team and act as an advocate for modern SRE and observability practices - Document standards, best practices, and policies for monitoring, alerting, incident response, and reliability - Conduct capacity planning and performance testing of platform