Bulldogjob Stacjonarnie Senior

Senior Site Reliability Engineer, ML Platform

Vinted

From 5 100 EUR (UoP)

Wymagania

  • Kubernetes
  • Go
  • Terraform
  • Kafka
  • GCP

Opis stanowiska

Brief info about Vinted Our mission is to make second-hand the first choice, and we're looking for people who want to help us get there. Every day, we work together to help our members buy and sell pre-loved clothing and lifestyle items, giving each piece a second life – or even a third. The Vinted Group is made up of three business units that support this mission: Vinted Marketplace is Europe’s leading platform for second-hand fashion and a go-to destination for all kinds of pre-loved items, with a growing range of categories. Our platform connects millions of members across 20+ markets, helping great items find a new life. Vinted Go enhances the shipping experience with a vast network of over 500,000 pick-up and drop-off points, partnering with more than 60 carriers across Europe, with added services like item verification for peace of mind on high-value pieces. Vinted Pay is the newest part of the Vinted Group, dedicated to bringing secure, reliable payments to buyers and sellers across Europe. Seamlessly integrated into the Vinted app, it helps keep every transaction safe, efficient, and easy for our members. Founded in 2008 in Lithuania, Vinted began as a way for friends to find new homes for clothes they no longer needed. In 2019, we became Lithuania's first unicorn! Today, our headquarters remain in Vilnius, and we've grown with offices across Europe, supported by a team of over 2,000 people. Our backers include Accel, EQT Growth, Insight Partners, Lightspeed Venture Partners, Sprints, and TPG. Information about the position As Senior Site Reliability Engineer, you will be responsible for the reliability, scalability, observability, and operational excellence of Vinted’s ML Platform services including SLOs/alerting, incident response, capacity planning, automation, and on-call for critical services such as the Vinted Feature Store. You’ll join the ML Platform team, which owns the tooling for ML/LLM development, deployment and other platform capabilities that increase ML/AI delivery speed at Vinted. You will be working closely with ML platform users, Data Infrastructure and Production Engineering teams. Here are some of the technologies we use: Kubernetes, Terraform, Chef, Google Cloud Platform, Go, Kafka, Vespa, Redis, Vitess. In this position, you’ll - Work with MLOps engineers to deliver deployment solutions that enable Data Scientists to seamlessly deploy any ML model to production. - Monitor and improve the Vinted Feature Store to reliably support 240k feature-read RPS while meeting SLOs. - Participate in deploying and maintaining self-hosted Langfuse for LLM evaluation and observability (upgrades, automation, security hardening). - Lead reliability work for ML Platform services (SLOs, alerting, runbooks, postmortems, toil reduction). - Build up observability and operational tooling for model deployment and platform services. - Assist with integrating deployment workflows with automated experimentation (A/B, shadow). - Communicate with stakeholders about incidents, planned changes, risks, and reliability trade-offs.