JustJoin.IT Praca zdalna Senior New

AI Senior DevOps Engineer

ITMAGINATION

⚲ Warszawa, Kraków, Wrocław, Poznań, Gdańsk

19 375 - 23 250 PLN netto (B2B)

Wymagania

  • GitLab CI/CD
  • Jenkins
  • Prometheus
  • AI/ML
  • Docker
  • GitHub Actions
  • Terraform
  • Kubernetes
  • Azure DevOps
  • MLOps

Opis stanowiska

This is a remote position. Virtusa is seeking an AI Senior DevOps Engineer to bridge the gap between AI development and production-grade operations. You will be responsible for building the automated "highways" that allow ML models to flow from training to deployment seamlessly. This role requires a strong DevOps foundation combined with an understanding of the unique challenges of MLOps, such as GPU resource management, model versioning, and performance monitoring. Key Responsibilities: • CI/CD & MLOps Pipelines: Build and maintain automated pipelines for ML models using Azure DevOps, GitHub Actions, or Jenkins. • Workflow Automation: Automate model validation, packaging, and deployment workflows to ensure rapid iteration cycles. • Infrastructure as Code (IaC): Use Terraform or CloudFormation to provision and manage cloud-native infrastructure, focusing on high-availability and scalability. • Monitoring & Observability: Set up comprehensive monitoring for infrastructure (CPU/GPU/Memory) and model performance (latency and drift) using Prometheus and Grafana. • DevSecOps Implementation: Integrate security into the heart of the pipeline, including secret management, IAM role configuration, and vulnerability scanning. • Collaboration: Work closely with Data Scientists and AI Engineers to enable a self-service platform for model deployment. Requirements • 6–8 years of experience in DevOps/SRE roles, with a minimum of 2 years focused on MLOps or supporting AI/ML workloads. • Deep expertise in Jenkins, GitHub Actions, or GitLab CI/CD. • Hands-on proficiency with Azure DevOps and Terraform (CloudFormation is a strong plus). • Advanced knowledge of Docker and Kubernetes for managing distributed AI applications. • Proven experience setting up Prometheus and Grafana dashboards for technical and model-specific metrics. • Practical experience managing or connecting to MySQL, PostgreSQL, or MongoDB. • Familiarity with Ansible, Chef, or Puppet for automated environment setup. • Hands-on experience with SAST/DAST tools and automated vulnerability scanning within CI/CD pipelines. • Experience with versioning tools for datasets and models (e.g., DVC or similar pipeline versioning logic). • Professional English (C1) for seamless interaction with global delivery teams. Benefits • Professional training programs  • Work with a team that’s recognized for its excellence. We’ve been featured in the Deloitte Technology Fast 50 & FT 1000 rankings. We’ve also received the Great Place To Work® certification for five years in a row

🔍 Dekoder Ogłoszenia

🔴
bridge the gap between AI development and production-grade operations
Będziesz musiał tłumaczyć potrzeby zespołów AI na język techniczny i odwrotnie, często rozwiązując problemy wynikające z braku spójności między tymi obszarami.
🔴
building the automated "highways" that allow ML models to flow from training to deployment seamlessly
Oczekuje się od Ciebie stworzenia i utrzymania złożonych, zautomatyzowanych procesów, które mogą być podatne na błędy i wymagać ciągłego dopracowywania.
🔴
enable a self-service platform for model deployment
Twoim zadaniem będzie stworzenie narzędzi i procesów, które pozwolą Data Scientistom samodzielnie wdrażać modele, co może oznaczać konieczność radzenia sobie z ich różnym poziomem wiedzy technicznej.
🟡
focusing on high-availability and scalability
Oznacza to, że systemy muszą być odporne na awarie i zdolne do obsługi rosnącego obciążenia, co wymaga zaawansowanych umiejętności projektowania i implementacji.
🟡
minimum of 2 years focused on MLOps or supporting AI/ML workloads
Wymagane jest doświadczenie w konkretnej, często nowej i szybko rozwijającej się dziedzinie, co może oznaczać potrzebę szybkiego uczenia się i adaptacji.