Kubernetes & Cloud Infrastructure Engineer – AI Platform
⚲ Wrocław
40 000–60 000 zł / mies. (zal. od umowy)
Wymagania
- Kubernetes
- Helm
- Kustomize
- AWS
- Python
- Git
Opis stanowiska
Nasze wymagania:
4+ years of experience in Cloud, DevOps, or Platform Engineering.
Strong expertise in Kubernetes, including cluster operations, multi-tenancy, GPU scheduling, and Helm/Kustomize.
Hands-on experience with AWS cloud services—networking, IAM, EKS, EC2, and cost management.
Proficient in Python and infrastructure-as-code tools such as Terraform.
Proven ability to build and operate CI/CD and GitOps workflows.
Skilled in defining and managing infrastructure SLOs.
Pragmatic, ownership-driven approach and comfortable working in ambiguous environments.
Mile widziane:
Experience supporting LLM inference workloads, model serving platforms, or GPU-backed infrastructure.
Knowledge of RAG systems, vector databases, or AI-related data platforms.
Background in high-performance or latency-sensitive environments.
Relevant certifications such as CKA, CKAD, Solutions Architect, or similar.
O projekcie:
As a Kubernetes & Cloud Infrastructure Engineer – AI Platform, you will be working for our client, a leader in AI innovation, building and managing critical infrastructure that supports advanced AI models and large language model workloads. Your work will directly impact the delivery, scalability, and security of cutting-edge AI solutions, empowering teams across the firm to harness AI's full potential. Join us and be part of shaping the future of intelligent technologies!
Unleash the power of cloud-native infrastructure — revolutionize AI platforms with your expertise!
Wroclaw-based opportunity with on-site work model.
Zakres obowiązków:
Build and operate scalable Kubernetes clusters supporting multi-tenancy, GPU workloads, and model serving.
Manage AWS infrastructure, including networking, IAM, security, and cost optimization.
Develop and maintain infrastructure as code utilizing tools like Terraform, Helm, and Kustomize.
Implement and maintain CI/CD and GitOps workflows to streamline deployment pipelines.
Build observability solutions for system health, utilization, latency, and platform performance monitoring.
Automate scaling, capacity management, and enforce security and governance policies across cloud and on-premise environments.
Define and monitor infrastructure Service Level Objectives (SLOs) to ensure reliability and performance.
Collaborate closely with AI Platform Engineers, Data Scientists, and Research teams to support model inference and deployment infrastructure.
Enable internal teams through platform tooling, onboarding, and self-service portals.
Oferujemy:
Stable and long-term cooperation with very good conditions
Enhance your skills and develop your expertise in the financial industry
Work on the most strategic projects available in the market
Define your career roadmap and develop yourself in the best and fastest possible way by delivering strategic projects for different clients of ITDS over several years
Participate in Social Events, training, and work in an international environment
Access to attractive Medical Package
Access to Multisport Program
Access to Pluralsight
Flexible hours
4+ years of experience in Cloud, DevOps, or Platform Engineering.
Strong expertise in Kubernetes, including cluster operations, multi-tenancy, GPU scheduling, and Helm/Kustomize.
Hands-on experience with AWS cloud services—networking, IAM, EKS, EC2, and cost management.
Proficient in Python and infrastructure-as-code tools such as Terraform.
Proven ability to build and operate CI/CD and GitOps workflows.
Skilled in defining and managing infrastructure SLOs.
Pragmatic, ownership-driven approach and comfortable working in ambiguous environments.
Mile widziane:
Experience supporting LLM inference workloads, model serving platforms, or GPU-backed infrastructure.
Knowledge of RAG systems, vector databases, or AI-related data platforms.
Background in high-performance or latency-sensitive environments.
Relevant certifications such as CKA, CKAD, Solutions Architect, or similar.
O projekcie:
As a Kubernetes & Cloud Infrastructure Engineer – AI Platform, you will be working for our client, a leader in AI innovation, building and managing critical infrastructure that supports advanced AI models and large language model workloads. Your work will directly impact the delivery, scalability, and security of cutting-edge AI solutions, empowering teams across the firm to harness AI's full potential. Join us and be part of shaping the future of intelligent technologies!
Unleash the power of cloud-native infrastructure — revolutionize AI platforms with your expertise!
Wroclaw-based opportunity with on-site work model.
Zakres obowiązków:
Build and operate scalable Kubernetes clusters supporting multi-tenancy, GPU workloads, and model serving.
Manage AWS infrastructure, including networking, IAM, security, and cost optimization.
Develop and maintain infrastructure as code utilizing tools like Terraform, Helm, and Kustomize.
Implement and maintain CI/CD and GitOps workflows to streamline deployment pipelines.
Build observability solutions for system health, utilization, latency, and platform performance monitoring.
Automate scaling, capacity management, and enforce security and governance policies across cloud and on-premise environments.
Define and monitor infrastructure Service Level Objectives (SLOs) to ensure reliability and performance.
Collaborate closely with AI Platform Engineers, Data Scientists, and Research teams to support model inference and deployment infrastructure.
Enable internal teams through platform tooling, onboarding, and self-service portals.
Oferujemy:
Stable and long-term cooperation with very good conditions
Enhance your skills and develop your expertise in the financial industry
Work on the most strategic projects available in the market
Define your career roadmap and develop yourself in the best and fastest possible way by delivering strategic projects for different clients of ITDS over several years
Participate in Social Events, training, and work in an international environment
Access to attractive Medical Package
Access to Multisport Program
Access to Pluralsight
Flexible hours
🔍 Dekoder Ogłoszenia
🔴
comfortable working in ambiguous environments
Może oznaczać brak jasnych procesów, niepewność co do kierunku projektu lub częste zmiany wymagań.
🔴
Pragmatic, ownership-driven approach
Oczekuje się samodzielności i brania pełnej odpowiedzialności za powierzone zadania, nawet jeśli brakuje jasnych wytycznych.
🔴
building and managing critical infrastructure
Może oznaczać pracę pod dużą presją i z wysokimi wymaganiami dotyczącymi dostępności i niezawodności.
🟡
directly impact the delivery, scalability, and security
Twoja praca będzie miała kluczowe znaczenie, co może wiązać się z dużą odpowiedzialnością i presją.
🟡
empowering teams across the firm to harness AI's full potential
Może oznaczać konieczność wspierania wielu różnych zespołów z różnymi potrzebami i poziomami zaawansowania technologicznego.