NoFluffJobs Praca zdalna Senior New

Senior DevOps Engineer (AWS+Ruby)

Square One Resources

⚲ Warsaw

23 520 - 28 560 PLN (B2B)

Wymagania

  • Kubernetes
  • AWS
  • Ruby
  • Docker
  • RabbitMQ
  • Kafka
  • SQS
  • MySQL
  • PostgreSQL
  • Ruby on Rails (nice to have)
  • Java (nice to have)
  • Spring Boot (nice to have)

Opis stanowiska

O projekcie: You will join a team building a large-scale mobility platform enabling taxi services, payments and real-time ride management. The system integrates multiple distributed components including ride processing, payment infrastructure (PCI DSS compliant), mobile apps and third-party services such as payment gateways and communication tools. Wymagania: - 7+ years of DevOps / SRE experience - Strong expertise in Kubernetes (EKS preferred) - Strong hands-on experience with AWS (EC2, EKS, RDS, S3, IAM, VPC) - Solid programming skills in Ruby - Experience with Docker and containerized environments - Experience with CI/CD and Infrastructure as Code (Terraform, CloudFormation, etc.) - Experience with messaging systems (RabbitMQ, Kafka, SQS) - Strong knowledge of relational databases (MySQL/PostgreSQL) - Experience with distributed systems and microservices - Familiarity with observability tools (Prometheus, Grafana, ELK, Datadog) - Strong Linux and networking fundamentals - Experience with SRE practices (SLO/SLI/SLA, incident management) - Strong communication skills and ownership mindset Optional - Experience with Ruby on Rails or Java/Spring Boot - Experience with high-traffic SaaS platforms - Cloud cost optimization experience - Chaos engineering practices - Mentoring experience Codzienne zadania: - Design, build and operate scalable, highly available distributed systems - Improve system performance, reliability and resilience - Automate infrastructure, CI/CD pipelines and operational workflows - Troubleshoot and resolve production issues - Lead system upgrades and migrations with minimal downtime - Participate in on-call rotations and incident response - Collaborate closely with development teams to improve system operability - Implement monitoring, alerting and capacity planning best practices - Drive automation to reduce operational overhead - Contribute to incident management, post-mortems and disaster recovery strategies