Pracuj.pl Praca zdalna Senior

Senior Data Engineer (PySpark, NoSQL)

EPAM Systems (Poland) sp. z o.o.

⚲ Kraków, Grzegórzki

Wymagania

  • PySpark
  • Python
  • Microsoft Azure
  • Cosmos DB
  • Cassandra
  • DynamoDB
  • MongoDB

Opis stanowiska

Nasze wymagania: 5+ years of experience as a Data Engineer or similar role Strong hands-on experience with PySpark in production Proven experience in data modeling, partitioning, indexing, and performance tuning in NoSQL systems Strong programming skills in Python Experience building and operating production-grade pipelines in cloud (Azure) Experience with distributed NoSQL databases (e.g., Cosmos DB, Cassandra, DynamoDB, MongoDB) Strong understanding of distributed systems and performance optimization Experience with CI/CD, monitoring, troubleshooting, and production support Strong analytical and communication skills (English B2+) Mile widziane: Experience with real-time / streaming data Exposure to Data Science workflows Knowledge of Big Data ecosystems Experience with financial data Familiarity with AI-assisted development or LLM tools O projekcie: We are seeking a Senior Data Engineer with strong expertise in Azure and PySpark, skilled in designing, implementing, and maintaining robust data processing solutions. This role focuses on building scalable, production-grade data systems, ensuring reliability, and optimizing performance in distributed environments. Zakres obowiązków: Design and optimize large-scale data pipelines using PySpark Build and maintain scalable ETL/ELT workflows in Azure Troubleshoot production issues related to performance, latency, and availability Work with distributed NoSQL technologies (e.g., Cosmos DB, Cassandra, DynamoDB, MongoDB, or similar) Optimize Spark jobs (partitioning, execution plans, resource usage) Implement best practices for scalability, security, and reliability Collaborate with cross-functional teams on data-driven solutions Contribute to automation, CI/CD, and operational improvements Oferujemy: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English language classes Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events

🔍 Dekoder Ogłoszenia

🔴
Strong hands-on experience with PySpark in production
Oczekuje się, że kandydat będzie potrafił samodzielnie rozwiązywać problemy i optymalizować istniejące rozwiązania PySpark, a nie tylko tworzyć nowe.
🔴
Proven experience in data modeling, partitioning, indexing, and performance tuning in NoSQL systems
Może to oznaczać, że systemy NoSQL, z którymi kandydat będzie pracował, są już skomplikowane i wymagają zaawansowanej optymalizacji.
🔴
Experience building and operating production-grade pipelines in cloud (Azure)
Kandydat musi być gotów do samodzielnego wdrażania, monitorowania i utrzymania działających systemów w chmurze, co często wiąże się z dyżurami.
🔴
Troubleshoot production issues related to performance, latency, and availability
Część pracy będzie polegać na rozwiązywaniu problemów w działającym systemie, co może być stresujące i czasochłonne.
🟡
Strong analytical and communication skills (English B2+)
Oprócz umiejętności technicznych, oczekuje się aktywnego udziału w dyskusjach technicznych i jasnego przekazywania informacji, również w języku angielskim.