Senior Data Engineer (PySpark, NoSQL)
EPAM Systems (Poland) sp. z o.o.
⚲ Kraków, Grzegórzki
Wymagania
- PySpark
- Cosmos DB
- Cassandra
- DynamoDB
- Spark
- Microsoft Azure
Opis stanowiska
Nasze wymagania: 5+ years of experience as a Data Engineer or similar role Strong hands-on experience with PySpark in production (transformations, tuning, optimization) Solid experience with distributed NoSQL databases (e.g., Cosmos DB, Cassandra, DynamoDB, or similar) Strong understanding of Spark architecture and performance optimization Experience building scalable data pipelines in Azure Knowledge of NoSQL data modeling and cloud-based troubleshooting Strong analytical and problem-solving skills Ability to work effectively in high-availability environments Strong communication skills (English B2+) Mile widziane: Experience with code generation, including non-AI and AI-assisted approaches Exposure to Data Science workflows Experience with Big Data platforms and distributed systems Knowledge of financial instruments and financial services data Hands-on experience with industry-standard LLMs (including GPT, Claude, or similar) O projekcie: We are seeking a Senior Data Engineer with strong expertise in cloud-based NoSQL databases, Azure, and PySpark, skilled in designing, implementing, and maintaining robust data processing solutions. This role focuses on ensuring platform reliability, optimizing performance, and enabling scalable and efficient data operations within a production-grade environment. Zakres obowiązków: Provide end-to-end support for NoSQL databases, including monitoring, troubleshooting, and performance tuning Design and optimize large-scale data pipelines using PySpark Work with distributed NoSQL technologies (e.g., Cosmos DB, Cassandra, DynamoDB, or other production-grade systems) Build and maintain scalable ETL/ELT workflows in Azure environments Troubleshoot and resolve production issues related to performance, latency, and availability Optimize Spark jobs (partitioning, execution plans, resource usage) Implement best practices for scalability, security, and data reliability Collaborate with cross-functional teams to support data-driven solutions Contribute to automation and operational improvements Maintain documentation and participate in production support rotations Oferujemy: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English language classes Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events