NoFluffJobs Stacjonarnie Mid

Data Engineer (Databricks)

Xebia sp. z o.o.

⚲ Warszawa

18 500 - 26 700 PLN (B2B)

Wymagania

  • Spark
  • Databricks
  • Airflow
  • Cloud
  • Python
  • Unity Catalog (nice to have)

Opis stanowiska

O projekcie: Who We Are While Xebia is a global tech company, our journey in CEE started with two Polish companies – PGS Software, known for world-class cloud and software solutions, and GetInData, a pioneer in Big Data. Today, we’re a team of 1,000+ experts delivering top-notch work across cloud, data, and software. And we’re just getting started. What We Do We work on projects that matter – and that make a difference. From fintech and e-commerce to aviation, logistics, media, and fashion, we help our clients build scalable platforms, data and AI solutions, and cutting-edge applications to shape the future of tech. Our clients include McLaren, Aviva, Deloitte, Spotify, Disney, ING, UPS, Tesco, Truecaller, AllSaints, Volotea, Schmitz Cargobull, Allegro, InPost, and many, many more. We value smart tech, real ownership, and continuous growth. We use modern, open-source stacks, and we’re proud to be trusted partners of Databricks, dbt, Snowflake, Azure, GCP, and AWS. Fun fact: we were the first AWS Premier Partner in Poland! Beyond Projects What makes Xebia special? Our community. We support tech communities, organize meetups (Software Talks, Data Tech Talks), and have a culture that actively support your growth via Guilds, Labs, and personal development budgets — for both tech and soft skills. It’s not just a job. It’s a place to grow. What sets us apart?  Our mindset. Our vibe. Our people. And while that’s hard to capture in text – come visit us and see for yourself. Wymagania: Your profile: - 2–4+ years of professional experience in Data Engineering, Software Engineering, or Operational Engineering,- experience with Databricks and PySpark for large-scale data processing,- strong proficiency in Python, including building and debugging data pipelines and automation scripts,- hands-on experience with Apache Airflow (DAG development, operators, troubleshooting),- very good knowledge of SQL, including complex joins, window functions, and JSON-based data,- experience working with cloud platforms (AWS and/or GCP),- upper-intermediate English,- readiness to work in a hybrid setup (in the Warsaw office once per week). Nice to have: - experience with Unity Catalog,- experience with database migrations and schema/version management,- comfort working in environments with frequent production support and delivery deadlines,- experience building agentic or AI-driven automation workflows. Work from the European Union region and a work permit are required Recruitment Process: CV review – HR Interview – Technical Interview - Client Interview – Decision Codzienne zadania: - designing, building, and maintaining end-to-end data pipelines for client-facing measurement reports and licensed datasets, - operating and troubleshooting Apache Airflow DAGs supporting scheduled and on-demand data deliveries, - managing push-based delivery workflows (cloud storage, file transfers, delivery verification) Investigating and resolving production incidents across distributed systems (Airflow, databases, cloud storage), - implementing automation and AI-driven agents to streamline operational processes and data validation, - supporting custom delivery requests, including matching files, cross-reference datasets, and bespoke client configurations, - developing data quality and validation tooling to ensure accuracy before client delivery, - writing and maintaining database migrations for delivery configurations and client setups, - collaborating with product, engineering, measurement science, and client-facing teams, - documenting operational processes, runbooks, and delivery workflows.

🔍 Dekoder Ogłoszenia

🔴
We work on projects that matter – and that make a difference.
Może oznaczać projekty o dużym wpływie biznesowym, ale też projekty o charakterze bardziej społecznym lub badawczym, które niekoniecznie są komercyjnie dochodowe.
🔴
We value smart tech, real ownership, and continuous growth.
„Real ownership” może oznaczać dużą odpowiedzialność i samodzielność, ale też konieczność samodzielnego rozwiązywania problemów bez wsparcia.
🔴
Our community. We support tech communities, organize meetups (Software Talks, Data Tech Talks), and have a culture that actively support your growth via Guilds, Labs, and personal development budgets — for both tech and soft skills.
Duży nacisk na aktywność w społeczności i rozwój może oznaczać oczekiwanie zaangażowania w te obszary poza standardowymi obowiązkami projektowymi.
🔴
It’s not just a job. It’s a place to grow.
Podkreśla rozwój, ale może sugerować, że praca może być mniej stabilna lub przewidywalna niż w tradycyjnym modelu zatrudnienia, skupiając się na nauce.
🔴
Our mindset. Our vibe. Our people. And while that’s hard to capture in text – come visit us and see for yourself.
Jest to bardzo ogólne stwierdzenie, które może maskować brak konkretnych informacji o kulturze pracy, narzędziach czy codziennych obowiązkach.