Pracuj.pl Hybrydowo Senior

Data Engineer (DBX,dbt), Senior

Webellian Sp. z o.o.

⚲ Warszawa, Mokotów

Do uzgodnienia

Wymagania

  • Python
  • PySpark
  • SQL
  • dbt
  • GitHub

Opis stanowiska

Nasze wymagania:
Strong experience with Databricks (DBX)
Advanced knowledge of Python (experience with libraries such as Pydantic is a plus; PydanticAI is a strong advantage)
Solid experience in building and optimizing ETL/ELT pipelines
Very good knowledge of SQL and relational databases
Experience with PySpark
Familiarity with dbt is a plus
Experience with Azure Data Services is an advantage
Knowledge of CI/CD practices and tools (e.g. GitHub)
General understanding of infrastructure, orchestration, and IT security principles
Proven experience in Data Engineering (Senior level)
Bachelor’s or Master’s degree in a technical field (e.g. Computer Science, Engineering) or equivalent experience
Fluent English (written and spoken)
DevOps mindset (“you build it, you run it”)
Ability to understand complex requirements and translate them into actionable solutions
Strong communication skills and ability to work with cross-functional teams
Attention to detail, especially regarding data quality and business logic
Proactive attitude, ownership, and willingness to learn new technologies

Mile widziane:
Familiarity with dbt
Experience in the insurance domain

O projekcie:
We are looking for a Senior Data Engineer to join a new, innovative project for one of our key clients in the insurance industry. You will work in a hybrid model with teammates based in Poland and collaborate with global stakeholders, including direct interaction with business users. This project focuses on building an advanced solution leveraging Large Language Models (LLMs) to scan documents and support automated decision-making processes.

Zakres obowiązków:
Contribute by building a training dataset for document processing and decision support
Design and implement end-to-end data pipelines (ingestion, transformation, storage, and consumption)
Prepare and maintain high-quality training datasets for machine learning models
Work with large-scale data on a modern cloud data platform (Databricks)
Apply best practices in data engineering, testing, and deployment
Collaborate closely with data scientists, engineers, and business stakeholders
Continuously improve performance, reliability, and automation of data workflow

Oferujemy:
Contract under Polish law: B2B or Umowa o Pracę
Benefits such as private medical care, group insurance, Multisport card
There are English classes available
Hybrid work (at least 1 day/week on-site) in Warsaw (Mokotów)
Opportunity to work with excellent professionals
High standards of work and focus on the quality of code
New technologies in use
Continuously learning and growth
International team
Pinball, PlayStation & much more (on-site)

🔍 Dekoder Ogłoszenia

🔴
DevOps mindset (“you build it, you run it”)
Oczekuje się, że będziesz odpowiedzialny za pełen cykl życia kodu, od tworzenia po utrzymanie i monitorowanie w produkcji.
🔴
Ability to understand complex requirements and translate them into actionable solutions
Może oznaczać, że wymagania będą niejasne lub sprzeczne, a Ty będziesz musiał je doprecyzować i samodzielnie je rozwiązać.
🔴
Proactive attitude, ownership, and willingness to learn new technologies
Oczekuje się, że będziesz samodzielnie identyfikował problemy i proponował rozwiązania, a także szybko przyswajał nowe narzędzia i technologie.
🔴
Strong experience with Databricks (DBX)
Chociaż DBX jest wymienione, może to oznaczać, że firma dopiero zaczyna z tym narzędziem i oczekuje, że kandydat pomoże w jego implementacji i rozwoju.
🟡
Bachelor’s or Master’s degree in a technical field (e.g. Computer Science, Engineering) or equivalent experience
Określenie "equivalent experience" może być używane do obniżenia wymagań formalnych, ale w praktyce może oznaczać, że formalne wykształcenie jest preferowane.