Data Engineer (DBX, dbt)
Webellian Sp.z o o
⚲ Warszawa
Wymagania
- Databricks
- Python
- PySpark
- ELT/ETL
- Pydentic
- PydenticAI
- Azure Data Services
Opis stanowiska
About the Webellian Webellian is a well-established Digital Transformation and IT consulting company committed to creating a positive impact for our clients. We strive to make a meaningful difference in diverse sectors such as insurance, banking, healthcare, retail, and manufacturing. Our passion for cutting-edge and disruptive technologies, as well as our shared values and strong principles, are what motivate us. We are a community of engineers and senior advisors who work with our clients across industries, playing a deep and meaningful role in accelerating and realizing their vision and strategy. About the position We are looking for a Regular Data Engineer to join a new, innovative project for one of our key clients in the insurance industry. You will work in a hybrid model with teammates based in Poland and collaborate with global stakeholders, including direct interaction with business users. This project focuses on building an advanced solution leveraging Large Language Models (LLMs) to scan documents and support automated decision-making processes. Goals and challenges • Contribute by building a training dataset for document processing and decision support • Design and implement end-to-end data pipelines (ingestion, transformation, storage, and consumption) • Prepare and maintain high-quality training datasets for machine learning models • Work with large-scale data on a modern cloud data platform (Databricks) • Apply best practices in data engineering, testing, and deployment • Collaborate closely with data scientists, engineers, and business stakeholders • Continuously improve performance, reliability, and automation of data workfl Hard skills we are looking for • Strong experience with Databricks (DBX) • Advanced knowledge of Python (experience with libraries such as Pydantic is a plus; PydanticAI is a strong advantage) • Solid experience in building and optimizing ETL/ELT pipelines • Very good knowledge of SQL and relational databases • Experience with PySpark • Familiarity with dbt is a plus • Experience with Azure Data Services is an advantage • Knowledge of CI/CD practices and tools (e.g. GitHub) • General understanding of infrastructure, orchestration, and IT security principles Soft Skills & Experience • Proven experience in Data Engineering (Senior level) • Bachelor’s or Master’s degree in a technical field (e.g. Computer Science, Engineering) or equivalent experience • Fluent English (written and spoken) • DevOps mindset (“you build it, you run it”) • Ability to understand complex requirements and translate them into actionable solutions • Strong communication skills and ability to work with cross-functional teams • Attention to detail, especially regarding data quality and business logic • Proactive attitude, ownership, and willingness to learn new technologies • Experience in the insurance domain is a plus What we offer • Contract under Polish law: B2B or Umowa o Pracę • Benefits such as private medical care, group insurance, Multisport card • There are English classes available • Hybrid work (at least 1 day/week on-site) in Warsaw (Mokotów) • Opportunity to work with excellent professionals • High standards of work and focus on the quality of code • New technologies in use • Continuously learning and growth • International team • Pinball, PlayStation & much more (on-site) Join a growing team of dedicated professionals! We love to pass on the knowledge to grow excellence, speak our minds without playing politics, and just enjoy hanging around together. If you share our passions - we want to meet you! Please include the following statement: “I hereby authorize Webellian Poland Sp. z o.o. to process my personal and store data included in my job application for the needs of following and future recruitment processes (in accordance with the Personnel Protection Act 29.08.1997 no 133 position 883)”.