Data Science Engineer – pharmaceutical industry (f/m/x)
Sii Sp. z o.o.
⚲ Białystok, Centrum, Bydgoszcz, Gdańsk, Oliwa, Katowice, Kraków, Podgórze, Lublin, Łódź, Śródmieście, Piła, Poznań, Wilda, Rzeszów, Szczecin, Toruń, Warszawa, Mokotów, Wrocław, Fabryczna
Wymagania
- Python
- GitHub
- GitLab
- AWS
- Grafana
- Datadog
- GenAI
- RAG
Opis stanowiska
Nasze wymagania: Minimum 5 years of experience in Python and working knowledge of JavaScript Understanding of version control systems (Git) and CI/CD pipelines Strong foundation in algorithms, data structures, and graph theory, including AST concepts Background in data engineering, including working with large codebases in on-prem and cloud (AWS) environments Familiarity with tools such as Artifactory, Prometheus, and Splunk Knowledge of AI/ML concepts, particularly GenAI, Graph RAG, and vector databases Ability to produce clear technical documentation and operate with an independent, exploratory problem-solving mindset Fluent Polish required Residing in Poland required O projekcie: We are looking for a Data Scientist with a strong engineering mindset to build the data foundation for software portfolio efficiency. This role focuses on extracting, structuring, and analysing data from code repositories, artifact management systems, and observability platforms. You will work at the intersection of data engineering, software architecture, and AI, leveraging modern approaches such as graph-based modelling and GenAI to map complex dependencies and improve visibility across a large-scale ecosystem of products and platform services. Zakres obowiązków: Extract and analyze data from code repositories (GitHub, GitLab) to build data-driven software metrics Use Abstract Syntax Trees (AST) and static analysis techniques to understand code structure, dependencies, and usage patterns Design and implement graph database solutions to model complex relationships across services and platforms Develop and enforce metadata standards (e.g., via Backstage/DevHub) to ensure consistency across systems Integrate data from observability and telemetry platforms (Grafana, DataDog, Prometheus, Splunk) to track adoption and performance Build algorithms to calculate software efficiency metrics based on defined criteria Apply AI/GenAI techniques (e.g., Graph RAG, vector databases) to enhance data retrieval, dependency mapping, and insights generation Oferujemy: Great Place to Work since 2015 - it’s thanks to feedback from our workers that we get this special title and constantly implement new ideas Employment stability - revenue of PLN 2.1BN, no debts, since 2006 on the market We share the profit with Workers - over PLN 76M has already been allocated for this aim since 2022 Attractive benefits package - private healthcare, benefits cafeteria platform, car discounts and more Comfortable workplace – class A offices or remote work Dozens of fascinating projects for prestigious brands from all over the world – you can change them thanks to Job Changer application PLN 1 000 000 per year for your ideas - with this amount, we support the passions and voluntary actions of our workers Investment in your growth – meetups, webinars, training platform and technology blog – you choose Fantastic atmosphere created by all Sii Power People