Data Scientist Catalog Quality and Selection
Allegro
⚲ Warszawa, Wola
Wymagania
- Python
Opis stanowiska
Nasze wymagania: Have experience in data analysis using SQL and building machine learning solutions in production environments, especially with end-to-end experience in designing ML models Want to translate business challenges into ML problems Can clearly communicate with business units - from formulating an analytical problem to a clear and intuitive presentation of results Program in Python and efficiently use basic development tools Understand statistical and machine learning methods Embrace modern tooling, specifically possessing practical and theoretical knowledge of Generative AI and Agentic AI principles (coding tools, prompting, validation). The ideal candidate enthusiastically leverages GenAI coding tools (like Copilot) to boost and automate workflows, remaining up-to-date and critically engaged with the newest GenAI tooling. Graduated with a degree strongly related to statistical/mathematical modeling, such as Mathematics, Physics, Economics, Computer Science (or similar major) or job experience in data science / ML positions. O projekcie: Our team drives the logic that powers a trustworthy marketplace. Composed of Data Scientists, Data Engineers and Analysts we develop statistical models and rules to measure and improve Products Catalog and Product Selection. We oversee the quality and correctness of all product components: titles, descriptions, images, and parameters. Our scope also covers ensuring consistency with sellers offers, detecting product duplicates, building selection models and more. We work in close collaboration with developers and business teams, participating in the entire project lifecycle - from ideation through to implementation. Our ultimate goal is to create a seamless experience where Partners can rely on fair visibility rules and Customers can trust every product detail they see on the platform. Zakres obowiązków: Co-creating projects on each stage - from concept to productization - to deliver insights and models solving business problems from different areas Using a wide range of model types, including boosting models, Bayesian methods, causal inference, optimization methods, deep learning and Generative AI solutions including LLMs and Agentic AI Processing terabytes of data using Google Cloud Platform solutions Working not only with tabular data, but also with spatial data, natural language texts, images and time series Taking part in implementations of off-line and on-line models Brainstorm, provide mentorship, share knowledge Cooperation with other teams: business stake-holders, analytics, data engineers Oferujemy: Flexible working hours in the hybrid model (4/1) - working hours start between 7:00 a.m. and 10:00 a.m. We also have 30 days of occasional remote work. Annual bonus based on your annual performance and company results. Well-located offices (with e.g. fully equipped kitchens, bicycle parking, terraces full of greenery) and excellent work tools (e.g., raised desks, ergonomic chairs, interactive conference rooms). A 16" or 14" MacBook Pro or corresponding Dell with Windows (if you don't like Macs) and all the necessary accessories. A wide selection of fringe benefits in a cafeteria plan - you choose what you like (e.g., medical, sports or lunch packages, insurance, purchase vouchers). English classes that we pay for related to the specific nature of your job. A training budget, inter-team tourism (see more here), hackathons, and an internal learning platform where you will find multiple trainings. An additional day off for volunteering, which you can use alone, with a team, or with a larger group of people connected by a common goal. Social events for Allegro people - Spin Kilometers, Family Day, Fat Thursday, Advent of Code, and many other occasions we enjoy.