Data Engineer (AI/Schema Engineering)
Acaisoft
⚲ Warszawa
23 520 - 29 400 PLN (B2B)
Wymagania
- Python
- SQL
Opis stanowiska
O projekcie: i there! If you’re looking for a high-impact position in an ambitious software house, we’ve got a match for you! Currently, we are searching for a Data Engineer to cooperate with our US client - the company integrates production-grade, governed AI workflows into complex enterprise systems by addressing common friction points like poor integration and lack of domain expertise. The organization combines deep domain knowledge with cutting-edge capabilities to accelerate the deployment of reliable AI products. Working hours: generally flexible; however, availability is required daily between 6:00 PM and 9:00 PM CEST due to regular meetings with the client’s team. Wymagania: - 5+ years of industry experience, including some experience with AI/ML systems- Strong proficiency in Python and SQL for data processing, analysis, and pipeline development- Solid understanding of data schema design, dataset generation, and data quality standards for AI applications - Experience building and maintaining scalable data and evaluation pipelines in production environments Nice to have: - Hands-on experience with AI/LLM frameworks, particularly LangChain- Some experience with AI models (benchmarking, test set creation, and performance analysis) Codzienne zadania: - Design, implement, and maintain data schema definitions for AI platform inputs, outputs, and intermediate representations - Ensure schema compatibility with APIs, databases, and downstream systems, including versioning as the platform evolves - Build and maintain benchmarking pipelines, run systematic evaluations, and deliver structured performance reports - Identify performance gaps and collaborate with cross-functional teams to prioritize model improvements - Generate synthetic and curated datasets, applying data augmentation and labeling techniques while ensuring data quality, diversity, compliance, and proper documentation