Senior Data Scientist – Generative AI / LLM Specialist
SquareOne
⚲ Remote
26 880 - 30 240 PLN (B2B)
Wymagania
- GenAI
- NLP
- multimodal models
- Python
- Azure
- GCP
- Deep learning
- NumPy
- PyTorch
- HuggingFace
- LangChain
- LangGraph
- GenAI API
- OpenAI
- Gemini
- MCP Servers
- Databricks (nice to have)
- Microservices architecture (nice to have)
- Code repositories (nice to have)
- Code assistant (nice to have)
Opis stanowiska
O projekcie: We are looking for a Data Scientist to join our AI team and support clients in building end-to-end Talk-To-Data (TTD) solutions powered by LLMs and GenAI models. The AI team focuses on cutting-edge aspects of Generative AI and LLMs with applications such as RAG, summarization, multi-agent workflows, and model fine-tuning. With deep, PhD-level expertise in GenAI, NLP, and Computer Vision, this is an excellent environment for professionals eager to push their skills to new heights. We are seeking candidates for long-term engagement in GenAI and related domains. Wymagania: - Solid understanding of deep learning concepts. - Experience in Machine Learning, particularly in Generative AI (LLM/LMM), with focus on NLP or multimodal models. - Experience gathering business requirements and translating them into technical plans, data processing, feature engineering, model evaluation, hypothesis testing, and model deployment. - Strong Python and object-oriented programming skills; working knowledge of SQL and vector databases. - Experience with Azure or GCP cloud platforms. - Knowledge of Deep Learning and GenAI libraries: NumPy, PyTorch, HuggingFace, LangChain, LangGraph, and GenAI APIs (OpenAI, Gemini). - Hands-on experience designing or operating MCP servers/clients for LLM agents. Nice to Have - Experience working with Databricks. - Proven commercial experience in Generative AI, NLP, or Computer Vision projects. - Familiarity with microservice architectures. - Experience with code repositories and code assistants. - Strong business acumen and ability to propose creative solutions for client problems. - Ability to mentor and lead junior team members. Codzienne zadania: - Develop end-to-end GenAI applications such as chatbots, voicebots, and Talk-to-Data systems, including data ingestion, retrieval layers, orchestration (e.g., LangChain, LlamaIndex, LangGraph), API/backend, and simple UI where needed. - Design and implement RAG pipelines with vector databases, hybrid search, rerankers, query transformation, and evaluation frameworks for relevance and robustness. - Perform model selection, prompting strategies, and fine-tuning (LoRA/QLoRA/SFT) for text, code, and multimodal models, including guardrails, output evaluation, and A/B testing. - Design, integrate, and optimize LLM interactions with external tools, APIs, and data sources using Model Context Protocol (MCP) connectors. - Understand business requirements and translate them into technical goals, define success metrics, audit data feasibility, and align stakeholder expectations. - Support project delivery and pre-sales initiatives.