JustJoin.IT Praca zdalna Mid

QA Engineer

Andersen

⚲ Warszawa, Belgrad, Chisinau, Prague, Vilnus

Wymagania

  • JSON
  • AI/ML Testing
  • Swagger
  • Postman
  • LangFuse

Opis stanowiska

Andersen is hiring a QA Engineer to ensure quality and stability of a secure cloud platform with collaboration and analytics features, supporting continuous improvements and reliable product delivery.  The customer is an international company delivering professional and technology-enabled solutions that support effective collaboration, structured communication, and operational efficiency for organizations. It operates in a fast-growing environment, focusing on scalability, security, and continuous improvement while developing digital platforms used by diverse clients worldwide.  The project is focused on enhancing a secure, cloud-based board management platform with intuitive meeting tools, real-time collaboration, and advanced analytics. It also includes building and maintaining scalable AI infrastructure, orchestration patterns, and observability to ensure reliable and intelligent platform performance.  Responsibilities:  - Designing test cases for non-deterministic AI systems.  - Executing evaluation runs against AI workflows using LangFuse.   - Validating AI outputs against acceptance criteria and business rules.   - Building and maintaining regression test suites for AI features.   - Identifying edge cases and failure patterns in AI behavior.   - Triaging and documenting accuracy/quality issues.   - Monitoring production quality metrics and flag degradation.   - Partnering with AI Engineer on test data curation and quality thresholds.  Must-have:  - Experience as a QA Engineer or in a similar role for 3+ years.  - Experience testing AI/ML or other non‑deterministic systems, with an understanding of probabilistic outputs and variability in model behavior.   - Solid knowledge of QA methodologies adapted for AI workflows, including evaluation‑based testing, scenario testing, and semantic validation.   - Hands‑on experience designing test cases for systems without fixed expected outputs, using acceptance criteria, heuristics, and quality thresholds. Experience executing evaluation runs with  - AI observability or evaluation tools (experience with LangFuse is a strong advantage).   - Ability to validate LLM outputs against business rules, prompt requirements, safety guidelines, and product acceptance criteria.   - Experience building and maintaining regression suites for AI features (prompt regressions, dataset-based regressions, workflow regressions).   - Ability to identify edge cases, emergence patterns, and failure modes in AI behavior (e.g., hallucinations, inconsistency, bias, context loss).   - Experience documenting issues with detailed repro steps, evaluation evidence, logs, and accuracy/quality metrics.   - Ability to work with JSON, API tools (Postman, Swagger), and logs from AI systems.  - Level of English – from Upper-Intermediate and above.  Nice to have:  - Exposure to evaluation frameworks (LangFuse evals, Ragas, TruLens, or internal evaluators).    - Understanding of key LLM quality metrics such as accuracy, precision/recall, semantic similarity scores, or custom evaluation metrics.   - Familiarity with production monitoring dashboards or telemetry tools to detect quality degradation.   - Basic understanding of LLMs, embeddings, RAG workflows, and prompt-based systems.   - Experience with test data curation, labeling, or annotation processes.   Reasons why this job would be interesting to you:  - Experience in teamwork with leaders in FinTech, Healthcare, Retail, Telecom, and others. Andersen cooperates with such businesses as Samsung, Siemens, Johnson & Johnson, BNP Paribas, Ryanair, Mercedes, TUI, Verivox, Allianz, T-Systems, etc..  - The opportunity to change the project and/or develop expertise in an interesting business domain.  - Job conditions – you can work both fully remotely and from the office or can choose a hybrid variant.  - Guarantee of professional, financial, and career growth! The company has introduced systems of mentoring and adaptation for each new employee.  - The opportunity to earn an additional up to 1,000 USD per month by participating in the company's activities.  - Access to the corporate training portal, where the entire knowledge base of the company is collected and which is constantly updated.  - Bright corporate life (parties / pizza days / PlayStation / fruits / coffee / snacks / movies).  - Certification compensation (AWS, PMP, etc).  - Referral program.  - English courses.  - Private health insurance and compensation for sports activities.    Join us!  Your personal data is protected in accordance with GDPR regulations. Learn more: https://andersenlab.com/privacy-policy/pl   https://people.andersenlab.com/