Pracuj.pl Praca zdalna Mid New

T-Hub - Data Engineer

T-Mobile

⚲ Warszawa, Mokotów

Opis stanowiska

Nasze wymagania: GCP: Strong with Dataflow, BigQuery performance features (partitioning, clustering), Pub/Sub, Dataproc, use Cloud Monitoring/Logging for ops. Nice to have: IAM, VPC basics, service accounts; Apache Beam: Advanced transforms, windowing/triggering, side inputs/outputs; state/timers for streaming; resource sizing and tuning on Dataflow. Spark: Optimize joins/shuffles/partitioning; handle schema evolution; job debugging with metrics. Proficiency in Java: Solid grasp of concurrency/immutability; profiling; write reusable libraries; enforce code quality and testing standards. Apache Flink: Implement streaming pipelines. Airflow: Build complex DAGs SQL: Advanced queries, performance tuning, incremental patterns; define data quality checks and acceptance criteria. Willingness to travel at least 4 times a year Proven ownership of production pipelines. Mile widziane: CI/CD for data (GitHub Actions/Cloud Build), Terraform for infrastructure-as-code. Docker/Kubernetes for job packaging (especially for Spark/Flink). Architect batch and streaming systems; select technologies and patterns; define standards and guardrails. Provide technical leadership and architecture for data platforms. Drive strategy, build reusable frameworks, and ensure reliability, scalability, security, and cost efficiency across teams. Proficiency in Java on an Expert Level; System design and API design; memory/concurrency profiling; performance-critical code; establish coding standards and review practices. Build platform components (libraries, templates, governance, quality frameworks) to accelerate teams. Apache Beam: Deep expertise in high-throughput/low-latency pipelines; advanced state/timers; custom IOs; portability and cross-language considerations; performance debugging at runner level. 6+ years in data engineering with proven architectural leadership. Track record of delivering large-scale data systems and influencing cross-team outcomes. Sets architectural direction, builds frameworks, ensures security/compliance, drives measurable platform improvements, influences stakeholders. Zakres obowiązków: Design and implement batch and streaming pipelines; select Beam/Spark/Flink based on requirements. Data transformation ops for analytics and downstream services. Optimize jobs (performance, cost) and implement robust testing and observability. Build and manage Airflow orchestration. Troubleshoot production issues, lead incident resolution, and drive root-cause fixes. Contribute to team standards, templates, and reusable libraries. Using Java and Apache Beam as the main programming language and library for the streaming pipelines. Designs and ships a new pipeline or major refactor with measurable latency/cost improvements Hands-on, close cooperation with the Data Engineering Lead, resulting in quick onboarding and successful code base understanding Introduces or improves testing/observability patterns adopted by the team. Oferujemy: Working at T Hub will offer you an unique and highly rewarding experience on IT market. As a leader in the telecommunications industry, we do not only provide a platform to hone your technical skills but also empower you to be a catalyst for innovation. You'll have the opportunity to work at the forefront of modern technologies, from 5G to IoT and AI, shaping the future of connectivity.