Senior Databricks Architect / Data Engineering Expert
HyLogic
⚲ Warszawa
Wymagania
- Databricks
- PySpark
- SQL
- Python
- Azure Data Services
- Data Lake
- Data Factory
- Airflow
- Power BI
- Machine Learning
Opis stanowiska
About the RoleWe’re looking for a Senior Databricks Architect / Data Engineering Expert to lead the design and implementation of scalable, high-performance data architectures. You’ll define the Databricks platform blueprint, optimize compute and storage layers, and ensure enterprise-grade performance, security, and cost efficiency. This role focuses on architectural design, platform governance, and advanced Databricks optimization to support analytics, AI, and data product development across the organization.Location: Warsaw, PolandWork type: Hybrid Key Responsibilities • Define and implement the overall Databricks Lakehouse architecture, including Delta Lake design, security models, and data governance standards. • Architect and oversee the development of large-scale, reusable data pipelines serving as the foundation for analytics, AI, and BI platforms. • Design and implement advanced optimization strategies (e.g., cluster sizing, job parallelization, Delta optimization, caching, partitioning). • Lead the integration of real-time and batch data streams using Databricks, Delta Live Tables, and Azure data services. • Establish and enforce data governance, lineage, and observability frameworks across the Databricks ecosystem. • Collaborate with cloud, AI/ML, and BI teams to align platform strategy with data product needs. • Provide technical leadership, architecture reviews, and best practices for Databricks usage across teams. • Drive cost and performance tuning initiatives to ensure efficient compute resource utilization and scalability. Required Skills & Qualifications • Deep architectural expertise in Databricks and the Lakehouse paradigm (clusters, jobs, Delta Lake, Unity Catalog, governance). • Expert-level proficiency in PySpark, SQL, and Python for large-scale distributed data processing and pipeline design. • Proven ability to design and optimize complex batch and streaming architectures (Lambda, Kappa patterns). • Solid understanding of Azure Data Services (Data Lake, Data Factory, Event Hubs) and orchestration frameworks (e.g., Airflow). • Demonstrated success in Databricks platform tuning, performance audits, and cost optimization. • Experience integrating Databricks with machine learning and BI ecosystems (Power BI, MLflow). • Strong communication and leadership skills to influence data strategy and cross-functional architecture decisions. Nice-to-Have • Experience implementing Unity Catalog, advanced lineage, and access control frameworks. • Hands-on experience with CI/CD for Databricks assets (GitHub Actions, Azure DevOps). • Broad knowledge of data governance and metadata management (Purview, Collibra, etc.). • Understanding of enterprise-level data architecture standards and cross-platform data interoperability. What We Offer • A chance to shape the data foundation of an evolving analytics landscape. • Hands-on experience with modern data technologies and streaming architectures. • A collaborative and forward-thinking environment that values innovation and technical excellence. Reasons to Join Our Team: • Tangible impact on the company you work for. • Collaboration with professionals from the IT and Supply Chain industry in a fun environment. • Exposure to cutting-edge technologies and business frameworks. • “We Get Things Done”– we value work in an agile high-growth company. This can be achieved with the right balance of planning and pragmatism