
Lead Data Engineer
- Barcelona
- Permanente
- Tiempo completo
- Collaborate closely with the business teams, experienced data engineers, data scientists, and stakeholders to capture complex data needs in but not limited to domains like bioinformatics, omics, clinical data, and other relevant domains and translate them into robst and scalable data engineering solutions
- Lead and guide the design, build, and operations of robust data infrastructure on AWS Cloud, driving our data mesh architecture
- Automate, optimize, and fine-tune platform provisioning, scaling, and maintenance tasks to boost operational efficiency, performance, scalability, and cost control
- Lead the development and optimization of data pipelines, leveraging your expertise in data integration, ETL/ELT, advanced tooling and AWS cloud to deliver cutting-edge solutions
- Work hand-in-hand with cross-functional agile teams to architect and implement hybrid-cloud solutions to ensure seamless and high-performance data processing
- Lead the implementation of data monitoring and alerting systems, and partner with DevOps teams to proactively identify and resolve platform issues
- Ensure data security, compliance, and governance at every stage of the data platform, following global standards and best practices
- Establish and enforce global data engineering standards, ensuring alignment with data architecture, platform, quality, and governance principles
- Demonstrate deep expertise in implementing data warehouses, data lakes, and distributed processing technologies—such as Spark, Hadoop, and Kafka—in production environments
- Showcase your advanced proficiency with SQL (preferably Snowflake) and both relational and non-relational databases to optimize complex queries and data manipulation
- Exhibit mastery in programming languages (like Python, Shell scripting, and Scala/Java) for the development of sophisticated data engineering solutions
- Work within cross-functional agile teams to architect and deploy hybrid-cloud solutions and automated pipelines, ensuring seamless and high-performance data processing
- Act as a mentor and leader, offering guidance and support to junior engineers, and fostering a culture of collaboration and growth within the team
- Engage actively within the data engineering community by sharing insights, best practices, and innovations that contribute to broader industry progress
- Bachelor's/Master's in STEM or a relevant field with 5-7 years of experience in data engineering, with a strong preference for experience in the life ciences/pharmaceutical industry
- Extensive background in designing, developing, and optimizing data and cloud solutions, including data pipelines, service-oriented architectures
- Proven expertise in data integration technologies, ETL/ELT, and modern data engineering technologies, with experience in implementing or supporting Data Mesh architectures
- Experience with multimodal data systems and architectures, including batch, near real-time, and streaming data
- Proven experience designing distributed architectures for large-scale data processing with high performance, scalability, and fault tolerance (AWS, Snowflake, Spark, Hadoop, Kafka)
- Advanced knowledge of SQL, relational/non-relational databases, and data query optimization. Proficiency in programming languages such as Python, Shell scripting, and Scala/Java
- Expertise in managing cloud-native systems following IaC and DataOps principles (terraform, CI/CD, Orchestration, Actions)
- Extensive experience with agile development processes and concepts
- Exceptional problem-solving skills and attention to detail
- Excellent communication, presentation, and interpersonal skills
- Ability to lead teams effectively and collaborate with stakeholders at all levels
- Curiosity and a commitment to continuous learning and improvement
- Experience in the life sciences/pharmaceutical industry
- Familiarity with Data Mesh concepts such as data as a product, domain-driven design, and federated computational governance
- Familiarity with visualization tools (PowerBI, Tableau) and project management tools (JIRA, Confluence)