About the Role

Director of Data Engineering

locations

Remote

time type

Full time

job requisition id

JR104934

Marathon Health is a leading provider of advanced primary care in the U.S., serving 2.5 million eligible patients through approximately 630 employer and union-sponsored clients. Our comprehensive services include advanced primary care, mental health, occupational health, musculoskeletal, and pharmacy services, delivered through our 680+ health centers across 41 states. We also offer virtual primary care and mental health services accessible in all 50 states. Transforming healthcare delivery with a patient-first approach, we prioritize convenient access to both in-person and virtual care, resulting in improved health outcomes and significant cost savings. Committed to inclusivity and collaboration, we foster a positive work environment and recruit exceptional talent to ensure expertise and compassion in healthcare delivery. Marathon has been recognized as a five-time Modern Healthcare Best Places to Work in Healthcare winner and a six-time Best in KLAS award winner for employer-sponsored healthcare services.

ABOUT THE JOB

The Director of Data Engineering leads our dedicated team of data professionals in maintaining our databases, developing and managing data pipelines and supporting all client data integration activities with strong focus on Data Quality Management principles. In addition, the Director oversees all aspects of data ingestion, extraction, and integration processes to ensure seamless data exchanges between Marathon Health, clients, third-party vendors, and internal systems.  The Director of Data Engineering has extensive experience with and leading teams using AWS, PostgreSQL, SQL Server, Databricks, Snowflake and enterprise ETL tools. A strong foundation in agile methodologies is required to drive the development and optimization of our data infrastructure. This leadership role involves strategic planning, technical oversight, and ensuring the accurate availability of data into our enterprise environment.

ESSENTIAL DUTIES & RESPONSIBILITIES

  • Drive initiatives to scale the development of data pipelines, integration of client data (claims, eligibility, etc.), operations related to supporting client and vendor integration and implementation of data quality patterns into all solution development.
  • Lead and manage a team of data engineers, data operators and database administrators in the maintenance of databases and the development and support of complex data pipelines and integrations.
  • Design and implement strategies for database management, data extraction, transformation, and loading (ETL) using enterprise tools.
  • Manage the operations related to integration of client-supplied data into our data environment with a focus on automation, quality engineering and unit cost scale of data integration operations.
  • Oversee the integration of data from various sources to our platform, ensuring high availability, scalability, quality and security of the data infrastructure.
  • Manage the development of integration solutions between Marathon Health and clients, vendors and other integration partners in addition to internal analytics solutions like Snowflake.
  • Champion agile methodologies within the team, utilizing tools such as Jira for sprint planning and project tracking.
  • Collaborate with other technology and business units to identify data needs, integrate systems, and optimize data flows to support business objectives.
  • Monitor and analyze data pipelines and database performance, making adjustments as necessary to improve efficiency and reliability.
  • Stay abreast of the latest in data engineering technologies and practices, leading the adoption of innovations that can enhance data processing and analysis capabilities.

QUALIFICATIONS

Bachelor’s degree and 10+ years of experience in data engineering or a related field, with at least 3 years in a leadership role managing data engineering, data integration and data operations teams or equivalent combination of education and experience. AWS Certification a plus.

  • Expertise in managing databases (PostgreSQL, SQL Server) and implementing data pipelines in a cloud environment, particularly AWS.
  • Experience with large scale analytics services like Databricks and Snowflake in the integration of data and development of pipelines into analytics-based data structures.
  • Experience in developing progressive coding patterns using message-based frameworks (Kafka, Event Hub, etc.), pub-sub models, microservice architectures, etc.

DESIRED ATTRIBUTES

  • Proficiency with enterprise ETL tools and a solid understanding of ETL best practices.
  • Demonstrated experience in working with common healthcare files including claims, eligibility and biometrics (lab results, for example) data.
  • Demonstrated success in applying agile methodologies to data engineering projects, with experience using project management tools like Jira.
  • A track record of successful collaboration across departments to support data-driven decision-making and system integrations.
  • In-depth knowledge of database management, data integration, and ETL processes, with hands-on experience in PostgreSQL, SQL Server, AWS, and enterprise ETL tools.
  • In-depth knowledge and experience of developing solutions on large cloud-based analytics platforms like Databricks and Snowflake.
  • Strong knowledge and experience in integrating data quality management into data pipelines and data integration solutions.
  • Strong leadership skills, capable of managing and developing a high-performing data engineering team.
  • Excellent problem-solving and analytical skills, with the ability to oversee complex data systems and pipelines.
  • Proficient in agile project management, with experience in using tools like Jira for effective team collaboration and project tracking.
  • Strong communication and interpersonal skills, with the ability to work collaboratively with other technology teams and business stakeholders.

APPLY HERE