Job Description

Lead Data Engineer

US-Based Remote /

Engineering /


/ Remote

Health care is all about conversations, with over 2B spoken conversations each year between patients and their care teams in just the United States. However, people forget up to 80% of those conversations, leading to worse patient outcomes. And doctors are burning out writing notes in their EMRs instead of focusing on their patients. That’s where Abridge comes in our audio-based standalone and integrated solutions record and summarize medical conversations, anywhere care happens.

We’re looking for a Lead Data Engineer to help us design, build, and operate the data platform required to scale our business, which operates at the forefront of generative AI in healthcare. You’ll work alongside our growing team of engineers, machine learning scientists, and business teams to transform our data pipeline and warehouse, fuel our machine learning models, and meet our growing business intelligence needs.

What You’ll Do:

  • Lead the end-to-end design, development, testing and deployment of our data pipelines and warehouse — ensuring scalability, efficiency and data quality
  • Evaluate, recommend and implement the technologies, tools and processes we need
  • Collaborate closely with machine learning scientists to understand data requirements and translate them into technical solutions that you will implement
  • Help ML teams establish secure and compliant access protocols
  • Ensure the security, compliance and privacy of data through implementation of necessary protocols and best practices
  • Stay up-to-date with industry trends and advancements in data engineering and bring innovative ideas to continuously improve our data processes
  • Monitor and troubleshoot data pipeline and warehouse quality and performance issues, ensuring smooth operation and proactively addressing potential bottlenecks
  • Document the architecture, design decisions, and processes, enabling knowledge sharing within the team and across the organization

What you’ll bring:

  • 5+ years of Data Engineering experience in a cloud-first organization
  • Experience in building and operating data pipelines and warehouses. You have probably built an organization’s first data pipeline or have been the lead on a project to overhaul or replace a pipeline.
  • Knowledge of governance issues and how to secure sensitive information such as PHI, PII or financial data without limiting its utility to analysts and scientists
  • Up-to-date expertise on industry best-practices and tools + a passion for learning new things. You’re savvy with tooling like Airbyte, Fivetran, dbt, Snowflake and BigQuery.
  • Excitement to take a hands-on approach in a fast-moving, productive and supportive environment
  • Willingness to pitch in wherever needed — as a fast-moving startup we need to do great work, quickly