About the Role
Title: Senior Data Engineer
Type; Remote
Location: Remote – United States
Job Description:
Datavant is a data platform company and the world’s leader in health data exchange. Our vision is that every healthcare decision is powered by the right data, at the right time, in the right format.
Our platform is powered by the largest, most diverse health data network in the U.S., enabling data to be secure, accessible and usable to inform better health decisions. Datavant is trusted by the world’s leading life sciences companies, government agencies, and those who deliver and pay for care.
By joining Datavant today, you’re stepping onto a high-performing, values-driven team. Together, we’re rising to the challenge of tackling some of healthcare’s most complex problems with technology-forward solutions. Datavanters bring a diversity of professional, educational and life experiences to realize our bold vision for healthcare.
The Site Connect delivery data engineers are masters of data ETL. We create and maintain integrations for over 70+ EHR’s powering every patient use case you can think of. Our team is full of go getters that work in a matrixed 100% remote environment on agile teams that support customers from go live to support for the live of the integration.
You will:
- Create and maintain ELT/ETL processes for existing and new systems.
- Collaborate with development and business teams to understand requirements and define source system data flows
- Develop and maintain ETL/ETL specifications for data integration development
- Define and deliver consistent data modeling and data architecture standards, methodologies, guidelines and techniques
- Document, implement and maintain the data pipeline architecture and related business processes
- Serve as a source of knowledge of industry practices and processes.
- Participate in the development of enterprise standards and guidelines for data model quality and accuracy
- Audit project level data model quality deliverables to ensure that practices and standards are met
- Analyze information and data requirements and understand effects of data inconsistencies
- Identify inefficiencies in current architecture and processes and communicate solutions in a manner that gets support from the teams involved
- Perform cost and sizing estimates for projects
- Collaborate with the project coordinator and the rest of the agile team to identify epics, stories and estimate effort
- Create and maintain data dictionary documents, table and data lineage models and produce artifacts to support project development and communicate project information to customers
What you will bring to the table:
- Bachelors in Computer Science or other engineering degree equivalent
- 5+ years of hands-on experience in building Data pipeline (ETL/ELT) in a cloud platform
- 5+ years of experience with relational DBMS, SQL Server/T-SQL, stored procedures, functions. Including experience optimizing database performance using T-SQL and PL/SQL for efficient data retrieval
- 5+ years of experience with Python
- Experience with using AWS
Bonus points if:
- Healthcare data experience 2+ years
- Knowledge of clinical systems (e.g. Cerner, Epic, Meditech, etc.) and standard Acute/Ambulatory workflows.
- Preferred AWS Cloud Practitioner or above certification