Our client is a fastgrowing tech startup headquartered in Los Angeles CA looking for talented dynamic team members who want to help shape our groundbreaking artificial intelligence platform.
We are transforming and accelerating clinical trials to help get lifesaving treatments to patients faster and accelerate innovation in healthcare. To that end we build a cuttingedge software suite that connects all clinical research stakeholders from research teams to treating physicians patients and study sponsors on a realtime realworld data SaaS platform powered by AI.
They are on a mission to revolutionize healthcare’s clinical trials process through innovative AI and ML solutions. Our software mines realtime clinical data to precisionmatch patients to clinical trials utilizing cuttingedge AI and ML techniques.
We are looking for a
Head of Data Engineering to spearhead the continued development and application of our data engineering platform. This is both a visionary and handson role.
This role presents an opportunity for an exceptional handson data engineering leader to build a team of data and platform engineers while overseeing the use of AI and ML to drive strategic initiatives.
As the Head of Data Engineering you will work closely with stakeholders to continuously innovate our platform ensuring that it stays ahead of the curve in supporting the mission of the organization. You will be tasked with scaling the data ingestion and comprehension of one of the largest and densest sets of rich unstructured clinical data.
What You'll Do
-
- Lead the continued development and enhancement of our clinical data ingestion and comprehension pipeline.
- Drive the utilization of core principles in development including observability scalability and endtoend control.
- Support the utilization of advanced AI/ML research through the exposure of raw canonicalized and comprehended data in analytics platforms.
- Establish and maintain key performance metrics to track the effectiveness of data engineering initiatives.
- Foster collaboration with internal and external stakeholders to gather feedback and drive continuous improvement.
- Enhance the AI's reputation as a leader in AIdriven clinical trials acceleration through thought leadership and industry recognition.
About You
-
- Strong playercoach mentality with an ability to balance the handson needs of leading two groups: data platform (data engineering) and search/enrichment (ML engineering).
- Proven record of accomplishment leading successful largescale clinical data initiatives within a productfocused environment.
- Strong background in applied data engineering and data pipelines with handson experience developing and deploying productionready models.
- Understanding of Software Development Life Cycle and data product development
- Experience working with healthcare data especially HL7 and FHIR.
- Deep understanding of streaming data ingestion and ETL processes
- Conceptual understanding of ML techniques particularly NLP (e.g. NER BERT).
- Conceptual understanding of AI techniques like large language models (LLMs) selflearning models (SLMs) and other stateoftheart approaches.
- Experience with database technologies especially Elasticsearch PostgreSQL Amazon Aurora and DynamoDB.
- Demonstrated passion for staying up to date with the latest data engineering and pipeline trends along with a record of accomplishment of driving innovation.
Preferred Qualifications
-
- Cloud Services: Experience with cloudbased data processing and storage services (AWS).
- Infrastructure as Code: Proficiency with infrastructure as code tools (CDK).
Technologies We Use: While specific expertise in our tech stack is beneficial we value adaptability and a willingness to learn. Our current stack includes:
- AWS Cloud Services (e.g. EC2 ECS RDS Aurora DynamoDB Lambda)
- Java (Kotlin) Python TypeScript
- Kubernetes Docker
- FHIR Servers (e.g. HAPI Health Samurai AidBox)
- Elasticsearch and Elastic Cloud
- CI/CD: GitHub Actions
- Monitoring: OpenTelemetry AWS XRay AWS Cloudwatch Datadog Pendo