Back to Jobs

[Remote] AI Data Engineer – Scientific Data Platforms (Remote)

Remote, USA Full-time Posted 2026-06-21

Note: The job is a remote job and is open to candidates in USA. Astrix is a leading global biotechnology and pharmaceutical organization focused on innovation and access to healthcare. They are seeking an AI Data Engineer to scale AI models for drug discovery by building automated data ingestion and curation pipelines for genomics data.

Responsibilities

  • Build an agentic data ingestion pipeline and move beyond bespoke steps toward agents that teams can reliably use as a shared, deployed service
  • Triage and prioritize incoming requests to ingest specific datasets. Clean and organize data, building the first-pass cleaning and organization steps into the agentic flow
  • Validate cross-modal linkage. Add automated checks that catch when ingested data does not connect correctly and flag low-quality or mismatched records
  • Version every dataset, retaining and making prior versions addressable. Preserve raw data and provenance, ensuring agent workflows log validation and transformation steps so lineage is fully traceable
  • Partner with AI, software engineering, and computational biology groups to co-define data standards and conventions

Skills

  • Demonstrated experience building multi-agent workflows or LLM workflows using tools/frameworks such as LangGraph or LlamaIndex, including tool/function calling and asynchronous task execution
  • Strong Python skills for data manipulation, working with APIs and databases, and handling heterogeneous data formats
  • Familiarity with dataset versioning approaches (e.g., DVC, lakeFS, or equivalent)
  • Comfortable with or showing a strong willingness to learn common omics data formats like AnnData, H5AD, and TileDB
  • No deep bioinformatics expertise required; just a basic conceptual understanding of different modalities (e.g., RNA-seq vs. scRNA-seq vs. WES; genomics vs. transcriptomics vs. proteomics vs. metabolomics)
  • Comfortable writing unit and functional tests to ensure data processing workflows are reliable and reproducible
  • Degree in a technical field or equivalent practical experience
  • Must be Authorized to work in the United States without Sponsorship
  • Experience deploying agent workflows as a shared service (e.g., FastAPI or MCP endpoints)
  • Exposure to cloud platforms (AWS, GCP) and containerization (Docker)
  • Familiarity with scientific workflow managers such as Nextflow or Snakemake

Benefits

  • Plus benefits

Company Overview

  • Astrix is the global leader in delivering innovative strategies and solutions to the life sciences industry. It was founded in 1995, and is headquartered in Red Bank, New Jersey, USA, with a workforce of 501-1000 employees. Its website is http://astrixinc.com.
  • Apply To This Job

    Similar Jobs

    [Remote] Customer Support Manager

    Remote, USA Full-time

    [Remote] Healthcare Cost Reporting/Reimbursement Manager - Remote Eligible

    Remote, USA Full-time

    [Remote] Lead Data Scientist, Stars Analytics

    Remote, USA Full-time

    [Remote] Growth Agency COO & Client Lead

    Remote, USA Full-time

    [Remote] Senior Product Manager (Healthcare Supply Chain)

    Remote, USA Full-time

    [Remote] Federal Account Executive - AI Observability & Networking Start-up

    Remote, USA Full-time

    [Remote] Head of Clinical Data Management

    Remote, USA Full-time

    [Remote] Social Media Marketing Assistant

    Remote, USA Full-time

    [Remote] Senior Director, Clinical Research & Development

    Remote, USA Full-time

    [Remote] Salesforce Data Cloud/Marketing Cloud Solution Developer

    Remote, USA Full-time

    Software Engineer, Data Infrastructure & Acquisition - Phoenix, AZ, USA

    Remote, USA Full-time

    Part-Time Remote Data Entry Specialist – Flexible Schedule, $32/hr – Home‑Based Administrative Support

    Remote, USA Full-time

    Remote Grants Writing Specialist

    Remote, USA Full-time

    Jr. Packaging Production Designer, rhode

    Remote, USA Full-time

    [Remote] Sr. Software Systems Engineer (.NET Developer)

    Remote, USA Full-time

    Experienced Customer Support Representative – Remote Part-Time Opportunity at arenaflex

    Remote, USA Full-time

    Inside Sales Manager - Club

    Remote, USA Full-time

    Experienced Customer Service Specialist – Delivering Exceptional Experiences for arenaflex Clients

    Remote, USA Full-time

    Urgently Require Certified Personal Trainer Stretch Therapist in Pasadena, CA

    Remote, USA Full-time

    Remote Entry-Level Virtual Customer Service Representative – E-Commerce Support & Client Success (Work From Home)

    Remote, USA Full-time