Staff Software Engineer

Share this job

Canada

Staff Software Engineer, AI Engineering

Our client is a well-funded healthtech company serving over 140,000 independent medical practices across the US — think the obstetrician's clinic, the physiotherapy practice, the small surgical group. They achieved unicorn status in 2022 and closed a $250M funding round in late 2024, with a significant portion of that mandated specifically to accelerate their AI transformation. The engineering culture is strong — average tenure on the team is six to seven years, which says a lot — and this is a rare opportunity to build AI-native infrastructure from the ground up in one of the most complex and high-stakes domains in the US economy.

Our client is building a new class of AI-native systems where intelligence is embedded directly into application workflows — not bolted on as a standalone feature. These systems combine large language models, machine learning models, deterministic logic, event-driven architectures, and stateful workflow execution to automate some of the most complex operational processes in US healthcare.

They're looking for an AI Engineer to design and build production-grade AI systems that can reason, make decisions, and take action within real clinical and billing workflows.

What you'll work on

You'll be operating primarily in the AI orchestration layer, building the intelligence infrastructure that powers their next-generation platform. This includes:

Designing and implementing multi-step, stateful workflows that coordinate models, tools, and APIs — including long-running execution, failure recovery and retries, partial execution and resumption
Building LLM-powered applications for reasoning, extraction, and decision support — structured outputs, tool usage, prompt-driven workflows, both real-time and asynchronous
Designing hybrid AI architectures that combine LLMs (reasoning and language understanding), ML models (prediction and scoring), and deterministic logic (validation and enforcement) — using each appropriately based on the task
Building event-driven systems with asynchronous processing, idempotent execution, retry and failure handling, and state change-triggered workflows
Developing retrieval and context-aware systems — vector search, embedding-based approaches, RAG pipelines, internal and external data integration
Ensuring systems are production-ready: logging, tracing, monitoring, resilience to failures and edge cases

Specific capabilities you'll build

AI orchestration workflows using LangGraph, LangChain, AutoGen, or custom orchestration frameworks
Tool-calling and agent architectures integrated with APIs, databases, knowledge bases, and RAG pipelines
Reasoning workflows covering planning, tool execution, and validation loops
RAG systems including vector databases, embeddings pipelines, and retrieval workflows
AI safety and guardrails — prompt safety, hallucination mitigation, deterministic validators
AI-driven workflows for claim generation, note summarization, decision support, and automated actions

What they're looking for

5–10+ years of software engineering experience
Strong backend development experience — Python preferred
Experience building distributed and asynchronous systems
Hands-on experience with RAG pipelines and production LLM systems
Experience with workflow orchestration concepts and event-driven architectures
System design experience for reliability and scale

What will make you stand out

Production experience with LangGraph or LangChain orchestration frameworks
Experience with vector databases (Pinecone, ChromaDB, or similar)
Experience deploying models on Vertex AI or equivalent ML platforms
Streaming systems experience — Kafka, Pub/Sub, or similar
Prior work building AI-first or ML-driven applications

What makes this role different

This is not an AI integration role. Our client is building AI systems as core product infrastructure — systems that execute multi-step workflows driven by AI, maintain state across complex processes, combine multiple forms of intelligence into a single system, and continuously improve through data and feedback.

Candidates whose experience is limited to prompt engineering, chatbot integrations, or traditional REST backend development are unlikely to be the right fit.

Tech stack

Python, OpenAI/Gemini or similar LLMs, LangGraph/LangChain, Vertex AI, Kafka/Pub/Sub, Pinecone/ChromaDB, Docker, Kubernetes, cloud data warehouses, object storage

Location & model

Fully remote. Canadian candidates

Compensation:

$180,000-$220,000 CAD + Benefits

Apply for this job