Lead Full Stack Software Engineer

Manulife
Toronto, CA; US

Job Description

The Lead Full Stack Software Engineer brings deep expertise across both front-end and back-end development, playing a key role in building next-generation products and services. This individual will lead the design, development, testing, and implementation of new features that enable scalable, high-quality software solutions while setting technical direction and best practices.

Position Responsibilities:

  • Designs, builds, and maintains the technology platform's features and infrastructure, including hardware, software, and network components.
  • Implement and integrate technology solution supporting AI governance (model governance) for orchestration, feature engineering, model deployment controls, approval workflows, and audit-ready evidence capture.
  • Design and implement policy-as-code guardrails (access, usage, data handling, deployment standards) and enforcement points across the AI lifecycle.
  • Build capabilities for model lineage, traceability, and monitoring (metadata, evaluations, drift/quality signals) to support risk management and regulatory needs.
  • Build scalable microservices and event-driven pipelines for model training/inference using Akka Streams and Cluster Sharding.
  • Integrate AdaptiveML workflows for continuous/online learning, feature stores, model registries, and A/B experimentation.
  • Develop reusable reference patterns, inner-source components that meet reliability, security, and compliance standards.
  • Implement shared runtimes for multi agent coordination, state management, memory persistence, and messaging.
  • Design interoperable APIs/SDKs used by data scientists and developers to build agent powered applications.
  • Maintain and improve CI/CD pipelines and developer toolchains for AI services to enable rapid, compliant delivery.
  • Evaluate emerging AI/ML infrastructure capabilities; prototype and introduce tools that improve developer productivity and reliability.
  • Develop and operate scalable backend services supporting high traffic agent interactions, retrieval operations, and real time execution flows.
  • Use cloud native technologies (containers, orchestration, IaC, CI/CD) to deliver reliable, cost-efficient services.
  • Optimize runtime performance across CPU/GPU/accelerator workloads.
  • Monitors and resolves persistent platform issues when surfaced by technical support teams such as bottlenecks, connectivity problems, and system failures.
  • Considers compliance and regulatory requirements throughout the platform lifecycle. Implements security measures, such as access controls, encryptions, and vulnerability assessments when applicable.
  • Partners with architects and business leaders to design and build robust platforms across all Global AI Platform capability layers.

Required Qualifications:

  • 7–10+ years in software engineering; 3+ years leading teams/projects in AI/ML or distributed systems.
  • Hands-on experience building or integrating technology supporting AI governance (model governance) and MLOps capabilities for model lifecycle management (registry, approvals, monitoring), orchestration, and continuous learning (e.g., AI Foundry, AdaptiveML, or equivalent).
  • Proficiency in Scala or Java (Akka ecosystem), plus Python for ML tooling.
  • Experience with stream processing and data pipelines.
  • Solid MLOps background: model registries, feature stores, CI/CD for ML, containerization (Docker), orchestration (Kubernetes).
  • Cloud proficiency (AWS/Azure), Terraform or IaC, and secrets/IAM.
  • Deep understanding of distributed systems, observability stack and resilience patterns.
  • Strong communication, documentation, and stakeholder management skills.

Preferred Qualifications

  • Experience with online learning, reinforcement learning, or active learning in production.
  • Knowledge of responsible AI frameworks, model risk management, and fairness/bias assessment.
  • Performance optimization for low-latency inference; GPU/accelerator utilization.
  • Experience in regulated industries (e.g., financial services/insurance) with audit and governance requirements.
  • ModelOp (ModelOps) and Dynamo implementation experience supporting AI governance / model governance and model lifecycle management.

When you join our team:

  • We’ll empower you to learn and grow the career you want.
  • We’ll recognize and support you in a flexible environment where well-being and inclusion are more than just words.
  • As part of our global team, we’ll support you in shaping the future you want to see.

#LI-Hybrid

The role being advertised is an existing vacancy.

About Manulife and John Hancock

Manulife Financial Corporation is a leading international financial services provider, helping people make their decisions easier and lives better. To learn more about us, visit https://www.manulife.com/en/about/our-story.html.

Manulife is an Equal Opportunity Employer

At Manulife/John Hancock, we embrace our diversity. We strive to attract, develop and retain a workforce that is as diverse as the customers we serv

Skills & Requirements

Technical Skills

ScalaJavaPythonAkka ecosystemDockerKubernetesAwsAzureTerraformIacSecrets/iamCloud native technologiesContainersOrchestrationCi/cdMlopsModel registriesFeature storesCi/cd for mlStream processingData pipelinesModel governanceAi governanceAdaptivemlAkka streamsCluster shardingModel lineageTraceabilityMonitoringMetadataEvaluationsDrift/quality signalsMicroservicesEvent-driven pipelinesModel training/inferenceA/b experimentationShared runtimesMulti agent coordinationState managementMemory persistenceMessagingApis/sdksAi/ml infrastructureDeveloper productivityReliabilitySecurityComplianceAccess controlsEncryptionVulnerability assessmentsTechnical supportBottlenecksConnectivity problemsSystem failuresLeadershipCommunicationAiMlDistributed systemsSoftware engineeringCloud computingData scienceEngineering

Level

lead

Posted

4/17/2026

Apply Now

You will be redirected to Manulife's application portal.