Reliability Engineer (Contractor)

Oct 4, 2025 · Remote — Asia-Pacific preferred · Contract (full-time hours) · USD $2,000–$4,000 per month (DOE)

Help design and maintain the infrastructure behind Workmind’s AI-powered automation platform for service franchises.

About Workmind

Workmind builds AI-powered automation for American home-service franchises. Founded in July 2025 by experienced AI professionals, we’re focused on robust, reliable systems that turn operational friction into revenue. We don’t fine-tune models; our value is systems design, orchestration, and evaluation that hold up under real-world abuse.

Role: Reliability Engineer (Contractor)

You’ll work directly with the CTO to design, deploy, and maintain the infrastructure powering Workmind’s core platform. The focus is on Infrastructure-as-Code, observability, and CI/CD using Google-native services.

This is a hands-on role where you’ll define and evolve the foundation that makes our automation products resilient, observable, and secure.

What you’ll do

Implement and maintain infrastructure using Infrastructure as Code (IaC).
Manage multi-environment deployment pipelines (dev / stg / prod) with Temporal.
Integrate Secret Manager for CI/CD secrets and runtime security.
Set up and maintain observability pipelines using OpenTelemetry, Managed Prometheus, and Cloud Monitoring / Logging.
Create alerting policies and dashboards in Google Cloud to track system health.
Maintain SOC 2–aligned controls, including IAM least privilege, audit logging, and key rotation.
Build and scan distroless container images, enforce Artifact Registry policies, and apply vulnerability scanning standards.
Document infrastructure decisions, diagrams, and processes in Markdown and version control.

Our stack

Languages: Go (for supporting tools and tests)
Cloud: Google Cloud Platform (Cloud Run, Cloud SQL, Firestore, Artifact Registry, Secret Manager, Cloud Logging/Monitoring, Managed Prometheus)
IaC: Google Cloud Go Libraries, Temporal
CI/CD: Cloud Build (Git-based triggers)
Observability: OpenTelemetry + GCP dashboards and alerts
Containers: Distroless images, least-privilege runtime

Must-haves

2+ years in infrastructure, reliability, or DevOps roles
Strong experience with Google Cloud Platform services
Experience with serverless/containerized environments (Cloud Run, Cloud Functions, etc.)
Familiarity with monitoring and alerting (Prometheus, Cloud Monitoring)
Practical knowledge of CI/CD and secure deployment pipelines
One of the following Google Cloud certifications (or equivalent hands-on experience):
- Professional Cloud DevOps Engineer
- Professional Cloud Security Engineer
- Professional Cloud Architect
- Associate Cloud Engineer

Nice-to-haves

Experience with Temporal durable execution workflows
Familiarity with SOC 2 audit processes and compliance documentation
Knowledge of serverless workers, vulnerability scanning, and supply chain security
Background with AWS or Azure (for comparison and migration perspective)

Location & hours

Remote. Preference for Southeast Asia (Indonesia, Malaysia, Philippines, Singapore, Thailand, Vietnam). Applicants elsewhere are welcome but must work ASEAN hours with at least 4 hours daily overlap.

Compensation & contract

Contract role; you handle your own taxes/insurance per local law
Base USD $2,000–$4,000/month (DOE)
Initial 90-day contract with potential extension
Fully remote; flexible hours with core overlap

How to apply

Email your CV to careers@workmind.cloud. Include:

Links to your GitHub or infrastructure-as-code repositories
A brief note answering one:
1. What’s the hardest part of maintaining reliable infrastructure at scale?
2. How have you implemented or audited infrastructure-as-code in production?
3. What does “observability” mean to you beyond monitoring?

Interview process

CV review
90-minute video call with lightweight pair-programming
Offer

Apply now