Reliability Engineer (Contractor)
Oct 4, 2025 · Remote — Asia-Pacific preferred · Contract (full-time hours) · USD $2,000–$4,000 per month (DOE)
Help design and maintain the infrastructure behind Workmind’s AI-powered automation platform for service franchises.
About Workmind
Workmind builds AI-powered automation for American home-service franchises. Founded in July 2025 by experienced AI professionals, we’re focused on robust, reliable systems that turn operational friction into revenue. We don’t fine-tune models; our value is systems design, orchestration, and evaluation that hold up under real-world abuse.
Role: Reliability Engineer (Contractor)
You’ll work directly with the CTO to design, deploy, and maintain the infrastructure powering Workmind’s core platform. The focus is on Infrastructure-as-Code, observability, and CI/CD using Google-native services.
This is a hands-on role where you’ll define and evolve the foundation that makes our automation products resilient, observable, and secure.
What you’ll do
- Implement and maintain infrastructure using Terraform or OpenTofu (IaC).
- Manage multi-environment deployments (dev / stg / prod) with Cloud Build.
- Integrate Secret Manager for CI/CD secrets and runtime security.
- Set up and maintain observability pipelines using OpenTelemetry, Managed Prometheus, and Cloud Monitoring / Logging.
- Create alerting policies and dashboards in Google Cloud to track system health.
- Maintain SOC 2–aligned controls, including IAM least privilege, audit logging, and key rotation.
- Build and scan distroless container images, enforce Artifact Registry policies, and apply vulnerability scanning standards.
- Document infrastructure decisions, diagrams, and processes in Markdown and version control.
Our stack
- Languages: Go (for supporting tools and tests)
- Cloud: Google Cloud Platform (Cloud Run, Cloud SQL, Firestore, Artifact Registry, Secret Manager, Cloud Logging/Monitoring, Managed Prometheus)
- IaC: Terraform / OpenTofu
- CI/CD: Cloud Build (Git-based triggers)
- Observability: OpenTelemetry + GCP dashboards and alerts
- Containers: Distroless images, least-privilege runtime
Must-haves
- 2+ years in infrastructure, reliability, or DevOps roles
- Strong experience with Google Cloud Platform services
- Proficiency in Terraform or OpenTofu for IaC
- Experience with serverless/containerized environments (Cloud Run, Cloud Functions, etc.)
- Familiarity with monitoring and alerting (Prometheus, Cloud Monitoring)
- Practical knowledge of CI/CD and secure deployment pipelines
- One of the following Google Cloud certifications (or equivalent hands-on experience):
- Professional Cloud DevOps Engineer
- Professional Cloud Security Engineer
- Professional Cloud Architect
- Associate Cloud Engineer
Nice-to-haves
- Experience writing Go-based infrastructure tooling or operators
- Familiarity with SOC 2 audit processes and compliance documentation
- Knowledge of serverless workers, vulnerability scanning, and supply chain security
- Background with AWS or Azure (for comparison and migration perspective)
Location & hours
Remote. Preference for Southeast Asia (Indonesia, Malaysia, Philippines, Singapore, Thailand, Vietnam). Applicants elsewhere are welcome but must work ASEAN hours with at least 4 hours daily overlap.
Compensation & contract
- Contract role; you handle your own taxes/insurance per local law
- Base USD $2,000–$4,000/month (DOE)
- Initial 90-day contract with potential extension
- Fully remote; flexible hours with core overlap
How to apply
Email your CV to careers@workmind.cloud. Include:
Links to your GitHub or infrastructure-as-code repositories
A brief note answering one:
- What’s the hardest part of maintaining reliable infrastructure at scale?
- How have you implemented or audited infrastructure-as-code in production?
- What does “observability” mean to you beyond monitoring?
Interview process
- CV review
- 90-minute video call with lightweight pair-programming
- Offer