Platform Operations and Maintenance Engineer

Moledao

$5-16K[月薪]
远程办公5 - 10 年經驗學歷不限全職
分享

遠程工作詳情

工作開放國家香港 | 印度尼西亞 | 馬來西亞 | 菲律賓 | 新加坡 | 台灣 | 泰國 | 阿拉伯聯合酋長國 | 美國 | 越南

語言要求英語 | 簡體中文

職位描述

顯示原文

Optional Base Locations: Singapore, Malaysia, Abu Dhabi


The Business Operations Engineer (BOE) is a technical role dedicated to business delivery. By leveraging platformization and automation, you enable development and business teams to deploy and operate faster, more reliably, and more securely. You will act as a hybrid of internal platform product and operations engineer, rather than a traditional on-call operator.


Responsibilities

  • Delivery and Release System Development: Enhance CI/CD, environment management, and release strategies (canary releases, rollbacks, change control) to improve delivery efficiency and control.
  • Infrastructure Automation / IaC: Drive Infrastructure as Code, standardized resource templates, and one-click deployments to reduce manual operations and configuration drift.
  • Observability and Operational Data: Build and optimize monitoring, logging, and alerting systems (metrics, logs, tracing), implement alert noise reduction, and accelerate incident diagnosis and recovery.
  • Capacity Planning and Cost Management: Plan capacity based on business growth, continuously optimize cloud resources and costs (resource reclamation, sizing optimization, policy governance).
  • Reliability and Incident Management: Develop and maintain runbooks, drill plans, and post-incident review processes (RCAs) to drive engineering improvements that are recoverable and preventable.
  • Cloud-Native Platform Governance: Participate in Kubernetes platform operations and governance, support ingress/gateway solutions (Ingress, Nginx) and service governance components (e.g., Envoy, service mesh).
  • Security and Compliance Implementation: Enforce least-privilege access controls (AWS IAM, Kubernetes RBAC), network policies, vulnerability remediation processes, and incident response mechanisms.


Qualifications

  • 5+ years of experience in Linux / DevOps / SRE / platform engineering, with expertise in troubleshooting and managing distributed systems.
  • Familiar with AWS core services (networking, compute, storage, security), with experience in architecture, operations, and cost optimization.
  • Proven Kubernetes production experience: cluster governance, common incident troubleshooting, stability, and performance tuning.
  • Hands-on experience with CI/CD, IaC, and automation scripting, proficient in at least one language (Go, Python, Shell) for tool development.
  • Experience with observability ecosystems (e.g., Prometheus, ELK), capable of closing the loop on “metric → alert → diagnosis → recovery.”
  • Strong security awareness, understanding of common system, network, and application security issues and mitigation strategies (access control, vulnerabilities, incident response).


Preferred Qualifications

  • Experience as a platform owner or leading cross-team standardization and platform initiatives.
  • Deep expertise in observability (systematic Prometheus, Grafana, ELK implementations).
  • AWS or Kubernetes certifications.
  • Chinese language skills are a plus.
Preview

Dorothy Mole

HR OfficerMoledao

今天回覆 5 次

發布於 26 December 2025

Moledao

少於50人

DAO

查看熱招工作

舉報

Bossjob安全提醒

若該職位需要您出國工作,請提高警惕,並小心詐騙。

如果您在求職過程中遇到雇主有以下行為, 請立即檢舉

  • 扣留您的身分證件,
  • 要求您提供擔保或收取財產,
  • 迫使您投資或籌集資金,
  • 收取非法利益,
  • 或其他違法情形。
Tips
×

Some of our features may not work properly on your device.

If you are using a mobile device, please use a desktop browser to access our website.

Or use our app: Download App