Logo

Senior Staff Engineer - Public Cloud Operations (AWS/Alibaba)

OKX
OKX seeks a Senior Staff Engineer for their Singapore team to manage enterprise-level hybrid cloud infrastructure. Responsibilities include full lifecycle management of AWS and Alibaba Cloud resources, ensuring high availability, performance, and compliance.

Overview

Department

IT

Job type

Full time

Compensation

Salary not specified

Location

Singapore, Southeast Asia

Resume Assistance

See how well your resume matches this job role with our AI-powered score. By uploading your resume, you agree to our Terms of Service

Ready to apply?

You're one step away - it takes less than a minute to upload your resume

Requirements

  • Mastery of core services (compute/storage/network/security) on AWS or Alibaba Cloud, with familiarity in the other platform.
  • Proficient in Linux/Windows system operations and automation tools (Shell/Python/Ansible).
  • Hands-on experience with containerized operations (Kubernetes, ECS/EKS, ACK) and cloud-native technologies (e.g., Service Mesh).
  • 5+ years of operations experience, with at least 3 years focused on public cloud (AWS/Alibaba Cloud) environments managing 100+ instances.
  • Experience in building cloud platforms from scratch, hybrid cloud architecture design, or large-scale migration projects (e.g., IDC-to-cloud) is preferred.
  • Strong problem-solving skills with the ability to handle high-pressure operational challenges.
  • Excellent communication skills to collaborate with development, testing, and security teams.
  • AWS Certified SysOps Administrator or Alibaba Cloud ACP/ACE certifications are preferred.
  • Bachelor’s degree or higher in Computer Science, Network Engineering, or related fields.
  • Responsibilities

  • Plan, deploy, monitor, and maintain AWS services (EC2, S3, VPC, Lambda, EKS, etc.) and Alibaba Cloud services (ECS, OSS, VPC, Function Compute, ACK, etc.).
  • Design highly available, auto-scaling cloud architectures, optimizing network (e.g., Alibaba Cloud CEN, AWS Direct Connect), storage, and compute resource configurations.
  • Implement full-stack monitoring and alerting using cloud-native tools (AWS CloudWatch, Alibaba Cloud CloudMonitor) and open-source solutions (Prometheus+Grafana, ELK).
  • Lead critical incident response, perform root cause analysis, and implement preventive measures (e.g., resource contention, misconfigurations, network latency).
  • Analyse cloud resource usage, reduce costs via reserved instances, auto-scaling, and storage lifecycle policies (e.g., AWS S3 Intelligent-Tiering, Alibaba Cloud OSS Archive).
  • Establish resource quota management strategies to prevent waste and overspending.
  • Implement cloud security baselines (security groups, IAM policies, Alibaba Cloud RAM permissions, AWS Security Hub), conduct regular security audits, and remediate vulnerabilities.
  • Design granular access controls using AWS IAM and Alibaba Cloud RAM, and enforce database auditing (e.g., AWS CloudTrail + Alibaba Cloud DAS).
  • Collaborate with development teams to optimize application architectures and provide cloud-native solutions (Server-less, Microservices).
  • Document operational procedures (SOP manuals) and lead internal technical training sessions.
  • Benefits

  • Competitive total compensation package
  • L&D programs and Education subsidy for employees' growth and development
  • Various team building programs and company events
  • Wellness and meal allowances
  • Comprehensive healthcare schemes for employees and dependants
  • © All rights reserved.