Logo

Staff Cloud Infrastructure Engineer (Kubernetes)

OKX
San Jose, California, United States
Full time
$ 240k - $ 280k

About this job

Job category

IT

Job type

Full time

Salary range

$ 240k - $ 280k

Location

San Jose, California, United States

Company size

Mature [ 50+ employess ]

Apply now

Don't miss out on this opportunity. Apply now and take the first step toward success.

Resume Assistance

See how well your resume matches this job role with our AI-powered score. By uploading your resume, you agree to our Terms of Service

Job Description

OKX seeks a driven Cloud Infrastructure Engineer to spearhead innovative Kubernetes and container technology implementations, boosting platform stability and scalability. Responsibilities include maintaining AWS and Alibaba Cloud products and Kubernetes environments.

Responsibilities

  • Maintain and configure AWS and Alibaba Cloud products and services
  • Investigate new Kubernetes features and provide guidance and suggestions for current systems
  • Maintain service access, cost optimization, etc. for each Kubernetes environment
  • Prepare relevant documentation for Kubernetes operation, maintenance, and specifications
  • Architect, deploy, and manage Kubernetes environments to ensure high availability, scalability, and security
  • Monitor and optimize the performance of containerized applications and Kubernetes clusters
  • Develop and maintain infrastructure as code (IaC) using tools like Terraform or Helm
  • Collaborate with development teams to ensure seamless integration and deployment of new features

Requirements

  • Bachelor's degree or above in Computer Science or relevant field with 6+ years of DevOps, SRE, or related experience
  • Familiarity with Linux OS, TCP/IP, and basic computer knowledge
  • Relevant developer experience and familiarity with at least one scripting language (Shell/Python/Go)
  • Proficiency in Kubernetes (k8s) administration (deployment, scaling, management of containerized applications)
  • Familiarity with Kubernetes management, scheduling, operation, safety features
  • Familiarity with Kubernetes extensions (Operator/CRD/CSI/CNI/CRI) and relevant operational/maintenance or development experience
  • Strong engineering skills in at least one area: public cloud networking, SRE, DevOps, or cloud-native application
  • Familiarity with O&M work content and processes, process driving, and understanding of system concepts
  • Solid Linux platform O&M and debugging capabilities; proficient in troubleshooting, configuration tuning, and performance analysis
  • Experience with cloud platforms (AWS, Google Cloud, Azure) and Kubernetes services (EKS, GKE, AKS)
  • Experience with monitoring, logging, and alerting solutions for Kubernetes environments

Benefits

  • Competitive total compensation package
  • L&D programs and Education subsidy for employees' growth and development
  • Various team building programs and company events
© All rights reserved.