Logo

Senior Data Engineer

OKX
Hong Kong
Full time

Overview

Department

Data Science & Analytics

Job type

Full time

Compensation

Salary not specified

Location

Hong Kong

Company size

Mature [ 50+ employess ]

Ready to apply?

You're one step away - it takes less than a minute to upload your resume

Resume Assistance

See how well your resume matches this job role with our AI-powered score. By uploading your resume, you agree to our Terms of Service

Design, develop, and maintain real-time and batch data processing pipelines using Flink. Collaborate with data scientists to support feature engineering and develop feature stores for machine learning models in production.

Requirements

  • Deep expertise in Apache Flink for real-time data streaming and processing, with experience in designing and developing streaming architecture.
  • Familiarity with data platforms such as Alibaba Cloud Hologres, Doris/Clickhouse, and Spark.
  • Experience in feature engineering and feature store development to support machine learning applications in production environments.
  • At least 10 years of relevant work experience, especially in large enterprises or international Fintech companies
  • Background in FinTech or blockchain companies.
  • Strong English communication skills to work effectively in a cross-national team
  • Responsibilities

  • Design, develop, and maintain real-time and batch data processing pipelines using Flink, ensuring high performance and scalability.
  • Responsible for the architecture design and development of Flink streaming systems, tailoring solutions to complex business requirements.
  • Collaborate with data scientists to support feature engineering and develop feature stores for machine learning models in production.
  • Work closely with cross-functional teams, including AI, compliance, risk, and engineering, to provide robust and scalable data solutions.
  • Troubleshoot and resolve performance bottlenecks and technical challenges in real-time stream processing, ensuring system stability and scalability.
  • Improve system performance and data integrity, addressing complex issues in real-time data pipelines.
  • © All rights reserved.