resume
Cheng-En (Blues) Li
Senior DevOps / SRE Lead | Hands-on Platform Engineering
📧 blueslee076@gmail.com | 📱 +886-983-597-076
LinkedIn | GitLab
📧 blueslee076@gmail.com | 📱 +886-983-597-076
LinkedIn | GitLab
SUMMARY
Hands-on DevOps/SRE Lead with 10+ years of engineering experience across backend development, cloud infrastructure, platform engineering, and production reliability. Over the past 4+ years, focused on global-scale DevOps for high-traffic financial trading platforms, with strong ownership of APP platform reliability, Kubernetes operations, CI/CD automation, cloud migration, observability, and infrastructure governance.
Proven track record of delivering $38K+/month in cloud cost savings, executing zero-downtime cloud migration, building GitOps-based deployment automation, and operating production workloads across multi-region, multi-tenant environments. Currently exploring MCP, AI Agents, and time-series foundation models to improve incident diagnosis and infrastructure capacity forecasting.
TECHNICAL SKILLS
Category
Tools / Experience
Cloud & Network
AWS, Global Accelerator, CloudFront, Serverless, Private Network Acceleration
Container & Orchestration
Kubernetes, EKS, EKS Auto Mode, Karpenter, KEDA, Helm, Docker, Kustomize
CI/CD & IaC
GitOps, Jenkins, AWS CodeBuild, Bitbucket, GitLab CI, Drone, Terraform, Pulumi
Monitoring & Observability
Prometheus, Grafana, CloudWatch, SkyWalking, Loki, OpenSearch, Kafka, FluentBit
Programming Languages
Go, Python, Java, Shell Script, PHP
Backend Frameworks
Gin, Fiber, Django, Flask
Database & MQ
Aurora MySQL, PostgreSQL, Redis, Kafka, RabbitMQ
AI / AIOps
MCP, AI Agent, Genkit, Time-Series Foundation Model
Architecture & Reliability
SLI, SLO, RPO, RTO, Microservices, Incident Response, Capacity Planning
PROFESSIONAL EXPERIENCE
HYTECH S TECHNOLOGY PTY LTD | Taipei, Taiwan
Mar 2022 – Present
Joined Hytech as a Senior DevOps Engineer and progressively grew into a hands-on technical lead role, taking ownership of APP platform reliability, infrastructure automation, deployment governance, and production operations for a high-traffic, multi-tenant financial trading platform.
APP Platform Reliability
Owned DevOps/SRE responsibilities for APP-related services, including mobile backend services, API/H5 infrastructure, production monitoring, deployment automation, incident response, and cross-region operational stability.
Kubernetes Operations & FinOps
Managed production Kubernetes workloads on AWS EKS with EKS Auto Mode, Karpenter, and KEDA-based autoscaling. Delivered $38K+/month in cloud cost savings while maintaining reliability and SLI/SLO baselines under high-concurrency workloads.
Zero-Downtime Cloud Migration
Led key execution areas of a zero-downtime migration from Alibaba Cloud to AWS across multi-tenant production environments. Covered infrastructure provisioning, DNS/LB cutover, CI/CD adjustment, observability validation, rollback planning, and post-migration stabilization.
GitOps & CI/CD Automation
Designed and standardized Bitbucket → CodeBuild → Jenkins/Helm GitOps pipelines for APP and production workloads. Improved deployment automation, release consistency, rollback safety, and operational governance.
Trading Infrastructure & Network
Designed and operated secure, low-latency infrastructure for financial trading systems, APP workloads, MT5-related services, and third-party trading platform integrations. Improved cross-border network performance using AWS Global Accelerator, private network acceleration, and CDN edge distribution to support platform stability SLAs.
Observability at Scale
Built and operated observability capabilities with Prometheus, Grafana, SkyWalking, Loki, FluentBit, Kafka, and OpenSearch, supporting high-volume logs, metrics, dashboards, alerting, and incident investigation workflows.
Infrastructure as Code & Automation
Managed multi-region infrastructure using Terraform and Pulumi. Developed internal automation tools and serverless API gateways in Go and Python to support infrastructure operations, deployment workflows, and troubleshooting.
AIOps Prototype
Built a hands-on AIOps prototype integrating MCP and AI Agents to accelerate incident root-cause diagnosis, reduce repetitive investigation work, and improve cross-team troubleshooting efficiency.
Technical Leadership
Grew from an individual contributor into a hands-on technical lead, establishing CI/CD practices, engineering standards, incident response workflows, review processes, and operational SOPs while continuing direct implementation and production troubleshooting.
Feversocial Corporation | Taipei, Taiwan
Mar 2019 – Sep 2021
Maintained and optimized CI/CD pipelines using GitLab CI/CD, Drone, and Shell scripting.
Developed and maintained backend features and automation scripts in Python, PHP, and Go.
Managed Kubernetes clusters with Helm and Kustomize, providing infrastructure support for development teams.
Conducted platform security scans based on OWASP Top 10 and supported security remediation workflows.
Built centralized logging systems using the EFK stack.
Developed AWS Lambda functions in Python and Node.js for event-driven automation tasks.
e-fun Technology Corporation | Taipei, Taiwan
Jun 2016 – Feb 2018
Built and maintained development environments, including Redmine, GitLab, and lab servers.
Developed system features using ConceptWave, Python, and Java.
Containerized applications with Docker to reduce environment-specific deployment issues.
Executed test cases and created automated testing workflows using Robot Framework.
Touch-Idea Corporation | Taipei, Taiwan
Apr 2014 – Feb 2016
TaiwanMobile — SessionTrace & IPRAN Projects
Apr 2014 – Dec 2015
Built and maintained project servers, deployment environments, and codebases.
Designed rollback and backup methodologies for disaster recovery.
Deployed IPRAN POC environments using Juniper routers.
Developed automation scripts in Java and Python.
FarEasTone — ALM Project
Aug 2014 – Nov 2014
Developed Python scripts to parse CSV files into the ALM system.
Evergreen — TOSPro Project
Jan 2015 – Feb 2016
Developed UI features using ExtJS based on client requirements.
Assisted technical leads in debugging framework-level issues and delivering client-specific features.
EDUCATION
Taipei College of Maritime Technology | Taipei, Taiwan
Associate's Degree, Department of Computer and Communication Engineering
Sep 2008 – Jun 2012
Associate's Degree, Department of Computer and Communication Engineering
Sep 2008 – Jun 2012