Ke Wang

Ke Wang

Senior Software Engineer/Cloud & Kubernetes

I build the systems that keep clusters healthy — auto-remediation frameworks, control-plane reliability, and LLM-powered tooling across distributed infrastructure at scale.

01

Stack & Skills

Languages
GolangJavaJavaScript TypeScriptReactNode.jsPythonSQLShellPHPHTML / CSS
Cloud Native & Infra
KubernetesETCDDockerLinux Service Mesh / IstioPrometheusGrafanaELKGitOpsCloud Computing
Foundations
Distributed SystemsOperating SystemsData Structures & Algorithms DatabasesCryptographyNetworkingCybersecurityDesign Patterns
AI / LLM
LLM SystemsRAGAgentic ToolingPrompt & EvalLLM-assisted Dev

Efficient self-learner and creative problem-solver with broad and deep technical range and discerning taste for the right solution. Full-stack capable, a great mentor to new hires, and writes good code and docs. Strong English — GRE 331, TOEFL 110, CET-6 632.

02

Work Experience

ByteDance — Volcano Engine

Jun 2023 — Present
Senior Software Engineer 2-2 · Cloud & Kubernetes
  • Cloud Auto-Remediation Framework — architected a configurable, extensible system for issue detection, policy definition and automated remediation; owned core components: API & models, anomaly ingestion, policy engine, remediation engine and reporting.
  • LLM-based Cloud Assistant — built an AI assistant that answers questions via RAG, performs tasks like lookups and machine reboots, and runs diagnostics on cloud resources using LLM-generated scripts from runbooks and engineer corrections.
  • Developer Experience for Diagnostics Platform — delivered tooling for efficient DSL development in TypeScript with full type info, code generation, custom compilation, local/remote debugging and LLM-based auto-i18n.
  • Internal Diagnostics Platform — led API and architecture redesign; performance work, bugfixes and diagnostic scripts.
  • Dynamic Configuration Service — overhauled API, architecture and performance; added cloud-native (IP→label) features, robust monitoring, and GitOps-based config management.
  • K8S & ETCD Reliability — drove the "adaptive throttling" initiative, hardened ETCD backups, migrated large objects out of ETCD, and was key to APIServer incident recovery and post-mortems.

eBay — China Center of Excellence

May 2020 — Mar 2023
Member of Technical Staff · Cloud, Kubernetes & Infra Engineering
  • Network Micro-segmentation — K8s-based platform translating DSL policies into enforcement across K8S node iptables, Istio, OVN and Juniper. Drove stability, observability and scalability, led data migration and K8S rebasing, and designed an approved new architecture.
  • VM on Kubernetes — designed and shipped running VMs inside eBay's internal clusters on virtlet + libvirt (incl. GPU pass-through), integrated with eBay's PaaS (metadata, onboarding, quota).
  • Cluster Reliability — APIServer tuning with creative adaptive throttling, Node "NotReady" root-cause & remediation, and an ETCD incremental backup solution.
  • Site-wide K8S Upgrade — rebased eBay's internal fork from 1.15 → 1.18, ran regression with all stakeholders, and executed the upgrade with minimal downtime.
  • Node Provisioning & CLI — maintained node provisioning (auto-remediation of bad nodes) and built a super-efficient, SSO, scriptable internal K8S CLI — loved by all.

Ant Financial

Jul 2018 — May 2020
Software Engineer · Cloud Native & PaaS
  • Serverless / FaaS on Knative — reduced cold-start latency, built reliable runtime 0→1, n→m, 1→0 autoscaling, integrated service mesh, and maintained multi-language runtimes.
  • Kubernetes PaaS for Alipay — implemented workflow-based complex operation orders, runtime patrols for mission-critical apps, and improved observability, monitoring and alerting.

Ant Financial

Jun 2017 — Jul 2018
Software Engineer Intern · Full-stack
  • Built a proprietary workflow execution engine from scratch — React + JointJS frontend, Spring backend, and a Service Provider Interface spanning both.
03

Beyond Work

Open Source

treebox

Interactive TreeMap visualization, widely used inside eBay to build dashboards.

KevinWang15/treebox
Open Source

notelix

Browser note-taking & highlighter software, plus its core engine web-marker.

notelix/notelix notelix/web-marker
Speaker · Nov 2019

KubeCon NA 2019

Serverless Platform for Large-Scale Mini-apps: from Knative to Production.

Watch the talk
Academia · Jun 2018

Top-10% Dissertation

A New Paradigm for Mobile Apps — Environment-based App Discovery. B.Sc. CS, Fudan University (rank 22/70).

Read more
04

Off the Clock

01Photography
02Homelab & Servers
03Electronic Gadgets
04AI Tools
05Reading
06Video Gaming
07Psychology
08Graphic Design
09Cooking & Coffee