业务延迟优化全栈指南:从硬件到应用的系统性降延迟实践
深入探讨如何从硬件、网络、操作系统、应用架构、业务逻辑等多个维度系统性降低业务延迟,包含实战配置、代码示例和最佳实践。
Experienced DevOps engineer with expertise in cloud infrastructure, automation, and site reliability engineering. Passionate about building scalable, reliable systems and implementing best practices for continuous delivery and monitoring.
DevOps Engineer (Unemployed)
Remote/Shanghai/Beijing (Expected)
Experienced DevOps engineer with expertise in cloud infrastructure, automation, and site reliability engineering.
Experienced DevOps engineer with expertise in cloud infrastructure, automation, and site reliability engineering.
Renmin University of China
2016/03 — 2019/07
Computer Science major with focus on system administration and network engineering.
Limited experience in leading large teams and managing complex team dynamics
No experience with large-scale software development projects and complex codebases
Limited formal education background as I did not receive full-time university education
I have vitiligo (a skin condition with white patches) which may be noticeable in onsite work environments and some colleagues might be uncomfortable with it
Expertise in building and maintaining highly available systems
Strong focus on infrastructure automation and CI/CD
Comprehensive monitoring and alerting solutions
My professional journey in software development and product management
Tubi
Contributed to infrastructure automation and site reliability engineering initiatives as an individual contributor. Managed Kubernetes clusters, implemented monitoring solutions, and supported improvements in system reliability and performance. Led AWS network architecture design, optimized VPC traffic flow, and implemented secure, scalable networking solutions for high-traffic environments.
Bitmain
Built and maintained cloud infrastructure, implemented CI/CD pipelines, and ensured high availability of production systems. Reduced deployment time by 70% and improved system uptime to 99.9%.
Flipboard China
Designed and implemented cloud-native solutions using AWS and Azure. Managed multi-cloud environments, implemented security best practices, and optimized infrastructure costs by 30%.
CmsTop
Managed Linux servers and network infrastructure. Implemented monitoring systems, automated deployment processes, and provided technical support for development teams.
Successfully delivered projects on time and within budget
Led cross-functional teams and managed complex DevOps projects
Mastered technologies and tools
A selection of my recent work showcasing technical skills and problem-solving abilities (Videos and Repositories will come soon)
A collection of production-ready Terraform modules for AWS infrastructure. Includes modules for VPC, EC2, EKS, SSO, Organizations, Lightsail, and Terraform state backend management. Comprehensive Makefile with formatting, validation, linting, security scanning, and documentation generation.
Designed and deployed a production-ready Bitcoin node infrastructure with comprehensive monitoring, automated failover, and real-time blockchain analytics. Implemented multi-region deployment with 99.9% uptime, handling 1000+ transactions per second with sub-second latency.
Built a scalable AIGC platform integrating DeepSeek LLM and Stable Diffusion (ComfyUI) for enterprise use cases. Implemented GPU orchestration, model serving, and real-time inference with load balancing. Achieved 50x faster inference compared to baseline, supporting 100+ concurrent users with intelligent resource allocation.
Designed and deployed a comprehensive homelab environment using Proxmox VE for virtualization and containerization. Implemented automated backup strategies, network segmentation, and monitoring solutions. Created a self-hosted development environment with multiple VMs and LXC containers for testing, development, and learning purposes.
A modern, locally deployable API management platform designed to provide efficient, user-friendly, and powerful interface management services. Built with React 18 + TypeScript + Koa + MongoDB, supporting Mock services, automated testing, and comprehensive API lifecycle management.
An open-source Identity and Access Management (IAM) platform providing enterprise-grade Single Sign-On (SSO), Multi-Factor Authentication (MFA), user management, and application integration capabilities. Supports OAuth 2.0/OIDC, SAML 2.0, and LDAP protocols.
Sharing my thoughts on technology, career development, and the latest trends in software development
深入探讨如何从硬件、网络、操作系统、应用架构、业务逻辑等多个维度系统性降低业务延迟,包含实战配置、代码示例和最佳实践。
全面解析生产环境混合云架构规划与部署实施,涵盖多云策略、网络架构、安全设计、成本优化、监控运维等关键技术点
Master kubectl apply workflow from YAML files to running pods in production, including deep analysis of core concepts, common issues, and best practices with detailed Mermaid sequence diagrams
深入分析 Kubernetes 三种探针(Liveness、Readiness、Startup)的区别、应用场景、检测成功及失败后的行为,包含最佳实践和实际配置示例
Test Mermaid diagram display effects in Marp slides
Featured
Get notified when I publish new articles about technology, career development, and the latest trends in software development.
I'm always open to discussing new opportunities, interesting projects, or just having a chat about technology.
Location
Remote/Shanghai/Beijing (Expected)