Job Responsibilities:
Responsible for the deployment, operation, and maintenance of application systems, middleware, and databases, including monitoring and optimization, fault diagnosis, and troubleshooting to ensure the stable operation of business systems.
Experience in cloud environment operations and maintenance. Responsible for organizing and writing operations-related documentation.
Responsible for maintaining the company's physical and virtual machines, and on this basis, building and maintaining k8s clusters.
Job Requirements:
Bachelor's degree or above, 4~6 years of system operations experience.
Familiar with Linux system operations, and experience in maintaining and managing MySQL, MongoDB, Kafka, etc.
Familiar with application system monitoring frameworks, proficient in at least one monitoring system's deployment, configuration, and maintenance, such as zabbix, prometheus, etc.
Proficient in at least one scripting language (shell, python, etc.), capable of independently developing tool scripts.
Proficient in system operations workflows, including deployment and release, data maintenance, script programming, fault handling, etc.
Proficient in using ELK tools for daily maintenance, and knowledgeable about DevOps.
Proficient in Kubernetes principles and application deployment.