Principal responsibilities:
-Design, implement, maintain highly available, scalable infrastructure solutions, leveraging automation to streamline operations.
-Monitsystem performance, proactively identify potential issues, drive incident response root cause analysis.
-Collaborate with cross-functional teams (development, product, security) to integrate reliability best practices the entire software lifecycle.
-Develop manage automation scripts, CI/CD pipelines, infrastructure-as-code (IaC) frameworks to enhance efficiency reduce manual intervention.
-Optimize cloud resources, cost management, disaster recovery strategies to ensure business continuity.
Qualifications :
-Experience: Minimum 5 years in IT operations Site Reliability Engineering, with a focus on infrastructure management system optimization.
-Technical Skills: Proficiency in operation control tools such as Ansible, Puppet, Chef, Terraform, Prometheus, Grafana, ELK Stack.
-Strong scripting skills in Python, Shell, similar languages.
Cloud Competency: Solid experience with majcloud platforms (AWS, Azure, GCP), including services like EC2, Lambda, Kubernetes, containerization.
-Problem-Solving: Proven ability to troubleshoot complex issues across distributed systems, networks, applications.
-Communication: Excellent written verbal communication skills, with the ability to collaborate effectively in a fast-paced, dynamic environment.
Preferred Qualifications:
-3+ years of dedicated experience in cloud service operations, with expertise in cloud-native architectures microservices.
-Certifications in AWS Certified Solutions Architect, Google Cloud Professional Cloud Architect, equivalent.
-Experience with service mesh technologies (e.g., Istio) observability tools (e.g., Jaeger).
-Familiarity with DevOps culture practices, including agile methodologies continuous improvement frameworks.
-Bonus: Proven experience in developing IT operation maintenance tools using Python, demonstrating the ability to automate complex workflows solve real - world problems.
更新于 2025-12-16
查看更多崗位職責
Support Role:
1. Hands-on Java/J2EE programming diagnostic experience;
2. Configuration support of enterprise software applications;
3. Experience of application servers such as Tomcat, Jboss, Websphere;
4. Experience of Unix server operating system including the Shell Script Development;
5. Experience ofacle, including the ability to write complex SQL queries & PL/SQL;
6. Experience of DevOps tool, like Jenkins , Jira , Confluence , Ansible,docker;
7. Good at English writing & listening
8. Fluent English commutation skill
9. Team management skill
10. Potential need to work nightshift;
11. Be proactive responsible.
Developer Role:
1. Experience of Spring boot /Spring cloud /Spring MVC development;
2. Experience of Java socket development Multi threaded development;
3. Experience of JavaScript/Python/Linux Shell development;
4. Experience of restful API design;
5. Experience of UI development;
6. Experience of application servers (Tomcat/Websphere);
7. Experience of PL/SQL development;
8. Experience of DevOps tool, like Jenkins , Maven , docker,git;
9. Good at English writing & listening ;
10. Fluent English commutation skill ;
11. Team management skill ;
12. Potential need to work nightshift;
13. Be proactive responsible.
更新于 2026-04-02
查看更多崗位職責