Responsibilities:-
7*24 IT monitoring service, use the monitoring platform to monitor the operation status and performance of relevant systems (datacenter environment, server, network and application systems) in real time;
-
Judge the severity level in time, in charge of trouble shooting bridge, coordinate with second-line engineers or escalate to proper level till problem be solved.
-
Ability to prioritize tasks and work independently, realize the problems and find potential risks.
-
Excellent communication skills to collaborate with teams globally
-
Make report and record all information according to monitoring and processing results;
-
Assist daily IT operation.
Requirements:-
Experience using software tools from VMware to provision and manage VM infrastructure a plus
-
Experience with configuration management systems such as Ansible, Puppet or Terraform a plus
-
Proficiency in using network debug tools like ping, telnet, MTR, curl, traceroute, tcpdump, Wireshark would be helpful
-
Experience with AWS, or other cloud infrastructure providers a plus