Connecting

W1siziisimnvbxbpbgvkx3rozw1lx2fzc2v0cy9kyxj3aw4tcmvjcnvpdg1lbnqvanbnl2jhbm5lci1kzwzhdwx0lmpwzyjdxq

Latest Jobs

Search Results

IT Systems Engineer (Grafana, Kibana, SCOM)

Zürich, Switzerland , Swiss Franc90 - Swiss Franc105 per hour

Date Posted

2018-03-09 16:28:57 +0000

Reference

ITSE_1520612933

Job Type

Contract

Sector

DevOps

IT Systems Engineer (Grafana, Kibana, SCOM)

Position goals:

The IT Systems Engineer drives the on-boarding of additional IT services to the scope of IT Service Control. She or he builds-up the capabilities required for obtaining near-real-time health-state information of on-boarded IT services and their underlying components or dependencies. He or she analyses recent (major) incidents in order to identify relevant use-cases for incident detection, where an end-to-end view adds value to the already existing, but maybe restricted monitoring activities. The IT Systems Engineer works in close collaboration with the various IT service owners and is responsible for the adequate training of the members in the IT Service Coordination organization.

Main tasks/activities:

  • Analyze recent incidents to identify cases, where a proactive detection of patterns, with an end-to-end view, could have prevented an outage, or at least mitigated the impact.
  • Understand the purpose as well as the composition of the IT service, and identify the critical components or dependencies in detail.
  • Describe the relevant dashboards, monitors, events and log-files already in place, and where/how they can be accessed.
  • Together with the IT service owner, define strategies for how to monitor the different aspects of the end-to-end health-state in near-real-time.
  • Specify in detail, which events are to be aggregated in order to depict a relevant health-state view for a specific IT service.
  • Specify, what additional aspects should be monitored or logged.
  • Implement a real-time aggregation and correlation of the relevant events.
  • Implement a near-real-time dashboard with a timely and accurate depiction of the end-to-end health-status of the IT service to be on-boarded.
  • Perform data analysis on the health-status information available from across all on-boarded IT services, and search for patterns or anomalies, which indicate a future degradation of the IT service or a potential intrusion event.
  • Describe these specific patterns of interest, with procedures on which events to correlate in order to detect operational issues as well as anomalies caused by potentially malicious activity.
  • Implement real-time triggers for such patterns of interest, where the IT Service Coordination organization is actively alerted in case for the sake of an immediate first analysis and therefore enabling an early response.
  • Document procedures for how to react on such specific scenarios and how to distinguish relevant from false positive events.
  • Describe the support organization of the IT service to be on-boarded and formalize the escalation paths, as well as the mutual expectations with the IT service owner.
  • Train the members of the IT Service Coordination organization on the IT services to be on-boarded and on the alerts to be reacted on.

Position requirements:

  • Self-motivated and highly proactive attitude
  • Broad technical background with expertise in network technologies, operating systems and typical application stacks (in particular Java and .Net)
  • Good understanding of cloud delivery models
  • Hands-on experience in aggregating data in ELK, SCOM and InfluxDB
  • Hands-on experience in developing dashboards with Grafana, SCOM and Kibana
  • Experience with statistical data analysis
  • Excellent verbal and oral communication skills (in English).
  • Ability and disposition to …
  • Understand complex technology stacks and their dependencies;
  • Understand business as well as operational requirements and translate them into technical solutions;
  • Work in a global company with people having different cultural backgrounds;
  • Appear as professional and communicate target group related;
  • Assume responsibility and drive projects autonomously;
people-placeholder

Managing This Role

Ashley Morton

Since starting as a Trainee Consultant in 2010, I have worked in positions such as Consultant, Senior Consultant, Team Leader and in recent years, working as Darwin's first Part...

Ashley's Full Bio

Job Search

Find Your New Job Here