Site Reliability Engineer
London / permanent / £80k
Applicants must be eligible to work in the specified location.
Harris Global are currently on the lookout for a Site Reliability Engineer to work for one of our Leading Media clients. This is a contract role working on a Hybrid basis in London. The ideal candidate will have strong AWS experience.
Responsibilities:
- Confident in managing virtual platforms and the underpinning services such as Enterprise Storage, SD-Networking.
- Confident in managing infrastructure services such as Backup/Restore, non-native monitoring, Hardware monitoring.
- Provide technology insight and support to key management staff and peers.
- Manage tasks to tight deadlines and mange upwards regular updates and managing expectations accordingly.
- Have a good understanding of VMware, EMC and/or AWS cloud and other IASS solutions.
- Be keen to develop Scripting capability and API integration in one or more popular languages. (Puppet/Python/Shell)
- Act as a technical escalation point for the applications owners to resolve issues swiftly and find root cause and mitigate from happening again had too better improve service(s) we deliver.
- Be flexible when it comes to out of hours support. You will be required to be on-call evening and weekends as you will form part of an on-call system for escalating to out of core working hours and ask to carry out change controls in agreed business maintenance windows
Experience:
- A good understanding of Linux OS (Ubuntu/RHEL).
- A good understanding of configuration management and automation with Puppet.
- Experience using and configuring monitoring technologies, specifically ELK and Zabbix.
- Experience building and configuring Elasticsearch clusters at scale.
- A good understanding of VMware and virtualisation.
- A good understanding of caching both at the CDN layer and data centre tier (Akamai/Varnish).
- Programming and Scripting experience - Bash, Python, Perl etc