Site Reliability Engineer
London / permanent / £80k-£100k

Hollie Raeburn-ward
£80k-£100k
permanent
Harris Global are currently recruiting for a Site Reliability Engineer to join a leading media organisation on a permanent basis in their London office 2-3 days a week.
Key responsibilities:
- Confident in managing virtual platforms and the underpinning services such as Enterprise Storage, SD-Networking.
- Confident in managing peripheral infrastructure services such as Backup/Restore, non-native monitoring, Hardware monitoring.
- Provide technology insight and support to key management staff and peers.
- Have the ability to manage tasks to tight deadlines and mange upwards regular updates and managing expectations accordingly.
- Have a good understanding of VMware, EMC and/or AWS cloud and other IASS solutions.
- Be keen to develop Scripting capability and API integration in one or more popular languages. (Puppet/Python/Shell)
- Act as a technical escalation point for the applications owners to resolve issues swiftly, and find root cause and mitigate from happening again had too better improve service(s) we deliver.
- Understand and work towards a strategy set out by senior management ensuring we adhere to direction and execute tasks based on priority to meet strategy deadlines.
- Have the drive to constantly improve and try out new technology offerings to improve Operational efficiency and execution.
- Constantly improve functional monitoring and non-functional monitoring of the infrastructure, to head off any issue that might occur.
- Be flexible when it comes to out of hours support. You will be required to be on-call evening and weekends as you will form part of an on-call system for escalating to out of core working hours and ask to carry out change controls in agreed business maintenance windows.
Technical Skills Required:
- A good understanding of Linux OS (Ubuntu/RHEL).
- A good understanding of configuration management and automation with Puppet.
- Experience using and configuring monitoring technologies, specifically ELK and Zabbix.
- Experience building and configuring Elasticsearch clusters at scale.
- A good understanding of VMware and virtualisation.
- A good understanding of caching both at the CDN layer and data centre tier (Akamai/Varnish).
- Programming and Scripting experience - Bash, Python, Perl etc
For more information, please apply now!