Croydon Harris global Ltd, 3rd Floor One Croydon, 12-16, Addiscombe Road, Croydon CR0 0XT
Site Reliability Engineer 2022-11-11 Harris Global are currently recruiting for a Site Reliability Engineer to join a leading media organisation on a permanent basis in their London office 2-3 days a week. Harris Global 2022-12-11

Site Reliability Engineer

London / permanent / £80k-£100k

Hollie Raeburn-ward

80 DAY

£80k-£100k

GBP

permanent

Harris Global are currently recruiting for a Site Reliability Engineer to join a leading media organisation on a permanent basis in their London office 2-3 days a week.

Key responsibilities:

  • Confident in managing virtual platforms and the underpinning services such as Enterprise Storage, SD-Networking.
  • Confident in managing peripheral infrastructure services such as Backup/Restore, non-native monitoring, Hardware monitoring.
  • Provide technology insight and support to key management staff and peers.
  • Have the ability to manage tasks to tight deadlines and mange upwards regular updates and managing expectations accordingly.
  • Have a good understanding of VMware, EMC and/or AWS cloud and other IASS solutions.
  • Be keen to develop Scripting capability and API integration in one or more popular languages. (Puppet/Python/Shell)
  • Act as a technical escalation point for the applications owners to resolve issues swiftly, and find root cause and mitigate from happening again had too better improve service(s) we deliver.
  • Understand and work towards a strategy set out by senior management ensuring we adhere to direction and execute tasks based on priority to meet strategy deadlines.
  • Have the drive to constantly improve and try out new technology offerings to improve Operational efficiency and execution.
  • Constantly improve functional monitoring and non-functional monitoring of the infrastructure, to head off any issue that might occur.
  • Be flexible when it comes to out of hours support. You will be required to be on-call evening and weekends as you will form part of an on-call system for escalating to out of core working hours and ask to carry out change controls in agreed business maintenance windows.

Technical Skills Required:

  • A good understanding of Linux OS (Ubuntu/RHEL).
  • A good understanding of configuration management and automation with Puppet.
  • Experience using and configuring monitoring technologies, specifically ELK and Zabbix.
  • Experience building and configuring Elasticsearch clusters at scale.
  • A good understanding of VMware and virtualisation.
  • A good understanding of caching both at the CDN layer and data centre tier (Akamai/Varnish).
  • Programming and Scripting experience - Bash, Python, Perl etc

For more information, please apply now!