Skip to content

ComputerWork: Jobs for Technical People

 

Job Application

 
 
 

Please answer the following questions in order to process your application.

 
 
Email Address *
 
Select your working status in the UK *
 
 
 
File Attachments:
(2MB file maximum. doc, docx, pdf, rtf or txt files only)
 
Attach a CV * 
 
Optional covering letter 
OR
Clear covering letter
 
 
 * denotes required field
 
 
 
Additional Information:
 
First Name
 
Last Name
 
Address
 
Country
 
Home Telephone
 
Mobile/Cell
 
Availability/Notice
 
Hourly Rate GBP
 
Approximately how far are you willing to travel to work (in miles) ?
 
 
 

Key Privacy Information

When you apply for a job, ComputerWork will collect the information you provide in the application and disclose it to the advertiser of the job.

If the advertiser wishes to contact you they have agreed to use your information following data protection law.

ComputerWork will keep a copy of the application for 90 days.

More information about our Privacy Policy.

 

Job Details

 

Service Monitoring and Maintenance Engineer (Contract)

Location: United Kingdom Country: UK
 

Job Title: Service Monitoring and Maintenance Engineer

Contract Type: Contract (Inside IR35), 12 Months

Job Description:

We are seeking a highly skilled Service Monitoring and Maintenance Engineer to join our team on a 12-month contract basis. The successful candidate will be responsible for monitoring and maintaining the operational health of various services within our technology ecosystem. This role is crucial in ensuring the reliability and performance of our services, making use of a variety of tools and platforms.

Key Responsibilities:

  • Service Monitoring: Continuously monitor service metrics through various platforms such as BES, ECP Platform Health Dashboard, and CloudWatch metrics. Identify and respond to anomalies and performance issues promptly.

  • Application Maintenance: Regularly update and maintain application code across services. This includes managing:

    • Python runtime and dependencies
    • Terraform configurations
    • GitHub Actions workflows
  • Incident Management

  • Develop and execute runbooks/playbooks for efficient response to incidents and service requests. Ensure swift resolution and minimal downtime.
  • Testing and Quality Assurance: Create, maintain, and enhance testing frameworks and infrastructure.

Responsibilities include:

  • Developing and executing unit tests and synthetic tests
  • Integrating and maintaining BES Monitoring
  • Ensuring proper functioning with the ECP Platform Health Dashboard
  • Deployment and Configuration Management: Manage GitHub deployment workflows to ensure smooth and reliable deployment processes.

Responsibilities include:

  • Performing tests on deployments
  • Reverting configurations that compromise operational availability, such as erroneous Firewall rules
  • Service Review and Stakeholder Engagement: Regularly review service performance and incident reports.
  • Provide constructive feedback and recommendations to ECP stakeholders and incorporate feedback from customers to enhance service delivery.

Required Skills and Qualifications:

  • Technical Expertise: Proficiency in Python, Terraform, and GitHub. Experience with AWS CloudWatch or similar monitoring tools is highly desired.
  • Problem Solving: Strong analytical and problem-solving skills with the ability to handle multiple incidents and emergencies.
  • Communication: Excellent communication skills, capable of effectively articulating technical challenges and solutions to stakeholders and team members.
  • Experience: Proven experience in managing IT service delivery, monitoring, and incident response.

Additional Requirements:

  • Ability to work in a fast-paced, dynamic environment.
  • Demonstrated experience in handling large-scale services and deployments.
  • A proactive approach to service health and improvements.

Posted Date: 25 Apr 2024 Reference: JS Employment Business: WNTD Contact: Drew Delahunty