Looking to be a part of a highly innovative organization in your next opportunity!? Our client is currently looking for a Senior Site Reliability Engineer to join the team remotely within Canada! If you are looking for a new challenge, and have a great interest in technology, we want to hear from you!

What are the perks:

  • Exceptional Benefits!
  • RRSP Matching!
  • Great Team Culture!
  • Career Advancement Opportunities!

The Senior Site Reliability Engineer will be responsible for:

  • Responsible for monitoring, maintaining and supporting production applications, systems and connectivity issues
  • Engage, influence, and convert SRE practices with development, operational and product groups to align technology service/solution delivery
  • Oversee the creation, improvement, and implementation of Enterprise monitoring and alerting for tracking of golden signals of SRE: latency, traffic, error rate and saturation
  • Respond to requests of critical incidents impacting production
  • Participate in incident management and provide Tier II and Tier III support
  • Observe, respond and action network alarm conditions
  • Review high level and functional requirement documents after product implementation to resolve any problems
  • Work with necessary teams to develop and refine SLIs and SLOs to monitor and track the customer experience
  • Assist in application implementation, support, monitoring, change management, and training
  • Assess performance and provide/develop ideas for improvement of systems and products
  • Automate repetitive tasks for more efficient support and customer uptime
  • Coordinate with internal development teams to document and understand usage needs of platforms

The required qualifications for the Senior Site Reliability Engineer are:

  • 5+ years’ experience working in IT Operations
  • 3-5 years of SRE/DevOps experience
  • Experience in Administration/troubleshooting with both Windows and Unix platforms
  • Experience with SQL Query
  • Experience with Windows/UNIX command line authoring
  • Experienced with VPN, WLAN, TCP/IP and FTP/SMTP protocols and troubleshooting
  • Experience with scripting languages such as bash, Ruby, Python, JavaScript, etc.
  • Experience with Azure, AWS and, or GCP
  • Familiar with using and managing monitoring tools such as New Relic, DataDog, SolarWinds, Dynatrace, Splunk, Stackdriver, CloudWatch, etc.
  • Familiarity with Site Reliability Engineering (SRE) standards and practices (Service Level
  • Familiar with SRE standards and practices, SLI/SLO, automation, CICD, etc.
  • Ability to be part of an on-call rotation in order to respond to incidents on a 24×7 basis
  • Certifications in Microsoft, Cisco, Redhat, etc. would be considered an asset


If you are looking for a new challenge, professional growth and have a great interest in technology, please reach out to us today! We can’t wait to introduce you to your awesome new team.

Contract Info / Information sur le contrat
  • Job ID / No. du Poste: 26903
  • Open Positions / Postes Ouverts: 1
Aperçu du travail

Se connecter

Sign Up

Forgotten Password