US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs

   

Site Reliability Engineer III

Job Description:

JOB SUMMARY:

The Site Reliability Engineer helps Vertex to implement highly reliable, scalable, and performant system across the enterprise.

This is realized by relentlessly measuring the environments and finding areas that need improvement.

Improvements can range from education of engineering and operational resources, creating new capabilities, providing code enhancements, or implementing processes and tools.

Success is measured by data and backed by continued customer satisfaction.

The Site Reliability Engineer will use their infrastructure experiences combined with engineering best practices to build solutions to improve our environment.

ESSENTIAL JOB FUNCTIONS AND RESPONSIBILITIES:


* Responsible for designing, developing, implementing, and optimizing the efficiency of the environment including performance, reliability, and scalability of our services.


* Responsible for measuring the health and performance of the environments by implementing tooling such as Datadog to achieve the proper level of visibility of the environment.


* Enable teams to implement observability by developing and publishing standards and best practices and providing guidance and implementation assistance to engineering teams.


* Responsible for designing and implementing coding assignments related to applications, systems reliability, monitoring, alerting, and analytics.


* Participate in educating Engineering and Operations teams to ensure SRE principles are implemented consistently across the enterprise.


* Take a proactive approach to anticipate and correct a wide range of production issues including outages, processing slowdowns or stoppages, errors, and failures


* Implement engineering and operational improvements including code enhancements, process improvements, or procedural amendments.


* Ability to triage, isolate, and resolve environmental issues in an expedient and open fashion.


* Provide technical leadership for a wide range of projects.


* Assist and mentor other engineering staff

KNOWLEDGE, SKILLS AND ABILITIES:


* Experience with multiple software development languages including C#, Go, Python or Java.


* Experience with platform monitoring tools like Datadog, AWS CloudWatch, or similar


* Experience with Software as a Service (SaaS) environments


* Experience designing and deploying AWS services with an Infrastructure as Code (IaC) mindset with tools like Terraform.


* Experience with hyperscalers, most notably AWS, Azure, or OCI


* Experience in Agile development methodology.


* Good written / verbal communication skills


* Ability to listen and understand information and communicate the same.


* Ability to network with key contacts outside own area of expertise.


* Ability to work with minimal supervision, working with latitude for independent decision making.

EDUCATION, TRAINING:


* Undergraduate degree preferably in Computer Science or a ...




Share Job