US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs

   

Lead Software Engineer - Resiliency and Observability

As a Senior Lead Software Engineer at JPMorgan Chase within the Branch Workplace Technology product team, you will focus on Site reliability engineering and Application development skills.

Individual should be passionate about solving business problems through engineering excellence and innovation and has ability to proactively identify, solve and provide recommendation for complex problems and technology related bottlenecks related to performance, reliability and stability of Chase Branch applications and platforms using data-driven analytics and observability tooling.

This role is in a fast-paced environment where you are challenged to get involved into solving and providing recommendations for technology-related bottlenecks and complex problems in enterprise system architecture, reliability, scalability, security, and performance with proficiency in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, AppDynamics, Prometheus, CloudWatch, Datadog, Splunk, Geneos, Tivoli Netcool, etc.

You will partner with various stakeholders from multiple products to analyze, pilot and certify technology solutions for Chase branches ranging from web / mobile applications to AI driven digital experiences, from physical devices to networking overlays, from files to modern distributed databases, from video/telephony communication to software distribution processes.

It will stretch the horizons of your contributions to areas like architecture, infrastructure, automation, and SRE triage.

We embrace a culture of experimentation and constantly strive for improvement and learning.

You will work in a collaborative, trusting, thought-provoking environment-one that encourages diversity of thought and creative solutions that are in the best interests of our customers

Job responsibilities


* Proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to identity bottlenecks in application/infrastructure and drive issue solutions with recommendations leveraging different data points and observability metrics


* Fluency in at least one programming language such as (e.g., Java Spring Boot, Python, etc.)


* Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.


* Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, GitHub, Terraform, etc.)


* Experience with common cloud platforms and services (AWS certification preferred)


* Experience with design and architecture of Web/Java-based application running on cloud


* Advanced Knowledge of application design and its impact on availability, resilience and performance of the applications


* Experience with container and container ...




Share Job