Site Reliability Engineer Sr. Staff
Site Reliability Engineer Sr.
Staff
This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week from an HPE office.
Who We Are:
Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work.
We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today's complex world.
Our culture thrives on finding new and better ways to accelerate what's next.
We know varied backgrounds are valued and succeed here.
We have the flexibility to manage our work and personal needs.
We make bold moves, together, and are a force for good.
If you are looking to stretch and grow your career our culture will embrace you.
Open up opportunities with HPE.
Job Description:
Responsibilities
As a Staff Software Engineer, you will play a key role in designing, building, and optimizing cloud infrastructure and deployment systems.
Your work will directly impact scalability, security, and operational efficiency across our platforms.
Key responsibilities include:
* Enhance Infrastructure as Code (IAC) and enforce best practices.
* Optimize cloud infrastructure for scalability, security, and cost-effectiveness.
* Develop internal tools to support and streamline cloud platform operations.
* Improve CI/CD pipelines and deployment workflows using FluxCD and Jenkins.
* Address container image vulnerabilities and standardize remediation processes.
* Build Amazon Machine Images (AMIs) aligned with CIS and STIG benchmarks.
* Strengthen monitoring, alerting, and observability using Prometheus, Grafana, and logging tools.
* Troubleshoot complex production issues to ensure system reliability and customer satisfaction.
* Fine-tune distributed systems such as Apache Kafka and Cassandra.
* Collaborate with development, security, and operations teams to align infrastructure with application needs.
Basic Qualifications
* Minimum of 12 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE).
* Proficiency with Linux systems, especially Debian-based distributions.
* Strong experience with cloud platforms such as AWS and GCP.
* Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible.
* Solid programming skills in Python and/or Golang.
* Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE).
* Experience with GitOps workflows.
* Proven track record in implementing and maintaining CI/CD pipelines.
* Strong background in security and familiarity with security programs.
* Experience with monitoring and logging tools (Prometheus, Grafana, ELK).
* Knowledge of both relational (SQL) and non-relational databases.
* Excellent problem-solving and debugging skills wi...
- Rate: Not Specified
- Location: San Jose, US-CA
- Type: Permanent
- Industry: Finance
- Recruiter: Hewlett Packard Enterprise Company
- Contact: Not Specified
- Email: to view click here
- Reference: HPE1US1192777EXTERNALENUS
- Posted: 2025-08-28 09:05:25 -
- View all Jobs from Hewlett Packard Enterprise Company
More Jobs from Hewlett Packard Enterprise Company
- Millwright General Foreman
- Millwright General Foreman
- DDI PM Senior Operációs ügyintéző (éjszaka)
- DHL Freight söker: Terminalarbetare deltid 50%
- Postbote für Pakete und Briefe (m/w/d)
- General Labor
- Shipping/Receiving Forklift Operator 1st Shift
- Forklift Operator
- Mechanic Piv Step 1
- Vendor & Contracts Associate
- Maintenance Supervisor
- Software Engineer
- Chef De Cuisine - Anise - F&B Preparation - InterContinental® Dubai Festival City
- Global Product Manager- Connectors
- Outside Sales Representative
- Manufacturing Process Engineer
- Sales Manager
- Commissioning Technician
- Sales Manager
- Outside Sales Representative