We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Site Reliability Engineer, GovCloud 24x7

salesforce.com, inc.
parental leave, 401(k)
United States, Colorado, Denver
Jul 23, 2025

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category

Software Engineering

Job Details

About Salesforce

We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good - you've come to the right place.

This candidate must be a U.S. citizen (U.S. born or naturalized) operating on U.S. Soil who does not hold dual citizenship with the ability to meet customer and government screening standards applicable to this role.

Applications will be accepted until 08/27/2025.

This position requires onsite presence in the Denver, Colorado office.

Are you passionate about ensuring the reliability and performance of mission-critical cloud services? Salesforce is seeking a talented Site Reliability Engineer to join our dynamic team in our Denver, CO, location, supporting our GovCloud environment. As a key member of our Site Reliability organization, you'll play a vital role in maintaining 99.99% uptime for customer-facing services, proactively addressing issues, and ensuring the security of our data. We foster a collaborative and innovative culture, where you'll work alongside skilled engineers to solve complex problems and drive continuous improvement.

Please Note: This position requires a successful background investigation and the ability to obtain and maintain a specific level of U.S. government background clearance. Details will be provided during the interview process.

Shift Requirements: This role involves shift work, including night shifts, as part of a 24/7 support team. We provide a rotating schedule and ensure adequate compensation for shift differentials.

About the Role:

The Site Reliability team at Salesforce is the backbone of our cloud operations, working around the clock to keep our services available and our customers protected. You will be a crucial part of the GovCloud Incident Response (GIR) team, which maintains the current infrastructure through day-to-day alert response, smart hands support, and comprehensive incident management, including retrospectives and long-term remediation.

Your Responsibilities:

  • Ensure 99.99% uptime for customer-facing services by proactively monitoring and maintaining the health of supporting systems, contributing directly to customer satisfaction and trust.

  • Act in key support roles during major incidents (e.g., Sev0, Sev1) and participate in technical incident reviews for problem management.

  • Contribute to Problem Management by populating and participating in Root Cause Analyses (RCAs) and handing them off to the Global Solutions team.

  • Ensure all work carried out by the Site Reliability team aligns with the company's internal compliance policies and directives.

  • Collaborate with technical staff to solve complex technical issues and customer concerns.

  • Lead and mentor other team members in staying abreast of industry innovations and technologies, and assist in team development growth.

  • Thrive in a fast-paced environment, solving sophisticated issues quickly and successfully balancing multiple priorities.

  • Automate the detection and resolution of recurring issues in the production environment.

  • Help create and improve current processes to reduce operational and engineering toil, including the implementation of AI-driven automation for routine tasks.

Requirements:

  • A related technical degree required.

  • 5+ years systems engineering experience in enterprise-scale internet service engineering or support role.

Required Technical Skills:

  • Expertise in TCP/IP related technologies (networking protocols, network programming, etc.).

  • Expertise in CLI enterprise support of Unix variants (Linux/Solaris/BSD), with significant exposure to Red Hat Enterprise Linux and Solaris.

  • Strong understanding of monitoring security systems and administration.

  • Experience provisioning, operating, and running AWS/C2S based infrastructure and systems.

  • Proficiency in scripting with Python, Go, or other languages.

  • Communication: Strong written and oral communication skills.

  • Incident Management: Past experience in Incident Management and a good understanding of ITIL service operations.

  • Availability: Ability to participate in a 24/7 on-call rotation supporting large data center operations and be available for shift work.

Preferred Qualifications:

  • Prior experience with Chef/Puppet or automated deployment. (This helps streamline our infrastructure management.)

  • Prior experience with Jenkins/Bamboo/Spinnaker pipeline execution. (This aids in our continuous integration and deployment processes.)

  • Experience supporting and maintaining monitoring and alert systems. (Ensures proactive issue detection.)

  • Experience supporting and maintaining Java applications. (Supports our application stack.)

  • Hands-on experience configuring and running AWS (Amazon Web Services) using the CLI/SDKs. (Essential for our cloud infrastructure.)

  • Certifications in Linux+, RedHat, and AWS. (Validates technical expertise.)

  • Experience supporting and leading Kubernetes-based applications and services. (Supports our containerized environment.)

  • Familiarity with Agile Process and DevOps practices. (Enables efficient workflow and collaboration.)

  • Experience participating in blameless retrospectives, learning from incidents, and conducting post-incident investigations, with an interest in how AI can assist in root cause analysis and pattern identification. (Promotes a culture of continuous improvement.)

  • Working knowledge of and interest in resilience engineering, including concepts such as Safety II and proactive problem prevention, leveraging AI for proactive risk identification and system optimization. (Enhances system reliability.)

  • Experience with AI/ML concepts and tools for operational insights, predictive maintenance, or intelligent automation.

  • Familiarity with data analysis and visualization tools to interpret AI-generated insights.

This candidate must be a U.S. citizen (U.S. born or naturalized) operating on U.S. Soil who does not hold dual citizenship with the ability to meet customer and government screening standards applicable to this role, including a Criminal Justice Information Services screening with fingerprint scan. Due to the citizenship requirements for this role, which supports U.S. federal, state, and/or local government customers, citizenship will be verified through two of the following REAL ID Act documents: U.S. Passport, Passport Card, REAL Driver's License, Global Entry Card, U.S. Government CAC/PIV. You agree to complete a Minimum Background Investigation (MBI) for a Moderate Public Trust position with the U.S. federal government and gain other clearances as deemed appropriate for the role.

Benefits & Perks
Check out our benefits site which explains our various benefits, including wellbeing reimbursement, generous parental leave, adoption assistance, fertility benefits, and more.

Salesforce Information
Check out our Salesforce Engineering Site.

This candidate must be a U.S. citizen (U.S. born or naturalized) who does not hold dual citizenship and agrees to complete a U.S. federal government Minimum Background Investigation (MBI) for a Moderate Public Trust position.

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement

Salesforce is an equal opportunity employer and maintains a policy of non-discrimination with all employees and applicants for employment. What does that mean exactly? It means that at Salesforce, we believe in equality for all. And we believe we can lead the path to equality in part by creating a workplace that's inclusive, and free from discrimination. Know your rights: workplace discrimination is illegal. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications - without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.

In the United States, compensation offered will be determined by factors such as location, job level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock purchasing program. More details about company benefits can be found at the following link: https://www.salesforcebenefits.com. For Colorado-based roles, the base salary hiring range for this position is $143,300 to $197,000.
Applied = 0

(web-6886664d94-5gz94)