Foundational Site Reliability Engineer II
Microsoft | |
United States, Texas, Irving | |
7000 State Highway 161 (Show on map) | |
Jan 25, 2025 | |
OverviewSecurity represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end, simplified solutions. The Microsoft Security organization accelerates Microsoft's mission and bold ambitions to ensure that our company and industry is securing digital technology platforms, devices, and clouds in our customers' heterogeneous environments, as well as ensuring the security of our own internal estate. Our culture is centered on embracing a growth mindset, a theme of inspiring excellence, and encouraging teams and leaders to bring their best each day. In doing so, we create life-changing innovations that impact billions of lives around the world. Our team is looking for a Foundational Site Reliability Engineer (SRE) II to join the bedrock to every cloud offering at Microsoft. We provide isolated production identities and tooling for internal users that are reliable, highly available, and secure. Our team's vision is to "Provide a secure, robust, and extensible identity platform for both on-premises and cloud first solutions for managing all production resources." Our services are the first services in and last out of any cloud. We supply highly dependable, redundant, resilient, and hardened identities to Microsoft engineers use for cloud buildout, management, and recovery of services at all architectural layers of the Azure technology stack. Based on Zero Trust Architecture (ZTA), identity is the primary security perimeter for our production environment. Our mission is to improve the availability, latency, performance and security of the Identity systems behind Microsoft's cloud. Like traditional operations, we keep important revenue-critical systems up and running, even when natural disasters, bandwidth outages and configuration problems occur. Unlike traditional operations groups, we identify and address these software problems directly through software improvements, innovative machine learning and systems automation.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.
Responsibilities* Proven experience dealing with large scale data architecture, operational architecture and or network architecture.* Designs components of a service delivery system that defines tools, hardware, processes, role assignments, dependencies, and documentation, resulting in complete system that supports service delivery and meets KPIs.* Design, write and deliver software to improve the availability, scalability, latency, and efficiency of Microsoft's Identity services.* Designs process or technology solutions that identify and resolve platform, system, deployment, and environmental issues prior to production release, and ensure an on-time release and measurable improvements against KPIs.* Influence and create new designs, architectures, standards and methods for large-scale distributed systems.* Communicates data and information among stakeholders and applies advanced diagnostic experience during service issues to restore service with minimal disruption to the customer and business.* Collaborates with Development, Test and PM counterparts to translate customer, business, and technical requirements into components of a service architecture and operability and customer scenarios that meet compliance standards and KPIs such as quality, cost, and customer expectation.* Conduct periodic on call duties. |