hero

Discover the best
jobs in tech

From design and development to sales,
people, and management, get <matched>
with the best opportunities.
92
companies
11,008
Jobs

Senior Site Reliability Engineering Manager - CTJ TS

Microsoft

Microsoft

Software Engineering, Other Engineering
New York, USA
Posted on Jul 31, 2024
Do you have a passion for high scale services and working with some of Microsoft’s most critical customers? We’re looking for a Senior Site Reliability Engineering Manager with the right mix of software development, on-line services experience and passion for quality to envision, design, and deliver Office 365 government cloud service offerings.

Office 365 is at the center of Microsoft’s cloud first, devices first strategy as it brings together cloud versions of our most trusted communication and collaboration products like Exchange, SharePoint, and Teams with our cross-platform desktop suites and mobile apps. The Office 365 Enterprise Cloud team works with Microsoft’s largest enterprise and government customers to deliver features that meet their specific needs and enable cloud adoption. As you would expect, our customers have the highest expectations for feature quality, security, reliability, availability, and performance.

The Site Reliability Engineering (SRE) team provides leadership, direction and accountability for application architecture, system design, and end-to-end implementation. As a Senior SRE Manager, you build and devople a team to identify and deliver software improvements using expertise in software development, complexity analysis, and scalable system design. Collaboration skills will be required to work closely with other engineering teams to ensure services/systems are highly stable and performant, meeting the expectations of our government customers and users.

At Microsoft, we can offer you an amazing team, exciting challenges, and a fun place to work. The work environment empowers you to have a positive impact on millions of end users.

The Right Candidate For This Job (is)

  • Passionate about distributed systems and working with highly scalable services
  • Gains fulfillment developing others and building a postive and collabrative team culture.
  • Enjoys new technological challenges and is motivated to solve them
  • Excited about making better software and continuously improving the development, integration, and deployment processes
  • Smart, highly motivated, self-starter who thrives in a bottoms-up, fast-paced, highly technical environment
  • Effective collaborator, experienced in creating technical partnerships across teams

Responsibilities

  • Provide deep technical leadership to a team of highly passionate and skilled engineers
  • Recruit, on-board, and grow a team of Software Engineers focused on Site Reliability
  • Build, run and improve critical public-sector service environments
  • Coordinate planning and execution with internal engineering teams, business partners and technical leaders across the division
  • Own deployment, availability, reliability, performance and customer escalation targets for these environments
  • Proactive identification and reduction of issues through design, testing, and implementation of software
  • Uphold high organizational standard of great employee and team satisfaction

Qualifications

Required Qualifications:

  • 6+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration
    • OR Master's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration.
  • 7+ years of Software, Site Reliability, Systems, or Service Engineering experience.
  • Current software development expertise in multiple programming languages (C#, C++, Python, Java, et al)
  • Proven experience with effectively driving improvement and delivering solutions with stakeholders across all levels of an organization
Other Requirements

  • Security Clearance Requirements: Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
    • Candidates must have an active TS and be willing to upgrade to TS/SCI (with polygraph) or have an active TS/SCI and be willing to upgrade to TS/SCI (with polygraph). This role will require candidates to maintain the TS/SCI (with polygraph) clearance. Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. Failure to maintain or obtain the appropriate clearance and/or customer screening requirements may result in employment action up to and including termination.
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
  • Citizenship & Citizenship Verification: This position requires verification of U.S. citizenship due to citizenship-based legal restrictions. Specifically, this position supports United States federal, state, and/or local United States government agency customer and is subject to certain citizenship-based restrictions where required or permitted by applicable law. To meet this legal requirement, citizenship will be verified via a valid passport, or other approved documents, or verified US government Clearance
Preferred Qualifications

  • 7+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration
    • OR Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration
    • OR Doctorate Degree in Computer Science, Information Technology, or related field.
  • 3+ years technical experience working with large-scale cloud or distributed systems.
  • 3+ years people management experience.
  • Experience designing, building, servicing, and driving ongoing improvement of service infrastructure & systems
  • Proven track record of improving reliability, available and performance of cloud services
  • Technical understanding of Office 365 and Exchange architecture
  • Previous work experience requiring government screening and clearance
Site Reliability Engineering M4 - The typical base pay range for this role across the U.S. is USD $112,000 - $218,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $145,800 - $238,600 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

#m365core

Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.