Senior Software Engineer
Atlanta, GA 
Share
Posted 13 days ago
Job Description
OverviewMicrosoft Cloud Operations & Innovation (CO+I) is the engine that powers Microsoft cloud services through the operation of our unified global datacenters enabling ~30% of Microsoft revenue through Commercial Cloud ($38 billion in FY20 Q1). The Cloud Infrastructure Health team in CO+IE is focused on improving Cusomer Availability, Data center Safety, Capacity and helping optimize the utilization of Datacenter resources using telemetry and Insights. Our systems analyze petabyte scale telemetry data from Datacenter critical environments and secondary signals in near real time and offline that enable timesensitive insights directly impacting Cloud Operations. CO+IE is looking for a Senior Software Engineer to work closely with fellow software engineers, product managers, data scientists, and analysts to build Software solutions to Ingest petabyte scale scritical environment telemetry, data products, real time detections using Machine Learning and Heuristics. Our engineering team is at the forefront of Microsoft's cloud computing transformation working with state of the art distributed systems that deal with near real time detections on petabyte scale telemetry using Machine Learning and traditional software to deliver on Cloud Availability and Safety goals. Come joing a team of talented engineers delivering world class Software solutions building Petabyte scale Big Data Pipelines to ingest and process Data center Critical Environment Telemetry and draw deep insights designed to improve cutsomer availability of Microsoft Cloud, safety and Infrastructure optimization. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.
ResponsibilitiesDesign, develop andoperatelarge scale,efficientand reliablecloud services.Write high quality, maintainable and high-performance code following demonstrated development principles.Mentor other engineers by teaching best practices and encouraging thought leadership.Work with Project Managers and business stakeholders to design and deliver new features, collaborating with partner teams across the org to ensure successful launches.Identifyopportunities and drive the implementation of monitoring, self-healing, and automation capabilities to improve service manageability and reliability.Participate in an on-call rotation (typically 24/7 for one week every 6-8 weeks) acting as Designated Responsible Individual (DRI) to monitor production systems for degradation, downtime, or interruptions, alerting stakeholders about status and taking appropriate actions to restore system/product/service.Investigate and resolve Customer Reported Incidents, continually looking for ways to minimize or eliminate future incidents and improve customer experiences.OtheEmbody our Culture and Values

 

Job Summary
Company
Start Date
As soon as possible
Employment Term and Type
Regular, Full Time
Required Experience
Open
Email this Job to Yourself or a Friend
Indicates required fields