We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Principal Software Engineer

Microsoft
United States, Washington, Redmond
Jan 09, 2025
OverviewMicrosoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Teams, OneDrive, and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate engineers to help achieve that mission. To achieve this goal, we in the Hardware Health Service team within Azure are responsible for the design, implementation, and operation of a global scalable cloud services to monitor the fleet's hardware health and predict anomalies and pending failures. We focus on delivering solutions required for our cloud service platforms at the lowest possible cost of ownership (TCO) and providing great customer experiences on unreliable hardware. Azure Hardware Health Service is looking for a Principal Software Engineer to be a part of the fast pace and exciting business of Azure. This is your chance to be part of the most exciting end to end teams within Microsoft. We are looking for a highly motivated Software Engineer with a track record in Cloud Service development to come help us develop and light up innovative hardware solutions that powers Azure and make our world-class cloud infrastructure even better. To be successful in this role, you have a great track record of delivering quality results to customers, an engineering mindset, an innate aptitude for agility, and technical excellence in software engineering. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesDesign, develop and operate large scale, low latency, and high throughput cloud services.Lead and drive highly complex and mission critical solutions that involve multiple Azure Services. Conduct A/B analysis, create and validate metrics, develop ML pipeline and modeling algorithm in the area of Information Retrieval and Machine Learning.Determines the technique needed and develops analytic modelsto understand complex business issues and provide data-driven insights by integrating statistical inference, Machine Learning modeling, AI and/or other advanced analytical methods to manage, classify and analyze complex data from a variety of sources.Perform data analysis using a variety of analytical tools (Python, KQL, Azure Databricks, Synapse, Power BI, Fabric etc), and interprets results with actionable recommendations.Define & measure the success/impact of requested analytics & reporting features via quantitative measures.Lead the development of cutting-edge models based on Hardware Telemetry. Leverage and advance Deep Learning, Reinforcement Learning, Causal Inference, and other techniques to solve complex problems.Provide overarching technical leadership and direction to a team of big data focused developers to deliver global scale services to collect signals and monitor the fleet's hardware health and predict anomalies and pending failures.Being an active leader in Azure infrastructure eco-system, work closely with the core Azure teams and the data center operations teams to ensure customers are not impacted by unreliable hardware. Take an active role and partner with internal peer teams and external partners to ensure highly available, fully secure, accurate and actionable results based on hardware health signals, policies, and predictive analytics.Embody our culture and values.
Applied = 0

(web-6f784b88cc-ncxr8)