Datacenter Forensic Engineer

Dublin, Dublin, Ireland
Jun 11, 2024
Jun 24, 2025
Onsite
Full-Time
4 Years
Job Description

Microsoft Cloud Infrastructure and Operations (CO+I) seeks a highly motivated and skilled Datacenter Infrastructure Forensic Analyst to join our team. In this role, you will play a critical part in ensuring the reliability and performance of Microsoft's global data centers, supporting services for over 1 billion customers and 20 million businesses worldwide.

Responsibilities

  1. Lead Forensic Analysis. I am in charge of forensic analysis of events occurring within the data center infrastructure, identifying root causes, and implementing solutions to prevent recurrence.
  2. Subject Matter Expertise. Serve as a subject matter expert on all aspects of data center functions and failure modes in critical environments, providing insights and guidance to internal teams.
  3. Performance Validation. Develop methodologies to validate data center performance, control parameters, and operational efficiency against design intent, ensuring adherence to standards.
  4. Troubleshooting and Root Cause Analysis. Conduct troubleshooting and root cause analysis associated with equipment failure, leveraging analytical skills to identify issues through trend analysis.
  5. Compliance Review. Review compliance with existing corrective and preventative maintenance programs, enhancing operational readiness through proactive measures.
  6. Staffing Analysis. Analyze full-time employee and vendor staffing, including training, procedures, and site requirements, as part of root cause analysis and solution implementations.
  7. Process Improvement. Foster a culture of continuous improvement by proactively implementing lessons learned from analysis across multiple design, construction, and operational organizations.
  8. Solution Development. Develop solutions for defects identified through trends and data analysis, driving global standardization and consistency of processes, procedures, and reports.
  9. Collaboration and Communication. Collaborate with Site Operations Engineers to establish visual standards, process improvements, and error-proofing systems to drive efficiency and availability within the business.
  10. Tool Evaluation and Implementation. Identify and monitor the need for new tools to improve the quality of data and analytics, evaluating and implementing solutions as needed.

Qualifications

  • Bachelor's Degree in electrical, mechanical, or controls engineering or equivalent experience.
  • Solid experience in working in critical environments.
  • Proficiency in data center or critical environment mechanical and electrical systems.
  • Communication and leadership skills to drive and support projects.
  • Proficient in Root Cause Analysis methodologies and creating Failure Mode and Effects Analyses (FMEA).
  • Experience leading construction, design, and process reviews.
  • Understanding of data center topologies or equivalent mission-critical facility background.
  • Strong analytical skills with the ability to summarize complex data.
  • Team player with the ability to influence cross-functional teams.
  • Good written and verbal communication skills.

Other Requirements

Ability to meet Microsoft, customer, and/or government security screening requirements, including Microsoft Cloud Background Check.
Join us in our mission to empower every person and organization on the planet to achieve more. Apply now to be part of a dynamic team driving innovation and excellence in the cloud infrastructure space at Microsoft.