Principal Domain Architect Job at National Grid plc, Waltham, MA

OVBRRFp5ZWw4a3hMSVdlL3JibU9CL0V0Y3c9PQ==
  • National Grid plc
  • Waltham, MA

Job Description

Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Division: IT Global Solution Development Job Type: Requisition Number: 62957 Department: Job Function: Information Technology Every day we deliver safe and secure energy to homes, communities, and businesses. We connect people to the energy they need for the lives they live. The pace of change in society and our industry is accelerating and our expertise and track record puts us in an unparalleled position to shape the sustainable future of our industry. To be successful we must anticipate the needs of our customers, reducing the cost of energy delivery today and pioneering the flexible energy systems of tomorrow. This requires us to deliver on our promises and always look for new opportunities to grow, both ourselves and our business. IT and Digital works in a harmonised partnership with the National Grid group of diverse energy businesses to deliver technology which revolutionises the way we operate. As we lead the charge towards a carbon-free future, our teams are embracing disruptive changes in our industry by working with Agile methodologies and adopting Digital mindsets to drive efficiency and bring new capabilities for our internal and external customers. Our work here is critical. National Grid moves energy to millions of homes and businesses in the UK and US and the technology we utilise to complete that task is down to us. The successful applicant for this position will be an integral contributor towards this goal and we will support your professional development as part of our multi-cultural, customer-centric global team. National Grid is hiring a Principal Domain Architect. This is a hybrid position open to our offices in Waltham, MA, Syracuse, NY or Brooklyn, NY. Job Purpose As a Principal Domain Architect for AI Ops and Site Reliability Engineering, your primary objective is to design and oversee the implementation of complex systems that meet functional and non-functional requirements. You will play a key role in developing system design policies, standards, and innovation processes specific to AI Ops and SRE. Additionally, you will actively monitor emerging technologies and assess their potential impact on the organization. Your responsibilities will include driving the strategic vision for AI Ops and SRE within the domain, ensuring alignment among stakeholders and promoting a cohesive approach. Key Accountabilities Developing AI Ops and Site Reliability Engineering (SRE) Strategies: As a Principal Cloud Domain Architect, your primary responsibility is to develop comprehensive strategies and architectures for implementing AI Ops and SRE practices within the data center and cloud domain. This involves understanding business requirements, assessing technical capabilities, and identifying areas where AI and automation can be leveraged to enhance reliability, performance, and operational efficiency. Designing Cloud Architecture Solutions: You will be responsible for designing cloud and on-premise architecture solutions that integrate AI technologies and SRE principles into the existing cloud infrastructure. This includes designing scalable and resilient systems, implementing monitoring and alerting mechanisms, and ensuring high availability and fault tolerance in the cloud environment. Collaborating with Development and Operations Teams: As a Principal Architect, you will work closely with development and operations teams to provide technical guidance and ensure the successful implementation of AI Ops and SRE practices. This involves reviewing designs, providing recommendations, and promoting best practices for building and operating reliable and efficient cloud-based applications. Implementing AI-Driven Monitoring and Analytics: You will be responsible for implementing AI-driven monitoring and analytics solutions in the cloud domain. This includes leveraging machine learning and data analysis techniques to identify and predict system anomalies, performance bottlenecks, and potential failures. These insights help in proactively addressing issues and optimizing the performance of cloud-based systems. Establishing Incident Response and Resolution Processes: You will define and establish incident response and resolution processes aligned with SRE practices within the cloud and on-premises domain. This includes setting up incident management frameworks, defining escalation paths, and implementing effective incident response strategies to minimize downtime and ensure quick resolution in the cloud environment. Driving Continuous Improvement and Optimization: As a Principal Architect, you will drive continuous improvement and optimization efforts within the cloud domain. This involves analyzing system metrics, conducting root cause analysis, and implementing changes to optimize cloud performance, reliability, and efficiency. Automation and self-healing mechanisms are often employed to enhance system resilience and reduce manual intervention. Qualifications We are looking for a skilled professional with experience in cloud architecture, automation, and reliability engineering. While the following skills are ideal, we encourage you to apply even if you don’t meet every requirement: Bachelor's degree in a relevant discipline, or an equivalent combination of education, training, and experience. 5-7 years of Cloud Platforms and Automation: Hands-on experience with cloud platforms like Azure (preferred), AWS, or GCP, alongside tools for containerization (e.g., Docker, Kubernetes) and infrastructure automation (e.g., Terraform, CloudFormation). Monitoring and Observability: Familiarity with tools like Prometheus, Grafana, Splunk, or the ELK Stack to ensure system performance and reliability. Continuous Integration and Deployment: Experience with CI/CD pipelines using tools such as GitHub or GitLab CI/CD. Incident Management and Collaboration: Knowledge of incident response tools (e.g., ServiceNow, PagerDuty) and collaboration platforms (e.g., Slack, Teams). Scripting and Databases: Exposure to scripting languages (e.g., Python, Bash) and database systems (e.g., MySQL, MongoDB, Redis). More Information #LI-RK1 #LI-HYBRID Waltham: $162k - $191k a year Brooklyn: $173k - $204k a year Syracuse:$145k - $170k a year This position has a career path which provides for advancement opportunities within and across bands as you develop and evolve in the position; gaining experience, expertise and acquiring and applying technical skills. Candidates will be assessed and provided offers against the minimum qualifications of this role and their individual experience. National Grid is an equal opportunity employer that values a broad diversity of talent, knowledge, experience and expertise. We foster a culture of inclusion that drives employee engagement to deliver superior performance to the communities we serve. National Grid is proud to be an affirmative action employer. We encourage minorities, women, individuals with disabilities and protected veterans to join the National Grid team. #J-18808-Ljbffr National Grid plc

Job Tags

Flexible hours,

Similar Jobs

DOROT

Paid Summer College Internship - Finance Field Job at DOROT

 ...bringing the generations together. For Summer 2025, we are seeking an intern to join our...  ...about the aging population and gain experience in the finance field. This individual will...  ...educational environment. Throughout the internship, this individual should expect to learn... 

Double Arrow LLC

Delivery Driver - Amazon Packages - $20.25 Job at Double Arrow LLC

 ...Double Arrow, LLC, an Amazon Delivery Service Partner (DSP), is looking for enthusiastic, high performing team players. Established...  ...of age or older to apply. Must have a good to excellent driving record. Must be able to speak, read and write in English.... 

The Walsh Group

Skilled Laborer Job at The Walsh Group

Overview: Archer Western Constructioin a member of The Walsh Group is currently seeking a Skilled Laborer for the SETN Region/I-26 Widening MM85-MM101 , in Columbia, SC . The Skilled Laborer will perform tasks involving physical laborer on construction... 

Integrated Real Estate Group

Activities Assistant - Landing at Watermere Woodland Lakes, Full Time Job at Integrated Real Estate Group

 ...Landing at Watermere Woodland Lakes is a brand-new resort-style assisted living community in the heart of Conroe, Texas. Get paid...  ...with ZayZoon ! Quick access up to 50% of your earned wages! Activities Assistant, Full Time The Activities Assistant is responsible... 

Yale New Haven Health

Perinatal Educator-CPR Instructor Job at Yale New Haven Health

 ...Perinatal Educator-CPR Instructor at Yale New Haven Health summary: The Perinatal Educator-CPR Instructor provides essential education to expectant parents and families, focusing on childbirth preparation and early parenting. This role involves collaboration with the...