• Cloud Incident Engineer Jobs in Seattle - 23296194

  • Seattle
  • Save Job
  • 5 - 8 Years
  • Posted : above 1 month

Job Description:

Cloud Incident Engineer - 19000YRY

Preferred Qualifications

The Oracle Cloud Infrastructure (OCI) Operations team is seeking accomplished and passionate individuals to lead and evolve our Incident Management practice to become a best-in-class service offering

The primary function of a Cloud Incident Engineer is to direct Subject Matter Experts (SMEs) and Service(s) leaders to restore service as quickly as possible during Major Incidents while keeping accurate and timely data on the progress of such incidents and keeping senior leaders, stakeholders and end users updated

Incident Commanders are also responsible for building and evolving the practice of Incident Management across OCI, using Post Incident Reviews, developing processes and systems to leverage the related metrics to identify and drive process and procedural improvements globally

Who are you

Passionate about Cloud, customer focused, have done incident management problem management and thrive in a dynamic team culture

A technologist at heart, curious about how things work and how things break - likely to be someone who enjoys finding a better way to do things using automation

Able to build, maintain and leverage key relationships with internal stakeholders and service leaders to drive increased engagement and accountability for your work

Love technology and how to apply it Maybe you have set up your own environment in the cloud or have spent time developing apps or games that you share with others

Strong communicator who is passionate about the customers experience

Motivated to be resourceful, innovative and entrepreneurial

Driven to learn about cloud infrastructure and its inter-dependencies

Humble and committed to always improving

Key Responsibilities

Provides leadership in responding and resolving major incidents that impact business critical services, applications and infrastructure for OCI

Leverages broad technical expertise to convene appropriate SMEs (resolvers) and to direct Major Incident response, with focus on impact mitigation and service restoration

Work closely with SMEs to quickly identify customer impact (who, how, when)

Conducts escalation to service teams, senior management and leaders to ensure appropriate awareness, engagement and focus

Produces accurate and timely communications tailored to relevant audience (Senior Leaders and internal Stakeholders)

Leads and/or participates in Post Incident Review and Problem Management meetings with key stakeholders and service owners to review events and opportunities for ongoing improvement

Documents pertinent information relating to Incidents that aids process improvement, identifies deviations and enables the creation of an Incident Knowledge Base

Monitors and evaluates high-level service and infrastructure dashboards and takes action to address identified anomalies

Collates and analyses incident based data for team metrics and KPIs

Identifies opportunities and takes ownership for automation and/or continuous improvement of Incident Management process steps and best practices

Proactively engages with Service teams to identify and evaluate gaps in operational capabilities and improvements to support Cloud scalability and resiliency

Represents Incident Management at relevant software team Roadmap planning and backlog reviews, influencing the prioritization of automation and tooling enhancements

Work as part of the Major Incident Management team to ensure that the performance of the team achieves the defined performance targets and KPIs


Have a broad and deep knowledge of cloud infrastructure and related technologies

Experience in technical troubleshooting, with broad expertise in core infrastructure technologies (eg server, compute, storage, network, authentication, databases)

Able to review and edit automation code (eg Python, JavaScript, Linux shell) and data objects written in JSON or XML

Experience in managing and tuning systems and/or applications, with ability to review and validate system test output

Understand IP networking fundamentals and be familiar with Data Center network architectures and standard protocols (eg BGP, OSPF)

Experience in influencing internal/external teams within a diverse/large organization and skilled at building strong relationships, to deliver required & improved results

Strong leadership skills to direct service teams during Major Incidents that have the potential for significant business impact; remaining calm, professional and focused in high pressure situations

Excellent Incident and Problem Management knowledge and experience

Exceptional written and verbal communication skills with meticulous attention to detail

Able to work unsupervised, independently and within a global team

Experienced user of a trouble ticketing system (Jira, Remedy or similar)

Flexibility to work within a Follow the Sun global shift rota, covering local day-time hours, including holidays and weekends, on a rotational basis

Ability to be on-call as part of an on-call rotation shared across all team members

Ability to manage multiple tasks in a fast-paced, ever changing environment

Ability to think strategically and tactically and work in both a reactive (incident response) as well as proactive engagement model

US Citizenship or US Lawful Permanent Resident Status/Protected Person Required Federal Government customer

Detailed Description and Job Requirements

Responsible for our production infrastructure, including the servers and services which support our growing client base as well as designing and implementing highly scalable environments This Engineer works with other teams in the organization and provides infrastructure solutions for their needs Understands client systems and applications, networking, infrastructure, data centers, web tools and technologies, databases and Cloud, Big Data, Enterprise Resource Planning (ERP), and more

Design new scalable solutions for fast changing infrastructure environment with complex needs in fields like configuration deployments, monitoring, and logging Perform deep drill down analysis into performance bottlenecks and provide necessary fixes Bring in new ideas, change, evolve, improve and simplify the production infrastructure Work closely with our development and research teams and provide customer friendly solutions and support Responsible for working on the design, development, and/or deployment of enterprise supporting systems

BS degree or equivalent experience relevant to functional area Suggested majors include Computer Science or Mathematics Working knowledge of software development tools, methodologies, and programming languages Experience working with external or internal customers to implement large scale solutions, business process architecture, application system design, and implementation Design and implementation of Infrastructure as a Service (IaaS), Platform as a Service (PaaS) and Software as a Service (SaaS) solutions using a variety of cloud platform services Highly technical and analytical, possessing significant implementation and operations experience Identifies solutions in experience of application or server architecture and networking A minimum of 5 years experience in application or server architecture and networking or related experience

Oracle is an Equal Employment Opportunity Employer All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status or any other characteristic protected by law

Job Product Development

Location US-WA,Washington-Seattle

Job Type Regular Employee Hire

Organization Oracle

Profile Summary:

Employment Type : Full Time
Eligibility : Any Graduate
Industry : Software Services, IT-Software
Functional Area : IT Software : Software Products & Services
Role : Software Engineer
Salary : As per Industry Standards
Deadline : 01st Feb 2020

Key Skills:

These free online tutorials may interest you

People who search this job also searched for the following Keywords


Salary trends based on over 1 crore profiles

View Salaries

All rights reserved © 2018 Wisdom IT Services India Pvt. Ltd DMCA.com Protection Status