• Principal Service Reliability Engineer Jobs in United States Of America - 25312637

  • United States Of America, Usa
  • Save Job
  • 7 - 10 Years
  • Posted : above 1 month

Job Description:

Principal Service Reliability Engineer - 19001E37 No Visa Sponsorship is available for this position

Preferred Qualifications

The Oracle ERP Cloud Operations is looking for passionate, innovative, high caliber, team oriented super stars that seek being a major part of a transformative revolution in the development of modern business cloud based applications As part of market leading ERP Cloud, Oracle ERP Cloud Operations offers a broad suite of modules and capabilities designed to empower the development organization with world-class service reliability engineering disciplines and deliver customer success with streamlined processes, increased productivity, and improved business decisions

Oracle, the world leader in Enterprise Cloud, is hiring the best and brightest technologists in the industry as we continue to add customer-centric, world-class, leading edge, secure, hyper-scale based solutions throughout all levels of the cloud stack Oracles cloud eco-system is the only complete business cloud platform on the planet, with market leading and business transforming solutions spanning SaaS, DaaS, PaaS and IaaS Oracles Cloud applications, such as Enterprise Resource Management, Customer Relationship Management, Human Capital Management, and Supply Chain Management are used by thousands of customers across the globe and are the broadest, most innovative in the industry, providing businesses with adaptive intelligence, standardized business processes and competitive advantage at low cost

Key Tasks and Responsibilities

Service Ownership You will be part of the SRE team, whose mission is the shared full stack ownership of a collection of services, with our Service Development and Operations SRE partners
Ownership Scope You will understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of the production services you own In partnership with your Service Development and Operations SRE partners, you will have the responsibility to ensure that services are designed and delivered to be mission critical with focus on monitoring, telemetry, security, resiliency, scale, and performance
Service Requirements - You will provide direction and prioritization to service Product Management and Service Development teams to engineer and add premier SRE capabilities to the Oracle SaaS/ERP services
Incident Response You will be the primary author of technical content for both customer and internal communication used throughout the incident response process, eg postmortem/root cause analysis, end-to-end repair item definition, fixes in production
Prevention - Using data-driven incident findings, you will work on solutions that will ultimately prevent the incident/problem from arising ever again, and interim solutions to more quickly resolve the problem next time
Technical Experts - You are the ultimate escalation point for complex or critical issues that have not yet been documented as SOPs for Level1 staff You will usually get called in during major incidents as an SME, when the source of a problem is unclear You will have the deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations
Evangelize and Educate You will play a critical role in making the transformational culture change to an SRE mindset within Service Development You will be responsible for evangelizing and educating Service Product Management and Service Development on the service centric, full stack approach and principles of SRE as well as the architectures and solutions used for Oracle SaaS/ERP services
Operations Engineering You will understand and be able to communicate the scale, capacity, security, performance attributes and requirements of the services you own You are a Subject Matter Expert, able to understand and communicate every characteristic of your service stack, such as Degradation and behavior under load of the services and their dependencies
End-to-end tuning needs, optimizing resource utilization, as load patterns fluctuate
Instrumentation and metrics that clearly describe the service behaviors
Scaling requirements and patterns
Resiliency and recoverability, ensuring that backup / restore and disaster recovery capabilities are implemented, tested and maintained

Automation You will have a clear understanding of automation and orchestration principles, and will be eager to automate, wherever and whenever the possibility arises, while simultaneously eliminating technical debt Automation must be part of your DNA

Skills and Qualifications

Minimum of 5 years of software development, with demonstrated knowledge of professional software engineering best practices for the full software development life cycle, including coding standards, code reviews, source control, build and release processes, continuous deployment, and test suite development and maintenance
Experience deploying and running large scale online systems built on Cloud platforms such as Oracle Cloud, AWS, Azure, Google Cloud Platform, and/or OpenStack
Experience designing and implementing solutions for platform and application layer telemetry, monitoring, scalability, performance and reliability
Excellent written and verbal technical communications with technical and non-technical peers, customers, and at times, executive leadership
Proven success in contributing in a collaborative, team-oriented environment, with the ability to establish and nurture relationships between multiple teams and navigate dependencies
3 years of experience Working in systems and network administration, application security, DevOps and/or Site Reliability Engineering
Hands-on with web protocols and Linux/Unix tools and architecture, from kernel to shell, file systems, and client-server protocols
Using C#, PowerShell/Shell script, ASPNET/MVC, JavaScript, TypeScript, React, or T-SQL
Maintaining, analyzing, and troubleshooting large-scale distributed services
Building automated tools in Python, Java, GoLang, and/or Ruby

Profile Summary:

Employment Type : Full Time
Eligibility : Any Graduate
Industry : Software Services, IT-Software
Functional Area : IT Software : Software Products & Services
Role : Software Engineer
Salary : As per Industry Standards
Deadline : 03rd Jun 2020

Key Skills:

These free online tutorials may interest you

People who search this job also searched for the following Keywords

Salary trends based on over 1 crore profiles

View Salaries

All rights reserved © 2018 Wisdom IT Services India Pvt. Ltd DMCA.com Protection Status