We’re looking for hands-on engineers with a passion for solving problems in Database, distributed systems, virtualized infrastructure, and highly available services. Joining Oracle and System Test team will give you the opportunity to learn and help build innovative new systems from the ground up and operate services at large scale.

Engineers at every level can have significant technical and business impact while delivering critical enterprise level features.

As an  SRE, you will work as part of a highly collaborative team to support features/tools while operating and growing the current service offering. You should value simplicity and scale, work comfortably in a collaborative environment, be familiar with ITL and Agile methodologies, produce good technical documentation and be excited to learn. 

 

AS or BS degree or equivalent experience relevant to functional area.

 

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

  • 2+ years’ experience supporting commercial software in a distributed environment.
  • Good knowledge of Oracle Databases, Any high level language (Python, Java), Bash programming.
  • Strong knowledge of Operations, Linux, operating systems, distributed systems fundamentals and Networking
  • Strong troubleshooting, debugging skills.
Responsibilities displayed in the job posting 

Dealing with operational tickets that have well defined Runbooks. These include:

  • Auto generated tickets that are associated with known problems and need to be cleared manually - pending service enhancements or bug fixes.
  • Human cut tickets that are associated with common customer issues or require some initial triage before escalating.
  • Participating in Large Scale Events involving Service dependencies.
  • Learn and Adapt for new technologies at a high pace environment.
  • Identify opportunities for improvement and documenting operational processes 
  • Create and track bug fixes.
  • Enhance the service.
  • Identify missing metrics.
  • Proficient in Change Management best practices. 

Duties and tasks are varied and require independent judgment.

Technical Skills
Is a Remote Job?
Remote
Employment Type
Full time

Oracle is the cloud leader for global business. Present in over 175 countries, we’re one of the biggest technology companies on the planet. With a fully integrated suite of cloud applications and infr...

Apply Now