Site Reliability Developer 3
Bucharest, RO,Romania, RO
3 zile în urmă

Preferred Qualifications

Oracle's Demonstration Services

Our mission is to continuously deploy, integrate, and manage Oracle's products to create rich content to demonstrate Oracle's Cloud platform to potential and current Oracle customers. In order to support this mission we deliver applications and tooling, what we call our Demo Cloud Platform, to Oracle's Sales, Consulting and strategic partners to manage their demonstration environments. Our applications and tooling support billions, yes billions, of dollars in sales annually. This is where you come in. Reliability of the applications and tools that support the demonstration environments are critical to Oracle's success as a leading cloud provider.

Site Reliability Engineering Team

The SRE team is responsible for the overall health, performance and reliability of our Demo Cloud Platform. Demo Services (DS) leverages Oracle Cloud services as well as open source components to deliver a full featured, self-service and extensible demo platform for Oracle Sales, Consulting and strategic partners. In order for Demo Services to meet the goals of the Business, the SRE team works alongside the DS Development and DS Architecture teams to rapidly deploy new functionality for the platform through CI/CD methodologies. We are looking for candidates that have a strong passion for automation and are enthusiastic to pickup new technologies, product stacks and industry current solutions. 

As part of the SRE team you...

  • Will develop, enhance and maintain automation solutions to create infrastructure as code in Oracle Cloud aimed to improve and support productivity of the Demo ecosystem

  • Possess a contagious sense of ownership and are capable of using all available tools to solve any issues you encounter
  • Extend and improve ChatOps automation to speed-up deployment, triaging, system tracking and metrics

  • Act as the primary point of contact for Production incidents, perform detailed root cause analysis, identify and resolve underlying problem patterns, while developing automated and self-healing solutions

  • Monitor, detect and troubleshoot issues during code deployments in Production. Analyze real-time data to determine impact and advise development on release GO/NO-GO

  • Participate in the development of tools and processes that leverage observability best practices to proactively identify and resolve issues before they become incidents

  • Work hand-in-hand with the Development team and participate in team rotations

  • Your Skills..

  • You have a Bachelor’s Degree in Computer Science, Software Engineering, Information Systems or equivalent and 4+ years of relevant work experience.
  • You have worked in an SRE/DevOps role and managed highly complex production environments at scale
  • You are fluent in writing code, with 3+ years of experience in developing in languages like JavaScript, NodeJS, Java, Python & Perl
  • You have developed tools and provided scalable, maintainable and autonomized solutions to support mission-critical applications
  • You have practical experience with continuous integration and continuous delivery methodologies
  • You have hands-on experience with orchestration and configuration management tools such as Ansible, Terraform, Puppet or others
  • You have deep understanding of monitoring and observability best practices across distributed systems
  • You have a solid foundation on network concepts - DNS, load balancing, VCN, firewall, proxy server
  • You are intimately familiar with Linux and its administration life cycle - deployment, upgrading, compiling, and debugging
  • You have a solid foundation in database administration and are comfortable with the complete database Life Cycle, including provisioning, backup&recovery, cloning, performance tuning, maintenance and troubleshooting
  • You are able and willing to work in on-call rotation that will include weekend coverage
  • Your Bonus Skills...

  • You have a Masters Degree in Computer Science or related studies
  • You have experience in working with major cloud platform(s): Oracle Cloud, Microsoft Azure, Google Cloud Platform or AWS - any certification(s) a plus
  • You have experience with Container and Container Management technologies: Docker, Kubernetes
  • You are adept with SQL, PL/SQL and query performance tuning
  • You have worked with monitoring solutions such as Prometheus, Grafana, Nagios, Oracle Enterprise manager/Management Cloud or similar software
  • Raportați această lucrare

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Email-ul meu
    Făcând clic pe "Continuă", acord nevoo consimțământ de a procesa datele mele și de a-mi trimite alerte prin e-mail, așa cum este detaliat în policyApplicația de confidențialitate a lui neuvoo. Pot să-mi retrag consimțământul sau să mă dezabonez în orice moment.