Cloud Operations Database Reliability Engineer
5 zile în urmă

Job Responsibility :

Cloud Operations Reliability Engineering, part of Cloud Services, has an exciting and challenging mission : Build, deploy, operate, scale and maintain company-

wide platforms (PlaaS) for customer facing Adobe SaaS solutions . While various development groups focus on building our platforms, Cloud Ops provides operational / engineering support for both the platform as well as the product teams that leverage the platforms.

Database Reliability Engineers (DBRE) are responsible for keeping database systems that support all Adobe user-facing services running smoothly 24 / 7 / 365 .

DBREs are a blend of database engineering and DB administration enthusiasts and software crafters that apply sound engineering principles, operational discipline and mature automation, specializing in databases.

In that capacity, DBREs are peers to SREs and bring database expertise to the SRE and Infrastructure teams as well as our engineering teams.

This role will interact with various Adobe cloud native engineering teams that are building customer-facing services by providing operational support and architectural guidance for a series of database technologies.

Areas of Responsibility :

  • Work on database reliability and performance aspects for core database infrastructure pieces that allow Adobe products to scale
  • Ensure the highest level of uptime and Quality of Service (QoS) to Adobe’s customers through operational excellence
  • Work with engineering teams on database architectural designs, performance optimization, environment build-out
  • Work with peer SREs to roll out changes to production environments and help mitigate database-related production incidents
  • Serve as the primary point of contact for advanced level troubleshooting issues
  • Work on automation of database infrastructure
  • Identify areas to improve service resiliency through techniques such as chaos engineering, performance / load testing, etc
  • Support and maintain globally distributed, multi-cloud (public and / or private) database environments
  • Document and automate common, repeatable tasks at large scale to streamline operational procedures and reduce the human footprint
  • Implement proactive monitoring and alerting
  • Follow change management processes during implementations
  • Work in a diverse and global team environment
  • Cross-train with other global team members
  • Participate in an on-call rotation as required
  • Job Requirements :

  • 3-5 years relevant production experience in supporting at-scale, highly available, mission-criticalenvironments running at least one of the following open source database management systems :
  • oNOSQL - MongoDB, CouchDB, Cassandra

    oRDBMS - Percona XtraDB Cluster, MariaDB Galera Cluster, PostgreSQL

  • at least 1 year of experience running databases in public clouds (AWS and / or Azure)
  • at least 1 year of experience with infrastructure automation and configuration management (Chef, Ansible, Puppet, Terraform )
  • Deep understanding of cluster management areas, such as adding / bootstrapping / removing nodes, scaling, consistency tuning, replication, and multi-
  • datacenter configuration

  • Solid understanding of security, monitoring, management and capacity planning for databases
  • Solid understanding of HA strategies including database replication, clustering, sharding
  • Experience in creating and implementing full-proof DR plans for vlarge databases including backup and recovery solutions
  • Solid experience in performance monitoring and storage performance optimization.
  • Solid experience in tuning database server configurations, queries, and indexes
  • Strong data modeling and data structure design skills
  • Good understanding of Linux OS concepts and of Linux and Unix Shell
  • Proficiency in any of the scripting language ( e.g. Python / PHP / Perl / Ruby)
  • Demonstrated ability to adapt to new technologies and learn quickly
  • Excellent written and verbal English communication skills
  • Desirable :

  • Exposure to any other open-source DBMS solutions
  • B.S. degree in Computer Science or related technical field
  • Experience with Log management tools like Splunk.
  • Experience working within software development or Internet-related industries, particularly in the context of a SaaS offering.
  • Understanding of Service Delivery / support Processes or ITIL Framework.
  • Qualities :
  • Team player
  • Team player
  • The ability to communicate technical knowledge in a clear, concise and easy to understand manner is essential, as the candidate will participate in technical and customer reviews and must communicate effectively with both technical and non-technical audiences.
  • Should have the aptitude to work independently on projects / tasks with minimal instructions.
  • Participation in technical blogging, PoCs and community projects.
  • Strong sense of humor
  • At Adobe, you will be immersed in an exceptional work environment that is recognized throughout the world on Best Companies lists.

    You will also be surrounded by colleagues who are committed to helping each other grow through our unique Check-In approach where ongoing feedback flows freely.

    If you’re looking to make an impact, Adobe's the place for you. Discover what our employees are saying about their career experiences on the Adobe Life blog and explore the meaningful benefits we offer.

    Adobe is an equal opportunity employer. We welcome and encourage diversity in the workplace regardless of race, gender, religion, age, sexual orientation, gender identity, disability or veteran status.

    Adaugați la favorite
    Eliminați de la favorite
    Email-ul meu
    Făcând clic pe "Continuă", acord nevoo consimțământ de a procesa datele mele și de a-mi trimite alerte prin e-mail, așa cum este detaliat în policyApplicația de confidențialitate a lui neuvoo. Pot să-mi retrag consimțământul sau să mă dezabonez în orice moment.