Job Responsibility :
Cloud Operations Reliability Engineering, part of Cloud Services, has an exciting and challenging mission : Build, deploy, operate, scale and maintain company-
wide platforms (PlaaS) for customer facing Adobe SaaS solutions . While various development groups focus on building our platforms, Cloud Ops provides operational / engineering support for both the platform as well as the product teams that leverage the platforms.
Database Reliability Engineers (DBRE) are responsible for keeping database systems that support all Adobe user-facing services running smoothly 24 / 7 / 365 .
DBREs are a blend of database engineering and DB administration enthusiasts and software crafters that apply sound engineering principles, operational discipline and mature automation, specializing in databases.
In that capacity, DBREs are peers to SREs and bring database expertise to the SRE and Infrastructure teams as well as our engineering teams.
This role will interact with various Adobe cloud native engineering teams that are building customer-facing services by providing operational support and architectural guidance for a series of database technologies.
Areas of Responsibility :
Work on database reliability and performance aspects for core database infrastructure pieces that allow Adobe products to scale
Ensure the highest level of uptime and Quality of Service (QoS) to Adobe’s customers through operational excellence
Work with engineering teams on database architectural designs, performance optimization, environment build-out
Work with peer SREs to roll out changes to production environments and help mitigate database-related production incidents
Serve as the primary point of contact for advanced level troubleshooting issues
Work on automation of database infrastructure
Identify areas to improve service resiliency through techniques such as chaos engineering, performance / load testing, etc
Support and maintain globally distributed, multi-cloud (public and / or private) database environments
Document and automate common, repeatable tasks at large scale to streamline operational procedures and reduce the human footprint
Implement proactive monitoring and alerting
Follow change management processes during implementations
Work in a diverse and global team environment
Cross-train with other global team members
Participate in an on-call rotation as required
Job Requirements :
3-5 years relevant production experience in supporting at-scale, highly available, mission-criticalenvironments running at least one of the following open source database management systems :
oNOSQL - MongoDB, CouchDB, Cassandra
oRDBMS - Percona XtraDB Cluster, MariaDB Galera Cluster, PostgreSQL
at least 1 year of experience running databases in public clouds (AWS and / or Azure)
at least 1 year of experience with infrastructure automation and configuration management (Chef, Ansible, Puppet, Terraform )
Deep understanding of cluster management areas, such as adding / bootstrapping / removing nodes, scaling, consistency tuning, replication, and multi-
Solid understanding of security, monitoring, management and capacity planning for databases
Solid understanding of HA strategies including database replication, clustering, sharding
Experience in creating and implementing full-proof DR plans for vlarge databases including backup and recovery solutions
Solid experience in performance monitoring and storage performance optimization.
Solid experience in tuning database server configurations, queries, and indexes
Strong data modeling and data structure design skills
Good understanding of Linux OS concepts and of Linux and Unix Shell
Proficiency in any of the scripting language ( e.g. Python / PHP / Perl / Ruby)
Demonstrated ability to adapt to new technologies and learn quickly
Excellent written and verbal English communication skills
Exposure to any other open-source DBMS solutions
B.S. degree in Computer Science or related technical field
Experience with Log management tools like Splunk.
Experience working within software development or Internet-related industries, particularly in the context of a SaaS offering.
Understanding of Service Delivery / support Processes or ITIL Framework.
Qualities : Team player
The ability to communicate technical knowledge in a clear, concise and easy to understand manner is essential, as the candidate will participate in technical and customer reviews and must communicate effectively with both technical and non-technical audiences.
Should have the aptitude to work independently on projects / tasks with minimal instructions.
Participation in technical blogging, PoCs and community projects.
Strong sense of humor
At Adobe, you will be immersed in an exceptional work environment that is recognized throughout the world on Best Companies lists.
You will also be surrounded by colleagues who are committed to helping each other grow through our unique Check-In approach where ongoing feedback flows freely.
If you’re looking to make an impact, Adobe's the place for you. Discover what our employees are saying about their career experiences on the Adobe Life blog and explore the meaningful benefits we offer.
Adobe is an equal opportunity employer. We welcome and encourage diversity in the workplace regardless of race, gender, religion, age, sexual orientation, gender identity, disability or veteran status.