Site Reliability Engineer, Fitbit
Bucharest, Romania
6 zile în urmă

Fitbit Site Reliability Engineering team ensures that Fitbit's site and backend services are available, healthy, and that customers are having a positive experience.

We advocate best practices to measure, manage and enhance site reliability. We encourage others to treat change, operational flexibility and observability as first-class concerns and make informed tradeoffs between functional and operational goals.

We are involved in incident and change management. We also act as consultants for engineers when new code and services are getting ready to launch.

Our goal is to transform the system such that services have service-level objectives (SLOs) and the appropriate amount of monitoring / alerting so that teams can balance shipping fast with maintaining the stability and reliability of their features and services.

You are familiar with Java and its ecosystem in order to contribute to the SRE team commitments when it comes to building, scaling, and operating Java-based applications.

Google's mission is to organize the world's information and make it universally accessible and useful. Our Devices & Services team combines the best of Google AI, Software, and Hardware to create radically helpful experiences for users.

We research, design, and develop new technologies and hardware to make our user's interaction with computing faster, seamless, and more powerful.

Whether finding new ways to capture and sense the world around us, advancing form factors, or improving interaction methods, the Devices & Services team is making people's lives better through technology.


  • Create and own technical design documents
  • Write code in Java and perhaps Python
  • Act as a product owner for the tools, systems, and / or applications you work on
  • Contribute to process improvements that boost productivity and quality
  • Participate in the team’s production on-call rotation ("follow the sun")
  • Minimum qualifications :

  • Software engineering and programming experience in Java
  • Experience as a site reliability engineer
  • Experience with Unix / Linux operating systems
  • Preferred qualifications :

  • Experience being part of an on-call rotation and responding to production incidents
  • Experience with cloud computing platforms
  • Knowledge of the Python programming language and ecosystem
  • Knowledge of the internals of applications and frameworks such as Kafka, Aurora / Mesos, ZooKeeper, Spring, Hibernate, Finagle, Thrift, gRPC, Prometheus, Elasticsearch, Kibana, Grafana, and Terraform
  • Ability to effectively lead small to medium sized projects
  • Raportați această lucrare

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Email-ul meu
    Făcând clic pe "Continuă", acord nevoo consimțământ de a procesa datele mele și de a-mi trimite alerte prin e-mail, așa cum este detaliat în policyApplicația de confidențialitate a lui neuvoo. Pot să-mi retrag consimțământul sau să mă dezabonez în orice moment.