Data Engineer
ALTEN
Bucureşti, Romania
3 zile în urmă

General Responsibilities :

  • Quality of data transformed in the Datalake, proper operation of data processing systems and optimizing the use of on-
  • premise or cloud cluster resources by data processing channels.

    Responsibilities :

    During project definition :

  • Design of data ingestion chains
  • Design of data preparation chains
  • Basic ML algorithm design
  • Data product design
  • Design of NoSQL data models
  • Design of data visualizations
  • Participation in the selection of services / solutions to be used according to uses
  • Participation in the development of a data toolbox
  • During the iterative implementation phase :

  • Implementation of data ingestion chains
  • Implementation of data preparation chains
  • Implementation of basic ML algorithms
  • Implementing data visualizations
  • Using ML framework
  • Implementation of data products
  • Exhibition of data products
  • NoSQL database configuration
  • Use of functional languages
  • Debugging distributed processes and algorithms
  • Identifying and cataloging reusable elements
  • Contribution to the development of working standards
  • Contribution and opinion on data processing problems
  • During integration and deployment :

  • Participation in problem solving
  • Requirements :

  • Experience in the implementation of end-to-end data processing chains and Big data architectures (Hadoop cluster, noSQL databases, Elastic search) mastering languages and frameworks for distributed data processing (Spark / Scala).
  • Practice of Agile methods.

  • Expertise in the implementation of end-to-end data processing chains
  • Mastery of distributed development
  • Basic knowledge and interest in the development of ML algorithms
  • Knowledge of the ingestion framework
  • Knowledge of Spark and its different modules
  • Proficiency of Scala and / or Python
  • Knowledge of the AWS or GCP ecosystem
  • Knowledge of the ecosystem of NOSQL databases
  • Knowledge in building API data products
  • Knowledge of Dataviz tools and libraries
  • Spark ease in debugging and distributed systems
  • Extension of complex systems
  • Proficiency in the use of notebook data
  • Experience in data testing strategies
  • Strong problem-solving skills, intelligence, initiative and ability to withstand pressure
  • Strong interpersonal skills and great communication skills (ability to go into detail)
  • Aplică
    Adaugați la favorite
    Eliminați de la favorite
    Aplică
    Email-ul meu
    Făcând clic pe "Continuă", acord nevoo consimțământ de a procesa datele mele și de a-mi trimite alerte prin e-mail, așa cum este detaliat în policyApplicația de confidențialitate a lui neuvoo. Pot să-mi retrag consimțământul sau să mă dezabonez în orice moment.
    Continuă
    Formular