Alfa-Bank

Data Engineer

6.0/10

Alfa-Bank

Not specified
Hybrid
mid
28 days ago
aidatatechfintechPythonSQLAirflowPySparkOraclePostgresGreenplumnumpy

AI Summary

The vacancy is well-defined but lacks compensation details, affecting overall attractiveness to applicants.

Check Match โ€” Just drop your CV

See your fit for Data Engineer in seconds.

Description

Responsibilities

  • โ€ขImplement high-load data processing pipelines to ensure reliable data replication from the Bank's IT systems.
  • โ€ขPrepare data in target analytical storage (DataLake, SandBox, FeatureStore) for building features necessary for machine learning models.
  • โ€ขDevelop and maintain documentation for the developed functionality.
  • โ€ขReflect task status in Jira in a timely manner.
  • โ€ขReview code quality (code review) written by data engineers and junior data engineers.

Requirements

Requirements

  • โ€ขPython - strong knowledge of data structures and algorithms, effective application of OOP and FP principles, experience in writing unit and integration tests, knowledge and experience with data processing and analysis libraries - numpy, pandas.
  • โ€ขExperience in developing and implementing services for loading and processing unstructured and weakly structured data (text, xml, json) from external sources.
  • โ€ขAbility to understand data provider APIs using available documentation.
  • โ€ขSQL - ability to create complex queries using analytical window functions and use profiling tools to optimize their performance, experience with Oracle, Postgres, Greenplum databases.
  • โ€ขStrong knowledge and experience with development, planning, and monitoring tools (workflow engines) for batch data processing.
  • โ€ขAirflow - experience in developing complex, high-load data processing applications based on PySpark, strong knowledge of Spark settings and their impact on Spark application performance.
Loading similar jobs...