Junior Data Engineer
6.0/10
МТС
Not specified
Remote
junior
about 1 month ago
May be outdated
aidatatechETLSQLApache SparkPythonGitHadoopAirflow
AI Summary
The vacancy is well-defined but lacks compensation details, impacting overall attractiveness to applicants.
Check Match — Just drop your CV
See your fit for Junior Data Engineer in seconds.
Description
Responsibilities
- •Develop ETL processes for data extraction, cleansing, transformation, and loading;
- •Integrate new data sources into existing architecture;
- •Adapt and refine existing processes to changing business requirements;
- •Write and optimize SQL queries for large data volumes;
- •Participate in the creation and support of data warehouses.
Requirements
Requirements
- •Experience with Apache Spark (fundamental concepts: RDD, DataFrame, transformations, actions);
- •Proficient in SQL (JOIN, UNION, window functions, understanding of relational databases);
- •Basic knowledge of Python (syntax, collections, functions);
- •Understand the difference between ETL and ELT, familiar with orchestration concepts (Airflow);
- •Experience with Git (repository, branches, commits);
- •Familiar with the Hadoop ecosystem (HDFS, YARN, Hive);
- •Willingness to learn and deeply understand tasks;
- •Advantage: experience with Kafka, Flink, Spark Streaming.
Loading similar jobs...