Required Skills/Responsibilities:
?Expertise and knowledge: Cloudera Data Platform, Oozie, Hive, Spark, Spark Streaming and Presto
Data Pipeline Development:
Design, develop, and implement scalable data pipelines using Cloudera tools like Hadoop, Spark, Hive, Impala, and HDFS.
Write and optimize ETL processes to extract, transform, and load data into data lakes or warehouses.
Big Data Application Development:
-??System dimensioning (computational resources/Storage/Networks).
-??System reconfiguration in case of HW extension and/or replacement.
-??OS and Cloudera Software upgrades.
-??Cloudera SW vulnerabilities and patching management.
-??Access and permission management.
-??Installation of any other Cloudera application if needed.
-??Assist in Designing scalable architectures for high volume data.
-??Ensure E2E pipeline stability for already developed and future use cases.
-??Performance tuning of Spark workflows.
Srinithi / srinithi@vysystems.com
Key Skills: