Expertise and knowledge:Cloudera Data Platform, Oozie, Hive, Spark, Spark Streaming and Presto
Data Pipeline Development:
Big Data Application Development:
Cluster Management:
-??System dimensioning (computational resources/Storage/Networks).
-??System reconfiguration in case of HW extension and/or replacement.
-??OS and Cloudera Software upgrades.
-??Cloudera SW vulnerabilities and patching management.
-??Access and permission management.
-??Installation of any other Cloudera application if needed.
Data Storage and Management:
Performance Tuning:
-??Assist in Designing scalable architectures for high volume data.
-??Ensure E2E pipeline stability for already developed and future use cases.
-??Performance tuning of Spark workflows.
Integration and Collaboration:
Key Skills: