Required Skills/Responsibilities:
Expertise and knowledge: Cloudera Data Platform, Oozie, Hive, Spark, Spark Streaming and Presto
Data Pipeline Development:
Big Data Application Development:
Cluster Management:
Work with Cloudera Manager for cluster setup, configuration, monitoring, and performance optimization.
Ensure high availability and scalability of Cloudera clusters.
System dimensioning (computational resources/Storage/Networks).
System reconfiguration in case of HW extension and/or replacement.
OS and Cloudera Software upgrades.
Cloudera SW vulnerabilities and patching management.
Access and permission management.
Installation of any other Cloudera application if needed.
Data Storage and Management:
Performance Tuning:
Assist in Designing scalable architectures for high volume data.
Ensure E2E pipeline stability for already developed and future use cases.
Performance tuning of Spark workflows.
Integration and Collaboration:
Key Skills: