Understanding of distributed computing principles
Management of Hadoop, Spark, Mesos clusters.
Issue handling in cluster
Experience with stream processing systems (Spark streaming)
Some experience with NoSQL databases (HBase, Cassandra, MongoDB)
Knowledge of various ETL and logging frameworks (Flume, Splunk)
Experience with Cloudera/MapR/HortonWorks
Familiarity with cluster management frameworks (Mesos, Yarn)
Working knowledge and experience with either Python or Scala
Knowledge of Linux OS and scripting.
Good understanding of PostgreSQL, Columnar databases.
Must be strong in one of these programming languages i.e. Java, C++, Python
- Experience Level Mid-Level
- No. Of Positions 1
- Experience in Years 2