• Experience in installation and configuration of various versions of Hadoop MR, YARN and High availability clusters all flavors of Hadoop (Apache, Cloudera & Hortonworks).
• Implemented High Availability and automatic failover infrastructure to overcome single point of failure for Name node utilizing Zookeeper services.
• Performed Backup and Recovery process in order to Upgrade Hadoop stack.
• Excellent knowledge of Spark architecture and its components - Spark core, Spark SQL and Spark Streaming.
• Strong Knowledge of using SPARK, PIG and Hive for processing and analyzing large volumes of data.
• Experienced in Writing Hive queries for data analysis and to process the data for visualization using Ambari views, Zeppelin and Hue.
• Created Hive internal and external tables to load data from DB2 database using Sqoop.