Candidate needs to be very strong Cloudera/ Enterprise Data Lake(EDL) Application architect

Candidate should be hands on and has Technical Expertise on Cloudera Stack – Hadoop, YARN, Spark, Hbase, Hive, Pig, Oozie, Sqoop, Flume, Kafka, R

Candidate should have led complex EDL Build projects using Agile / Scale Agile Framework to Ingested data from Heterogeneous source systems into EDL using Best Data Ingestion framework / Strategy and application should have scaled very well to handle Global / Enterprise data volumes.

Designed and Implemented Complex Data Transformation / Harmonization and EDL Semantic layer to enable Tableau based Reporting and Self service Analytics.

Candidate should have delivered big EDL projects using Distributed Delivery model.

Secondary ASK

Knowledge of Alteryx based Data blending and modelling is added advantage.

About Capgemini

With more than 190,000 people, Capgemini is present in over 40 countries and celebrates its 50th Anniversary year in 2017. A global leader in consulting, technology and outsourcing services, the Group reported 2016 global revenues of EUR 12.5 billion (about $13.8 billion USD at 2016 average rate). Together with its clients, Capgemini creates and delivers business, technology and digital solutions that fit their needs, enabling them to achieve innovation and competitiveness. A deeply multicultural organization, Capgemini has developed its own way of working, the Collaborative Business ExperienceTM, and draws on Rightshore, its worldwide delivery model.

Learn more about us at

Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.

Click the following link for more information on your rights as an Applicant:

Apply now