Data Engineer Spring 2021


Job Title: Jr Data Engineer

Travel/Relocation: Must be open to both


Working at Capgemini:

Are you interested in working in a high-tech global consulting firm focused on digital transformations and tech solution implementations? We are looking for a goal oriented compassionate person who can work in a fast-paced environment. Due to our rapid growth we are looking to hire a motivated person who is excited to take the next steps with Capgemini.

About Capgemini



Capgemini is a global leader in consulting, digital transformation, technology and engineering services. The Group is at the forefront of innovation to address the entire breadth of clients’ opportunities in the evolving world of cloud, digital and platforms. Building on its strong 50-year+ heritage and deep industry-specific expertise, Capgemini enables organizations to realize their business ambitions through an array of services from strategy to operations. Capgemini is driven by the conviction that the business value of technology comes from and through people. Today, it is a multicultural company of 270,000 team members in almost 50 co

Job Description

You will be working with Capgemini’s Banking and Insurance clients for developing, testing and maintaining business applications after initial 4-week training program. During the training program, you will be trained to enhance your relevant programming, delivery and consultative skills to be successful in your role. During your tenure, you’ll be working on our client projects in financial services industry on challenging projects that have the potential to change the way we trade. You’ll work closely with a mentor who will provide guidance, growth opportunities, and genuine feedback. We provide an ideal opportunity for bright engineering-savvy minds to put their talents to the test in real-world applications

Job Responsibilities

  • Design, develop, and automate data pipelines in agile environment leveraging commercial and open source technologies such as Cloud offerings, Hadoop, Databricks, Spark, NoSQL, and In-memory Data Grids
  • Collaborate with senior data engineers, be self-driven, and able to work independently and in teams 
  • Develop functional and technical specifications from business requirement
  • Perform unit and integration testing, work closely with testing teams to identify root causes of bugs and provide fixes
  • Ensure application quality and adherence to performance requirements
  • Provide practical and creative data engineering solutions
  • Participate in code review process


  • Bachelor’s degree in Computer Science, Computer Engineering, Information Systems, Data Science or related field
  • Experience working with and developing big data solutions
  • Exposure to cloud technologies
  • Hands-on experience on writing shell scripts, complex SQL queries, Hadoop commands, and Git
  • Ability to write abstracted, reusable code components
  • Programming experience in at least two of the following languages: Scala, Java, C/C++, or Python
  • Performance tuning experience
  • Experience in developing Hive, Sqoop, Spark, Kafka, HBase on Hadoop and cloud technologies
  • Familiarity with ETL tools like Informatica, Ab Initio, Hortonworks, Zookeeper, and Oozie is a plus
  • Willingness to learn new technologies quickly
  • Excellent verbal and written communication skills, as well as the willingness to collaborate across teams of internal and external technical staff, business analysts, software support and operations staff.
  • US Citizen or Permanent Resident



Posted on:

April 6, 2021

Experience level:

Experienced (non-manager)

Contract type:

Permanent Full Time (us-en)

Business units:

FS (us-en)


Financial Services