Insights & Data

Qixuan Yang

Expert in Data analysis, Machine Learning, Research

As a Data Scientist in the AI Garage of Capgemini Invent, I work with technical as well as subject matter experts to develop data-driven solutions and products. I am particularly interested in machine learning, especially its applications in the natural language processing (NLP). Being in-fluent in both Python and R and close to academic discussions, I attempt to identify the best strategies for enterprises to turn internal and external data into valuable analytical assets and distill the signals from noises.

Since I joined Capgemini, I have been working in the banking and the pharmaceutical industry for standardizing the analytical pipeline.

My recent work includes:

– Designing and implementing a reporting tool for a leading German bank, which significantly reduces manual efforts and saves cost
– Functioning as NLP expert in a pharmaceutical company in Germany to build a plausible ML/NLP pipeline

I previously worked in the academic field and had important operating role in algorithmic design of data-driven social science projects, incl. text classification and topic modeling. I am particularly interested in the intersection between social science and statistics.

Previously, I have studied at University of Konstanz and Yale University, and joined research teams and led training and tutorials in both institutions. For instance, I created a tutorial for NLP at Yale University and led a training at its graduate school.

I am fluent in Chinese, English, and German. In his spare time, he is a passionate swimmer and blogger.

Qixuan Yang

My experience

Data Scientist @ Capgemini Invent

Oct 2019 - Present

Research Assistant @ University

Nov 2016 - Aug 2019

Latest Publications


Enabling text-based insights: infrastructure matters

Qixuan Yang

Techniques in data collection and analytics are soft assets that need an adequate pipeline to...


Deriving sharp insights from unstructured texts: analytical approaches

Qixuan Yang

In the previous post on the use of NLP in the public sector, some techniques and use cases...


By continuing to navigate on this website, you accept the use of cookies.

For more information and to change the setting of cookies on your computer, please read our Privacy Policy.


Close cookie information