Summary
Overview
Work History
Education
Skills
Certification
Interests
Timeline
Generic
Shubham Keshri

Shubham Keshri

Principal Data Engineer
Vallikatu 11 A 2, Espoo Finland

Summary

An experienced, proactive and fast-learning Data professional with strong education background in Data Engineering. Solid solution design and logic building skills. Abilities in leading agile teams, working independent and as a team-player, self-driven and motivated.

Overview

10
10
years of professional experience
7
7
years of post-secondary education
2
2
Certifications

Work History

Etlia Oy

Principal Data Engineer
04.2022 - Current

Capgemini Oy

Principal Data Engineer
11.2021 - 04.2022

TietoEvry Oy, Finland

Senior Data Engineer
09.2019 - 10.2021

Solita Oy

Data Engineer
12.2018 - 08.2019

Affecto (Now CGI)

Consultant
08.2017 - 11.2018

Wärtsilä

Master's Thesis / Trainee
05.2016 - 07.2017

Sears Holdings

Software Developer
06.2014 - 07.2015

Project Assigments

Customers

Principal Data Engineer

Basware
12.2023 - Current

Role / Project Description

Principal Data Engineer

Created IaC code using terraform to deploy new dev/prod environment from scratch, migrated data from Synapse and Storage account, setup CI/CD for Azure Data Factory, Azure Databricks and Synapse using DevOps.

Technology Stack : Terraform, Azure Synapse Analytics, Azure Data Factory, Azure Databricks, Azure DevOps

Customer Benefits: Customer had only one environment for both dev and prod. Now there is a process established to handle all infra changes using terraform (IAC, RBAC) and CI/CD for data pipeline development using databricks/adf

Contact person at the customer: Manager, R&D Platform

Senior Data Engineer / Architect

Microsoft
08.2022 - 12.2023

Role / Project Description

Senior Data Engineer / Architect

Responsible for migrating (Retail Demo Experience reporting solution) legacy Cosmos + SQL server + SSAS solution to Azure Synapse using Deltalake. Technical hands-on work started with POC and deployed to production. Established CI/CD for ADF, ADB

Technology Stack : Azure Synapse Analytics, Azure Data Factory, PySpark, SparkSQL, Azure Analysis Services, Azure DevOps

Customer Benefits: Modernize their legacy solution from on-premise to Azure and reduced cost

Contact person at the customer: Director, Data and Analytics Platform

Volvo

Principal Data Engineer
11.2021 - 04.2022

Role / Project Description

Principal Data Engineer

Azure Data Platform

Migration from Financial Data Platform to Centralized Data Plaform

Assessment, Architecture design, Project planning, estimations

Technology Stack : Azure Databricks, PySpark, Azure Storage Account, Azure Data Factory, SQL DW, Azure DevOps, Github, JIRA, Scrum

Customer Benefits: Assessment for Risks, Strategy and estimation for the migration

Contact person at the customer: Product Owner

ABB

Data Architect
06.2021 - 10.2021

Role / Project Description

Lead Data Architect

Azure Data Platform

Solution Designing - Azure Data Platform

Pre-sales, Architecture design, Project planning, estimations

Technology Stack : Azure Databricks, PySpark, Azure Storage Account, Azure Data Factory, SQL DW, Azure DevOps, Github, JIRA, Scrum

Customer Benefits: Understanding of modern tools in Azure, project plan, design.

Contact person at the customer: Data Platform and Technology Manager

Valmet

Senior Data Engineer
08.2021 - 10.2021

Role / Project Description

Senior Data Engineer

Azure Data Platform

Design and implement DevOps practices in Azure Data Platform development

Technology Stack : Azure Databricks, PySpark, Azure Storage Account, Azure Data Factory, SQL DW, Azure DevOps, Github, JIRA, Scrum

Customer Benefits: Smooth elevation of code to higher environments, Automated pipelines in Azure Datalake

Contact person at the customer: Data Platform and Technology Manager

Kesko

Senior Data Engineer
08.2020 - 04.2021

Role / Project Description

Senior Data Engineer

Common data platform - Azure

Solution Designing and developing Azure Data Platform

Designed and implemented data platform accelerator tool using Azure Databricks and Data factory

Technology Evaluation for modelling tools for Azure Synapse Analytics

Technology Stack : Azure Databricks, PySpark, Azure Storage Account, Azure Data Factory, SQL DW, Azure DevOps, Github, JIRA, Scrum

Customer Benefits: Customer data is modelled and utilised in creating self service reports on Power BI

Contact person at the customer: Data Platform and Technology Manager

Scanfil

Lead Data Engineer
06.2020 - 08.2020

Role / Project Description

Lead Data Engineer

Big Data Platform Azure

Solution Designing and developing Azure Data Platform

Designed and automated ingestion of data from SAP to Azure Datalake using Azure Databricks and Data factory. Created views on top of hive views using databricks database. Helped customer to learn new tools and self develop KPI's

Technology Stack : Azure Databricks, PySpark, Azure Storage Account, Azure Data Factory, SQL DW, Azure DevOps, Github, JIRA, Scrum

Customer Benefits: Customer was able to leverage cloud big data tools to create self service reports. Customer send a video feedback appreciating our efforts in the project

Contact person at the customer: Head of Data & Analytics

Glaston

Senior Data Engineer
09.2019 - 12.2019

Role / Project Description

· Team lead - Leading a team of 5 members

· Analytics project building data lake and data warehouse on Azure ecosystem

· Architecting and developing Data warehouse automation tool

· Build end - end data fuelling data to Business reports

Technology Stack : Data Vault 2.0 (Modelling), Azure Storage Account, Azure Data Factory, SQL DW, PowerBI, JIRA, Scrum, Python

Customer Benefits: Customer data is modelled and utilised in creating self service reports on Power BI

Contact person at the customer: Manager, ICT Applications

Stockmann

Data Engineer
12.2018 - 08.2019

Role / Project Description

· Data Engineer

· Analytics project building data lake and data wareshouse on AWS ecosystem

· Building DevOps data pipelines using python / ansible scripts to automate BI solutions

· Build end - end data fuelling data to Business reports

Technology Stack : AWS S3, Python, AWS Redshift, Github, Ansible, Jenkins, Rundeck, Qlikview, Superset, Linux, EC2, JIRA, Scrum

Customer Benefits: Business is able to make better decisions based on automated reports

Contact person at the customer: Product Owner

Valtiokonttori

Data Virtualization Consultant
07.2018 - 10.2018

Role / Project Description

· Project Lead

· Setting up architecture and environment for Data virtualization POC. Requirement analysis and Pre-sales

· Presented Demo, Solution Design, implementation of Denodo and created reports using Power BI

Technology Stack: Denodo 7.0, MS SQL Server, AWS Infrastructure, Azure Infrastructure, Load Balancer, Share Point

Customer Benefits: Customer finds value in the solution.

Contact person at the customer: Process Manager BI & Analytics, Solution Manager

VR Group

Informatica Consultant
08.2017 - 07.2018

Role / Project Description

· Architect / Developer

· ETL solution design and development

· Informatica Platform administration

· Providing alternatives to ETL tools and cloud migration

Technology Stack: Informatica Power Center 10.x, Informatica Cloud, Service Now, SQL Server, Oracle DW, Azure SQL server

Customer Benefits: ETL implementation to automate HR solution, improved report reliability, cost effective integration solution alternatives

Contact person at the customer: Integration Owner, Head of Data and Analytics

Konecranes

Cloud Data Virtualization consultant
08.2017 - 06.2018

Role / Project Description

· Data Virtualization Consultant

· Setting up architecture and environment for Denodo (Production and non-prod), RFP, requirement analysis, installation.

· Project started with POC for Data Virtualization. It realized into project for setting up high Availability clustered environment.

Technology Stack : Denodo 7.0, MS SQL Server, AWS Infrastructure, Azure Infrastructure, Load Balance, JIRA, Share point

Customer Benefits: Customer is able to utilize Data Virtualization tool for GDPR project and Single view of customer project.

Contact person at the customer: Enterprise Data Architect, Head of MDM

Wärtsilä

Master's Thesis
05.2016 - 07.2017

Role / Project Description

· Master's Thesis / Trainee

· Leading Business reporting team of 5 members

· Research, Development, Solution Design, Business analysis

· Integrating Product Data Management (PDM) tool (Teamcenter) with EDW for Business Reporting

Technology Stack: Informatica Power Center 9.x, SQL Server, Teamcenter, Oracle Exadata DW, Docker, Talend, Web Service, Trello, MSBI, TCL, JAVA

Customer Benefits: Improved integration by reducing integration time from 7 hours to 15 minutes

Contact person at the customer: Enterprise Architect, Solution Architect, Solution Manager

Sears Holdings

Software Developer
05.2014 - 06.2015

Role / Project Description

· Software Developer

· Loading data from database shards into HDFS automating it using Scoop. Creating Hive queries for predictive analytics for Sears products

Technology Stack: Hadoop, Hive, Scoop, MYSQL, Shell Scripts, API

Customer Benefits: Predictive analytics for shopping products for next year

Contact person at the customer: Team Lead, Group Manager

Education

Tampere University of Technology, Finland -

Master's in Information Technology
01.2015 - 04.2017

Siliguri Institute of Technology, India -

Bachelor's in Information Technology
01.2010 - 04.2014

Skills

    Azure Databricks

undefined

Certification

Microsoft Certified: Azure Data Engineer Associate (renewed from 2019)

Interests

To maintain energy, focus and motivation, I participate in organising events for Suomi-Intia Seura Spending time with family, long drive, photography ,cooking, swimming, and following new technology trends are my favorite hobbies

Timeline

Principal Data Engineer

Basware
12.2023 - Current

Microsoft Certified: Azure Data Engineer Associate (renewed from 2019)

06-2023

Senior Data Engineer / Architect

Microsoft
08.2022 - 12.2023

Etlia Oy

Principal Data Engineer
04.2022 - Current

Volvo

Principal Data Engineer
11.2021 - 04.2022

Capgemini Oy

Principal Data Engineer
11.2021 - 04.2022

Valmet

Senior Data Engineer
08.2021 - 10.2021

ABB

Data Architect
06.2021 - 10.2021

Kesko

Senior Data Engineer
08.2020 - 04.2021

Certified Data Vault 2.0 Practitioner

07-2020

Scanfil

Lead Data Engineer
06.2020 - 08.2020

TietoEvry Oy, Finland

Senior Data Engineer
09.2019 - 10.2021

Glaston

Senior Data Engineer
09.2019 - 12.2019

Stockmann

Data Engineer
12.2018 - 08.2019

Solita Oy

Data Engineer
12.2018 - 08.2019

Valtiokonttori

Data Virtualization Consultant
07.2018 - 10.2018

VR Group

Informatica Consultant
08.2017 - 07.2018

Konecranes

Cloud Data Virtualization consultant
08.2017 - 06.2018

Affecto (Now CGI)

Consultant
08.2017 - 11.2018

Wärtsilä

Master's Thesis
05.2016 - 07.2017

Wärtsilä

Master's Thesis / Trainee
05.2016 - 07.2017

Tampere University of Technology, Finland -

Master's in Information Technology
01.2015 - 04.2017

Sears Holdings

Software Developer
06.2014 - 07.2015

Sears Holdings

Software Developer
05.2014 - 06.2015

Siliguri Institute of Technology, India -

Bachelor's in Information Technology
01.2010 - 04.2014

Project Assigments

Customers
Shubham KeshriPrincipal Data Engineer