Sunteți pe pagina 1din 4

Ankit Rathi

Delhi/NCR, India
+91 9891650969
rathi.ankit@gmail.com
 linkedin.com/in/ankitrathi
Lead Data Architect  ankitrathi.com

Summary

Seasoned Data Science Architect, Kaggle expert & Data blogger/author with 13 years in data engineering/
architecture and 5 years in data science/machine learning.
Designed & developed many data intensive technology solutions using various tools in data architecture,
data science, big data & cloud.
Translated complex business problems into technology & analytics solutions for data-driven decision
making.
Demonstrated knowledge of the business success drivers, industry trends, regulatory issues and
competitive marketplace.
Continuously in sync with latest developments in the analytics field methodologies and technologies.
Participating in Kaggle (Data Science) competitions since 2014, achieved Kaggle Expert level in 2017.
Experienced in leading technical teams and handling technical & business stakeholders.
Familiar with project management aspects (scope, time & budget) of deliverables.
Built analytics teams from scratch by hiring & developing analytics professionals.
Experienced in full SDLC (Requirement Analysis, Estimation, Architecture/Design Specifications,
Coding/Code Reviews, Testing and Deployment) with Agile/Scrum and Waterfall methodologies.
Completed 20+ project modules implementations (end to end) with 9 clients/businesses.
B.Tech (Electronics) from Harcourt Butler Technological Institute (HBTI), Kanpur in 2005.
Possesses strong analytical, problem-solving and communication/presentation skills.

Skills Overview

Development: SQL, Python, R, Scala, Keras, TensorFlow


Big Data/Hadoop/Spark: HDFS, YARN/MapReduce, Pig, Hive, HBase, Cassandra
Cloud Computing: AWS, Azure, Google Cloud Platform (GCP)
DevOps (CI/CD): Kubernetes, GoCD
Data Science/AI: Classification/Regression, Clustering/Associative Mining/PCA, NLP, Image/Video Analytics
BI/Reporting: Tableau/PowerBI/QlikView/OBIEE.
Data Governance: Data Quality, Master/Reference Data, Metadata management.
Architecture: Enterprise (Business/Application/Data/Technology) Architecture (TOGAF/ArchiMate2)
Design/Modeling: Database Design (Oracle/Teradata/MySQL), ER/Dimensional Modeling (Erwin/Visio), ETL
Designs (Informatica), Normalization, De-normalization
Functional/Domain: Banking (AML, Mortgages), Health Insurance & ATI

Work experience

Lead Architect Dec '17 - Till now


SITA.aero
Project: Data Science Platform (Dec ‘17– till date)

Project Environment: Python/R/SQL, ML/DL/NLP, Keras/TensorFlow, AWS, Spark(SQL/PySpark/MLlib),


Sqoop/Flume/Kafka, Tableau/PowerBI, DevOps (Kubernetes/GoCD).

Domain: Air Travel Industry (ATI)

Team-size: 11
SITA portfolios/products (iBorder/Baggage/Filght Ops) requires in-built intelligence to collect, process &
analyse data so that our customers can take appropriate actions in timely manner. The objective of the
programme to optmize operations & improvise decision-making using cutting edge technologies like ML &
DL.

Achievements as Data Science Architect:

Built end to end Data Science Platform integrating various data sources to data lake.
Mentored data engineers & data scientists to build the platform and analytics models.
Designed & developed many ML/DL models using Python & R.
Integrated ML/DL models to existing products & protfolios .
Worked on Flight Prediction, Spoof Image Detection & Smart FlightOps analytics projects.

Principal Consultant Jun '16 - Dec '17


Genpact HCM
Project: Digital Analytics Platform (Jun ‘16– Dec ‘17)

Project Environment: Python/R/Unix, Keras/TensorFlow, Azure, Spark(SQL/PySpark/MLlib), PowerBI.

Domain: Finance & Accounting (F&A), Insurance

Team-size: 16

We started Digital Analytics Platform to improve business decisions by executives using data & automate
exceptions previously worked by operators using Analytics/ML models. Here we understood the current
business processes, identified Analytics/ML opportunities and designed/developed prediction and
recommendation models to provide tangible business benefits.

Achievements as Data Scientist/Data Architect

Built end to end Digital Analytics Platform integrating different data sources to data lake.
Guided analytics professionals to build the platform and analytics models.
Designed & deployed ML/DL models using Python & R.
Built models for Case Recommendation System, Dashboard Forecasting.

Technical Lead Nov '10 - Jun '16


RBS IDC
Project: Advanced Analytics - Retail Banking (Jul ’13 – Jun ‘16)

Project Environment: SQL/R/ODM, Cloud (GCP), Unix/Java, Hadoop (HDFS/YARN/Hbase/Pig/Hive/

Sqoop/Flume), QlikView.

Domain: Retail Banking (AML/Mortgages)

Team-size: 12

We started Analytics - Retail Banking programme to build Data Science capability within the bank and explore
the possibilities to process/analyse existing data via Big Data/Cloud and compare the results with traditional
data processing/analysis before actual migration projects.

Achievements as Data Specialist

Evaluated different Hadoop/ML/Cloud components for data collection, processing, analysis & reporting.
Built analytics platform on cloud using Hadoop by ingesting data from traditional data sources.
Translated complex business problems to analytical solutions.
Mentored team members on Big Data, Cloud & Analytics.
Built models for Customer Profiling, Market Basket Analysis & Anomaly Detection.

Project: MTP (Business/Technology Transformation) (Nov ‘10 – Jul ‘13)

Project Environment: Oracle 11g, UNIX server, Java/J2EE, Informatica, OBIEE, Teradata.

Domain: Mortgages (Retail Banking)

Team-size: 15
MTP stands for Mortgage Transformation Program, where we are focusing on Business Architecture driven
Technology transformation. Under the program, based on current and aspired Business Architecture, we are
transforming Application/Data/Technology Architecture to consolidate all mortgage systems into strategic one
and introducing Fee & Products configuration across Mortgages platform.

Achievements as Database Architect

Built data architecture for business requirements (functional & non-functional).


Derived design (Visio/ErWin) changes in current Data Architecture (OLTP/ETL/OLAP).
Performed Data Analysis/Mining using ODM, R to get in-depth insight).

Sr. Software Engineer Oct '07 - Nov '10


Mastek Ltd
Project: Apollo Munich (Oct ‘07 – Nov ‘10)

Client: Apollo Munich Health Insurance, Gurgaon

Project Environment: Oracle 10g with Report server, UNIX Server, Java/J2EE.

Domain: Health Insurance

Team-size: 8

ApolloMunich application was a policy life-cycle management solution designed by Mastek for Apollo Munich,
which delivers a group-wide customer-centric system to handle membership, finance, reinsurance, claims and
payment processing, with the ability to support multiple products/brands.We built MIS dashboard for
business to undestand how their business was growing.

Achievements as ETL/BI Lead

Designed & built data warehouse (DWH) for business.


Built ETL pipelines from DBs to DWH.
Designed & built MIS reports based on business requirements.

Associate Jul '05 - Oct '07


Perot Systems
Project: LGRS Application (Jul ‘05 – Oct ‘07)

Client: Blue Cross Blue Shield Rhode of Island (BCBSRI)

Project Environment: Oracle 9i (SQL, PL/SQL), VB 6.0, UNIX Server.

Domain: Healthcare

Team-size: 6

The project involved the Rating System of Large Group claims in Healthcare for BCBS. With the help of the
application, BCBSRI evaluates its customer’s performance on monthly basis. The application pulled data from
data-mart using SQL loader and tables were manipulated by PL/SQL modules which were maintaining
customer information and claims for BCBSRI.

Achievements as DB Developer

Built rating system using SQL queries & PL/SQL programming .


Created MIS reports for business using SQL queries.

Education

Continuous Self-learning 2012 - till now


Books, MOOCs, Blogs
Learning from MS in Data Science course on Harvard University in 2018.
Attended Deep Learning course by Vincent Vanhoucke on Udacity in 2016.
Learnt Probability & Statistics for deeper understanding of Data Science in 2014.
Attended Machine Learning course by Andrew Ng on Coursera in 2012.
B. Tech Aug '01 - Jun '05
Harcourt Butler Technological Institute (HBTI), Kanpur
Passed with 66% marks.
Attended Vocational Training in HCL Infosystems.
Presented a model ‘Intruder Alarm with Timer’ in Tech-Era (a national level seminar on recent trend in
electronics technology) in 2003.
Executive Member of Literary Sub-Council in college.

Other Activities/Achievements

Authored 'Probability & Statistics for Data Science' book in 2019


(https://www.goodreads.com/book/show/43693434-probability-statistics-for-data-science).
Achieved ‘Kaggle Expert’ level on Kaggle Data Science platform in 2017.
Attended workshop on ‘ArchiMate2/TOGAF’ in 2016.
Participated in seminar on ‘Data Governance & Architecture’ in 2014.
Attended workshop on ‘Data Analytics in Banking’ in 2012.
Passed Oracle SQL and PL/SQL certification (OCA) with 93% score in 2009.
Passed the MCP (Microsoft Certified Professional) exam for ASP.NET with a 98.4% score in 2005.
Attended Vocational Training in HCL Infosystems on ‘Hardware components troubleshooting and
maintenance’ in 2004.
Presented a model ‘Intruder Alarm with Timer’ in Tech-Era (a national level seminar on recent trend in
electronics technology) in 2003 held at HBTI, Kanpur.

S-ar putea să vă placă și