Project Description

Diploma in Data Science

Course ID – GKA-070

Data Science certification course is an extremely popular, in-demand profession which requires a professional to possess sound knowledge of analysing data in all dimensions and uncover the unseen truth coupled with logic and domain knowledge to impact the top-line (increase business) and bottom-line (increase revenue). Also, Google Trends shows an upward trajectory with an exponential increase in the volume of searches like never seen before. This is proof enough to back the statements made by Harvard Business Review and the business research giants, that Business Analytics will be the most sought-after profession the world has ever witnessed.
Data Science / Analytics is creating myriad jobs in all the domains across the globe. Business organizations realised the value of analysing historical data in order to make informed decisions and improve their businesses.

“Steinbeis University, Berlin Accreditation, “It is continuing faithfulness in the founding and development of this institution and outstanding competence, and excellence in deliveries”. Steinbeis University Berlin works as a knowledge and technology transfer partner to companies in trade and industry. Its core services encompass: research and development, consulting, and – as a basis for all of this – education. Founded in 1998, the Steinbeis University Berlin (German abbreviation: SHB) offers executive degrees and employee training and development programs matched to the needs of ‘knowledge and technology
transfer’ – that is ideally suited to the requirements of modern, knowledge based society. Steinbeis University aims to increase competitiveness among the students and the companies beyond the pure knowledge transfer and application. Steinbeis accreditation recognizes ExcelR’s excellence in areas such as curriculum, faculty qualifications, support services, institutional effectiveness, planning and learning resources and student learning outcomes.

ExcelR’s Diploma Program in Data Science curriculum is meticulously designed and delivered matching the industry needs and considered to be the best in the industry.

All our trainers have extensive experience as Data Scientists in leading Multinational companies and have passion for teaching and considered to be the best in the industry. At ExceIR, we hand pick the trainers subsequent to a thorough evaluation of knowledge, presentation skills, experience and passion for training. No wonder that our trainers are the best in the industry. Participants can be rest assured about the real-life practical exposure along-side with the theory.

ExceIR is an initiative from alumni of IIM & IIT with branches across India ,USA, Malaysia and spreading its footprint across the globe. ExceIR has been awarded as Deloitte’s fastest 50 technological companies for the year 2018, and proud business partner of Tata Consultancy Services (TCS) in Analytics space. We are a trusted training delivery partner of 350+ corporate clients and universities across the globe with 140,000+ professionals trained across various courses.

Enquire Today

Diploma in Data Science

Course ID – GKA-070

Pre Requisites

Graduates from any stream with good logical, mathematical and analogical skills. Working Professional from any domain, who has good logical, mathematical and analytical skills


Introduction to core python Programming

  • Overview of Python-Starting with Python
  • Why Python for data science?
  • Anaconda vs. python
  • Introduction to installation of Python and Packages
  • Introduction to Python Editors & IDE’s(Jupyter/Ipython)
  • Understand Jupyter notebook & Customize
  • Settings;Data Types & Data objects/structures (strings, Tuples, Lists, Dictionaries)
  • List and Dictionary Comprehensions
  • Debugging & Code profiling;
  • Built-in Functions (Text, numeric, date, utility functions);
  • User defined functions – Lambda functions

Data Science Project Lifecycle

  • Introduction to Types of analytics
  • project life cycle
  • Introduction to R Programming
  • Overview of R – Starting with R
  • Installation R and R studio
  • Data Types & Data structures
  • Data Importing and Exporting

Basic Statistics

  • Data Types, Measure Of central tendency, Sampling Funnel
  • Python DS libraries Pandas, Numpy, Scikit, matplotlib);
  • Measures of Dispersion, Expected Value
  • R coding; Random Variable, Probability, Probability Distribution (Normal and Logistic)
  • Graphical Techniques (Bar, Boxplot and histogram etc)
  • Skewness & Kurtosis
  • Sampling Variation

Interferential Statistics

  • CLT, Confidence interval
  • R coding; Introduction to concept with examples( 2 proportion test, 2 t sample t test)
  • Python DS coding concepts and challenge
  • Anova and Chisquare case studies

Linear Regression

  • Scatter Diagram, Corr Analysis, Principles of Regression
  • Python DS coding challenges
  • Intro to Simple Linear Regression
  • Python DS coding challenges
  • Multiple Linear Regression

Logistic Regression

  • Principles of Logistic regression
  • Python DS coding challenges
  • Multiple Logistic Regression, ROC curve, Gain chart, Chisquare theory hands on Python DS coding challenges

Data Mining – Unsupervised

  • Clustering – Hierarchical
  • Python DS coding challenges
  • Clustering – Kmeans
  • Unsupervised – Network Analytics(update the code in better way)
  • Python DS coding challenges
  • Association Rules
  • Python DS coding challenges
  • Recommender System

Dimension Reduction

  • PCA

Text Mining

  • Introduction to Text Mining and applications Python DS coding challenges
  • Vector Space Method (VSM)[Text processing, TDM and Weights]
  • Python DS coding challenges
  • Word clouds and LDA
  • Python DS coding challenges
  • Latent Semantic Analysis (LSA)
  • NLP introduction, NER and Emotion mining


  • Chatbot introduction, types bots and demo


  • Naïve Bayes
  • KNN
  • Decision Tree
  • Random Forest
  • Bagging, boosting and stocking
  • XGBM
  • LGBM
  • SVM


  • Lasso and Ridge Regressions


  • Introduction to Timeseries, Level, Trend and Seasonality, strategy (Python DS coding challenges)
  • Scatter plot, Lag plot, ACF, Principles of Visualization, Naïve forecasts (Introduction to R shiny (deployment))
  • Forecast in Error and it metrics, Model Based Approaches (Introduction to Python flask (deployment))
  • Model Based approach cont,
  • AR Model for errors Data driven approaches, MA and exp Smoothing

Survival Analysis

  • Concept with a business case

Enquire Today

Diploma in Data Science

Course ID – GKA-070

Course Duration

  • 90 Hours

Examination Pattern


  • 60 Multiple Choice Questions


  • DS (70%), other topics (30%)

Time in minutes

  • 90


  • Online – Computer Based(Web proctoring)

Pass Percentage

  • 60%

Number of attempts allowed

  • 2

When will be the examination

  • Every 1st / 3rd Sunday of a month


  • 80% attendance

Mock tests

  • 2

Enquire Today

Diploma in Data Science

Course ID – GKA-070

Program Brochure


Enquire Today


Become Part of Global Knowledge Alliance to Further Your Career.