top of page
Here I blog on all topics related to Big Data and Data Science. Articles could be of types: Executive Summaries, Tools analysis, Tool Comparisons, Architectural Patterns, Introductions to complex topics, and 'How to' or Tutorial types that share code snippets.
Search
Sai Geetha M N
Nov 7, 20225 min read
Decision Trees through an Example
We have so far seen what decision trees are, why we need them, what are certain measures that help in creating a decision tree and how...
286 views0 comments
Sai Geetha M N
Sep 17, 20224 min read
Decision Trees - Feature Selection for a Split
In the previous two articles "Decision Trees- How to decide the split?" and "Decision Trees - Homogeneity Measures", I have laid the...
896 views1 comment
Sai Geetha M N
Sep 4, 20224 min read
Decision Trees - Homogeneity Measures
Having had an introduction to what is homogeneity and what are the 3 basic types of measures that can be used in the previous article on...
996 views0 comments
Sai Geetha M N
Aug 16, 20223 min read
Decision Trees - How to decide the split?
In the introduction to Decision trees, we have seen that the whole process is to keep splitting one node into two based on certain...
139 views0 comments
Sai Geetha M N
Jul 30, 20222 min read
Why Decision Trees?
As we saw in the last article introducing Decision Trees, decision trees can be used for classification or regression. But the same can...
114 views0 comments
Sai Geetha M N
Jul 23, 20224 min read
Decision Trees - An Introduction
Decision trees are an algorithm class that form the foundation for Random Forests, a class of algorithms that is extensively used in...
276 views0 comments
Sai Geetha M N
Jan 23, 20224 min read
Hierarchical Clustering Through an Example
I have taken a problem statement of an NGO wanting to find the top 5-10 countries from a list of 169 who are in dire need of aid, in the...
673 views0 comments
Sai Geetha M N
Jan 16, 20223 min read
Hierarchical Clustering - Types of Linkages
We have seen in the previous post about Hierarchical Clustering, when it is used and why. We glossed over the criteria for creating...
3,836 views0 comments
Sai Geetha M N
Nov 26, 20215 min read
Hierarchical Clustering: A Deep Dive
In the last five blog posts, I have discussed the basics of Clustering and then, K-Means clustering in detail. In my "Introduction to...
206 views0 comments
Sai Geetha M N
Nov 4, 202110 min read
K-Means Clustering through An Example
Now that we have understood the basics of K-Means Clustering, let us dive a little deeper today. Let us look at one practical problem and...
324 views0 comments
Sai Geetha M N
Oct 21, 20214 min read
Steps towards Data Science or Machine Learning Models
Having completed the basics of K-Means clustering in the last 3 weeks, I was tempted to take you through an example problem through code....
120 views0 comments
Sai Geetha M N
Oct 8, 20216 min read
K-Means Clustering: Part 3 of 3
Theoretically and mathematically, we have understood a great deal about K-Means Clustering through Part 1 and Part 2 of this series. If...
57 views0 comments
Sai Geetha M N
Oct 1, 20214 min read
K-Means Clustering: Part 2 of 3
Last week, we looked at the basic understanding of how K-Means Clustering works through the 5-step process where the two steps of...
40 views0 comments
Sai Geetha M N
Sep 23, 20214 min read
K-Means Clustering: Part 1 of 3
Having looked at Clustering in general and also having heard that K-Means is one of the simplest and most popular clustering algorithms,...
98 views0 comments
Sai Geetha M N
Sep 9, 20215 min read
When can you use Linear Regression?
It's been a while since my last post, as I was caught up with a couple of talking engagements - one at a university for engineering...
208 views1 comment
Sai Geetha M N
Aug 3, 20214 min read
Prediction Vs Forecasting in Supervised Learning
In supervised learning and especially in the context of Linear regression, we often use these two terms: Prediction and Forecast. We also...
90 views0 comments
Sai Geetha M N
Jul 19, 20217 min read
Feature Selection in Machine Learning
Selecting the right features that contribute to your model is an art and a science. I call it art because much pain can be saved if you...
828 views0 comments
Sai Geetha M N
Jul 8, 20218 min read
Machine Learning - Rendezvous Architecture
The Rendezvous architecture proposed by Ted Dunning and Ellen Friedman in their book on Machine Learning Logistics was a wonderful...
513 views0 comments
Sai Geetha M N
Jul 1, 20217 min read
Big Data Architecture for Machine Learning
Machine Learning by itself is a branch of Artificial Intelligence that has a large variety of algorithms and applications. One of my...
438 views0 comments
Sai Geetha M N
May 27, 20215 min read
MultiCollinearity
Multicollinearity is a concept relevant to all the input data that is used in a Machine learning Algorithm. This has to be understood...
191 views0 comments
bottom of page