top of page
Here I blog on all topics related to Big Data and Data Science. Articles could be of types: Executive Summaries, Tools analysis, Tool Comparisons, Architectural Patterns, Introductions to complex topics, and 'How to' or Tutorial types that share code snippets.
Search

Decision Trees - An Introduction
Decision trees are an algorithm class that form the foundation for Random Forests, a class of algorithms that is extensively used in...
Sai Geetha M N
Jul 23, 20224 min read
279 views
0 comments


K-Means Clustering: Part 3 of 3
Theoretically and mathematically, we have understood a great deal about K-Means Clustering through Part 1 and Part 2 of this series. If...
Sai Geetha M N
Oct 8, 20216 min read
59 views
0 comments

K-Means Clustering: Part 2 of 3
Last week, we looked at the basic understanding of how K-Means Clustering works through the 5-step process where the two steps of...
Sai Geetha M N
Oct 1, 20214 min read
40 views
0 comments

K-Means Clustering: Part 1 of 3
Having looked at Clustering in general and also having heard that K-Means is one of the simplest and most popular clustering algorithms,...
Sai Geetha M N
Sep 23, 20214 min read
99 views
0 comments

Feature Selection in Machine Learning
Selecting the right features that contribute to your model is an art and a science. I call it art because much pain can be saved if you...
Sai Geetha M N
Jul 19, 20217 min read
904 views
0 comments

HBase Architecture
We looked at the basics of HBase in the previous article, last week. Today we will understand the Architecture of HBase. We all agree...
Sai Geetha M N
Jun 10, 20217 min read
560 views
0 comments

HBase Fundamentals
HBase is a NoSQL DB that uses some capabilities of the Hadoop ecosystem to provide its features. NoSQL DBs (a.k.a Not Only SQL) are...
Sai Geetha M N
Jun 3, 20219 min read
1,460 views
2 comments

MultiCollinearity
Multicollinearity is a concept relevant to all the input data that is used in a Machine learning Algorithm. This has to be understood...
Sai Geetha M N
May 27, 20215 min read
196 views
0 comments

Feature Scaling and its Importance
Feature Scaling is a very important aspect of data preparation for many Machine Learning Algorithms. Let us understand what is feature...
Sai Geetha M N
May 13, 20215 min read
98 views
1 comment

Regression Algorithms
#ExecutiveSummary #MLModels What is Regression? Regression is a statistical model/method used to determine the strength and character of...
Sai Geetha M N
Apr 8, 20212 min read
109 views
0 comments

Machine Learning Algorithms Categories
Machine Learning Algorithms learn from data as humans learn from experience. But the type of learning and the goal varies from algorithm...
Sai Geetha M N
Mar 16, 20214 min read
358 views
0 comments

The Machine Learning Landscape
If you are looking to start learning about the basics of Machine learning, you are at the right place. My blog will cover overviews of...
Sai Geetha M N
Mar 10, 20213 min read
612 views
3 comments
bottom of page