top of page
Here I blog on all topics related to Big Data and Data Science. Articles could be of types: Executive Summaries, Tools analysis, Tool Comparisons, Architectural Patterns, Introductions to complex topics, and 'How to' or Tutorial types that share code snippets.
Search

Decision Trees - Homogeneity Measures
Having had an introduction to what is homogeneity and what are the 3 basic types of measures that can be used in the previous article on...
Sai Geetha M N
Sep 4, 20224 min read
1,049 views
0 comments

Decision Trees - How to decide the split?
In the introduction to Decision trees, we have seen that the whole process is to keep splitting one node into two based on certain...
Sai Geetha M N
Aug 16, 20223 min read
140 views
0 comments

Introduction to Clustering Algorithms
As mentioned in one of my introductory articles on "Machine Learning Algorithms Categories", Clustering Algorithms are a set of...
Sai Geetha M N
Sep 16, 20216 min read
170 views
0 comments

Feature Selection in Machine Learning
Selecting the right features that contribute to your model is an art and a science. I call it art because much pain can be saved if you...
Sai Geetha M N
Jul 19, 20217 min read
897 views
0 comments

Data Scientists, Data Engineers, ML Engineers And More - Demystified
As the world of Big Data, Machine Learning and Artificial Intelligence is taking off, there is an overlap of roles and responsibilities...
Sai Geetha M N
Jun 25, 20216 min read
192 views
0 comments

MultiCollinearity
Multicollinearity is a concept relevant to all the input data that is used in a Machine learning Algorithm. This has to be understood...
Sai Geetha M N
May 27, 20215 min read
196 views
0 comments

Feature Scaling and its Importance
Feature Scaling is a very important aspect of data preparation for many Machine Learning Algorithms. Let us understand what is feature...
Sai Geetha M N
May 13, 20215 min read
98 views
1 comment

Types of Variables - Definition
#Definition There are different characteristics of data that are used for analysis and machine learning. One very fundamental...
Sai Geetha M N
Apr 28, 20212 min read
43 views
0 comments

ACID Vs BASE - A definition
#Definitions ACID is a characteristic of RDBMS databases Atomic: Each task in a transaction succeeds or the entire transaction is rolled...
Sai Geetha M N
Apr 17, 20211 min read
76 views
0 comments
bottom of page