top of page
Here I blog on all topics related to Big Data and Data Science. Articles could be of types: Executive Summaries, Tools analysis, Tool Comparisons, Architectural Patterns, Introductions to complex topics, and 'How to' or Tutorial types that share code snippets.
Search
Sai Geetha M N
Sep 4, 20224 min read
Decision Trees - Homogeneity Measures
Having had an introduction to what is homogeneity and what are the 3 basic types of measures that can be used in the previous article on...
996 views0 comments
Sai Geetha M N
Aug 16, 20223 min read
Decision Trees - How to decide the split?
In the introduction to Decision trees, we have seen that the whole process is to keep splitting one node into two based on certain...
139 views0 comments
Sai Geetha M N
Sep 16, 20216 min read
Introduction to Clustering Algorithms
As mentioned in one of my introductory articles on "Machine Learning Algorithms Categories", Clustering Algorithms are a set of...
159 views0 comments
Sai Geetha M N
Jul 19, 20217 min read
Feature Selection in Machine Learning
Selecting the right features that contribute to your model is an art and a science. I call it art because much pain can be saved if you...
828 views0 comments
Sai Geetha M N
Jun 25, 20216 min read
Data Scientists, Data Engineers, ML Engineers And More - Demystified
As the world of Big Data, Machine Learning and Artificial Intelligence is taking off, there is an overlap of roles and responsibilities...
187 views0 comments
Sai Geetha M N
May 27, 20215 min read
MultiCollinearity
Multicollinearity is a concept relevant to all the input data that is used in a Machine learning Algorithm. This has to be understood...
191 views0 comments
Sai Geetha M N
May 13, 20215 min read
Feature Scaling and its Importance
Feature Scaling is a very important aspect of data preparation for many Machine Learning Algorithms. Let us understand what is feature...
96 views1 comment
Sai Geetha M N
Apr 28, 20212 min read
Types of Variables - Definition
#Definition There are different characteristics of data that are used for analysis and machine learning. One very fundamental...
42 views0 comments
Sai Geetha M N
Apr 17, 20211 min read
ACID Vs BASE - A definition
#Definitions ACID is a characteristic of RDBMS databases Atomic: Each task in a transaction succeeds or the entire transaction is rolled...
75 views0 comments
bottom of page