top of page
Here I blog on all topics related to Big Data and Data Science. Articles could be of types: Executive Summaries, Tools analysis, Tool Comparisons, Architectural Patterns, Introductions to complex topics, and 'How to' or Tutorial types that share code snippets.
Search
Sai Geetha M N
Jul 8, 20218 min read
Machine Learning - Rendezvous Architecture
The Rendezvous architecture proposed by Ted Dunning and Ellen Friedman in their book on Machine Learning Logistics was a wonderful...
513 views0 comments
Sai Geetha M N
Jul 1, 20217 min read
Big Data Architecture for Machine Learning
Machine Learning by itself is a branch of Artificial Intelligence that has a large variety of algorithms and applications. One of my...
438 views0 comments
Sai Geetha M N
Jun 25, 20216 min read
Data Scientists, Data Engineers, ML Engineers And More - Demystified
As the world of Big Data, Machine Learning and Artificial Intelligence is taking off, there is an overlap of roles and responsibilities...
187 views0 comments
Sai Geetha M N
Jun 18, 202113 min read
HBase Design - Guidelines & Best Practices
We have looked at HBase Fundamentals and HBase Architecture in the last two weeks. Today I will look at a few best practices and...
1,270 views1 comment
Sai Geetha M N
Jun 10, 20217 min read
HBase Architecture
We looked at the basics of HBase in the previous article, last week. Today we will understand the Architecture of HBase. We all agree...
519 views0 comments
Sai Geetha M N
Jun 3, 20219 min read
HBase Fundamentals
HBase is a NoSQL DB that uses some capabilities of the Hadoop ecosystem to provide its features. NoSQL DBs (a.k.a Not Only SQL) are...
1,408 views2 comments
Sai Geetha M N
Apr 22, 20219 min read
Data Validation - During Ingestion into Data Lake
Any enterprise that wants to harness the power of data, almost always begins with building a data lake. By definition, a data lake is a...
3,357 views6 comments
Sai Geetha M N
Apr 17, 20211 min read
ACID Vs BASE - A definition
#Definitions ACID is a characteristic of RDBMS databases Atomic: Each task in a transaction succeeds or the entire transaction is rolled...
75 views0 comments
Sai Geetha M N
Apr 16, 20214 min read
Making the Right Database Choice
#ArchitecturalDecision If someone were to ask, should I use SQL or NoSQL database, the obvious answer is "it depends". Depends on what?...
840 views3 comments
Sai Geetha M N
Mar 19, 20214 min read
Hadoop for Analysts - Apache Druid, Apache Kylin and Interactive Query Tools
#ToolComparison #ArchitectureDecision Introduction Traditional Data Warehouses have existed in the industry for quite some time now. They...
674 views3 comments
bottom of page