Its all about Data
Sunday, December 15, 2013
Map/Reduce Programming Paradigm Example
›
MapReduce is the heart of Hadoop. It is the programming paradigm that allows for massive scalability across hundreds or thousands of server...
4 comments:
Friday, November 8, 2013
Steps to run Presto on linux m/c
›
Facebook announced on 6th November 2013 that it is committing its Presto low-latency, SQL-compliant query system for Hadoop to open source...
3 comments:
Tuesday, October 29, 2013
Steps to run K-Means Clustering on Apache Mahout
›
For k-means example on Mahout, here i am downloading reuters dataset from : http://kdd.ics.uci.edu/databases/reuters21578/reuters21578.t...
1 comment:
Wednesday, October 9, 2013
Ensemble Learning
›
Ensemble learning is to build a prediction model by combining the strengths of a collection of simpler base models Bagging, Boosting and...
Monday, July 22, 2013
Regression Model Analysis
›
Regression model analysis is done by RMSE(Root Mean Square Error) value. An RMSE of zero, meaning that the estimator predicts observ...
Tuesday, June 11, 2013
Data Preparation and Predictive Modeling Mistakes in M/C Learning and How to Avoid These Mistakes?
›
1) Mistake (Including ID Fields as Predictors) : Because most IDs look like continuous integers (and older IDs are typically smaller), it...
Monday, June 10, 2013
Underfitting/Overfitting Problem in M/C learning
›
Underfitting : If our algorithm works badly with points in our data set, then the algorithm underfitting the data set. It can be check eas...
›
Home
View web version