Filed under IT – random_reading

random reading

Deduplication How it works? Its like saving letter head only once for multiple letters. iPython/ Notebook Kind of like spreadsheet but can write code in the “cells”. With [in] indicates input [out] indicates outputs. Can write math formulas and import libraries. Output shows in the next “cell”. Can save & print the whole page. Good … Continue reading

MapReduce basics

MapReduce invalid  user ‘bart’ invalid  user ‘bart’ invalid  user ‘lisa’ invalid  user ‘dave’ user ‘admin’ logs in successfully —-> map 1 1 1 —-> reduce 3 Used in large sets of data, like analysing server logs, spam filter, etc. Haloop,  a Java framework. cd /tmp/hadoop-1.0.4 bin/hadoop … Continue reading