Filed under IT – random_reading

random reading

Deduplication How it works? Its like saving letter head only once for multiple letters. iPython/ Notebook Kind of like spreadsheet but can write code in the “cells”. With [in] indicates input [out] indicates outputs. Can write math formulas and import libraries. Output shows in the next “cell”. Can save & print the whole page. Good … Continue reading

MapReduce basics

MapReduce dsl5.example.com: invalid  user ‘bart’ dsl5.example.com: invalid  user ‘bart’ dsl5.example.com: invalid  user ‘lisa’ dsl5.example.com: invalid  user ‘dave’ dsl5.example.com: user ‘admin’ logs in successfully —-> map dsl5.example.com: 1 dsl5.example.com: 1 dsl5.example.com: 1 —-> reduce dsl5.example.com: 3 Used in large sets of data, like analysing server logs, spam filter, etc. Haloop,  a Java framework. cd /tmp/hadoop-1.0.4 bin/hadoop … Continue reading