Collecting and scraping data, data cleansing and transformation. data visualization. building supervised and unsupervised machine learning algorithms, building statistical and predictive models. Big data analysis using Hadoop and mapreduce computing model.