Monthly Archives: January 2015

Datascience explained in form of a poster

by awahid on January 25, 2015 in blog

ICRIS (http://www.icris.nl) made a simple poster to describe fundamentals of data science. Click on the following image to see the poster in high resolution. Read more →

Basics of Bigdata

by awahid on January 22, 2015 in blog • 0 Comments

Bigdata is often misunderstood and thought to be very large data, however it is just one aspect of bigdata. The term Bigdata refers to data, which is too complex for traditional approaches to handle. The bigdata have following characteristics. Volume – Large amount of the data. Velocity – Rapid generation of the data. Variability – Inconsistency of the data. Veracity – Quality of… Read more →

Weka or LingPipe for New Data Scientist

by awahid on January 11, 2015 in blog • 0 Comments

I started working in Weka and Lingpipe around 2 years ago. My task was to develop a better clustering algorithm for text data. I initially used Weka to familiarize my self with basic clustering algorithms, however I found Weka has more documentation for classification algorithms than clustering algorithms. I came across Lingpipe framework on the internet and found that their blog provides… Read more →

Clustering Bigdata

by awahid on January 4, 2015 in blog • 0 Comments

Clustering large amount of data brings complexity and requires special clustering algorithms. Common clustering algorithms like k-means are not designed to handle such tasks. Anil K. Jain, A big name in domain of clustering algorithms explains this phenomena in his video lecture (http://videolectures.net/single_jain_bigdata/). He provides a solution “approximate k-means algorithm” which cluster large amount of data (bigdata). Other researcher like Xiao Cai et.… Read more →

Abdul Wahid

Providing AI/Machine Learning/Data Science/BigData Solutions

Monthly Archives: January 2015

Datascience explained in form of a poster

Basics of Bigdata

Weka or LingPipe for New Data Scientist

Clustering Bigdata