Author: Scott King

  • Book Review: Exploring Data with Rapidminer

    Synopsis: Data preparation and visualization within a popular software toolDifficulty: Beginner Most books about a particular software tool will focus on the core capabilities of that tool, giving only cursory (if any) coverage about the housekeeping or preparatory functions. For example, most books I’ve read that discuss RapidMiner (and there aren’t many) spend 90% of…

  • Book Review: Introduction to Data Science by Jeffrey Stanton

    Synopsis: An explanation of common data science tasks like text analysis and regressions, coupled with an intro to the R programming language.Difficulty: Beginner There must be a hundred resources online called “Introduction to Data Science”, but let’s face it–most of them are crap. I included Dr. Stanton’s name in the title to make it easy for you…

  • Introduction to RapidMiner Part 2

    Now that we have RapidMiner downloaded and installed, it’s time to import a dataset so we can begin to examine it. Hopefully the quick overview of the GUI in our last post has given you enough to be able to navigate with some degree of effectiveness, but I’ll go over it a little more this…

  • Book Review: Big Data – Understanding How Data Powers Big Business

    Synopsis: A guide for managers to plan for incorporation of Big Data and Analytics in their company. Difficulty: Not really applicable. After reading through “Big Data”, I had to go back and read it again. Not because it was confusing or poorly laid out, but because I had a hard time understanding how author Bill Schmarzo managed…

  • Introduction to RapidMiner Part 1

    RapidMiner is widely used and has a free version that is ideal for learning (and even useful enough to solve some real problems day-to-day). During this three-part tutorial series we’ll download the software, load data into it, and learn how to do simple customer segmentation. Let’s go To get the free version of RapidMiner, go…