This book is meant to be an introduction to advanced data manipulation in R. The first chapter will deal with R structures, vectors, matrixes, lists, and dataframes. We will explain how to design objects in R and how to use R main functions, such as rearranging a vector or adding columns to a matrix.
The second chapter will cover several topics. We will see how to generate random sequences, how to make a subsetting, use conditional instructions and logical operators and, more in depth, how to deal with the missing values.
Third chapter is about importing data on R using various formats, mainly .csv, but also Excel, .txt, and .sav data
The fourth chapter will introduce data processing and manipulation through some functions and packages, including dplyr and reshape2
In the fifth chapter we will see how to use the data.table package to handle big dataset.
In the sixth chapter, we will see how to manipulate strings and text data using various strings management functions. We will also deal with the use of regular expressions.
The seventh chapter will be about functions - how to write a function and all the conditional operators that make our functions more complex.
Last, in chapter eight, we will quickly take a glance at how to export and reuse a model of analysis on other software using XML and the pmml package.