From the course: Cleaning Bad Data in R

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

What is tidy data?

What is tidy data?

From the course: Cleaning Bad Data in R

Start my 1-month free trial

What is tidy data?

- [Instructor] The goal of this course is to help you use R to transform your data sets into a consistent format known as tidy data. You do this through a process known as data wrangling. Data wrangling is the art of taking messy data and manipulating it into a format that is well-suited for analysis. It goes by many other names. Some people call this work data cleaning, data munging, or data preparation. Whatever name you choose to use, it's important to remember that this is not a one-time task. While it's true that most data projects will involve a lot of data wrangling up front, data wrangling is a continuous process. And as you encounter new data sets, new problems, and new ideas during the course of your project, you'll likely return to perform some new data wrangling. The term tidy data describes data that has been put into a standardized format that facilitates future analytic work. Hadley Wickham, a data scientist who is one of the key developers of the R language, coined the…

Contents