From the course: Designing Big Data Healthcare Studies, Part Two

Unlock the full course today

Join today to access over 22,400 courses taught by industry experts or purchase this course individually.

Finalizing the analytic dataset

Finalizing the analytic dataset - R Tutorial

From the course: Designing Big Data Healthcare Studies, Part Two

Start my 1-month free trial

Finalizing the analytic dataset

- [Instructor] Hello there. In this video, we will move forward with finalizing our analytic dataset. In the previous section, we talked about first removing rows, so you narrow your dataset down to your subpopulation. After that, you generate all the variables you need from your data dictionary, putting rollback points as you go along. If you check yourself each step of the way, you can make sure you don't make any mistakes. Once that is done, you need to make sure you've generated the right columns. And then you choose the ones you need for the analysis and write them out to your analytic dataset, called Analytic or some standard name you will use. This is the step where you want to make sure that your analytic dataset has unnecessary identifiers removed to maintain privacy. When you are writing out this analytic dataset, you need to make sure that your documentation is updated and matches the dataset. Then, the final step is share and share alike. If you have private data or…

Contents