From the course: Designing Big Data Healthcare Studies, Part Two
Unlock the full course today
Join today to access over 22,400 courses taught by industry experts or purchase this course individually.
Finalizing the analytic dataset - R Tutorial
From the course: Designing Big Data Healthcare Studies, Part Two
Finalizing the analytic dataset
- [Instructor] Hello there. In this video, we will move forward with finalizing our analytic dataset. In the previous section, we talked about first removing rows, so you narrow your dataset down to your subpopulation. After that, you generate all the variables you need from your data dictionary, putting rollback points as you go along. If you check yourself each step of the way, you can make sure you don't make any mistakes. Once that is done, you need to make sure you've generated the right columns. And then you choose the ones you need for the analysis and write them out to your analytic dataset, called Analytic or some standard name you will use. This is the step where you want to make sure that your analytic dataset has unnecessary identifiers removed to maintain privacy. When you are writing out this analytic dataset, you need to make sure that your documentation is updated and matches the dataset. Then, the final step is share and share alike. If you have private data or…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.