From the course: Data Science for Java Developers

Unlock this course with a free trial

Join today to access over 22,600 courses taught by industry experts.

Splitting your dataset by label

Splitting your dataset by label - Java Tutorial

From the course: Data Science for Java Developers

Splitting your dataset by label

- [Instructor] So now that we have all of our unique labels the next thing we're going to want to do, is calculate the summary, a summary being just the mean and standard deviation. And those are just statistical terms don't worry if you're not super familiar with those at the moment, just kind of follow along with me here, if you aren't familiar with those. So we're going to need to calculate the mean and standard deviation of our data set after we've split it into different pieces by the label. So in other words, what we're going to do, is we're going to take our flour data here. We're going to split it apart into three parts in our case, that's three because we have three different labels and we're going to want to calculate the mean and standard deviation of each of these parts. So we're going to have one part for this species here, we're going to have one part for this species here, and we're going to have one part…

Contents