From the course: GitHub for Data Scientists

What is GitHub?

- As a data scientist, there are many platforms that you are often told you need to know to work on different projects. It can be overwhelming to try to figure out what is important to learn and which softwares are used for which tasks. GitHub is one of the platforms you will likely encounter when working on projects with teams. By the end of this lesson, you will understand what functionality GitHub provides to data scientists. GitHub is a platform that is used by programmers and data scientists for collaboration on projects that involve source code. GitHub provides two main functionalities that make it easy to involve multiple people on a project simultaneously. First, the GitHub website provides a graphical user interface or GUI for interacting with Git, which makes it easy to create and host repositories, view changes to your code, and create or accept alterations to your project. Using GitHub, you can also keep track of progress on projects by creating tasks and assigning them to teammates or yourself. Additionally, GitHub acts is a form of social media for technical individuals. Using GitHub, data scientists and programmers can create projects for the world to see and post them on their GitHub profile. This is used to increase collaboration among users and creates a personal portfolio of your experience that can be shared with your peers and potential employers. GitHub is commonly used in technical fields and is an important part of how data scientists continue to work together, especially when your teams are remote or you're working with people outside of your organization. The more you use GitHub, the more you will be exposed to new ideas and allow others to contribute to your projects in ways you might have never thought of. If you want to expand outside your own horizons and understand how collaboration leads to richer output, using GitHub is a must.

Contents