开始时间: 随时 持续时间: 自主
Lesson 1: Getting and Cleaning Data Part 1
Lesson 2: Getting and Cleaning Data Part 2
Lesson 3: Data Quality and Beyond
Lesson 4: Storing Data
Lesson 5: Analyzing Data
Lesson 6: Case Study ---RNA Data/Human Transcriptome
This is one of the first courses we offer for students interested in the emerging field of data science.
In this course, we will explore how to wrangle data from diverse sources and shape it to enable data-driven applications. Some data scientists spend the bulk of their time doing this!
Students will learn how to collect, clean, and extract needed data and store it in MongoDB. We will also cover schema design, learn how to process data within MongoDB, and utilize Hadoop along with MongoDB to perform MapReduce operations.
This is a great course for those interested in entry-level data analyst positions as well as current business/data analysts looking to add big data to their repertoire, and managers working with data professionals or looking to leverage big data.