Module Overview
Data science is the study and practice of how we can extract insight and knowledge from large amounts of data. It is a developing field, currently attracting substantial demand from both academia and industry.
This course provides a practical introduction to a data science analysis, including data collection and processing, data visualization and presentation, statistical model building using machine learning for scaling these methods.
Topics covered include: Collecting and processing data, free text analysis; Analyzing the data using a variety of statistical and machine learning methods.
Learning Outcomes
Upon successfully completing the course, you will be able to:
Understand the full data science pipeline, and be familiar with programming tools to accomplish the different portions
Use of Python and its modules to scrape, clean, and process data
Use of data management techniques to store data
Use of statistical methods and visualization to quickly explore data
Apply statistics and computational analysis to analyze the data