コース概要

Introduction

  • The Data Science Process
  • Roles and responsibilities of a Data Scientist

Preparing the Development Environment

  • Libraries, frameworks, languages and tools
  • Local development
  • Collaborative web-based development

Data Collection

  • Different Types of Data
    • Structured 
      • Local databases
      • Database connectors
      • Common formats: xlxs, XML, Json, csv, ...
    • Un-Structured
      • Clicks, censors, smartphones
      • APIs
      • Internet of Things (IoT)
      • Documents, pictures, videos, sounds
  • Case study: Collecting large amounts of unstructured data continuosly

Data Storage

  • Relational databases
  • Non-relational databases
  • Hadoop: Distributed File System (HDFS)
  • Spark: Resilient Distributed Dataset (RDD)
  • Cloud storage

Data Preparation

  • Ingestion, selection, cleansing, and transformation
  • Ensuring data quality - correctness, meaningfulness, and security
  • Exception reports

Languages used for Preparation, Processing and Analysis

  • R language
    • Introduction to R
    • Data manipulation, calculation and graphical display
  • Python
    • Introduction to Python
    • Manipulating, processing, cleaning, and crunching data

Data Analytics

  • Exploratory analysis
    • Basic statistics
    • Draft visualizations
    • Understand data 
  • Causality
  • Features and transformations
  • Machine Learning
    • Supervised vs unsurpevised
    • When to use what model
  • Natural Language Processing (NLP)

Data Visualization

  • Best Practices
  • Selecting the right chart for the right data
  • Color pallets
  • Taking it to the next level
    • Dashboards
    • Interactive Visualizations
  • Storytelling with data

Summary and Conclusion

要求

  • A general understanding of database concepts
  • A basic understanding of statistics
 35 時間

参加者の人数



Price per participant

お客様の声 (2)

関連コース

Kaggle

14 時間

Accelerating Python Pandas Workflows with Modin

14 時間

GPU Data Science with NVIDIA RAPIDS

14 時間

Anaconda Ecosystem for Data Scientists

14 時間

ArcGIS for Spatial Analysis

14 時間

ArcMap in ArcGIS

14 時間

ArcGIS Pro for Spatial Analysis

14 時間

ArcGIS with Python Scripting

14 時間

QGIS for Geographic Information System

21 時間

Sensu: Beginner to Advanced

14 時間

Monitoring Your Resources with Munin

7 時間

Automated Monitoring with Zabbix

14 時間

Fluentd for Log Data Unification

14 時間

Nagios Certified Administrator Preparation

21 時間

Advanced Nagios

21 時間

関連カテゴリー