-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathindex.Rmd
More file actions
56 lines (39 loc) · 3.98 KB
/
index.Rmd
File metadata and controls
56 lines (39 loc) · 3.98 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
---
title: "The Dataist"
output:
html_document:
toc: true
number_sections: false
toc_depth: 4
theme: lumen
highlight: tango
---
<style type="text/css">
.title {
display: none;
}
#getting-started img {
margin-right: 10px;
}
</style>
<div class="row" style="padding-top: 30px;">
<div class="col-sm-6">
# **Introduction to Data Science**
### *Social & Behavioral Scientific Research Methods Using R* {style="padding-top: 0px;"}
#### Dr. Fatih Uenal
Last updated: `r Sys.Date()`
This page is designed to provide you with the necessary resources and a step-by-step guide to learn the fundamentals of Data Science in R. It is mainly designed for a social scientist audience, meaning you are training/trained as a psychologist, sociologist, political scientist, etc., and you want to augment your skills to run all your analyses in a powerful programming language while also expanding into more novel data science techniques such as Machine Learning, Web Scraping and more. The materials provided here are designed for people with **no previous programming experience** or familiarity with traditional social scientific methods.
The entire course is build on the textbook *Introduction to Data Science* by Prof Rafael A. Irizarry (Harvard University). The textbook has also been adapted for an online course on edX. Here, I provide an augmented version of these materials as an interactive learning course created with the `learnr` package `datsci`. The `datsci` package blends together the textbook materials with several complementary courses available on DataCamp, allowing you to practice and apply your skills on real-world datasets. Together, these resources form **a step-by-step syllabus which will allow you to teach yourself R in an interactive and fun way!** The materials used here are carefully selected and blended such that they complement and reinforce each other. The combination of online reading materials, comprehension checks, assessments, and practical courses and projects will allow for a **smooth and efficient learning experience.**
The entire course is spread over 8 sequential modules (datsci_01 - datsci_08): R Basics, Data Visualization, Probability, Inference and Modeling, Productivity Tools, Data Wrangling, Linear Regression, and Machine Learning. In each module, we use motivating case studies. In each case study, we try to realistically mimic a data scientist’s experience. For each of the concepts covered, we start by asking specific questions and answer these through data analysis. We learn the concepts as a means to answer the questions.
The materials are **free and open**, including the primary textbook [*Introduction to Data Science*](https://rafalab.github.io/dsbook/). We will provide assessment materials to test your learning success as you work through the syllabus. For students taking my course at the University I will provide free access to the DataCamp materials by the beginning of the trimester. If you are not participating in my University course, you will need to purchase a subscription for access to DataCamp.
### Copying/Adopting these materials
This website is written using R Markdown. The source code for the website is adapted from Dr. Matt Crump's webpage here [Link](https://github.com/CrumpLab/psyc3400). For anyone interested in adopting these materials, the idea is you can fork the repo for the [website](https://github.com/CrumpLab/psyc3400). Then edit as desired for your purposes, as did I. I am grateful for the amazing materials provided by Prof. Rafael A. Irizarry (Harvard University) which are licenced under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/). More materials by Prof. Rafael A. Irizarry can be found [here](http://rafalab.github.io/).
</div>
<div class="col-sm-6">
<a href="https://fuenal.github.io/dataism/Lectures.html">
```{r statsTextbook, out.width=400, echo=F}
knitr::include_graphics("images/The Brain.png")
```
</a>
</div>
</div>