Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added .DS_Store
Binary file not shown.
2,570 changes: 2,570 additions & 0 deletions .ipynb_checkpoints/profiling-checkpoint.ipynb

Large diffs are not rendered by default.

1,867 changes: 1,867 additions & 0 deletions .~tableau__3670.twbr

Large diffs are not rendered by default.

92 changes: 20 additions & 72 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,72 +1,20 @@
![IronHack Logo](https://s3-eu-west-1.amazonaws.com/ih-materials/uploads/upload_d5c5793015fec3be28a63c4fa3dd4d55.png)

# Project: Business Intelligence with Tableau

## Overview

The goal of this project is for you to practice what you have learned in the Business Intelligence chapter of this program. For this project, you will choose a data set, explore the it using Tableau, and put together a Story for presentation showing the insights you have derived from the data. You should demonstrate your proficiency using Tableau and the concepts you have learned throughout the chapter. The workbook should be saved to Tableau Public and a link to the workbook should be provided.

**You will be working in pairs for this project**

---

## Technical Requirements

The technical requirements for this project are as follows:

- You must construct a Tableau Story consisting of at least 5 story points for the data set you have chosen.
- You must use Story features such as captions and annotations.
- You must demonstrate all the concepts we covered in the chapter (sorting, filtering, different visualizations types, aggregations, etc.).
- Your Tableau workbook consisting of at least 5 visualizations and 1 Story should be saved to Tableau Public.
- You should create a Github repo for this project, and your data should be saved to that repo in a folder named data.
- You should also include a README.md file that describes the steps you took, your thought process as you built your visualizations and Story in Tableau, and a link to your workbook on Tableau Public.

## Necessary Deliverables

The following deliverables should be pushed to your Github repo for this chapter.

- **A Tableau workbook uploaded to Tableau Public** that contains the visualizations and Story you created from your data set.
- **An data folder** containing the data set you used for your project.
- **A `README.md` file** containing a detailed explanation of your approach and code for constructing visualizations and organizing them into a Story as well as your results, obstacles encountered, lessons learned, and a link to your completed Tableau workbook.

## Suggested Ways to Get Started

- **Find a data set to process** - As great places to start looking we recommend [Kaggle](https://www.kaggle.com/datasets), [Pordata](https://www.pordata.pt), and [EuroStat](https://ec.europa.eu/eurostat/data/database).
- **Explore the data set** and come up with a variety of visualizations that you can potentially include in your story.
- **Break the project down into different steps** - identify the entities/dimensions in your data set, explore them each individually, and then progress to analyzing different combinations of them.
- **Use the tools in your tool kit** - the concepts and methods you have learned in the business intelligence chapter as well as some of the things you've learned in previous chapters. This is a great way to start tying everything you've learned together!
- **Work through the lessons in class** & ask questions when you need to!
- **Commit early, commit often**, don’t be afraid of doing something incorrectly because you can always roll back to a previous version.
- **Consult documentation and resources provided** to better understand the tools you are using and how to accomplish what you want.

## Useful Resources

- [Tableau Getting Started Tutorial](https://onlinehelp.tableau.com/current/guides/get-started-tutorial/en-us/get-started-tutorial-home.html)
- [Tableau Training Videos](https://www.tableau.com/learn/training)
- [Tableau Learning Library](https://onlinehelp.tableau.com/current/guides/get-started-tutorial/en-us/get-started-tutorial-next.html)

## Evaluation topics

**Text**

- Subtitle and/or annotations provide additional information
- Text is hierarchical in size, readable and horizontal
- Data is labeled directly and labels are used sparingly

**Arrangement**

- Data is intentionally ordered

**Color**

- Color scheme is intentional, used to highlight key patterns, readable when printed in black & white, and sufficiently contrasts with background

**Lines**

- Axes do not have unnecessary tick marks, and graph has one horizontal and one vertical axis

**Overall**

- Graphs highlights significant finding or conclusions
- No addition of unnecessary graphs
- Individual chart elements work together to reinforce the overarching takeaway message
- Detailed explanation of your approach and code for constructing visualisations and organising them into a Story:
1) I started by plotting conditions vs. waiting time, so we could quickly see which conditions were taking longer to be addressed; I connected gender and age group to this so we could have a nice overview of the female/male and age ratio inside each condition.
2) I went to investigate which hospital units (neighbourhoods) were having longer waiting times, and if that was somehow resulting in more no-shows on the appointments.
3) Finally I went to investigate a bit more on the scholarship: if patients with it were actually showing up or not (not goof spending government money for anything), and who were these patients, how many days they were waiting for an appointment, and if they had received a warning message or not.

- Results:
> (see above/see plots on workbook)

- Obstacles encountered:
> Working in general with tableau... it's not very flexible. I end up not fully understanding how to work around with
calculated fields and parameters so end up not using them.
> Shifting from pandas (matplolib and seaborn) to tableau in general. The data cleaning and manipulation has to go in a different way.
> Not being able to perform correlations. They might be possible for sure but is something that was so immediate and useful in pandas...

- Lessons learned:
> Work in group, not matter out.
> I think brushing is on point.

- Link to your completed Tableau workbook:
> I uploaded the complete workbook. Couldn't login to my tableau account for some reason.
Loading