Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added .DS_Store
Binary file not shown.
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
Archive/
.gitignore
.DS_Store
Binary file added Archive/.DS_Store
Binary file not shown.
Binary file added Archive/Pictures/.DS_Store
Binary file not shown.
Binary file added Archive/Pictures/NoStopsEur_corr.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Archive/Pictures/NoStopsEur_scatter.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Archive/Pictures/baixachiado1.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Archive/Pictures/baixachiado2.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Archive/Pictures/conclusion.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Archive/Pictures/map.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Archive/Pictures/oldman1.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Archive/Pictures/pracodocomercio1.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Archive/Pictures/pracodocomercio2.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Archive/Pictures/tldr.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
117 changes: 117 additions & 0 deletions Archive/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,117 @@
<img src="https://bit.ly/2VnXWr2" alt="Ironhack Logo" width="100"/>

# Welcome to Your Own Project!
This project is completely up to you!*
\*terms and conditions may apply. Consult your TA or lead teacher for full details of this limited offer

## Content
- [Project Description](#project-description)
- [Project Goals](#project-goals)
- [Requirements](#requirements)
- [Deliverables](#deliverables)
- [Mentoring](#mentoring)
- [Schedule](#schedule)
- [Presentation](#presentation)
- [Tips & Tricks](#tips-&-tricks)
- [Resources](#resources)

<a name="project-description"></a>

## Project Description
In this project, you will think of a topic and problem, collect experimental data, complete an end-to-end analysis and present the results, all by yourself.

First, choose a topic of interest to you and understand what research has already been done in that area. What are some interesting questions that remain? Can you turn those questions into a product (i.e. can you extract value out of answering those questions)?

You will then collect some data you think could help answer those questions. Choose your main source of data wisely, since in this project you have a restriction that tries to emulate a common corporate setting: you won't have access to a census of the universe of your choice. You must collect the data yourself in such a way that the universe of datapoints available to you is limted. For example, you may be limited by time (e.g. watching and categorizing Youtube videos or Instagram pictures), by cost (e.g. querying Google Maps for public transport routes via the gui, without paying for the API access) or by access (e.g. surveying people on their preferences). In the end, you should aim at collecting between 30 and 100 observations (rows) and between 5 and 10 features (columns) per observation.

Once you have your data, complete an analysis that answers your original question and/or related ancilliary questions. Please make sure that the main observations you make hold to scientific scrutiny at some level of significance. You can and should supplement your analysis with visual intuition and highlights of hypothesis that the data seems to support, even if you are not necessarily able to hold those insights to the same level of scrutiny as your main question.

You can enrich your limited dataset with information from richer sources that you can obtain trough any means you've learned before (e.g. you may web scrape the weights of car models if that is one of your observations).

Like in the previous project, package your results with a product or service mindset. You will present your findings in a presentation (possibly supported by an interactive visualization) where you should evidence principles of dashboarding and storytelling.

<a name="project-goals"></a>

## Project Goals
* Research, collect and analyse data on a topic of interest to you.
* Feel free to use additional data to enrich your dataset, maybe using an API or web scraping.
* Apply the statistical techniques we have learned, along with techniques from EDA.
* Create useful and easily-interpretable plots.
* Prepare a presentation keeping in mind the finer points of storytelling.
* Communicate the results of your analysis clearly, accurately and engagingly.

<a name="requirements"></a>

## Requirements
* You must plan your project. That is why creating a Kanban or Trello Board is mandatory. You have a template for Trello [here](https://trello.com/b/YX8EsB2P/project-4-your-own-project).
* You **CAN'T CODE** until you project is planned.
* Create a *.gitignore* file and include it in your repository.

<a name="deliverables"></a>

## Deliverables
* All the scripts you used for your analysis.
* Slides and a 5 minute presentation in the classroom.
* Repository with your workflow + documentation + code. Even if you are working alone, you need to maintain good practices!
* A short report including your motivation, methodology and results.

<a name="mentoring"></a>

## Mentoring
One of the TAs will be your mentor!
Your mentor will:
* Follow your project in general.
* Check if you are following the tasks, your blockers, etc
* Help/support you in specific questions.

<a name="schedule"></a>

## Schedule
*Monday*
* Think about a topic and propose some core questions.
* Choose data that is relevant to your questions and devise ways of collecting such data.
* Choose ancilliary data that would allow you to acheive your strech goals.
* Look for documentation to give context to your project.
* Write the README file in your repository.
* Get approval for your project
* **DO NOT START CODING**
* Start collecting the data for your core questions

**NO CODE UNTIL HERE**

* Tuesday - Thursday morning*
* Data entry, cleaning and transformation.
* Start the analysis. Remember all the techniques you have learned!
* Prepare a draft of your first slides presentation (no analysis or conclusions yet): title, motivation, context, ...

*Thursday afternoon*
* Rehearsal. Take the feedback and use it!
* Finish the analysis. Finish the slides.
* Final improvements!

*Friday*
* Presentation!

<a name="presentation"></a>

## Presentation
Presentations for this project will be in the classroom! Presentations will be **EXACTLY** 5 minutes long, with 2 additional minutes for questions. We will stop you!

<a name="tips-&-tricks"></a>

## Tips & Tricks
* Organize yourself (don't get lost!).
* Ask for help vs Google is your friend.
* Define a simple approach first. You never know how the data can betray you ;)
* Learn about your subject and understand what other research has been done before you.
* You *can* use data from the projects your partners did in the last weeks.
* Before making a graph, think about what you want to represent.

<a name="resources"></a>

## Resources
Here are some data sources that could be interesting to you:
* [Explore the Google Trends](http://pages.today/trends4)
* [Hans Roslings's vizs](http://b.link/ted52)
* [See an analysis about heart diseases](http://b.link/kaggle10)
* [Your body's data](http://body.media/ted6)
Binary file added Presentation.key
Binary file not shown.
Binary file added Presentation.pdf
Binary file not shown.
135 changes: 38 additions & 97 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,117 +1,58 @@
<img src="https://bit.ly/2VnXWr2" alt="Ironhack Logo" width="100"/>

# Welcome to Your Own Project!
This project is completely up to you!*
\*terms and conditions may apply. Consult your TA or lead teacher for full details of this limited offer
# Return on Street Performance in Lisbon
*by Uli @ Ironhack Lisbon Nov 20th 2020*

## Content
- [Project Description](#project-description)
- [Project Goals](#project-goals)
- [Requirements](#requirements)
- [Deliverables](#deliverables)
- [Mentoring](#mentoring)
- [Schedule](#schedule)
- [Presentation](#presentation)
- [Tips & Tricks](#tips-&-tricks)
- [Resources](#resources)
- [Hypotheses / Questions](#hypotheses-/-questions)
- [Dataset](#dataset)
- [Workflow](#workflow)
- [Organization](#organization)
- [Links](#links)

<a name="project-description"></a>

## Project Description
In this project, you will think of a topic and problem, collect experimental data, complete an end-to-end analysis and present the results, all by yourself.

First, choose a topic of interest to you and understand what research has already been done in that area. What are some interesting questions that remain? Can you turn those questions into a product (i.e. can you extract value out of answering those questions)?
My neighbor in Lisbon performed Flamenco dances in the streets of Lisbon. The project was about testing some hypotheses regarding what drives the return of a street performaner.

You will then collect some data you think could help answer those questions. Choose your main source of data wisely, since in this project you have a restriction that tries to emulate a common corporate setting: you won't have access to a census of the universe of your choice. You must collect the data yourself in such a way that the universe of datapoints available to you is limted. For example, you may be limited by time (e.g. watching and categorizing Youtube videos or Instagram pictures), by cost (e.g. querying Google Maps for public transport routes via the gui, without paying for the API access) or by access (e.g. surveying people on their preferences). In the end, you should aim at collecting between 30 and 100 observations (rows) and between 5 and 10 features (columns) per observation.
<a name="hypotheses-/-questions"></a>

Once you have your data, complete an analysis that answers your original question and/or related ancilliary questions. Please make sure that the main observations you make hold to scientific scrutiny at some level of significance. You can and should supplement your analysis with visual intuition and highlights of hypothesis that the data seems to support, even if you are not necessarily able to hold those insights to the same level of scrutiny as your main question.
## Hypotheses / Questions
- Old individual men donate more?
- Gender 'study' (male vs. female vs. couples).
- Number of stops has a positive effect on return?
- Location matters?
- Songs matter?

You can enrich your limited dataset with information from richer sources that you can obtain trough any means you've learned before (e.g. you may web scrape the weights of car models if that is one of your observations).
<a name="dataset"></a>

Like in the previous project, package your results with a product or service mindset. You will present your findings in a presentation (possibly supported by an interactive visualization) where you should evidence principles of dashboarding and storytelling.
## Dataset
The dataset contains 50 observations including two outliers of:
- 6 groups (time slots of 20 min each)
- 2 locations [Baixa-Chiado, Praça do Comércio]
- Genders [Male, Female, Couples]
- Individuals [True/False]
- Age Group [25-49, 50+]
- Songs (4 out of 5 songs)
- Return in EUR
[Dataset]()

<a name="project-goals"></a>
<a name="workflow"></a>

## Project Goals
* Research, collect and analyse data on a topic of interest to you.
* Feel free to use additional data to enrich your dataset, maybe using an API or web scraping.
* Apply the statistical techniques we have learned, along with techniques from EDA.
* Create useful and easily-interpretable plots.
* Prepare a presentation keeping in mind the finer points of storytelling.
* Communicate the results of your analysis clearly, accurately and engagingly.
## Workflow
Outline the workflow you used in your project. What were the steps you went through?

<a name="requirements"></a>
<a name="organization"></a>

## Requirements
* You must plan your project. That is why creating a Kanban or Trello Board is mandatory. You have a template for Trello [here](https://trello.com/b/YX8EsB2P/project-4-your-own-project).
* You **CAN'T CODE** until you project is planned.
* Create a *.gitignore* file and include it in your repository.
## Organization
How did you organize yourself? Did you use any tools?

<a name="deliverables"></a>
<a name="links"></a>

## Deliverables
* All the scripts you used for your analysis.
* Slides and a 5 minute presentation in the classroom.
* Repository with your workflow + documentation + code. Even if you are working alone, you need to maintain good practices!
* A short report including your motivation, methodology and results.
## Links
Include the links to your repository, slides and trello. Feel free to include any other links associated to your project.

<a name="mentoring"></a>

## Mentoring
One of the TAs will be your mentor!
Your mentor will:
* Follow your project in general.
* Check if you are following the tasks, your blockers, etc
* Help/support you in specific questions.

<a name="schedule"></a>

## Schedule
*Monday*
* Think about a topic and propose some core questions.
* Choose data that is relevant to your questions and devise ways of collecting such data.
* Choose ancilliary data that would allow you to acheive your strech goals.
* Look for documentation to give context to your project.
* Write the README file in your repository.
* Get approval for your project
* **DO NOT START CODING**
* Start collecting the data for your core questions

**NO CODE UNTIL HERE**

* Tuesday - Thursday morning*
* Data entry, cleaning and transformation.
* Start the analysis. Remember all the techniques you have learned!
* Prepare a draft of your first slides presentation (no analysis or conclusions yet): title, motivation, context, ...

*Thursday afternoon*
* Rehearsal. Take the feedback and use it!
* Finish the analysis. Finish the slides.
* Final improvements!

*Friday*
* Presentation!

<a name="presentation"></a>

## Presentation
Presentations for this project will be in the classroom! Presentations will be **EXACTLY** 5 minutes long, with 2 additional minutes for questions. We will stop you!

<a name="tips-&-tricks"></a>

## Tips & Tricks
* Organize yourself (don't get lost!).
* Ask for help vs Google is your friend.
* Define a simple approach first. You never know how the data can betray you ;)
* Learn about your subject and understand what other research has been done before you.
* You *can* use data from the projects your partners did in the last weeks.
* Before making a graph, think about what you want to represent.

<a name="resources"></a>

## Resources
Here are some data sources that could be interesting to you:
* [Explore the Google Trends](http://pages.today/trends4)
* [Hans Roslings's vizs](http://b.link/ted52)
* [See an analysis about heart diseases](http://b.link/kaggle10)
* [Your body's data](http://body.media/ted6)
[Repository](https://github.com/)
[Slides](https://slides.com/)
[Trello](https://trello.com/en)
51 changes: 51 additions & 0 deletions data.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,51 @@
#,group,date,h_from,h_to,place,gender,individual,age_group,eur,song_order,comment
1,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,25-49,0.00,3 - 4 - 5 - 1,
2,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,25-49,0.00,3 - 4 - 5 - 1,
3,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,25-49,0.00,3 - 4 - 5 - 1,
4,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,25-49,0.00,3 - 4 - 5 - 1,
5,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,25-49,0.00,3 - 4 - 5 - 1,
6,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,25-49,0.00,3 - 4 - 5 - 1,
7,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,25-49,0.00,3 - 4 - 5 - 1,
8,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,25-49,0.00,3 - 4 - 5 - 1,
9,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,50+,0.00,3 - 4 - 5 - 1,
10,1,13.11.2020,12:10,12:30,Baixa-Chiado,Male,TRUE,50+,0.00,3 - 4 - 5 - 1,
11,1,13.11.2020,12:10,12:30,Baixa-Chiado,Couple,FALSE,25-49,0.00,3 - 4 - 5 - 1,
12,1,13.11.2020,12:10,12:30,Baixa-Chiado,Couple,FALSE,25-49,0.00,3 - 4 - 5 - 1,
13,1,13.11.2020,12:10,12:30,Baixa-Chiado,Couple,FALSE,50+,0.00,3 - 4 - 5 - 1,
14,1,13.11.2020,12:10,12:30,Baixa-Chiado,Couple,FALSE,50+,0.00,3 - 4 - 5 - 1,
15,1,13.11.2020,12:10,12:30,Baixa-Chiado,Couple,FALSE,50+,0.00,3 - 4 - 5 - 1,
16,2,13.11.2020,12:40,13:00,Baixa-Chiado,Male,TRUE,25-49,0.00,2 - 3 - 1 - 4,
17,2,13.11.2020,12:40,13:00,Baixa-Chiado,Male,TRUE,25-49,0.00,2 - 3 - 1 - 4,
18,2,13.11.2020,12:40,13:00,Baixa-Chiado,Male,TRUE,50+,1.00,2 - 3 - 1 - 4,
19,2,13.11.2020,12:40,13:00,Baixa-Chiado,Male,TRUE,50+,0.00,2 - 3 - 1 - 4,
20,2,13.11.2020,12:40,13:00,Baixa-Chiado,Couple,FALSE,50+,0.00,2 - 3 - 1 - 4,
21,3,13.11.2020,13:10,13:30,Baixa-Chiado,Male,TRUE,25-49,0.00,5 - 4 - 1 - 2,
22,3,13.11.2020,13:10,13:30,Baixa-Chiado,Male,TRUE,25-49,0.00,5 - 4 - 1 - 2,
23,3,13.11.2020,13:10,13:30,Baixa-Chiado,Male,TRUE,25-49,0.00,5 - 4 - 1 - 2,
24,3,13.11.2020,13:10,13:30,Baixa-Chiado,Male,TRUE,25-49,0.00,5 - 4 - 1 - 2,
25,3,13.11.2020,13:10,13:30,Baixa-Chiado,Male,TRUE,25-49,0.00,5 - 4 - 1 - 2,
26,3,13.11.2020,13:10,13:30,Baixa-Chiado,Male,TRUE,25-49,0.00,5 - 4 - 1 - 2,
27,3,13.11.2020,13:10,13:30,Baixa-Chiado,Male,TRUE,25-49,0.00,5 - 4 - 1 - 2,
28,3,13.11.2020,13:10,13:30,Baixa-Chiado,Male,TRUE,25-49,0.00,5 - 4 - 1 - 2,
29,3,13.11.2020,13:10,13:30,Baixa-Chiado,Male,TRUE,50+,0.00,5 - 4 - 1 - 2,
30,3,13.11.2020,13:10,13:30,Baixa-Chiado,Female,TRUE,25-49,1.00,5 - 4 - 1 - 2,
31,4,13.11.2020,14:30,14:50,Praça do Comércio,Male,TRUE,25-49,2.00,3 - 4 - 2 - 1,
32,4,13.11.2020,14:30,14:50,Praça do Comércio,Male,TRUE,25-49,2.00,3 - 4 - 2 - 1,
33,4,13.11.2020,14:30,14:50,Praça do Comércio,Male,TRUE,25-49,0.00,3 - 4 - 2 - 1,
34,4,13.11.2020,14:30,14:50,Praça do Comércio,Male,TRUE,50+,0.00,3 - 4 - 2 - 1,
35,4,13.11.2020,14:30,14:50,Praça do Comércio,Female,TRUE,25-49,2.00,3 - 4 - 2 - 1,
36,4,13.11.2020,14:30,14:50,Praça do Comércio,Female,TRUE,25-49,0.00,3 - 4 - 2 - 1,
37,4,13.11.2020,14:30,14:50,Praça do Comércio,Female,TRUE,25-49,0.00,3 - 4 - 2 - 1,
38,4,13.11.2020,14:30,14:50,Praça do Comércio,Female,TRUE,25-49,0.00,3 - 4 - 2 - 1,
39,5,13.11.2020,15:00,15:20,Praça do Comércio,Female,TRUE,25-49,1.00,1 - 4 - 5 - 3,
40,5,13.11.2020,15:00,15:20,Praça do Comércio,Female,FALSE,25-49,2.00,1 - 4 - 5 - 3,interviewer
41,5,13.11.2020,15:00,15:20,Praça do Comércio,Female,FALSE,25-49,5.00,1 - 4 - 5 - 3,interviewer
42,5,13.11.2020,15:00,15:20,Praça do Comércio,Female,TRUE,25-49,0.00,1 - 4 - 5 - 3,
43,5,13.11.2020,15:00,15:20,Praça do Comércio,Female,TRUE,25-49,0.00,1 - 4 - 5 - 3,
44,5,13.11.2020,15:00,15:20,Praça do Comércio,Female,TRUE,25-49,0.00,1 - 4 - 5 - 3,
45,5,13.11.2020,15:00,15:20,Praça do Comércio,Female,TRUE,25-49,0.00,1 - 4 - 5 - 3,
46,6,13.11.2020,15:30,15:50,Praça do Comércio,Male,TRUE,25-49,2.00,2 - 3 - 1 - 4,
47,6,13.11.2020,15:30,15:50,Praça do Comércio,Male,TRUE,50+,0.00,2 - 3 - 1 - 4,
48,6,13.11.2020,15:30,15:50,Praça do Comércio,Female,FALSE,25-49,2.00,2 - 3 - 1 - 4,
49,6,13.11.2020,15:30,15:50,Praça do Comércio,Female,FALSE,25-49,0.00,2 - 3 - 1 - 4,
50,6,13.11.2020,15:30,15:50,Praça do Comércio,Female,TRUE,25-49,0.00,2 - 3 - 1 - 4,
Loading