Skip to content

Methods from 2017 study

Nic edited this page Jun 22, 2020 · 1 revision

Research Questions

  • What fields of academic research are using OGD?
  • What types (themes/ formats/ sources) of OGD do researchers use?
  • What purposes do OGD serve in academic research?/ What are utilization types of OGD in academic research?
    • Types of utilization: Source data, data for evaluating results, inputs / outputs, ground truthing?
  • How much impact has OGD made through advancing academic research?
    • Citation counts - justify the use of citation counts / journal impact factors as a proximate measure of impact.

Goal

The goal of lit review is to understand how OGD is being used in academic research and what is the impact of utilization of OGD in research.

I suggest something like: To better understand the transfer of open government data initiatives to academic research and development activities.

Sources

  • ACM : excluded because cannot do reference search
  • IEEE: 263
  • Scopus: 1997
  • Springer: 226

After removing duplicates and retaining only journal/conferences articles: 2377

Search terms:

for example: "data.seattle.gov" in full text and references

in Scopus:

ALL ( "odaa.dk" OR "open.alabama.gov" OR "data.alberta.ca" OR "cabq.gov/abq-data" OR "alkmaar.nl/opendat" OR "datacatalogs.org/catalog/allerdal" OR "os.amsterdam.nl" OR "data.angers.fr" OR "a2gov.org/data" OR "opendata.antwerpen.be" ) AND PUBYEAR > 2008 AND ( LIMIT-TO ( DOCTYPE , "ar" ) OR LIMIT-TO ( DOCTYPE , "cp" ) ) AND ( LIMIT-TO ( LANGUAGE , "English" ) )

in IEEE:

("odaa.dk" OR "open.alabama.gov" OR "data.alberta.ca" OR "cabq.gov/abq-data" OR "alkmaar.nl/opendat" OR "datacatalogs.org/catalog/allerdal" OR "os.amsterdam.nl" OR "data.angers.fr" OR "a2gov.org/data" OR "opendata.antwerpen.be") and refined by Content Type: Conference Publications Journals & Magazines Early Access Articles   Year: 2009-2018

notes: when retrieving 'data.gov' in Scopus, additional conditions used: excluded papers which has 'open government data' in title/keywords/abstract. Otherwise, there are way too many results...

Selection

Inclusion Criteria:

  • 2009 - Forward (data.gov launched in May, 2009/ )
  • Data must be used - and not simply mentioned
  • Only peer-reviewed research articles, no abstracts, chapters or books
  • Only English
  • Only full-text accessible (qualification on this)

Exclusion Criteria:

  • Articles describing the importance of open data, describing the OGD platforms(technology/policy/usage)/data holdings but not the actual datasets, or reviewing open data policies (and do not actually analyze data)
  • Pre 2009

Question: Does Census data count as open government data? What about data gathered or available from national research laboratories?

  • retain for now, but may exclude from further analysis.

Timeline

  • retrieve materials and remove duplicates (August, 4th)

  • Nic and Annie examine 50 samples (From Scopus sample 1 ~ 50) to determine in/out (August, 4th)

  • Nic and Annie examine anther 98 samples (Springer 1~98) to determine use types (August, 11st)

  • Nic and Annie examine anther 52 samples (Springer 99~150) to determine in/out and use types (August, 18st)

  • Nic and Annie examine 10 samples (Springer 185, 191-194, 196, 198,199,200,203) to determine usage types (8.29)

  • Nic and Annie examine 50 samples (Scopus, 52-114, only those determined as "in") to determine usage types (8.29)

  • Write lit retrieval outline (August, 11st)

  • Finalize a set of papers to be analyzed (August 11st)

  • descriptive analysis (year, subjects, trends) (August 30th)

  • classification of utilization (August 30th)

  • classification of types/sources of OGD used. (August 30th)

  • select and analyse typical use cases (August 30)

  • estimate impact via paper citation/journal IF. (September 22)

  • draft writing (September 30)

initial samples coding

Annie's annotation of first 50 samples: https://docs.google.com/a/uw.edu/spreadsheets/d/1DrfMxdu4L1FjeGx9T7e3JMDp5APZ52lj40awMphG4iU/edit?usp=sharing

Nic and Annie determine in this round:

Annie's annotation of second round 100 samples from Springer:

https://docs.google.com/a/uw.edu/spreadsheets/d/1PufdOkukrXerqiuvrhhSv1aFcw1hcU016UvYDvP2H58/edit?usp=sharing

or use copy on Github : https://github.com/OpenDataLiteracy/ODG-usage-in-Research/blob/master/lit_mining/Springer_sample2_20170808_Annie.xlsx

Nic and Annie determine in this round:

Third round of annotation of 50 samples from Springer

Nic and Annie determine in this round:

4th round of annotation, 10 samples: https://docs.google.com/a/uw.edu/spreadsheets/d/1MX2FjN1HyJ3x4RCAPdDmqMeFxsvMTR3loOSRZfpZKBg/edit?usp=sharing

**5th round of annotation, 50 samples: ** https://docs.google.com/a/uw.edu/spreadsheets/d/10Om6eWpjNqiCTJbs1pttKZxJlMKVKXvCHxu8vzfLqYg/edit?usp=sharing

descriptive analysis

Clone this wiki locally