Economics Education Research

Previous research has established that transition from face-to-face instruction to distance education during the COVID-19 pandemic adversely impacts the academic performance of students in grade 3 through 8. To date, however, little is known if similar effects apply to students in higher grade levels. This study investigates this gap by using high school test proficiency and dropout rates data from six U.S. states. Contrary to the existing literature, this study reveals that high school students on average did not experience considerable learning losses during the pandemic, with no compelling evidence of widening achievement gaps among marginalized groups. Additionally, the effect of instruction mode varied considerably across diverse education and social contexts, with some states demonstrating notable benefits from virtual and hybrid learning. Changes in dropout rates exhibit greater variability and less economic and statistical significance, suggesting intricate dropout dynamics influenced by various pandemic-related disruptions beyond mere instruction mode changes.

Special thanks to Prof. Arvind Krishna, Prof. Charles Manski, and Prof. Joel L. Horowitz for their guidance and constructive feedback.

Full Paper: Impacts of Pandemic Instruction Mode on High School Students' Education Outcomes: Evidence from U.S. States

Repository Description

Tip

<outcome> is either mathpass, elapass, dropout, or all
<state> is the name of U.S. states

data:

Each state folder contains replication code <state>_cleaning.ipynb used to create <state>_<outcome>.csv in the final_csv folder

Data Source: Arizona, Colorado, Georgia, Illinois, Indiana, Wisconsin, COVID-19 Instruction Mode

descriptive_analysis:

Note

Descriptive analysis is pivotal for unveiling data patterns and characteristics. Visualizing test proficiency and dropout rate changes by state, demographics, and instruction modes effectively communicates complex information in a concise and accessible manner. However, it is crucial to acknowledge that these changes cannot be used to draw significant conclusions about the effect of instruction mode on education outcomes. An identification strategy is essential to disentangle other factors and biases. Attention to statistical significance of weighted point estimates reinforces the need for caution in drawing insights solely from descriptive graphs.

descriptive_analysis.ipynb: replication code used to create weighted mean csv file in the figure_csv folder

descriptive_analysis_ci.ipynb: replication code used to output the 95% confidence interval of weighted mean calculations

figure_csv: csv files used to generate figures in the figure folder through Stata

influential_analysis:

Note

Influential point analysis safeguards statistical reliability by identifying and removing influential observations, enhancing the validity and generalizability of regression results. DFBETAS, indicating the impact of deleting each observation on regression coefficients, are computed to guide this process. High DFBETAS values signify significant influence and suggest removal to maintain the representativeness of coefficient across the population.

before_removal_dfbetas: DFBETAS values for each parameter of interest before influential entities removal

after removal_dfbetas: DFBETAS values for each parameter of interest after influential entities removal

dfbetas_computation.ipynb: replication code used to create DFBETAS values

dfbetas_visualization.ipynb: replication code used to create DFBETAS plots

removed_entities: influential enitities ultimately removed in the final data sample

model_diagnostic:

Note

While model specifications begin with domain knowledge, rigorous statistical analysis validates model selection and sample adequacy in satisfying key model assumptions. Plotting residuals, studentized residuals, and Cook's distance evaluates regression model accuracy and identifies influential data points, strengthening the robustness of regression results.

model_diagnostic.ipynb: replication code for model selection, model assumption check, and plotting residuals, studentized residuals, and Cook's distance

pairwise_correlation.ipynb: replication code used to compute pairwise correlation between demographic variables and instruction mode

visualization:

Each outcome folder contains visualizations of DFBETAS plots, residual plots, studentized residual plots, and Cook's distance plots

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Economics Education Research

Repository Description

data:

descriptive_analysis:

influential_analysis:

model_diagnostic:

visualization:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 134 Commits
data		data
descriptive_analysis		descriptive_analysis
influential_analysis		influential_analysis
model_diagnostic		model_diagnostic
visualization		visualization
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Economics Education Research

Repository Description

data:

descriptive_analysis:

influential_analysis:

model_diagnostic:

visualization:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages