This repository contains the thesis workflow on AfD polling support, media salience, framing/content indicators, and a supplementary media tone layer.
- Primary project guide:
README.md(this file) - Data map and canonical files:
data/README.md - Notebook map and outputs:
notebooks/README.md
Key thesis-facing notes:
docs/final_empirical_results_summary.mddocs/supplementary_media_tone_results_note.mddocs/pre_writing_empirical_synthesis_note.md
Corpora/: source corpus layer (monthly main corpus + shock corpora)archive_pipeline/: archived post-retrieval technical pipeline stagesdata/: canonical analysis-ready tables and panel outputsnotebooks/: analysis/reporting notebooksfigures/: notebook-generated figure outputsdocs/: interpretation and methods notesscripts/: rebuild helpers
- Main panel:
data/panel/monthly_full_analysis_panel_2013_2025.parquetdata/panel/monthly_full_analysis_panel_with_tone_2013_2025.parquet
- Synthesis tables:
data/panel/thesis_master_shock_results_table.csvdata/panel/thesis_dataset_counts_summary.csv
- Synthesis note:
docs/pre_writing_empirical_synthesis_note.md
Clean layer refresh:
python scripts/build_postretrieval_dataset.pypython scripts/build_postretrieval_dataset.py --run-archived-pipeline
Salience rebuild:
python scripts/build_monthly_media_salience_indicator.py
Main corpus processing (only when corpus batches change):
python "Corpora/Main Corpus/run_main_corpus.py"- full reprocess:
python "Corpora/Main Corpus/run_main_corpus.py" --rerun --reextract - target month example:
python "Corpora/Main Corpus/run_main_corpus.py" --only 2021-09 --rerun --reextract
Note:
- Corpus scripts read source PDFs from
raw_unarchivefolders. - Shock-specific corpora (for example Cologne) also read only from
raw_unarchive; run their localaudit_*.pythenbuild_clean_*.pyscripts after archive updates.
- DAWUM is the primary source from
2017-01onward. - Historical manual supplement (Wahlrecht-based) covers
2013-09to2016-12.
- Do not manually edit canonical generated datasets in
data/. - Regenerate via scripts/notebooks.
- Tone indicators are supplementary; core analysis uses salience + framing/content indicators.