Skip to content

Feature/post-processing-fixes#1

Draft
fheitzer wants to merge 6 commits intoDBO-DKFZ:mainfrom
fheitzer:feature/post-processing-fixes
Draft

Feature/post-processing-fixes#1
fheitzer wants to merge 6 commits intoDBO-DKFZ:mainfrom
fheitzer:feature/post-processing-fixes

Conversation

@fheitzer
Copy link
Copy Markdown

@fheitzer fheitzer commented Jan 28, 2026

Hi, as I am continuing your research I had to do some changes to the post processing. I was working with the data dir that was already put on the cluster. So, I hope this is universal and not just because of the cluster data. The commits are straight forward, but ill list my changes here:

  • tma column stripping to get the exact identifier had to be adjusted to eliminate all Grading prefixes and suffixes --> tma_identifiers stripped and double checked
  • the free text mapping wasn't utilised at all --> utilised
  • the "free text: " prefix wasn't removed --> now removed
  • there still were explanations with 2 parts separated by a "\n" --> now they are separated and the first valid explanation is used
  • the German / English errors mappings were not caught --> map them (German error straight to English)
  • there were still double spaces in some explanations --> replaced with single spaces

The training runs for me now and the val loss decreases yippie.

@fheitzer
Copy link
Copy Markdown
Author

fheitzer commented Feb 4, 2026

Most definitely due to outdated csv

@fheitzer fheitzer marked this pull request as draft March 10, 2026 08:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant