Hi,
I have implemented this spreadsheet benchmark within the prime intellect environments.
I tried to implement it based on the codes from this github repo. I would really appreciate it if you could review the implementation and provide feedback regarding any inconsistencies or necessary adjustments.
Here is the implementation PR: PrimeIntellect-ai/community-environments#287
Thanks