Fix CI: apt update on runner, file lock race condition#341
Fix CI: apt update on runner, file lock race condition#341mikekryjak wants to merge 6 commits intomasterfrom
Conversation
Was erroring out on installing libcurl4-openssl-dev before.
…start Fixes CI hang on Python >= 3.13
Cause of the next hang locally. This is a different file now than last time, which means it may be that many tests need to be fixed. For now, I am pushing this to see if this is enough.
|
The tests were still failing locally, but intermittently. I made a bash script which loops the tests until they fail, and then prints a stack trace. I found another cause in another test inside I also cherry picked @dschwoerer's timeout and stack trace from 62ab549. While the CI continues, I will keep looping the tests locally to see if I can find more. If I do, I will make all file loads safe in this test file. There is still the mystery of why it fails every time on CI and intermittently locally. My LLM thinks it could be because the runners are slow which could make timing and file locking issues worse. |
The first error is:
I tried to follow the error advice and add
sudo apt-get update, which seems to resolve the same issue in xHermes. This PR adds it to xBOUT CI.The second issue is where the tests hang on
test_boutdataset.py::TestSaveRestart::test_to_restart. I reproduced the CI environment locally and reproduced the issue. Thanks to @dschwoerer's stack trace debug and the help of an LLM, I was then able to narrow this down to an unsafely loaded dataset in that test. This resolves the issue on my end at least....