-
Notifications
You must be signed in to change notification settings - Fork 383
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Experiments seem stuck when acquiring lock #273
Comments
UPDATE: I was wrong, the metrics were saved. And it has nothing to do with macOS because now I am having the same problem when running on Ubuntu (both the experiments and MongoDB, but different machines). Also, I am using Python 3.6. |
Ugh, this is an ugly one, and it is going to be difficult to track down. Just to be sure, could you check if the problem persists with the current
|
Interesting. It seems the issue is fixed in |
Yes. I had a bad way of dealing with the heartbeat thread, such that it would sometimes not exit when the experiment finished. This is probably also where it got stuck in your case. I just released |
I have |
Hi,
Firstly, I want to thank you for your amazing work. Sacred really, really helps me organizing and analyzing my experiments. Love it sooo much!
When I run several experiments in a row though, sometimes Sacred got stuck after an experiment is finished. Last time I waited for hours but it was still stuck. I had to manually press CTRL-C for the next experiment to start. And when I did that, this traceback occurred:
Also, the metrics for that run weren't saved. This always happens after ~7 experiments in a row. I am running and storing my experiments locally on my macOS 10.13. I am using Sacred 0.7.2, Pymongo 3.4.0, and Mongodb 3.6.4. I'm probably wrong but this might have something to do with macOS because I had no such issues when running on Ubuntu.
The text was updated successfully, but these errors were encountered: