Checkpointing lost if BOINC shut down
log in

Advanced search

Message boards : CPU : Checkpointing lost if BOINC shut down

Author Message
Yavanius
Avatar
Send message
Joined: 18 Nov 16
Posts: 42
Credit: 454,617
RAC: 0
Message 85 - Posted: 26 Nov 2016, 21:32:09 UTC

Hi Vlad,

It appears if the client is shut down, the checkpointing is lost. I tried suspending one WU and repeating this and the progress was lost for that too.

Sofar as I can see, if the client (or WU) is simply paused, this doesn't affect them.

I've heard of this occurring on other projects before. Offhand, I don't recall what the solution was. If you aren't able to figure out, you might try contacting Dave Anderson at BOINC.

~Y

Vlad
Project administrator
Project developer
Project tester
Project scientist
Help desk expert
Send message
Joined: 26 Oct 16
Posts: 322
Credit: 103,382
RAC: 0
Message 91 - Posted: 27 Nov 2016, 9:24:25 UTC - in response to Message 85.

Hi Yavanius,

The app does not make any checkpoints. It's hard to implement, a lot of data should be stored including the full atomic ensemble and the histograms of interatomic distances (hundreds of megabytes for the large WUs). I think, the checkpointing is unnecessary because the execution time is not very large.

Yavanius
Avatar
Send message
Joined: 18 Nov 16
Posts: 42
Credit: 454,617
RAC: 0
Message 96 - Posted: 28 Nov 2016, 7:00:12 UTC - in response to Message 91.

Hi Yavanius,

The app does not make any checkpoints. It's hard to implement, a lot of data should be stored including the full atomic ensemble and the histograms of interatomic distances (hundreds of megabytes for the large WUs). I think, the checkpointing is unnecessary because the execution time is not very large.



True. But just make a sticky note to remind you to make sure you implement that in the future before the new launch. ;)

Message boards : CPU : Checkpointing lost if BOINC shut down


Main page · Your account · Message boards


© 2021 Vladislav Neverov (NRC 'Kurchatov institute'), Nikolay Khrapov (Institute for Information Transmission Problems of RAS)