Constant "downloading" #138

sprig · 2020-05-08T09:30:45Z

In the same setup as #133 (modified 0.6.4 from git on ubuntu 19.10, for anyone else is reading this), maestral now appears to constantly be in a "downloading X/600K files" state, after the initial downloade. Reading the logs, I see mostly that the files are resolved unchanged. I can't paste the log at the moment, but the vast majority of files state that the have "Equal Content Hashes".

I have seen it reach very low numbers - once below 10k remaining files, but when when I come back to the window, maestral appears to be syncing all over again. Since I have a rather large library, it means that the sync (if it even actually happens) takes over a day, which is rather long.

Is this normal/expected? From https://github.com/SamSchott/maestral/blob/09417eee2ba5e1d9bfb3a42ffe90a1e13d09ff34/maestral/sync.py#L1959 it seems like this is by design, but I'm not certain.

If it is indeed normal, I'm also curious about the reasoning - I would have expected "normal" syncs to be quick, after the initial slow bulk download.

samschott · 2020-05-08T10:24:40Z

This is definitely not by design! The method get_remote_folder is only called during the initial sync. Any future syncs will only apply changes which are provided by the Dropbox API. Additionally, a complete resync is performed once per week but you can modify this interval in the config file.

I suspect that this is a similar issue as with indexing: A connection timeout causes the downloads to be interrupted and to eventually restart from the beginning. The log output would be helpful to see which API call exactly is problematic.

I have so far chosen the easy route of not saving any download state on connection errors or crashes but just restarting from the last successful sync. This is typically not a problem because already downloaded files will be skipped if their content hashes are equal. However, it becomes problematic if the initial sync never completes. There are in principle two solutions: First, retry a certain number of times on timeouts for all API calls before giving up. Second, save the current state on a connection error and resume from there if not too much time has elapsed.

samschott · 2020-05-08T10:31:23Z

I had hoped increasing the timeout from 60 to 100 sec in v1.0.0 would take care of most problems due to a slow internet connection or a slowly responding API.

sprig · 2020-05-08T18:36:37Z

Thanks for the explanation! I'm retry the sync and post the logs.

I had hoped increasing the timeout from 60 to 100 sec in v1.0.0 would take care of most problems due to a slow internet connection or a slowly responding API.

I don't know 60 seconds is quite a long time to respond as it is.

samschott · 2021-02-14T22:27:43Z

Closing since this has been addressed in v1.4.0. Indexing now resumes from where is was interrupted.

samschott added the bug Something isn't working label Jul 16, 2020

samschott mentioned this issue Jan 31, 2021

Resumable indexing #296

Merged

4 tasks

samschott mentioned this issue Feb 14, 2021

Version 1.4.0 #313

Merged

samschott closed this as completed Feb 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Constant "downloading" #138

Constant "downloading" #138

sprig commented May 8, 2020

samschott commented May 8, 2020

samschott commented May 8, 2020

sprig commented May 8, 2020

samschott commented Feb 14, 2021

Constant "downloading" #138

Constant "downloading" #138

Comments

sprig commented May 8, 2020

samschott commented May 8, 2020

samschott commented May 8, 2020

sprig commented May 8, 2020

samschott commented Feb 14, 2021