Backup to B2 has tons of rclone rcat and rclone put errors and it never finishes #683
-
Creating a discussion because I'm not quite sure this is a bug in rustic, but need some direction on how to narrow this down. I'm running a backup of several ZFS datasets to Backblaze B2. I have backed up a smaller ~17G dataset without any issues. I am now trying to back up a 1.08T dataset and running into some issues. Setup:
I have created separate buckets in b2 for each of my ZFS datasets. Figured this would allow for easier management and more flexible restore options in the future. I have a script that runs each backup synchronously, so there is only one backup happening with rustic at a time. I wanted to prevent issues with the files changing mid backup so I decided to use the ZFS snapshot as the source for my backup. My script queries ZFS to get the read-only mountpoint for the latest daily ZFS snapshot. I then call rustic with the --ignore-devid and I set the --as-path to reflect the actual dataset path, not the snapshot path. This allows rustic to properly recognize the parent of subsequent backups even though the snapshot path is different every time a backup runs. For my smaller ~17G dataset, the backup is successful and subsequent backups correctly identify changed files, etc. However, when running a backup of my ~1.08T dataset, it runs smooth for a few days and then starts spitting out a ton of rclone errors. Now, my backup has been running for 14 days and my current B2 bucket size is actually larger than the source dataset being backed up (the bucket was empty at the beginning of this backup run). My bucket size is now ~1.15T while the data to be backed up from the source is only ~1.08T as mentioned before. Even though it seems there is a more data in the bucket than in the source at this point, the backup is still running and the errors are still being thrown. The following are the configuration and run time details for my backup run (x's replacing sensitive information): Rustic Profile toml: [global]
log-level = "info"
no-progress = true
[repository]
repository = "rclone:Backblaze:XXXXX" <-XXXXX = BUCKET NAME
password-file = "XXXXXXX"
no-cache = true
[forget]
keep-daily = 8
keep-weekly = 5
keep-monthly = 13
keep-yearly = 10
[backup]
exclude-if-present = [".nobackup", "CACHEDIR.TAG"] rclone configuration: [Backblaze]
type = b2
account = XXXXXXXXXXXXXXXXXXXXX
key = XXXXXXXXXXXXXXXXXXXXX
hard_delete = true Errors found in logs. This is just a few lines. I can provide more if necessary: [INFO] rclone output: <3>ERROR : data/fd/fd0564df471ff3802e4e3e42b2df4bc7b928ceb9cc927caff1194e331d2f25c3: Post request put error: Post "https://pod-050-1001-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501001_t0045": context canceled
[INFO] rclone output: <3>ERROR : data/fd/fd0564df471ff3802e4e3e42b2df4bc7b928ceb9cc927caff1194e331d2f25c3: Post request rcat error: Post "https://pod-050-1001-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501001_t0045": context canceled
[WARN] Error error sending request for url (http://127.0.0.1:44079/data/fd0564df471ff3802e4e3e42b2df4bc7b928ceb9cc927caff1194e331d2f25c3): operation timed out at 3.305898633s, retrying
[INFO] rclone output: <3>ERROR : data/fd/fd0564df471ff3802e4e3e42b2df4bc7b928ceb9cc927caff1194e331d2f25c3: Post request put error: Post "https://pod-050-1005-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501005_t0027": context canceled
[INFO] rclone output: <3>ERROR : data/fd/fd0564df471ff3802e4e3e42b2df4bc7b928ceb9cc927caff1194e331d2f25c3: Post request rcat error: Post "https://pod-050-1005-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501005_t0027": context canceled
[WARN] Error error sending request for url (http://127.0.0.1:44079/data/1d9c19c83688d4446ae7678d95086da765a6fd7e2a0c3c643aa2e391f542b74b): operation timed out at 654.849972ms, retrying
[INFO] rclone output: <3>ERROR : data/1d/1d9c19c83688d4446ae7678d95086da765a6fd7e2a0c3c643aa2e391f542b74b: Post request put error: Post "https://pod-050-1005-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501005_t0027": context canceled
[INFO] rclone output: <3>ERROR : data/1d/1d9c19c83688d4446ae7678d95086da765a6fd7e2a0c3c643aa2e391f542b74b: Post request rcat error: Post "https://pod-050-1005-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501005_t0027": context canceled
[WARN] Error error sending request for url (http://127.0.0.1:44079/data/1d9c19c83688d4446ae7678d95086da765a6fd7e2a0c3c643aa2e391f542b74b): operation timed out at 518.998993ms, retrying
[INFO] rclone output: <3>ERROR : data/1d/1d9c19c83688d4446ae7678d95086da765a6fd7e2a0c3c643aa2e391f542b74b: Post request put error: Post "https://pod-050-1001-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501001_t0045": context canceled
[INFO] rclone output: <3>ERROR : data/1d/1d9c19c83688d4446ae7678d95086da765a6fd7e2a0c3c643aa2e391f542b74b: Post request rcat error: Post "https://pod-050-1001-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501001_t0045": context canceled
[WARN] Error error sending request for url (http://127.0.0.1:44079/data/e31271d79d989248835f8511d033a096b27a6fbd70a1692b985a7a1322c39552): operation timed out at 470.218131ms, retrying
[INFO] rclone output: <3>ERROR : data/e3/e31271d79d989248835f8511d033a096b27a6fbd70a1692b985a7a1322c39552: Post request put error: Post "https://pod-050-1001-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501001_t0045": context canceled
[INFO] rclone output: <3>ERROR : data/e3/e31271d79d989248835f8511d033a096b27a6fbd70a1692b985a7a1322c39552: Post request rcat error: Post "https://pod-050-1001-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501001_t0045": context canceled
[WARN] Error error sending request for url (http://127.0.0.1:44079/data/e31271d79d989248835f8511d033a096b27a6fbd70a1692b985a7a1322c39552): operation timed out at 1.078056185s, retrying
[INFO] rclone output: <3>ERROR : data/e3/e31271d79d989248835f8511d033a096b27a6fbd70a1692b985a7a1322c39552: Post request put error: Post "https://pod-050-1005-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501005_t0027": context canceled
[INFO] rclone output: <3>ERROR : data/e3/e31271d79d989248835f8511d033a096b27a6fbd70a1692b985a7a1322c39552: Post request rcat error: Post "https://pod-050-1005-01.backblaze.com/b2api/v1/b2_upload_file/583356dd947c7f8f80820613/c005_v0501005_t0027": context canceled
[WARN] Error error sending request for url (http://127.0.0.1:44079/data/e31271d79d989248835f8511d033a096b27a6fbd70a1692b985a7a1322c39552): operation timed out at 901.616388ms, retrying Thanks in advance for any guidance you can provide! Been googling and cannot find any concrete answers. |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 33 replies
-
Thanks for opening this discussion! Besides that, here are some thoughts:
|
Beta Was this translation helpful? Give feedback.
-
I did dig a bit into the retry/timeout topic. Actually there are three things:
|
Beta Was this translation helpful? Give feedback.
-
Great! Did you see some retries in your log?
Can you give the output of |
Beta Was this translation helpful? Give feedback.
-
Yes.
repository files
And if I run rustic snapshots I get: snapshots for (host [Kraken], label [], paths [/naspool/Elnaz])
|
Beta Was this translation helpful? Give feedback.
-
As #720 has been just merged and there seems to be no open questions, I'll mark this discussion as answered. |
Beta Was this translation helpful? Give feedback.
As #720 has been just merged and there seems to be no open questions, I'll mark this discussion as answered.
Thanks again @RobTheBuilder2 for the intensive testing!
If you could create a PR putting your configuration in the
config
dir, I would very much appreciate that!