Longhorn v1.6.0-dev-20231217
Pre-release
Pre-release
·
377 commits
to master
since this release
This is a sprint release for internal testing and development purposes only. DO NOT INSTALL or UPGRADE to any sprint releases, as this operation is not supported.
Handled issues in the sprint
- [BUG] Warning events are being spammed by Longhorn - CRD 7290 - @m-ildefons
- [TASK] Add linear dm device on the top of v2 volume 7357 - @derekbit
- [BUG] Persistent volume is not ready for workloads 6776 - @james-munson @roger-ryao
- [DOC] Fix erronous value for default StorageMinimalAvailablePercentage setting. 7342 - @james-munson
- [BUG] A race after a node reboot leads to I/O errors with migratable volumes 6961 - @yangchiu @ejweber
- [BUG] CSI components CrashLoopBackOff, failed to connect to unix://csi/csi.sock after cluster restart 7116 - @ejweber
- [IMPROVEMENT] Don't crash the migration engine when kubelet restarts 7302 - @ejweber
- [TASK] Ensure the Go version in Dockerfile.dapper matches go.mod for all Longhorn components 7341 - @ejweber
- [BUG] Kubelet cannot finish terminating a pod that uses a PVC with volumeMode: Block when restarting the node 6919 - @PhanLe1010
- [IMPROVEMENT] deploy: driver deployer shouldn't cleanup previous deployment if Kubernetes version changes 5474 - @PhanLe1010
- [IMPROVEMENT] BackingImage should be compressed when downloading and use the name as filename instead of UUID 7295 - @ChanYiLin
- [FEATURE] Allow kubectl drain to stop manually attached volumes 6978 - @ChanYiLin @chriscchien
- [IMPROVEMENT] Reconcile replica instance state after its associated block-type disk is ready 7338 - @derekbit
- [BUG] Helm2 install error: 'lookup' function not defined in validate-psp-install.yaml 6318 -
- [BUG] multiple "for-cloning-volume" snapshots created after cloning volume 5835 - @PhanLe1010 @chriscchien
- [IMPROVEMENT] Collect SPDK related info for the usage metrics 6033 - @c3y1huang
- [BUG] Test case
test_node_eviction
failed 7210 - @ejweber @roger-ryao - [TASK] Revisit volume automatic recover mechanism 6962 - @PhanLe1010
- [BUG] During volume live engine upgrade, delete replica with old engine image will make volume degraded forever 7012 - @PhanLe1010 @chriscchien
- [FEATURE] Add a new settings that allows Longhorn to evict replicas automatically when a node is drained 2238 - @ejweber
- [BUG] backingimage download server error 7288 - @ChanYiLin @scures
- [IMPROVEMENT] Allow deployment of Prometheus ServiceMonitor with the Longhorn helm chart 7041 - @mantissahz @chriscchien
- [TEST] Introduce Negative Testing 5240 - @yangchiu @khushboo-rancher
- [DOC] FS Trim for RWX is supported, but docs are out of date. 6733 - @james-munson
- [TASK] Collect coverage profiles for Go components during e2e test cases 5681 - @FrankYang0529
- [IMPROVEMENT] Reconcile engine/replica instance state of v2 volume like v1 volume 7326 - @derekbit
- [BUG] Backing image disk state unknown after unmount disk 6443 - @ChanYiLin
- [IMPROVEMENT] add build script to generate gRPC related code more convenient 6973 - @Vicente-Cheng
- [BUG] Backup NFS - Operation not permitted during mount 6114 - @mantissahz
- [BUG] Failed RWX mount due to connection timeout still happening 7301 - @james-munson
- [IMPROVEMENT] Disable CGO in longhorn components if not used 7135 - @derekbit
- [IMPROVEMENT] Environment setup and check tooling container 6657 - @derekbit
- [BUG] Cannot add block-type disk to node resource due to timeout error 7253 - @shuo-wu
- [IMPROVEMENT] Add test for longhorn-spdk-engine 6060 - @shuo-wu
- [IMPROVEMENT] Thread-safe SPDK JSON client 6106 - @shuo-wu
- [FEATURE] Support non-disruptive volume-related setting updates 7173 - @mantissahz
- [BUG] Test case
test_volume_reattach_after_engine_sigkill
failed 6751 - @c3y1huang - [FEATURE] Container-Optimized OS support 6165 - @c3y1huang
- [IMPROVEMENT] Blindly stop raid bdev exposure before exposing it for V2 volume 7324 - @derekbit
- [IMPROVEMENT] BackingImage UI improvement 7293 - @scures
- [BUG] Volume has 2 active engines at the same time that blocks the volume controller reconciliation loop 4827 - @PhanLe1010 @chriscchien @roger-ryao
- [BUG] Volume UI displays only the last backup when using the recurring job 2997 - @mantissahz @chriscchien @roger-ryao
- [BUG] Volume gets stuck in an unknown state forever if created in an engine not fully deployed environment 6131 - @yangchiu @PhanLe1010
- [BUG] Continuously auto-balancing replicas when zone does not have enough space 6671 - @yangchiu @c3y1huang @roger-ryao
- [REFACTOR] Abstract the disk/lvol file operations in backupstore 6576 - @derekbit @chriscchien
- [BUG]
backing-image-manager-
hostPath selection exception 7062 - @ChanYiLin @chriscchien - [IMPROVEMENT] Upgrade support bundle kit version to v0.0.33 7277 - @c3y1huang
- [BUG] GET error for volume attachment on node reboot 4188 - @PhanLe1010
- [BUG] MultiUnmapper floods logs with warnings about size mismatch. 6406 - @james-munson
- [TASK] Remove namespaceOverride from the chart values 6527 - @mantissahz @roger-ryao
- [IMPROVEMENT] Volumes: metrics for backups and snapshots include (count and size and type: system vs user), to better understand how the actual size is summed up. 5869 - @c3y1huang
- [TASK] Bump up the minimum supported Kubernetes version 7224 - @c3y1huang @roger-ryao
- [TASK] Implement xattr get and set operations on SPDK logical volumes (lvol) 6604 - @derekbit
- [DOC] Stress using object store as best practice for backups. 6773 - @james-munson
- [BUG] Errors found by static checker in volume controller 7009 - @m-ildefons
- [FEATURE] Have default priorityClass to prevent unexpected longhorn pods eviction 6528 - @mantissahz
- [BUG] Deadlock for RWX volume if an error occurs in its share-manager pod 7183 - @derekbit @chriscchien
- [TASK] Remove
namespaceOverride
from values.yaml in charts repo 7311 - @innobead - [FEATURE] Talos support 3161 - @c3y1huang
- [IMPROVEMENT] Bypass upgrade when installing a fresh setup 6988 - @mantissahz @roger-ryao
- [IMPROVEMENT] Replace deprecated grpc.WithInsecure 7291 - @c3y1huang
- [BUG] Download backing image failed with HTTP 502 error if Storage Network configured 7236 - @ChanYiLin
- [TASK] Investigate and update the Rancher chart annotation requirements 7024 - @PhanLe1010 @chriscchien
- [BUG] Enabling replica-auto-balance tries to replicate to disabled nodes causing lots of errors in the logs and in the UI 6508 - @c3y1huang @chriscchien
- [IMPROVEMENT] Gracefully shut down spdk_tgt 7263 - @derekbit
- [FEATURE] Single Node Disk affinity 3823 - @ejweber @roger-ryao
- [FEATURE] Restore BackingImage for BackupVolume in a new cluster 4165 - @ChanYiLin
- [BUG] Storage Class Configuration Not Applied to PV When Created via Web UI 6802 -
- [IMPROVEMENT] Upgrade support bundle kit version to v0.0.32 7152 - @c3y1huang @chriscchien
- [TEST] tests.test_basic.test_aws_iam_role_arn failed on SLES on 1.4.x (AMD) 6912 - @yangchiu
- [BUG] ShareManager pod is dying repeatedly until it starts eventually 7112 - @james-munson
- [IMPROVEMENT] Have a setting to disable snapshot purge for maintenance purpose 7075 - @ejweber @roger-ryao
- [IMPROVEMENT] Expose actual size of a logical volume 5947 - @DamiaSan
- [DOC] Run fsck.ext4 on newer Longhorn volume from older Linux distro 6859 - @ejweber @roger-ryao
- [BUG] Confusing logging when trying to attach a new volume with no scheduled replicas 7244 - @ejweber @chriscchien
- [BUG] Supported minimum K8s version not correct 7196 - @c3y1huang
- [BUG]
allow-collecting-longhorn-usage-metrics
setting is missing from chart settings 7050 - @ChanYiLin @yardenshoham @roger-ryao - [TASK] Enable s309x CI build back from maintenance 7259 - @Anarkis
- [IMPROVEMENT] Support custom options for network filesystems for backup 6608 - @james-munson @roger-ryao
- [BUG] DataEngineV2 Unable to attach a PV to a pod in the newer kernel 7190 - @DamiaSan
- [IMPROVEMENT] Global setting
default-data-path
supports block device 7234 - @derekbit @chriscchien - [BUG] Longhorn storage network is incompatible with Multus version above v4.0.0 6953 - @c3y1huang @chriscchien
- [FEATURE] Support Encryption for VolumeMode Block 4883 - @derekbit @roger-ryao
Changes since the previous sprint
longhorn/longhorn-engine
- ci(drone): remove privileged in build stage c672c3a30435e50d3922b739ac274e15c7e19da1 by @yangchiu
- refactor: replace deprecated grpc.WithInsecure 636a99aad3ba51dd44693a72da5b2d2353734399 by @c3y1huang
- remove s390 arch during drone workers maintenance 014125a20fd26e78be607365b6d090fe93d1f00f by @thomasferrandiz
longhorn/longhorn-manager
- ci: fix invalid version validation to support sprint release 8567455620e53fa1c9f6e3a79e4eb3ab23ffe9e0 by @innobead
- refactor(metrics_collector): replace abbreviations 51524e0ace25b472fd369a2f182b9b543586b5ae by @c3y1huang
- improvement: driver deployer shouldn't cleanup previous deployment if Kubernetes version changes c48d06067f735d8768fd2d9d7f0efb0570a63d61 by @PhanLe1010
- Allow kubelet to be down for 10 seconds before responding ae6b6aa99d0e2eb12fd0d868f5e67105bb6b685c by @ejweber
- refactor: move non-entry app functions away from app pkg ac3a130101fc0a660e0ea57dc8d73a00b02e4e3b by @innobead
- feat(telemetry): collect SPDK underlying device type 27101bb1ad4c287d2957003b51189da6ffa74cff by @c3y1huang
- feat(telemetry): collect volume backend store driver e716691e52ad1e7fcd319730f4e8748df5d35adb by @c3y1huang
- feat(telemetry): collect Longhorn disk type 420568893dcf34fca451336e5121a0f8924f3418 by @c3y1huang
- doc: add comment for Longhorn disk types 72fc244be57b223614d14fed4dbe93af777824e6 by @c3y1huang
- refactor: correcting variable names fae1cda479d501e2d595d7d74f43a9503250dc7e by @c3y1huang
- feat(node drain): add DetachManuallyAttachedVolumesWhenCordoned setting 5d5ee68508f8eff8f0fff39e49ab1cb66c1c7500 by @ChanYiLin
- fix: scope of err variable while migrating volume 268b1d63b9570e750bc13e011f475a17e6878041 by @m-ildefons
- fix(replica-auto-balance): loop when node has no storage available dccf8de02f67db2f0c78a72e83f81d859e0e7a13 by @c3y1huang
- fix: skip upgrade path for new installation a074cd96678c01b828c4d9ac95aeedc1d2f57e11 by @mantissahz
- refactor: remove duplicated ready node filtering 6b207162fb021b809927133019cbcc589646a9ec by @c3y1huang
- fix(replica-auto-balance): loop when node has no schedulable disk d4700d32d656a560712c27b3212b19224eda01e1 by @c3y1huang
- Use the share-manager pod owner node for volume attachments. 266210f7c2ac6e4d8b80381d25daa673dd9577d5 by @james-munson
- Disable purging in snapshot controller for disable-snapshot-purge setting 51106c8b05d59820f043f6767399db7e920a49af by @ejweber
- Disable purging via API for disable-snapshot-purge setting 8f12da1ca16a0557aa7320286487f5bbb236664b by @ejweber
- Add disable-snapshot-purge setting and validation 30eebb8ca378dc08a9c71f777f7609169233449b by @ejweber
- Return a clear error when attaching with no scheduled replicas 75e44368278f00967dbb7361e47aa247db25a505 by @ejweber
- ci(drone): remove privileged in build stage 4c841f73f8cd1cae5be491d0852645aee4da650c by @yangchiu
- remove s390 arch during drone workers maintenance a09908dab732d514a8750dd8ead776129c92838e by @thomasferrandiz
longhorn/longhorn-ui
- build(deps-dev): bump browserify-sign from 4.2.1 to 4.2.2 ed5d0a3fd058f7cb045d0770dc5650f14c408edf by @dependabot[bot]
- fix: case mismatch of status conditions c3cfb8f79ea28d402129c30112bd616c6507dba9 by @m-ildefons
- fix: display volume conditions 6572cfd6d3ef89b5abd487c371ca8210f304a412 by @m-ildefons
- remove s390 arch during drone workers maintenance 3b154752549fd827f134728d7c914558b1a2bd15 by @yangchiu
longhorn/longhorn
- feat(telemetry): add LonghornVolumeBackendStoreDriver(V1/V2)Count a91f7966185ae0d5e19e1c2946ac551163bb0ebc by @c3y1huang
- feat(telemetry): add LonghornDisk(Filesystem/Block)Count f3286f524b877867aac5be65637f2ba46c02baad by @c3y1huang
- Release v1.5.3 and bump memmory request/limit for upgrade responder f44cdd64363f9fd1ff86d9e396bfc7488e5f05a4 by @PhanLe1010
- ci: add created automation test issue to qa project (#7229) bac6446821cc53db8358a4c0665fea1b01643db7 by @yangchiu
- ci(gh-action): add github action for automatically creating ui ticket 8014c44f25b33c70407b595205ce1e257b12c6ba by @yangchiu
- Update MAINTAINERS 97e8c77229a77e19a1f832017d3b2c4006610224 by @innobead
- fix(cve): support-bundle-kit 875a5f2cbca335a59dcd4fe6d95d7f1ca267c1a3 by @c3y1huang
- Add LEP for AD refactoring bff5fe520783d52092d9ebd8b5a1774138a4c215 by @PhanLe1010
- feat(lep): default priority class d183f643cd528cc372c0803c7a3ca9159e22a719 by @mantissahz
- doc: update chart/README.md 41f2fd88438cca50b77e36f37a5a0d781cbf7c9a by @mantissahz
- Update refactor.md c989803ff5761e0257182960a041edbb6c20cdff by @innobead
- Update ci.md 466612533570f6957a881891ace5da1fa3ac1593 by @innobead
- Create ci.md 9918a89461825632abf980dd1c96628f761db117 by @innobead
- Add ServiceMonitor in helm chart (#6157) 69739d8a63231934c5f7bb0c70619130be25fe23 by @enrico9034
- chore(setting): keep snapshots for the backup 69c456a63554d631374e6af7d38994c4be75db09 by @mantissahz
- Revert "build: bump up the minimum supported Kubernetes version" 7624ed554e61e645424f12b4392085d16567eea4 by @c3y1huang
- fix: remove the namespaceOverride value 722642abc5fc2cda610975c89a3464189addc5ac by @mantissahz
- feat(node drain): add DetachManuallyAttachedVolumesWhenCordoned setting 5f453051052ea3c75a707031621a0bc8f29e8e23 by @ChanYiLin
- Add disableSnapshotPurge Helm variable 6627b0c55c15adb0831546ec2412ce2623462ce7 by @ejweber
- build: bump up the minimum supported Kubernetes version 6ad473c1b68da13da8337489eefcc5cab26dd891 by @c3y1huang
- Run
./scripts/helm-docs.sh
2704917cb6ff631cac6e646e354866732c00e2d7 by @yardenshoham - chart: Add "Allow Collecting Longhorn Usage Metrics" to settings 1d964865b8162cc5b1c7f56f2e43ccf19a24a790 by @yardenshoham
longhorn/longhorn-share-manager
- Disable CGO support 3d0816dc76cb23a4d46175ceeb5233371c3f89bc by @derekbit
- ci(drone): remove privileged in build stage 69e0cdf3e8ee337c1980dc3cc4ffc49d884bf010 by @yangchiu
- remove s390 arch during drone workers maintenance 517e42ea86698ff69966fe08eb34fcba58b1d533 by @thomasferrandiz
longhorn/backing-image-manager
- refactor: replace deprecated grpc.WithInsecure c6cc6d2e0e41cb7a93ac182d61af49e4e793046d by @c3y1huang
- ci(drone): remove privileged in build stage 5b3fa35867067bfa4f75235e80268073cc4c4126 by @yangchiu
- build(deps): bump google.golang.org/grpc from 1.58.1 to 1.58.3 d00e7378800d3191e8301b6233a772875a6b497d by @dependabot[bot]
- remove s390 arch during drone workers maintenance 49eef3a5ce8ee4dabf08e951e95271a1fe47fecf by @yangchiu
- fix: add timeout parameter for export volume 5d90d3c608e1765771d8f7aa55a58c6de1f6250e by @FrankYang0529
- feat(proto): upgrade to v24.3 f8ccb0c2721e7babd2ef61ed521af053a8f2536b by @FrankYang0529
- feat: upgrade packages version fcd19e6bd5ff419f9a02e4b6b832ae9e73264ddc by @FrankYang0529
longhorn/longhorn-instance-manager
- feat(proto): upgrade to v24.3 0d573243a775ceccd4228796596f64091d06b091 by @FrankYang0529
- feat: upgrade packages version a78f734aa7d087c1c198240d3696ce795728bcfc by @FrankYang0529
- build(deps): bump google.golang.org/grpc from 1.58.1 to 1.58.3 7c41635e05f142e854b8d8d7d9fd1f50a15d8733 by @dependabot[bot]
- vendor: update dependencies 86608edd1948e979a3a4fe03a5ed535b491247fd by @derekbit
- refactor: replace deprecated grpc.WithInsecure 4e7bee4eda9f78a21cb0199b92b5402ab93c10c1 by @c3y1huang
- build(deps): bump google.golang.org/grpc from 1.53.0 to 1.56.3 b081a58ec546b2c8d6ab7c6b4a4bb420d27536e8 by @dependabot[bot]
- ci(drone): remove privileged in build stage bed512f7bd21d2946995975df4688eafa150124c by @yangchiu
- remove s390 arch during drone workers maintenance 5170baa133b22ec91896ddf79a2b52b3bc748c1b by @thomasferrandiz
- vendor: update go-spdk-helper and longhorn-spdk-engine 5652de640e48b47b73a3250941ba7ef92303b491 by @derekbit