longhorn v1.8.1-rc2 版本更新介绍
发布日期: 2025-02-26
版本号: v1.8.1-rc2
此版本为Longhorn v1.8.1,重点修复了多个问题并进行了功能改进。注意:不支持从或升级至任何RC/Preview/Sprint版本。
改进内容:
- 支持配置升级响应器URL。
- 优化设置变更验证逻辑,使用卷状态判断是否所有卷已分离。
- 修复若干不明原因的警告提示。
已修复的关键问题:
- 数据与存储相关:副本重建进度显示异常、备份存储设置未应用于克隆卷、V2卷因副本清理不当导致循环挂载/卸载、使用后备镜像时重建导致校验值变化、快照计数问题引发卷故障等。
- 稳定性与兼容性:节点重启后V2后备镜像失效、长时间备份阻塞工作负载迁移、CSI插件因竞态条件错误格式化文件系统、Helm chart中备份超时设置问题等。
- 界面与操作:批量创建备份时返回错误、系统备份页面按钮失效、卷列表显示异常、版本号丢失等。
- 其他修复:修复CVE漏洞、修复gRPC API输出格式不一致、减少指标收集错误等。
此版本通过多项关键修复提升了数据可靠性、操作稳定性及用户体验,建议用户升级以避免已知问题。
更新内容 (中文)
请勿从/向任何 RC/Preview/Sprint 版本升级,因为该操作不受支持。
此版本中已解决的问题
改进
- [BACKPORT][v1.8.1][改进] 支持可配置的升级响应器 URL 10439 - @derekbit @roger-ryao
- [BACKPORT][v1.8.1][改进] 修复多个原因不明的警告 10420 - @roger-ryao
- [BACKPORT][v1.8.1][改进] 设置变更验证应恢复使用卷状态判断"所有卷是否已分离" 10376 - @yangchiu @james-munson
缺陷
- [BACKPORT][v1.8.1][缺陷] 副本重建完成后进度可能显示 99 而非 100 10485 - @shuo-wu @chriscchien
- [BACKPORT][v1.8.1][缺陷] list_backupVolume API 可能随机返回
failed to find a node that is ready and has the default engine image
错误 10478 - @yangchiu @mantissahz - [BACKPORT][v1.8.1][缺陷] 当后端映像副本从 spec 中删除且同时被驱逐时出现空指针异常 10466 - @yangchiu @ChanYiLin
- [BACKPORT][v1.8.1][缺陷] 卸载作业创建后可能生成 2 个卸载 Pod,其中一个因
deleting-confirmation-flag is set to false
错误失败,另一个成功完成 10484 - [BACKPORT][v1.8.1][缺陷][UI] 备份存储设置未应用到克隆的卷 10468 - @yangchiu @mantissahz
- [BACKPORT][v1.8.1][缺陷] v2 卷工作负载 FailedMount 并显示消息 Staging target path
\/var\/lib\/kubelet\/plugins\/kubernetes.io\/csi\/driver.longhorn.io\/xxx\/globalmount is no longer valid
10477 - [BACKPORT][v1.8.1][缺陷][UI] 批量创建备份时若卷已分离会返回 405 错误且浏览器控制台显示错误信息 10462 - @mantissahz
- [BACKPORT][v1.8.1][缺陷] v2 卷无法清理错误副本并重建新副本 - test_data_locality_basic 10364 - @shuo-wu @chriscchien
- [BACKPORT][v1.8.1][缺陷] 因 Longhorn CSI 插件在罕见竞态条件下执行错误文件系统格式化操作导致数据丢失 10418 - @yangchiu @PhanLe1010
- [BACKPORT][v1.8.1][缺陷] v2 引擎在重建后循环进入分离/附加状态 10397 - @shuo-wu
- [BACKPORT][v1.8.1][缺陷] 使用后端映像创建的 v2 卷在校验和重建副本后校验和会变化 10341 - @shuo-wu @chriscchien
- [BACKPORT][v1.8.1][缺陷] 快照数量强制策略的缺陷导致卷故障并卡在分离/附加循环 10309 - @PhanLe1010 @roger-ryao
- [BACKPORT][v1.8.1][缺陷] 测试用例
test_csi_mount_volume_online_expansion
因无法扩展 PVC 而失败 10414 - @yangchiu @c3y1huang - [BACKPORT][v1.8.1][缺陷] v2 后端映像在节点重启后失效 10343 - @ChanYiLin @chriscchien
- [BACKPORT][v1.8.1][缺陷] 备份操作耗时过长时工作负载 Pod 无法迁移到新节点 10172 - @PhanLe1010 @chriscchien
- [BACKPORT][v1.8.1][缺陷][UI] 网页界面中的卷消失后又重现 10332 - @PhanLe1010 @chriscchien @houhoucoop
- [BACKPORT][v1.8.1][缺陷] 每次指标抓取时
metrics_collector.(*BackupCollector).Collect
报错Error get size
10361 - @derekbit @chriscchien - [BACKPORT][v1.8.1][缺陷][UI] 系统备份页面刷新后’创建’按钮不可用 10354 - @chriscchien @houhoucoop
- [BACKPORT][v1.8.1][缺陷] 代理 gRPC API ReplicaList 对 v1 和 v2 卷返回不同输出格式 10353 - @shuo-wu @roger-ryao
- [BACKPORT][v1.8.1][缺陷] 升级至 1.8 后卷持续附加/重新附加 10315 - @james-munson
- [BACKPORT][v1.8.1][缺陷] Helm 图表中的备份执行超时设置问题 10325 - @james-munson @chriscchien
- [BACKPORT][v1.8.1][缺陷] 若旧副本未正确清理,v2 引擎会卡在分离-附加循环 10363 - @shuo-wu @chriscchien
- [BACKPORT][v1.8.1][缺陷] Longhorn CSI 插件 1.8.0 在创建快照时持续崩溃 10319 - @PhanLe1010 @chriscchien
- [BACKPORT][v1.8.1][缺陷] 引擎卡在 "stopped" 状态导致卷无法附加 10329 - @ChanYiLin @chriscchien
- [BACKPORT][v1.8.1][缺陷] 升级至 v1.8.0 后网页界面版本号丢失 10337 - @derekbit
- [BACKPORT][v1.8.1][缺陷] 节点临时关闭后出现存储不足;新副本预检失败 10234 - @PhanLe1010
其他
- [任务] 修复 v1.8.1 的 CVE 问题 10318 - @c3y1huang
贡献者
- @ChanYiLin
- @PhanLe1010
- @c3y1huang
- @chriscchien
- @derekbit
- @houhoucoop
- @innobead
- @james-munson
- @mantissahz
- @roger-ryao
- @shuo-wu
- @yangchiu
更新内容 (原始)
DON’T UPGRADE from/to any RC/Preview/Sprint releases because the operation is not supported.
Resolved Issues in this release
Improvement
- [BACKPORT][v1.8.1][IMPROVEMENT] Support configurable upgrade-responder URL 10439 - @derekbit @roger-ryao
- [BACKPORT][v1.8.1][IMPROVEMENT] Several warning for unknown reason 10420 - @roger-ryao
- [BACKPORT][v1.8.1][IMPROVEMENT] Settings change validation should go back to using Volume state to determine “are all volumes detached” 10376 - @yangchiu @james-munson
Bug
- [BACKPORT][v1.8.1][BUG] When replica rebuilding completed, the progress could be 99 instead of 100 10485 - @shuo-wu @chriscchien
- [BACKPORT][v1.8.1][BUG] list_backupVolume API could randomly returns
failed to find a node that is ready and has the default engine image
error 10478 - @yangchiu @mantissahz - [BACKPORT][v1.8.1][BUG] nil pointer when the backing image copy is delete from the spec but also gets evicted at the same time 10466 - @yangchiu @ChanYiLin
- [BACKPORT][v1.8.1][BUG] 2 uninstall pods could be created after uninstall job was created, one failed with
deleting-confirmation-flag is set to false
error, while the other completed successfully 10484 - - [BACKPORT][v1.8.1][BUG][UI] Backup store setting doesn’t apply to the cloned volume 10468 - @yangchiu @mantissahz
- [BACKPORT][v1.8.1][BUG] v2 volume workload FailedMount with message Staging target path
/var/lib/kubelet/plugins/kubernetes.io/csi/driver.longhorn.io/xxx/globalmount is no longer valid
10477 - - [BACKPORT][v1.8.1][BUG][UI] Bulk backup creation with a detached volume returns error 405 and error messages show in browser console 10462 - @mantissahz
- [BACKPORT][v1.8.1][BUG] V2 volume fails to cleanup error replica and rebuild new one - test_data_locality_basic 10364 - @shuo-wu @chriscchien
- [BACKPORT][v1.8.1][BUG] Data lost caused by Longhorn CSI plugin doing a wrong filesystem format action in a rare race condition 10418 - @yangchiu @PhanLe1010
- [BACKPORT][v1.8.1][BUG] v2 Engine loops in detaching and attaching state after rebuilding 10397 - @shuo-wu
- [BACKPORT][v1.8.1][BUG] A V2 volume checksum will change after replica rebuilding if the volume created with backing image 10341 - @shuo-wu @chriscchien
- [BACKPORT][v1.8.1][BUG] Bug in snapshot count enforcement cause volume faulted and stuck in detaching/attaching loop 10309 - @PhanLe1010 @roger-ryao
- [BACKPORT][v1.8.1][BUG] Test case
test_csi_mount_volume_online_expansion
is failing due to unable to expand PVC 10414 - @yangchiu @c3y1huang - [BACKPORT][v1.8.1][BUG] V2 BackingImage failed after node reboot 10343 - @ChanYiLin @chriscchien
- [BACKPORT][v1.8.1][BUG] Workload pod will not be able to move to new node when backup operation is taking a long time 10172 - @PhanLe1010 @chriscchien
- [BACKPORT][v1.8.1][BUG] WebUI Volumes Disappear and Reappear 10332 - @PhanLe1010 @chriscchien @houhoucoop
- [BACKPORT][v1.8.1][BUG] “Error get size” from “metrics_collector.(*BackupCollector).Collect” on every metric scrape 10361 - @derekbit @chriscchien
- [BACKPORT][v1.8.1][BUG] [UI] ‘Create’ button on the System Backup page is disabled after reloading page 10354 - @chriscchien @houhoucoop
- [BACKPORT][v1.8.1][BUG] Proxy gRPC API ReplicaList returns different output formats for v1 and v2 volumes 10353 - @shuo-wu @roger-ryao
- [BACKPORT][v1.8.1][BUG] constant attaching/reattaching of volumes after upgrading to 1.8 10315 - @james-munson
- [BACKPORT][v1.8.1][BUG] Backup Execution Timeout setting issue in Helm chart 10325 - @james-munson @chriscchien
- [BACKPORT][v1.8.1][BUG] v2 engine stuck in detaching-attaching loop if the previous replica is not cleaned up correct 10363 - @shuo-wu @chriscchien
- [BACKPORT][v1.8.1][BUG] Longhorn CSI plugin 1.8.0 crashes consistently when trying to create a snapshot 10319 - @PhanLe1010 @chriscchien
- [BACKPORT][v1.8.1][BUG] Engine stuck in “stopped” state, prevent volume attach 10329 - @ChanYiLin @chriscchien
- [BACKPORT][v1.8.1][BUG] After upgrading to v1.8.0 the version number lost on the web-ui 10337 - @derekbit
- [BACKPORT][v1.8.1][BUG] insufficient storage;precheck new replica failed after a temporary shutdown of a node 10234 - @PhanLe1010
Misc
- [TASK] Fix CVE issues for v1.8.1 10318 - @c3y1huang
Contributors
- @ChanYiLin
- @PhanLe1010
- @c3y1huang
- @chriscchien
- @derekbit
- @houhoucoop
- @innobead
- @james-munson
- @mantissahz
- @roger-ryao
- @shuo-wu
- @yangchiu