Skip to content

Conversation

@nixpanic
Copy link
Member

@nixpanic nixpanic commented Oct 15, 2025

Recent versions of Rook use Ceph-CSI Operator by default. There is no need to have Ceph-CSI deployed by Rook, as our CI deploys it from the PR that is being tested.

Rook v1.18.4 works well with Ceph Squid (v19). But for NVMe-oF testing we need a Ceph Tentacle (v20) cluster.


Show available bot commands

These commands are normally not required, but in case of issues, leave any of
the following bot commands in an otherwise empty comment in this PR:

  • /retest ci/centos/<job-name>: retest the <job-name> after unrelated
    failure (please report the failure too!)

@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@mergify mergify bot added the component/testing Additional test cases or CI work label Oct 15, 2025
nixpanic added a commit to nixpanic/ceph-csi that referenced this pull request Oct 15, 2025
The devel branch will use a newer version of Rook and Ceph in the near
future. That requires the images to be mirrored in the CI registry.

See-also: ceph#5672
Signed-off-by: Niels de Vos <ndevos@ibm.com>
@nixpanic nixpanic added the ci/skip/multi-arch-build skip building on multiple architectures label Oct 15, 2025
@nixpanic
Copy link
Member Author

nixpanic commented Oct 15, 2025

@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

1 similar comment
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic nixpanic changed the title ci: use Rook 1.18.4 ci: use Rook 1.18.4 and Ceph Tentacle Oct 15, 2025
mergify bot pushed a commit that referenced this pull request Oct 16, 2025
The devel branch will use a newer version of Rook and Ceph in the near
future. That requires the images to be mirrored in the CI registry.

See-also: #5672
Signed-off-by: Niels de Vos <ndevos@ibm.com>
@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from c2dabc9 to 03040f1 Compare October 16, 2025 09:07
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.33

@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from 03040f1 to 870215d Compare October 16, 2025 09:54
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from 870215d to 025e47a Compare October 16, 2025 10:13
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from 025e47a to a40f06c Compare October 16, 2025 11:19
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from a40f06c to 9b9119c Compare October 16, 2025 11:59
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

1 similar comment
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from 960e97e to 4b7516f Compare October 16, 2025 14:44
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from 4b7516f to 48a4306 Compare October 16, 2025 14:59
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

1 similar comment
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from b919ccd to 96abede Compare October 16, 2025 15:49
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34

Recent versions of Rook use Ceph-CSI Operator by default. There is no
need to have Ceph-CSI deployed by Rook, as our CI deploys it from the PR
that is being tested.

Signed-off-by: Niels de Vos <ndevos@ibm.com>
@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from 2ae4c4f to 8a40298 Compare November 24, 2025 09:07
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34

@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-helm/k8s-1.34

@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e/k8s-1.34

@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34/nfs

Signed-off-by: Niels de Vos <ndevos@ibm.com>
ceph-volume in Tentacle (v20) uses udev data to detect some properties
of the potential OSDs. Inside the minikube cluster (running with
Podman), there is no udev. Mounting the host udev runtime data into the
containers makes ceph-volume happy.

Signed-off-by: Niels de Vos <ndevos@ibm.com>
When the CephFS e2e tests run on Ceph Tentacle (v20), they always fail
during

    check data persist after recreating pod

With standard yaml deployment, the e2e tests succeed.
With Ceph Reef (v19) and Helm, the e2e tests succeed.

See-also: ceph#5772
Signed-off-by: Niels de Vos <ndevos@ibm.com>
Signed-off-by: Niels de Vos <ndevos@ibm.com>
With Ceph Tentacle creating a (the first?) NFS-export fails with:

```
Error EINVAL: Traceback (most recent call last):
    File "/usr/share/ceph/mgr/mgr_module.py", line 2010, in _handle_command
      return CLICommand.COMMANDS[cmd['prefix']].call(self, cmd, inbuf)
    File "/usr/share/ceph/mgr/mgr_module.py", line 534, in call
      return self.func(mgr, **kwargs)
    File "/usr/share/ceph/mgr/object_format.py", line 592, in _format_response
      robj = f(*args, **kwargs)
    File "/usr/share/ceph/mgr/nfs/module.py", line 45, in
_cmd_nfs_export_create_cephfs
      earmark_resolver = CephFSEarmarkResolver(self)
    File "/usr/share/ceph/mgr/mgr_util.py", line 356, in __init__
      self._cephfs_client = client or CephfsClient(mgr)
    File "/usr/share/ceph/mgr/mgr_util.py", line 318, in __init__
      self.connection_pool = CephfsConnectionPool(self.mgr)
    File "/usr/share/ceph/mgr/mgr_util.py", line 223, in __init__
      self.timer_task.start()
    File "/usr/lib64/python3.9/threading.py", line 899, in start
      _start_new_thread(self._bootstrap, ())
  RuntimeError: can't start new thread
```

It seems this can only happen when the CLI is used, that means go-ceph
failed to create the export in the first place. It would be useful to
log the go-ceph failure, so that improving is possible.

Signed-off-by: Niels de Vos <ndevos@ibm.com>
@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from 8a40298 to 0c82877 Compare November 24, 2025 13:25
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34

Signed-off-by: Niels de Vos <ndevos@ibm.com>
@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from 0c82877 to 2855fcd Compare November 24, 2025 13:46
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34

@nixpanic
Copy link
Member Author

ci/centos/mini-e2e-operator/k8s-1.34 didn't really fail, it timed out after 2 hours of e2e

nixpanic added a commit to nixpanic/ceph-csi that referenced this pull request Nov 24, 2025
The other two (Helm and simple yaml) deployments already have a timeout
of 150 minutes. Just the operator tests time-out at the moment.

See-also: ceph#5672
Signed-off-by: Niels de Vos <ndevos@ibm.com>
mergify bot pushed a commit that referenced this pull request Nov 24, 2025
The other two (Helm and simple yaml) deployments already have a timeout
of 150 minutes. Just the operator tests time-out at the moment.

See-also: #5672
Signed-off-by: Niels de Vos <ndevos@ibm.com>
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34

1 similar comment
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34

Signed-off-by: Niels de Vos <ndevos@ibm.com>
@nixpanic nixpanic force-pushed the testing/ceph-tentacle branch from 2d17f62 to 703779e Compare November 26, 2025 12:50
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34

@nixpanic nixpanic marked this pull request as draft November 26, 2025 12:50
@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34/rbd

@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34/cephfs

@nixpanic
Copy link
Member Author

/test ci/centos/mini-e2e-operator/k8s-1.34/nfs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/skip/multi-arch-build skip building on multiple architectures component/testing Additional test cases or CI work

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants