-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FirstFitRouting Allocator only tries the first cluster it finds then fails before attempting others #10598
Comments
As per my understanding, when the host on which the VM is running on is placed in maintenance mode, it will consider only those clusters that have the same hypervisor type as that of the host on which the VM is running for migrating VMs on the host put into maintenance mode. Do both the clusters have the same hypervisor type? |
The original host is node99 on the Linux cluster (ID 16) with tags: Both clusters are KVM type |
it looks both service offering and template have host tags.
btw: is global setting |
I'm not sure, is it unsupported to use both? Even if it was only using the service offering, both windows/linux hypervisors have the 'generic' tag which would mean it should still try to migrate to the other cluster. Yes, EDIT: I set |
Hi @loth , @Pearl1594 has found some clues which caused the issue. we will update you later. |
Thanks for reporting this observation @loth seems like you have hit a bug. Investigating the details you shared, we identified that this issue of other clusters not being considered when the 1st cluster has no suitable hosts matching the constraints is caused by: This issue is most likely going to be encountered even with the RandomAllocator: https://github.com/apache/cloudstack/pull/7489/files#diff-039cdf205f1edfd0c7c6ebb0cb97e2caef1d08a8dd7dbcbd30813a68da5dae1bR100-R103 We'll look into it. Thanks Edit: Seems like part of the issue was fixed in: #8611 - available in 4.19.1. But is missing in 4.20.0 |
Our CS environment has two clusters, one windows and one linux. We have host tags that allocate where VM's are placed. We are facing an issue when setting host in maintenance that sometimes it will just fall to error state. Upon checking the logs I found that while it checked the first cluster (ID 19, Windows) however it never tried any hosts in the Linux cluster (ID 16).
Here is the log from the management server. As we can see, the allocator tried to find resources in cluster 19, failed to find any with the Linux tag, then failed out before trying the other cluster.
I also checked the UI upon doing a manual migration, all windows hosts are invalid destinations in the options to select a host (which I believe findHostsForMigration uses, so that should be working fine)
versions
ACS 4.20, KVM with NFS Primary/Secondary storage
The steps to reproduce the bug
Setup two clusters, with host tags to identify where VM's should go (windows/linux).
Try to set maintenance on one of the hosts.
Check for clusters in the log to see if it attempts others.
What to do about it?
Correct the code to try all clusters instead of just the first.
The text was updated successfully, but these errors were encountered: