-
Notifications
You must be signed in to change notification settings - Fork 183
Add pulse checks for ensuring disk health on VmHosts #2202
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add pulse checks for ensuring disk health on VmHosts #2202
Conversation
All contributors have signed the CLA ✍️ ✅ |
c74acaf
to
eede09c
Compare
cd3bc90
to
e2bd310
Compare
Since this is also a warm-up, it could be nice to split this change into multiple commits and explain each change in the commit messages. The migration could be one commit, population of the UNIX devices could be another and finally the change in the health check logic, each with their own specs. |
453f521
to
e1c64f0
Compare
I have read the CLA Document and I hereby sign the CLA |
recheck |
17deb53
to
a928c4a
Compare
a928c4a
to
cd4381d
Compare
cd4381d
to
f7f62f5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, I would however like to have one more eye on this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
at a minimum, two things:
- If storing an array, why not use either a text array, or even more likely, a jsonb
- The migration should be broken out from other application changes to not make rollbacks fussy.
I'll also fix the commits and separate the migration commit and others when review is done. thanks for pointing out. @fdr |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. I only have one remaining comment.
2ceaa28
to
b5b05fe
Compare
b5b05fe
to
aaf6c55
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Previous comment was addressed. One additional comment.
f00ed76
to
df2f8ce
Compare
df2f8ce
to
02a5609
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall, looks OK IMO.
@mohi-kalantari, do we have any blocker for this PR, other than Daniel's comments?
02a5609
to
84a5b06
Compare
69517bf
to
76b482d
Compare
76b482d
to
3bbdcfb
Compare
1501ef3
to
6bc194f
Compare
In order to determine and save the disk devices of each VmHost We store an array of strings representing the device names. Also updated the schema caches.
Added functions for determining the disk devices and also extracting the underlying device names of raided disks.
6bc194f
to
db540db
Compare
smartmontools is installed when bootstrapping a VmHost. It will be used as a healthcheck for disk devices of VmHosts.
Multiple healthchecks were added to check disks using different tools like smartctl, nvme cli, read/write on disk and reading kernel logs.
db540db
to
f8e147a
Compare
Multiple checks using different linux tools are added to determine the health of the disks. The code also supports both raided and not raided VmHosts