-
-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Health metrics for device #238
Comments
Adding to this topic, a possibility would be to implement simple device metrics, as already suggested here #100 (comment) for those checks that can be done in platform. A proposal could be to add a
Data gaps / completenessTo be done at ingestion time by ahoy or similar library. The kit's firmware will post the intervals for reading and publication on This could also provide a metric that represents the variability of the posts interval and raise a flag for a sensor that is not posting data regularly. Missing sensorsThe kit's firmware will send data normally, and the platform needs to know what to expect. This is now done by blueprints (kits) but we would like to change this as discussed in #241. This would present a list of sensors to the user, on the onboarding or on the kit edit page (device edit) in which the user can select which sensors are to be expected, and whether or not a notification should be sent in case one of them is not received after a certain threshold has been passed (related to the reading/publication intervals from above). The user could select notifications in this page, and mark sensors in the front end for misbehaving sensors: Hardware reportThe kit's firmware would post at least these new sensors:
These shouldn't be presented in the frontend to avoid confusion, but would be supporting health diagnosis. |
Summary of action points for now:
|
This issue is to open the discussion about health metrics for a device. Currently we see some common issues when devices are deployed, such as connectivity issues or hardware problems to name the most common ones. We need an easier debugging process for the users, which can be provided by some metrics and analytics of the data, and ad-hoc physical device metrics.
Initially, we are addressing this issue offline, with custom requests to the API, but down the line, the process should be integrated in the platform for an easier debugging.
To start with this, we suggest adding a property to the device indicating the device
health
, in which we can collect various metrics, some calculated in the physical device side, and some on the platform side. Current proposals:Platform checks
hardware_info
for instanceFirmware checks
This could be sent on a
/device/<token>/health
mqtt topic, and ingested on thehealth
table for later. Could be sentad-hoc
, or on boot:@pral2a @vicobarberan please provide inputs to build it progressively.
The text was updated successfully, but these errors were encountered: