System healthcheck
System health check¶
The target service required to check for all. This can be handle in inno-transflow
[1] Experiment 1: Call SSH to user preferences to check health
[2] Experiment 2: Deploy using Prometheus for health of user
[3] Experiment 3: Using a plane to invoke the user and call <- kind of health ->
[4] Experiment 4: maintainer required check on daily basic
[5] xử lý dữ liệu trước đi, các process chạy gặp lỗi hay chạy tính toán đc bao nhiêu mã, thì email. như vậy chẽcklist đầu ngày là biết đc hay sai
- Pipeline để gửi Errors Reporting + SMS maintainer
Data health insights can help stop data anomalies
-
Ansible Server Checking for Deployment:
-
Check the exists of GCP
- Check service accounts place
- Check the permissions of GCP
-
Check the set up project for GCP (to avoid problems on the host server)
-
Post hook for overall project
-
Check table class is available or not. And if not, created one.
-
Check server hit or resources ping
-
Database reset Connection when Error has been establish by connection
This required verify by Connection Errors
and when change the local internet or IP changes
Must test the IP changes affected the connection
Has the IP changes we can captured the information has been take placed.
https://www.montecarlodata.com/blog-stopping-data-anomalies/