Skip to content

System healthcheck

System health check

The target service required to check for all. This can be handle in inno-transflow

[1] Experiment 1: Call SSH to user preferences to check health

[2] Experiment 2: Deploy using Prometheus for health of user

[3] Experiment 3: Using a plane to invoke the user and call <- kind of health ->

[4] Experiment 4: maintainer required check on daily basic

[5] xử lý dữ liệu trước đi, các process chạy gặp lỗi hay chạy tính toán đc bao nhiêu mã, thì email. như vậy chẽcklist đầu ngày là biết đc hay sai

  • Pipeline để gửi Errors Reporting + SMS maintainer

Data health insights can help stop data anomalies

  • Ansible Server Checking for Deployment:

  • Check the exists of GCP

  • Check service accounts place
  • Check the permissions of GCP
  • Check the set up project for GCP (to avoid problems on the host server)

  • Post hook for overall project

  • Check table class is available or not. And if not, created one.

  • Check server hit or resources ping

  • Database reset Connection when Error has been establish by connection

This required verify by Connection Errors

and when change the local internet or IP changes

Must test the IP changes affected the connection

Has the IP changes we can captured the information has been take placed.

https://www.montecarlodata.com/blog-stopping-data-anomalies/