- git > 2.22
- hatch
- kubectl
- yq, jq
- power meter if available
- Fork and clone this repository and move to
model_training
folder
git clone
cd model_training
Note: port 9090 and 5101 should not being used. It will be used in port-forward for prometheus and kind registry respectively
./script.sh prepare_cluster
The script will:
- create a kind cluster
kind-for-training
with registry at port5101
. - deploy Prometheus.
- deploy Prometheus RBAC and node port to
30090
port on kind node which will be forwarded to9090
port on the host. - deploy service monitor for kepler and reload to Prometheus server
Please confirm the following requirements:
- Kepler installation
- Prometheus installation
- Kepler metrics are exported to Promtheus server
- Prometheus server is available at
http://localhost:9090
. Otherwise, set environmentPROM_SERVER
.
In addition to the above approach, you can manually run your own benchmarks, then collect, train, and export the models by the entrypoint
Manual Metric Collection and Training with Entrypoint
For kind-for-training cluster:
./script.sh cleanup