edge-accelerator-monitor

所属分类:物联网
开发工具:Dockerfile
文件大小:12KB
下载次数:0
上传日期:2021-11-24 07:52:13
上 传 者sh-1993
说明:  边缘计算环境下支持加速器的DNN任务Kubernetes调度器
(Accelerator-Aware Kubernetes Scheduler for DNN Tasks on Edge Computing Environment)

文件列表:
Deployment (0, 2021-11-24)
Deployment\AMD64-Daemonset.yaml (951, 2021-11-24)
Deployment\ARM64-Daemonset.yaml (1164, 2021-11-24)
Deployment\ServiceAccount.yaml (597, 2021-11-24)
LICENSE (11357, 2021-11-24)
MonitorContainer (0, 2021-11-24)
MonitorContainer\AMD64 (0, 2021-11-24)
MonitorContainer\AMD64\Dockerfile (741, 2021-11-24)
MonitorContainer\AMD64\hw-monitor-automatic-lableing.sh (348, 2021-11-24)
MonitorContainer\ARM64 (0, 2021-11-24)
MonitorContainer\ARM64\Dockerfile (3968, 2021-11-24)
MonitorContainer\ARM64\hw-monitor-automatic-lableing.sh (1498, 2021-11-24)

# edge-accelerator-monitor [Accelerator-Aware Kubernetes Scheduler for DNN Tasks on Edge Computing Environment](https://edge-k8s-project.s3.amazonaws.com/Accelerator-Aware+Kubernetes+Scheduler+for+DNN+Tasks+on+Edge+Computing+Environment.pdf) Configure a Kubernetes cluster environment for edge accelerator hardware information extraction and automatic labeling ### 1. Docker install ``` sudo apt-get install -y apt-transport-https ca-certificates curl gnupg-agent software-properties-common sudo curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add - sudo add-apt-repository "deb [arch=arm***] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable" sudo apt-get update sudo apt-get install -y containerd.io docker-ce docker-ce-cli ``` ### 2. Kubernetes install ``` sudo apt-get update && apt-get install -y apt-transport-https curl sudo curl -s https://packages.cloud.google.com/apt/doc/apt-key.gpg | apt-key add - sudo echo deb http://apt.kubernetes.io/ kubernetes-xenial main > /etc/apt/sources.list.d/kubernetes.list sudo apt-get update sudo apt-get install -y kubelet kubeadm ``` ### 3. Disable kubernetes container swap,zram ``` sudo swapoff -a sudo rm /etc/systemd/nvzramconfig.sh ``` ### 4. Kubernetes cluster setting (master node) Cluster api initialization on the master node and the token is issued. ``` sudo kubeadm init --apiserver-advertise-address=[master ip] --pod-network-cidr=10.244.0.0/16 --kubernetes-version=v1.18.14 ``` ``` sudo mkdir -p $HOME/.kube sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config sudo chown $(id -u):$(id -g) $HOME/.kube/config ``` How to reissue a token ``` sudo kubeadm token create --print-join-command ``` ### 5. Kubernetes cluster setting (worker node) Join the cluster using the token value from the worker node. ``` sudo kubeadm join [master ip : port] --token [token data] --discovery-token-ca-cert-hash [token hash data] ``` In Google Coral TPU device, execute join after setting cgroup memory. ``` sudo vi /boot/firmware/nobtcmd.txt add line >> cgroup_ena vv b ble=cpuset cgroup_enable=memory cgroup_memory=1 sudo reboot ``` ### 6. Flannel network plugin install (master node) Tasks to configure the container's network and assign an IP ``` sudo kubectl apply -f https://raw.githubusercontent.com/coreos/flannel/v0.13.0/Documentation/kube-flannel.yml ``` ### 7. worker node role setting (master node) ``` sudo kubectl label node [node name] node-role.kubernetes.io/worker=worker ``` ### 8. Kubernetes cluster setting check (master node) ``` sudo kubectl get nodes ``` ### 9. NVIDIA GPU physical device separate settings (GPU runtime) Register nvidia-docker related repository on host ``` sudo distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \ && curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add - \ && curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list ``` nvidia-docker package install ``` sudo apt-get update sudo apt-get install -y nvidia-docker2 ``` default runtime setting ``` sudo vi /etc/docker/daemon.json add line >> "default-runtime": "nvidia" ``` docker daemon service restart ``` sudo systemctl restart docker ``` Install NVIDIA device plugin on master node ``` sudo kubectl apply -f https://raw.githubusercontent.com/NVIDIA/k8s-device-plugin/v0.9.0/nvidia-device-plugin.yml ```

近期下载者

相关文件


收藏者