...
Virtual Machine Deployment
Deploy a virtual machine from fileDownload the
vss-cli
configuration spec:
and update the following attributes:View file name ubuntu-llm-privategpt.yaml machine.folder
: target logical folder. List available folders withvss-cli compute folder ls
metadata.client
: your department client.metadata.inform
: email address for automated notifications
Deploy with the following command:
Code Block vss-cli cli --wait compute vm mk from-file ubuntuwait compute vm mk from-file ubuntu-llm-privategpt.yaml
Add a virtual GPU of
16GB
, specifically the16q
profile. For more information in the profile used, check the following document How to Request a Virtual GPUCode Block vss-cli compute vm set ubuntu-llm gpu mk --profile 16q
Once the VM has been deployed, a confirmation email will be sent with the assigned IP address and credentials.
Power on virtual machine
Code Block vss-cli compute vm set ubuntu-llm state on
NVIDIA Driver and Licensing
NVIDIA Driver and Licensing
Current supported driver is nvidia-linux-grid-535_535.183.01_amd64.deb
Login to the server via ssh. Note that the username may change if you further customized the VM with
cloud-init
Code Block ssh -p 2222 vss-admin@XXX.XXX.XXX.XXX
Download the NVIDIA drivers from VKSEY-STOR:
Code Block scp {vss-user}@vskey-stor.eis.utoronto.ca:/ut-vss-lib/nvidia-grid-vpsherevsphere-7.0-525535.147183.0104-525535.147183.0501-529538.1967/Guest_Drivers/nvidia-linux-grid-525535_525535.147183.0501_amd64.deb /tmp/
Install the drivers as privileged user:
Code Block apt install dkms nvtop dpkg -i apt install /tmp/nvidia-linux-grid-525535_525535.147183.0501_amd64.deb
Create directory: ClientConfigToken
Code Block mkdir /etc/nvidia/ClientConfigToken
Create the NVIDIA token file:
Code Block echo -n -e $(vmware-rpctool "info-get guestinfo.ut.vss.nvidia_token") > /etc/nvidia/ClientConfigToken/client_configuration_token_12-05-2023-11-26-05.tok
Set permissions to the NVIDIA token:
Code Block chmod 744 /etc/nvidia/ClientConfigToken/client_configuration_token_12-05-2023-11-26-05.tok
Set the
FeatureType
to2
for NVIDIA “NVIDIA RTX Virtual WorkstationWorkstation” in/etc/nvidia/gridd.conf
with the following command:Code Block sed -i 's/FeatureType=0/FeatureType=2/g' /etc/nvidia/gridd.conf
Restart
nvidia-gridd
service to pick up the new license token:Code Block systemctl restart nvidia-gridd
Check for any Error or successful activation:
Code Block journalctl -u nvidia-gridd
output:
Code Block Dec 13 11:23:20 ubu-llm systemd[1]: Stopped NVIDIA Grid Daemon. Dec 13 11:23:20 ubu-llm systemd[1]: Starting NVIDIA Grid Daemon... Dec 13 11:23:20 ubu-llm systemd[1]: Started NVIDIA Grid Daemon. Dec 13 11:23:20 ubu-llm nvidia-gridd[2017]: Started (2017) Dec 13 11:23:20 ubu-llm nvidia-gridd[2017]: vGPU Software package (0) Dec 13 11:23:20 ubu-llm nvidia-gridd[2017]: Ignore service provider and node-locked licensing Dec 13 11:23:20 ubu-llm nvidia-gridd[2017]: NLS initialized Dec 13 11:23:20 ubu-llm nvidia-gridd[2017]: Acquiring license. (Info: vss-nvidia-ls.eis.utoronto.ca; NVIDIA RTX Virtual Workstation) Dec 13 11:23:22 ubu-llm nvidia-gridd[2017]: License acquired successfully. (Info: vss-nvidia-ls.eis.utoronto.ca, NVIDIA RTX Virtual Workstation; Expiry: 2023-12-14 16:23:22 GMT) Dec 13 14:59:24 ubu-llm nvidia-gridd[2017]: License renewed successfully. (Info: vss-nvidia-ls.eis.utoronto.ca, NVIDIA RTX Virtual Workstation; Expiry: 2023-12-14 19:59:23 GMT)
Verify GPU status with
nvidia-smi
:Code Block user@test:~$ nvidia-smi Wed Dec 13 14:18:27 2023 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 525.147.05 Driver Version: 525.147.05 CUDA Version: 12.0 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 GRID P6-16Q On | 00000000:02:00.0 Off | N/A | | N/A N/A P8 N/A / N/A | 6426MiB / 16384MiB | 0% Default | | | | Disabled | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | 0 N/A N/A 3458 C ...ld/cuda/bin/ollama-runner 6426MiB | +-----------------------------------------------------------------------------+
You can also monitor in console the gpu usage with
nvtop
:
...