background
Check the graphics card
lshw
yum install -y lshw lshw -numeric -C display *-display description: 3D controller product: GK110BGL [Tesla K40c] [10DE:1024] vendor: NVIDIA Corporation [10DE] physical id: 0 bus info: pci@0000 :03:00.0 version: a1 width: 64 bits clock: 33MHz capabilities: pm msi pciexpress bus_master cap_list configuration: driver=nvidia latency=0 resources: iomemory:21f0-21ef iomemory:21f0-21ef irq:40 memory:ca000000-caffffff memory:21fe0000000-21fefffffff memory:21ff0000000-21ff1ffffff
nvidia-detect
yum install nvidia-detect
[ root@original ~]# nvidia-detect -v Probing for supported NVIDIA devices... [102b:0532] Matrox Electronics Systems Ltd. MGA G200eW WPCM450 [10de:1024] NVIDIA Corporation GK110BGL [Tesla K40c] This device requires the current 460.84 NVIDIA driver kmod-nvidia WARNING: Xorg log file /var/log/Xorg.0.log does not exist WARNING: Unable to determine Xorg ABI compatibility WARNING: The driver for this device does not support the current Xorg version
Update graphics card driver
Unload old drive
yum remove -y nvidia* reboot
Find new drive
wget https://us.download.nvidia.com/tesla/470.57.02/NVIDIA-Linux-x86_64-470.57.02.run
Shield nouveau
#blacklist nvidiafb
blacklist nouveau options nouveau modeset=0
Install new drive
Chmod a+x NVIDIA-Linux-x86_64-470.57.02. run # Add execution permission to the driver ./NVIDIA-Linux-x86_64-470.57.02.run -no-x-check -no-nouveau-check -no-opengl-files
-
-No-x-check: Close the X service when installing the driver -
-No nouveau check: disable nouvea when installing the driver -
No open files: only install driver files, not OpenGL files. If OpenGL is disabled, the installation will not have the problem of cyclic login
inspect
lspci |grep NVIDIA 03:00.0 3D controller: NVIDIA Corporation GK110BGL [Tesla K40c] (rev a1)
nvidia-smi Thu Oct 21 18:29:45 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 Tesla K40c Off | 00000000:03:00.0 Off | 0 | | 23% 40C P0 67W / 235W | 0MiB / 11441MiB | 98% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | No running processes found | +-----------------------------------------------------------------------------+
reboot
Install CUDA
download
install
wget https://developer.download.nvidia.com/compute/cuda/11.4.2/local_installers/cuda_11.4.2_470.57.02_linux.run sudo sh cuda_11.4.2_470.57.02_linux.run
Graphics card