We are having problems getting our NVIDIATesla M2070-Q working with vsphere 5.1
if we set a vm to hardware graphics it gets stuck at 95% starting up
we have rebooted multiple time to no avail, and we are out of ideas
here are some commands we have run and there outputs :
~ # esxcli software vib list | grep NVIDIA
NVIDIA-VMware_ESXi_5.1_Host_Driver 304.76-1OEM.510.0.0.802205 NVIDIA VMwareAccepted 2013-04-17
________________________________________________________
~ # esxcli system module load -m nvidia
Unable to load module /usr/lib/vmware/vmkmod/nvidia: Busy
________________________________________________________
~ # esxcli hardware pci list -c 0x300 -m 0xff
000:001:04.0
Address: 000:001:04.0
Segment: 0x0000
Bus: 0x01
Slot: 0x04
Function: 0x00
VMkernel Name:
Vendor Name: ASPEED Technology, Inc.
Device Name: ASPEED Graphics Family
Configured Owner: Unknown
Current Owner: VMkernel
Vendor ID: 0x1a03
Device ID: 0x2000
SubVendor ID: 0x1028
SubDevice ID: 0x04e3
Device Class: 0x0300
Device Class Name: VGA compatible controller
Programming Interface: 0x00
Revision ID: 0x10
Interrupt Line: 0x0b
IRQ: 11
Interrupt Vector: 0xd0
PCI Pin: 0x79
Spawned Bus: 0x00
Flags: 0x0221
Module ID: -1
Module Name: None
Chassis: 0
Physical Slot: 0
Slot Description: AST2050 VGA
Passthru Capable: false
Parent Device: PCI 0:0:20:4
Dependent Device: PCI 0:0:20:4
Reset Method: Bridge reset
FPT Sharable: false
000:04a:00.0
Address: 000:04a:00.0
Segment: 0x0000
Bus: 0x4a
Slot: 0x00
Function: 0x00
VMkernel Name:
Vendor Name: NVIDIA Corporation
Device Name: NVIDIATesla M2070-Q
Configured Owner: Unknown
Current Owner: VMkernel
Vendor ID: 0x10de
Device ID: 0x06df
SubVendor ID: 0x10de
SubDevice ID: 0x087f
Device Class: 0x0302
Device Class Name: 3D controller
Programming Interface: 0x00
Revision ID: 0xa3
Interrupt Line: 0x0a
IRQ: 10
Interrupt Vector: 0x31
PCI Pin: 0x00
Spawned Bus: 0x00
Flags: 0x0201
Module ID: 76
Module Name: nvidia
Chassis: 0
Physical Slot: 8
Slot Description:
Passthru Capable: false
Parent Device: PCI 0:73:8:0
Dependent Device: PCI 0:73:8:0
Reset Method: Bridge reset
FPT Sharable: false
________________________________________________________
~ # nvidia-smi
Mon Apr 22 18:32:21 2013
+------------------------------------------------------+
| NVIDIA-SMI 4.304.76 Driver Version: 304.76 |
|-------------------------------+----------------------+----------------------+
| GPU Name | Bus-Id Disp. | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla M2070-Q | 0000:4A:00.0 Off | 0 |
| N/A N/A P8 N/A / N/A | 0% 12MB / 5375MB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Compute processes: GPU Memory |
| GPU PID Process name Usage |
|=============================================================================|
| No running compute processes found |
+-----------------------------------------------------------------------------+
________________________________________________________
~ # nvidia-smi -q
==============NVSMI LOG==============
Timestamp : Mon Apr 22 18:32:40 2013
Driver Version : 304.76
Attached GPUs : 1
GPU 0000:4A:00.0
Product Name : Tesla M2070-Q
Display Mode : Disabled
Persistence Mode : Disabled
Driver Model
Current : N/A
Pending : N/A
Serial Number : 0322111036660
GPU UUID : GPU-9680525a-c033-1363-519d-1f8c6a61164f
VBIOS Version : 70.00.41.00.04
Inforom Version
Image Version : N/A
OEM Object : 1.0
ECC Object : 1.0
Power Management Object : 1.0
GPU Operation Mode
Current : N/A
Pending : N/A
PCI
Bus : 0x4A
Device : 0x00
Domain : 0x0000
Device Id : 0x06DF10DE
Bus Id : 0000:4A:00.0
Sub System Id : 0x087F10DE
GPU Link Info
PCIe Generation
Max : 2
Current : 2
Link Width
Max : 16x
Current : 16x
Fan Speed : N/A
Performance State : P8
Clocks Throttle Reasons : N/A
Memory Usage
Total : 5375 MB
Used : 12 MB
Free : 5363 MB
Compute Mode : Default
Utilization
Gpu : 0 %
Memory : 0 %
Ecc Mode
Current : Enabled
Pending : Enabled
ECC Errors
Volatile
Single Bit
Device Memory : 0
Register File : 0
L1 Cache : 0
L2 Cache : 0
Texture Memory : N/A
Total : 0
Double Bit
Device Memory : 0
Register File : 0
L1 Cache : 0
L2 Cache : 0
Texture Memory : N/A
Total : 0
Aggregate
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Total : 0
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Total : 1
Temperature
Gpu : N/A
Power Readings
Power Management : N/A
Power Draw : N/A
Power Limit : N/A
Default Power Limit : N/A
Min Power Limit : N/A
Max Power Limit : N/A
Clocks
Graphics : 270 MHz
SM : 540 MHz
Memory : 1566 MHz
Applications Clocks
Graphics : N/A
Memory : N/A
Max Clocks
Graphics : 573 MHz
SM : 1147 MHz
Memory : 1566 MHz
Compute Processes : None
____________________________
gpuvm
no output will not quit on it own.