We generally followed the instructions in Setting up a Deep Learning Machine from Scratch.
Before installing the drivers, make sure there are no nvidia drivers already installed by doing the following:
$ sudo apt-get --purge remove nvidia-*
And reboot. Then download NVidia drivers from here and install them according to instructions.
First, install some AMD drivers that will be used by CUDA.
$ sudo apt-get install fglrx-updates
After installation, reboot the computer.
Next, stop the SSH X11 forwarding by modifying /etc/ssh/sshd_config:
$ sudo nano /etc/ssh/sshd_config
Find the following line:
X11Forwarding yes
and change it to:
X11Forwarding no
After saving the changes, restart the ssh service by typing:
$ sudo service ssh restart
Now, stop the default X server by typing:
$ sudo service mdm stop
Now we can install CUDA. The latest supported version is 7.5 which can be downloaded from here. Select Linux > X86_64 > Ubuntu > 14.04 > runfile (local). Follow the instructions on the page to complete the installation.
After finishing the installation of CUDA, undo the change that you made to /etc/ssh/sshd_config, and restart the ssh server using sudo service ssh restart. Also, start the X server again: sudo service mdm start
Download cuDNN from here. Select a version which is compatible with CUDA. After downloading, do the following:
$ tar xvf <DOWNLOADED FILE NAME>
$ cd cuda
$ sudo cp */*.h /usr/local/cuda/include/
$ sudo cp */libcudnn* /usr/local/cuda/lib64/
$ sudo chmod a+r /usr/local/cuda/lib64/libcudnn*
To install for all users
$ sudo -i
then install package. git clone in ~\repos if need.
To install only for user then add --user at the end. E.g.:
$ python setup.py install --user
Here are some specific instructions:
$ pip install theano
Edit/create ~/.theanorc:
[global]
floatX = float32
device = gpuX
choose as gpuX your assigned gpu, there will be random inspections. Grad students all share gpu1. klab uses gpu0
$ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/linux/gpu/tensorflow-0.10.0rc0-cp27-none-linux_x86_64.whl
$ pip install --ignore-installed --upgrade $TF_BINARY_URL
Note: right now tensorflow is not working because we are using cuDNN v5. Either wait for it to be compatible or downgrade cuDNN to v4 when we really need to use tensorflow (don’t do it). Or try to compile/install tensorflow from source (good luck with that).
$ pip install keras
See here to choose tensorflow and theano in Keras
$ git clone --recursive https://github.com/dmlc/xgboost
$ cd xgboost; make -j4
$ cd python-package
$ python setup.py install
$ conda install opencv