Projet

Général

Profil

Install the platform » Historique » Version 26

Arnaud Sevin, 22/09/2014 14:54

1 14 Arnaud Sevin
{{toc}}
2 1 Damien Gratadour
3 14 Arnaud Sevin
h1. Install the platform without MAGMA
4
5 1 Damien Gratadour
The COMPASS platform is distributed as a single bundle of CArMA and SuTrA libraries and YoGA and its AO extension for Yorick. 
6
7
h2. Hardware requirements
8 2 Damien Gratadour
9 10 Damien Gratadour
The system must contain at least an x86 CPU and a CUDA capable GPU. list of compatible GPUs can be found here http://www.nvidia.com/object/cuda_gpus.html. Specific requirements apply to clusters (to be updated).
10 1 Damien Gratadour
11
h2. Environment requirements
12 2 Damien Gratadour
13 11 Damien Gratadour
The system must be running a 64 bit distribution of Linux or Mac OS with the latest NVIDIA drivers and "CUDA toolkit":https://developer.nvidia.com/cuda-downloads. The installation of the corresponding version of the "CULA tools":http://www.culatools.com/downloads/dense/ is also required. The following installation instructions are valid if the default installation paths have been selected for these components.
14 1 Damien Gratadour
15 5 Damien Gratadour
Additionally, to benefit from the user-oriented features of the platform, Yorick should be installed as well as the latest version of Python and the associated pygtk module. 
16 1 Damien Gratadour
17 3 Damien Gratadour
To install Yorick, download the latest version from the github repository:
18
<pre>
19
git clone https://github.com/dhmunro/yorick.git yorick.git
20
</pre>
21
then cd onto the created directory and install:
22
<pre>
23
./configure && make && make install
24
</pre>
25
once Yorick is locally installed, you will have to add this directory : yorick.git/relocate/bin to your PATH to have an easy access to the yorick executable. You may want to add support for command history by using rlwrap and alias the yorick executable as :
26
<pre>
27 4 Damien Gratadour
alias yorick='rlwrap path_to_yorick_executable/yorick'
28 3 Damien Gratadour
</pre>
29 1 Damien Gratadour
30 3 Damien Gratadour
31 1 Damien Gratadour
h2. Installation process
32
33
First check out the latest version from the svn repository :
34
<pre>
35 3 Damien Gratadour
svn co https://version-lesia.obspm.fr/repos/compass compass
36 1 Damien Gratadour
</pre>
37
then go in the newly created directory and then trunk:
38
<pre>
39
cd compass/trunk
40
</pre>
41
once there, you need to modify system variables in the define_var.sh executable :
42
<pre>
43
emacs define_var.sh
44
</pre>
45
in this file define properly CUDA_ROOT, CULA_ROOT and YoGA path. Note that for the latter, as YoGA is distributed with SUTrA you should just point to the newly created trunk directory. On a Linux system you should normally have:
46
<pre>
47
export CUDA_ROOT=/usr/local/cuda
48
export CULA_ROOT=/usr/local/cula
49
export YOGA_DIR=/home/MyUserName/path2compass/trunk
50
</pre>
51 12 Damien Gratadour
in this file, you also have to indicate the proper architecture of your GPU so as the compiler will generate the appropriate code. Modify the following line:
52 13 Damien Gratadour
<pre>
53 12 Damien Gratadour
export GENCODE="arch=compute_12,code=sm_12"
54
</pre>
55 24 pierre kestener
and change both 12 to your architecture : for instance a Tesla Fermi will have 2.0 computing capabilities so change 12 to 20, a Kepler GPU will have 3.0 or 3.5 (K20) computing capabilities, change 12 to 30 (or 35).
56 1 Damien Gratadour
57
Once this is done, you're ready to compile the whole library. First run define_var.sh to define the system variables that will be used during the compilation process:
58
<pre>
59
./define_var.sh
60
</pre>
61
62
then identify the absolute path to your Yorick executable using: 
63
<pre>
64
which yorick
65
</pre>
66
and run the compilation script:
67
<pre>
68 6 Damien Gratadour
./reinstall absolute_path_to_yorick
69 1 Damien Gratadour
</pre>
70
71 7 Damien Gratadour
If you did not get any error, CArMA, SuTrA and YoGA are now installed on your machine. You can check that everything is working by launching a GUI to test a simulation:
72 1 Damien Gratadour
<pre>
73
yorick -i yoga_ao/ywidgets/widget_ao.i
74
</pre>
75 14 Arnaud Sevin
76
h1. Install the platform with MAGMA
77
78
h2. Why MAGMA ?
79
80
The MAGMA project aims to develop a dense linear algebra library similar to LAPACK but for heterogeneous/hybrid architectures, starting with current "Multicore+GPU" systems.
81
82
Unlike CULA, MAGMA propose a dense linear algebra library handling double for free.
83
84
But MAGMA needs a LAPACK and a BLAS implementation. Actually, we try two options : ATLAS BLAS (free, easy to install) and MKL (free, need a registration but more powerful)
85
86 18 Arnaud Sevin
h2. Dependencies : gfortran
87
88
Use your package manager to install dependencies:
89
* on scientific linux : yum install gcc-gfortran libgfortran
90
* on debian : apt-get install gfortran gfortran-multilib
91
92 14 Arnaud Sevin
h2. Configure MAGMA with ATLAS
93
94
h3. Dependencies : blas, lapack, atlas
95
96
Use your package manager to install dependencies:
97
* on scientific linux : yum install blas-devel lapack-devel atlas-devel
98 1 Damien Gratadour
* on debian : apt-get install libblas-dev liblapack-dev libatlas-base-dev libatlas-dev
99 24 pierre kestener
100
The binary packages of ATLAS (and also OpenBLAS / GotoBLAS2) distributed by your Linux distribution (SL, Fedora, Debian,...) are generic packages, which are not optimized for a specific machine.
101
It is strongly advised to recompile ATLAS on your local machine to get best performances.
102 14 Arnaud Sevin
103 26 Arnaud Sevin
debian easy method :
104
sudo apt-get build-dep atlas
105
apt-get source atlas
106
cd atlas-*
107
sudo fakeroot debian/rules custom
108
cd ..
109
ls libatlas*.deb
110
111
Then, for each of the entries listed by the ls command (there may be a quicker way to do it, using "*"), type:
112
sudo dpkg -i <filename here>.deb
113
114
This should install an optimised build of the version of ATLAS that Ubuntu provides. There's a more recent version, but it doesn't come with the easy "debian/rules" stuff, so this method won't work.
115
116 25 pierre kestener
IMPORTANT NOTE: when building ATLAS, you must ensure that cpu throtling is disabled (if not timing measurement are erroneous, which may lead to an unoptimized build of ATLAS); see page
117
http://math-atlas.sourceforge.net/atlas_install/node5.html
118
119
120 14 Arnaud Sevin
h3. extraction
121
122
MAGMA is available here : http://icl.cs.utk.edu/magma/software/index.html
123
124
extract the tgz file and go into the new directory
125
> ~$ tar xf magma-1.4.1-beta.tar.gz
126
> ~$ cd magma-1.4.1
127
128
h3. configuration
129
130
You have to create your own make.inc :
131
132
* example on a scientific linux : *please verify GPU_TARGET, LAPACKDIR, ATLASDIR, CUDADIR*
133
134
<pre><code class="Makefile">
135
#//////////////////////////////////////////////////////////////////////////////
136
#   -- MAGMA (version 1.4.1) --
137
#      Univ. of Tennessee, Knoxville
138
#      Univ. of California, Berkeley
139
#      Univ. of Colorado, Denver
140
#      November 2013
141
#//////////////////////////////////////////////////////////////////////////////
142
143
# GPU_TARGET specifies for which GPU you want to compile MAGMA:
144
#     "Tesla"  (NVIDIA compute capability 1.x cards)
145
#     "Fermi"  (NVIDIA compute capability 2.x cards)
146
#     "Kepler" (NVIDIA compute capability 3.x cards)
147
# See http://developer.nvidia.com/cuda-gpus
148
149
GPU_TARGET ?= Fermi
150
151
CC        = gcc
152
NVCC      = nvcc
153
FORT      = gfortran
154
155
ARCH      = ar
156
ARCHFLAGS = cr
157
RANLIB    = ranlib
158
159
OPTS      = -fPIC -O3 -DADD_ -fopenmp -DMAGMA_SETAFFINITY
160
F77OPTS   = -fPIC -O3 -DADD_
161
FOPTS     = -fPIC -O3 -DADD_ -x f95-cpp-input
162
NVOPTS    =       -O3 -DADD_ -Xcompiler "-fno-strict-aliasing -fPIC"
163
LDOPTS    = -fPIC -fopenmp
164
165
# Depending on how ATLAS and LAPACK were compiled, you may need one or more of:
166
LIB       = -llapack -lf77blas -latlas -lcblas -lcublas -lcudart -lstdc++ -lm -lgfortran
167
168
# define library directories here or in your environment
169
LAPACKDIR = /usr/lib64
170
ATLASDIR  = /usr/lib64/atlas
171
CUDADIR   = /usr/local/cuda
172
173
LIBDIR    = -L$(LAPACKDIR) \
174
            -L$(ATLASDIR) \
175
            -L$(CUDADIR)/lib64
176
177
INC       = -I$(CUDADIR)/include
178
</code></pre>
179
180 20 Arnaud Sevin
* example on debian : *please verify GPU_TARGET, LAPACKDIR, ATLASDIR, CUDADIR*
181 17 Arnaud Sevin
<pre><code class="Makefile">
182
#//////////////////////////////////////////////////////////////////////////////
183
#   -- MAGMA (version 1.4.1) --
184
#      Univ. of Tennessee, Knoxville
185
#      Univ. of California, Berkeley
186
#      Univ. of Colorado, Denver
187
#      November 2013
188
#//////////////////////////////////////////////////////////////////////////////
189
190
# GPU_TARGET specifies for which GPU you want to compile MAGMA:
191
#     "Tesla"  (NVIDIA compute capability 1.x cards)
192
#     "Fermi"  (NVIDIA compute capability 2.x cards)
193
#     "Kepler" (NVIDIA compute capability 3.x cards)
194
# See http://developer.nvidia.com/cuda-gpus
195
196
GPU_TARGET ?= Fermi
197
198
CC        = gcc
199
NVCC      = nvcc
200
FORT      = gfortran
201
202
ARCH      = ar
203
ARCHFLAGS = cr
204
RANLIB    = ranlib
205
206
OPTS      = -fPIC -O3 -DADD_ -fopenmp -DMAGMA_SETAFFINITY
207
F77OPTS   = -fPIC -O3 -DADD_
208
FOPTS     = -fPIC -O3 -DADD_ -x f95-cpp-input
209
NVOPTS    =       -O3 -DADD_ -Xcompiler "-fno-strict-aliasing -fPIC" 
210
LDOPTS    = -fPIC -fopenmp
211
212
# Depending on how ATLAS and LAPACK were compiled, you may need one or more of:
213
LIB       = -llapack -lf77blas -latlas -lcblas -lcublas -lcudart -lstdc++ -lm -lgfortran
214
215
# define library directories here or in your environment
216
LAPACKDIR = /usr/lib
217
ATLASDIR  = /usr/lib
218
CUDADIR   = /usr/local/cuda
219
220
LIBDIR    = -L$(LAPACKDIR) \
221
            -L$(ATLASDIR) \
222 19 Arnaud Sevin
            -L$(CUDADIR)/lib64 \
223
            -L/usr/lib/x86_64-linux-gnu
224 17 Arnaud Sevin
225
INC       = -I$(CUDADIR)/include
226
</code></pre>
227 14 Arnaud Sevin
228
h2. Configure MAGMA with MKL
229
230
h3. extraction
231
232
To download MKL, you have to create a account here : https://registrationcenter.intel.com/RegCenter/NComForm.aspx?ProductID=1517
233
234
extract l_ccompxe_2013_sp1.1.106.tgz and go into l_ccompxe_2013_sp1.1.106
235
236
install it with ./install_GUI.sh and add IPP stuff to default choices
237
238
h3. configuration
239 1 Damien Gratadour
240 23 Arnaud Sevin
* example on debian : *please verify GPU_TARGET, MKLROOT, CUDADIR*
241 20 Arnaud Sevin
<pre><code class="Makefile">
242
#//////////////////////////////////////////////////////////////////////////////
243
#   -- MAGMA (version 1.4.1-beta2) --
244
#      Univ. of Tennessee, Knoxville
245
#      Univ. of California, Berkeley
246
#      Univ. of Colorado, Denver
247
#      December 2013
248
#//////////////////////////////////////////////////////////////////////////////
249
250
# GPU_TARGET contains one or more of Tesla, Fermi, or Kepler,
251
# to specify for which GPUs you want to compile MAGMA:
252
#     Tesla  - NVIDIA compute capability 1.x cards
253
#     Fermi  - NVIDIA compute capability 2.x cards
254
#     Kepler - NVIDIA compute capability 3.x cards
255
# The default is all, "Tesla Fermi Kepler".
256
# See http://developer.nvidia.com/cuda-gpus
257
#
258
GPU_TARGET ?= Fermi
259
260
CC        = gcc
261
NVCC      = nvcc
262
FORT      = gfortran
263
264
ARCH      = ar
265
ARCHFLAGS = cr
266
RANLIB    = ranlib
267
268
OPTS      = -fPIC -O3 -DADD_ -Wall -fno-strict-aliasing -fopenmp -DMAGMA_WITH_MKL -DMAGMA_SETAFFINITY
269
F77OPTS   = -fPIC -O3 -DADD_ -Wall
270
FOPTS     = -fPIC -O3 -DADD_ -Wall -x f95-cpp-input
271
NVOPTS    =       -O3 -DADD_ -Xcompiler "-fno-strict-aliasing -fPIC"
272
LDOPTS    = -fPIC -fopenmp
273
274
# gcc with MKL 10.3, Intel threads
275
LIB       = -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -lpthread -lcublas -lcudart -lstdc++ -lm -liomp5 -lgfortran
276
277
# define library directories preferably in your environment, or here.
278
# for MKL run, e.g.: source /opt/intel/composerxe/mkl/bin/mklvars.sh intel64
279
MKLROOT ?= /opt/intel/composerxe/mkl
280
CUDADIR ?= /usr/local/cuda
281
-include make.check-mkl
282
-include make.check-cuda
283
284
LIBDIR    = -L$(MKLROOT)/lib/intel64 \
285
            -L$(CUDADIR)/lib64
286
287
INC       = -I$(CUDADIR)/include -I$(MKLROOT)/include
288
</code></pre>
289 14 Arnaud Sevin
290 22 Arnaud Sevin
In this example, I use gcc but with MKL, you can use icc instead of gcc. In this case, you have to compile yorick with icc. For this, you have to change the CC flag in Make.cfg  
291 21 Arnaud Sevin
292 14 Arnaud Sevin
h2. compilation and installation
293
294
h3. compilation
295
296
just compile the shared target (and test if you want)
297
> ~$ make -j 8 shared
298
299
h3. installation
300
301
To install libraries and include files in a given prefix, run:
302
> ~$ make install prefix=/usr/local/magma
303
  
304
The default prefix is /usr/local/magma. You can also set prefix in make.inc.
305
306
h3. tune (not tested)
307
308 15 Arnaud Sevin
For multi-GPU functions, set $MAGMA_NUM_GPUS to set the number of GPUs to use.
309
For multi-core BLAS libraries, set $OMP_NUM_THREADS or $MKL_NUM_THREADS or $VECLIB_MAXIMUM_THREADS to set the number of CPU threads, depending on your BLAS library.
310
311
h2. Platform installation
312
313
Just just define $MAGMA_PATH and use the standard procedure