/articles/toma

To get this branch, use:
bzr branch http://darksoft.org/webbzr/articles/toma
34 by Suren A. Chilingaryan
Fix cross-references and some latex complaints
1
\begin{table}[htb] %[htbp]
27 by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished
2
\begin{threeparttable}
3
\caption{\label{tbl:hybrid} Performance and configuration of hybrid back-projection kernel}
4
\centering
5
\noindent
6
%\resizebox{\columnwidth}{!}{\begin{tabular}{} ... \end{\tabular}}
7
\begin{tabularx}{\columnwidth}{ | X  c | r | l l l l l l | }
8
\hline
9
%& & & \multicolumn{5}{c|}{Configuration} \\
10
%\mhd{|c}{GPU} & \mhd{c|}{Slices} & \mhd{c|}{Perf.} & \mhd{c}{Area} & \mhd{c}{Blocks} & \mhd{c}{L1/SM} & \mhd{c}{CC} & \mhd{c|}{PaO} \\
11
& & \mhd{c|}{Perf} & \multicolumn{6}{c|}{Configuration} \\
32 by Suren A. Chilingaryan
Fix algorithms and separate them into the files as well
12
\mhd{|c}{GPU} & \mhd{c|}{$n_v$} & GU/s & \mhd{c}{T/A} &\mhd{c}{$n_q$} & \mhd{c}{$s_d$} & \mhd{c}{U} & \mhd{c}{R} & \mhd{c|}{O} \\
27 by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished
13
%&            & Hybrid        & T/A     & Px.         &  Pr.           & U        & Rnd.    & Occ.        \\
14
\hline                                                                                         
15
                                                                                         
16
\multirow{3}{*}{GTX980}
48 by Suren A. Chilingaryan
Fix tables: Maxwell/Pascal only use fancy round if half-float numbers are utilized
17
& 1          & 266            &  3/5     & 16          & 8              & -        & SFU     &100\%      \\
27 by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished
18
& 2          & 389            &  1/1     & 4           & 16             & -        & SFU     &100\%      \\
19
\hline                                                                                   
20
                                                                                         
21
\multirow{3}{*}{Titan}
48 by Suren A. Chilingaryan
Fix tables: Maxwell/Pascal only use fancy round if half-float numbers are utilized
22
& 1          & 734            &  3/5     & 16          & 8              & -        & SFU     &100\%      \\
27 by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished
23
& 2          & 995            &  1/1     & 4           & 16             & -        & SFU     &100\%      \\
24
\hline                                                                                         
25
                                                                                   
26
       
27
\end{tabularx}
28
\begin{tablenotes}
35 by Suren A. Chilingaryan
Few more fixes
29
\item The table summarizes the performance and optimal configuration for the hybrid back-projection kernel. Both texture engine and ALUs are used to perform interpolation. The configuration specifies: \tblcol{T/A} - is a ratio between the blocks executing \textbf{T}exture-based reconstruction and the blocks running \textbf{A}LU-based algorithm, \tblcol{$n_q$} - a number of pixels per thread, \tblcol{$s_d$} - a number of cached projections, \tblcol{U} - unrolling hint for inner projection loop, \tblcol{R} - the units to perform rounding and type conversions (index is always computed using SFU), \tblcol{O} - the requested occupancy. The caches are configured as specified in \tablename~\ref{tbl:cacheconf}. The number of threads to cache a projection row is determined according to guidelines in \tablename~\ref{tbl:shmemconf}.
27 by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished
30
\end{tablenotes}
31
\end{threeparttable}
32
\end{table}