bzr branch
http://darksoft.org/webbzr/articles/toma
34
by Suren A. Chilingaryan
Fix cross-references and some latex complaints |
1 |
\begin{table}[htb] %[htbp] |
27
by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished |
2 |
\begin{threeparttable} |
3 |
\caption{\label{tbl:hybrid} Performance and configuration of hybrid back-projection kernel} |
|
4 |
\centering
|
|
5 |
\noindent
|
|
6 |
%\resizebox{\columnwidth}{!}{\begin{tabular}{} ... \end{\tabular}}
|
|
7 |
\begin{tabularx}{\columnwidth}{ | X c | r | l l l l l l | } |
|
8 |
\hline
|
|
9 |
%& & & \multicolumn{5}{c|}{Configuration} \\
|
|
10 |
%\mhd{|c}{GPU} & \mhd{c|}{Slices} & \mhd{c|}{Perf.} & \mhd{c}{Area} & \mhd{c}{Blocks} & \mhd{c}{L1/SM} & \mhd{c}{CC} & \mhd{c|}{PaO} \\
|
|
11 |
& & \mhd{c|}{Perf} & \multicolumn{6}{c|}{Configuration} \\ |
|
32
by Suren A. Chilingaryan
Fix algorithms and separate them into the files as well |
12 |
\mhd{|c}{GPU} & \mhd{c|}{$n_v$} & GU/s & \mhd{c}{T/A} &\mhd{c}{$n_q$} & \mhd{c}{$s_d$} & \mhd{c}{U} & \mhd{c}{R} & \mhd{c|}{O} \\ |
27
by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished |
13 |
%& & Hybrid & T/A & Px. & Pr. & U & Rnd. & Occ. \\
|
14 |
\hline
|
|
15 |
||
16 |
\multirow{3}{*}{GTX980} |
|
48
by Suren A. Chilingaryan
Fix tables: Maxwell/Pascal only use fancy round if half-float numbers are utilized |
17 |
& 1 & 266 & 3/5 & 16 & 8 & - & SFU &100\% \\ |
27
by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished |
18 |
& 2 & 389 & 1/1 & 4 & 16 & - & SFU &100\% \\ |
19 |
\hline
|
|
20 |
||
21 |
\multirow{3}{*}{Titan} |
|
48
by Suren A. Chilingaryan
Fix tables: Maxwell/Pascal only use fancy round if half-float numbers are utilized |
22 |
& 1 & 734 & 3/5 & 16 & 8 & - & SFU &100\% \\ |
27
by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished |
23 |
& 2 & 995 & 1/1 & 4 & 16 & - & SFU &100\% \\ |
24 |
\hline
|
|
25 |
||
26 |
||
27 |
\end{tabularx} |
|
28 |
\begin{tablenotes} |
|
35
by Suren A. Chilingaryan
Few more fixes |
29 |
\item The table summarizes the performance and optimal configuration for the hybrid back-projection kernel. Both texture engine and ALUs are used to perform interpolation. The configuration specifies: \tblcol{T/A} - is a ratio between the blocks executing \textbf{T}exture-based reconstruction and the blocks running \textbf{A}LU-based algorithm, \tblcol{$n_q$} - a number of pixels per thread, \tblcol{$s_d$} - a number of cached projections, \tblcol{U} - unrolling hint for inner projection loop, \tblcol{R} - the units to perform rounding and type conversions (index is always computed using SFU), \tblcol{O} - the requested occupancy. The caches are configured as specified in \tablename~\ref{tbl:cacheconf}. The number of threads to cache a projection row is determined according to guidelines in \tablename~\ref{tbl:shmemconf}. |
27
by Suren A. Chilingaryan
Alutex/Oversampling tables are fixed and hybrid section is finished |
30 |
\end{tablenotes} |
31 |
\end{threeparttable} |
|
32 |
\end{table} |