Year |
Citation |
Score |
2020 |
Haidar A, Bayraktar H, Tomov S, Dongarra J, Higham NJ. Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems. Proceedings. Mathematical, Physical, and Engineering Sciences. 476: 20200110. PMID 33363437 DOI: 10.1098/rspa.2020.0110 |
0.372 |
|
2020 |
Farhan MA, Abdelfattah A, Tomov S, Gates M, Sukkari D, Haidar A, Rosenberg R, Dongarra J. MAGMA templates for scalable linear algebra on emerging architectures International Journal of High Performance Computing Applications. 109434202093842. DOI: 10.1177/1094342020938421 |
0.34 |
|
2020 |
Abdelfattah A, Tomov S, Dongarra J. Matrix multiplication on batches of small matrices in half and half-complex precisions Journal of Parallel and Distributed Computing. 145: 188-201. DOI: 10.1016/J.Jpdc.2020.07.001 |
0.42 |
|
2020 |
Dongarra J, Gates M, Luszczek P, Tomov S. Translational Process: Mathematical Software Perspective Journal of Computational Science. 101216. DOI: 10.1016/J.Jocs.2020.101216 |
0.382 |
|
2019 |
Zaitsev D, Tomov S, Dongarra J. Solving Linear Diophantine Systems on Parallel Architectures Ieee Transactions On Parallel and Distributed Systems. 30: 1158-1169. DOI: 10.1109/Tpds.2018.2873354 |
0.374 |
|
2018 |
Dongarra J, Gates M, Haidar A, Kurzak J, Luszczek P, Tomov S, Yamazaki I. The Singular Value Decomposition: Anatomy of Optimizing an Algorithm for Extreme Scale Siam Review. 60: 808-865. DOI: 10.1137/17M1117732 |
0.36 |
|
2018 |
Abdelfattah A, Haidar A, Tomov S, Dongarra J. Analysis and Design Techniques towards High-Performance and Energy-Efficient Dense Linear Solvers on GPUs Ieee Transactions On Parallel and Distributed Systems. 29: 2700-2712. DOI: 10.1109/Tpds.2018.2842785 |
0.391 |
|
2018 |
Haidar A, Abdelfattah A, Zounon M, Tomov S, Dongarra J. A Guide for Achieving High Performance with Very Small Matrices on GPU: A Case Study of Batched LU and Cholesky Factorizations Ieee Transactions On Parallel and Distributed Systems. 29: 973-984. DOI: 10.1109/Tpds.2017.2783929 |
0.448 |
|
2018 |
Abdelfattah A, Haidar A, Tomov S, Dongarra J. Batched one-sided factorizations of tiny matrices using GPUs: Challenges and countermeasures Journal of Computational Science. 26: 226-236. DOI: 10.1016/J.Jocs.2018.01.005 |
0.427 |
|
2017 |
Dongarra J, Tomov S, Luszczek P, Kurzak J, Gates M, Yamazaki I, Anzt H, Haidar A, Abdelfattah A. With Extreme Computing, the Rules Have Changed Computing in Science & Engineering. 19: 52-62. DOI: 10.1109/Mcse.2017.48 |
0.395 |
|
2017 |
Yamazaki I, Nooshabadi S, Tomov S, Dongarra J. Structure-Aware Linear Solver for Realtime Convex Optimization for Embedded Systems Ieee Embedded Systems Letters. 9: 61-64. DOI: 10.1109/Les.2017.2700401 |
0.342 |
|
2017 |
Baboulin M, Dongarra J, Rémy A, Tomov S, Yamazaki I. Solving dense symmetric indefinite systems using GPUs Concurrency and Computation: Practice and Experience. 29: e4055. DOI: 10.1002/Cpe.4055 |
0.456 |
|
2016 |
Anzt H, Tomov S, Dongarra J. On the performance and energy efficiency of sparse linear algebra on GPUs The International Journal of High Performance Computing Applications. 31: 375-390. DOI: 10.1177/1094342016672081 |
0.357 |
|
2016 |
Yamazaki I, Tomov S, Dongarra J. Stability and Performance of Various Singular Value QR Implementations on Multicore CPU with a GPU Acm Transactions On Mathematical Software. 43: 1-18. DOI: 10.1145/2898347 |
0.435 |
|
2016 |
Abdelfattah A, Anzt H, Dongarra J, Gates M, Haidar A, Kurzak J, Luszczek P, Tomov S, Yamazaki I, YarKhan A. Linear algebra software for large-scale accelerated multicore computing Acta Numerica. 25: 1-160. DOI: 10.1017/S0962492916000015 |
0.438 |
|
2016 |
Yamazaki I, Tomov S, Dongarra J. Non-GPU-resident symmetric indefinite factorization Concurrency and Computation: Practice and Experience. 29: e4012. DOI: 10.1002/Cpe.4012 |
0.355 |
|
2015 |
Dongarra J, Gates M, Haidar A, Jia Y, Kabir K, Luszczek P, Tomov S. HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi Scientific Programming. 2015. DOI: 10.1155/2015/502593 |
0.422 |
|
2015 |
Yamazaki I, Tomov S, Dongarra J. Computing Low-Rank Approximation of a Dense Matrix on Multicore CPUs with a GPU and Its Application to Solving a Hierarchically Semiseparable Linear System of Equations Scientific Programming. 2015. DOI: 10.1155/2015/246019 |
0.434 |
|
2015 |
Yamazaki I, Tomov S, Dongarra J. Mixed-Precision Cholesky QR Factorization and Its Case Studies on Multicore CPU with Multiple GPUs Siam Journal On Scientific Computing. 37: C307-C330. DOI: 10.1137/14M0973773 |
0.434 |
|
2013 |
Baboulin M, Dongarra J, Herrmann J, Tomov S. Accelerating linear system solutions using randomization techniques Acm Transactions On Mathematical Software. 39. DOI: 10.1145/2427023.2427025 |
0.391 |
|
2013 |
Du P, Luszczek P, Tomov S, Dongarra J. Soft error resilient QR factorization for hybrid system with GPGPU Journal of Computational Science. 4: 457-464. DOI: 10.1016/J.Jocs.2013.01.004 |
0.425 |
|
2012 |
Vömel C, Tomov S, Dongarra J. Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems Siam Journal On Scientific Computing. 34: C70-C82. DOI: 10.1137/100806783 |
0.448 |
|
2012 |
Kurzak J, Tomov S, Dongarra J. Autotuning GEMM kernels for the fermi GPU Ieee Transactions On Parallel and Distributed Systems. 23: 2045-2057. DOI: 10.1109/Tpds.2011.311 |
0.424 |
|
2009 |
Baboulin M, Buttari A, Dongarra J, Kurzak J, Langou J, Luszczek P, Tomov S. Accelerating scientific computations with mixed precision algorithms Computer Physics Communications. 180: 2526-2533. DOI: 10.1016/J.Cpc.2008.11.005 |
0.434 |
|
2005 |
Carstensen C, Lazarov R, Tomov S. Explicit and averaging a posteriori error estimates for adaptive finite volume methods Siam Journal On Numerical Analysis. 42: 2496-2521. DOI: 10.1137/S0036142903425422 |
0.589 |
|
2005 |
Tomov S, McGuigan M, Bennett R, Smith G, Spiletic J. Benchmarking and implementation of probability-based simulations on programmable graphics cards Computers and Graphics (Pergamon). 29: 71-80. DOI: 10.1016/J.Cag.2004.11.008 |
0.398 |
|
2002 |
Lazarov R, Tomov S. A posteriori error estimates for finite volume element approximations of convection-diffusion-reaction equations Computational Geosciences. 6: 483-503. DOI: 10.1023/A:1021247300362 |
0.568 |
|
2001 |
Lazarov RD, Tomov SZ, Vassilevski PS. Interior Penalty Discontinuous Approximations of Elliptic Problems Computational Methods in Applied Mathematics Comput. 1: 367-382. DOI: 10.2478/Cmam-2001-0024 |
0.353 |
|
Show low-probability matches. |