Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/11212
DC FieldValueLanguage
dc.contributor.authorBaboulin, Marc-
dc.contributor.authorDongarra, Jack-
dc.contributor.authorTomov, Stanimire-
dc.date.accessioned2009-08-27T10:29:12Z-
dc.date.available2009-08-27T10:29:12Z-
dc.date.issued2008-
dc.identifier.citationPré-Publicações DMUC. 08-53 (2008)en_US
dc.identifier.urihttps://hdl.handle.net/10316/11212-
dc.description.abstractIf multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accelerators! This is happening today as accelerators, in particular Graphical Processing Units (GPUs), are steadily making their way into the high performance computing (HPC) world. We highlight the trends leading to the idea of hybrid manycore/GPU systems, and we present a set of techniques that can be used to e ciently program them. The presentation is in the context of Dense Linear Algebra (DLA), a major building block for many scienti c computing applications. We motivate the need for new algorithms that would split the computation in a way that would fully exploit the power that each of the hybrid components o ers. As the area of hybrid multicore/GPU computing is still in its infancy, we also argue for its importance in view of what future architectures may look like. We therefore envision the need for a DLA library similar to LAPACK but for hybrid manycore/GPU systems. We illustrate the main ideas with an LUfactorization algorithm where particular techniques are used to reduce the amount of pivoting, resulting in an algorithm achieving up to 388 GFlop/s for single and up to 99:4 GFlop/s for double precision factorization on a hybrid Intel Xeon (2x4 cores @ 2.33 GHz) { NVIDIA GeForce GTX 280 (240 cores @ 1.30 GHz) system.en_US
dc.language.isoengen_US
dc.publisherCentro de Matemática da Universidade de Coimbraen_US
dc.rightsopenAccesseng
dc.subjectHybrid computingen_US
dc.subjectDense linear algebraen_US
dc.subjectParallel algorithmsen_US
dc.subjectLU factorizationen_US
dc.subjectMulticore processorsen_US
dc.subjectGraphic process unitsen_US
dc.subjectAcceleratorsen_US
dc.titleTowards dense linear algebra for hybrid GPU accelerated manycore systemsen_US
dc.typepreprinten_US
item.grantfulltextopen-
item.fulltextCom Texto completo-
item.openairetypepreprint-
item.languageiso639-1en-
item.openairecristypehttp://purl.org/coar/resource_type/c_816b-
item.cerifentitytypePublications-
Appears in Collections:FCTUC Matemática - Vários
Files in This Item:
Show simple item record

Page view(s) 50

335
checked on Apr 23, 2024

Download(s)

139
checked on Apr 23, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.