Trireme: Exploring Hierarchical Multi-Level Parallelism for Domain Specific Hardware Acceleration

Speedup Hardware acceleration Implementation
DOI: 10.48550/arxiv.2201.08603 Publication Date: 2022-01-01
ABSTRACT
The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts an application to accelerate in hardware leave software. Moreover, applications domains such as Extended Reality (XR) offer opportunities for various forms parallel execution, including loop level, task level pipeline parallelism. To assist the process expose every possible parallelism, we present Trireme, fully automated tool-chain explores multiple levels parallelism produces accelerator designs configurations maximize performance, given budget. Experiments on demanding benchmarks from XR revealed speedup up 20x, well 37x smaller applications, compared software-only implementations.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....