Final Report Summary - FTK (Fast Tracker for Hadron Collider Experiments)
Work performed since the beginning of the project: electronic boards were developed for the FTK Processor [2], an approved ATLAS upgrade. The implemented system is based on the optimal mapping of a complex algorithm onto different technologies: a combination of high performances dedicated hardware and distinctive flexibility of general-purpose but lower-performance CPUs. A key role in the architecture is played by high-performance field programmable gate arrays (FPGAs), while most of the computing power is provided by full-custom ASICs named AM. We built a powerful highly parallel dedicated hardware that provides excellent performance, reaching resolutions, efficiencies and fake track rejection typical of offline algorithms. The latencies are short (about one microseconds), power usage is low (the AM chip, has a power consumption below 3 W for 1M fit/10ns), and the system is small (the boards contained in 4 racks perform a task that would require a farm of O(1k) of commercial CPUs). The AM, the central device of our system, store each pattern in a single memory location consisting of 8 independent words of 16 bits each. Each of these words is separately compared to its own input data stream [1]; the coincidence of the 8 words matches, found at any time during data arrival, determines the match of the pattern. FPGAs control, configure and handle the AMs, providing the flexible computing power to process the selected patterns. Distributed debugging and monitoring tools suited for a pipelined, highly parallelized structure and a high degree of configurability have been developed to cope with different applications.
Main goals achieved: We have built the AM [4] and clustering [5] system prototypes, tested and integrated them in the global FTK system at CERN. The design of the AM system was a particularly challenging task for the following factors: the high pattern density (8M patterns/board), required a large silicon area for the AM ASICs; the I/O signal congestion at the board level, required heavy use of serial links; and the power limitation due to the maximum cooling power cooling resulted in setting the maximum power to 250 W per AM board. The extreme challenge presented by the AM chip design forced us to adapt the project schedule however it did not impact any of the project milestones. The infrastructures (crates, heatsinks, fans, power supplies) were studied and optimized [6]. The described system was developed and tested for HEP detectors and it is expected to highly increase the physics discovery potential of ATLAS however this system is essentially a versatile and powerful image reconstruction system adaptable for generic image processing applications. The AM system applies a filter that can be applied to images of different kinds. The AM-based processor can simulate the preliminary stages of image processing performed by the brain for vision, such as the identification of shape edges [6]. In the second two years of the project the main aim was in fact demonstrating the potentiality of the AM-processor to reconstruct medical images. In collaboration with various experts in medical image reconstruction the AM-system has been used to reconstruct Magnetic Resonance Imaging (MRI) of the brain with the aim of recognizing MRI area with sign of degenerative diseases. We have used our AM based pattern reconstruction to produce an image pre-processing that implements a 3D edge enhancing filter. This first stage allows a huge decrease of the image processing time (more than a factor 1000) [8, 9] giving the possibility to make this diagnostic exam applicable to a wider audience of patients.
[1] Annovi, A., et al. "VLSI Processor for Fast Track Finding Based on Content Addressable Memories" - IEEE Trans. Nucl. Sci. 53, 4, Aug(2006), 2428-2433.
[2] Andreani A. et al. "The FastTracker Real Time Processor and Its Impact on Muon Isolation, Tau and b-Jet Online Selections at ATLAS"- IEEE Trans. Nucl. Sci. 59, 2, 348-357.
[4] S. Citraro et al., “Highly Parallelized Pattern Matching Hardware for Fast Tracking at Hadron Colliders” - IEEE Trans. Nucl. Sci., 63, 2, Apr(2016), 1147 - 1154
[5] Sotiropoulou, C.-L. et al. "A Multi-Core FPGA-based 2D-Clustering Implementation for Real-Time Image Processing" - IEEE Trans. Nucl. Sci. 61, 6, Dec(2014), 3599-3606.
[6] IEEE Real Time 2016, Padova. C.Sotiropoulou et al.; “The Associative Memory System Infrastructures for the ATLAS Fast Tracker” to be published IEEE Trans. Nucl. Sci.
[7] Del Viva, M., Punzi, G., and Benedetti, D. 2013. "Information and Perception of Meaningful Patterns" - PloS one 8.7 (July 2013): e69154.
[8] IEEE Real Time 2016, Padova. P. Luciano et al.; “Brain Emulation for Image Processing” to be published IEEE Trans. on Nuclear Science
[9] IEEE NSS/MIC 2016, Strasbourg. C. Sotiropoulou et al. “Medical Image Processing Using Brain Emultion”. Proceeding to be published as IEEE Conference Record.