Big Data and AI Pipeline Framework Technology Analysis from a Benchmarking Perspective

Big Data and AI Pipeline patterns provide a good foundation for the analysis and selection of technical architectures for Big Data and AI systems. Expe- riences from many projects in the Big Data PPP program has shown that a number of projects use similar architectural patterns with variations only in the choice of vari- ous technology components in the same pattern. The project DataBench has devel- oped a Big Data and AI pipeline framework, which is used for the description of pipeline steps in Big Data and AI projects, and supports the classification of bench- marks. This includes the four pipeline steps of Data Acquisition/Collection and Storage, Data Preparation and Curation, Data Analytics with AI/Machine Learning, and Action and Interaction, including Data Visualization and User Interaction as well as API Access. It has also created a toolbox which supports the identification and use of existing benchmarks according to these steps in addition to all of the different technical areas and different data types in the BDV Reference Model. An observatory, which is a tool, accessed via the toolbox, for observing the popularity, importance and the visibility of topic terms related to Artificial Intelligence and Big Data technologies has also been developed and is described in this chapter.

Keywords: Benchmarking · Big data and AI pipeline · Blueprint · Toolbox · Observatory

Excerpt from: J. Berre A, et al. (2021) Big Data and AI Pipeline Framework: Technology Analysis from a Benchmarking Perspective. In: Curry E., Auer S., Berre A. J., Metzger A., Perez M. S., Zillner S. (eds) Technologies and Applications for Big Data Value. Springer, Cham.