Software Tools
The Graph-Massivizer Toolkit
The Graph-Massivizer project will deliver a toolkit consisting of different tools. The acompanying picture (we can also say the picture above or below, depending on where you place the picture) depicts an architecture container diagram that illustrates how the tools and stakeholders interface and interact with each other into a combined integrated metaphactory platform.
- The metaphactory platform provides a graphical front-end for users to visualise and interact with the toolkit and to query the platform for information. Within the toolkit, complex operational pathways enable users to interact with separate tools or complex workflow combinations;
- The metaphactory platform contains components that enable KG management, such as modelling, visualization, and graph editing.
Graph-Massivizer Workflow
A typical workflow enables complex operations that call each tool sequentially as a pipeline. This operational pattern enables various classic optimisations to improve performance, energy or other trade-offs. Common workflows and pipelines in a deployed toolkit configure each tool for running a default execution pipeline, with users potentially re-configuring (or modifying, for advanced users) parts depending on the Use Case.
- The first tool in the pipeline is the Graph-Inceptor, which the toolkit uses to initialize and handle massive graph ingestion and storage. Graph-Inceptor supports BGOs that create graphs and move them to the graph storage in coordination with the graph database when necessary;
- Graph-Scrutinizer uses this graph data, which supports BGOs to analyze and perform reasoning, including graph sampling and probabilistic reasoning;
- Graph-Optimizer optimizes the workflow by selecting the best-performing mix of BGO (Basic Graph Operations) implementations for the workflow.
- Graph-Greenifier analyses the graph data and processing metrics for performance and sustainability optimizations.
- Finally, the Graph-Choreographer leverages information provided by the performance and sustainability models to enable serverless BGOs and workflow processing by deploying on the compute infrastructure.
Each tool also has an internal log accessed by the other platform tools for general inspection, development and debugging. The final components external to the Graph-Massivizer toolkit, depicted to the left and bottom of the figure, include data sources and the computing infrastructure.
- Data sources are always accessible to the toolkit, independent of the pipeline and the graph created by Graph-Inceptor. Data providers can supply files, streams, databases, APIs for external graph resources, or anything that can store or represent graphs;
- All tools use the computing continuum ranging from high-performance computers, cloud and edge devices to traditional local machines with their own hardware, storage and graph processing workload allocation method;
- Graph database is a shared resource comprising graph storage with SPARQL query access and various formats (RDF and RDF-star) and representations used in the toolkit.