YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment

YABench: A Comprehensive Framework for RDF

Stream Processor Correctness and Performance Assessment

Maxim Kolchin, Peter Wetz, Elmar Kiesling, A Min TjoaITMO University, Russia | TU Wien, Austria

The 16th International Conference on Web Engineering 2016, Lugano, Switzerland

RDF Stream Processing (RSP)

RDF Stream - a potentially infinite sequence of time-varying data elements encoded in RDF

Continuous query - a query registered over streams that in most cases are observed through windows

Query results - similarly to SPARQL they can be tuples, RDF dataset or a new RDF Stream

State of the art

■ LSBench (2012)■ SRBench (2012)■ CSRBench (2013)■ CityBench (2015)

Details can be found at W3C RSP Community Group’s Wiki: https://www.w3.org/community/rsp/wiki/RSP_Benchmarking

Our contribution

■ We propose a benchmarking framework for RDF Stream Processing engines that focuses on correctness and performance○ Stream generator (generates configurable RDF stream)○ Oracle (validates correctness of the results)○ Runner (measures performance of an RSP engine)

■ We run a benchmark with the window-based RDF stream processing engines:○ C-SPARQL○ CQELS

Requirements

■ Scalable and configurable input■ Comprehensive correctness checking■ Flexible queries■ Reproducibility

Architecture1. Define tests,2. Generate data streams,3. Run the tests with a given

engine,a. Performance metrics

are collected in a separate process,

4. At the end validate the results with the oracle.

Architecture: Reporting tool

Validation against CSRBench

We validated the correctness checking functionality of YABench by reproducing the CSRBench* benchmark.

CSRBench defines 7 queries for C-SPARQL, CQELS and SPARQLstream engines.

Datasets, test configurations and results are available online: github.com/YABench/csrbench-validation

*Daniele Dell’Aglio, et al. “On Correctness in RDF Stream Processor Benchmarking”, 2013

Validation against CSRBench (C-SPARQL)

QueryC-SPARQL

CSRBench YABench

Q1 ✓ ✓

Q2 ✓ ✓

Q3 ✓ ✓*

Q4 ✓ ✓

Q5 ✗ ✗

Q6 ✓ ✓*

Q7 ✓ ✓*

* - the results are the same, but because of timing discrepancies some results sometimes present in the subsequent window

Validation against CSRBench (CQELS)

QueryCQELS

CSRBench YABench

Q1 ✓ ✓

Q2 ✓ ✓

Q3 ✓ ✓

Q4 ✗ ✗

Q5 ✓ ✓

Q6 ✗ ✗

Q7 ✗ ✗**

** - indicates that the query did not execute successfully on the CQELS engine. The engine crashed before returning the query results

Benchmark

We reuse queries introduced by CSRBench, but we’re able to parametrize them, e.g. window size, window slide, filter values, etc.

Measure:

- Precision and recall,- Window and result size, and delay,- Memory and CPU usage, # of threads

We run each test 10 times, to compute the distribution of precision/recall.

Detailed results are available online: github.com/YABench/yabench-one

Benchmark: Data Stream Model

A data stream is generated based on:

■ Number of weather stations,

■ Time interval between two observations of a single station,

■ Duration of the stream,■ A seed for the

randomize function

Benchmark: Queries

Experiment 1: SELECT + FILTER

Experiment 2: SELECT + AVG + FILTER

Experiment 3: joining of triples from different timestamps

Experiment 4: demonstrates the use of gracious mode which implemented by the oracle to eliminate the timing discrepancy issues of the engines

Experiment 1 (precision/recall): 50 stations

Experiment 1 (memory usage): 50 stations

Experiment 1 (delay): 50 stations

Experiment 1 (C-SPARQL): delay vs result size

Architecture: Gracious mode

In this mode the oracle tries to adjusts its window scope to match the scope of an actual window, by moving the left and right borders to back and/or forth while the precision and recall grows.

It allows to:

(a) confirm our assumption on why precision and recall are low,(b) reconstruct and visualize the actual window borders

Experiment 4: gracious vs non-gracious modes

(a) In non-gracious (default) mode (b) In gracious mode

C-SPARQL

Experiment 4: gracious vs non-gracious modes

(a) In non-gracious (default) mode (b) In gracious mode

Conclusion

■ We build a framework for benchmarking RSP engines which allows to assess their correctness and performance

■ We run a benchmark which revealed some insides:○ CQELS shows better precision/recall for simple queries,○ C-SPARQL is slightly more memory efficient than CQELS,

○ C-SPARQL outperformes CQELS in terms of delay for more complex queries, which is mainly caused by a different reporting strategy

■ By introducing gracious mode we’re able to estimate the extent of the timing discrepancy

Thank you!

github.com/YABench

YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment

Science

Teaching Software Correctness

RDF-3X: a RISC-style Engine for RDF,

RDF Data Model and Query Languagestessaris/docs/RDF-query.pdf · Introduction RDF Semantics Querying RDF Building Blocks RDF Abstract Syntax RDF Vocabulary Basic Concepts RDF: language

Graphically Querying RDF Using RDF-GL

Correctness Correctness. Quality Perceptions The perception of quality associated with your code is typically bound to: Correctness Efficiency (speed

RDF/XML: Encoding RDF into XML

RDF-Anwendungen: Photo RDF

RDF On the Go: An RDF Storage and Query Processor for ...iswc2010.semanticweb.org/pdf/503.pdf · SPARQL query processor for mobile devices. ... An RDF Storage and Query Processor

Practical RDF Chapter 2. RDF: Heart and Soul

A Proof of Correctness of a Processor Implementing Tomasulo’s Algorithm without a Reorder Buffer Ravi Hosabettu (Univ. of Utah) Ganesh Gopalakrishnan (Univ

Chapter 2 correctness

RDF as a Universal Healthcare Exchange Languagedbooth.org/2014/rdf-as-univ/rdf-as-univ-slides.pdf · 10 Yosemite Manifesto on RDF as a Universal Healthcare Exchange Language 1. RDF

Correctness through simplicity

Correctness of Speculative Optimizations with Dynamic ...aviral.io/publications/correctness-of-speculative-optimizations-with... · 49 Correctness of Speculative Optimizations with

Striving for correctness*

CERTIFICATE OF CORRECTNESS

Proof of Correctness of a Processor with Reorder Buffer using the Completion Functions Approach

Seminar Work – RDF Databases - DFKIsauermann/papers/SeminarWorkR... · Seminar Work – RDF Databases Using the three RDF Databases FORTH-RDFSuite, Sesame, RDF Gateway in development

Correctness Proofs

Designing for Correctness