I think that data set and algorithms implementations should be placed into different repositories.
One for the data set. One for each algorithm: the set of matrix-based algorithms, (G)LL-LR-based algorithms. We should provide the ability to integrate new implementation into the benchmark. I think that it can be done by using git submodules. each implementation can be included as a submodule.