Could you please provide a benchmark (details of argument values for each experiment) for accurate reproduction of results? Perhaps, similar to the Loglizer repository which has a benchmarks folder.