From 5ee966d8b39d0b7caf01817bfecc3b2d494a18ef Mon Sep 17 00:00:00 2001 From: Daniel Obraczka Date: Wed, 13 Mar 2024 16:32:41 +0100 Subject: [PATCH] Update README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index fac8650..4c9a6bb 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ # !! Update 2024-02-24 (fixed in 1.1.0) !! We found that `ent_links` in some cases contained intra-dataset links, which is not immediately noticable by the user. -Another round of clerical review was performed, transitive links, which were previously missed are added and the `ent_links` files now only contain entity links _between_ the datasets. The `721_5fold` directories have been adapted accordingly. +Another round of clerical review was performed, (transitive) links, which were previously missed are added and the `ent_links` files now only contain entity links _between_ the datasets. The `721_5fold` directories have been adapted accordingly. The intra-dataset links are now in `{dataset_name}_intra_ent_links` for each of the three datasets. What might also not be immediately obvious is that this dataset can be used as multi-source entity resolution task. We therefore provide a `multi_source_cluster` file with each line consisting of a cluster id and comma-seperated cluster members of the three datasets, which can also include multiple entries for a single dataset. @@ -60,7 +60,7 @@ For the binary cases each dataset has a `cluster` file in the respective folder. For the multi-source setting, you can use the `multi_source_cluster` file in the `data` folder. Using [`sylloge`](https://github.com/dobraczka/sylloge) you can also easily load this dataset as a multi-source task: -``` +```python from sylloge import MovieGraphBenchmark ds = MovieGraphBenchmark(graph_pair='multi') ```