replace_blank_node must go

I just realized there is a critical problem with the piece of code below.

https://github.com/UUDigitalHumanitieslab/EDPOP/blob/f7de000426e60a4ba7d9753d62777d8a7620a526/backend/triplestore/utils.py#L74-L80

When I commented on it in #163, @tijmenbaarda [stated](https://github.com/UUDigitalHumanitieslab/EDPOP/pull/163/files#r1594196894) that this function was necessary because Blazegraph did not support blank nodes, so we left it in.

At first sight, this function just replaces real blank nodes by an emulation of blank nodes. However, it actually introduces a risk of collision between unrelated nodes. Real blank nodes are intrinsically distinct; they can only be identified if they occur within the same representation. The emulation replacement, on the other hand, produces URIs consisting of a small random number embedded in a fixed string. It is only a matter of time before the same random number is reused. The duplicates will be generated on different dates, possibly by different WSGI workers, and most likely in the process of serializing different queries. This will compromise data integrity.

As for the argument that Blazegraph does not support blank nodes: this *should* not be true. Blank nodes are too essential to leave them out of any serious RDF implementation. In fact, the Blazegraph wiki [mentions](https://github.com/blazegraph/database/wiki/DataMigration#blank-nodes) support for blank nodes and even an alternative mode of supporting them.

@tijmenbaarda, could you retrace the origin of your belief that Blazegraph does not support blank nodes? I'm sure there is a real problem that needs to be addressed, but we must find an approach that does not involve `replace_blank_node` or anything similar.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace_blank_node must go #172

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

replace_blank_node must go #172

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions