FEATURE: Introduce `NodeIdentity` #4868

mhsdesign · 2024-02-02T09:16:18Z

Resolves partially: #4564
Required for a new NodeUriBuilder api #4552 which in-turn "should" solve cross cr linking: #4441

The NodeIdentity will contain WorkspaceName instead of the ContentStreamId

For most, if not all use cases, like caching a reference to a node in fusions @cache.context or serializing a node into the frontend and passing it around as query parameter, the workspace name should be used.

That guarantees that a node is still findable, when a workspace will be rebased (which can happen anytime, and thus the csid would be changed

Upgrade instructions

Review instructions

#4943 (comment) but maybe manual refresh instead of implicit through handle.

TODOs as discussed with @mhsdesign and @kitsunet today:

add workspaceName <-> contentStreamId table to content graph projection
add workspaceName -> contentStreamId lookup to ContentGraph::getSubgraph() (with runtime cache in the ContentGraph – the same workspace will resolve to the same CS ID in one request, maybe reset runtime cache via markStale())
rewrite ContentGraph::find* queries to use workspaceName instead of contentStreamId (IMO it's totally fine to join the workspaceName <-> cs id table here for those queries. Alternatively we could add a dedicated lookup + runtime cache)
remove SubgraphIdentity

Afterwards we have to find out whether there are places that need to be able to override the worspace name -> cs id mapping for tests or CR constraints. If so we could either:

Add a separate, internal, ContentGraph::getSubgraphInternal() – but that would be dangerous because it is part of the public interface (even if marked internal)
Introduce a ContentsubgraphFactoryInterface that can be used via CR service
Provide a way to hook into the runtime cache, e.g. $contentGraph->dangerousSetContentStreamIdForWorkspaceNameFooBar(...)
Extract workspaceName<->cs id mapping to external service that can be replaced/hooked into

Checklist

Code follows the PSR-2 coding style
Tests have been created, run and adjusted as needed
The PR is created against the lowest maintained branch
Reviewer - PR Title is brief but complete and starts with FEATURE|TASK|BUGFIX
Reviewer - The first section explains the change briefly for change-logs
Reviewer - Breaking Changes are marked with !!! and have upgrade-instructions

Neos.ContentRepository.Core/Classes/SharedModel/Node/NodeIdentity.php

bwaidelich

Some initial comments.

My main concern (not with this PR but in general regarding this topic) is the implicit distinction between the "content stream" and the "workspace" view.

E.g. ContentSubgraphIdentity vs NodeIdentity

Neos.ContentRepository.Core/Classes/SharedModel/Node/NodeIdentity.php

Neos.ContentRepository.Core/Classes/Projection/ContentGraph/Node.php

Neos.ContentRepository.Core/Classes/SharedModel/Node/NodeIdentity.php

Neos.ContentRepositoryRegistry/Classes/NodeSerializer.php

bwaidelich

As recap from the weekly:
I like the NodeIdentity DTO and I think that it really makes sense.

NodeSerializer does not really make sense to me yet and I think it highlights some inconsistencies we currently have.

I could imagine that we could change the Node read model from:

subgraphIdentity:
  contentRepositoryId
  contentStreamId
  dimensionSpacePoint
  visibilityConstraints
nodeAggregateId
originDimensionSpacePoint
...

to

identity
  contentRepositoryId
  workspaceName
  dimensionSpacePoint
  nodeAggregateId
originDimensionSpacePoint
...

But I'm not sure about all implications yet

Neos.ContentRepository.Core/Classes/SharedModel/Node/NodeIdentity.php

kitsunet · 2024-02-16T13:36:00Z

@bwaidelich the suggested properties for node would prevent us from ever addressing a node that is in a contentstream without workspace attachment. I think it's fine to not plan for this from outside, but we might need to load one eg. for conflict resolution?

bwaidelich · 2024-02-16T15:53:09Z

I just discussed this topic with @skurfuerst and he also agrees to getting rid of the CS id in the public APIs as much as possible.

We do probably still need visibilityConstraints in the Node read model because we use them to "stay in the same visibility context" when passing nodes via Fusion.
Personally I would prefer some required Fusion runtime variable for that (similar to request) but that would probably be too big of a change.
visibilityConstraints should not be part of the NodeIdentity though but it could be a property of the Node itself (and SubgraphIdentity can probably removed afterwards).

@kitsunet re

the suggested properties for node would prevent us from ever addressing a node that is in a contentstream without workspace attachment

That's true and might be a potential issue.
Sebastian suggested to introduce something like a VirtualWorkspaceName (or TransientWorkspaceName or some similar name) that reflects a CS id without a corresponding workspace (a bit similar to a detached head in Git)
This could be a concept of the read model exclusively, i.e.

class NodeIdentity
{
    private function __construct(
        // ...
        public WorkspaceName|VirtualWorkspaceName $workspaceName
// ...

For write operations, i.e. commands, we would always require a WorkspaceName

mhsdesign · 2024-02-20T12:27:39Z

I implemented some wip stuff, edit moved to https://github.com/mhsdesign/neos-development-collection/tree/feature/NodeIdentityDto-original-draft

Details

ala (my goal was to not have to keep track of the workspaces in the content graph projection)

final class ContentRepository
{
    public function getSubgraph(NodeIdentity $nodeIdentity, VisibilityConstraints $visibilityConstraints): ContentSubgraphInterface
    {
        if ($nodeIdentity->workspaceName instanceof DetachedWorkspaceName) {
            return $this->getContentGraph()->getSubgraph(
                $nodeIdentity->workspaceName->contentStreamId,
                $nodeIdentity->dimensionSpacePoint,
                $visibilityConstraints
            );
        }
        $workspace = $this->getWorkspaceFinder()->findOneByName($nodeIdentity->workspaceName);
        // ...


interface ContentGraphInterface extends ProjectionStateInterface
{
    public function getSubgraph(
        ContentStreamId|Workspace $contentStreamReference,
        DimensionSpacePoint $dimensionSpacePoint,
        VisibilityConstraints $visibilityConstraints
    ): ContentSubgraphInterface;


final readonly class NodeIdentity implements \JsonSerializable
{
    private function __construct(
        public ContentRepositoryId $contentRepositoryId,
        public WorkspaceName|DetachedWorkspaceName $workspaceName,
        public DimensionSpacePoint $dimensionSpacePoint,
        public NodeAggregateId $nodeAggregateId,
    ) {
    }


final class ContentSubgraph implements ContentSubgraphInterface
{
    // todo what if there was already a subgraph with runtime cache constructed and different / no workspace name?
    public function __construct(
        private readonly ContentRepositoryId $contentRepositoryId,
        private readonly ContentStreamId $contentStreamId,
        private readonly ?WorkspaceName $workspaceName,


final class NodeFactory
{
    public function __construct(
        private readonly ContentRepositoryId $contentRepositoryId,
        private readonly NodeTypeManager $nodeTypeManager,
        private readonly PropertyConverter $propertyConverter
    ) {
    }

    /**
     * @param array<string,string> $nodeRow Node Row from projection (<prefix>_node table)
     */
    public function mapNodeRowToNode(
        array $nodeRow,
        DimensionSpacePoint $dimensionSpacePoint,
        VisibilityConstraints $visibilityConstraints,
        ?WorkspaceName $workspaceName,
    ): Node {
        $nodeType = $this->nodeTypeManager->hasNodeType($nodeRow['nodetypename'])
            ? $this->nodeTypeManager->getNodeType($nodeRow['nodetypename'])
            : null;

        return Node::create(
            NodeIdentity::create(
                $this->contentRepositoryId,
                $workspaceName ?? DetachedWorkspaceName::fromContentStreamId(
                    ContentStreamId::fromString($nodeRow['contentstreamid'])
                ),
                $dimensionSpacePoint,
                NodeAggregateId::fromString($nodeRow['nodeaggregateid'])
            ),

kitsunet · 2024-02-20T14:53:37Z

The VirtualWorkspaceName idea seems fine, I think that should cover most bases, I like it :)

Re the visibility constraints, IMHO it makes sense to know where exactly that node is from, to refer to other nodes, eg. if I get one of those nodes handed in userland code without them being available on the node I would need to pass that in parallel to not suddenly drop into a different visibility if I need a subtree for this node.

bwaidelich · 2024-02-20T16:40:27Z

eg. if I get one of those nodes handed in userland code without them being available on the node I would need to pass that in parallel

IMO that would be the "better" way to solve this: To traverse the graph from a given node, you'd need the visibility constraints, too.
Merging those into the node read model for convenience blurs the responsibilities (e.g. whether they should be part of some caching identifier depends on the scenario).

But the usecase you describe is exactly why we would need it for now because the alternative would be to introduce the visibility constraints as some kind of context to the fusion runtime

bwaidelich · 2024-03-18T21:17:10Z

Any update on this one?
It's one of the major outstandig changes AFAIR and it blocks some security related features.
Let me know if I can help!

mhsdesign · 2024-03-18T23:04:15Z

Jup. #4790 should be merged first as i was about to adjust the node factory and was like wait a second, dont make it harder with the conflicts ;)

The nodeadress was added 6 years ago without the concept of multiple crs. It will be replaced by the node identity

bwaidelich · 2024-03-19T12:18:34Z

@mhsdesign Thanks for the update and let me know if I can help with this one!

It will keep track of the current workspace name to content stream id mapping

…tStreamId`

mhsdesign · 2024-03-19T22:22:50Z

Neos.ContentGraph.DoctrineDbalAdapter/src/Domain/Repository/NodeFactory.php

@@ -175,6 +196,7 @@ public function mapNodeRowsToNodeAggregate(
                // ... so we handle occupation exactly once ...
                $nodesByOccupiedDimensionSpacePoints[$occupiedDimensionSpacePoint->hash] = $this->mapNodeRowToNode(
                    $nodeRow,
+                    WorkspaceName::forLive(), // todo use workspace name in content graph api


NodeAggregates also need to operate on workspace names, but for that we would have to adjust the content graph api. That might go in a separate pr.

public function findNodeAggregateById( WorkspaceName $workspaceName, NodeAggregateId $nodeAggregateId ): ?NodeAggregate;

the cs id would be looked up internally via the workspaces table.
And the cache would be similar flushed on markStale as the cached subgraphs

The problem with this would be that the write site heavily relies on the ContentGraph and must work on a specific contentstream id (see ContentStreamIdOverride)

unless we create a custom ContentGraph which will use our content stream, or by actually hacky checking against the state in ContentStreamIdOverride we wont get around that.

mhsdesign · 2024-03-19T22:24:33Z

...entRepository.Core/Classes/Feature/DimensionSpaceAdjustment/DimensionSpaceCommandHandler.php

@@ -71,6 +71,8 @@ private function handleMoveDimensionSpacePoint(
        $streamName = ContentStreamEventStreamName::fromContentStreamId($command->contentStreamId)
            ->getEventStreamName();

+        // todo use WorkspaceName here as well but the command doesnt expose it
+        // https://github.com/neos/neos-development-collection/issues/4942
        self::requireDimensionSpacePointToBeEmptyInContentStream(


Would be great to find out why #4942 is how it is.
These were the only commands untouched by bernhards refactoring.

mhsdesign · 2024-03-19T22:25:54Z

Neos.ContentRepository.Core/Classes/Feature/NodeMove/NodeMove.php

+                    // todo UNSAFE as override via ContentStreamIdOverride will NOT be taken into account!!!
+                    $command->workspaceName,


this MUST not be used as a custom repoint via ContentStreamIdOverride would be ignored.
We possibly have to pass both as touple and create a new ContentGraph by hand.

mhsdesign · 2024-03-19T22:27:54Z

Neos.Neos/Classes/AssetUsage/Service/AssetUsageSyncService.php

+            // todo use workspace name instead!!!
            $usage->contentStreamId,


i guess adding the workspace to the AssetUsage will not be trivial, so we should just use the workspace finder beforehand and get a workspace that will be internally resolved again to a content stream?

mhsdesign · 2024-03-19T22:30:08Z

Neos.ContentRepository.Core/Classes/Feature/NodeTypeChange/NodeTypeChange.php

@@ -193,7 +193,7 @@ private function handleChangeNodeAggregateType(
                $tetheredNodeName = NodeName::fromString($serializedTetheredNodeName);

                $subgraph = $contentRepository->getContentGraph()->getSubgraph(
-                    $node->subgraphIdentity->contentStreamId,
+                    $node->identity->workspaceName,


this is illegal as well because of ContentStreamIdOverride.

mhsdesign · 2024-05-11T12:56:19Z

Thanks for all the discussion here, but this branch is super outdated. I cherry picked all the changes to a new pr and we can continue on a blank page: #5042

github-actions bot added Feature 9.0 labels Feb 2, 2024

mhsdesign commented Feb 2, 2024

View reviewed changes

Neos.ContentRepository.Core/Classes/SharedModel/Node/NodeIdentity.php Outdated Show resolved Hide resolved

mhsdesign commented Feb 5, 2024

View reviewed changes

Neos.ContentRepository.Core/Classes/SharedModel/Node/NodeIdentity.php Outdated Show resolved Hide resolved

mhsdesign marked this pull request as draft February 7, 2024 16:23

mhsdesign force-pushed the feature/NodeIdentityDto branch 2 times, most recently from 4e56a48 to 8912655 Compare February 12, 2024 21:11

mhsdesign changed the title ~~FEATURE: Introduce NodeIdentity Dto~~ FEATURE: Introduce NodeIdentity and NodeSerializer Feb 12, 2024

mhsdesign force-pushed the feature/NodeIdentityDto branch from 8912655 to cd7ad38 Compare February 12, 2024 21:18

mhsdesign marked this pull request as ready for review February 14, 2024 12:52

bwaidelich reviewed Feb 14, 2024

View reviewed changes

This was referenced Feb 14, 2024

!!! TASK: Move ContentRepositoryId to SharedModel namespace #4891

Merged

BUGFIX: Serialised node fully qualified for fusion uncached mode #4734

Merged

!!! FEATURE: Overhaul node uri building #4892

Merged

mhsdesign force-pushed the feature/NodeIdentityDto branch from cd7ad38 to 1bfdef1 Compare February 16, 2024 08:16

bwaidelich reviewed Feb 16, 2024

View reviewed changes

Neos.ContentRepository.Core/Classes/SharedModel/Node/NodeIdentity.php Show resolved Hide resolved

mhsdesign mentioned this pull request Feb 16, 2024

9.0 Discussion Node property mapping in controllers #4873

Closed

mhsdesign marked this pull request as draft February 16, 2024 21:21

mhsdesign changed the title ~~FEATURE: Introduce NodeIdentity and NodeSerializer~~ FEATURE: Introduce NodeIdentity Feb 20, 2024

mhsdesign force-pushed the feature/NodeIdentityDto branch from ae02b4c to c9feaca Compare March 18, 2024 23:10

mhsdesign added 2 commits March 19, 2024 09:10

TASK: Deprecate NodeAddress

78848ee

The nodeadress was added 6 years ago without the concept of multiple crs. It will be replaced by the node identity

The ~bourne~ node identity

75aecf0

mhsdesign added 4 commits March 19, 2024 21:37

WIP: Prepare node factory to build nodes with node identity

ea65fdf

TASK: Add cr_default_p_graph_workspaces table

f6e4e7f

It will keep track of the current workspace name to content stream id mapping

!!! TASK: Require WorkspaceName in getSubgraph instead of `Conten…

53a1687

…tStreamId`

TASK: Adjust to WorkspaceName in getSubgraph (more complex usages)

e37a4b8

mhsdesign force-pushed the feature/NodeIdentityDto branch from c9feaca to e37a4b8 Compare March 19, 2024 22:13

mhsdesign commented Mar 19, 2024

View reviewed changes

mhsdesign mentioned this pull request Apr 2, 2024

TASK: Introduce (internal) low level content graph api for constraint checks and write side #4973

Closed

bwaidelich mentioned this pull request Apr 19, 2024

!!! FEATURE: Add workspaceName to relevant events #5002

Merged

mhsdesign mentioned this pull request May 11, 2024

FEATURE: workspace aware Node (introduce new NodeAdress) #5042

Merged

6 tasks

mhsdesign closed this May 11, 2024

mhsdesign mentioned this pull request Jun 12, 2024

TASK: Unify JSON de/encoding behavior #5093

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEATURE: Introduce `NodeIdentity` #4868

FEATURE: Introduce `NodeIdentity` #4868

mhsdesign commented Feb 2, 2024 •

edited

Loading

bwaidelich left a comment

bwaidelich left a comment

kitsunet commented Feb 16, 2024

bwaidelich commented Feb 16, 2024

mhsdesign commented Feb 20, 2024 •

edited

Loading

kitsunet commented Feb 20, 2024

bwaidelich commented Feb 20, 2024

bwaidelich commented Mar 18, 2024

mhsdesign commented Mar 18, 2024

bwaidelich commented Mar 19, 2024

mhsdesign Mar 19, 2024

mhsdesign Mar 19, 2024

mhsdesign Mar 19, 2024

mhsdesign Mar 19, 2024

mhsdesign Mar 19, 2024

mhsdesign Mar 19, 2024

mhsdesign commented May 11, 2024

		// todo UNSAFE as override via ContentStreamIdOverride will NOT be taken into account!!!
		$command->workspaceName,

		// todo use workspace name instead!!!
		$usage->contentStreamId,

FEATURE: Introduce NodeIdentity #4868

FEATURE: Introduce NodeIdentity #4868

Conversation

mhsdesign commented Feb 2, 2024 • edited Loading

bwaidelich left a comment

Choose a reason for hiding this comment

bwaidelich left a comment

Choose a reason for hiding this comment

kitsunet commented Feb 16, 2024

bwaidelich commented Feb 16, 2024

mhsdesign commented Feb 20, 2024 • edited Loading

kitsunet commented Feb 20, 2024

bwaidelich commented Feb 20, 2024

bwaidelich commented Mar 18, 2024

mhsdesign commented Mar 18, 2024

bwaidelich commented Mar 19, 2024

mhsdesign Mar 19, 2024

Choose a reason for hiding this comment

mhsdesign Mar 19, 2024

Choose a reason for hiding this comment

mhsdesign Mar 19, 2024

Choose a reason for hiding this comment

mhsdesign Mar 19, 2024

Choose a reason for hiding this comment

mhsdesign Mar 19, 2024

Choose a reason for hiding this comment

mhsdesign Mar 19, 2024

Choose a reason for hiding this comment

mhsdesign commented May 11, 2024

FEATURE: Introduce `NodeIdentity` #4868

FEATURE: Introduce `NodeIdentity` #4868

mhsdesign commented Feb 2, 2024 •

edited

Loading

mhsdesign commented Feb 20, 2024 •

edited

Loading