We're working on converting tests for the new harness, and without going into too much detail on that work-in-progress, we have noted that there is a lot of divergence on how the implementations deal with the tension between slash semantics and LDP Interaction Models. I think this means that the heuristics from #128 is not sufficiently clear. My main fear is that since there is quite a lot of assumptions connected to containers and resources in the access control, lack of clarity in this area could result in opening attack vectors that we can't imagine right now. So, the intention isn't to change the good path (i.e. these are errors an end-user should never see), but to define error behaviour in the other cases, e.g. when something is declared a Non-RDF resource but doesn't have a /.

In #40 (comment) I tried to introduce the two concepts of consistency and exactness. I still think they are useful and that we need them.

By consistency, I mean that different parts of the request carries the same semantics. If something has a Link header that says something different from the request URI, there are a conflicting statements and so, there is an error. There are several sources of information that could cause these conflicts, but to advance this issue, I think we should consider three, namely the possible conflict between LDP Interaction Models as given by a Link header, the Content-Type header that can distinguish between RDF Sources and Non-RDF Sources and the request URI, which can determine if the resource is a container.

As previously mentioned, by exactness I mean to constrain the freedom the server has to adapt the request. Exactness is a criterion that applies differently to POST than to other HTTP verbs. With PUT request, the HTTP spec affords very little freedom, from RFC7231:

Proper interpretation of a PUT request presumes that the user agent knows which target resource is desired. A service that selects a proper URI on behalf of the client, after receiving a state-changing request, SHOULD be implemented using the POST method rather than PUT. If the origin server will not make the requested PUT state change to the target resource and instead wishes to have it applied to a different resource, such as when the resource has been moved to a different URI, then the origin server MUST send an appropriate 3xx (Redirection) response; the user agent MAY then make its own decision regarding whether or not to redirect the request.

So, it is not appropriate to change a request URI to fit a different interaction model when PUT is used, I think, as that implies it is a different resource than the UA intended to make.

It is different with POST, but my opinion is that the server should nevertheless not make changes that changes the interaction models. If that needs to happen, it is most certainly a mistake by the programmer, and in my experience, it is much better to learn of such errors early, than far down the development cycle when the mistake has had noticable end-user consequences. Therefore, I think there are cases where exactness should be enforced, I'll return to the exact cases.

HTTP also uses the term "consistent":

An origin server SHOULD verify that the PUT representation is consistent with any constraints the server has for the target resource that cannot or will not be changed by the PUT. This is particularly important when the origin server uses internal configuration information related to the URI in order to set the values for representation metadata on GET responses. When a PUT representation is inconsistent with the target resource, the origin server SHOULD either make them consistent, by transforming the representation or changing the resource configuration, or respond with an appropriate error message containing sufficient information to explain why the representation is unsuitable. The 409 (Conflict) or 415 (Unsupported Media Type) status codes are suggested, with the latter being specific to constraints on Content-Type values.

In this case, I think the suggested transformation does not imply that it is legitimate to create a different type of resource (e.g. change the interaction model), the transformation applies when you can merely change some aspects of the representation. For these interaction-model-changing situations, I think this makes it very clear that an error is appropriate.

Again, I think it is better to have an error early than learn of a mistake later. Thus, my opinion is that we should specify consistency and exactness requirements, and that we should specify that errors should be thrown when these requirements aren't met.

I'll try to tabulate various combinations of situations that could cause consistency and/or exactness failures when creating resources (note that the use of text/turtle is just an example of a resource represented by an RDF serialization, and text/plain a generic Non-RDF resource):

Rel Request URI	Method	Link header	Content type	ok/fail	Remark
`foo.ttl`	`PUT`	`BasicContainer`	`text/turtle`	fail	Slash semantics dictates this is not a container, the content type makes it an RDF Source
`foo.ttl`	`PUT`	`NonRDFSource`	`text/turtle`	fail	Turtle makes this an RDF Source
`foo.ttl`	`PUT`	`RDFSource`	`text/turtle`	ok	Turtle makes this an RDF Source
`foo.ttl`	`PUT`	none	`text/turtle`	ok	Turtle makes this an RDF Source
`foo/`	`PUT`	`BasicContainer`	`text/turtle`	ok	Turtle is legitimate content type for container
`foo/`	`PUT`	`NonRDFSource`	`text/turtle`	fail	Slash semantics makes this a container but can't be a NonRDFSource, and content type is also inconsistent
`foo/`	`PUT`	`RDFSource`	`text/turtle`	ok	Turtle is legitimate content type for container and LDP says Container isa RDFSource
`foo/`	`PUT`	none	`text/turtle`	ok	Slash is sufficient to declare a container, and Turtle is legitimate content type for container
`foo.txt`	`PUT`	`BasicContainer`	`text/plain`	fail	Slash semantics dictates this is not a container, content type also inconsistent
`foo.txt`	`PUT`	`NonRDFSource`	`text/plain`	ok	Kinda obvious, isn't it? ;-)
`foo.txt`	`PUT`	`RDFSource`	`text/plain`	fail	Plain text isn't an RDF Source
`foo.txt`	`PUT`	none	`text/plain`	ok	Plain text makes this a Non RDF Source
`foo/`	`PUT`	`BasicContainer`	`text/plain`	fail	Plain text is inconsistent with this being a container
`foo/`	`PUT`	`NonRDFSource`	`text/plain`	fail	Slash semantics makes this a container but can't be a NonRDFSource
`foo/`	`PUT`	`RDFSource`	`text/plain`	fail	Plain text is inconsistent with this being a container as dictated by slash semantics
`foo/`	`PUT`	none	`text/plain`	fail	Plain text is inconsistent with this being a container as dictated by slash semantics
`foo/`	`POST`	`BasicContainer`	`text/turtle`	ok	Creates a new container under the `foo/` container
`foo/`	`POST`	`NonRDFSource`	`text/turtle`	fail	Slash semantics makes this a container but can't be a NonRDFSource, content type also inconsistent
`foo/`	`POST`	`RDFSource`	`text/turtle`	ok	Creates a new RDF source resource, but we could do more with `Slug`
`foo/`	`POST`	none	`text/turtle`	ok	Creates a new RDF source resource, but we could do more with `Slug`
`foo/`	`POST`	`BasicContainer`	`text/plain`	fail	Plain text is inconsistent with this being a container
`foo/`	`POST`	`NonRDFSource`	`text/plain`	ok	Creates a new Plain text resource under the `foo/` container
`foo/`	`POST`	`RDFSource`	`text/plain`	fail	Plain text is inconsistent with the RDF Source declaration
`foo/`	`POST`	none	`text/plain`	ok	Creates a new plain text resource, but we could do more with `Slug`

I believe PATCH for resource creation is identical to PUT. We really need to define POST on non-containers as an append operation, but I left that out because that's an orthogonal issue.

Finally, we need to decide what a failure should result in. Earlier, I advocated a simple 400, but then, the use of the term inconsistency in HTTP, as quoted above, has made me think that 409 is appropriate. In all these cases, there is a conflict either between different parts of the request, or between the request and the slash semantics of Solid (which could count as configuration information in HTTP's terms). There are also cases where 415 is appropriate, so I think we should allow that when appropriate, but generally, the fails in the table should result in a 409.

There are considerations around this in several older issues that I think was closed a bit prematurely. #121 deals with conflicting interaction models. There's also some discussion in #105 and others.

Slash semantics and conflicting requests on resource creation #301

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions