Specify existing practice for Container data about contents #343

timbl · 2021-11-07T17:33:20Z

The data about each contained resource which is given by the containing Container has been stable for at least 6 years, in NSS and so it is appropriate to take existing well-established practice as the current Solid spec, and as the default for any future versions.

This issue requires that the current situation be documented in the solid protocol before any additions or subtractions are considered .

jeff-zucker · 2021-11-07T17:50:15Z

It seems to me that a blob of a .png stored in a database still has the media-type "image/png". And that if the user wants to download it, its size is knowable.

timbl · 2021-11-07T18:07:26Z

The data about contained resource ?x seems to currently

Subject	Predicate	Object	Object type
`?x`	rdf:type	A class whose URI formed by concatenating `'http://www.w3.org/ns/iana/media-types/`, the Internet Content Type of the resource, and `#Resource`	rdfs:Class
`?x`	stat:size	An integer giving the size of the resource in bytes	xsd:integer
`?x`	dct:modified	The time when the resource was lastmodified	xsd:dateTime
`?x`	stat:mtime	The unix time when the resource was lastmodified	xsd:integer

using the usual prefix conventions.

timbl · 2021-11-07T18:21:16Z

@RubenVerborgh I'd not be happy just making this a SHOULD, as basically that makes it something which a client could never depend on. Maybe something like "MUST unless that information does not exist is really expensive to find out"

The "wider solid ecosystem" will still need to support HEAD on each of those contained resources, and will have to respond to that HEAD with content-type and content-length and last-modified surely -- so even if there is a triple store back end, the PNG images must be strings in there.

If this is a SHOULD, then client code which browses Solid containers could be modified to do an explicit HEAd on each resource where it is missing that information. ((That would be a bit of a pain in rdflib.js as the cache currently knows about GET but not HEAD so we would have to introduce a new cache state "HEAD done but not GET".))

I am sympathetic to a triple store worried about the size... would number of triples be an alternative for those? Or it could just fake a byte size by multiplying the number of triples by 100 -- at least the resources would order by size ok)

bourgeoa · 2021-11-07T18:55:04Z

Could we not retain just the original contentType for RDF resource. Is there any harm.
If the data has never been serialised (modified). The user can expect at least on certain server to have kept everything including comments when allowed.

justinwb · 2021-11-08T11:20:55Z

Noting that Solid Editors organized a session last Friday (11/5) pertinent to this topic - see minutes, Mime-type and last-modified didn't receive pushback in-session.

timbl · 2021-11-08T13:35:51Z

"That is the issue indeed; size and content type are just not meaningful in those cases."

Then what happens when I do a HEAD to it?

timbl · 2021-11-08T13:51:16Z

So in principle you could to conneg in each case, using the HTTP request we have.

Of course in the case you have a file-backed or string-store-backed system, there is an actual resource representation. So the server should return that metadata. Shall we leave it that if the server has a store where there is "no such thing: as internet content type, or byte length, that we allow the server to leave that data out? But otherwise its a MUST?

In those case, the client will have to iterate over the contained resources doing HEAD requests, so that would be a bunch of work for rdflib, but we could do it.

Of course in a case the server being able to supply many different translated forms with conneg, we could also suggest that the container return a complete documentation of all the options with a graph of the alternatives.... pribably not for this release.

timbl · 2021-11-08T13:51:57Z

If we can get this through I'd be happy to make it a MUST as a compromise.

justinwb · 2021-11-08T13:58:29Z

…specifically for non-RDF resources though:

Yes that's exactly right - should've been more clear about that. The focus on Friday was to start with the least controversial items.

Shall we leave it that if the server has a store where there is "no such thing: as internet content type, or byte length, that we allow the server to leave that data out? But otherwise its a MUST?

This seems to be another use case where it would be useful for a client to be able to understand what a server does and doesn't support in a programmatic way. Otherwise it is hard to know if a client (in the scenario above) is dealing with a server that is non-conformant, vs. one who cannot produce a valid size for a legitimate reason (for example).

csarven · 2021-11-08T14:04:33Z

If ldp:RDFSource is to be interpreted as Ted explains, I strongly suggest that the Solid Protocol does not adopt it. Instead, do
#342 (comment) :

If we need a class to distinguish RDF stuff from non-RDF stuff, we can use solid:RDFDocument or solid:RDFSource (based off rdf11-concepts)

To date, Solid servers/clients neither adopted ldp:RDFSource or interpreted it to be along the lines as mentioned. We only really cared about it being serialized as a concrete RDF syntax as per RDF 1.1.

kjetilk · 2021-11-09T22:11:40Z

My main issue with the current behavior is that if you have predicates such as dct:modified and st:mtime, then if you modify a resource, then that updates its container (as designed), but since that container now also changed, its parent container also has to be updated, and their parent again all the way up to root. NSS does not do that now, but that means these metadata do not reflect the actual changes in the container representation, which I think is bad. You could for example not use that for conditional requests.

My proposal has been to have a .stat auxiliary resource or something like that. That would solve this problem elegantly, and be analogous to POSIX, which does not have this problem since metadata isn't on the directory, but on an inode. Alternatively, we might not add the problematic metadata to descriptions of children that are containers themselves, but that is a departure from current NSS, AFAICS.

RubenVerborgh mentioned this issue Nov 7, 2021

Specify container description #227

Open

This comment has been minimized.

Sign in to view

RubenVerborgh mentioned this issue Nov 8, 2021

Content of Turtle and RDFa documents should be wholly and entirely preserved #342

Open

csarven self-assigned this Nov 10, 2021

csarven added this to the Release 0.9 milestone Nov 10, 2021

kjetilk added doc: Protocol topic: resource access labels Nov 10, 2021

csarven mentioned this issue Nov 17, 2021

Authoritative Contained Resource Data #352

Merged

kjetilk linked a pull request Nov 17, 2021 that will close this issue

Authoritative Contained Resource Data #352

Merged

csarven closed this as completed in #352 Dec 15, 2021

csarven added this to Specification Sep 25, 2022

csarven moved this to Done in Specification Sep 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specify existing practice for Container data about contents #343

Specify existing practice for Container data about contents #343

timbl commented Nov 7, 2021

jeff-zucker commented Nov 7, 2021

timbl commented Nov 7, 2021 •

edited by csarven

Loading

timbl commented Nov 7, 2021 •

edited by csarven

Loading

bourgeoa commented Nov 7, 2021

justinwb commented Nov 8, 2021

timbl commented Nov 8, 2021

timbl commented Nov 8, 2021

timbl commented Nov 8, 2021

justinwb commented Nov 8, 2021

csarven commented Nov 8, 2021 •

edited

Loading

This comment has been minimized.

kjetilk commented Nov 9, 2021 •

edited

Loading

Specify existing practice for Container data about contents #343

Specify existing practice for Container data about contents #343

Comments

timbl commented Nov 7, 2021

jeff-zucker commented Nov 7, 2021

timbl commented Nov 7, 2021 • edited by csarven Loading

timbl commented Nov 7, 2021 • edited by csarven Loading

bourgeoa commented Nov 7, 2021

justinwb commented Nov 8, 2021

timbl commented Nov 8, 2021

timbl commented Nov 8, 2021

timbl commented Nov 8, 2021

justinwb commented Nov 8, 2021

csarven commented Nov 8, 2021 • edited Loading

This comment has been minimized.

kjetilk commented Nov 9, 2021 • edited Loading

timbl commented Nov 7, 2021 •

edited by csarven

Loading

timbl commented Nov 7, 2021 •

edited by csarven

Loading

csarven commented Nov 8, 2021 •

edited

Loading

kjetilk commented Nov 9, 2021 •

edited

Loading