[Gzz] Re: Content types in the URI?

gzz-dev

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Gzz] Re: Content types in the URI?

From:	Benja Fallenstein
Subject:	[Gzz] Re: Content types in the URI?
Date:	Sun, 23 Mar 2003 15:55:16 +0100
User-agent:	Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.3) Gecko/20030319 Debian/1.3-3


Hi Gordon, hi Justin,

been mulling over this problem some more, trying to look at it fromdifferent sides, but my position essentially hasn't changed. I thinkthat we won't agree on this point; I also think that this won't do greatharm, though-- our systems should still be able to interoperate withoutproblems. (See below.)


Gordon Mohr wrote:

But the point about URIs is that they do not need context to identify aresource. It would be nice to be able to use our URNs in the samecontext as a HTTP URL, for example in an <img> tag or an <a href>.
Basically, the idea about URIs is that you can use many different URIschemes in the same context, because they do not depend on a context, no?
Yep. So just do it without any type-labelling.
HTTP URLs don't include content-types. The URL...

    http://foobar.com/smiley

...could be a GIF or HTML or an executable. It's only the extra
(outside-the-URI) context provided over HTTP that sets its type --

Yes, but the point is that HTTP maps this URI to a content type plus abody, with the same 'trust level'. If I make a link tohttp://foobar.com/smiley, I expect that following that link will give acorrect body *and* a correct content type. Both are in the hand of thesame person (the server operator).

and I believe most browsers will even be tolerant of many kinds ofmistyping by the server, once they see the data themselves, andcoerce the file into its intended type.

Yes, but I'm sure the developers of those browsers will tell you they'drather be served correct content types by all servers. :)

In my opinion, content type guessing is a kludgy workaround, because itis hard to do, error-prone, and hard to extend. Why do all thoseoperating systems determine the content type of a file based on itsextension, rather that 'just' looking at it and guessing its type?

The data URL scheme (RFC 2397). (It also puts the data in the URL; myopinion is that content type plus data isn't so different from contenttype plus cryptographic hash...)
Aha. I forgot about that one.
There might be a good reason you have to specify the type-interpretation,
but so far I haven't seen a specific case where it's necessary. This
ought to work (in a properly extended browser)...

 <img src="urn:sha1:BLAH">

...even without advance knowledge of the format of the bitstream, as
long as it turns out to be a recognizable image format.

I'm thinking that we probably won't be able to reach an argument here. Ibelieve that providing a content type is essential for makingdevelopers' life easier; you don't think so.

(BTW, I think the analogy with data: runs deep-- why not guess the typeof the content in the data URI, instead of putting the type in the URI?)


To summarize the options we've discussed:

1. Give the content type in the URI. You don't like that.

2. Make an indirect reference: The URI points to a hashed blockcontaining a content type and the hash of the block with the actualdata. This means we wouldn't use the same hash for e.g. an MP3 as youdo, as we'd use the hash of the block *refering to* that MP3. I don'tthink you liked that.3. Guessing the content type. I don't like that, on grounds that itmakes life harder for client developers, and that may necessiatespecifying the content type in the context (e.g. for digital signatures,to ensure we know the correct interpretation of the bytes that were signed).4. Getting the content type through an out-of-bounds mechanism, like theBitzi database. I don't like that, on grounds that it requires anInternet connection, and it doesn't have the same 'trust level' ashaving the content type in the URI or hashed data.


I still think content type in URI is the best alternative for us.

Now, if we cannot agree on this, can our system (Storm) stillinteroperate with the ones you are developing?

I should think so. Given a Storm URI, we can easily create a bitprintfrom it by stripping the content type. This means we can find Stormblocks and metadata about them using bitprint-based systems. Given abitprint, we can generate a Storm URI by using either of the methods youproposed-- adding the content type from the Bitzi database, or guessingit from the content. (We could also use application/octet-stream if wedon't know what kind of data it is, for some reason.)

I would also be entirely comfortable with building the Storm storagelayer so that data is looked up by bitprint. For lookup, the contenttype doesn't matter, after all. It would only be used at the higherlevels building on Storm, to interpret the data that the lower levelshave retrieved. Thus, a Storm system could be used to look up bitprints,and a bitprint-based system could be used to lookup Storm blocks.

So, given all this, I do not see great harm if Storm proceeds in the waythat seems most natural to me, and you proceed in the way that seemsmost natural to you. Right?


- Benja

P.S. Gordon, is there a public domain Java version of the new bitprintcalculator, including source, already? Thanks, -b

[Prev in Thread]

Current Thread

[Next in Thread]

[Gzz] Re: Content types in the URI?, Benja Fallenstein <=

Prev by Date: Re: [Gzz] PEG 1013: Add clipping state to vob scenes
Next by Date: Re: [Gzz] The Fenfire/Loom test system
Previous by thread: [Gzz] Loom notes
Next by thread: [Gzz] hh gradu
Index(es):
- Date
- Thread