Ticket #21 (assigned defect)

Opened 3 years ago

Last modified 16 months ago

Website metadata allows invalid characters which break the RDF feed

Reported by: IVORW@… Owned by: Kake
Priority: normal Milestone:
Component: openguides Version: 0.50
Severity: normal Keywords: hackfestsummer2007-reviewed
Cc: ganesh@…

Description

Mon Oct 17 03:00:04 2005 IVORW - Ticket created [Reply] [Comment]

Subject: Website metadata allows invalid characters which break the RDF feed

Currently, no syntax validation is performed on the field "website". This actually allows square bracket syntax, such as the page:

http://london.openguides.org/index.cgi?Peckham_Rye_Common_And_Park

Unfortunately, this cleverness breaks anything parsing the RDF data (attached).

Either this metadata field should be validated more strictly in the commit function (probably a good idea), or we should at least cater for square bracket links in the RDF feed.

Download (untitled) 458b

Download Peckham_Rye_Common_And_Park.rdf 1.7k

Attachments

Peckham_Rye_Common_And_Park.rdf (5.4 kB) - added by IVORW@… 3 years ago.
Peckham_Rye_Common_And_Park.2.rdf (1.7 kB) - added by IVORW@… 3 years ago.
This one isn't broken.

Change History

Changed 3 years ago by IVORW@…

Changed 3 years ago by IVORW@…

This one isn't broken.

Changed 3 years ago by earle

  • owner set to earle
  • status changed from new to assigned

Changed 3 years ago by dom

  • component set to openguides

Changed 3 years ago by ganesh

  • cc ganesh@… added

Changed 16 months ago by dom

  • owner changed from earle to Nobody
  • status changed from assigned to new

Changed 16 months ago by Kake

  • keywords hackfestsummer2007-reviewed added
  • owner changed from Nobody to Kake

Decision: We need to figure out whether this allowing of square brackets in the website field should be supported or not. There are arguments on both sides: on the one hand, it's a handy way to let people supply text for the link, to avoid having huge long URLs spreading across the screen; but on the other, it means that every time we want to spit out the website datum (e.g. in RDF) we have to parse the field. There's also the issue that if people have used this feature a lot then they might be annoyed if we remove it.

Decided that Kake will post to the list summarising the problem and asking for opinions on whether we should support this "feature" or not.

Changed 16 months ago by Kake

  • status changed from new to assigned

Changed 16 months ago by Kake

Consensus on the list is that we should remove this "feature" and use link text based on a truncated URL, like so:

* Website: http://this-url-is-really-long.com/and-it...                       
or                                                                            
* Website: http://quiteshortsite.com/

The number of characters retained in the URL should be admin-configurable, and default to 20.

Changed 16 months ago by dom

The offending code starts here.

Note: See TracTickets for help on using tickets.