Bug#833779: devscripts: [uscan] please strip whitespace in href when matching URLs
James McCoy
jamessan at debian.org
Mon Aug 8 22:18:43 UTC 2016
On Mon, Aug 08, 2016 at 11:40:39AM -0400, Daniel Kahn Gillmor wrote:
> the HTML there is decidedly weird, with extra whitespace in the href:
>
> ---------
> <td class="name"><a class="execute" href="
> /skskeyserver/sks-keyserver/downloads/sks-1.1.6.tgz">sks-1.1.6.tgz</a>
> </td>
> ---------
>
> So uscan isn't able to find it.
>
> I reported the issue to bitbucket:
>
> https://bitbucket.org/site/master/issues/13130/downloads-page-has-spurious-whitespace
>
> and got this response on IRC (#bitbucket on freenode):
>
> 11:29 <+dwfreed> dkg: browsers ignore the whitespace
> 11:30 <+dwfreed> URLs can't have spaces in them anyway, which is what the field is
> 11:30 <+dwfreed> it must be encoded, either as + or %20
> 11:30 <+dwfreed> so teach the tracker to adhere to the spec
Referencing the HTML spec[0], the value of the href attribute is a
"valid URL potentially surrounded by spaces", so uscan does indeed need
to be fixed.
[0]: https://www.w3.org/TR/html5/links.html#links-created-by-a-and-area-elements
Cheers,
--
James
GPG Key: 4096R/91BF BF4D 6956 BD5D F7B7 2D23 DFE6 91AE 331B A3DB
More information about the devscripts-devel
mailing list