Bug#773390: uscan: pagemangle

Osamu Aoki osamu at debian.org
Thu Aug 27 15:23:29 UTC 2015


Hi,

I just made an untested patch as attached to generically solve issues
reported:
 * https://bugs.debian.org/773390 data-realurl ==> href
 * https://bugs.debian.org/705989 <a>...</a>   ==> <a href="$1">...</a>

Then I realize, none of the web page reported as example has such
non-standard code any more.

Did I miss something?  If so, show me a shell code to see such web page
example such as:
 $ wget http::///www.example.org/page
 $ grep -r "pattern" page.html 

Anyway, things like this rare non-standard page problem will hit us but
should not hard code such fixes into uscan.  So idea of having generic
solution is good idea.  (Even if it will be slow scanning the whole
page.)

I see special case code for s3.amazonaws.com .  I think this is Ubuntu
thing since Debian does not seem to have package mentioned.  N(Nor I
could find package in Ubuntu.

Anyway, I am posting this so it will not be lost.

Osamu

PS: Downloading from git tag seems interesting ...
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-pagemangle-test.patch
Type: text/x-diff
Size: 1459 bytes
Desc: not available
URL: <http://lists.alioth.debian.org/pipermail/devscripts-devel/attachments/20150828/fb16fff5/attachment.patch>


More information about the devscripts-devel mailing list