[libweb-scraper-perl] annotated tag upstream/0.38 created (now c22cdee)
gregor herrmann
gregoa at debian.org
Wed Oct 22 18:44:44 UTC 2014
This is an automated email from the git hooks/post-receive script.
gregoa pushed a change to annotated tag upstream/0.38
in repository libweb-scraper-perl.
at c22cdee (tag)
tagging f55fc14e4c2d324520a68abfe43d4991573880ef (commit)
replaces upstream/0.37
tagged by gregor herrmann
on Wed Oct 22 20:32:05 2014 +0200
- Log -----------------------------------------------------------------
Upstream version 0.38
Alexandr Ciornii (1):
correct docs
Andrew Whatson (1):
Fix content decoding and add a unit test for this issue
DQNEO (1):
fix SYNPOSIS to work
David Steinbrunner (1):
typo fix
Jay Shirley (1):
Just some pod
Lee Aylward (2):
add failing tests for <section> and <header> elems
don't ignore unknown tags
Perlover (1):
Added a comment node support to xpath selectors
Stanislaw Pusep (3):
added regexp filter
referenced the regexp filtering in the documentation
don't fail the regexp filter test on older perls
Tatsuhiko Miyagawa (127):
Directory for svk import.
import Web::Scraper
Checking in changes prior to tagging of version 0.01. Changelog diff is:
Checking in changes prior to tagging of version 0.02. Changelog diff is:
make use of 'TEXT' instead of content, to be more compatible with Scrapi
added unit tests
refactored get_value to the function. Callback takes HTML::Element, not ->as_HTML for speedups. Added tests
implemented process 'selector', sub { ... } and process_first for that.
Checking in changes prior to tagging of version 0.03. Changelog diff is:
API CHANGE: Now scraper {} returns Web::Scraper object, not the closure.
Checking in changes prior to tagging of version 0.04. Changelog diff is:
added less-DSLish constructor Web::Scraper->define(sub { ... });
rename .t
treat as UTF-8 if there's no encoding found
add HD trailer extraction code as an example of callback
store $node to $_ in the callback
if expression starts with /, it's treated as direct XPath expression, not CSS selector.
assume default as latin-1 per RFC
Checking in changes prior to tagging of version 0.05. Changelog diff is:
add live.t for Unicode testing
don't use decoded_content to work with new HTTP::Response::Encoding
Checking in changes prior to tagging of version 0.06. Changelog diff is:
better dependencies for XPath libraries
Checking in changes prior to tagging of version 0.07. Changelog diff is:
fixed live test
add tree->delete to avoid memeory leaks
Checking in changes prior to tagging of version 0.08. Changelog diff is:
Checking in changes prior to tagging of version 0.09. Changelog diff is:
Checking in changes prior to tagging of version 0.10. Changelog diff is:
requires YAML for scraper script and tests
Checking in changes prior to tagging of version 0.11. Changelog diff is:
Checking in changes prior to tagging of version 0.12. Changelog diff is:
added 'c' and 'c all' to scraper
added URI to the deps
added WARN handy sub to scraper
add search-cpan.pl example
added rel-tag extractor
Checking in changes prior to tagging of version 0.13. Changelog diff is:
fix Term::Readline usage
added =~ to the selector
no optional end tag
added URI absolutification and RAW/HTML getter
now url is absoltue. yay
Checking in changes prior to tagging of version 0.14. Changelog diff is:
don't escape utf-8 characters in WARN and 's' on scraper shell
make UserAgent variable accessible
make user_agent an accessor as well
simplify POD example
Checking in changes prior to tagging of version 0.15. Changelog diff is:
changes date
support TextNode. call Term::Encoding
Checking in changes prior to tagging of version 0.16. Changelog diff is:
Checking in changes prior to tagging of version 0.17. Changelog diff is:
fix absolute URI bug with nested scrapers
use as_XML instead of as_HTML in RAW
Checking in changes prior to tagging of version 0.18. Changelog diff is:
make absolute URI return value as a string, not URI object to be consistent.
revert the fix for now!
try to get encoding from META tags as well
Checking in changes prior to tagging of version 0.19. Changelog diff is:
add PS store news
Checking in changes prior to tagging of version 0.20. Changelog diff is:
Checking in changes prior to tagging of version 0.21. Changelog diff is:
add an experimental filter support!
fixed a bug in loop filters
fixed issues with non-match regexp sub. also handle undef return values
fixed example
Checking in changes prior to tagging of version 0.21_01. Changelog diff is:
look to see if textarea preserves newlines
print to PAGER if it's set. changed variable names for the generated code.
Checking in changes prior to tagging of version 0.22. Changelog diff is:
bump up HTML::Selector::XPath req
accept id() function as XPath not CSS selector
Checking in changes prior to tagging of version 0.23. Changelog diff is:
Checking in changes prior to tagging of version 0.24. Changelog diff is:
accept 0 as a return value
Added git
.gitignore
scrape() now accepts HTTP::Response as well
Added an unit test for HTTP::Response
Added a test to check if relative URI is absolutized
Checking in changes prior to tagging of version 0.25. Changelog diff is:
testing github config
use 5.8.1
display URL if request failed
Checking in changes prior to tagging of version 0.26. Changelog diff is:
no v-string
Add support for HTML::TreeBuilder::LibXML and tests
tests
Checking in changes prior to tagging of version 0.27. Changelog diff is:
Oops, should call replace_original to override
Should call ->eof thanks to tokuhirom++
create libxml tests if it supports eof() (since 0.04 I hope)
Checking in changes prior to tagging of version 0.28. Changelog diff is:
woot, docs
use Module::Install::Repository
POD fix
Merge branch 'master' of git at github.com:miyagawa/web-scraper
Add Web::Scraper::LibXML
Checking in changes prior to tagging of version 0.29. Changelog diff is:
no warnings when used multiple times
releng
adds HTML::TreeBuilder::LibXML 0.04 to feature recommends
more docs
LibXML can (obviously) handle XML document!
use content_charset (new with LWP 5.827) instead of H::R::Encoding
releng
Checking in changes prior to tagging of version 0.31. Changelog diff is:
Merge branch 'jshirley/master'
Checking in changes prior to tagging of version 0.32. Changelog diff is:
remove invalid XPath tests
Checking in changes prior to tagging of version 0.33.
skip xml_simple.t if libxml is not there
Checking in changes prior to tagging of version 0.34.
Merge pull request #3 from Perlover/fix/add-comment-nodes
Checking in changes prior to tagging of version 0.35.
Merge pull request #5 from leedo/html5-tags
Checking in changes prior to tagging of version 0.36.
Checking in changes prior to tagging of version 0.37.
Merge pull request #11 from dsteinbrunner/patch-1
Merge pull request #16 from DQNEO/fix_SYNOPSIS_to_work
gitignore
fix LIVE_TEST. Fix #15
Merge pull request #8 from vti/master
Merge pull request #10 from creaktive/regex
migrate to Milla
0.38
gregor herrmann (1):
Imported Upstream version 0.38
vti (1):
Improved documentation
-----------------------------------------------------------------------
No new revisions were added by this update.
--
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/pkg-perl/packages/libweb-scraper-perl.git
More information about the Pkg-perl-cvs-commits
mailing list