[libweb-scraper-perl] branch master updated (d3107aa -> b273e35)
gregor herrmann
gregoa at debian.org
Wed Oct 22 18:44:42 UTC 2014
This is an automated email from the git hooks/post-receive script.
gregoa pushed a change to branch master
in repository libweb-scraper-perl.
from d3107aa update changelog
new ce9b8f4 Add debian/upstream/metadata
new d038186 Update debian/changelog
adds 37ad448 Directory for svk import.
adds ce938d8 import Web::Scraper
adds 4507b25 Checking in changes prior to tagging of version 0.01. Changelog diff is:
adds e418fd3 Checking in changes prior to tagging of version 0.02. Changelog diff is:
adds 0d75fbd make use of 'TEXT' instead of content, to be more compatible with Scrapi
adds c95003a added unit tests
adds c2c271e refactored get_value to the function. Callback takes HTML::Element, not ->as_HTML for speedups. Added tests
adds 10da341 implemented process 'selector', sub { ... } and process_first for that.
adds 045ffcf Checking in changes prior to tagging of version 0.03. Changelog diff is:
adds d68c014 API CHANGE: Now scraper {} returns Web::Scraper object, not the closure. You should call ->scrape() method to get the data back.
adds 445ad10 Checking in changes prior to tagging of version 0.04. Changelog diff is:
adds b72bbfe added less-DSLish constructor Web::Scraper->define(sub { ... });
adds 01fc5c5 rename .t
adds 1d52e4e treat as UTF-8 if there's no encoding found
adds c772e27 add HD trailer extraction code as an example of callback
adds 0766ee7 store $node to $_ in the callback
adds 37188cb if expression starts with /, it's treated as direct XPath expression, not CSS selector.
adds 61e8f17 assume default as latin-1 per RFC
adds af6fb86 Checking in changes prior to tagging of version 0.05. Changelog diff is:
adds 03a4e74 add live.t for Unicode testing
adds 77091a6 don't use decoded_content to work with new HTTP::Response::Encoding
adds 200be64 Checking in changes prior to tagging of version 0.06. Changelog diff is:
adds 90a4768 better dependencies for XPath libraries
adds 441cc92 Checking in changes prior to tagging of version 0.07. Changelog diff is:
adds a9434fc fixed live test
adds 9158189 add tree->delete to avoid memeory leaks
adds 87b40b2 Checking in changes prior to tagging of version 0.08. Changelog diff is:
adds 398a973 Checking in changes prior to tagging of version 0.09. Changelog diff is:
adds 9afe7a4 Checking in changes prior to tagging of version 0.10. Changelog diff is:
adds a00a157 requires YAML for scraper script and tests
adds ead72c6 Checking in changes prior to tagging of version 0.11. Changelog diff is:
adds a6bcf0b Checking in changes prior to tagging of version 0.12. Changelog diff is:
adds f84f3ef added 'c' and 'c all' to scraper
adds 642f8ee added URI to the deps
adds ada8a4d added WARN handy sub to scraper
adds f482587 add search-cpan.pl example
adds 47c62b4 added rel-tag extractor
adds ee3fec8 Checking in changes prior to tagging of version 0.13. Changelog diff is:
adds ec55e2d fix Term::Readline usage
adds 26f0f4c added =~ to the selector
adds c9a4c91 no optional end tag
adds d8a787f added URI absolutification and RAW/HTML getter
adds e1d4e76 now url is absoltue. yay
adds 6fcae54 Checking in changes prior to tagging of version 0.14. Changelog diff is:
adds 420fb37 don't escape utf-8 characters in WARN and 's' on scraper shell
adds d7c4366 make UserAgent variable accessible
adds 8dd5601 make user_agent an accessor as well
adds 2a82ede simplify POD example
adds 0952246 Checking in changes prior to tagging of version 0.15. Changelog diff is:
adds e61dff1 changes date
adds f254134 support TextNode. call Term::Encoding
adds 9da689b Checking in changes prior to tagging of version 0.16. Changelog diff is:
adds 6663c50 Checking in changes prior to tagging of version 0.17. Changelog diff is:
adds 079186a fix absolute URI bug with nested scrapers
adds 7d8f2a8 use as_XML instead of as_HTML in RAW
adds 38d91bc Checking in changes prior to tagging of version 0.18. Changelog diff is:
adds 0bd2571 make absolute URI return value as a string, not URI object to be consistent.
adds 951df6a revert the fix for now!
adds 5dea6ab try to get encoding from META tags as well
adds 51f4f2f Checking in changes prior to tagging of version 0.19. Changelog diff is:
adds 4111806 add PS store news
adds 5ba7624 Checking in changes prior to tagging of version 0.20. Changelog diff is:
adds debb10b Checking in changes prior to tagging of version 0.21. Changelog diff is:
adds e6a2ed6 add an experimental filter support!
adds 666e30a fixed a bug in loop filters
adds 4935448 fixed issues with non-match regexp sub. also handle undef return values
adds 2780066 fixed example
adds d90ec4e Checking in changes prior to tagging of version 0.21_01. Changelog diff is:
adds 68942d4 look to see if textarea preserves newlines
adds aa3fa2b print to PAGER if it's set. changed variable names for the generated code.
adds 195824f Checking in changes prior to tagging of version 0.22. Changelog diff is:
adds 4d0687a bump up HTML::Selector::XPath req
adds e1138ef accept id() function as XPath not CSS selector
adds 766438a Checking in changes prior to tagging of version 0.23. Changelog diff is:
adds 9a67435 Checking in changes prior to tagging of version 0.24. Changelog diff is:
adds 1183954 accept 0 as a return value
adds 93e0f0f Added git
adds 6c9b66f .gitignore
adds 784a14d scrape() now accepts HTTP::Response as well
adds 5deae47 Added an unit test for HTTP::Response
adds a071ccc Added a test to check if relative URI is absolutized
adds 1e758a3 Checking in changes prior to tagging of version 0.25. Changelog diff is:
adds 5ea1141 testing github config
adds dec99a9 use 5.8.1
adds c5db596 display URL if request failed
adds 5ebbf37 Checking in changes prior to tagging of version 0.26. Changelog diff is:
adds 51ad7f6 no v-string
adds dfb668e Add support for HTML::TreeBuilder::LibXML and tests
adds c30cf30 tests
adds e2786ef Checking in changes prior to tagging of version 0.27. Changelog diff is:
adds 5d3879a Oops, should call replace_original to override
adds 4ac17f6 Should call ->eof thanks to tokuhirom++
adds 351a955 create libxml tests if it supports eof() (since 0.04 I hope)
adds dcb42cd Checking in changes prior to tagging of version 0.28. Changelog diff is:
adds 7658eee woot, docs
adds aae8b75 use Module::Install::Repository
adds 0eb36f5 POD fix
adds 2a536a8 correct docs
adds 8120d0b Merge branch 'master' of git at github.com:miyagawa/web-scraper
adds c18a039 Add Web::Scraper::LibXML
adds 8adf70b Checking in changes prior to tagging of version 0.29. Changelog diff is:
adds 0addf63 no warnings when used multiple times
adds eafa4b6 releng
adds dd1239d adds HTML::TreeBuilder::LibXML 0.04 to feature recommends
adds c2be61f more docs
adds 838d203 LibXML can (obviously) handle XML document!
adds baec21c use content_charset (new with LWP 5.827) instead of H::R::Encoding
adds 3947c12 releng
adds 74e2b0a Checking in changes prior to tagging of version 0.31. Changelog diff is:
adds bd1d680 Just some pod
adds fad055a Merge branch 'jshirley/master'
adds ea24deb Fix content decoding and add a unit test for this issue
adds 0a7115d Checking in changes prior to tagging of version 0.32. Changelog diff is:
adds cc9f22f remove invalid XPath tests
adds 8f5e2ec Checking in changes prior to tagging of version 0.33.
adds ac6ea5b skip xml_simple.t if libxml is not there
adds a7af7a5 Checking in changes prior to tagging of version 0.34.
adds dcf7436 Added a comment node support to xpath selectors
adds 8a17877 Merge pull request #3 from Perlover/fix/add-comment-nodes
adds bc06509 Checking in changes prior to tagging of version 0.35.
adds 5e22119 add failing tests for <section> and <header> elems
adds 491af52 don't ignore unknown tags
adds 028e81c Merge pull request #5 from leedo/html5-tags
adds 5b96791 Checking in changes prior to tagging of version 0.36.
adds 431cd10 Checking in changes prior to tagging of version 0.37.
adds 085c7e4 typo fix
adds eda532a Merge pull request #11 from dsteinbrunner/patch-1
adds d67e4d6 fix SYNPOSIS to work
adds 986edd5 Merge pull request #16 from DQNEO/fix_SYNOPSIS_to_work
adds 65b4bb9 gitignore
adds 0cc0b3a fix LIVE_TEST. Fix #15
adds 686f36c Improved documentation
adds 38e1ff0 Merge pull request #8 from vti/master
adds ab1db88 added regexp filter
adds 33f5f3f referenced the regexp filtering in the documentation
adds e1e6c4c don't fail the regexp filter test on older perls
adds e15a2cd Merge pull request #10 from creaktive/regex
adds 2fbb581 migrate to Milla
adds e03b666 0.38
adds f55fc14 Imported Upstream version 0.38
new d1a9970 Merge tag 'upstream/0.38'
new 5faa4ba Update debian/changelog
new 3523a46 Update debian/copyright.
new 8e1ac7c Declare compliance with Debian Policy 3.9.6.
new 744aea4 Build-depend on libmodule-build-tiny-perl.
new 5dfde66 debian/rules: drop manual removal of script.
new 9b82d4b Add a spelling patch.
new c89021b Mark package as autopkgtest-able.
new b273e35 releasing package libweb-scraper-perl version 0.38-1
The 11 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "adds" were already present in the repository and have only
been added to this reference.
Summary of changes:
.gitignore | 7 -
Build.PL | 7 +
Changes | 5 +
LICENSE | 379 ++++++++++++++++++++
MANIFEST | 31 +-
META.json | 90 +++++
META.yml | 65 ++--
Makefile.PL | 47 ---
README | 191 +++++++++-
cpanfile | 19 +
debian/changelog | 19 +-
debian/compat | 2 +-
debian/control | 10 +-
debian/copyright | 16 +-
debian/patches/series | 1 +
debian/patches/spelling.patch | 18 +
debian/rules | 9 -
debian/upstream/metadata | 5 +
dist.ini | 3 +
inc/Module/Install.pm | 470 -------------------------
inc/Module/Install/AuthorTests.pm | 59 ----
inc/Module/Install/Base.pm | 83 -----
inc/Module/Install/Can.pm | 154 --------
inc/Module/Install/Fetch.pm | 93 -----
inc/Module/Install/Makefile.pm | 418 ----------------------
inc/Module/Install/Metadata.pm | 722 --------------------------------------
inc/Module/Install/Repository.pm | 88 -----
inc/Module/Install/Scripts.pm | 29 --
inc/Module/Install/Win32.pm | 64 ----
inc/Module/Install/WriteAll.pm | 63 ----
lib/Web/Scraper.pm | 108 ++++--
lib/Web/Scraper/Filter.pm | 7 +-
t/07-live.t | 4 +-
t/18_http_response.t | 11 +-
t/19_decode_content.t | 11 +-
t/22_filter_regex.t | 54 +++
t/release-pod-syntax.t | 14 +
xt/97_podspell.t | 9 -
xt/99_pod.t | 4 -
39 files changed, 945 insertions(+), 2444 deletions(-)
delete mode 100644 .gitignore
create mode 100644 Build.PL
create mode 100644 LICENSE
create mode 100644 META.json
delete mode 100644 Makefile.PL
create mode 100644 cpanfile
create mode 100644 debian/patches/series
create mode 100644 debian/patches/spelling.patch
create mode 100644 debian/upstream/metadata
create mode 100644 dist.ini
delete mode 100644 inc/Module/Install.pm
delete mode 100644 inc/Module/Install/AuthorTests.pm
delete mode 100644 inc/Module/Install/Base.pm
delete mode 100644 inc/Module/Install/Can.pm
delete mode 100644 inc/Module/Install/Fetch.pm
delete mode 100644 inc/Module/Install/Makefile.pm
delete mode 100644 inc/Module/Install/Metadata.pm
delete mode 100644 inc/Module/Install/Repository.pm
delete mode 100644 inc/Module/Install/Scripts.pm
delete mode 100644 inc/Module/Install/Win32.pm
delete mode 100644 inc/Module/Install/WriteAll.pm
create mode 100644 t/22_filter_regex.t
create mode 100644 t/release-pod-syntax.t
delete mode 100644 xt/97_podspell.t
delete mode 100644 xt/99_pod.t
--
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/pkg-perl/packages/libweb-scraper-perl.git
More information about the Pkg-perl-cvs-commits
mailing list