[newmaint-site] Bug reporters to DC
Enrico Zini
enrico at enricozini.org
Mon Apr 21 08:21:42 UTC 2014
On Sun, Apr 20, 2014 at 07:46:59PM +0200, Simó Albert i Beltran wrote:
> Firstly, thanks for your work.
> I pushed initial data of bug reporters to DC:
> https://contributors.debian.org/source/bugs.debian.org
> I am coding the following utility to post this data:
> https://gitorious.org/sim6/udd2dc
> Please, feel free to comment it or do merge requests.
In https://contributors.debian.org/contributors/site_status
the number of mapped email identifiers went from about 2713 to 5065.
The number of not-mapped identifiers went from about 5000 to 35717. That
is an order of magnitude more! I'm excited thinking where this can take
the perception Debian has of itself.
I've tried turning that into a dc-tool mining configuration. It can
*almost* be done, except dc-tool does not support having SQL queries
return a min-max date range. It shouldn't be difficult to have dc-tool
deal correctly with a query like this, but it's not there yet:
contribution: bug-submission
method: postgres
db: service=udd
query:
select
submitter_email as id,
min(arrival) as since,
max(arrival) as until
from
all_bugs
where
arrival != 'epoch'
and
arrival <= 'now'
group by
submitter_email
having
count(*) > 1
and
max(arrival) >= '-infinity'
url: https://bugs.debian.org/cgi-bin/pkgreport.cgi?correspondent={id}
Would it make sense to add something along the lines of:
"HAVING until - since > two weeks" ?
Ciao,
Enrico
--
GPG key: 4096R/E7AD5568 2009-05-08 Enrico Zini <enrico at enricozini.org>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: Digital signature
URL: <http://lists.alioth.debian.org/pipermail/newmaint-site/attachments/20140421/97629333/attachment.sig>
More information about the newmaint-site
mailing list