[newmaint-site] Bug reporters to DC

Enrico Zini enrico at enricozini.org
Mon Apr 21 08:21:42 UTC 2014


On Sun, Apr 20, 2014 at 07:46:59PM +0200, Simó Albert i Beltran wrote:

> Firstly, thanks for your work.
> I pushed initial data of bug reporters to DC:
> https://contributors.debian.org/source/bugs.debian.org
> I am coding the following utility to post this data:
> https://gitorious.org/sim6/udd2dc
> Please, feel free to comment it or do merge requests.

In https://contributors.debian.org/contributors/site_status
the number of mapped email identifiers went from about 2713 to 5065.

The number of not-mapped identifiers went from about 5000 to 35717. That
is an order of magnitude more! I'm excited thinking where this can take
the perception Debian has of itself.

I've tried turning that into a dc-tool mining configuration. It can
*almost* be done, except dc-tool does not support having SQL queries
return a min-max date range. It shouldn't be difficult to have dc-tool
deal correctly with a query like this, but it's not there yet:

  contribution: bug-submission
  method: postgres
  db: service=udd
  query:
  	select
  	    submitter_email as id,
  	    min(arrival) as since,
  	    max(arrival) as until
  	from
  	    all_bugs
  	where
  	    arrival != 'epoch'
  	    and
  	    arrival <= 'now'
  	group by
  	    submitter_email
  	having
  	    count(*) > 1
  	    and
  	    max(arrival) >= '-infinity'
  url: https://bugs.debian.org/cgi-bin/pkgreport.cgi?correspondent={id}

Would it make sense to add something along the lines of:
"HAVING until - since > two weeks" ?

Ciao,

Enrico

-- 
GPG key: 4096R/E7AD5568 2009-05-08 Enrico Zini <enrico at enricozini.org>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: Digital signature
URL: <http://lists.alioth.debian.org/pipermail/newmaint-site/attachments/20140421/97629333/attachment.sig>


More information about the newmaint-site mailing list