[Popcon-developers] Popcon time-series data points
Lucas Nussbaum
lucas at debian.org
Wed Feb 14 11:47:17 UTC 2018
(Adding popcon developers to Cc, and reordering paragraphs so that it's
easier to follow)
On 14/02/18 at 10:47 +0000, Nikolaos Alexopoulos wrote:
> On Tue, 2018-02-13 at 19:36 +0100, Lucas Nussbaum wrote:
> > On 12/02/18 at 13:23 +0000, Nikolaos Alexopoulos wrote:
> > > Hi everyone,
> > >
> > > As part of our research here at TU Darmstadt, we would like to have
> > > access to the time points used to generate the Debian popcon
> > > graphs. Is
> > > there a person we could talk to in order to gain access to the
> > > data,
> > > while of course respecting the submitters' privacy?
> >
> > I don't think that Debian has a policy in place that would allow to
> > share the raw data.
> >
> > Maybe it would be easier if you explained what you are trying to do,
> > and
> > someone generates the required processed data for you?
>
> In a nutshell, what we want to do is find correlations between the
> (changes in) popularity of Debian (source) packages and the
> bugs/vulnerabilities reported for them over time. Therefore, we do not
> need raw data, rather we just need snapshots of the popcon public data
> in monthly intervals for the past years. I would assume that sharing
> this data does not infringe on popcon's privacy statement.
OK, I looked into how the popcon graphs are implemented.
If nobody objects by 2018-02-21, I'll send you the output of
select p.day, package, p.vote, p.old, p.recent, p.no_files,
submissions
from popcon_package pp
join popcon p on (pp.id = p.package_id)
join popcon_day pd on (pd.day = p.day)
where in_debian order by p.day;
Which results in something like:
day | package | vote | old | recent | no_files | submissions
------------+---------+--------+-------+--------+----------+-------------
2004-01-24 | bash | 1600 | 48 | 33 | 0 | 1684
2004-01-25 | bash | 1727 | 49 | 35 | 0 | 1814
2004-01-26 | bash | 1731 | 52 | 41 | 0 | 1828
2004-01-27 | bash | 1724 | 53 | 54 | 0 | 1835
2004-01-28 | bash | 1727 | 52 | 61 | 0 | 1844
2004-01-29 | bash | 1620 | 50 | 65 | 0 | 1739
2004-01-30 | bash | 1631 | 48 | 67 | 0 | 1750
2004-01-31 | bash | 1639 | 49 | 74 | 0 | 1766
2004-02-01 | bash | 1871 | 43 | 103 | 0 | 2020
2004-02-02 | bash | 1890 | 34 | 112 | 0 | 2039
2004-02-03 | bash | 1910 | 33 | 106 | 0 | 2052
2004-02-04 | bash | 1909 | 33 | 103 | 0 | 2047
(Please remind me if I forget to send this to you)
- Lucas
More information about the Popcon-developers
mailing list