[newmaint-site] contributors.d.o source for git.debian.org

Martín Ferrari tincho at debian.org
Wed Jan 7 03:08:57 UTC 2015


Hi!

I finally had some time to work on the data collection for git.d.o. It
is finally working!!

You can see the current results at
https://contributors.debian.org/source/git.debian.org

As I am writing this, a definitive run is executing, but it will take a
few hours until is ready.

Some notes I gather while working on this:

* There are about 15k repositories in alioth, and shell globs take
minutes to process, so I am using find to find the repositories and
generate a config that is then used to run dc-tool.

* There seems to be some git repos inside other git repo directories,
which is pretty weird. Most of them seem to be non-bare repositories
with bare-repo-looking names (e.g.
/srv/git.debian.org/git/debian-med/liboptions-java.git/.git). Since find
takes 2 orders of magnitude more time if I don't stop processing at the
first occurence, I chose to ignore them.

* Processing 15k repositories takes a few hours, so I tried to trim this
down. About 6k repositories are inside pkg-perl and collab-maint, so I
have excluded them from processing. I have also excluded user
directories (*/users/*.git), and other projects already tracked
separately (nm, DSA, debbits). There are still about 9k repositories. As
we add more specialised sources, we can exclude more from this thing.

And a few notes/questions for Enrico:

* I noticed that the NM source is not picking up commits from the
python-debiancontributors repo, of if it is doing it, it is only looking
at file owners, since I am not there :-)

* Also, the NM source has not been updated in the past 2 weeks, so
something must be broken there..

* I have another patch for the python code (it croaked on Unicode commit
logs), and I don't even remember how I was exporting patches to be sent
by email.. Can you pull from my repo? (commit
b8dfde9555a074488e64ecda96c7259afd3fcd09 at
/srv/home/users/tincho/python-debiancontributors)

There are a few groups with over 100 repos that I would like to treat
separately, to make this lighter, and because they clearly deserve their
own source. Any takers?

103 pkg-common-lisp
115 pkg-vdr-dvb
131 pkg-grass
133 debian-med
145 pkg-horde
171 d-i
208 pkg-fonts
226 pkg-php
246 pkg-xorg
260 pkg-ocaml-maint
264 openstack
268 pkg-javascript
293 pkg-games
504 pkg-multimedia
524 debian-science
597 pkg-java
633 pkg-kde
801 pkg-ruby-extras


-- 
Martín Ferrari (Tincho)



More information about the newmaint-site mailing list