[Daca-general] Statistics about source languages

Raphael Geissert geissert at debian.org
Sun Dec 26 03:25:22 UTC 2010


Hi,

On 21 December 2010 15:58, Michael Tautschnig <mt at debian.org> wrote:
> Hi all,
>
> Does anybody know about statistics of source code distribution over the various
> programming languages? I'm asking this question on the DACA list as it might be
> a valuable information to guide the choice of analysis tools and making more
> informed guesses of, e.g., the importance of Java analysis tools.

Not really, but read below.

> What I have in mind is something like a sloccount run over all source packages,
> but probably excluding some auto-generated files like configure.

I've run ohcount (the software used for ohloh.net) on the archive and
will soon publish the results on the DACA website. Note that they are
in a per-source package basis and are in a format that needs a bit of
work to make it machine readable. If anyone wants to work on it, it
would be great (ohcount also has an option to list the licences of
every file.)
As for configure and other autoconf-generated files, ohcount treats
them as a separate "language" so that the thousands of lines of code
of configure files are not counted as shell scripts.

Cheers,
-- 
Raphael Geissert - Debian Developer
www.debian.org - get.debian.net



More information about the Daca-general mailing list