Home > gentoo, gsoc, planet-gentoo > GSoC midterm report

GSoC midterm report

Hi all,

Welcome to the midterm report of the ‘Package statistics’ project.

Summary

The goal of this project is to implement a client-server architecture

for reporting and querying package statistics of Gentoo based machines.
The client program will be used to collect package statistics from
Gentoo installations and submit them to a central server. The server
will calculate useful statistics based on the global dataset, that
developers as well as end users have access to, via an intuitive web
interface.

For the past few days, I’ve been working on the webUI, adding pages for stats.
We’ve also managed to get the webapp running (finally :D ) on vulture. Thanks
to my mentor antarus, robbat2, and the rest of the infra team for helping out.
We hit a few snags, but managed to ease them out in the end. Also, apologies for
making a stupid mistake of committing my mysql password to git (:P).

What works

  • Submitting host stats using a client script
  • Accessing host stats at /host/≤uuid>
  • Arch stats: /arch
  • Package stats:
    • /package/<category>
    • /package/<category>/<pkgname>
    • /package/<category>/<pkgname>-<version>
      (An optional ?top=N can be added to the url for the no. of top items)
  • Repository stats: /repo
  • Keyword stats: /keyword
  • Useflag stats:
    • /use
    • /use/≤useflag>
  • Portage FEATURES stats: /feature
  • Language stats: /lang
  • Mirror stats: /mirror
  • Profile stats: /profile

What doesn’t work (yet)

  • Package search
  • Rating of packages
  • Graphs
  • Bugzilla, tinderbox integration
  • Export the stats to JSON

What needs work

  • The webUI should be prettier
  • The repository and useflag stats could be improved

I think I can finish the remaining goals in another 2-3 weeks. After that, I’ll consider working on some of my stretch goals.

I’m also working on improving the packages.gentoo.org api, so that there’s an easy way to access the portage tree state, and enrich the package stats.

Help me out by submitting your stats to the server. An ebuild for the client is available in the repo. Please report bugs, exceptions etc.

Got any feature suggestions/ideas ?

About these ads
Categories: gentoo, gsoc, planet-gentoo
  1. July 14, 2011 at 11:37 am | #1

    You should add the overlay containing app-portage/gentoostats to http://www.gentoo.org/proj/en/overlays/repositories.xml. Keep up the good work! I’ve been looking for something like gentoostats for years. :-)

    • August 8, 2011 at 3:33 am | #2

      Okay, thanks to sping, my ebuild has now been added to betagarden.

  2. July 19, 2011 at 8:38 pm | #3

    Would you mind including app-portage/pfl fucntionality?
    If it will collect statistics that utility can be very useful imho.

  3. Patryk
    August 3, 2011 at 3:00 pm | #4

    Hi, are you planning to provide info for “which package holds file xxx”? I was about to do something like that by just renting a virtual server with gentoo somewhere, emerge all packages and provide a web frontend to equery, but your attempt makes more sense provided it would deliver such information?

    • August 8, 2011 at 3:35 am | #5

      Since pfl already provides this functionality, it’s not really my priority right now. I might do this outside the SoC, classifying the files installed by a package using useflags or something.

  4. August 8, 2011 at 6:36 am | #6

    That’s a best Gentoo GSoC project I’ve seen… keep it up!

  1. August 8, 2011 at 4:04 am | #1
  2. August 23, 2011 at 2:33 am | #2

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

%d bloggers like this: