KDnuggets : News : 2004 : n21 : item3 | PREVIOUS | NEXT |
FeaturesFrom: Aleks JakulinDate: 27 Oct 2004 Subject: Data Mining of Political Data We have taken the US Senate roll call voting data, which disclose how each of the senators voted in a particular issue. There is a lot of this: there are the 100 senators and there are almost 500 issues per year. Several organizations examine how "friendly" individual senators are to them, but for an ordinary voter, there is just too much hassle. Political scientists, however, regularly observe these datasets with special-purpose models. Our objective was to check if the "usual" algorithms It turns out that the data mining tools are quite complementary to those already used in political science. We can do lots of things:
We have used the general-purpose Orange data mining toolkit (http://www.ailab.si/orange/) which is Python and GPL. Furthermore, we have used the MPCA discrete probabilistic principal components modeling kit (http://cosco.hiit.fi/search/MPCA), also under GPL, to identify the blocs in the senate. Our scripts and data are all freely available. We also have two working papers there that discuss everything in more detail.
mag. Aleks Jakulin
|
KDnuggets : News : 2004 : n21 : item3 | PREVIOUS | NEXT |
Copyright © 2004 KDnuggets. Subscribe to KDnuggets News!