|
| View previous topic :: View next topic |
| Author |
Message |
acszs
Joined: 02 May 2013 Posts: 1
|
Posted: Thu May 02, 2013 8:23 am Post subject: An up to date keyword set on global news |
|
|
My question is not strictly binds to the topic of text mining, but maybe you can help.
I am hunting for a keyword set, which has the following criterions:
- contains only english words/n-gramms or named entities
- manual (tagged by human) tags of global news
- has some main topics (5-10), e.g. tech, business, sport, ...
- has to be relatively big (10000+ tags)
- has be up to date
- it would be nice that the keywords have frequency weights.
I thought that couple of the biggest news portals has tag cloud which (partly) fits on the criterions. But I didn't find anything on BBC News, CNN News, Reuters, ...
Interestingly I found some portals in my mother language (hungarian), I cant believe that, there isnt anything on global level.
I dont need API, I can parse the HTML if necessary.
Maybe a corpora can be useful.
Thanks. |
|
| Back to top |
|
 |
editor Site Admin
Joined: 04 Oct 2005 Posts: 120 Location: Boston, MA
|
|
| Back to top |
|
 |
|
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
Powered by phpBB © 2001, 2005 phpBB Group
|
|
|