Part of my research includes the scraping and visualization of bulk Tweets. I end up seeing a lot of sentences this way, far more than I will ever actually read. I use Mallet to text mine my Twitter collections.
I have noticed a number of groups attempting to create what appear to be wire services either with regional handles or candidate specific Twitter streams, such as the candidate_news_network and POLS. These are messing up my analysis as they pump huge volumes of uniform text with little relevance.
Here is my line on these enterprises: they are a false start at best and generally junk.

Does anyone really care what a brand new wire service writes in a microblog format? I can’t believe that any real fan of a candidate needs a curated Tweet stream when they can have the real thing.
This is Twitters problem in a nutshell: the news services are a poor substitute for a poor substitute for real news and analysis. Are these services intended for “novice” users? Newsflash: there aren’t many of those, Twitter is slowly burning out, not building new. My best guess is that these news services are intended to accumulate followers and then sell out. Sort of like Twitter should have long before that IPO thing.
What does this mean for me? Finding ways to clean this stuff out of my dataset, blerg.
