There are about 200 web sites listed in an excel file.
Here is a sample row of the excel file:
Name url article 1 article 2 article 3 article 4 article 5
Dallas Morning News [url removed, login to view]
1.) You need to go to the web site, select 5 article urls (any articles will do, but they must be articles) and record them in the excel columns.
2.) Create a directory with the name of the source (for example "[url removed, login to view]")
3.) For each url you select, extract the article text carefully!!!!
4.) Save the text for each article in the appropriate directory where the article url is the title of the file.
"[url removed, login to view]" (see an example)
5.) Place all 5 articles in the directory with the domain name as the directory name.
I'll provide the excel file of the 200 to the winning bidder.
Sorry -- the 200 blogs are on the project clarification board itself as an attached file.