I need spiders created for 3 websites. They should be separate applications, each written in C# with SQL Server for the DBMS. Will need for you to compile the apps since I don't have Visual Studio, but will require finished source code when done.
The 3 sites to spider are:
1. Press Releases on [url removed, login to view]
2. [url removed, login to view]
3. ttp://[url removed, login to view]
Use Proxies (ability to provide a list of private proxies)
Need two modes: mode 1 - spider all available pages of press releases until stopped; mode 2 - only spider new releases added, and then stop (this would be for running as a scheduled task)
spider every page
capture publication time
capture email (if email is obscured by using " at " instead of "@", change back to "@")
capture phone via Regular Expression
capture article URL
capture URL of the company being promoted in the press release
Output the program's activity to the screen and to a log file. Both activity outputs should indicate the proxy that is currently being used. Also should output URL of page being loaded at the moment. This should make it easier to understand what's going on if the program gets stuck or stops working for any reason.
Hello I'm interesting your project very well I'm a Good C#, Scrap, DB, Algorithm expert. I m quite well experienced in these jobs. Let's go ahead with me I want to service for you continously. Thanks