A bot to collect articles from 3rd party websites, and..

進行中 投稿 May 16, 2011 着払い
進行中 着払い

I need a bot that will crawl websites (e.g huffington post, gizmodo, ect) for new articles:

- Then it would break the article up into the most commonly used 10 words

- Then it would run that data against a mysql field contained with keywords associated with a username

- Then it will compile a list of usernames that have more then 3 keywords associated with that article

- Then it will create a mysql entry posting that the usernames have been mentioned in the artical URL, which will have to be collected.

Can be done in any programming language, but it has to run every 10 minutes or so (If in PHP i'll use a cron job)

MySQL

プロジェクトID: #1062175

プロジェクトについて

5個の提案 リモートプロジェクト アクティブ May 17, 2011

5人のフリーランサーが、平均£161 で、この仕事に入札しています。

mantislin

Hi sir, Please check PM.

£250 GBP 6日以内
(26件のレビュー)
4.9
greggfletcher

Hello. Check PM. Thanks.

£175 GBP 5日以内
(3件のレビュー)
4.2
golekduit

let start parsing using PHP DOM easy

£30 GBP 1日以内
(5件のレビュー)
2.8
StiiCeva

I've done this before.

£150 GBP 3日以内
(0件のレビュー)
0.0
Diiinnovation

*** We can do it as per Your requirement *** Please check PM ***

£200 GBP 5日以内
(0件のレビュー)
0.0