I need a web scraper written for the following url:
[login to view URL]
All of the information needed is available on the main page; the number of rows will vary.
The output should be a pipe (|) delimited file with the following column mappings:
origin_city --> data located in the "Load City" column
origin_state --> data located in the "Load St" column
ship_date --> data located in the "Load Date" column, changed to the YYYY-MM-DD format
destination_city --> data located in the "Dest City" column
destination_state --> data located in the "Dest St" column
receive_date --> data located in the "Drop Date" column, changed to the YYYY-MM-DD format
trailer_type --> data located in the "Equip Type" column
load_size --> add text "full" to the column
weight --> leave blank
length --> leave blank
width --> leave blank
height --> leave blank
trip_miles --> data located in the "Miles" column
pay_rate --> leave blank
contact_phone --> data located in the "Phone" column
contact_name --> data located in the "Contact" column
tarp_required --> leave blank
comment --> add data from the "Email" column
load_number --> data located in the "Order No" column
commodity --> leave blank
The first line of the output should contain all of the column headers.
Any field that contain no data should be left blank.
Please do not use words like "null" or "blank" in blank columns.
Below is a sample output of the first 5 columns using sample data:
The deliverable will be a Perl .pl file that must run on
Ubuntu Linux and must use Modern::Perl. The Perl .pl file
should be called '[login to view URL]' and the output file should be
called '[login to view URL]'
It will be scheduled in cron to run unattended every 15 minutes.
We suggest WWW::Mechanize but you are free to use other Perl libraries.
Please specific what language/OS/tools you will be using in your bid.
Also, please include the word "raccoon" in your bid so I know that
you read this description.
First of all I know about "raccoon". I'm talking about same scraping script as before: I'll use following Perl modules WWW::Mechanize, HTML::TreeBuilder::LibXML etc.