Every day the Government Accountability Office (GAO) publishes 1-4 cases in PDF on its website [login to view URL] <[login to view URL]> 1) I would like to download those each day. Each case has a separate number such as B-41804, unfortunately the GAO labels the file 706776, I'd like the file saved as a B-414804 rather than 706776 if possible, in PDF format; 2) once downloaded with a sensible file name, I would like data extracted from the PDF file into a CSV or MS Excel, this data would be "Matter of: [Company Name]"; "File: B--[6 digit number sometimes separated by a semicolon with other B-Numbers]; "Date:" [Month, Day, YYYY"[Between two Lines is the names of the lawyers]; "Office of the General Counsel, GAO, participated in the preparation of the decision."; "DIGEST" [a text box varying from a few lines to almost one page]. I'd like a CSV file or MS Excel file with Matter of, File, Date, Counsel, and Digest as headings; and 3) I'd like to see a weekly report with all the cases for the week properly named in PDF and with the CSV or MS Excel file with the key data; and 4) if this works out I'd like to get every case we can find on the GAO website, which goes back probably about 30 years and tens of thousands of cases.
I can do this for you. I will download all the files and will rename accordingly. And then I will extract data into csv files as per your requirements. message me so we can discuss it further. Thanks
Hi, I have read your project and would like to offer a detailed-oriented job to get the .CVS files you need. I have run a test to make sure I am able to complete the task. Thank you for reading my proposal.
I've good typing skills, so I believe I can complete this project in proper time manner. If you require, you can just send me a small project to prove my skills.
I am able to extract the data to do a conversion and help write a macro for the format you want for all future extractions Relevant Skills and Experience Basic macro programming Data processing Data analysis