Dear Software Developers,
My company needs a Python script that can manipulate various types of data in Excel files (read and write in Excel sheets according to simple rules).
It needs to be able to open web pages and parse the text for keywords.
The input would be an Excel sheet, with a column containing links to clinicaltrials.gov. The script would need to read each webpage and look for the keyword "*investigator: " and retrieve (copy paste) the string of characters that follows it until the line return character, in a pre-defined column in the same Excel sheet (or a copy of it). Principal and sub-investigators need to be added to different columns to differentiate them.
Here are some examples :
- with only 1 principal investigator : [login to view URL]
- with several principal investigators : [login to view URL]
- with sub-investigators as well : [login to view URL]
The aim is to be able to list all the investigators of some specific clinical studies, and we will provide the script with an excel sheet with direct links to all the studies we are interested in.
If this project is successful, we have another project that is not more complex but will be very useful for us and is also data manipulation in Excel files (even simpler than this, in fact, but this is more urgent). We will have a lot of returning business for you!
Thanks in advance for your help !
Hello How are you? I read your description carefully. I have a good experience in web scraping with python and C#. So I can finish quickly. I would like to discuss in detail via chat. Thanks
Hello. I am a Python developer. I have enough experience in both things you need: web scraping and Excel data processing. You can find the reviews of my past similar tasks on my profile page. Best regards, Pavlo