I'm looking for a web scraper for a one-off project to extract all example sentences and phrases from a free online bilingual dictionary, for personal use in a language-learning flashcard app.
I'm looking for all the example phrases/sentences from www.[***].com/language1-language2 and a separate collection of all sentences from www.[***].com/language2-language1, as these two databases aren't exact mirror-images of each other. There would be two different databases, one from language 1 to language 2, and another going from language 2 to language 1. I require every single example sentence from the entire dictionary, so I’d estimate it to amount to 500,000-1,500,000 sentences. The only pieces of information I require are the phrase, its translation, and the headword (dictionary entry) to which it belongs.
I've included an example .xls file to show what I want. I need the original phrases in the first column, the translations in the second column, and the headword in in the final column. Note that the contextual information (the words in rounded brackets) falls into the translation column. The output file could be in 2 excel files, or if the files end up too big, in text files (UTF-8 encoded) with each column separated by a tab.
I’m less concerned about the time that the project takes, and more about the accuracy, completeness, and cost.
Hello, I am ready to start the work anytime. Price for the project is £50, fast delivery, complete in 5 days. Btw, the headword list will be provided by you? Regards, Santos