Please read ENTIRE description, otherwise you will be ignored (and I will NOT reply to you if you have not read the description).
I need experienced MySQL developer who will create algorithm and code with MySQL for checking similarity percentage between text submitted via form against a number of texts stored in MySQL database.
READ THIS IMPORTANT!!!!READ THIS IMPORTANT!!!!READ THIS IMPORTANT!!!!
I am looking for MySQL stored procedure that will work the same way like PHP simular_text(); function. There is already MySQL Levenshtein distance procedure so you may use that to help you create the simular_text(); function.
When the user submits the text the algorithm should return any entry in database with given percentage of similarity to the text submitted (you will compare only one column in database), e.g return all entries from database that have similarity > 40% with the text submitted by the user.
E.g table
TABLE - Articles
id, article_body,article_title
Code should return all rows that have similarity percentage > 40% (or other given value) with the text (article_body) the user have submitted.
Only ONE column (the article_body column) will need to be compared.
The algorithm must be as much as possible optimized.
When you will write the code you will need to explain each line of MySQL code.
Exclusive and complete copyrights to all programming code will belong to me. You may not distribute the programming code in any way.
Dear sir,
I am strong in programming especially in algorithm. I am strong in data-mining and information retrieval. I am very familiar with Levenshtein distance, Jaccard similarity such measurement. I have implemented Levenshtein distance by dynamic programming which can improve efficiency greatly in several languages.
I am also developed stored procedures in MySQL. I can do the project with high quality.
Wait for your response
Thank you
BR
I have read the description and ready to start. We are developing the tools for database servers administration so I have the necessary experience in MySQL and its procedural language.