Data Processing/Scraping from Standard Format txt Files

進行中 投稿 Oct 8, 2013 着払い
進行中 着払い

Hi, we are looking to hire someone to manipulate already existing data files (will be given web link) that are in a standard .txt file format with numeric and text entries to a format used for computing.

1) We would like you to start with taking 100 of the entries (randomly selected with random number generator) in one of the 30 files we will give you.

2) We would like you to transform these 100 entries into a matrix in .csv form based on pre-specified categories given by us. Two of the columns are word and word count. Another is entry ID.

3) We also would like a sparse representation of the two columns of word and word count where there is a new matrix (rows are entry #, columns are word label - filled with the count) and that depends on size of file. We can talk about this.

4) The deliverable should be in manageable csv file sizes, which won't be a problem for this data...

But, we will definitely have more work if this is done successfully (over all files and more entries needed), so scalable routines are highly encouraged. Thinking about a million entries with a higher budget, if this goes well.

Thank you very much.

Please note that we will only hire someone who has the ability to do this automatically since we are looking for FUTURE work primarily. This is just a pilot.
Once we go from 100 entries to 1 million, manual typing will not work. We realize that file size will be an issue depending on the matrix, so if things eventually need to be broken apart into let's say 1000 files of 1000 entries, we will then use this with parallel computing routines for our computations. Thank you so much and we look forward to working with you.

ビッグデータ営業 データ入力 データマイニング 情報処理 ウェブ記事のスクラップ

プロジェクトID: #5006785

プロジェクトについて

40個の提案 リモートプロジェクト アクティブ Oct 9, 2013

40人のフリーランサーが、平均$141 で、この仕事に入札しています。

jaylancer43

Hello - I am an expert techno-functional analyst having vast experience in lots of arenas of IT industry including Excel Macros. I am an Engineering Graduate with an MBA degree. If you see, I am among the niche bid もっと

$111 USD 3日以内
(414件のレビュー)
8.0
Toperfection

Dear "statsphd" Hope you are doing well. I have reviewed the project details and would like to offer our services. We have completed many Research/Data collection/Product add/Data mining assignments on [login to view URL] もっと

$151 USD 3日以内
(168件のレビュー)
7.8
uumairkhalid

Hi.. Expert web scraper/Data Minor here. Interested in your project. I assure you 100% accurate and good quality work. Regards

$105 USD 3日以内
(189件のレビュー)
7.1
tjawad17

Hello Sir, We are a professional company specialized in Data Mining and Web Scraping. We have our own server, team and tools for data mining and scraping efficiently and accurately. We can parse your given text もっと

$155 USD 4日以内
(165件のレビュー)
6.9
happy2helpp

Respected sir, We saw project description and got complete idea about project. We are expert in Big Data, Data Entry, Data Mining, Data Processing and Web Scraping!!! We have worked on many similar tasks before and もっと

$231 USD 4日以内
(84件のレビュー)
6.9
diamond247

Hello Sir, We are a big set up company with excellent skilled operator who have a lot of experience in this segment, our employee complete more than 300 similar job, i have gone through your project specification, i もっと

$144 USD 3日以内
(243件のレビュー)
7.1
ashok7925

Hi, I am much interested in this work. Please share me more details with sample text file and describe me what would like to do. I can automate all of the process once I get understood your requirement. Please sha もっと

$100 USD 3日以内
(33件のレビュー)
5.3
elMancha

Hello there. I have high Excel and Visual Basic skills with great professionalism. I study electronics and computer engineering at Oporto university and I'm looking for work to fill the blanks on my schedule. I' もっと

$60 USD 3日以内
(40件のレビュー)
5.0
arvt

Hi I'm interested and I like to know more details about your project to bid accordingly. I have experience doing programs and scripts in some projects here and in other freelancer site. I have Skype, Gtalk, MS もっと

$35 USD 3日以内
(12件のレビュー)
4.9
mohanlg

Hi, I am interested to do these project work. Expert in data conversion work. Please send me more details of work to start. Thanks sunny

$35 USD 2日以内
(25件のレビュー)
4.3
RajakScripts

Hi, Please attach the .txt file AND a matrix in .csv form based on your given pre-specified categories for a review, so I can adjust my bid & delivery time precisely. Yes, I aware that you want this to be perform もっと

$88 USD 3日以内
(7件のレビュー)
4.3
gokhanonal

Dear Sir / Madam, I'm a computer engineer (with BS Degree), working freelance in Istanbul, Turkey. I can complete your project as fast & accurate. Please let me know. Looking forward to hearing from you soon, もっと

$35 USD 1日以内
(13件のレビュー)
3.6
signo

Hello, I am experienced in working with large files and back-end processing in general. I will definitely finish this project in the next 24 hours. I still need some clarifications before getting started, regardi もっと

$133 USD 1日以内
(32件のレビュー)
4.2
thanhhungqb

Dear sir, I have read your requirement carefully and interested in it. I am expert on data entry, data scrapping and process data. I usually to do it automatic. For your project, I think I can automatic by a prog もっと

$126 USD 3日以内
(15件のレビュー)
3.6
sunil440

Good day! I would like to submit my application as Data Collector. I shall be pleased to consider me as a qualified applicant.I believe my qualifications would make me an outstanding asset to your organization. I woul もっと

$100 USD 3日以内
(16件のレビュー)
3.4
GurpreetSngh220

Hi, I am very much interested in your project. I would like to discuss with you more regarding the project. You can rely on me because i am serious on my work and not sitting here to waste time (both of us). you もっと

$188 USD 5日以内
(7件のレビュー)
3.2
FernandoCanizo

Hello, I'm interested, I'd to give it a try. Can you provide a sample file so I can send you my attempt? No compromises. Also send me any other information I should need to build a proper processing script, I'm t もっと

$30 USD 2日以内
(2件のレビュー)
3.4
inoussakabore

Hi i have almready do this kind of job. You can see that in my profile. I am ready to start it. I can do that in about one week.

$250 USD 7日以内
(3件のレビュー)
3.3
igors233

Greetings, I'm professional software developer with 15+ years of experience in similiar tasks. I will produce a standalone exe (no dependecies) that will take as input given txt file (it could be downloaded automatical もっと

$147 USD 10日以内
(4件のレビュー)
3.5
szymszteinsl

Hi! I am professional C/C++/C#/Java programmer. I can do this project with highest quality, Best Regards, Szymszteinsl

$144 USD 3日以内
(2件のレビュー)
3.3