Data Mining - Decision Tree Classifier and Pattern Mining.

完了済み 投稿 4年前 着払い
完了済み 着払い

Develop a single [login to view URL] file to perform following 3 tasks. Each task is invoked using a different command line command.

Task 1: Build a decision tree classifier for an input dataset in the form of redwine.csv. Assume all attributes except the class are numerical, and the last attribute is the class. The output should be written to a file called [login to view URL] describing the accuracy of the classifier when tested on the input dataset and also the parameters used to build the classifier. [login to view URL] will perform this task when given these 2 command line arguments: 1, [login to view URL]; here [login to view URL] is the name of an input dataset.

Task 2: Find top 10 patterns from an input dataset D similar in form to the itemized data file produced in previous project (refer to [login to view URL]), except that the class values may be categorical. Assume the values of the last column are the classes. For convenience, here we refer to one class as P and the other as N. If the input file uses other class names, treat the lexically smaller as P and the other as N. (Assume there are just two classes.) The top 10 patterns should contain the top 5 among the single item patterns, plus the top 5 among all non-singleton patterns. The expectation is that you can find the correct top 5 single-item patterns, but you might not be able to find the true top 5 among all non-singleton patterns. The ranking is determined using the function GR(X) * sup(X,D), where X is a pattern and GR(X) is defined as (sup(X,P)+1)/(sup(X,N)+1). The top 10 patterns must be written to an output file called top10EPs.csv. For each of top 10 patterns X={x1,x2,…,xk}, there should be a line in this file containing the following:

GR(X) * sup(X,D), GR(X), sup(X,D), x1,x2,…,xk

[login to view URL] will perform this task when given these 2 command line arguments: 2, [login to view URL]; here [login to view URL] is the name of an input dataset.

Task 3: Given an input dataset D3 similar in form to [login to view URL] and a pattern X, build a decision tree classifier on mds(X,D3). Recall that mds(X,D) is the subset of instances in D satisfying all single-attribute conditions in X. The output is written to a file called [login to view URL] describing the accuracy of the classifier when tested on mds([login to view URL]) as a test dataset, the number of instances in mds([login to view URL]), and also the parameters used to build the classifier. [login to view URL] should perform this task when given these 3 commandline arguments: 3, [login to view URL], X

Here [login to view URL] is the name of an input dataset and X is a pattern. When specifying patterns, we will refer to the attributes as A1,…,An, in the left to right order as they appear in the input data. Any pattern will be described in the following form:

Ai1 CMP Val1, Ai2 CMP Val2, …, Ai3 CMP Val3

Here CMP is either < or >, Ai1, Ai2, …, Ai3 are attributes, and Val1, …, and Val3 are numerical values. A pattern can contain one or more attributes. An example pattern is A3<5, A6>4.

All outputs are written to the folder where the jar file is located.

Correctness, efficiency, and quality of the findings (for Task 2) will be important factors for marking and evaluation.

You should use Weka for classification and you can use any frequent pattern mining programs written in Java in your program. You cannot use emerging/contrast pattern mining programs written by others; of course you can implement your own.

Include a [login to view URL] file indicate which frequent pattern mining program you are using (indicate developed by whom and the source URL).

All the above information is included in [login to view URL]

データマイニング 情報処理 Data Science Data Analytics Java

プロジェクトID: #24443923

プロジェクトについて

11個の提案 リモートプロジェクト アクティブ 4年前

アワード:

apr159

Hello My name is Alejandro Pasos. I have a MS in computer science at University of California. Right now I am working with java weka so I am clear about that is required on your task. I am ready to start, I read your もっと

$250 USD 14日以内
(46レビュー)
5.6

11人のフリーランサーが、平均$195 で、この仕事に入札しています。

dvcontact

Hi, Would you like to see a quick demo of your Decision tree classifier created for pattern mining and analysis before you award the project? You can consider it professionally done in a few hours. I can continue tell もっと

$150 USD 5日以内
(34件のレビュー)
6.1
anchalsingh0005

Hello, ‌Hope you doing well.I have checked all your requirements and we able to do this and deliver in time.I have 5 years of experience in these types of work. So, I believe we can do that work with your support. ‌Reg もっと

$100 USD 7日以内
(30件のレビュー)
4.9
MohammedSulhi

I am an experienced Data Scientist and Machine Learning Engineer. Deep learning, Artificial intelligence, machine learning, Data structures, and algorithms are my major fields. I finished specializations on Data Scienc もっと

$100 USD 2日以内
(6件のレビュー)
3.5
hayat38402

I have extensive experience in the domains of:  Neural Network/ Natural Language Processing  Machine learning/Data Mining  Deep Learning and Computer Vision  Image Recognition & Artificial Intelligence  AI text a もっと

$280 USD 7日以内
(2件のレビュー)
2.8
merinsinha

Hi i am senior Devoloper. For Professional works -Hire me. As 10+ years experiences in these field. I can give good quality work. I have read the guidelines of your work.I believe that i can provide you the best qualit もっと

$250 USD 5日以内
(2件のレビュー)
0.4
ajcsgi82893181

plz give me a chance. plzzzzz

$155 USD 3日以内
(0件のレビュー)
0.0
sednalondon

My proposal is founded in my long experience in the C++ code and the collection of Machine Learning tools that I have to apply to this kind of problem. Classification task using Supervised/Unsupervised Machine Learning もっと

$290 USD 3日以内
(0件のレビュー)
0.0
DjSalman

I am a java Expert & have got plenty of experience in java, data processing, data mining, data science, data analytics kinds of work I assure you that i can do this work as I have recently done this sort of work Looki もっと

$218 USD 7日以内
(0件のレビュー)
0.0
ranjithkumargkp

I have good experience on machine learning field and believe me I'm going to deliver you on time , my skills are SQL, python , r , SAS Tableau for visualization , I have good command over machine learning algorithms o もっと

$105 USD 15日以内
(0件のレビュー)
0.0