En cours

Java Text Classification II

I require a Java command line program that does a binary classification from a paragraphs of text using a Support Vector Machine (SVM), K-Nearest Neighbor, or other machine learning approach.

I would like to replicate this but using Java.

[url removed, login to view]

[url removed, login to view]

I would like two examples using two different SDKs.

[url removed, login to view]

[url removed, login to view] (but not using their CsvIterator reader)

The program should have the following parts (clearly commented in the code):

Train on 70% of the Data, Test on 30% (random samples)

Can also segment Training/Test data by Year column (i.e. Train on years 2000-2010, Test on 2011-2017)

Clean the Corpus

Remove Punctuation

Strip Whitespace

Convert to Lowercase

Remove Stopwords

Word Stemming

Create Term Document Matrix

Remove Sparse Terms

Apply Support Vector Machine (SVM)/K-Nearest Neighbor Algorithm

Print Results

The program should have the following parameters:

Filename (CSV filename)

Algorithm (SVM or KNN)

Outcome Column Name

Text Column Name

Example Command Line:

Java -jar TextClassify <filename> <SVM | KNN> <OutcomeColumn> <TextColumn>

I have attached a sample CSV file (with 3 outcome columns to test on) and can create more if needed.

I would like the program to loop through the CSV file - as I will connect it to a JDBC interface later. (so don’t use the Mallet CsvIterator reader)

Final deliverables are:

Jar file(s)

Source Code with clean code and basic documentation (i.e. comments in the source code). Preference is IntelliJ with Maven.

Compétences : Java

en voir plus : java text classification, java code text classification, text classification java, bag of words algorithm in java, naive bayes classifier java source code, weka java text classification, svm text classification java, classification of java language, text classification java code, text classification java library, naive bayes text classification java code, php, java, mysql, software development, data processing, c# programming, software architecture, c++ programming, data science

Concernant l'employeur :
( 71 commentaires ) Calgary, Canada

Nº du projet : #15014502

Décerné à:

kappDev

Thanks for the project :) Relevant Skills and Experience Java Proposed Milestones $35 USD - project with jatecs $35 USD - project with mallet

%selectedBids___i_sum_sub_7% %project_currencyDetails_sign_sub_8% USD en 3 jours
(4 Commentaires)
2.2

15 freelance font une offre moyenne de $167 pour ce travail

NovaSofts

Hello Sir/ Ma’am We are a group of Software Engineers having more than 5+ years of experience. Expert in java, C, C++. Please check our profile for reference. Thank you Relevant Skills and Experience java , C++ Prop Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(178 Commentaires)
7.0
%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(45 Commentaires)
6.5
yassine008

A proposal has not yet been provided

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 5 jours
(38 Commentaires)
6.5
%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(15 Commentaires)
5.7
iridescent2x15

I m software engineer. I have read the description and I would like to work for you. For further details please inbox me. Thank you Relevant Skills and Experience java Proposed Milestones $350 USD - m

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(44 Commentaires)
5.6
IMdaystar

Relevant Skills and Experience Java,Software Architecture Proposed Milestones $155 USD - as work I want to char with you

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(5 Commentaires)
4.9
%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(8 Commentaires)
3.9
dungtpvw

Hello guy. I've experienced in Machine learning so I did a lot of project like this. So please contact me directly for more discussion. thanks Relevant Skills and Experience Machine learning, java Proposed Milestones Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(12 Commentaires)
3.8
enobil

Hi, I made text classification before on hackerrank. I used weka library and support vector machine. I know the data preparation steps you mentioned and also I have good experience in Java. Relevant Skills and Experie Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(1 Évaluation)
2.3
lukatchumburidze

Once you will reply I will start learning algorithms in Youtube links which you have provided, for this and then converting logic into Java I will need 3 days, I am sure it will be enough for me. Relevant Skills and E Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(1 Évaluation)
2.0
afifa427

Hi sir im software engineering and i have done similar project successfully. i can show you sample right now Relevant Skills and Experience i know java and machine learning and all skills that are required for your ta Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 10 jours
(2 Commentaires)
1.8
dharmjitsingh

I have placed bid on this project as I have good experience in Java/Machine Learning techniques/StandfordNLP Relevant Skills and Experience Java/Machine Learning techniques/StandfordNLP Proposed Milestones $20 USD - Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(3 Commentaires)
1.8
alhassanlatif

I will develop the model in Java and also work on the pre-processing of the data set before applying the classification step. Please send me more information about the data set Relevant Skills and Experience Hey, I am Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 4 jours
(1 Évaluation)
0.5
MaryumAkhter1

i can develop this for you Relevant Skills and Experience i did mental health project using python interaction, painting tool android, quiz application handling huge database and much more! Proposed Milestones $150 U Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% USD en 3 jours
(0 Commentaires)
0.0