NLTK document classifier with big dataset

* Output should be saved as a .pickle file

* I should be able to input the percentage of documents used for testing and training

* The classifier can be NaiveBayes

* I want to be able to use Ngrams

* I want some way to control noise (such as minimum information gain)

* The classifier should output the most informative features as well as the accuracy rate

* The classifier should be capable of dealing with large datasets (over 350 MB and 100,000 text files)

I can only pay for results. You will show me screenshots of your progress and I will then release a milestone.

Please get back to me with budget and deadline.

Compétences : Langage Naturel, Python

Voir plus :

Concernant l'employeur :
( 38 commentaires ) EDINBURGH, United Kingdom

N° du projet : #8518455

13 freelance ont fait une offre moyenne de 726 £ pour ce travail


Hi, I have read your post and understood your requirement. I have great experience in handling /Python/Django/PHP/MySQL/HTML5/jQuery/Wordpress/Magento/Joomla/Drupal/AngularJS/node.js/CSS3/Java/Javascript/iOS/Andr Plus

824 £ GBP en 12 jours
(2 Commentaires)

I am an NLTK expert. I have implemented classifiers with big datasets in the past. I am python2 and python3 expert. I would need a sample of the input and expected output so that I can provide a more accurate effort Plus

555 £ GBP en 10 jours
(1 Commentaire)

I will guarantee delivery within 10 days with proof of work prior to payment for £1000. And you'll be able to know that it's finally done, and you can put an end to the ever growing costs and weeks.

500 £ GBP en 10 jours
(1 Commentaire)

I am experienced python developer with over 5 years experience in developing applications I use python for machine learning, nlp and data mining tasks. I have experience in processing large datasets. Infact has applie Plus

750 £ GBP en 10 jours
(2 Commentaires)

Hello! We are software developers and have strong experience with similar projects, we would love to work on it , lets discus it further via chat Thanks Kind regards Robin

736 £ GBP en 10 jours
(0 Commentaires)

★★ Machine learning expert and software architect based in the United States - feel free to message me to discuss requirements prior to the project being awarded. ☞ How I Can Help You: I have built text classifying Plus

1015 £ GBP en 14 jours
(0 Commentaires)

Hi, I am Sourabh Jain, I am interested in the project. I am an experienced programmer, I have written a full scale molecular simulation tool using python. I have worked on AI projects involving computer vision and ma Plus

700 £ GBP en 14 jours
(0 Commentaires)

Dear Sir. We claim to get it done perfectly for you EXACTLY in the way you want it - Kindly give we a chance and we will prove myself - Ready to prove our words, let's get it done right away and I mean RIGHT AWAY !! Plus

1263 £ GBP en 30 jours
(0 Commentaires)

Most of the features, you are required, may by implemented in just few days. But there are two features, that escape this rule: addining new parameters to output and huge file processing. I have expirience in both of t Plus

555 £ GBP en 15 jours
(0 Commentaires)

Hi, I can do this project i have in depth knowledge in python. Please let me know more details about the project.

666 £ GBP en 10 jours
(1 Commentaire)

If the size of each file is large (~350 MB) and the number of files is growing, we can think of using Spark (Python/Scala). Yes, you pay for the results rather than for hours spent. ========= We are a team of Data Plus

750 £ GBP en 15 jours
(0 Commentaires)

Hi there, I am a researcher with a PhD in the fields of machine learning and natural language processing. Who is also a kaggler, once listed in the top 500 among the world wide kagglers. Please have a look at my profil Plus

555 £ GBP en 10 jours
(0 Commentaires)

Hi, I have experience with statistics and data mining , using NLTK, Weka. I have published some papers in the area (http://www.jonathasmagalhaes.com/publicaes). Could you send me more details about this project?

555 £ GBP en 10 jours
(0 Commentaires)

Regular contestant of machine learning competition/challenge platform Kaggle and crowdanalytix & top ranked. Independently worked on many machine learning projects mainly in recommendation system, Forecasting, text min Plus

750 £ GBP en 14 jours
(0 Commentaires)