NLTK document classifier with big dataset

* Output should be saved as a .pickle file

* I should be able to input the percentage of documents used for testing and training

* The classifier can be NaiveBayes

* I want to be able to use Ngrams

* I want some way to control noise (such as minimum information gain)

* The classifier should output the most informative features as well as the accuracy rate

* The classifier should be capable of dealing with large datasets (over 350 MB and 100,000 text files)

I can only pay for results. You will show me screenshots of your progress and I will then release a milestone.

Please get back to me with budget and deadline.

Compétences : Langage Naturel, Python

Voir plus : test case document adding large integers, generate word document dataset using aspose, translate large french document, pages spell check large document, large photoshop document comic book, proofread large word document, large sample dataset association rules apriori, potential large big huge project, sharepoint big document size

Concernant l'employeur :
( 38 commentaires ) EDINBURGH, United Kingdom

N° du projet : #8518455

13 freelance font une offre moyenne de £726 pour ce travail


Hi, I have read your post and understood your requirement. I have great experience in handling /Python/Django/PHP/MySQL/HTML5/jQuery/Wordpress/Magento/Joomla/Drupal/AngularJS/[url removed, login to view] Plus

824 £ GBP en 12 jours
(2 Commentaires)

I am an NLTK expert. I have implemented classifiers with big datasets in the past. I am python2 and python3 expert. I would need a sample of the input and expected output so that I can provide a more accurate effort Plus

555 £ GBP en 10 jours
(1 Commentaire)

I will guarantee delivery within 10 days with proof of work prior to payment for £1000. And you'll be able to know that it's finally done, and you can put an end to the ever growing costs and weeks.

500 £ GBP en 10 jours
(1 Commentaire)

I am experienced python developer with over 5 years experience in developing applications I use python for machine learning, nlp and data mining tasks. I have experience in processing large datasets. Infact has applie Plus

750 £ GBP en 10 jours
(2 Commentaires)

Hello! We are software developers and have strong experience with similar projects, we would love to work on it , lets discus it further via chat Thanks Kind regards Robin

736 £ GBP en 10 jours
(0 Commentaires)

★★ Machine learning expert and software architect based in the United States - feel free to message me to discuss requirements prior to the project being awarded. ☞ How I Can Help You: I have built text classifying Plus

1015 £ GBP en 14 jours
(0 Commentaires)

Hi, I am Sourabh Jain, I am interested in the project. I am an experienced programmer, I have written a full scale molecular simulation tool using python. I have worked on AI projects involving computer vision and ma Plus

700 £ GBP en 14 jours
(0 Commentaires)

Dear Sir. We claim to get it done perfectly for you EXACTLY in the way you want it - Kindly give we a chance and we will prove myself - Ready to prove our words, let's get it done right away and I mean RIGHT AWAY !! Plus

1263 £ GBP en 30 jours
(0 Commentaires)

Most of the features, you are required, may by implemented in just few days. But there are two features, that escape this rule: addining new parameters to output and huge file processing. I have expirience in both of t Plus

555 £ GBP en 15 jours
(0 Commentaires)

Hi, I can do this project i have in depth knowledge in python. Please let me know more details about the project.

666 £ GBP en 10 jours
(1 Commentaire)

If the size of each file is large (~350 MB) and the number of files is growing, we can think of using Spark (Python/Scala). Yes, you pay for the results rather than for hours spent. ========= We are a team of Data Plus

750 £ GBP en 15 jours
(0 Commentaires)

Hi there, I am a researcher with a PhD in the fields of machine learning and natural language processing. Who is also a kaggler, once listed in the top 500 among the world wide kagglers. Please have a look at my profil Plus

555 £ GBP en 10 jours
(0 Commentaires)

Hi, I have experience with statistics and data mining , using NLTK, Weka. I have published some papers in the area ([url removed, login to view]). Could you send me more details about this project?

555 £ GBP en 10 jours
(0 Commentaires)

Regular contestant of machine learning competition/challenge platform Kaggle and crowdanalytix & top ranked. Independently worked on many machine learning projects mainly in recommendation system, Forecasting, text min Plus

750 £ GBP en 14 jours
(0 Commentaires)