I have a data set extracted from various online news sources. which contains news headlines.
since data is from different news sources, i have some duplicates which render same information. so i want u to build an algorithm to cluster them in to one.
For ex: data from website A : Narendra modi favorite is tajmahal.
data from website 2: 7th world wonder Taj mahal is Narendra modi's favorite..
the meaning of both sentence is same, but they are generating an addition data . i want to cluster them into one..
i tried using k-means but every time i son't ant to sit and analyse the required number of clusters, i want it to decide automatically.
8 freelance font une offre moyenne de ₹16805 pour ce travail
Greetings sir, i can help you and your 100% satisfaction is assured if you allow me to serve. I can do this task as per your requirement Relevant Skills and Experience I have more than 5 years of experience with exce Plus
I can assist you to find similar/duplicate news articles using NLP in R. Relevant Skills and Experience I have experience using R for NLP and recommender engines which also use the concept of similarity in text. Prop Plus
This is a very interesting project which requires the application of intent analysis and word2vec, would love to take this up. More about me: Hello, I am Prajwal Bhatt, a final year undergraduate student at IIT Plus
Hello, I am an analytics consultant with more than 6 years of experience working with clients of 2 major consulting firms. I have worked data analytics all through my career. Relevant Skills and Experience With over 5 Plus
Data Scientist with prior experience in clustering using DB-Scan and K-Means Relevant Skills and Experience Python, Tensorflow, Neural Networks, 3 years Proposed Milestones ₹15000 INR - Model training and verificatio Plus
I am a professional machine learning developer looking to expand my skillet. I know how to solve this problem and it would be a fun project to do during this weekend. Relevant Skills and Experience NLP, Computer Visio Plus