
Closed
Posted
Paid on delivery
I need to submit an end-to-end machine-learning classification model as part of my coursework. I will provide the assignment brief and any data-handling instructions the instructor has given; everything else—from preprocessing through final evaluation—needs to be completed for me. Follow the instructions below :ssignment 2025 - 2026 Title Online News Popularity Data Set Training data file [login to view URL], [login to view URL] Test data file [login to view URL] The data of this assignment refer to characteristics of the popular website of Mashable ([login to view URL]). Hence, this dataset does not share the original content but some statistics associated with it. The original content be publicly accessed and retrieved using the provided urls. All sites and related data were downloaded on January 8, 2015. The estimated relative performance values were estimated by the authors using a Random Forest classifier and a rolling windows as assessment method - see Fernandes et al. (2015) for more details on how the relative performance values were set. The main variable of the study is the number of shares which measures the popularity of the site/post. We are interested to identify the ingredients of a successful post and what it takes to for a post to become a viral. Each student will handle a random sub-sample of 3000 observations to use it for training their model and for inference. All students will use a common evaluation/test dataset of 10000 observations. 1. You should first do some exploratory data analysis. Visualizing the data should give you some insight into certain particularities of this dataset. Pairwize comparisons will help you also learn about the association implied by the data. 2. The main aim is to identify the best model for predicting the popularity of a post. Select the appropriate features to predict your model. Be careful, your model should not be overparameterized. 3. Check the assumptions of the model and revise your procedure 4. Use 10-fold cross-validation to select your model and assess the out-of-sample predictive ability of the model. 5. Use the test dataset to select your model and assess the out-of-sample predictive ability of the model. 6. Compare results obtained by different methods under 2, 3 and 4. 7. Select your final model and features and justify your choice. 8. Interpret the parameters and the predicting performance of the final model. 9. Describe the typical profile of a post and the characteristics of a viral post. 10. Write a report summarizing your results (see attached directions for this) Source: ✓ Kelwin Fernandes - INESC TEC, Porto, Portugal/Universidade do Porto, Portugal. ✓ Pedro Vinagre - ALGORITMI Research Centre, Universidade do Minho, Portugal ✓ Paulo Cortez - ALGORITMI Research Centre, Universidade do Minho, Portugal ✓ Pedro Sernadela - Universidade de Aveiro Relevant Paper: 2 K. Fernandes, P. Vinagre and P. Cortez. (2015). A Proactive Intelligent Decision Support System for Predicting the Popularity of Online News. Proceedings of the 17th EPIA 2015 - Portuguese Conference on Artificial Intelligence, September, Coimbra, Portugal. Attribute Information: Number of Attributes: 61 (58 explanatory attributes, 2 non-explanatory, 1 goal field response) Attribute Information: 0. url: URL of the article (non-explanatory ) 1. timedelta: Days between the article publication and the dataset acquisition (non-explanatory ) 2. n_tokens_title: Number of words in the title 3. n_tokens_content: Number of words in the content 4. n_unique_tokens: Rate of unique words in the content 5. n_non_stop_words: Rate of non-stop words in the content 6. n_non_stop_unique_tokens: Rate of unique non-stop words in the content 7. num_hrefs: Number of links 8. num_self_hrefs: Number of links to other articles published by Mashable 9. num_imgs: Number of images 10. num_videos: Number of videos 11. average_token_length: Average length of the words in the content 12. num_keywords: Number of keywords in the metadata 13. data_channel_is_lifestyle: Is data channel 'Lifestyle'? 14. data_channel_is_entertainment: Is data channel 'Entertainment'? 15. data_channel_is_bus: Is data channel 'Business'? 16. data_channel_is_socmed: Is data channel 'Social Media'? 17. data_channel_is_tech: Is data channel 'Tech'? 18. data_channel_is_world: Is data channel 'World'? 19. kw_min_min: Worst keyword (min. shares) 20. kw_max_min: Worst keyword (max. shares) 21. kw_avg_min: Worst keyword (avg. shares) 22. kw_min_max: Best keyword (min. shares) 23. kw_max_max: Best keyword (max. shares) 24. kw_avg_max: Best keyword (avg. shares) 25. kw_min_avg: Avg. keyword (min. shares) 26. kw_max_avg: Avg. keyword (max. shares) 27. kw_avg_avg: Avg. keyword (avg. shares) 28. self_reference_min_shares: Min. shares of referenced articles in Mashable 29. self_reference_max_shares: Max. shares of referenced articles in Mashable 30. self_reference_avg_sharess: Avg. shares of referenced articles in Mashable 31. weekday_is_monday: Was the article published on a Monday? 32. weekday_is_tuesday: Was the article published on a Tuesday? 33. weekday_is_wednesday: Was the article published on a Wednesday? 34. weekday_is_thursday: Was the article published on a Thursday? 35. weekday_is_friday: Was the article published on a Friday? 36. weekday_is_saturday: Was the article published on a Saturday? 37. weekday_is_sunday: Was the article published on a Sunday? 38. is_weekend: Was the article published on the weekend? 39. LDA_00: Closeness to LDA topic 0 40. LDA_01: Closeness to LDA topic 1 41. LDA_02: Closeness to LDA topic 2 42. LDA_03: Closeness to LDA topic 3 43. LDA_04: Closeness to LDA topic 4 3 44. global_subjectivity: Text subjectivity 45. global_sentiment_polarity: Text sentiment polarity 46. global_rate_positive_words: Rate of positive words in the content 47. global_rate_negative_words: Rate of negative words in the content 48. rate_positive_words: Rate of positive words among non-neutral tokens 49. rate_negative_words: Rate of negative words among non-neutral tokens 50. avg_positive_polarity: Avg. polarity of positive words 51. min_positive_polarity: Min. polarity of positive words 52. max_positive_polarity: Max. polarity of positive words 53. avg_negative_polarity: Avg. polarity of negative words 54. min_negative_polarity: Min. polarity of negative words 55. max_negative_polarity: Max. polarity of negative words 56. title_subjectivity: Title subjectivity 57. title_sentiment_polarity: Title polarity 58. abs_title_subjectivity: Absolute subjectivity level 59. abs_title_sentiment_polarity: Absolute polarity level 60. shares: Number of shares (target response) For more details concerning the variables see file [login to view URL]
Project ID: 40182387
64 proposals
Remote project
Active 2 mos ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
64 freelancers are bidding on average €133 EUR for this job

Hi Angelos, Thank you for considering my proposal for the Academic Classification Model Development project. With over 8 years of experience in Research Writing, I have the expertise to assist you in developing an end-to-end machine-learning classification model as per your coursework requirements. I have carefully reviewed the project details and would like to discuss further in chat to understand your specific needs and provide a tailored solution. Your project on predicting the popularity of online news posts seems intriguing, and I am eager to collaborate with you to achieve the desired outcomes. Looking forward to connecting with you to delve deeper into the project requirements. Regards.
€30 EUR in 1 day
9.1
9.1

I am PH.D writer 12+ years, And would have no problem providing you with the HIGH-Quality work you need. All my work is 100% my own and never Copied, Spun or Plagiarized, so you won’t have to worry about that at all. My three core values are EFFICIENCY, QUALITY, and EXPERTISE. I will deliver this work within the stipulated DEADLINE and a guarantee of NON-PLAGIARIZED work. Hire me, and you will get value for your money. Thank you.
€30 EUR in 1 day
7.6
7.6

Hello, I have carefully reviewed your project requirements and understand that you need a full end-to-end machine learning classification model for predicting online news popularity, including preprocessing, model training, evaluation, and reporting. I can confidently handle this assignment and deliver a technically rigorous solution. My approach begins with exploratory data analysis using Python’s pandas and matplotlib/seaborn libraries to identify trends, correlations, and anomalies. I will perform feature engineering and scaling, ensuring that only meaningful predictors are included to avoid overfitting. For modeling, I will implement multiple algorithms including Random Forest, Gradient Boosting, and Logistic Regression using scikit-learn, with 10-fold cross-validation to evaluate out-of-sample performance. Hyperparameter tuning will optimize predictive accuracy. The test dataset will validate the final model, and I will interpret feature importance, model parameters, and prediction quality. Finally, I will compile a clear report summarizing the methodology, model selection rationale, and characteristics of posts likely to go viral. Would you like me to include visualizations of feature importance and distribution patterns in the report to enhance interpretation? Lets chat and discuss further! Best Regards, Aneesa
€175 EUR in 1 day
7.6
7.6

Hello, I trust you're doing well. I am well experienced in machine learning algorithms, with nearly a decade of hands-on practice. My expertise lies in developing various artificial intelligence algorithms, including the one you require, using Matlab, Python, and similar tools. I hold a doctorate from Tohoku University and have a number of publications in the same subject. My portfolio, which showcases my past work, is available for your review. Your project piqued my interest, and I would be delighted to be part of it. Let's connect to discuss in detail. Warm regards. please check my portfolio link: https://www.freelancer.com/u/sajjadtaghvaeifr
€500 EUR in 7 days
7.9
7.9

As a highly proficient Biostatistician and Data Scientist, I possess the exact skill set to execute your academic classification model project excellently. Armed with seven years’ experience in data handling from the initial stage to final evaluation like you require, I am quite familiar with multiple data analysis tools and languages – Python being one of my fortes. Specifically in Python, I am skilled in machine learning (ML) as well as deep learning and natural language processing (NLP) models which are all crucial aspects of your project. In relation to consulting and employing ML algorithms for effective prediction models, I have engaged random forest classifiers like in your dataset and delivered accurate correlations necessary for an insightful analysis. Guided by the project's 10-step directive, I shall carry out rigorous exploratory data analysis, feature selection/preprocessing, carry out model evaluation using strict 10-fold cross validation, assess the out-of-sample predictive abilities of the final model using both training and test datasets, and present detailed report interpreting parameters of my final ML model’s performance fitting your directives. By choosing me for this project, expect resilient commitment supplemented by satisfaction guarantee within time limits. Choose Wyclife_Stats today!
€140 EUR in 7 days
7.1
7.1

Greetings, Thank you for considering my application for this project. As an AI Engineer and Python Developer with over 8+ years of experience, I bring a wealth of knowledge and expertise in the field of Python, Deep Learning. I have carefully reviewed the project description and am eager to discuss your specific needs and requirements in more detail. My commitment is to provide dedicated support and consistent follow-up throughout the project's lifecycle. Please feel free to reach out to me to further discuss how I can contribute to the success of your project. Looking forward to the opportunity of working together. Best regards, KuroKien
€120 EUR in 1 day
6.6
6.6

Hey there Glane here,hope you're doing well. I can help you in building a classification based model that focuses on those 10 points mentioned in the pdf, right from eda to finding the accuracy,precision,recall,f1 score for each model and thereby interpreting and providing recommendations. Feel free to get in touch.
€180 EUR in 2 days
6.2
6.2

Affordable, Early Delivery. ★★★★★★★★★★★★★★I hold a Masters degree which gives me the requisite background to handle writing from various subjects. I am a highly committed person towards my work. You can rely on QualityXenter for quality and consistency in writing. We never violate copyright rules. I have vast amount of experience in this industry since I am working from 2015 as a professional writer. I provide many modifications till to get your satisfactions. I have access to enough journals to use in your research project. I always produce quality work at VERY LOW RATES so, don't worry if you have a low budget for your work, I will be very happy to make a new client like you. I am producing quality work for my clients including ARTICLE WRITING, REPORT WRITING, ESSAY WRITING, RESEARCH PAPERS, BUSINESS PLAN, TECHNICAL WRITING, MATLAB, THESIS, ACCOUNTING & FINANCE work ETC. Go through my profile link https://www.freelancer.com/u/qualityxenter
€30 EUR in 1 day
6.2
6.2

Hello there,, I have advanced experience in Data Mining, Statistics, Statistical Analysis and Data Science. With my vast background in data analysis and management, I am confident in my ability to handle your categorical data project effectively and efficiently. I have extensive experience in collecting, cleaning, analyzing, and visualizing data using Python programming, an invaluable asset for a project of this nature. Additionally, I am well-versed with CRISP-DM framework and adept at identifying patterns within datasets Choosing me means benefitting from not only my expertise but also my personal approach to projects. I understand that each task is unique, requiring tailored skills, and so I'm willing to go the extra mile to provide you with results that meet and exceed your expectations. Let's join forces in this project as our combined strengths will surely produce a result that's efficient, elegant and insightful! Let's not waste any more time! Together, we can mine this data efficiently and answer the questions to achieve your goals. Best Regards, Thanks
€30 EUR in 1 day
5.9
5.9

Hello, I understand you’re looking for an end-to-end machine learning classification model developed strictly according to your academic coursework requirements using the Online News Popularity dataset. I have strong experience in data science, machine learning, and academic model development, delivering complete, submission-ready projects that cover exploratory data analysis, preprocessing, feature selection, model training, and evaluation. My approach follows established academic standards to ensure reproducibility, methodological clarity, and compliance with grading criteria. The workflow begins with structured exploratory data analysis using visualizations, summary statistics, and pairwise comparisons to uncover patterns influencing article popularity. I apply principled feature selection to avoid over-parameterization, validate model assumptions, and compare multiple classification techniques using 10-fold cross-validation to assess robustness and out-of-sample performance. The final model is evaluated on the common 10,000-observation test dataset. Deliverables include a validated model, comparative results across methods, and a professionally written academic report aligned with the provided guidelines. The report interprets parameters, explains predictive performance, and clearly describes the characteristics of typical versus viral posts using data-driven insights and relevant literature. Thanks, Asif
€250 EUR in 10 days
5.8
5.8

Hi there, I'm ready to start working on your project Academic Classification Model Development. I’ve reviewed your description carefully, and as a creative & academic content writer with extensive experience in Research Writing, I’m confident I can deliver a solution that meets your expectations and aligns with your vision. Check out my profile here: ✨ https://www.freelancer.com/u/saifsolutions ✨ Feel free to reach out via chat or Freelancer call so we can discuss your project in more detail. Best regards, Saifullah
€30 EUR in 2 days
5.2
5.2

Hello there, As an experienced researcher and data scientist, data analyst, my qualitative analysis skills perfectly align with your job requirements. My profound knowledge of Python and R Studio guarantees fast learning and adaptation to new tools. Moreover, my advanced skills in Excel make me highly competent in handling large datasets efficiently—making me proficient in extracting the best insights from your transcripts. I fully comprehend the importance of working papers and meticulously preparing financial statements, especially within strict timelines. my sharp analytical skills and extensive knowledge of excel ensure that I leave no stone unturned in making sure every detail is covered under evaluation. My passion for quality, originality and meeting deadlines makes me an excellent choice for this project. I cannot wait to prove my extensive skills to you through providing actionable insights that will help guide your decision making regarding domestic charter flights. Best Regards
€30 EUR in 1 day
5.4
5.4

I have worked on many academic projects/assignments/masters/phd projects including classification, regression, clustering, NLP, deep learning projects. My techniques focuses more on feature engineering, and optimising evaluation metrics based on the outcome. My code will be easy to understand and easy to reproduce with all comments. Lets me know.
€140 EUR in 3 days
5.2
5.2

I am an expert statistician, Research Writer, and data analyst with more than eight years of experience. I have full command of Excel analysis, SPSS, STATA, R LANGUAGE, AND PYTHON. I am an expert in creating time series prediction models, working with survey data, conducting marketing analysis, building estimators, and medical analysis. I am a perfect match for your project share other details of the work so I can start working on your project. Will complete task on time.
€100 EUR in 1 day
5.2
5.2

Hello sir ! We are a group of expert, each with a master's degree in economics and statistics, we have a good experience and brilliant skills in statistical analysis and machine learning using Python, we promise you for a quality work and clean code. at an affordable price on time. We are waiting for good news from you Best regards
€150 EUR in 7 days
4.8
4.8

Hi there, I can develop a complete end-to-end machine learning classification model for your Online News Popularity dataset. I’ll handle everything from preprocessing and exploratory data analysis to feature selection, model training, validation, and final evaluation according to your assignment instructions. Using Python and relevant ML libraries (scikit-learn, pandas, NumPy, matplotlib/seaborn), I’ll implement 10-fold cross-validation to assess predictive performance, compare multiple algorithms, tune hyperparameters, and select the best model. The final deliverables will include a fully reproducible script or notebook, detailed performance metrics (accuracy, precision, recall, F1-score), and a clear, reader-friendly report describing insights, feature importance, and the characteristics of viral posts. You’ll receive a complete solution ready for submission, fully aligned with your assignment requirements and replicable on the test dataset. I can start immediately and ensure the workflow is organized, accurate, and well-documented. Regards, Ahmad
€80 EUR in 7 days
4.7
4.7

Hi. I will deliver the complete end-to-end machine learning project per your brief. This includes EDA with visualizations, feature selection, 10-fold cross-validation, model testing, and a final report. I’ll implement best practices in Python (scikit-learn) to avoid overfitting and compare methods like Random Forest, XGBoost, and regularized regression. I have an MSc in Data Science and have completed 10+ similar classification assignments. I can deliver the full solution—code, report, and results—within 5 days. Please share the training data files and any specific formatting instructions.
€140 EUR in 1 day
4.8
4.8

Hello, I am willing to help you with this assignment. I have achieved 100 percent success in working on similar projects and would love to replicate that success in your project. By collaborating with me, you are assured of results that will exceed your expectations as well as free 1-2 week post-project support for bug fixes or revisions. I have some questions that would help me get a clearer picture of what you're looking for, so please initiate a chat with me and let's take your project to the next level, Fahad.
€100 EUR in 2 days
4.4
4.4

Hello, Interesting research, I am the only one here with Google AI foobar achievements. Consider this already done. I recommend this trio: Model A: Logistic Regression (The Baseline) Reason: It is the standard for Step 3 (Checking Assumptions). We can check for multicollinearity (using VIF) and linearity. It provides clear coefficients that allow you to say, "For every extra image, the odds of virality increase by X%." Model B: Random Forest (The Benchmark) Reason: The original authors of the dataset used this. It’s a "black box" but handles the non-linear nature of social media data perfectly. It’s also the best for Step 9 (identifying characteristics of viral posts) via "Feature Importance" scores. Model C: AdaBoost (The "Viral" Specialist) Reason: Research papers on this specific Mashable dataset often find that AdaBoost slightly outperforms Random Forest because it focuses on the "hard to predict" articles—exactly what you need for spotting outliers like viral posts. 2. The Execution Plan To do this right, we will follow these technical steps: Binarization: Since the prompt asks for a "classification model" but the data shares is a number, we will set a threshold (standard is 1,400 shares). Anything above is "Viral/Popular" (1), anything below is "Unpopular" (0). Contact me for more.
€140 EUR in 2 days
4.5
4.5

Hi, I can complete this end-to-end machine learning project for you, including EDA, feature selection, model building, 10-fold cross validation, test evaluation, and a full written report that follows your assignment instructions. I have experience with Python, scikit-learn, and academic ML reports, so I can deliver clean code, results, and interpretation. Share the data and brief, and I can start right away and confirm timeline and scope.
€100 EUR in 4 days
3.9
3.9

Athens, Greece
Payment method verified
Member since Feb 27, 2025
€8-50 EUR
€30-250 EUR
€30-250 EUR
$12-30 SGD
$15-25 USD / hour
$15-25 USD / hour
$30-250 USD
$250-750 USD
$250-750 USD
$30-250 USD
₹400-750 INR / hour
₹600-1500 INR
$25-50 USD / hour
$30-250 USD
$2-15 USD / hour
₹1500-12500 INR
₹1500-12500 INR
$10-30 AUD
₹750-1250 INR / hour
$250-750 USD
$15-25 USD / hour
₹12500-37500 INR
$1500-1800 USD