Match datasets based on precalculated principal components (R programming)

This needs to be done in R, so we need R code as a result. Attached is the file "[login to view URL]"

The attached file consists of 12 columns and 1000 rows (incl. header) , The headline identifies each row with ID, cohort, PC1 to PC10. Cohort contains one of the 6 uniq values ("Control", "set_a", "set_b", "set_c", "set_d", "set_e")

Take each "set" one-by-one and find the closest match to the PCs by finding the control with the minimum of sum over i of square (PCi-PCi) where i stands for PC1, 2 ... 10 and the difference is between the value for the "Control" and "set". So one needs to calculate these for case/set pairs. Once a control is selected it needs to be removed completely so it won't be selected for another case.

Start with a case chosen from one series, and determine the best control. Then switch to another case series and find the best control for a chosen case. Continue until the end of all the cases. Then, start again finding a new control for each case until you reach controls for each case.

Thank you!

Our goal is to select 5 controls for each "set" that are closely matched.

Compétences : Traitement de Données, Langage de programmation R, Statistiques

en voir plus : match com based drupal, principal components factor matlab, principal components analysis matlab, data analytics hadoop r programming, freelance r programming, help with r programming project -- 3, help with r programming project 3, r programming and, r programming language, r programming machine learning, r programming project, R programming, r-programming language, short r programming project, short r programming, test on r programming, freelancer r programming, r programming freelance, r programming freelance job, remote source r programming

Concernant l'employeur :
( 0 commentaires ) Montreal, Canada

Nº du projet : #17973076

7 freelance font une offre moyenne de $131 pour ce travail


This is Vibrant Webtech and I was glad to see that you're looking for help for project Match datasets based on precalculated principal components. I've delivered more than 400 + projects in the last 5 years and this Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% CAD en 3 jours
(183 Commentaires)

I am a data scientist by profession with more than 4 years of programming experience in R and have completed more than 35 projects in R. I can finish the task within 24 hrs

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% CAD en 3 jours
(46 Commentaires)

I have Masters degree in Economics and Statistics with 7years of professional experience working as a Quantitative Analyst (in the field of Statistics). A professional statistical analyst seeking opportunity to provid Plus

%bids___i_sum_sub_32% %project_currencyDetails_sign_sub_33% CAD en 1 jour
(22 Commentaires)

I possess exceptional data and statistical analysis experience. I use Excel, STATA, R-Programming and SPSS software’s in qualitative and quantitative research and report writing. I hold MBA (Strategic Management) under Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% CAD en 3 jours
(33 Commentaires)

Hello, i have read the details provided..please contact me to discuss more on the project deadline and some other few things

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% CAD en 3 jours
(11 Commentaires)

Feel fee to contact me for Match datasets based on precalculated principal components .Shoot me message to discuss further more details .We provide the commments,images,videos,demos and live sessions in order to he Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% CAD en 3 jours
(9 Commentaires)

I am expert in R Programming pls check my reviews and you can trust me and I will offer u the best price and the best quality work

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% CAD en 3 jours
(9 Commentaires)