I have a dataset of 14 columns approximately and all of the columns are categorical with total number of rows more than [login to view URL] I need is to get to understand that which are the columns which are contributing to my target [login to view URL] all being categorical I want to convert them into numerical using get_dummies however some of the columns are having more than 10000 unique values hence considering 13-14 columns I guess the number of columns converted into numeric using dummies would be more than 11K which is not possible to develop a model.
So I am looking for help on this model building and identifying which feature are most important out of 13 to impact my output/target feature.
11 freelances font une offre moyenne de 2341 ₹ pour ce travail
Hi, I am Ibrahim and I am a data scientist, I can help you with feature engineering, what algorithms are you trying to use, and please tell me more about the dataset via chat. Regards, Ibrahim Anjum
Hi, hope you are fine. So you want to do feature engineering to get rid of columns which have no impact or minimal impact on target variable. I have more than 3 years of experience ML and Feature Engineering. Let's con Plus
Hey! I am having 4+ years of Industry Experience in Machine Learning, Deep Learning,Natural Language Processing, and Computer Vision Applications. My Skills: OpenCV, TensorFlow, PyTorch, Keras,NLTK Supervised and Unsu Plus
Hi, can you share the data. I have worked on such problems. We have to do statistical analysis of unique values in the column and there significance wrt to target column. Club the values and may be convert to ordinal.
Experienced Machine learning scientist ,with experience building large massive scale systems ,extreme feature engineering , dimensionality reduction etc
Hey! I go through you details , its been like easy task for me , i'm having 3 yrs of experience in machine learning, feature engineering , completed 50+ projects in data science. Thankyou
Dear client, I know you would like to reduce the total factor from your dataset, attempting for a better machine learning result. I would suggest we use PCA to do this. I can work on it now. Sincere, DUAN.
1. Having more than 20 years of IT industry experience 2. Worked with multiple gobal customers at various levels 3. Committed and passion for Data Science
Hello, I'm Data Scientist and Machine learning Engineer. I worked with many feature engineering projects and can help you to choose the most important features in your data.
Hi, I have 6 years of experience in data analysis using statistical and machine learning tools in Python. I can help you with your project