Fermé

Data Analysis

Function for Calculating Z-scores for data in a Numpy Array

Concept:

As a preliminary step in data analysis, certain types of data are transformed to ensure “good behavior” and “compatibility with other data.” One such transformation is the Z-score. Two series that has been transformed using the Z-score are more easily compared: [login to view URL]

[login to view URL]

In this project, you calculate Z-scores. You will be using your knowledge of Numpy. As well, you learn about the nuances of constructing a function. Also, if you check the links above, you will learn a useful concept in statistics.

Requirement:

The basic requirement is that you produce a function that takes a numpy array as input and outputs an array of the same shape in which the data is transformed through Z-scores. This means that if the array is (5,2), then for each of the two columns, 5 values are used to calculate the mean and sigma and for each of the values in the column, you calculate: (value – mean)/sigma.

A google search “z-score numpy array” will give you plenty of ideas about implementation and you could achieve an efficient implementation with just one line of code! So let me throw in an enhanced requirement: the user also inputs a scalar value (the second input) to indicate the desired column for transformation. Thus, only the specified column is transformed and the other(s) are left alone. For example, if the user inputs a (10,4) array and inputs a column-indicating scalar with a value of 3, this means that only the third column requires transformation (the three remaining columns are left untouched). If the user does not input this scalar, the default is that all columns are transformed. (Hint: a low-level familiarity with Linear Algebra can quickly guide you to an elegant answer here; you can use 1D arrays of ones for sigma and zeros for means and modify them appropriately to complete your calculation. To elaborate, if you use mean = 0 and sigma = 1, the transformation does not change value. Alternatively, use loops and brute force! Whatever works!)

Finally, I want you to implement checks on whether the inputs make sense. For example, is the first input a numpy array (print error message if it is something else) and is the second input appropriate (print error message if scalar does not correspond to a column number)?

Submission:

You are required to code in an ipython notebook and create a pdf. Show your function in one cell – call the function myz. In another cell, create inputs using the following code:

x1 = [login to view URL]([[4,3,12],[1,5,20],[1,2,3],[10,20,40],[7,2,44]])

x2 = 3

x3 = 6

Run the function four times – myz(x1), myz(x1,x2), myz(x1,x3), and myz(x2,x3) – and generate results. In the next cell, please bullet-list things you learned by doing this project. Print to pdf and submit pdf.

Skills: Data Science, Python, Statistical Analysis, Statistics

See more: z score definition, how to calculate z score in excel, z score probability, z score to percentile, z scores table, z value calculator normal distribution, raw score calculator, how to find z score without standard deviation, low level bluetooth programming, low level network problem, net low level programming, low level network interface, preparing low level design document project net, desktop sharing low level, low level programming, low level programming cnet, low level network card, low level access network, money programming low level, low level ways network

Compétences : Science des données, Python

en voir plus : data analysis pdf, data analysis methods, data analysis in research example, data analysis example, data analysis in research methodology, data analysis course, what is data analysis in research, types of data analysis, website test brute force, data entry job want bangladesh, proposal data analysis project, resume data analysis, collating data analysis, brute force website, php admin brute force, brute force websites, brute force test site, perl brute force web login, data analysis job spss outsourcing, brute force web site

Concernant l'employeur :
( 0 commentaires ) India

Nº du projet : #17799369

15 freelance font une offre moyenne de ₹348/heure pour ce travail

anupkelkar02

I am a Python data science expert with experience in Classification and partitioning, Numpy, Pandas, Neural Networks, Association rules, Machine Learning Model, Tensorflow. Also I am a Oracle Certified profession Plus

₹500 INR / heure
(42 Commentaires)
6.0
davidngo817

Hello, dear! I am interested in your project. I have extensive experience in Python and Numpy library. I am confident in this field. If you are interested to work with me, please feel free to contact me. Thanks.

₹400 INR / heure
(10 Commentaires)
5.1
suyashdhoot

Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several comp Plus

₹1000 INR / heure
(17 Commentaires)
5.2
iamajayjadhav

Hello, Greetings of the day.!! Your project attracted my attention at first glance, because I've really rich experience in Machine Learning & Python Programming. I have worked on several similar projects before! Plus

₹277 INR / heure
(11 Commentaires)
4.0
julio2Cesar32

I worke witch Numpy more that 4 years and i know how to make the z-score transformation. I can make this faster.

₹776 INR / heure
(11 Commentaires)
2.9
NoelReubenMuturi

Hello, I am Noel, a proficient statistician. I am proficient in statistical analysis and statistical software such as Excel, SPSS, and R Statistics. I am qualified to take up the project and would love to speak with yo Plus

₹277 INR / heure
(0 Commentaires)
1.1
aamirk306

I have worked on several similar projects before! I'm really confident about your project, and very eager to join your project.

₹277 INR / heure
(0 Commentaires)
0.0
Amitkr333

i have sound knowledge on z score and other hypothesis testing methods . worked on multiple data set to find t score /z score depending on population size and parameters

₹277 INR / heure
(0 Commentaires)
0.0
siaditya

It is very similar to the work I have done before

₹333 INR / heure
(0 Commentaires)
0.0
₹333 INR / heure
(0 Commentaires)
0.0
iamarebel15

1)competitor on Kaggle and Hackaearth 2)good analytical and statistical knowledge 3)Anomaly Detection project which I had done required a lot of knowledge in Confidence Interval,Z-score,Gaussian estimation etc

₹111 INR / heure
(0 Commentaires)
0.0
rma491

Hey! Your project looks very interesting. I have experience with python and coding in general and I am very looking forward to learn new things. I am doing this work not for the money but to learn more skills for m Plus

₹277 INR / heure
(0 Commentaires)
0.0
jayeshthukarul

Hands on experience on numpy. Good with linear algebra. Sound knowledge of statistics.

₹100 INR / heure
(0 Commentaires)
0.0
lantern111

I have good hands on knowledge in python numpy array and stat functions.

₹111 INR / heure
(0 Commentaires)
0.0
ThamizhD

in currently doing post graduate programme in machine learning.... I'm more interested in this... had worked on small projects with numpy , statistics etc...

₹166 INR / heure
(0 Commentaires)
0.0