Find Jobs
Hire Freelancers

hadoop, spark , cloud service, cluster

$10-30 USD

Fermé
Publié il y a environ 2 ans

$10-30 USD

Payé lors de la livraison
deadline 16th april Project : Hadoop and Spark The purpose of this project is to support your in-class understanding of how data analytics stacks work and get some hands-on experience in using them. You will need to deploy Apache Hadoop as the underlying file system and Apache Spark as the execution engine. You will then develop several small applications based on them. Task 1: Launch a cluster of virtual machines in a cloud environment (e.g., AWS, Azure, or GCP). You will need to have one node as the master and at least two nodes as workers (slaves). Task 2: Deploy the HDFS service on the cluster. Task 3: Download the text version of Pride and Prejudice from Project Gutenberg, and save it to the HDFS cluster. Task 4: Deploy the Spark service on the cluster. Task 5: Use the file in HDFS as input, run a wordcount program in Spark to count the number of occurrences of each word. Sort the words by count, in descending order, and return a list of the (word, count) pairs for the 20 most used words. Task 6: Write a Spark program that uses Monte Carlo methods to estimate the value of $π$. Since the area of a circle of radius r is $A = πr^2$ , one way to estimate π is to estimate the area of the unit circle. A Monte Carlo approach to this problem is to uniformly sample points in the square $[−1, 1] × [−1, 1]$ and then count the percentage of points that land within the unit circle. The percentage of points within the circle approximates the percentage of the area occupied by the circle. Multiplying this percentage by 4 (the area of the square $[−1, 1] × [−1, 1]$) gives an estimate for the area of the circle What to submit: a report on describing the commands you run, code in any file(s), your observations, and output from all the steps in each task. Also explain the purpose of each step in your report.
N° de projet : 33456766

Concernant le projet

5 propositions
Projet à distance
Actif à il y a 2 ans

Cherchez-vous à gagner de l'argent ?

Avantages de faire une offre sur Freelancer

Fixez votre budget et vos délais
Soyez payé pour votre travail
Surlignez votre proposition
Il est gratuit de s'inscrire et de faire des offres sur des travaux
5 freelances proposent en moyenne $162 USD pour ce travail
Avatar de l'utilisateur
I have a hadoop cluster in my organization with 28 nodes. I use this for my research work and student project. I can do this work. I will provide public URL to access the cluster via Ambari as well as ssh.
$100 USD en 7 jours
0,0 (0 commentaires)
0,0
0,0
Avatar de l'utilisateur
Hi, I am an experienced Big Data Engineer with hands-on knowledge of HADOOP and SPARK. I can help you with this project, we can setup 1 master node and 2 workers nodes for hadoop and spark services. I can write efficient code to solve WordCount and $π$ value problem with Monte Carlo methods. I will finally develop a report with solution, their result and explanation of each step. Lets discuss details over chat. Regards, Safi
$280 USD en 3 jours
0,0 (0 commentaires)
0,0
0,0
Avatar de l'utilisateur
Hello Sir. I just saw your project in my freelancer feeds and I carefully read it's description and I am really interested in completing it in fast deadline and reasonable budget. My name is Ashish Yadav and I am DevOps Engineer as well as Bigdata hadoop expert. experience with following techs: - Containers: Docker, kubernetes - Cloud: AWS, GCP - DevOps Automation: Jenkins, Maven, Gitlab, Terraform, Ansible - Monitoring: ELK - Hadoop - System Administration: Linux, Debian, Redhat, Centos I have done what you have required project in automation that can be setup only one click using ansible in aws instance at any os. Be sure, Sir, that I am very excited to talk further about this project and be awarded delivery quality solution for your needs. But I have few questions to ask you before we start. So please reach me over chat so we can discuss about it. Thank you very much.
$30 USD en 7 jours
0,0 (0 commentaires)
0,0
0,0

À propos du client

Drapeau de UNITED STATES
woodside, United States
5,0
2
Méthode de paiement vérifiée
Membre depuis mai 24, 2021

Vérification du client

Merci ! Nous vous avons envoyé un lien par e-mail afin de réclamer votre crédit gratuit.
Une erreur a eu lieu lors de l'envoi de votre e-mail. Veuillez réessayer.
Utilisateurs enregistrés Total des travaux publiés
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Chargement de l'aperçu
Permission donnée pour la géolocalisation.
Votre session de connexion a expiré et vous avez été déconnecté. Veuillez vous connecter à nouveau.