En cours

Build Hadoop/Hive System

We want to find a partner to help us get started in building a big data / datawarehousing system in Hadoop and Hive (or suggested alternative) to run alongside our operational system. This new big data store will provide api, reporting and datawarehousing functions, the latter to drive a tool like Tableau. This data store will then develop to receive streamed and historical batch data and generate metrics from map-reduce calls.

We are a real time vehicle tracking solutions provider collecting information about vehicle positions and their adherence to scheduled journeys.

We are new to the Hadoop world and want to get up to speed ourselves during this development. Therefore we want a full end to end development including initial system setup, development of a data loading/streaming method and provision of a small set of data output methods (RESTful API calls, reports and an initial datawarehouse structure ).

We envisage the following tasks :-

1. Deploy Hadoop/Hive (or suggested alternative) on a virtual Debian server which we will provide access to. We want this performed in such a way that it can easily be expanded into new nodes and would want to see data distributed across more than 1 system/node.

2. Develop a data load process to pull information from our transactional system and load into Hadoop/Hive. This initial data set will be a block of data per day containing a couple of metrics but with quite a few decriptive fields as a vehicle code, location_code. Timestamp, customer code. If this was a traditional star schema then there would be about dimensions. This data has a time based aspect and has a geospatial aspect. We would be able to provide this in Csv format or from an API call. In

3. Develop some map reduce functions to generate some useful metrics and agregations which we can agree.

4. Make some of the metrics available using the existing HADOOP/Hive/RESTful technologies in order to provide an API.

5. It would be nice from us to access the datastore from PHP using perhaps a Hive/ODBC driver not sure if this is possible but it would be good to try this.

6. Organize the data so that an OLAP tool can be used against it. For example, use Pentaho or Tableau to generate some queries to be able to pivot ad drill down. Especially important is to be able to show aggregate data for say a year and drill into month, day etc. Also would be good to be able to show geographical data.

We are interested in working with someone who can recommend the paths to take to make this system expandible, fast and easily accessible and to help us make the best choices. For example help in deciding which database to use would be welcome.

Please bid only if yu have experience of this in the past. If you interested in bidding for this week we would like to hear about your experience in similar projects and your views on whether this is a sensible approach.

Compétences : Big Data, Stockage de données, Hadoop, Hive, PHP

Voir plus : real time data solutions, pentaho 5, olap database example, olap database, node data structure, node csv, full time positions available, example olap, data structure node, map data structure, big data development php, best olap database, hadoop email system, groovy hadoop hive, using nutch hadoop website, nutch hadoop help

Concernant l'employeur :
( 3 commentaires ) Crawley, United Kingdom

N° du projet : #8477793

Décerné à :

vnrteja

Dear Client, We, Fragma Data Systems, are a Big Data Consulting and Solutions startup based out of Bangalore. Our team comprises of Big Data Architects who have more than 10+yrs hands on experience in designing and Plus

2666 £ GBP en 12 jours
(0 Commentaires)
0.0

40 freelance ont fait une offre moyenne de 2693 £ pour ce travail

meet2amitvw

Let's discuss over freelancer Personal Message Box for the proper estimation of cost and time. I am myself developer so you will directly work with me. No mediators. No managers. No subcontractors. see my recent Plus

2938 £ GBP en 30 jours
(121 Commentaires)
8.7
kchg

Hi! I am TOP 6th freelancer. I am interest your project and hope to work with you. I have enough skills to carry out any sort of work load. I will complete your project perfectly in your deadline. Plus

2886 £ GBP en 30 jours
(278 Commentaires)
8.8
buraqtech

1. FeedbackFive Clone Amazone MWS API's In these days we are near to finalize another API's based feedback management system for sellers on Amazon.com by using the MWS API's of amazon whose details are currently confi Plus

2994 £ GBP en 45 jours
(97 Commentaires)
8.0
sritechnocrat

Hello Sri Technocrat will provide fully interactive website for your project. As per the detail, Sri Technocrat will provide three template functional schemes and sample pages to make your choice for layout. It will Plus

1666 £ GBP en 30 jours
(62 Commentaires)
7.4
iDCreativeUK

Hi, Mark here, I would be interested in discussing this project with you. Thanks for the consideration, I hope to hear from you soon Thanks, Mark

2886 £ GBP en 35 jours
(21 Commentaires)
6.7
logicscreators

A proposal has not yet been provided

2368 £ GBP en 30 jours
(118 Commentaires)
7.0
tretanz

Hi, We are professional web development company. We have team of web designer and developer who developed 100+ web site with different technology. Our developer and designer will give you satisfactory solution bas Plus

2368 £ GBP en 30 jours
(164 Commentaires)
6.8
Thesynapses

Hello There, Greetings From Synapse! Hope this finds you well. Coming straight to point: Firstly, I would like to share that we are having our own Cloud foundry. We have Hadoop Experts in our team and they have Plus

5263 £ GBP en 30 jours
(11 Commentaires)
6.5
iqbalsingh

Hi, Yes,i can help you to build a big data / datawarehousing system in Hadoop and Hive to be more modern as you want.It would be great if you can give me some time to discuss this project. We offer custom web s Plus

2947 £ GBP en 55 jours
(46 Commentaires)
6.7
gkws

Dear Hiring Manager, Greetings !! I hope your day is going well and all is good with you. We would like to discuss the project in details before confirming the bid, so kindly let me know when you are available. Plus

3092 £ GBP en 30 jours
(50 Commentaires)
6.2
sushant003

Dear Sir, http://www.pvsysgroup.us/carhire/ http://pvsysgroup.us/datingwebsite http://pvsysgroup.us/SportsChallengeEurope/ http://www.pvsysgroup.us/lawfirmsmanagement/ http://www.pvsysgroup.us/theroyaldeals/ ht Plus

2319 £ GBP en 300 jours
(61 Commentaires)
6.4
TeamValens

Hello, My experiences with performance testing / tuning of Mapreduce or Hive jobs before 01) "For a Company with innovative medical solutions" My Job Functions: Developed MapReduce programs to parse the raw da Plus

2989 £ GBP en 30 jours
(1 Commentaire)
5.8
WorkXpressPaaS

Hello! We are a US company with a 13 year history of developing cloud-based custom software solutions for a diverse client list. Our rapid development platform is able to save our clients significant time and money Plus

2319 £ GBP en 30 jours
(1 Commentaire)
5.5
AppDuniya

Dear Client, Hope you are doing well..!! I have carefully gone through your posted job description and understood your requirements (mentioned tasks) very well. I will do the complete development to make t Plus

3157 £ GBP en 30 jours
(22 Commentaires)
5.5
kimit1234

Hello, I will be honest with you. I run a company which is into data warehousing solutions. We have worked with various clients across US in building DW and Hadoop systems. You can visit thinklayer[dot]com for more det Plus

1500 £ GBP en 30 jours
(73 Commentaires)
5.6
webbookstudio

Hello, my name is Olya. I represent Ukrainian IT Company «Webbook». We provide website design and web development services for organizations, public and government institutions, company or private web-pages. We got Plus

2200 £ GBP en 30 jours
(11 Commentaires)
5.4
craftbynick

US PROVIDER - NO PAYMENT RELEASE TILL PROJECT IS FINISHED (for projects estimated to complete within 20 days or less): Hi! I’m Nick from MadGrizly a US based Digital Studio specializing in Web and Mobile innovative de Plus

3333 £ GBP en 30 jours
(12 Commentaires)
5.3
winnow1

1. Setup and configure a Hadoop cluster on Debian. ETL the batch data to HDFS using Pentaho DI or SSIS 2. Map reduce the files to create aggregates. Store the aggregates in Hive tables. Optionally we can store them Plus

3000 £ GBP en 45 jours
(4 Commentaires)
4.7
amazeinfoway

Hi, I have read your requirement and i am too much interested in this job ,I have an expert team , Only 10 people and working since 5 years and developed many websites , web applications e commerce websites , plugin d Plus

2368 £ GBP en 45 jours
(13 Commentaires)
4.6
norberto108vw

Hi, I am a senior developer with the skills that you need for the job. I can partner with you in this project. I hope we can talk better soon. Best regards, Norberto

2500 £ GBP en 30 jours
(23 Commentaires)
4.8