This project involves devising a method to match inconsistently coded data. We have a dataset of construction site inspections. Entries describing the location of the same work site are often recorded differently. For example, one entry might have an "address" cell recorded as 123 Elm Rd while another entry might be recorded as 123 Elm Road. In other cases, the same company's "company name" cells might be recorded differently. For example, Acme Inc. might be misspelled as Amce Inc. in one entry. We would like to devise a program to match inconsistently coded entries by comparing them to every other entry. A successful match would occur when there is a high probability that the two entries are actually one and the same. This must be an automated process because our data set contains around 1.5 million observations.
Thank you for all of the responses. I am going to provide a limited sample of the data which I hope will allow for some clarification.
Hi There, I have understood the Project Requirements and am ready to start. I am quite to new to this site and was lucky to win the first project I bidded for. The Project is exactly Similar to What you Need. Pl Plus
18 freelance font une offre moyenne de $235 pour ce travail
InfogateSoftware is the only ISO 9001 certified company listed here. We are in the field since 1998. We have a large and highly experienced team of Programmers, designers, technical writers and others to do your work i Plus
Recent past experience includes working with outsourcing service provider as an operations manager which gives him an exposure to work with cross functional and international clients providing them quality service with Plus
Very interesting project. We will do it with pleasure. We have done similar ETL projects. Our solution will use several comparing methods which can be parametrized. This allows end user to tune up the matching threshol Plus
Hi!, I am very happy to see this project. I will do it in PERL language using REGULAR EXPRESSION. Give me more details. Thanks.
Outsource Web Development, Website Design, Logo Design, Banner Design, Scripting and Programming, Flash Animations, Corporate Identity, E-Commerce Payment Gateway Integration === Serving since 7Years on the web, Fictio Plus
We will doing all that you want (and more... :-))). Quickly, Professional, Quality - our answer you and your organization. We work more than 10 years.. There are questions?
Dear Sir or Madam: I am a Sun Certified Java Developer with over 4 years of software engineering experience. I have a good understanding of how to implement your list of requirements as I've worked on similar data proc Plus
Although i am not sure of the structure of the data to be scanned, i would imagine a VBscript would be sufficient to write a procedure to compare different spellings of the same item of data. i do this type of work fre Plus
After looking at your data, I'll be able to provide a solution a little quicker and for a little less than I previously thought. mlgrow
I wiil do if SQL Server + C# (ASP.NET) can be accepted in this project. 2 days is the estimated period, actually depends on the detail document.
Hello, based on your requirements, our solution is a follows, we create a small data processing application, written in MS Visual Foxpro for speed. The application will consist of a maintenance option for a maintenance Plus