En cours

Open Access Harvest Project II (data gathering scripting)

Data Extraction/ Transformation

We are looking to build an Open Access archive of freely available scholarly journals. [url removed, login to view] is a good explanation of the what the content and project field is related to.

Requirements:

A. Create a harvesting engine in your own choice of coding ( parallel processing has proved the best results) that can:

1.) Crawl specific Internet sites (targets), we will help with the target choices, OAI is one method some site support

2.) If not crawling read from an input file to gleam the data, some site supply

3.) Ensure the data is accurate and test URLs for correctness

4.) Dump the defined data to a text delimited file format

5.) Transfer the data via ftp to us

B. Work with us to find new resources and refresh existing sources on a monthly basis.

C. Provide new and updated data feeds continually

D. Provide your own platform to run the harvests, a muli-core processor should be sufficient

E. The data provided will be Article level data relative to each Journal. The detail data will need these output fields:

"Publisher", "Journal Title", “Article Title”, "ISSN", "Alternate ISSN", "Journal Year", "JournalVol","JournalIssue", "HTML URL", "PDF URL", "Start Page", "End Page"

Compétences : Programmation C, Java, Perl, Ruby on Rails, Web Scraping

Voir plus : open access, open access harvest project, what is parallel programming, what is data input, what is a method in programming, start a wikipedia page, programming wiki, programming resources, programming perl pdf, programming org, programming in access, perl programming pdf, parallel programming in c, open text, one harvest, html and scripting access, ftp engine, ftp dump site, find wikipedia, find sites for programming work, file processor, d&b supply, create a wikipedia page, c programming wiki, c programming find output

Concernant l'employeur :
( 5 commentaires ) Windsor, United States

N° du projet : #1640419

Décerné à :

MagedGazzar

Hi, Please check PMB

400 $ USD en 30 jours
(2 Commentaires)
4.7

7 freelance font une offre moyenne de $457 pour ce travail

SigmaVisual

We can help in your project, please check PMB and our ratings/reviews to get idea of our experience.

250 $ USD en 7 jours
(49 Commentaires)
6.6
dolphin3456

I can start on this project. Please check my PM for more information.

400 $ USD en 4 jours
(1 Commentaire)
1.5
keepsense13

check PM for details......

250 $ USD en 7 jours
(1 Commentaire)
1.4
ketanmuneshwar

this would be good work for me!

500 $ USD en 14 jours
(0 Commentaires)
0.0
F5c8WV3aY

Custom software development - <b><i>Removed by Admin</i></b>

750 $ USD en 1 jour
(0 Commentaires)
0.0
Hagr1d

Hello. I have experiences with similar projects.

650 $ USD en 5 jours
(0 Commentaires)
0.0