Annulé

Wikipedia Parser

Schedule of conditions for the technical implementation of the WikiMedia Parser in C#

Introduction

This is a small program parser to make a static dump (HTML output) of Wikipedia which is based on WikiMedia ([url removed, login to view])

The program is designed to extract WikiMedia tags (including the template [url removed, login to view]:Template_messages/All) from text to transform onto html output. The Html must comply with the W3C's HTML specifications.

The parser must be written in C# language. The main class should have an easy to use method for getting the text parsed.

I would like to use the API something like this:

String OrignalText = “wiki text”;

WikimediaParser parser = new WikimediaParser();

String textParsed = [url removed, login to view]( OrignalText);

You can take as a starting point this site:

[url removed, login to view]

Job types

• .NET C#

• Regular expression

• HTML/CSS

Resume

Wikitext language or wiki markup is a markup language that offers a simplified alternative to HTML and is used to write pages in wiki websites.

Wikitext is text in this language.

There is no commonly accepted standard wikitext language. The grammar, structure, features, keywords and so on are dependent on the particular wiki software used on the particular website. For example, all wikitext markup languages have a simple way of hyperlinking to other pages within the site, but there are several different syntax conventions for these links.

Some wiki programs allow extensive optional use of HTML tags within wikitext, others a smaller subset, and still others no HTML at all. Other wiki programs allow the restrictions on HTML to be set by the particular site.

MediaWiki's wikitext allows you to freely mix wiki format and HTML, but it provides a simple, readable syntax that allows users to not even know HTML

Project

Wiki markup

I would like to translate all wiki markup that is on this page:

[url removed, login to view]:How_to_edit_a_page

Wiki template

Wiki markup templates on this page:

[url removed, login to view]:Template_messages/All

I don’t need “User talk namespace”.

Log Message:

I want to use Logger4Net to log each error and accurate debug message when debug message is enabled.

Flexible code

I want flexible code to add future Wiki Markup or Wiki Template. The code must be commented very clearly.

Platform

The API must be run on Windows and with the .NET Framework 1.1 or more. The API must be written with C# language.

Budget

We pay only at the end of the project. Any method payment is accepted ( Paypal, wire, etc…)

Data I/O

• Input

It must be string, text file or xml file.

• Output

The output must be complying with HTML specifications.

Methods

I need 2 methods, you can implement this Interface.

Public interface IWikiParser

{

String Parse( string wikitext);

String Parse( string wikitext, int length);

}

For the second method, be carefully don’t split between two html tags.

Test

You can test all Wikipedia articles with this database dump:

[url removed, login to view]

I give you also smaller files for testing the parser.

Release

The API must be on production release the mid January. But I would like to see every x days a working parser to check the quality of the dump.

Compétences : .NET, XML

Voir plus : wiki parser, wikipedia parser, wikipedia dump parser, wikipedia xml parser, wikitext parser, parse wikitext, net wikipedia parser, wikimedia parser, wikipedia parsing, parse wikipedia pages, parse wikipedia dump, wikipedia parser net, logger4net, net parse wiki api, parse wiki, wiki api parse net, parser wikipedia, wiki markup parser, wikitext html, java wikipedia parser, wiki parser net, wikipedia parse, wikitext parser php, parse wikipedia, wiki html parser

Concernant l'employeur :
( 0 commentaires ) Boulogne-billancourt, France

N° du projet : #37619

8 freelance font une offre moyenne de $1006 pour ce travail

webexpertz

Dear Sir, I am interested in your project. I request you to check your PM. Thanks. Regards, Webexpertz

1200 $ USD en 30 jours
(8 Commentaires)
7.0
websoftinfo

Our bid is for really very high quality work for your Wikipedia Parserthat will be made to be upgradable in case you need some upgrades in future. We will always be available for upgrades. Our bid includes six week fre Plus

1500 $ USD en 30 jours
(7 Commentaires)
5.9
bruzli2005

I'm very experienced in parsing data. I can deliver this for you on time. Serious Bid.

800 $ USD en 14 jours
(9 Commentaires)
3.2
provatitechno

Dear Sir! We are an efficient and dedicated team of professionals. We offer our large experience and professionalism to make all qualitatively. We provide post-developing support until all Your requirements are complet Plus

1000 $ USD en 35 jours
(0 Commentaires)
0.0
lambdagroup

Please see PMB. Thanks.

300 $ USD en 15 jours
(0 Commentaires)
4.4
paker

i am a professional in web design graphics design logo design,java ,java script ,php,translation so give me the job and consider it done

1500 $ USD en 10 jours
(0 Commentaires)
0.0
niaterra

Hi, We could easily do this job for you. Please visit our site at [url removed, login to view] to get a detailed quote for your needs. It's worth looking into.

1000 $ USD en 10 jours
(0 Commentaires)
0.0
akka

Very interesting project.

750 $ USD en 30 jours
(0 Commentaires)
0.0