Python .docx Parser and Python Database Builder.
Budget £10-15 GBP / heure
Job Description:
The python tool should do the following:
• Text (ONLY) Extraction (No tables, no Images)
o Text Clean-up (space removal, page removal, document header removal, document footer removal, …)
o Headers identification (level 1, 2, 3, 4, 5)
o Sentences extraction
o Linked Image/Table extraction
o Creation of a list of structures like this:
Sentence ID
Sentence Header Hierarchy (e. g. “5.7.2”) tag
Sentence content (pure text)
Referenced figures list
Referenced tables list
• Images extraction
o Creation of a list of structures like this:
Image ID
Image Header Hierarchy (e. g. “5.7.2”) tag
Image contents (e.g. .png)
Referencing sentences list
• Tables extraction
o Creation of a list of structures like this:
Table ID
Table Header Hierarchy (e. g. “5.7.2”) tag
Table contents (e.g. python table)
Referencing sentences list
• Report creation:
o Word count in the DB vs Word count in the docx
o Number of H1/H2/H3/H4/H5 extracted
o Number of sentences extracted
o Number of pictures extracted
o Number of tables extracted
32 freelances font une offre moyenne de 14 £/heure pour ce travail
Python .docx Parser and Python Database Builder. Good morning , Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced stati Plus
Greetings, I have read initial high level requirements of your project & discussed with my technical team lead. I was wondering, if you can please spare few minutes to discuss requirements thru' chat? I can then also a Plus
Hello, Warm Greetings! I am pretty sure I can produce high-quality and perfect results for your project. By using Python, I used to make AI engines, BOT, Web Scraping Tools, and so on. PHP and Python are my majors, so Plus
Hi, I have seen your project details very carefully and I am very interested. I have rich experience with your project. I am sure I can do your project as perfect and faster than others. I can start your project immed Plus
OK, I got what you want exactly. As a senior developer and AI engineer, I have done many projects using ML/DL, and used various AI models, fine-tuned it per the client's requirements. Of course, I'm very strong at Dat Plus
Hello , I'm a python developer With experience in word and pdf parsing I've done similar projects recently please check my profile Let's discuss more details in chat looking forward to working with you Best regards, Plus
Hello, I am very familiarized with the requirements of your projects. And it can be done really fast. Let's connect over chat to discuss more on this. Thanks
Hi, Dear Employer, I have read your whole Project description carefully and title "Python .docx Parser and Python Database Builder..". And understand your requirements well. Now I can say with confident that I can do i Plus
Hi I am really interested in your project I`m new to freelancer, so I`ve placed a bid in a low price But I have full experience of python programming I could make python program for doc parser as you want I could satis Plus
Hi, after reading over your post, I have come to know you want to parse .docx file by Python. I can implement your requirements by python-docx library. I am a Python expert who is well-versed in Python and many librari Plus
Hello, I have excellent experience in Python(+4 years), can you provide me with extra details about your project? Let's chat to discuss further specifications. Thanks
Hi there, I am able to build Database builder based on python as I am expert in Python, Web Development (Django, Flask). I have also knowledge about javascript, AJAX, JQuery, Bootstrap * REST API application develop Plus
I have experience with extracting data from different sources using Python. It is interesting challenge and I am happy to help you.