
Closed
Posted
Paid on delivery
Task: Handwritten OCR and Coordinate Extraction for Chinese Language Exam Papers We need a developer to build an OCR module for Chinese language exam papers. The module should take an image of a Chinese language exam paper and return the student’s handwritten Chinese answers with exact position coordinates. Requirements: 1. Image cleanup - Correct tilted or skewed photos - Remove shadows and uneven lighting - Reduce noise, stains, wrinkles, and background marks - Improve handwriting clarity 2. Remove printed content The system should ignore printed parts of the Chinese language exam paper, including: - Question text - Reading passages - Grid lines - Answer lines - Borders Only student handwriting should be recognized. 3. Handwritten Chinese recognition - The system should support: - Chinese handwriting - Messy student handwriting - Chinese essays - Short-answer questions - Open-ended Chinese language questions 4. Coordinate output The system must return: - Each line of text with its bounding box - Each Chinese character with its own bounding box Example output: { "lines": [ { "text": "今天天气很好", "bbox": [x1, y1, x2, y2], "words": [ {"char": "今", "bbox": [x1, y1, x2, y2]}, {"char": "天", "bbox": [x1, y1, x2, y2]} ] } ] } Important: - Every recognized character must have a coordinate box - Text and coordinates must match one by one - Printed exam content should not be included - Reading order must be correct Input: JPG, PNG, or scanned Chinese language exam paper image Output: JSON with handwritten text and coordinates Testing: We will test with: - Chinese language exam papers - Phone photos - Tilted papers - Uneven lighting - Messy handwriting - Scanned papers Expected result: - Around 90%+ handwriting recognition accuracy - Coordinates should be accurate enough for marking on the original paper Deliverables: - Working API - Source code - Deployment instructions - Simple test demo - Must support private/local deployment
Project ID: 40412185
37 proposals
Remote project
Active 2 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
37 freelancers are bidding on average $270 USD for this job

Hello, I trust you're doing well. I am well experienced in machine learning algorithms, with nearly a decade of hands-on practice. My expertise lies in developing various artificial intelligence algorithms, including the one you require, using Matlab, Python, and similar tools. I hold a doctorate from Tohoku University and have a number of publications in the same subject. My portfolio, which showcases my past work, is available for your review. Your project piqued my interest, and I would be delighted to be part of it. Let's connect to discuss in detail. Warm regards. please check my portfolio link: https://www.freelancer.com/u/sajjadtaghvaeifr
$140 USD in 7 days
6.7
6.7

&& YOLO, OCR, OpenCV, Tensorflow, PyTorch, Keras, ML/DL model && Hi, How are you?. I have full skills and full experiences of this field. I have developed many Image Processing project and I am expert in these fields I can finish your project with high quality and on time. Please send me your message to discuss more about your project. I am waiting your reply now. Thanks.
$140 USD in 7 days
5.8
5.8

Hello, I will develop an OCR module for Chinese language exam papers. It will take the image of a Chinese language exam paper and return the student's handwritten Chinese answers with exact position coordinates. I delivered a similar project last week with a 5-star review and would love to show that in private. Message me and let's talk more about your project and I will share my approach today. Cheers, Fahad.
$100 USD in 2 days
5.4
5.4

With experience in OCR development and image processing, I understand the need for accurate Chinese handwriting extraction from exam papers. My past projects involved similar tasks, ensuring precise character recognition and coordinate extraction. How critical is real-time processing for your project requirements? Regards, Yogesh Kumar
$80 USD in 8 days
5.1
5.1

You need per-character bounding boxes for messy Chinese handwriting while completely ignoring printed questions and gridlines — that’s the core challenge and I get why reading order and coordinate accuracy matter. The real difficulty is reliably separating printed content from handwriting under phones/tilt/shadows; once masked, line/char segmentation and a tuned recognizer give the accuracy you want. I recently built an exam-grading OCR for a local school that returned per-character boxes for Chinese handwriting and hit ~92% accuracy on mixed phone photos and scans. Plan: preprocess with OpenCV (deskew, illumination correction, denoise), detect and mask printed elements using layout/template detection plus a small CNN classifier, segment lines/chars using connected components and stroke-aware heuristics, then run a CRNN/transformer recognizer fine-tuned on Chinese handwriting; return the JSON schema you showed. I’ll deliver a FastAPI endpoint, source, Docker deployment, and a simple test demo for local use. Do you have sample images (phone photos vs scans) and how many paper templates/variants should I expect? My bid is $140.
$140 USD in 7 days
4.8
4.8

Hi,I am a seasoned Applied ML Experience(6+ yoe)& I can build this as a local/private Chinese exam-paper OCR module with a proper document-AI pipeline,not just a generic OCR call My approach would be: >>Image cleanup: OpenCV-based deskew,perspective correction,shadow removal,denoising,contrast enhancement & background normalization,while preserving the coordinate mapping back to the original image >>Printed-content removal: if blank templates are available,I’ll use template alignment/subtraction to remove question text,grids,answer lines and borders. For unknown templates,I’ll add line/grid removal plus a handwriting-vs-printed-content segmentation model >>Handwriting detection: detect only student-written regions,group components into correct reading order,& create line-level bounding boxes. >>Chinese handwriting OCR: use strong open-source/local models such as PaddleOCR/PP-OCRv5,Surya for layout/reading order if useful,or fine-tune a Chinese handwriting recognizer on your sample exam papers >>Character coordinates: since most OCR models return line boxes,I’ll add a custom character-level alignment layer using connected components/projection methods,& later improve it with a trained character detector if annotated data is available >>API output: FastAPI endpoint returning clean JSON with line text,line bbox,each Chinese character,character bbox,confidence scores & optional debug overlays >>Deployment: Dockerized local/private deployment with CPU fallback & GPU support
$100 USD in 3 days
4.2
4.2

Hi, I'm interested in your project and ready to start immediately. I have solid, proven experience in this type of work. Project Analysis: ✓ Complexity: 7/10 ✓ Timeline: 25 days ✓ Technologies: Python, OCR, JSON What I bring: ✓ Clean, well-documented, maintainable code ✓ Proactive daily communication ✓ On-time delivery — always ✓ Free revisions until you're satisfied Happy to discuss the requirements in more detail before you decide. Best regards, Abdullah Z Freelancer: redspector
$91 USD in 7 days
3.9
3.9

Hello There!!! ★★★★ (Chinese handwritten OCR with character-level coordinate extraction system) ★★★★ I’ve carefully reviewed your project. You need a high-accuracy OCR system for Chinese exam papers that extracts only handwritten content, removes printed text, and returns structured JSON with precise bounding boxes per character and line. ⚜ Image preprocessing (deskew, denoise, lighting correction) ⚜ Printed text removal (questions, grids, borders filtering) ⚜ Chinese handwriting recognition (messy student writing) ⚜ Character-level bounding box detection ⚜ Line-level structure reconstruction & reading order ⚜ JSON output formatting with coordinates ⚜ API development + local/private deployment support I have strong experince in computer vision, OCR pipelines and deep learning systems using OpenCV, PyTorch and transformer-based OCR models. I’ve worked on similar document extraction and structured recognition tasks. For this, I would likely use a hybrid model (detector + recognizer like TrOCR/PP-OCR) plus custom segmentation for printed vs handwritten separation. Let’s discuss dataset and sample papers so we can target that 90%+ accuracy goal. Warm Regards, Farhin B.
$110 USD in 10 days
3.9
3.9

Hi, there, I'm Kris from McKinney, Texas. I've worked on similar projects before, and with over 15 years of experience as a senior full-stack and AI engineer, I'm fully qualified to delivery this properly. I’m available to start right away and would be happy to discuss the details whenever it’s convenient for you. I would love to connect in chat to discuss details. Talk soon, Kris Kramer
$30 USD in 1 day
4.5
4.5

As a seasoned full-stack web developer with over 12 years of experience, I am excited about the prospect of applying my extensive skillset to your Chinese Handwritten OCR & Coordinate System project. My proficiency in AI, especially in Machine Learning and Image Processing, is fundamental for developing the precise OCR module you need. Leveraging my problem-solving mindset, I will adeptly handle the challenging aspects of your project such as image cleanup, removing printed content while identifying and distinguishing even messy student handwriting. My track record showcases my excellence in working with Chinese characters and languages, so ensuring accurate recognition of Chinese essays and open-ended questions is second nature to me. Most notably, I am proficient in Python, TensorFlow, PyTorch, and scikit-learn - essential tools for deep learning and data preprocessing which are critical to meeting your output requirements efficiently. Additionally, my experience with AWS Cloud means I can assist in deploying the system for private/local use. Trust me to deliver an impressive API with source code, deployment instructions, a simple test demo and above all at least 90% accuracy on handwriting recognition along with precise-coordinate outputs
$140 USD in 3 days
2.5
2.5

I implement preprocessing for tilted photos, shadows, and smudges, then use region detection to isolate handwritten responses. I apply a Chinese character recognition model optimized for messy handwriting and long essays. The output will be JSON with a line-character-coordinate structure. Do handwritten responses always appear in fixed areas of the exam (like bounded boxes) or can they be scattered anywhere on the page?
$50 USD in 7 days
2.6
2.6

Hello, I have experience in developing OCR systems and can build a module that extracts handwritten Chinese answers from exam papers along with precise coordinates for each character. The system will clean up the images, remove printed content, and recognize messy handwriting with high accuracy. I will deliver a working API, source code, deployment instructions, and a simple demo, ensuring the module can be privately deployed and meets the expected 90%+ recognition accuracy. Looking forward to working with you. Best regards
$140 USD in 7 days
2.2
2.2

Hello, I have carefully reviewed your project and recently developed an OCR pipeline for handwritten documents where I handled image preprocessing, removed structured printed layouts, and returned text with precise bounding boxes for downstream analysis. The system supported noisy inputs and maintained strong alignment between text and coordinates. Your requirements involve advanced image cleanup, isolating handwritten Chinese content, and producing accurate character-level and line-level bounding boxes. I will design a pipeline combining preprocessing (deskew, denoise, contrast normalization), layout filtering to remove printed elements, and a handwriting recognition model optimized for Chinese scripts. The output will be structured JSON with strict one-to-one text and coordinate mapping, ensuring correct reading order and reliable accuracy. I am available to begin immediately and committed to delivering a robust, locally deployable API within a short timeframe. Best regards, Pedro
$150 USD in 7 days
1.8
1.8

Lets chat, a free consultation and no obligation. I understand you need a clean, professional, and user-friendly solution for your "Chinese Handwritten OCR & Coordinate System Developer" project. My skills in PHP, Java, JavaScript are a perfect fit for this project. While I am new to freelancer.com, my extensive experience delivers integrated, automated solutions. Regards, Jason McLachlan
$188 USD in 3 days
1.4
1.4

As a seasoned full-stack developer with a strong proficiency in Python – the programming language often utilized in OCR projects – I can confidently take on the task of building an OCR module for Chinese language exam papers. My experience extends to mastering messy handwriting recognition and developing Chinese character extraction algorithms, hallmarks of what this project demands. Employing my background working with REST APIs, microservices, and databases (SQL/NoSQL), I will design a powerful system that cleans up images, disregards irrelevant information, reduces noise and enhances handwriting clarity. To ensure the authenticity of your unique requirements, the handwritten text will be accurately coordinated with line-by-line identification and every recognized character allocated its coordinate box as you stated. With my automation & AI skills, we can guarantee not only superior recognition accuracy above 90% but also speed and precision as well. There's no denying that this project demands high flexibility, scalability, and efficiency which are core strengths in my programming repertoire (Python, JavaScript) alongside being solution-oriented, results-driven plus dedicated to clear communication. I'm thrilled at the prospect of us bringing this timely project into fruition together!
$200 USD in 7 days
0.0
0.0

Hi, I can fix your Chinese Handwritten OCR & Coordinate Extraction module. I've solved this exact problem many times. Here is what I will do: 1) Clean and normalize exam images: deskew, denoise, shadow removal, and enhance handwriting clarity. 2) Separate printed content from student handwriting, then run Chinese handwriting OCR with character-level bounding boxes. 3) Build the API to return ordered JSON with line and character coordinates, ready for local/private deployment. 10 days free support after delivery Milestone-based payment Reply "YES" and . Best regards, syed ribal
$250 USD in 5 days
0.0
0.0

Hello I have read your project carefully and understand what’s the problem described in Chinese Handwritten OCR & Coordinate System Developer based on Task: Handwritten OCR and Coordinate Extraction for Chinese Language Exam Papers We need a developer to build an OCR module for Chinese language exam papers. From what I understand, you are looking for a solution involving Computer Vision, Deep Learning, API Development, OCR, OpenCV, Image Processing, Python and JSON, and I can help you with that. My approach to solving this would be: • First, analyze the requirements in Chinese Handwritten OCR & Coordinate System Developer in detail to avoid missing anything • Design a clear and scalable solution based on Computer Vision, Deep Learning, API Development, OCR, OpenCV, Image Processing, Python and JSON • Implement step by step and ensure everything is tested and stable before delivery This approach ensures the final result directly solves the issue described in your project without unnecessary revisions. If needed, I can also suggest improvements to make the system more efficient and scalable. Looking forward to your response and happy to clarify anything about Task: Handwritten OCR and Coordinate Extraction for Chinese Language Exam Papers We need a developer to build an OCR module for Chinese language exam papers. Best regards
$30 USD in 5 days
0.0
0.0

Hello, I’d love to help you build a robust Chinese handwritten OCR and coordinate extraction system for exam papers. I’ve worked extensively with Python, OpenCV, deep learning OCR models, and noisy real‑world image preprocessing, which fits well with your need for shadow removal, tilt correction, and printed‑text filtering. I can create a pipeline that isolates only the student’s handwriting, then outputs precise JSON bounding boxes for each line and each Chinese character. I’ll incorporate character‑level detection that handles messy handwriting and irregular layouts while ensuring the reading order stays accurate. Before moving forward, I’d like to clarify a few details to ensure the model fits your workflow.
$200 USD in 2 days
0.0
0.0

Hi there! You are building a Chinese handwritten OCR system and the real challenge is separating messy student handwriting from dense printed exam layouts while still preserving accurate per-character bounding boxes. I recently worked on a document AI pipeline where I combined OpenCV preprocessing with deep-learning OCR models to extract structured text from noisy scanned forms, including handling skew correction, layout filtering, and coordinate-level text mapping for downstream grading systems. I will build a Python-based OCR pipeline that cleans exam images, removes printed exam content, extracts handwritten Chinese text using a deep learning model, and returns precise line-level and character-level bounding boxes in structured JSON format via an API. Check our work: https://www.freelancer.com/u/ayesha86664 Which deployment environment do you prefer for the API, local Docker-based setup for offline use, or a cloud-hosted endpoint for scalable processing? I am ready to start — just say the word. Best Regards, Ayesha
$125 USD in 5 days
0.0
0.0

I can design and implement a robust Chinese handwritten OCR and coordinate extraction system tailored specifically for exam papers. This includes accurately recognizing handwritten Chinese characters and mapping each answer region to precise coordinates for downstream processing and grading. I have worked on OCR pipelines using engines like PaddleOCR and Tesseract, combined with custom deep learning models for East Asian scripts. My experience includes page layout analysis, detecting answer boxes, and exporting structured JSON/CSV data with bounding boxes and confidence scores. My approach would be to first define a clear page template for your exam papers, then train or fine-tune OCR models on your handwriting samples, and finally build a coordinate extraction module that integrates cleanly with your existing workflow. I would love to chat more about your project! Regards
$140 USD in 7 days
0.0
0.0

Malabon, Philippines
Payment method verified
Member since Feb 6, 2025
$8-15 USD / hour
$8-15 USD / hour
$10-30 USD
₹1500-12500 INR
$250-750 USD
$30-250 USD
€250-750 EUR
$30-250 USD
£10-15 GBP / hour
$30-250 USD
$250-750 USD
₹750-1250 INR / hour
₹100-400 INR / hour
₹400-750 INR / hour
$5000-10000 USD
₹12500-37500 INR
₹600-1500 INR
€18-36 EUR / hour
$250-750 AUD
₹12500-37500 INR
₹600-1500 INR
$250-750 USD