
Closed
Posted
We run a production-grade order automation platform built on Python, Playwright, FastAPI, Celery, and Docker. The system automates order submission across 16+ third-party vendor portals via browser automation workers. The challenge is not building — it's keeping things stable. Vendor portals change their UI overnight, sessions expire, selectors drift, and iframes behave unpredictably. We need someone who has dealt with exactly this in production before. What you'll work on: — Stabilising existing Playwright automation scripts for vendor portals — Improving selector strategy (role-based locators over fragile CSS) — Adding retry logic, session recovery, and structured failure logging — Debugging Celery worker queues and async task failures — Maintaining Docker-based worker containers To filter bids, answer this in your proposal: "When a vendor portal updates its DOM overnight and your Playwright script breaks silently, what is your first debugging step and how do you prevent it from happening again?" Do not send a generic proposal. If you cannot answer the question above specifically, please do not bid. Required skills : Playwright (Python) Python async / asyncio Celery workers Docker / Docker Compose Session & retry handling iframe-heavy portals Nice to have: FastAPI MongoDB / Redis Celery Beat Linux server ops Do not apply if: Your only automation tool is Selenium or Scrapy You've never worked on a production system Your stack is PHP / Laravel / WordPress
Project ID: 40444723
30 proposals
Remote project
Active 2 hours ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
30 freelancers are bidding on average ₹678 INR/hour for this job

As a seasoned Python developer with a specific focus on scripting and automation, I have had ample experience dealing with the very challenges your project description highlights. Whether it's identifying and rectifying issues arising from changes in UIs, managing session expirations, or addressing unpredictable iframe behaviors, I am well-versed in navigating through these domains. My extensive familiarity using Playwright leverages Python and asyncio to provide stable automation solutions - a perfect match for your project's platform built on Playwright. I prioritize role-based locators over fragile CSS selectors to improve stability and will incorporate reliable session recovery mechanisms alongside structured failure logging to mitigate downtime. Additionally, I have significant expertise working with Docker and Docker Compose ensuring reliable maintenance of containers. To address your question about handling vendor portal updates overnight comprehensively, my debugging step is an immediate comparison of the previous DOM with the current DOM that led to script breakage – this enables prompt identification of the specific problem areas. Once resolved, I implement robust monitoring tools to catch any future discrepancies proactively. My proven ability to leverage automation to streamline processes in production environments makes me the ideal candidate for ensuring the stability of your order automation platform. Let's improve your platform's efficacy together!
₹575 INR in 40 days
4.9
4.9

Having been in the data analytics and science field for over 8 years, I have faced my fair share of unexpected challenges – just like what you're experiencing with vendor portal stability. My specialization in Python will prove invaluable, but beyond that, my proficiency in Docker will enhance the maintenance of your existing Playwright automation scripts. I understand firsthand how processes and selector strategies must be meticulously tailored to constantly changing environments. To address the specific question in your proposal: If a vendor portal changes its DOM overnight, and the Playwright script breaks silently, my approach would be a rigorous debugging process. I'd start by identifying if the issue is due to the change in selectors, session failure, or any other root cause. Then, I would trace the root cause and fix it by adjusting the script's elements accordingly. For prevention, I'll implement periodic script audits aligned with vendor upgrades to anticipate potential changes proactively. Throughout my career, I've exhibited consistent commitment towards optimizing operations and enhancing user experiences. My understanding of **end-to-end solutions** and Python-driven **data storytelling** can seamlessly translate into better governance across 16+ third-party vendor portals through your **automation platform**. Let's collaborate to ensure stability and efficiency in every step of order submission process
₹575 INR in 40 days
3.1
3.1

I can help stabilise your Playwright automation infrastructure by improving locator strategy, session persistence, retry recovery, and Celery worker reliability for large-scale vendor portal automation running in Docker. When a vendor portal updates its DOM overnight, and the Playwright script breaks silently, my first debugging step is reviewing the latest failure logs, screenshots, HAR traces, and Playwright trace viewer output to identify exactly which locator or iframe context failed. I then compare the DOM diff against the previous stable version and replace fragile selectors with resilient role/text/data-testid based locators combined with strict iframe targeting and fallback handling. To prevent it from happening again, I implement layered selector strategies, health-check monitoring, automatic screenshot capture on failure, retry policies with session recovery, and alerting for silent failures. I also add validation checkpoints before critical actions so the worker fails loudly instead of continuing with a corrupted state. I have experience working with Python async workflows, Playwright automation, Celery queues, Docker worker containers, and production-grade browser automation systems where stability and recovery matter more than just initial script development. Available to start immediately and can work directly on stabilising your existing infrastructure.
₹575 INR in 40 days
3.0
3.0

Hi, I can see you're dealing with reliability issues in your vendor portal automation—production Playwright workflows with Celery dependencies tend to fail silently at scale, especially under async load. I'll start by instrumenting your Playwright runners with structured logging to identify where things break. Specifically, I'll add browser context pools with retry logic using Playwright's built-in expect() waits rather than hardcoded sleeps, which is where most instability comes from. For Celery, I'll review your task timeout configs and queue structure to ensure jobs don't orphan when the browser hangs. First 24 hours: I'll run diagnostics on your current Playwright sessions—capture failure patterns, log browser crashes, and identify which vendor endpoints are flakiest. You'll get a root cause breakdown that pinpoints where fixes matter most. Given the $400 scope, this diagnostic phase also reveals whether this is a quick config adjustment or needs deeper refactoring. Best regards, Val --- **Notes on the proposal:** - **Description gap:** The brief cuts off at "Celery, an"—you may want to ask the client for the full requirements before submitting, to refine technical details. - **Budget realism:** $400 is tight for production debugging work. The proposal positions the diagnostics phase as value-adding clarity that helps clarify scope expansion needs. - **Competition angle:** Low budget attracts many bidders; this pitch stands out by immediately demonstrating deep production-system knowledge rather than generic "I can help" posturing.
₹400 INR in 7 days
1.8
1.8

First, I pull the Playwright Trace Viewer artifact from the failed Celery task to see exactly what the headless bot saw. To prevent recurrences, I replace fragile CSS with getByRole locators and enforce strict state assertions before interaction, ensuring the script fails loud and alerts the queue, rather than failing silently. I am Kalana, a Systems Integrator. I engineer resilient "Supply Line" extraction pipelines using Python, Playwright, FastAPI, and Docker. I can stabilize your 16 portals to handle UI drift automatically. Best, Kalana
₹2,500 INR in 30 days
1.6
1.6

Hi, I have strong experience working with Playwright for browser automation, website testing, and workflow automation across multiple platforms. I have automated complex user journeys, including login flows, form submissions, data extraction, multi-step workflows, file uploads/downloads, and interactions with dynamic web elements. I am also comfortable handling authentication, cookies, sessions, selectors, retries, timeouts, and anti-flaky test patterns. I can work with Playwright using JavaScript/TypeScript or Python, depending on your existing setup. I can also help structure the automation cleanly, add error handling, logging, and make the scripts reliable for repeated execution. You can check my profile for more information about my experience. Please let me know the exact requirements and how you would like to proceed. Thanks, Pranshu
₹575 INR in 40 days
1.0
1.0

Being familiar with the combination of Python, Playwright, and FastAPI makes me stand out as a great fit for your project. Over the years, I've honed my skills in navigating intricate automation scenarios just like yours. For instance, my profound understanding of locating elements using robust strategies such as role-based locators rather than relying on fragile CSS selectors ensures stability even when vendor portals update their DOM overnight, minimizing silent script breaks. Additionally, I'll bring to the table invaluable experience in debugging Celery worker queues and async task failures and ensuring seamless session recovery. My extensive use of Docker and Docker Compose would also come into play in maintaining your Docker-based worker containers. It involves thorough monitoring, troubleshooting and implementing targeted updates that maximize automation efficiency. To me, project warranty is vital. In line with this, I have a structured approach towards failure logging which means I don't just address critical points in real-time but leave traceable footprints for a future reference. With my comprehensive knowledge of Python async / asyncio, Celery Beat and Linux server ops among other combined techstack you require offering your project stability won't be an issue. Let's get started.
₹575 INR in 40 days
0.0
0.0

Hi, I’ve reviewed your setup carefully, and this is clearly a stability problem, not a “build from scratch” problem. The biggest issue in systems like this is usually silent selector drift combined with weak observability inside async workers. When a vendor portal changes its DOM overnight and a Playwright script fails silently, my first step is checking structured logs + Playwright traces/screenshots/video for the exact failing action, then comparing the current DOM against the last stable state. After identifying the broken selector or iframe/session issue, I replace fragile selectors with role/text/data-testid-based locators and add fallback strategies plus explicit failure alerts so silent failures become visible immediately. What I’ll help with: ◆ Stabilize Playwright vendor automations ◆ Improve resilient locator strategy ◆ Add retry/session recovery handling ◆ Debug Celery queue + async worker failures ◆ Improve logging/monitoring inside Docker workers I’m comfortable working in production-grade automation systems where portals constantly change and reliability matters more than quick hacks. The focus will be making failures observable, recoverable, and easier to maintain long term. Rahul
₹575 INR in 40 days
0.0
0.0

First debugging step when a Playwright script breaks silently overnight: check structured logs for the last successful selector hit, then run the script in headed mode with slowMo against the live portal to visually confirm what changed — nine times out of ten it's a class rename or a new iframe wrapper. To prevent recurrence I replace fragile CSS selectors with role-based and data-testid locators, add a lightweight DOM snapshot diff job (Celery Beat, nightly) that alerts before the main workers even run, and wrap every critical interaction in explicit expect assertions so failures are loud, not silent. I've done exactly this — production Playwright pipelines with session recovery, iframe handling, and Celery worker queues — building browser automation systems that survive real-world portal drift. I'm comfortable with the full stack here: async Playwright, Celery + Redis, Docker Compose, FastAPI, and structured failure logging to MongoDB. Ready to dig into your codebase from day one — can you share the rough breakdown of which portals are currently most unstable?
₹450 INR in 40 days
0.0
0.0

As an experienced Software Engineer with over 8+ years in the industry, I am confident that I have cultivated the skills and knowledge necessary to not only resolve but also prevent issues with your vendor portal automation. Having implemented similar endeavors, I understand the fragility and volatility these systems face daily. Consequently, I developed a robust set of methodologies to deftly handle and sidestep these changes. My approach is to have an overnight monitoring system in place that performs an automated nightly DOM diff analysis, comparing the previous version with the most recent in order to detect changes. Armed with these insights from my nightly diff examination, I employ agile techniques like Continuous Integration (CI) coupled with comprehensive test coverage to identify any potential issue that arose, even if the specific change was obscure or non-addressed compartmentalized endpoints. This enables us to tackle such inconsistencies proactively and resiliently. My specialization with Docker and Docker Compose ensures a streamlined integration of various components in your infrastructure - keeping worker containers optimized and up-to-date. Dexterity with Python asynchronous concepts such as asyncio enables me to handle task-based implementations efficiently. Furthermore, my extensive pantheon of backend skills including FastAPI and Celery makes me an ideal candidate for your project requirements.
₹575 INR in 40 days
0.0
0.0

Hi, I’ve worked on production Playwright automation systems using Python, Celery, Docker, and async workflows. When a vendor portal changes its DOM and scripts fail, my first step is enabling Playwright trace/screenshot logging to compare the updated DOM and identify selector/session issues quickly.
₹655 INR in 40 days
0.0
0.0

I'm a Python automation and QA engineer with 5+ years of experience. I specialize in: • Playwright/Selenium test automation • pytest frameworks and CI/CD integration • API testing and automation scripting • Clean, well-documented code delivery I've built similar QA/automation solutions and understand what quality testing requires. I can start immediately, deliver on time, and maintain clear communication throughout. Let me know if you'd like to discuss the specific requirements!
₹450 INR in 7 days
0.0
0.0

First debugging step when a Playwright script breaks silently overnight: Check the structured failure log for the last successful selector snapshot. If we're not capturing those, that's the first fix. Then I replay the session in headed mode against a saved HAR or screenshot trace to confirm DOM change vs. session expiry vs. iframe re-nest. Prevention: I implement a selector health-check layer that runs on a cron before the main workers. Role-based and aria-label locators instead of CSS paths. If a portal's key element score drops below threshold, Celery fires an alert before any order submission runs that day. Built this discipline across Python automation systems at British Petroleum — where a silent failure in BioVerse's vendor data pipelines had direct financial impact. Same stakes as your 16-portal order system. What I'd stabilise in Phase 1 (Days 1–7): — Audit all 16 portal scripts; classify selectors by fragility risk — Swap brittle CSS selectors → role/aria locators — Add per-portal retry + exponential backoff with failure classification (DOM change / session expired / iframe timeout) Phase 2 (Days 8–14): — Structured JSON failure logs per portal per run — Celery Beat health-check worker with pre-run selector validation — Docker Compose stability pass (restart policies, worker isolation) — Brijesh, Sonraj Labs
₹575 INR in 40 days
0.0
0.0

Answering your screening question directly: When a Playwright script breaks silently after an overnight DOM change, my first step is to check stored screenshots and structured logs from the last successful run, then replay the failed task in headed mode (PWDEBUG=1) to see exactly where execution diverges. I compare old vs. new DOM using Playwright's built-in inspector and identify whether it's selector drift, a new overlay/iframe, or session expiry. To prevent recurrence: - Role-based locators (getByRole, getByText, getByLabel) instead of fragile CSS/XPath - Self-healing fallback chain: primary locator, semantic fallback, then logged alert - Screenshots at each critical step stored for diff comparison - Scheduled smoke tests that alert before production breaks My background: I've built and maintained Python + Playwright automation pipelines handling multi-portal workflows, iframe-heavy forms, session expiry loops, and Celery task failure recovery. I use Docker Compose for containerised workers and FastAPI for status webhooks. Available 20 hrs/week, can start immediately. Happy to share a code sample from a similar multi-portal project.
₹650 INR in 20 days
0.0
0.0

When a vendor portal updates its DOM overnight and my script breaks silently, my first debugging step is to check the Playwright trace Viewer and network logs to pinpoint exactly which selector failed and whether the DOM structure changed or an element was moved into an iframe/shadow DOM. I prevent recurrence by using robust selector strategies (data-testid attributes, text-based selectors as fallbacks, and multi-step locator chains) combined with visual regression snapshots that alert on unexpected changes. I also schedule a daily health-check run that sends a report so you know immediately if something broke. I can build this automation framework for you using Python+Playwright with async support, Celery workers for queue management, and Docker Compose for easy deployment. Full delivery in 7 days. My rate is ₹575/hr with a 20-hour weekly limit.
₹1,000 INR in 20 days
0.0
0.0

Hi, I am a Python developer and I can help you with your Playwright automation project efficiently. I have knowledge in Python scripting, web automation, debugging, and handling browser tasks using Playwright. I can write clean and reliable code, complete the work on time, and provide regular updates during the project. I am eager to learn and give my best to deliver quality results. Looking forward to working with you.
₹575 INR in 40 days
0.0
0.0

Hi I can help stabilize your Python Playwright vendor portal workers. My first debugging step when a vendor changes the DOM is to reproduce the failing path with tracing enabled capture screenshots video and console logs then compare the old locator assumptions against the new accessible roles network waits and iframe boundaries. After that I would patch the selector with role based or data anchored locators add a targeted retry and add a regression check so the same silent failure is caught early. I can start with a small portal fix first.
₹400 INR in 10 days
0.0
0.0

Hi, I can help stabilize this Python/Playwright/FastAPI/Celery automation stack. For the DOM-change failure case: my first step is to pull the failing run artifact (Playwright trace, video/screenshot, worker logs), compare it with the last successful run, and identify whether the break is auth/session, iframe context, selector drift, or async timing. I would then patch the locator at a role/text/data-anchored level instead of relying on fragile CSS chains, add an explicit failure reason, and add a regression/page-object check so the worker fails loudly before orders are silently skipped. What I would do in the first pass: - review worker logs, Celery retries, Docker env, and Playwright trace settings - add structured logging around portal, account, step, locator, iframe, retry count, and terminal failure - replace brittle selectors with page objects and resilient locators - add session-expiry detection and safe re-auth handoff where allowed - add retry/backoff only for recoverable errors, not blind loops - document each portal-specific assumption so future UI changes are faster to patch I will not propose captcha or anti-bot bypass. If these are legitimate vendor accounts and you can share code access plus one failing portal example, I can start with a stability audit and the first fixes quickly.
₹750 INR in 15 days
0.0
0.0

Hello, I’m interested in stabilizing your Python + Playwright automation platform. I understand this is not a normal CRUD/scraping task. The real challenge is keeping vendor portal automation reliable when DOM changes, iframes shift, sessions expire, selectors drift, and Celery workers fail silently. Answer to your question: If a vendor portal updates its DOM overnight and the Playwright script breaks silently, my first step is to replay the failed task with Playwright trace viewer, screenshots, video, console/network logs, HTML snapshot, accessibility tree snapshot, and iframe tree captured. I would compare the failed run with the last successful run to identify whether the issue is selector drift, iframe context change, session expiry, hidden modal, timing issue, or vendor UI change. To prevent recurrence, I would add a self-healing reliability layer with role-based locators, fallback selector chains, explicit assertions before critical actions, failure screenshots, DOM snapshots, structured logs, retry/session recovery, and error categories such as AUTH_EXPIRED, SELECTOR_DRIFT, IFRAME_CHANGED, MODAL_BLOCKED, and TIMEOUT. I can also help with Celery task debugging, async workflow failures, Docker worker issues, and stabilizing vendor-specific automation modules. I would start with one unstable vendor flow, reproduce the issue, fix the root cause, and add reusable reliability improvements.
₹600 INR in 40 days
0.0
0.0

Hi! I have experience working with Python automation, browser automation workflows, Selenium-style debugging, and handling unstable web flows. If a Playwright script suddenly breaks after a DOM update, my first step is: 1. Reproduce the failure locally with detailed logging and screenshots enabled 2. Inspect whether selectors, iframe hierarchy, dynamic rendering, or timing conditions changed 3. Use Playwright trace viewer/devtools to identify where the execution diverges 4. Check for stale selectors, shadow DOM changes, async loading issues, or session/token expiration To prevent it from happening again, I usually: • Replace fragile selectors with stable locators • Add explicit wait conditions and fallback logic • Improve retry/session recovery handling • Add structured logging + screenshots on failure • Isolate vendor-specific logic into maintainable modules • Monitor recurring failures for portal-specific changes I’m comfortable working with Python automation systems and improving reliability in unstable browser environments. Best regards, Sebastián
₹600 INR in 40 days
0.0
0.0

India
Member since May 15, 2026
₹100-400 INR / hour
₹37500-75000 INR
₹12500-37500 INR
$250-750 USD
$750-1500 USD
₹400-750 INR / hour
₹37500-75000 INR
$15-25 USD / hour
₹1500-12500 INR
₹12500-37500 INR
₹37500-75000 INR
$30-250 USD
₹1500-12500 INR
$2-8 USD / hour
£250-750 GBP
₹12500-37500 INR
₹12500-37500 INR
$500-1000 USD / hour
₹750-1250 INR / hour
₹12500-37500 INR