Skip to content

@wdio/ocr-service@1.0.0

Choose a tag to compare

@github-actions github-actions released this 23 May 04:26
· 503 commits to main since this release
06cadef

Major Changes

  • a924dfc: # 🚀 New Feature

    Sometimes it can be hard to find an element in a mobile native app or desktop site, with an interactable Canvas, with the default WebdriverIO selectors. In that case, it would be nice if you would be able to use something like OCR (Optical Character Recognition) to interact with elements on your device/screen.

    The new @wdio/ocr-service service provides you with the option to interact with elements based on visible text. It will provide multiple commands to:

    • wait
    • search
    • and interact

    with an element, all based on text.

    The following commands will be added

    • ocrGetText
    • ocrGetElementPositionByText
    • ocrWaitForTextDisplayed
    • ocrClickOnText
    • ocrSetValue

    A CLI command will also be provided to pre-check text received from the image, this can be run by using the command npx ocr-service. For a demo check this video

    ocr-service-cli.mp4

Committers: 1