Skip to content

Very cool, couple of additional ideas #1

@lonnylundsten

Description

@lonnylundsten

This is great! I've been looking for something like this for quite a while. I have two suggestions:

  1. Output results in yolo txt format similar to RectLabel:
    https://rectlabel.com

  2. Allow for inference from a live video feed:
    https://developer.apple.com/documentation/vision/recognizing_objects_in_live_capture
    a. webcam for testing purposes
    b. video capture card BMD decklink and ultrastudio: https://www.blackmagicdesign.com/products

  3. Allow user to select how often inference is run on video, i.e., perhaps instead of running on all frames a user could select 1 frame per second, etc.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions