Interactive plots with bounding boxes #4917

BradyJ27 · 2023-10-26T19:17:44Z

In computer vision, specifically object detection, it is common for a pipeline to output images with bounding boxes displaying the area of interest for specific objects. When the image(s) are relatively small or packed with multiple objects, it can be hard to view these images.

It would be nice to have some sort of interactive plots where the user can toggle on/off different objects based on labels.

This may require some dvc or dvc-render changes first, but just opening here because it would be beneficial to have this implemented within the VS-Code extension.

Related issues:

iterative/dvc#10198

mattseddon · 2023-10-26T22:53:41Z

To clarify:

Are there multiple images created by the pipeline or do you have an original image that you want to compare to the output? Can you give an example of what is produced by the model?

BradyJ27 · 2023-10-26T23:52:27Z

Usually it is multiple images. For example, we would print out the labels for the detections that we have made on the validation set.

The following example is from YoloV8s default output. The dvclive yolo demo notebook is a good place to reproduce this.

Yolo actually does both the validation labels (the ground truths) and the predicted values. This could be useful for comparing and contrasting.

mattseddon · 2023-10-27T01:36:39Z

Let me clarify the question. In the example above are there multiple images available for 000000000042.jpg? Do you have an image available with each of the combinations of labels available? I.e 1 for each of

baseline
dog
motorcycle
dog + motorcycle

I am not an expert in image manipulation but AFAIK removing these labelling boxes from an image is not a trivial task.

Have you seen this done elsewhere?

BradyJ27 · 2023-10-27T01:46:15Z

So the image is just a copy of the training or validation image with bounding boxes added using some library (usually matplotlib). The bounding boxes are stored in a formatted file (xml, csv, json, or some custom format) normally like "label,x1,y1,x2,y2" (one for each label, i.e. "person,..." \n "dog,...")

So the approach would not be to manipulate the image with the boxes already on it, but rather set the image as the original image from the validation set, then place the interactive bounding boxes over the original image.

In other words, we have 2 files:

image.png
image_labels.xml

And the above images are generated by ~~combining the two files~~ reading the labels and placing them on top of a copy of the original image, thus creating the third file which is the image with bounding boxes displayed. My suggestion is that we take this step and turn it into some interactive format within dvc.

dberenbaum · 2023-10-27T13:32:54Z

See https://docs.wandb.ai/guides/track/log/media#image-overlays for ideas on how others do this

BradyJ27 · 2023-10-30T18:19:26Z

One option for implementation would be a custom plot template, right?

Or is this something that's a little more in depth and actually a bigger feature?

mattseddon · 2023-10-31T02:56:51Z

One option for implementation would be a custom plot template, right?

No, I do not believe that you could shoe-horn the required data/image into the current DVC plots engine.

Or is this something that's a little more in depth and actually a bigger feature?

My opinion is that this is a larger feature given the current state of plots.

BradyJ27 · 2023-10-31T16:36:00Z

My opinion is that this is a larger feature given the current state of plots.

Ok, that makes sense. I'm sure some more discussion needs to be had regarding implementing something like this, but I would be happy to help contribute!

mattseddon · 2023-11-02T22:43:56Z

@BradyJ27 can you provide a concrete example of one of the XML files that you mentioned here? Is this the only format available?

mattseddon · 2023-11-02T23:06:13Z

Looks like we might be able to get away without using a plotting library for this. One potential way would be to use https://github.com/lovell/sharp in the clients + generate SVG bounding boxes based on the definitions (XML or other files). Loading the original image with the previous package gives us the option to call image.overlayWith(svgElementBuffer, {top:0, left:0}).toBuffer() where the svgElementBuffer is an SVG full of <rect> elements (source).

BradyJ27 · 2023-11-03T00:41:28Z

@BradyJ27 can you provide a concrete example of one of the XML files that you mentioned here? Is this the only format available?

I can share an example of the default yolo labels. This is just a text file, but the idea is the same in txt, csv, xml, json, etc. It can technically be any type of file, depending on what architecture you are using, but the above are the most common.

mattseddon · 2023-11-03T01:03:32Z

How do you determine which class the provided data relates to?

This is the contents of the file (for anyone else reading the issue):

45 0.479492 0.688771 0.955609 0.5955
45 0.736516 0.247188 0.498875 0.476417
50 0.637063 0.732938 0.494125 0.510583
45 0.339438 0.418896 0.678875 0.7815
49 0.646836 0.132552 0.118047 0.0969375
49 0.773148 0.129802 0.0907344 0.0972292
49 0.668297 0.226906 0.131281 0.146896
49 0.642859 0.0792187 0.148063 0.148062

BradyJ27 · 2023-11-03T01:58:32Z

@mattseddon the first number corresponds to a dictionary containing the classes.

It's something like:

...
44: "dog",
45: "person",
46: "car",
...

This is found in a dataset configuration file (specifically for yolo), which is data.yaml.

There is often some configuration similar to this whether it be a dictionary in a training script, a data configuration file, or sometimes the labels are hard coded in the labels file.

I will say that this above is yolo specific, it is more often just the actual label instead of a number corresponding to a dictionary.

BradyJ27 · 2023-12-15T23:10:51Z

I was just coming here to revisit (was busy for the past month) this and create some issues in the data and render repos, but it looks like you guys have maybe taken another look. Should I go ahead and create some additional issues and start looking into this, or is this in progress already?

julieg18 · 2023-12-18T15:11:44Z

I was just coming here to revisit (was busy for the past month) this and create some issues in the data and render repos, but it looks like you guys have maybe taken another look. Should I go ahead and create some additional issues and start looking into this, or is this in progress already?

@BradyJ27, feel free to do that, thanks. I've started to look into how Studio and VSCode are going to render these images but I'm currently not looking into dvc/dvc-render side of things.

julieg18 · 2024-01-09T17:44:03Z

While researching on UX, I took into account that while both Studio and VSCode use React for the frontend, Studio has a backend based in Python and VSCode has a backend based in NodeJS. So far, I've come up with two ideas on how the clients (VSCode/Studio) would handle this.

Ideas

Rely on the client backend to create images with the needed bounding boxes. The frontend would render these images. (See Matt's comment)
Send the box coordinates to the frontend and have the frontend render the bounding boxes onto an image using SVGs or HTML canvas (I believe W&B uses Canvas to create the bounding boxes)

Details

Rely on the client backend to create images with the needed bounding boxes. The frontend would render these images.

Pros

Both NodeJS and Python have multiple image manipulation libraries that we could use for creating images with bounding boxes. Matt has already mentioned sharp for NodeJS.

Cons

Studio and VSCode have different backends, so we would have to go about creating images in different ways. This would make keeping things consistent across products more difficult.

Send the box coordinates to the frontend and have the frontend render the bounding boxes onto an image using SVGs or HTML canvas (I believe W&B )

Pros

Since both Studio and VSCode use React in the frontend, it will easier to have consistent plots in both clients. React also has some libraries for Canvas (KonvaJS, FabricJS) and SVGs that would simplify the solution instead of using just Vanilla APIs.

Cons

The solution for rendering the bounding boxes will probably be a bit more complicated then using the methods that backend libraries offer.

What do we think?

dberenbaum · 2024-01-09T18:15:55Z

It would be nice to have some sort of interactive plots where the user can toggle on/off different objects based on labels.

We will probably want some level of interactivity like this at some point, so I think it makes sense to go with option 2.

julieg18 · 2024-01-17T16:40:29Z

Started working on implementing this and, after trying HTML Canvas and SVGs, decided on using SVGs to render the plots since they are easier to create and will be more performative especially when it comes to resizing the plots.

Design

Next, I started working on the UI design for the togglable boxes. Here is what I have so far (created in storybook):

Looking at Studio, either version could fit there as well:

Questions About Implementation

Do we want to toggle classes in all revision plots for a specific image path at once or have the toggles per single plot? I tried designs for both for now. There's also the option of toggling classes across all images in the webview at once.
What colors are we going to be using for the bounding boxes? I just chose red and blue for now but I'm assuming we want a pre-set of more muted colors?

What do we think? cc @shcheklein @iterative/vs-code

shcheklein · 2024-01-17T21:59:26Z

Look cool, @julieg18 !

Do we want to toggle classes in all revision plots for a specific image path at once or have the toggles per single plot? I tried designs for both for now. There's also the option of toggling classes across all images in the webview at once.

My 2cs. I think we should do toggle all images per path at once, for now.

What colors are we going to be using for the bounding boxes? I just chose red and blue for now but I'm assuming we want a pre-set of more muted colors?

let's take a look how YOLO generates colors / boxes and take if from it?

mattseddon · 2024-01-17T22:52:19Z

Is the HTML produced by the CLI (i.e. plots diff) out of scope for this?

dberenbaum · 2024-01-18T16:17:52Z

Is the HTML produced by the CLI (i.e. plots diff) out of scope for this?

I don't think CLI support is a requirement unless it's helpful to consolidate the VS Code and Studio implementation (similar to images per step).

julieg18 · 2024-01-18T16:35:16Z

Is the HTML produced by the CLI (i.e. plots diff) out of scope for this?
I don't think CLI support is a requirement unless it's helpful to consolidate the VS Code and Studio implementation

Are we referring to the DVC CLI being able to create these plots with bounding boxes?

If so, if it is doable for the CLI to create the bounding box plot SVGs, that could help with consolidation since Studio and VS Code would only need to create logic for toggling boxes. Currently, both Studio and VSCode need to create the SVG elements from the image src and bb coordinates as well as the toggle logic.

mattseddon added the triage label Oct 26, 2023

mattseddon added the story Product feature aka epic. Discussion, progress, checkboxes for implementation, etc label Oct 30, 2023

shcheklein added priority-p1 Regular product backlog and removed triage labels Dec 12, 2023

BradyJ27 mentioned this issue Dec 23, 2023

plots: interactive plots with toggling bounding box iterative/dvc#10198

Open

shcheklein changed the title ~~plots: Interactive plots with bounding boxes~~ Interactive plots with bounding boxes Jan 16, 2024

shcheklein added the A: plots Area: plots webview, side panel and everything related label Jan 16, 2024

shcheklein assigned julieg18 Jan 16, 2024

julieg18 mentioned this issue Jan 23, 2024

Add bounding boxes plot frontend components #5227

Closed

4 tasks

dberenbaum mentioned this issue Jan 23, 2024

log_image: log bounding boxes iterative/dvclive#766

Open

4 tasks

shcheklein assigned mattseddon, shcheklein and dberenbaum and unassigned julieg18 Mar 1, 2024

mattseddon removed their assignment Jun 24, 2024

0x2b3bfa0 unassigned dberenbaum Sep 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interactive plots with bounding boxes #4917

Interactive plots with bounding boxes #4917

BradyJ27 commented Oct 26, 2023 •

edited

Loading

mattseddon commented Oct 26, 2023 •

edited

Loading

BradyJ27 commented Oct 26, 2023 •

edited

Loading

mattseddon commented Oct 27, 2023

BradyJ27 commented Oct 27, 2023 •

edited

Loading

dberenbaum commented Oct 27, 2023

BradyJ27 commented Oct 30, 2023

mattseddon commented Oct 31, 2023

BradyJ27 commented Oct 31, 2023

mattseddon commented Nov 2, 2023

mattseddon commented Nov 2, 2023

BradyJ27 commented Nov 3, 2023

mattseddon commented Nov 3, 2023

BradyJ27 commented Nov 3, 2023

BradyJ27 commented Dec 15, 2023

julieg18 commented Dec 18, 2023

julieg18 commented Jan 9, 2024

dberenbaum commented Jan 9, 2024

julieg18 commented Jan 17, 2024

shcheklein commented Jan 17, 2024

mattseddon commented Jan 17, 2024

dberenbaum commented Jan 18, 2024

julieg18 commented Jan 18, 2024

Interactive plots with bounding boxes #4917

Interactive plots with bounding boxes #4917

Comments

BradyJ27 commented Oct 26, 2023 • edited Loading

mattseddon commented Oct 26, 2023 • edited Loading

BradyJ27 commented Oct 26, 2023 • edited Loading

mattseddon commented Oct 27, 2023

BradyJ27 commented Oct 27, 2023 • edited Loading

dberenbaum commented Oct 27, 2023

BradyJ27 commented Oct 30, 2023

mattseddon commented Oct 31, 2023

BradyJ27 commented Oct 31, 2023

mattseddon commented Nov 2, 2023

mattseddon commented Nov 2, 2023

BradyJ27 commented Nov 3, 2023

mattseddon commented Nov 3, 2023

BradyJ27 commented Nov 3, 2023

BradyJ27 commented Dec 15, 2023

julieg18 commented Dec 18, 2023

julieg18 commented Jan 9, 2024

Ideas

Details

Pros

Cons

Pros

Cons

dberenbaum commented Jan 9, 2024

julieg18 commented Jan 17, 2024

Design

Questions About Implementation

shcheklein commented Jan 17, 2024

mattseddon commented Jan 17, 2024

dberenbaum commented Jan 18, 2024

julieg18 commented Jan 18, 2024

BradyJ27 commented Oct 26, 2023 •

edited

Loading

mattseddon commented Oct 26, 2023 •

edited

Loading

BradyJ27 commented Oct 26, 2023 •

edited

Loading

BradyJ27 commented Oct 27, 2023 •

edited

Loading