Skip to content

Latest commit

 

History

History
18 lines (16 loc) · 1.15 KB

README.md

File metadata and controls

18 lines (16 loc) · 1.15 KB

segment-anything.java

Meta's Segment Anything (SAM) model, ported to Java SE 17.

Eventual goal is to use this in a JOSM plugin for segmentation of aerial imagery for OpenStreetMap. Further versions from that will be finetuned on aerial imagery for better results.

Roadmap

  • Reproduce ONNX export of encoder and decoder (ref: https://github.com/visheratin/segment-anything)
  • Image loading and preprocessing
  • Encoder forward pass
  • OnnxTensor to primitive matrix type conversion
  • Decoder forward pass, basic coordinate-based prompt
  • Decoder mask post-processing (mapping back to the input image)
  • Multi-mask support (currently T/F value is baked into the ONNX model at export-time)
  • Error handling TODOs
  • Improve upon ONNX export method
  • Figure out a place to host the ONNX models (~5 GB total)
  • FastSAM? It's such a different model though architecturally, might be out of the scope of this repository.
  • GPU runtimes?