Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OSSD source is missing defillama, crates, and pypi columns #3209

Open
ccerv1 opened this issue Mar 10, 2025 · 1 comment
Open

OSSD source is missing defillama, crates, and pypi columns #3209

ccerv1 opened this issue Mar 10, 2025 · 1 comment
Labels
c:data Gathering data (e.g. indexing)

Comments

@ccerv1
Copy link
Member

ccerv1 commented Mar 10, 2025

Which area(s) are affected? (leave empty if unsure)

No response

To Reproduce

View the expected columns from oss-directory:

from typing import Dict, List
from ossdirectory import fetch_data
from ossdirectory.fetch import OSSDirectory
data: OSSDirectory = fetch_data()
projects: List[dict] = data.projects
cols = {k for p in projects for k in p}

We see:

blockchain, crates, defillama, description, display_name, github, go, name, npm, open_collective, pypi, social, version, websites

Now view the columns in the dagster asset here: https://admin-dagster.opensource.observer/assets/ossd/projects

Image

Describe the Bug

There are several artifact types that are included in the latest OSSD schema and that show when querying. These include:

  • defillama
  • crates
  • pypi

As result, anything downstream from OSSD (including artifacts_by_projects models) does not capture all the artifacts

Expected Behavior

These artifacts should be getting pulled in via ossd.py and available in the source dataset

@ccerv1 ccerv1 added the c:data Gathering data (e.g. indexing) label Mar 10, 2025
Copy link

linear bot commented Mar 10, 2025

@github-project-automation github-project-automation bot moved this to Backlog in OSO Mar 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
c:data Gathering data (e.g. indexing)
Projects
Status: Backlog
Development

No branches or pull requests

1 participant