Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize repo size #455

Open
aspeake opened this issue Jan 24, 2025 · 2 comments
Open

Optimize repo size #455

aspeake opened this issue Jan 24, 2025 · 2 comments
Assignees
Milestone

Comments

@aspeake
Copy link
Collaborator

aspeake commented Jan 24, 2025

.github is really large making a fresh clone > 10 GB. This is may be due to the large savings shapes .gz files and binary files in the git history.

We may need to address:

  • Reducing or dropping the savings_shapes .gz files on the repo
    • Which will then require downloading these for each CI run
    • Which will then probably require a more targeted selection of ECMs for an integration test (separate issue)
  • Figure out ways to clean up the .github directory and remove objects no longer needed

Note that this PR adds binary files which could make this problem worse: #440

@aspeake aspeake added this to the v1.1.0 milestone Feb 26, 2025
@jtlangevin jtlangevin modified the milestones: v1.1.0, v1.2.0 Mar 24, 2025
@jmythms
Copy link
Collaborator

jmythms commented Mar 26, 2025

Updates:

Running git gc, not changing history, will result in ~0.5% reduction in size of .git directory.
Running bfg:

  • without deleting anything: $ java -jar bfg.jar --strip-blobs-bigger-than 100M scout.git: 20% reduction in size of .git directory.
  • after deleting saving_shapes: 90% reduction in size: https://github.com/jmythms2/scout

@jmythms
Copy link
Collaborator

jmythms commented Mar 31, 2025

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants