Skip to content

Why the compiled of the new llama-gguf-split is way bigger than the old gguf-split #9536

Answered by ggerganov
andibuwono asked this question in Q&A
Discussion options

You must be logged in to vote

I just tested on my Linux box building static binaries with make without CUDA or other backends and the resulting static binaries are much larger than the ones that I get on MacOS:

$ make clean
$ make -j
$ ls -lh

-rwxrwxr-x  1 ggerganov ggerganov  45M сеп 19 17:47 llama-baby-llama
-rwxrwxr-x  1 ggerganov ggerganov  45M сеп 19 17:47 llama-batched
-rwxrwxr-x  1 ggerganov ggerganov  45M сеп 19 17:47 llama-batched-bench
-rwxrwxr-x  1 ggerganov ggerganov  48M сеп 19 17:47 llama-bench
-rwxrwxr-x  1 ggerganov ggerganov 4,1M сеп 19 17:47 llama-benchmark-matmult
-rwxrwxr-x  1 ggerganov ggerganov  46M сеп 19 17:47 llama-cli
-rwxrwxr-x  1 ggerganov ggerganov  45M сеп 19 17:47 llama-convert-llama2c-…

Replies: 1 comment 6 replies

Comment options

You must be logged in to vote
6 replies
@slaren
Comment options

@andibuwono
Comment options

@slaren
Comment options

@ggerganov
Comment options

Answer selected by andibuwono
@slaren
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants