I guess the -7B might run on my 16GB AMD card?
Also it's entirely possible to run a model that doesn't fit in available GPU memory, it will just be slower.