Mixtral-8x7b-instruct support #3862

scoute · 2025-02-17T19:50:45Z

Hi, guys!

First of all, I would like to express my gratitude. I have been using TabbyML and Qwen-2.5-Coder-32b-instruct for several months now, and I feel joy and excitement every day!

Recently, I discovered another model called Mixtral-8x7b-instruct, which surpassed Qwen in response speed by approximately 3-4 times! I ran Mixtral with koboldcpp, but I couldn't manage to run it with TabbyML no matter how hard I tried.

I even tried launching Mixtral-8x7b-instruct under the guise of Mistral-7b, but that didn't work either.

Perhaps you can suggest a way to launch it through parameters in the config or add support for this model. Although it is slightly larger than Qwen-32b, it works significantly faster, even on CPU.

zwpaper · 2025-02-18T03:36:57Z

Hi @scoute, Thank you for the information. We are pleased that Tabby could help.

I discovered that KoboldCPP is a fork of Llama.cpp, and llama.cpp itself has also implemented mixtral support, as seen here: ggml-org/llama.cpp#4406. I believe it should not be challenging to integrate it. We will investigate this further at a later time.

Additionally, could you please share with us your hardware information related to running the Qwen 32B? We would also like to know about your user experience and the specific casino where you are utilizing Tabby.

scoute · 2025-02-18T06:21:16Z

I just write code using Tabby chat prompts.
My hardware: AMD Ryzen 9 7950X, DDR5 128 Gb

KoboldCPP showed this difference:
(213.0ms/T = 4.69T/s) - Mixtral-8x7b-instruct (q5) - 28Gb
(638.5ms/T = 1.57T/s) - Qwen-2.5-Coder-32b-instruct (q8) - 33Gb

scoute added the enhancement New feature or request label Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mixtral-8x7b-instruct support #3862

Mixtral-8x7b-instruct support #3862

scoute commented Feb 17, 2025

zwpaper commented Feb 18, 2025

scoute commented Feb 18, 2025 •

edited

Loading

Mixtral-8x7b-instruct support #3862

Mixtral-8x7b-instruct support #3862

Comments

scoute commented Feb 17, 2025

zwpaper commented Feb 18, 2025

scoute commented Feb 18, 2025 • edited Loading

scoute commented Feb 18, 2025 •

edited

Loading