[RFC] Local models, remote install and more losely dependencies #14
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hello,
I was playing around with mix_eval and noticed something, which i addressed in a fork to properly use it in my setup. First as #11 also opened there are some missing
__init__.py
which prevent remote installs withpip install git+https://github.com/Psycoy/MixEval --upgrade
. Additionally the dependencies are very hard which makes it harder to integrate in existing environments.I created a fork to make usage more easier.
This is a fork of the original MixEval repository. The original repository can be found here. I created this fork to make the integration and use of MixEval easier during the training of new models. This Fork includes several improved feature to make usages easier and more flexible. Including:
transformers
Getting started
# Fork with more losely dependencies pip install git+https://github.com/philschmid/MixEval --upgrade
Note: If you want to evaluate models that are not included Take a look here. Zephyr example here.
Evaluation open LLMs
Remote Hugging Face model with existing config:
Using vLLM/TGI with hosted or local API:
MODEL_PARSER_API=$(echo $OPENAI_API_KEY) API_URL=http://localhost:8000/v1 python -m mix_eval.evaluate \ --data_path hf://zeitgeist-ai/mixeval \ --model_name local_api \ --model_path alignment-handbook/zephyr-7b-dpo-full \ --benchmark mixeval_hard \ --version 2024-06-01 \ --batch_size 20 \ --output_dir results \ --api_parallel_num 20
Takes around 5 minutes to evaluate.
Local Hugging Face model from path:
Remote Hugging Face model without config and defaults
Note: We use the model name
local_chat
to avoid the need for a config file and load it from the Hugging Face model hub.