Align README.md with docs/ and minor fixes / improvements (#214)

* Remove `license_header.txt` in favour of `LICENSE` * Remove `.cache/**/*` * Update `.gitignore` from `Python.gitignore` See https://github.com/github/gitignore/blob/main/Python.gitignore * Update `description` in `index.md` to match `README`'s * Update `README.md`
argilla-io · Jan 5, 2024 · 3160a09 · 3160a09
1 parent 854e33e
commit 3160a09
Show file tree

Hide file tree

Showing 56 changed files with 103 additions and 99 deletions.
diff --git a/.cache/plugin/social/0996050c15d38684e49df59e0a486d5a.png b/.cache/plugin/social/0996050c15d38684e49df59e0a486d5a.png
diff --git a/.cache/plugin/social/0c444ce31177c04da404ef4007686864.png b/.cache/plugin/social/0c444ce31177c04da404ef4007686864.png
diff --git a/.cache/plugin/social/0f8298b45055d1e7e6d9024fc6eb8f41.png b/.cache/plugin/social/0f8298b45055d1e7e6d9024fc6eb8f41.png
diff --git a/.cache/plugin/social/0fd21192a2bf8f4699a3e81b794b49df.png b/.cache/plugin/social/0fd21192a2bf8f4699a3e81b794b49df.png
diff --git a/.cache/plugin/social/190bb622d06dfcb4edbad730e175b75b.png b/.cache/plugin/social/190bb622d06dfcb4edbad730e175b75b.png
diff --git a/.cache/plugin/social/1b86809fd6477ffd4fa4c2de8a44f737.png b/.cache/plugin/social/1b86809fd6477ffd4fa4c2de8a44f737.png
diff --git a/.cache/plugin/social/25c216e55c7579c65fc5160aece49b7f.png b/.cache/plugin/social/25c216e55c7579c65fc5160aece49b7f.png
diff --git a/.cache/plugin/social/2f0d20c696c00a973de2e19085cdcd22.png b/.cache/plugin/social/2f0d20c696c00a973de2e19085cdcd22.png
diff --git a/.cache/plugin/social/347988768de9ccf31fe986247021dc11.png b/.cache/plugin/social/347988768de9ccf31fe986247021dc11.png
diff --git a/.cache/plugin/social/39088edb59ad4bb87e288a9c90bdf63a.png b/.cache/plugin/social/39088edb59ad4bb87e288a9c90bdf63a.png
diff --git a/.cache/plugin/social/5470f3754486a9c84ad9951105cadf01.png b/.cache/plugin/social/5470f3754486a9c84ad9951105cadf01.png
diff --git a/.cache/plugin/social/56ee2920338f407aabbde0f70353c86d.png b/.cache/plugin/social/56ee2920338f407aabbde0f70353c86d.png
diff --git a/.cache/plugin/social/60cd35f4b6c9eac8a258a1eb2dc189b0.png b/.cache/plugin/social/60cd35f4b6c9eac8a258a1eb2dc189b0.png
diff --git a/.cache/plugin/social/624f403776bfaef5cbfb1fa7cb8fc7c5.png b/.cache/plugin/social/624f403776bfaef5cbfb1fa7cb8fc7c5.png
diff --git a/.cache/plugin/social/6c070f1fe38d98fb31b937389328edc4.png b/.cache/plugin/social/6c070f1fe38d98fb31b937389328edc4.png
diff --git a/.cache/plugin/social/70527f357a5fda0c8fd50b62a2bc10df.png b/.cache/plugin/social/70527f357a5fda0c8fd50b62a2bc10df.png
diff --git a/.cache/plugin/social/73072d4d982e6125b065108aa4596311.png b/.cache/plugin/social/73072d4d982e6125b065108aa4596311.png
diff --git a/.cache/plugin/social/790235402351a03da7370dc64c692b8b.png b/.cache/plugin/social/790235402351a03da7370dc64c692b8b.png
diff --git a/.cache/plugin/social/7904977e796fecbb4f1e3f2d9593c212.png b/.cache/plugin/social/7904977e796fecbb4f1e3f2d9593c212.png
diff --git a/.cache/plugin/social/7939f5a6a902226ee28f0c720180e0b3.png b/.cache/plugin/social/7939f5a6a902226ee28f0c720180e0b3.png
diff --git a/.cache/plugin/social/7f6cda5067d4a056913ef35fa2c2bc2f.png b/.cache/plugin/social/7f6cda5067d4a056913ef35fa2c2bc2f.png
diff --git a/.cache/plugin/social/80fae5521d7afd57a08bc56b611f6a00.png b/.cache/plugin/social/80fae5521d7afd57a08bc56b611f6a00.png
diff --git a/.cache/plugin/social/88281aae3b3fbe796ad78ca3e81290a4.png b/.cache/plugin/social/88281aae3b3fbe796ad78ca3e81290a4.png
diff --git a/.cache/plugin/social/8e2cbfbd4ac745a8e931065f467386cc.png b/.cache/plugin/social/8e2cbfbd4ac745a8e931065f467386cc.png
diff --git a/.cache/plugin/social/9186c7732b2c035cf64d16425e7ef6a8.png b/.cache/plugin/social/9186c7732b2c035cf64d16425e7ef6a8.png
diff --git a/.cache/plugin/social/983cc182dee00ddcdd201690d853dd4b.png b/.cache/plugin/social/983cc182dee00ddcdd201690d853dd4b.png
diff --git a/.cache/plugin/social/Roboto-Black.ttf b/.cache/plugin/social/Roboto-Black.ttf
diff --git a/.cache/plugin/social/Roboto-BlackItalic.ttf b/.cache/plugin/social/Roboto-BlackItalic.ttf
diff --git a/.cache/plugin/social/Roboto-Bold.ttf b/.cache/plugin/social/Roboto-Bold.ttf
diff --git a/.cache/plugin/social/Roboto-BoldItalic.ttf b/.cache/plugin/social/Roboto-BoldItalic.ttf
diff --git a/.cache/plugin/social/Roboto-Italic.ttf b/.cache/plugin/social/Roboto-Italic.ttf
diff --git a/.cache/plugin/social/Roboto-Light.ttf b/.cache/plugin/social/Roboto-Light.ttf
diff --git a/.cache/plugin/social/Roboto-LightItalic.ttf b/.cache/plugin/social/Roboto-LightItalic.ttf
diff --git a/.cache/plugin/social/Roboto-Medium.ttf b/.cache/plugin/social/Roboto-Medium.ttf
diff --git a/.cache/plugin/social/Roboto-MediumItalic.ttf b/.cache/plugin/social/Roboto-MediumItalic.ttf
diff --git a/.cache/plugin/social/Roboto-Regular.ttf b/.cache/plugin/social/Roboto-Regular.ttf
diff --git a/.cache/plugin/social/Roboto-Thin.ttf b/.cache/plugin/social/Roboto-Thin.ttf
diff --git a/.cache/plugin/social/Roboto-ThinItalic.ttf b/.cache/plugin/social/Roboto-ThinItalic.ttf
diff --git a/.cache/plugin/social/abe7fe958897ac2f79775b4bd6a2378a.png b/.cache/plugin/social/abe7fe958897ac2f79775b4bd6a2378a.png
diff --git a/.cache/plugin/social/ad99588f3e598c48ea76ca1fabe8060a.png b/.cache/plugin/social/ad99588f3e598c48ea76ca1fabe8060a.png
diff --git a/.cache/plugin/social/b33b84d5e78c9bcf70b8253892a2c354.png b/.cache/plugin/social/b33b84d5e78c9bcf70b8253892a2c354.png
diff --git a/.cache/plugin/social/b47e610316d75b0e21a017adbba9b817.png b/.cache/plugin/social/b47e610316d75b0e21a017adbba9b817.png
diff --git a/.cache/plugin/social/bf90aba6d296f4f9fa4ff1741cf295c1.png b/.cache/plugin/social/bf90aba6d296f4f9fa4ff1741cf295c1.png
diff --git a/.cache/plugin/social/d3c80c661fe824d6f00f2733e96d04cc.png b/.cache/plugin/social/d3c80c661fe824d6f00f2733e96d04cc.png
diff --git a/.cache/plugin/social/d61181e428c686f8c62d7814e4fd2947.png b/.cache/plugin/social/d61181e428c686f8c62d7814e4fd2947.png
diff --git a/.cache/plugin/social/d9afa3607cba0851e593b377dbf41f56.png b/.cache/plugin/social/d9afa3607cba0851e593b377dbf41f56.png
diff --git a/.cache/plugin/social/ddc294cba74f433e4c7f89529327a5bd.png b/.cache/plugin/social/ddc294cba74f433e4c7f89529327a5bd.png
diff --git a/.cache/plugin/social/e0a7d7eabd32925a2570663747ae9984.png b/.cache/plugin/social/e0a7d7eabd32925a2570663747ae9984.png
diff --git a/.cache/plugin/social/ea1614f4e981e056e3139a51469180d2.png b/.cache/plugin/social/ea1614f4e981e056e3139a51469180d2.png
diff --git a/.cache/plugin/social/ec10edb2769f30ccd2de90da91682169.png b/.cache/plugin/social/ec10edb2769f30ccd2de90da91682169.png
diff --git a/.cache/plugin/social/f95d7edaa460c11de9ec67313b0ee243.png b/.cache/plugin/social/f95d7edaa460c11de9ec67313b0ee243.png
diff --git a/.gitignore b/.gitignore
@@ -3,37 +3,78 @@ __pycache__/
 *.py[cod]
 *$py.class
 
-# C extensions
-*.so
-
 # Distribution / packaging
-dist/
+.Python
 build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+share/python-wheels/
 *.egg-info/
-
-# Virtual environments
-venv/
-env/
-ENV/
-.venv/
-.ENV/
+.installed.cfg
+*.egg
+MANIFEST
 
 # IDEs and editors
 .idea/
 .vscode/
 *.sublime-project
 *.sublime-workspace
 
-# Test files
-.pytest_cache/
-coverage.xml
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+
+# Unit test / coverage reports
 htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+cover/
+
+# Sphinx documentation
+docs/_build/
+
+# Jupyter Notebook
+.ipynb_checkpoints
+
+# pyenv
+#   For a library or package, you might want to ignore these files since the code is
+#   intended to run in multiple environments; otherwise, check them in:
+.python-version
+
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow and github.com/pdm-project/pdm
+__pypackages__/
+
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+
+# mkdocs documentation
+/site
 
 # Other
 *.log
 *.swp
 .DS_Store
-site/
-
-# Jupyter Notebook
-*.ipynb_checkpoints
diff --git a/.pre-commit-config.yaml b/.pre-commit-config.yaml
@@ -8,7 +8,7 @@ repos:
         exclude: ^docs/snippets/
         args:
           - --license-filepath
-          - license_header.txt
+          - LICENSE
           - --fuzzy-match-generates-todo
           # - --remove-header
 

diff --git a/README.md b/README.md
@@ -1,122 +1,98 @@
 <div align="center">
   <h1>⚗️ distilabel</h1>
-  <p><em>AI Feedback (AIF) framework for building datasets and labellers with LLMs</em></p>
+  <p><em>AI Feedback (AIF) framework for building datasets with and for LLMs.</em></p>
 </div>
 
-![overview](https://github.com/argilla-io/distilabel/assets/36760800/360110da-809d-4e24-a29b-1a1a8bc4f9b7)
-
 > [!TIP]
 > To discuss, get support, or give feedback [join Argilla's Slack Community](https://join.slack.com/t/rubrixworkspace/shared_invite/zt-whigkyjn-a3IUJLD7gDbTZ0rKlvcJ5g) and you will be able to engage with our amazing community and also with the core developers of `argilla` and `distilabel`.
 
-## What's `distilabel`?
-
-`distilabel` is a framework for AI engineers to align LLMs using RLHF-related methods (e.g. reward models, DPO).
-
-The initial focus is LLM fine-tuning and adaptation but we'll be extending it for predictive NLP use cases soon.
+![overview](https://github.com/argilla-io/distilabel/assets/36760800/360110da-809d-4e24-a29b-1a1a8bc4f9b7)
 
-Main use cases are:
+## Features
 
-1. As an AI engineer I want to **build domain-specific instruction datasets** to fine-tune OSS LLMs with increased accuracy.
-2. As an AI engineer I want to **build domain-specific and diverse preference datasets** to use RLHF-related methods and align LLMs (e.g, increase the ability to follow instructions or give truthful responses).
+- Integrations with the most popular libraries and APIs for LLMs: HF Transformers, OpenAI, vLLM, etc.
+- Multiple tasks for Self-Instruct, Preference datasets and more.
+- Dataset export to Argilla for easy data exploration and further annotation.
 
 > [!WARNING]
 > `distilabel` is currently under active development and we're iterating quickly, so take into account that we may introduce breaking changes in the releases during the upcoming weeks, and also the `README` might be outdated the best place to get started is the [documentation](http://distilabel.argilla.io/).
 
-## Motivation
-
-🔥 Recent projects like [Zephyr](https://huggingface.co/collections/HuggingFaceH4/zephyr-7b-6538c6d6d5ddd1cbb1744a66) and [Tulu](https://huggingface.co/collections/allenai/tulu-v2-suite-6551b56e743e6349aab45101) have shown it's possible to **build powerful open-source models with DPO and AI Feedback** (AIF) datasets. 
-
-👩‍🔬 There's a lot of exciting research in the AIF space, such as [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback) (the dataset leveraged by Zephyr and Tulu), [JudgeLM](https://github.com/baaivision/JudgeLM), or [Prometheus](https://huggingface.co/kaist-ai/prometheus-13b-v1.0). 
-
-🚀 However, going beyond research efforts and applying AIF at scale it's different. For enterprise and production use, we need framework that implements **key AIF methods on a robust, efficient and scalable way**. This framework should enable AI engineers to build custom datasets at scale for their own use cases. 
-
-👩‍🎓 This, combined with humans-in-the-loop for improving dataset quality is the next big leap for OSS LLM models. 
-
-⚗️ `distilabel` aims to bridge this gap.
-
-## Key features
+## Installation
 
-* 🤖 **Leverage OSS models and APIs**: 🤗 transformers, OpenAI, 🤗 Inference Endpoints, vLLM, llama.cpp, and more to come.
+```sh
+pip install distilabel --upgrade
+```
 
-* 💻 **Scalable and extensible**: Scalable implementations of existing methods (e.g. UltraFeedback). Easily extensible to build and configure your own labellers.
+Requires Python 3.8+
 
-* 🧑‍🦱 **Human-in-the-loop**: One line of code integration with Argilla to improve and correct datasets.
+In addition, the following extras are available:
 
-## Quickstart
+- `hf-transformers`: for using models available in [transformers](https://github.com/huggingface/transformers) package via the `TransformersLLM` integration.
+- `hf-inference-endpoints`: for using the [HuggingFace Inference Endpoints](https://huggingface.co/inference-endpoints) via the `InferenceEndpointsLLM` integration.
+- `openai`: for using OpenAI API models via the `OpenAILLM` integration.
+- `vllm`: for using [vllm](https://github.com/vllm-project/vllm) serving engine via the `vLLM` integration.
+- `llama-cpp`: for using [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) as Python bindings for `llama.cpp`.
+- `argilla`: for exporting the generated datasets to [Argilla](https://argilla.io/).
 
-### Installation
+## Example
 
-Install with `pip` (requires Python 3.8+):
+To run the following example you must install `distilabel` with both `openai` and `argilla` extras:
 
-```bash
-pip install distilabel[openai,argilla]
+```sh
+pip install "distilabel[openai,argilla]" --upgrade
 ```
 
-### Try it out
-
-After installing, you can immediately start experimenting with `distilabel`:
-
-- **Explore Locally**: Follow the example below to build a preference dataset for DPO/RLHF.
-- **Interactive Notebook**: Prefer an interactive experience? Try our Google Colab Notebook!
-
-  [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1rO1-OlLFPBC0KPuXQOeMpZOeajiwNoMy?usp=sharing)
-
-### Example: Build a preference dataset for DPO/RLHF
+Then run the following example:
 
 ```python
 from datasets import load_dataset
 from distilabel.llm import OpenAILLM
 from distilabel.pipeline import pipeline
 from distilabel.tasks import TextGenerationTask
 
-# Load a dataset with instructions from the Hub
 dataset = (
-    load_dataset("HuggingFaceH4/instruction-dataset", split="test[:5]")
+    load_dataset("HuggingFaceH4/instruction-dataset", split="test[:10]")
     .remove_columns(["completion", "meta"])
     .rename_column("prompt", "input")
 )
 
-# Use `OpenAILLM` (running `gpt-3.5-turbo`) to generate responses for given inputs
-generator = OpenAILLM(
-    task=TextGenerationTask(),
-    max_new_tokens=512,
-    # openai_api_key="sk-...",
-)
+# Create a `Task` for generating text given an instruction.
+task = TextGenerationTask()
+
+# Create a `LLM` for generating text using the `Task` created in
+# the first step. As the `LLM` will generate text, it will be a `generator`.
+generator = OpenAILLM(task=task, max_new_tokens=512)
 
+# Create a pre-defined `Pipeline` using the `pipeline` function and the
+# `generator` created in step 2. The `pipeline` function will create a
+# `labeller` LLM using `OpenAILLM` with the `UltraFeedback` task for
+# instruction following assessment.
 pipeline = pipeline("preference", "instruction-following", generator=generator)
 
-# Build a preference dataset comparing two responses focused on the instruction-following skill of the LLM
 dataset = pipeline.generate(dataset)
 ```
 
-The resulting dataset can already be used for preference tuning (a larger version of it). But beware these AIF dataset are imperfect. To get the most out of AIF, push to Argilla for human feedback:
+Additionally, you can push the generated dataset to Argilla for further exploration and annotation:
 
 ```python
 import argilla as rg
 
-rg.init(
-    api_key="<YOUR_ARGILLA_API_KEY>",
-    api_url="<YOUR_ARGILLA_API_URL>"
-)
+rg.init(api_url="<YOUR_ARGILLA_API_URL>", api_key="<YOUR_ARGILLA_API_KEY>")
 
+# Convert the dataset to Argilla format
 rg_dataset = dataset.to_argilla()
+
+# Push the dataset to Argilla
 rg_dataset.push_to_argilla(name="preference-dataset", workspace="admin")
 ```
 
-https://github.com/argilla-io/distilabel/assets/1107111/be34c95c-8be4-46ef-9437-cbd2a7687e30
-
-### More examples
+## More examples
 
 Find more examples of different use cases of `distilabel` under [`examples/`](./examples/).
 
-## Roadmap
+Or check out the following Google Colab Notebook:
 
-- [x] Add Critique Models and support for Prometheus OSS
-- [x] Add a generator with multiple models
-- [ ] Train OSS labellers to replace OpenAI labellers
-- [ ] Add labellers to evolve instructions generated with self-instruct
-- [ ] Add labellers for predictive NLP tasks: text classification, information extraction, etc.
-- [ ] Open an issue to suggest a feature!
+[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1rO1-OlLFPBC0KPuXQOeMpZOeajiwNoMy?usp=sharing)
 
 ## Contribute
 

diff --git a/docs/index.md b/docs/index.md
@@ -1,5 +1,5 @@
 ---
-description: Distilabel is an AI Feedback (AIF) framework to build datasets with and for LLMs.
+description: Distilabel is an AI Feedback (AIF) framework for building datasets with and for LLMs.
 ---
 # distilabel
 

diff --git a/license_header.txt b/license_header.txt