Refactor and improve docs #134

plaguss · 2023-11-30T13:27:11Z

Refactor and improve docs.

Closes #120

This PR is a refactor of the docs settings, including new sections, ased on the ideas from diátaxis.
We now have a Learn section that contains tutorials and how-to guides (work in progress), and a technical reference that contains both concept guides and the API reference.

alvarobartt · 2023-11-30T15:37:37Z

Hi here @plaguss, nice idea! I like the approach, feel free to open the PR for review once you're happy with it / is more mature 👍🏻

plaguss · 2023-12-01T09:23:17Z

You can take a look and let me know whatever. I think all the current examples can work perfectly as user guides with some explanation of what they do, I've let an example for one of the scripts . If you think so I can split them with @ignacioct and move them there.

ignacioct · 2023-12-01T12:46:25Z

@plaguss I also like the structure, and happy to help and split it between you and me. I'm still learning about distilabel, so I might need a little guidance, but eager to help.

plaguss · 2023-12-01T13:38:52Z

@plaguss I also like the structure, and happy to help and split it between you and me. I'm still learning about distilabel, so I might need a little guidance, but eager to help.

we are together in that 😉

dvsrepo · 2023-12-03T20:12:42Z

docs/learn/index.md

@@ -0,0 +1,5 @@
+# Learn
+
+This is the reference section to learn how to use the power that `distilabel` has to offer. Here you can find the tutorials and guides to learn how to use `distilabel`.


For distilabel docs, I'd highly recommend a more sober, technical and concise style.

dvsrepo · 2023-12-03T20:34:34Z

Hi @plaguss I like the structure too and the ref you make to diataxis. The only thing is that I feel the proposal of using the current examples as the main topics for the user guides is not very aligned. They are very quick examples and not so meaningful. The line between those and the tutorial is not so clean.

Things we need to document well:

Pipeline and pipeline

Tasks: what are the current tasks, their parameters, how to use these parameters

For preference a description of the different options (UF, etc.), parameters, etc.

For text generation, how to use prompt templates, parameters, and how to use principles.

LLM: what are the current LLMs, their parameters, etc. Here for example we should help users to choose and option, to explain how llamacpp is used (which models?), how to setup vLLM with multiple GPUs (cc @gabrielmbmb )

Happy to do a call when I'm back from my holidays but I'd recommend keeping a mindset of adding parts describing key things in the most direct, descriptive, but simple language.

docs/api/pipeline.md

Co-authored-by: Ignacio Talavera <[email protected]>

…ilabel into docs/update-docs

dvsrepo · 2023-12-15T13:01:20Z

@plaguss can you replace the alembic svg with this:

dvsrepo · 2023-12-15T13:04:54Z

good job, this is a very good version.

My only comment is that I think the noqa comments if displayed in the docs can be annoying and out of place (never seen this in other docs), don't we have a general way to exclude those code snippets from the qa analysis?

plaguss · 2023-12-15T15:14:25Z

good job, this is a very good version.

My only comment is that I think the noqa comments if displayed in the docs can be annoying and out of place (never seen this in other docs), don't we have a general way to exclude those code snippets from the qa analysis?

Done! my bad, I just removed the /docs folder from the ruff formatter (should have done that the first time).
I added the new alembic.svg, but I left the previous one because it's not working for the favicon (I don't know why).

alvarobartt

LGTM! Thanks for this awesome (and much needed) PR @plaguss 🎉 I just have some comments that you can find below, but great work so far, it's been really straight forward to go through the docs!

P.S. Also I'm unsure about the value of keeping the scripts in separate files and then injecting them in the Markdown files, could you elaborate on that decision? How's that better to keeping those within the Markdown files? Thanks in advance 👍🏻

mkdocs.yml

docs/assets/alembic.svg

mkdocs.yml

docs/technical-reference/llms.md

docs/technical-reference/pipeline.md

docs/technical-reference/tasks.md

…docs/update-docs

plaguss · 2023-12-19T07:49:41Z

LGTM! Thanks for this awesome (and much needed) PR @plaguss 🎉 I just have some comments that you can find below, but great work so far, it's been really straight forward to go through the docs!

P.S. Also I'm unsure about the value of keeping the scripts in separate files and then injecting them in the Markdown files, could you elaborate on that decision? How's that better to keeping those within the Markdown files? Thanks in advance 👍🏻

Thank you!!

The decision for keeping the scripts in the snippets directory instead of the Markdown files because we already had it that way for a couple of examples. I would be useful if we reused the code snippets in different parts of the docs (which is not the case currently, and if that was the case we could do it for a single case specifically). In this case I think it's more a matter of preference, but I'm fine with any decision, I can place them directly in the markdown files and remove the code snippets.

Co-authored-by: Alvaro Bartolome <[email protected]>

…ilabel into docs/update-docs

Co-authored-by: Alvaro Bartolome <[email protected]>

alvarobartt · 2023-12-19T08:04:07Z

LGTM! Thanks for this awesome (and much needed) PR @plaguss 🎉 I just have some comments that you can find below, but great work so far, it's been really straight forward to go through the docs!
P.S. Also I'm unsure about the value of keeping the scripts in separate files and then injecting them in the Markdown files, could you elaborate on that decision? How's that better to keeping those within the Markdown files? Thanks in advance 👍🏻

Thank you!!

The decision for keeping the scripts in the snippets directory instead of the Markdown files because we already had it that way for a couple of examples. I would be useful if we reused the code snippets in different parts of the docs (which is not the case currently, and if that was the case we could do it for a single case specifically). In this case I think it's more a matter of preference, but I'm fine with any decision, I can place them directly in the markdown files and remove the code snippets.

Thanks for the answer! I'm happy keeping those within the docs instead, but may be opinionated, so lets double-check with @gabrielmbmb and @dvsrepo to see what are their thoughts 👍🏻

Co-authored-by: Alvaro Bartolome <[email protected]>

…ilabel into docs/update-docs

gabrielmbmb · 2023-12-19T14:39:42Z

Hi @plaguss! Looks good to me, I saw there are some scripts using in the docs that got the license header injected. I think we should skip adding the license in those scripts.

@alvarobartt regarding the decision on having the code snippets in different files, I think it's better to have them in a .py file because is better for the development of them (you get assistance from the IDE and they can be formatted using tools) and also you can quickly execute them to test they are working (for this to be true, they have to be fully working examples)

plaguss added 3 commits November 29, 2023 17:03

chore: add requirement to run mkdocs serve

7078e81

docs: add draft for learning section

27847ee

refactor: guides as a part of the learn section

3006955

plaguss requested review from dvsrepo, ignacioct, alvarobartt and gabrielmbmb November 30, 2023 13:27

plaguss added 4 commits November 30, 2023 18:09

chore: add support to render notebooks

81ddb99

docs: copy tutorial from gabri

b269caf

docs: update learn section

6045f13

docs: let navigable section and new api reference

8826315

plaguss marked this pull request as ready for review December 1, 2023 09:17

docs: add small overview of api reference

c4ea466

Merge branch 'main' into docs/update-docs

4cde7ac

ignacioct approved these changes Dec 1, 2023

View reviewed changes

dvsrepo reviewed Dec 3, 2023

View reviewed changes

ignacioct reviewed Dec 4, 2023

View reviewed changes

docs/api/pipeline.md Outdated Show resolved Hide resolved

plaguss and others added 8 commits December 7, 2023 11:31

Merge branch 'main' into docs/update-docs

1703837

Update docs/api/pipeline.md

eb54298

Co-authored-by: Ignacio Talavera <[email protected]>

Merge branch 'docs/update-docs' of https://github.com/argilla-io/dist…

fe76cfc

…ilabel into docs/update-docs

wip

fd1dcd6

docs: rewrite for the llm concept guides

d727cfb

chore: renamed files

f05cb5d

refactor: updated user-guides content

7510475

chore: allow adding footnotes and docs layout

0b5baf5

plaguss added 2 commits December 15, 2023 16:05

docs: removed noqa and avoid checking the docs with ruff

046416a

chore: add new alembig image

1e2ad18

alvarobartt reviewed Dec 18, 2023

View reviewed changes

alvarobartt added documentation Improvements or additions to documentation and removed size:XL labels Dec 18, 2023

alvarobartt assigned plaguss Dec 18, 2023

alvarobartt added this to the 0.2.0 milestone Dec 18, 2023

Merge branch 'main' of https://github.com/argilla-io/distilabel into …

8089bd2

…docs/update-docs

plaguss and others added 6 commits December 19, 2023 08:55

Remove commented line

2c82ea8

Co-authored-by: Alvaro Bartolome <[email protected]>

Update logo.svg new name

e25a1b4

Co-authored-by: Alvaro Bartolome <[email protected]>

Merge branch 'docs/update-docs' of https://github.com/argilla-io/dist…

cce4335

…ilabel into docs/update-docs

Rename to HuggingFace

eaf5db7

Co-authored-by: Alvaro Bartolome <[email protected]>

Rename to HuggingFace

b441c06

Co-authored-by: Alvaro Bartolome <[email protected]>

Rename to HuggingFace

9096994

Co-authored-by: Alvaro Bartolome <[email protected]>

Point doc reference to main branch

43fe493

Co-authored-by: Alvaro Bartolome <[email protected]>

alvarobartt changed the title ~~docs: idea for docs section~~ Refactor and improve docs Dec 19, 2023

plaguss and others added 4 commits December 19, 2023 09:05

Fixed reference

0e0bac5

Co-authored-by: Alvaro Bartolome <[email protected]>

Remove comment

4b0ca60

Co-authored-by: Alvaro Bartolome <[email protected]>

Merge branch 'docs/update-docs' of https://github.com/argilla-io/dist…

850c3f4

…ilabel into docs/update-docs

Add grid with different sections

dfd9952

gabrielmbmb approved these changes Dec 19, 2023

View reviewed changes

Update colours

0b33704

plaguss merged commit ac492ee into main Dec 19, 2023
4 checks passed

plaguss deleted the docs/update-docs branch December 19, 2023 16:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor and improve docs #134

Refactor and improve docs #134

plaguss commented Nov 30, 2023 •

edited

Loading

alvarobartt commented Nov 30, 2023

plaguss commented Dec 1, 2023

ignacioct commented Dec 1, 2023

plaguss commented Dec 1, 2023

dvsrepo Dec 3, 2023

dvsrepo commented Dec 3, 2023

dvsrepo commented Dec 15, 2023

dvsrepo commented Dec 15, 2023

plaguss commented Dec 15, 2023

alvarobartt left a comment

plaguss commented Dec 19, 2023

alvarobartt commented Dec 19, 2023

gabrielmbmb commented Dec 19, 2023

		@@ -0,0 +1,5 @@
		# Learn

		This is the reference section to learn how to use the power that `distilabel` has to offer. Here you can find the tutorials and guides to learn how to use `distilabel`.

Refactor and improve docs #134

Refactor and improve docs #134

Conversation

plaguss commented Nov 30, 2023 • edited Loading