Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

is there any chance you can help to implement the source code of this tinystores paper? #1

Open
edisondeng opened this issue Sep 16, 2023 · 1 comment
Assignees
Labels
help wanted Extra attention is needed

Comments

@edisondeng
Copy link

the content of this tinystories paper is promising. But lacking of the code stops me. Any chance if you can help to implement some of the code at least as a starting point.

tks in advance

@sleepingcat4 sleepingcat4 self-assigned this Sep 18, 2023
@sleepingcat4 sleepingcat4 added the help wanted Extra attention is needed label Sep 18, 2023
@sleepingcat4
Copy link
Owner

TinyStories, paper is not necessarily a code intensive paper. If you read the paper, the main focus was to limit GPT model with very low params and much importance was given to dataset. In this repository, the have shown a very basic way, how Huggingface models can be defined to a certain param and be used on TinyStories dataset.

There's much more, complicated methods to call a Large Language model both using third-party libraries as well as writing from scratch. What method to choose depends on you. Certainly, there're cool projects to be made but that was out-of-scope for this repository and my time.

What models and how to call them, depends entirely upon you as that's something a creative endeavor as well as research. Hope that answers your question.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants