Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Q: How should I train my personal dataset? #11

Open
WindyHu001 opened this issue Apr 9, 2024 · 2 comments
Open

Q: How should I train my personal dataset? #11

WindyHu001 opened this issue Apr 9, 2024 · 2 comments

Comments

@WindyHu001
Copy link

How should I train my personal dataset? I have doubts about the real datasets of the three cities mentioned in the project. What aspects of transportation data do they record? For example, I have some real-world intersection turning data, and in Sumo, I can use the jtcrouter interface to construct traffic flow based on turning data.

@Gungnir2099
Copy link
Collaborator

You can follow our proposed method laid out in the paper. Follow steps like 1) utilize GPT-4 to interact with the environment and collect its inference trajectories; 2) run the imitation fine-tuning ./finetune/run_imitation_finetune.py by leveraging the collected inference trajectories of GPT-4; 3) train the critic model within the same environment; 4) run the critic-guided policy refinement ./finetune/run_policy_refinement_data_collection.py to optimize the LLM's control action.

@Gungnir2099
Copy link
Collaborator

The prompt design is provided in the paper.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants