Skip to content

A small vision language model meant for research

License

Notifications You must be signed in to change notification settings

tyler-romero/seahorse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Seahorse: A small VLLM for research

This repo contains all of the code used in the entire development process for Seahorse, a VLLM based on Phi3.5 and CLIP.

Features | Experiments | Evaluation | Project Structure

License: MIT

Features

  • Built on Phi3.5 and CLIP, supports arbitrary interleaved images and text
  • Optimized for training on a single GeForce RTX 4090
  • Easily extensible for new research experiments, supports optuna
  • Comprehensive, standardized evaluation using lmms-eval

Running an experiment

This project uses a Makefile to manage tasks. Tasks in the Makefile rely on uv for dependency management.

To run an experiment, use the run-experiment task. For example:

make run-experiment pretrain

Tip: This is equivalent to running:

uv run python seahorse/experiments/run_experiment.py pretrain

If you prefer not to use uv, you can manually install the project dependencies and then run:

python seahorse/experiments/run_experiment.py pretrain

This will look for the function pretrain() in the experiment registry and execute it to create a (set of) experiment configuration(s). Then for each of those configurations, a training run will be launched.

Evaluation

Evaluation is performed via the lmms-eval library.

Project Structure

seahorse/
├── seahorse/         # Main Python package for the project
│   ├── config/       # HF-style configuration files for SeahorseModel
│   ├── data/         # Data preprocessing and loading (e.g. datasets, collators, etc.)
│   ├── eval/         # Evaluation code and utilities (e.g. benchmark scoring, etc.)
│   ├── experiments/  # Experiment configuration and launching
│   ├── models/       # Model architectures and construction
│   ├── train/        # Training script and custom HF Trainer class
│   └── utils/        # Misc utility tools (rng, profiling, etc.)
├── tests/            # Sanity-preserving unit tests for the project
└── Makefile          # Simple task management (`run-experiment`, `test`, etc.)

Unit Tests

To run the unit tests for the project, use the test task:

make test

To run a specific test, run

make test TEST_ARGS="-k test_seahorse_tokenizer"

About

A small vision language model meant for research

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published