Food Type Detection using Vision Transformer (ViT)

This repository contains the implementation of a Vision Transformer (ViT) model for food type detection. The project aims to reproduce the original ViT study on this task and compare the performance of a model trained from scratch against a transfer learning-based model, along with the EfficientDet model.

Introduction

Food type detection is a challenging computer vision task that involves classifying different types of food items from images. The Vision Transformer (ViT) model has shown promising results in various image classification tasks, including food type detection. This project aims to implement and evaluate the performance of ViT on the food type detection dataset.

Dataset

The dataset used for this project is a custom food type detection dataset. Unfortunately, due to copyright and licensing restrictions, we cannot share the dataset publicly. However, you can use your own dataset or obtain a suitable food type detection dataset from open datasets available online.

Installation

To set up the environment and install the required dependencies, follow these steps:

Clone the repository:

git clone https://github.com/PG-9-9/Food-Net.git
cd Food-Net

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.ipynb_checkpoints		.ipynb_checkpoints
07_efficientnetb0_fine_tuned_101_classes_mixed_precision		07_efficientnetb0_fine_tuned_101_classes_mixed_precision
__pycache__		__pycache__
downloaded_fine_tuned_gs_model/07_efficientnetb0_fine_tuned_101_classes_mixed_precision		downloaded_fine_tuned_gs_model/07_efficientnetb0_fine_tuned_101_classes_mixed_precision
downloaded_gs_model/07_efficientnetb0_feature_extract_model_mixed_precision		downloaded_gs_model/07_efficientnetb0_feature_extract_model_mixed_precision
model_checkpoints		model_checkpoints
training_logs		training_logs
07_efficientnetb0_feature_extract_model_mixed_precision.zip		07_efficientnetb0_feature_extract_model_mixed_precision.zip
07_efficientnetb0_feature_extract_model_mixed_precision.zip.1		07_efficientnetb0_feature_extract_model_mixed_precision.zip.1
07_efficientnetb0_fine_tuned_101_classes_mixed_precision.zip		07_efficientnetb0_fine_tuned_101_classes_mixed_precision.zip
Food Vision- Final.ipynb		Food Vision- Final.ipynb
Food Vision.ipynb		Food Vision.ipynb
README.md		README.md
VIT_Implementation.ipynb		VIT_Implementation.ipynb
helper_functions.py		helper_functions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Food Type Detection using Vision Transformer (ViT)

Table of Contents

Introduction

Dataset

Installation

About

Releases

Packages

Languages

PG-9-9/Food-Net

Folders and files

Latest commit

History

Repository files navigation

Food Type Detection using Vision Transformer (ViT)

Table of Contents

Introduction

Dataset

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages