Skip to content

Latest commit

 

History

History
250 lines (179 loc) · 14.7 KB

File metadata and controls

250 lines (179 loc) · 14.7 KB

PorscheInsight-CarClassification-AI

GitHub release (latest by date including pre-releases) GitHub Workflow Status GitHub repo size

Architecture of the Web UI

Description

This repository contains scripts to train models to classify pictures of Porsche cars. Check out the PowerPoint presentation here.

The following different model types are available:

🚗 Model 📝 Description
pre_filter This model is responsible for identifying whether an image contains a Porsche car or not. It is the first step in the classification process, and if the model determines the image to be a Porsche, it proceeds to the next models for further classification.The model is used for the new architecture in the web_ui.
model type This model predicts 10 broad Porsche car model types, such as 911, Cayman, and Panamera, among others. It is the second step in the classification process and helps to categorize the cars based on popular models.
car series Trained to predict 30 classes, this model groups several years together to imitate Porsche car series like the 911 991 or 911 992. It is a more detailed classification that focuses on car series rather than specific years.
all car series This model classifies images into 88 classes, corresponding to specific Porsche build years. It is the most detailed classification level, predicting not only the car model type but also the specific build year, e.g., a 911 from 2008. This model is suitable for users who want to identify and classify Porsche cars down to the finest level of detail.

Web UI

Get it on Google Play

The Web UI is a simple local website that enables users to upload images of Porsche cars and get classification results from the models. The app is built using Eel, which allows Python and HTML/JS to communicate with each other.

Tip

For a free online version of the Web UI, check out PorscheInsight.

With the Web UI, non-car images tend to lead to more random predictions. I trained a model to classify Porsche, other car brands, and others, resulting in a two-step Web UI architecture for Porsche Identification and Classification.

Installation

pip install -m build_requirements.txt

Usage

python web_app/main.py

or download the executable.

Demo

Screenshot of the Web UI

Architecture

The Web UI employs a two-step process involving two models. Initially, the pre_filter model determines if an image contains a Porsche. If a Porsche is detected, the image proceeds to the second model, which classifies the car according to the user's input.

Architecture of the Web UI

To see the architecture of the experimental ensemble models or the local web UI, click the arrow below.

Ensemble models Architecture of the ensemble models
App UI Architecture of the Web UI

Experimental Ensemble Models 🧪

To improve accuracy and prediction capabilities, I have experimented with two ensemble models. Ensemble models combine predictions from multiple models to give a final verdict, which often results in better prediction accuracy. These ensemble models are still in the experimental phase and can be found in the ensemble.

Note

The ensemble models are experimental and may not always provide accurate predictions. They are still in the testing phase and may be subject to change.

  1. Car Type Ensemble Model (weighted average)
    • Description: Trained two models on the same objective. Each with its on strengths and weaknesses. The results get weighted (correct weights through intensive testing). The models also differ in model architecture.
    • Achievements:
      • More balanced predictions
      • Better prediction accuracy
  2. Car Type hierarchy with car series (specific car type) (weighted average)
    • Description: Created a hierarchy of models. The first models predicts the car type. The second model predicts the car series. After this the results get aligned with the car type model.
    • Achievements:
      • much less outlier predictions
      • better prediction accuracy

They can be found at PorscheInsight-Ensemble.

ToDos

  • Add (explaining) Popup on first start of website
  • Retrain all models with better dataset
  • Improve pre_filter model/Use Grounded SAM
  • Add Taycans to images/models
Completed Tasks
  • Add ONNX export
  • Implement better Testing
  • Implement shap for feature importance
  • Add confusion matrix
  • Try a deeper/wider or different pretrained model to improve accuracy on the more classes dataset
  • Add a (web) app to use the models
  • Train on cleaned classes
  • Add onnx models to web UI to speedup and reduce size
  • Isnet model for bg remove
  • Bundle in one Binary
  • Add django web app
  • Train on cleaned classes with Vision Transformer
  • Add Android App
  • Implement new Architecture: One model classifies if a car is present and a second model classifies the car
  • Implement new architecture to online version
  • Add release 1.0.0
  • Add docker support (AWS Lambda)
  • Display the mask of the predicted car
  • Retrain car type model with efficientnet model
  • Build an ensemble prediction
  • Train a meta-learner
  • Try autokeras
  • Improve model predictions overall
  • Evaluate feature engineering/ More data augmentation
  • Build an ensemble model with car_type and car_series
  • Experiment with EfficientNet-Lite4
  • Compress images on upload

Installation

Requirements

  • Python >3.8, <3.11
  • [optional] CUDA 11.1
  • [optional] cuDNN 8.0.5

Install tensorflow, keras and the other dependencies with pip:

pip install -m requirements.txt

Models

Model Total params Trainable params Non-trainable params Batch size Accuracy Train % Accuracy Val % Number of classes
VGG16 pretrained* 20,027,082 12,390,538 7,636,544 32 99 95 10
efficientnetv2-b1(new head & faster) 7,106,956 993,416 6,113,640 32 47 46 88
efficientnetv2-b1 (cleaned classes) 7,099,474 985,934 6,113,540 32 82 85 30
efficientnetv2-b1-pre-filter 7,095,991 982,451 6,113,540 32 98 99 3
Open to view older models
Model Total params Trainable params Non-trainable params Batch size Accuracy Train % Accuracy Val % Number of classes
without augmentation* 11,239,850 11,239,850 0 32 98 78 10
with augmentation* 11,239,850 11,239,850 0 32 79 74 10
old_pretrained* 20,027,082 5,311,114 14,715,968 32 74 72 10
VGG16 pretrained 20,027,082 12,390,538 7,636,544 32 80 46 88
efficientnetv2-b1 7,099,474 1985,934 6,113,540 32 49 46 88
vit_b16 (cleaned classes) 85,901,470 102,558 85,798,912 32 45 49 30
efficientnetv2-b1-pre-filter-refactored 7,095,991 982,451 6,113,540 32 97 98 3
efficientnetv2-b1-car-type 7,096,894 1,329,386 5,767,508 32 96 97 10

Important

The models with * were trained on the pre cleaned dataset.

Have a look at the releases to see the models and results folder to see the training results. You can have a look at the models structure with Netron or look at the model_structure folder to see the model insights.

Have a look at an example Architecture of the Web UI

Usage

Train a model

To train a model you can use the train folder. You can choose the model, the dataset and the number of epochs. You can use the build in Discord Callback to get notfications on Discord after every epoch. You need to change the discord webhook url in the training file.

Predict with a model (Inference)

To predict with a model you can use the test_tf_model.py script. You can choose the model and the image you want to predict. If you want to predict with an onnx model you can use the test_onnx_model.py script.

I recommend to prepare the images with prepare_images.py before. Thus, an error-free and improved prediction is guaranteed.

Explain a model

To explain a model you can use the explainer.py script. You can choose the model and the image(folder) you want to get explanations. I recommend to prepare the images with prepare_images.py before.

After using shap values on the new efficientnet model and the vgg16 model, both on the old head, I found out that the vgg16 model found "better" spots to distinguish between classes, at least sometimes.

Confusion matrix of a model

The confusion matrix shows the performance of a classification model. It shows the number of correct and incorrect predictions made by a classifier.

To get a confusion matrix of a model you can use the confusion_matrix.py script. You can choose the model and the test set you want to get the confusion matrix from.

Confusion Matrix Confusion matrix for cat types

The confusion matrix of the specific model variants is too big to show it here. You can find it in the results folder.

Sample images from the shap values

Shap values for 911_1980

Explained images Shap values for Cayman_2009

Convert a model to ONNX

You can use the convert_to_onnx.py script to convert a keras(.h5) model to ONNX. You can choose the model you want to convert and a save path.

Dataset

The dataset is from Github and contains ~32.000 pictures of potential Porsche cars. Since the source of the dataset is public the quality and the arrangement of the images was also not great. After the data was cleaned, there are ~30.300 pictures left. Several pictures were removed because they were not of a Porsche car or the picture was not clear enough.

Have a look at the data: Sample images from Dataset

For the training of the pre_filter model a mixture of the porsche-pictures dataset, other Open Source datasets like cifar-10 were used.

Caution

This is not an official Porsche product and is not affiliated with or endorsed by Porsche AG

Get it on Google Play