Tensorflow 1 models + gunicorn workers - strange issue #1191

gregd33 · 2020-10-20T15:56:42Z

gregd33
Oct 20, 2020

I know tensorflow 1 models aren't supported (or at least I think so). However, I have gotten things to work by building a service with a TF1 artifact - the environment I build in is TF2, but TF1 is in the docker environment.

The reason for this is certain models of interest are only easily available in TF1 (e.g. https://github.com/zhang0jhon/AttentionOCR).

So everything seems to work okay with one exception - for a large model (e.g. the one above) the docker container will crash when run. We managed to fix this by only using one gunicorn worker. But this obviously a weird and not ideal solution.

So two questions:

Any ideas what might be causing the crashes and why it is fixed with only one guinicorn worker?
Is there a more ideal way to make use of a TF1 model?

Answered by bojiang

Oct 29, 2020

Hi! I'm working on the TensorFlow integration of bentoML. This information may be helpful to you.

I know TensorFlow 1 models aren't supported (or at least I think so).

In fact, bentoML was designed to supports both TensorFlow 1 & 2 models. An example using tf1 is included in the gallery:
https://github.com/bentoml/gallery/blob/master/tensorflow/fashion-mnist/tensorflow_1_fashion_mnist.ipynb

Any ideas what might be causing the crashes and why it is fixed with only one guinicorn worker?

In most cases, it was caused by insufficient system resources, especially memory.
For example, for any classifier using BERT, each worker would take more than 700M memory. Even on an EC2 c5-large instanc…

View full answer

bojiang · 2020-10-29T01:30:13Z

bojiang
Oct 29, 2020
Maintainer

Hi! I'm working on the TensorFlow integration of bentoML. This information may be helpful to you.

I know TensorFlow 1 models aren't supported (or at least I think so).

In fact, bentoML was designed to supports both TensorFlow 1 & 2 models. An example using tf1 is included in the gallery:
https://github.com/bentoml/gallery/blob/master/tensorflow/fashion-mnist/tensorflow_1_fashion_mnist.ipynb

Any ideas what might be causing the crashes and why it is fixed with only one guinicorn worker?

In most cases, it was caused by insufficient system resources, especially memory.
For example, for any classifier using BERT, each worker would take more than 700M memory. Even on an EC2 c5-large instance, we can only run three workers at most.

It would be great if you could provide the logs.
docker logs <your-container>

We managed to fix this by only using one gunicorn worker. But this obviously a weird and not ideal solution.

We all know that if it is because of the insufficient system resources, limiting the number of workers is a good solution. We can achieve higher throughput with the same number of workers by micro-batching.

0 replies

gregd33 · 2020-10-29T18:24:34Z

gregd33
Oct 29, 2020
Author

Hi! Thanks for the reply.

I'm glad to hear that TF1 models are officially supported. I had been doing a system where I save a model in TF1 but use TF2 in the environment in which I'm predicting. This is the only way I was able to get it working before. One the issues is that I'm using pretrained weights from others' models which is why I have less flexibility in their format. However, I will try using TF1 exclusively and see how that goes.

Could you clarify/explain what exactly a gunicorn worker is? Is each worker loading up the model in memory so that it can handle a request? If so then it makes sense that the memory issues are happening.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BentoML

Tensorflow 1 models + gunicorn workers - strange issue #1191

{{title}}

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

BentoML

Tensorflow 1 models + gunicorn workers - strange issue #1191

gregd33 Oct 20, 2020

Replies: 2 comments

bojiang Oct 29, 2020 Maintainer

gregd33 Oct 29, 2020 Author

gregd33
Oct 20, 2020

bojiang
Oct 29, 2020
Maintainer

gregd33
Oct 29, 2020
Author