Torchserve 23mt v0.8.0 by simonschoelly · Pull Request #13 · textshuttle/pytorch-serve

simonschoelly · 2023-05-24T11:59:16Z

This PR rebases our changes on v0.8.0 of torch-serve

This PR shoud not be merged into based-on-v0.8.0 - I just made this a base branch here, so that one can see the changes during code review.

…h#2104)

)

Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

…2118) * Added an example serving models using TorchServe Docker Container * Updated examples README with a link to the new readme * added link to example in docker readme

* Create codeql

…ytorch#2154) * Changes to support different python versions * Updated default version to 3.9 * spellcheck lint

* Update model file docs to be more accurate * model-archiver: Improve docs --------- Co-authored-by: Ankith Gunapal <agunapal@meta.com>

…nts using glob (pytorch#2142) Co-authored-by: Ankith Gunapal <agunapal@meta.com> Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

Co-authored-by: Matthias Reso <13337103+mreso@users.noreply.github.com>

* Upload benchmark artifacts for auto-validation

* Make torch_handler tests callable with pytest * Add handler test to sanity check * Replace resnet 152 against 18 to speedup unit test; also fixes issue with old weights version on gpu * Fix lint error

* feat: nvida dali torchserve integration Signed-off-by: jagadeeshi2i <jagadeeshj@ideas2it.com> * typos and naming changes in dali integration * fix: change env var to file check in vision handler Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix: dali imports Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix: set prefetch queue depth as constant Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix: vision handler Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix: readme doc Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix: spellcheck Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix: lint error Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix: serializer script Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * Added unit test for dali integration * fix: lint errors Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix typo Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix: move dali preprocessor to custom handler Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> * fix: unit test Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> --------- Signed-off-by: jagadeeshi2i <jagadeeshj@ideas2it.com> Signed-off-by: jagadeesh <jagadeeshj@ideas2it.com> Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

* BERT nightly benchmark on Inferentia1 * Consolidate neuron benchmark model config files into a single file for BERT Set the NEURON_RT_NUM_CORES value as a string in the inf1 nightly benchmark workflow file * Update trnsformer model downloader documentation * test workflow before merge * Consolidate benchmark workflows * Update runs-on syntax * Remove hardware specific benchmark workflow files * Consolidate install dependencies step Add necessary env variables for cpu and inf1 Disable fail-fast to enable all benchmarks to run even if one of them fail * update documentation --------- Co-authored-by: Naman Nandan <namannan@amazon.com>

* Make model-archiver tests independent from caller directory * Make model-archiver tests callable from any directory

* Support for PyTorch 2.0 * Ignoring DALI pytest * Updateregression tests to run weekly

Co-authored-by: Naman Nandan <namankt55@gmail.com> Co-authored-by: Ankith Gunapal <agunapal@meta.com>

* feat:grpc server side streaming * add regression test for grpc stream * add word in lint spell list * fmt * fmt * update doc * add LLMs in wordlist * update doc

…onftest file

…ytorch#2154) * Changes to support different python versions * Updated default version to 3.9 * spellcheck lint

Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu>

pytorch#2371) Co-authored-by: Naman Nandan <namannan@amazon.com>

* Enable ONNX test in CI

* update cpuinfo * update * lint * lint * lint * install numactl --------- Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu> Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

* fix tcp port in model_service_worker.py * update torchrun parameter name * fmt

* update cpuinfo * update * lint * lint * lint * install numactl * update num workers * skip if gpu --------- Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu> Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

…h#2374) torch == 2.0.1 torchvision == 0.15.2 torchtext == 0.15.2 torchaudio == 2.0.2 Co-authored-by: Naman Nandan <namannan@amazon.com> Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu>

… server to stop

* adding type annotations to functions

* reflect the model config from properties * honor model config in config.properties

Bumps [transformers](https://github.com/huggingface/transformers) from 4.28.1 to 4.30.0. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](huggingface/transformers@v4.28.1...v4.30.0) --- updated-dependencies: - dependency-name: transformers dependency-type: direct:development ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

…ytorch#2350) * add jobQsize * add jobQueueSize in model_config.yaml for mnist_scripted.mar * feature job tickets * rename hasJobTickets to getJobTickets * fmt doc * update doc * fmt doc * fmt doc * update doc

…:pytorch/serve into torchserve-23mt-v0.8.0-fix-1

Torchserve 23mt v0.8.0 fix 1

lxning and others added 30 commits February 9, 2023 10:06

remove workflow from nightly benchmark (pytorch#2114)

c851274

updated walmart blog link (pytorch#2117)

48c8153

Changing to previous version of mdutils (pytorch#2127)

30ec515

Add configuration option to disable system metrics collection (pytorc…

7e65972

…h#2104)

Freeze pynvml version to avoid crash in nvgpu pytorch#2136 (pytorch#2138

c84312b

)

Bump isort dependency for pre-commit to 5.11.5 (pytorch#2132)

7a7cc69

Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

Update CI GPU to run on CUDA 11.7 (pytorch#2137)

8b3ae1e

Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

Enable batch processing in scriptable tokenizer example (pytorch#2130)

485ebf8

Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

README for serving models using TorchServe Docker Container (pytorch#…

c417b4a

…2118) * Added an example serving models using TorchServe Docker Container * Updated examples README with a link to the new readme * added link to example in docker readme

Add Github Code Scanning codeql.yml (pytorch#2149)

92657b2

* Create codeql

upgrade CI to run on Ubuntu 20.04 (pytorch#2153)

b8f52d8

Add support for building docker images with specified python version (p…

26089d4

…ytorch#2154) * Changes to support different python versions * Updated default version to 3.9 * spellcheck lint

Update model file docs to be more accurate (pytorch#2148)

87359c4

* Update model file docs to be more accurate * model-archiver: Improve docs --------- Co-authored-by: Ankith Gunapal <agunapal@meta.com>

Added feature that allows for wildcard search in --extra-files argume…

b05c0ea

…nts using glob (pytorch#2142) Co-authored-by: Ankith Gunapal <agunapal@meta.com> Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

Fix Dead kubectl links (pytorch#2160)

86d4400

Co-authored-by: Matthias Reso <13337103+mreso@users.noreply.github.com>

Push benchmark artifacts for auto-validation (pytorch#2157)

fd8f1b3

* Upload benchmark artifacts for auto-validation

Make handler tests callable with pytest (pytorch#2173)

9412c0a

* Make torch_handler tests callable with pytest * Add handler test to sanity check * Replace resnet 152 against 18 to speedup unit test; also fixes issue with old weights version on gpu * Fix lint error

Make model-archiver tests runnable from any directory (pytorch#2191)

c164374

* Make model-archiver tests independent from caller directory * Make model-archiver tests callable from any directory

Upgrade to PyTorch 2.0 (pytorch#2194)

890d09e

* Support for PyTorch 2.0 * Ignoring DALI pytest * Updateregression tests to run weekly

NVIDIA MPS documentation (pytorch#2184)

82acbbf

Fix regex formatting (pytorch#2172)

41a3af3

Co-authored-by: Naman Nandan <namankt55@gmail.com> Co-authored-by: Ankith Gunapal <agunapal@meta.com>

Feature/grpc streaming (pytorch#2186)

d0510ba

* feat:grpc server side streaming * add regression test for grpc stream * add word in lint spell list * fmt * fmt * update doc * add LLMs in wordlist * update doc

Add zip-store option to model-archiver tool

c70c501

Moved test loading function of model-archiver integration test into c…

b6ad061

…onftest file

Added additional test to model-archiver unit test

8111e29

Added zip-store help to README.md

afe8a9b

Add NVIDIA MPS documentation to doc index

7d23cb2

Add mps doc to content + change title

6b5d8c8

mpoemsl and others added 4 commits May 28, 2023 15:50

Update custom images to new branch naming

4bc5e46

Add support for building docker images with specified python version (p…

10b81b5

…ytorch#2154) * Changes to support different python versions * Updated default version to 3.9 * spellcheck lint

Make dockerfile work

71518bb

Fix merge conflict in Dockerfile

4492505

simonschoelly requested a review from pypae May 30, 2023 07:49

mreso and others added 12 commits May 30, 2023 13:22

Avoid running processes in integration test

a568291

Fix error case

d229611

Remove test cross-talking

c816465

Enable Regression Test in CI (pytorch#2370)

81c3d59

Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu>

Fix regression test failures associated with fixes in PR: pytorch#2175 (

130a29e

pytorch#2371) Co-authored-by: Naman Nandan <namannan@amazon.com>

Remove evil session_mocker usage to prevent test cross talking

1f863f9

Enable ONNX test in CI (pytorch#2363)

27a6ed5

* Enable ONNX test in CI

Update cpuinfo (pytorch#2372)

c685b5c

* update cpuinfo * update * lint * lint * lint * install numactl --------- Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu> Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

fix tcp port in model_service_worker.py (pytorch#2377)

a81fc52

* fix tcp port in model_service_worker.py * update torchrun parameter name * fmt

Remove cu118 from regression tests (pytorch#2380)

77f8c0b

Update cpuinfo (pytorch#2379)

e5004b2

* update cpuinfo * update * lint * lint * lint * install numactl * update num workers * skip if gpu --------- Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu> Co-authored-by: Mark Saroufim <marksaroufim@fb.com>

Upgrade torch, torchvision, torchtext and torchaudio versions (pytorc…

e205e6b

…h#2374) torch == 2.0.1 torchvision == 0.15.2 torchtext == 0.15.2 torchaudio == 2.0.2 Co-authored-by: Naman Nandan <namannan@amazon.com> Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu>

pypae changed the base branch from based-on-v0.8.0 to torchserve-23mt June 5, 2023 07:23

mreso and others added 12 commits June 5, 2023 11:20

Remove wait time when stopping and starting torchserve in tests

9542c45

Remove 10 second wait time for start/stop torchserve

9a23ef7

Prepare log folder + remove spaces in command + obsolete kwarg

fe27e07

Allow --stop to be combined with --forward argument to wait for model…

28a2525

… server to stop

Ts type annotations (pytorch#2384)

08a9093

* adding type annotations to functions

fix: reflect model config in config.properties (pytorch#2382)

43d84ff

* reflect the model config from properties * honor model config in config.properties

fix model cpu config (pytorch#2408)

7f9967e

support jobQueueSize and job ticket per model in model config YAML (p…

89fbd88

…ytorch#2350) * add jobQsize * add jobQueueSize in model_config.yaml for mnist_scripted.mar * feature job tickets * rename hasJobTickets to getJobTickets * fmt doc * update doc * fmt doc * fmt doc * update doc

Merge commit '89fbd885b11e4d2f40bea779d8cee7c6a87f9c20' of github.com…

6eea9e1

…:pytorch/serve into torchserve-23mt-v0.8.0-fix-1

Use /pytorch-serve insteaf of /serve directory in dockerfile

f01fab6

Merge pull request #14 from textshuttle/torchserve-23mt-v0.8.0-fix-1

5124b67

Torchserve 23mt v0.8.0 fix 1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Torchserve 23mt v0.8.0#13

Torchserve 23mt v0.8.0#13
simonschoelly wants to merge 174 commits intotorchserve-23mt-v0.7.1from
torchserve-23mt-v0.8.0

simonschoelly commented May 24, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

17 participants

Conversation

simonschoelly commented May 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

17 participants

simonschoelly commented May 24, 2023 •

edited

Loading