`Tutorial_compile_llama2_with_mlc_llm.ipynb` dose not work

I follow this [notebook](https://github.com/mlc-ai/notebooks/blob/main/mlc-llm/tutorial_chat_module_getting_started.ipynb) in colab with T4 GPU. 
There are several problems:
- Now the version of cuda is 12.2 instead of 11.8. `mlc-ai-nightly-cu118 mlc-chat-nightly-cu118` should be replaced by `mlc-ai-nightly-cu122 mlc-chat-nightly-cu122`
- The prebuilt_libs does not work for colab T4 GPU (Get a error CUDA_ERROR_NO_BINARY_FOR_GPU). I compiled the model lib on colab with command `!mlc_chat compile ./dist/Llama-2-7b-chat-hf-q4f32_1-MLC/mlc-chat-config.json --device cuda -o dist/Llama-2-7b-chat-hf-q4f32_1-cuda.so` and it works to me. Maybe consider update the prebuild_libs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`Tutorial_compile_llama2_with_mlc_llm.ipynb` dose not work #32

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Tutorial_compile_llama2_with_mlc_llm.ipynb dose not work #32

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`Tutorial_compile_llama2_with_mlc_llm.ipynb` dose not work #32