fix(cloud-edge-llm): fix environment setup crashes and dependency typos by rakshaak29 · Pull Request #394 · kubeedge/ianvs

rakshaak29 · 2026-04-12T17:32:22Z

Resolves #393

Description

This PR resolves critical setup failure issues in the cloud-edge-collaborative-inference-for-llm example that blocked macOS and CPU-only users from installing dependencies and running the benchmark.

Changes Made:

requirements.txt:
- Removed vllm (CUDA/Linux only) to make it an optional, conditionally loaded backend.
- Corrected datamodel_code_generator to datamodel-code-generator.
- Added missing runtime dependency retry.
test_queryrouting.yaml:
- Commented out the vllm backend in the default configuration so the example executes successfully out-of-the-box falling back to huggingface.

Testing Done

Validated pip install -r requirements.txt completes successfully without CUDA.
Ensured CloudModel successfully runs utilizing the imported retry mechanics without ImportError.

kubeedge-bot · 2026-04-12T17:32:31Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: rakshaak29
To complete the pull request process, please assign moorezheng after the PR has been reviewed.
You can assign the PR to them by writing /assign @moorezheng in a comment when ready.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

gemini-code-assist

Code Review

This pull request enhances the portability of LLM examples by replacing absolute paths with relative ones, updating the BaseModel to use a public Hugging Face model ID, and implementing dynamic device selection (CUDA/MPS/CPU). It also updates project dependencies and ignores local development artifacts. Review feedback suggests refining the MPS availability check for better compatibility, deduplicating the model identifier string, and specifying a minimum version for the torch dependency.

…ix basemodel - Replace all hardcoded absolute paths with relative paths in YAML configs - Use HuggingFace Hub model ID (Qwen/Qwen2.5-0.5B-Instruct) instead of local path - Add missing preprocess() method to BaseModel (required by sedna) - Add MPS/CPU device detection for macOS compatibility - Update .gitignore to exclude local dev artifacts Signed-off-by: rakshaak29 <rakshak29@gmail.com>

Signed-off-by: rakshaak29 <rakshak29@gmail.com>

…r torch versions Signed-off-by: rakshaak29 <rakshak29@gmail.com>

…ainability Signed-off-by: rakshaak29 <rakshak29@gmail.com>

Signed-off-by: rakshaak29 <rakshak29@gmail.com>

Rishabh-git10

Hi @rakshaak29, thanks for putting this together. Shifting the default config to huggingface is a great improvement for accessibility, but I ran into a blocker during setup.

The example fails to launch because colorlog is missing from requirements.txt. Please add this dependency to examples/cloud-edge-collaborative-inference-for-llm/requirements.txt.

Reproduced error:

...$ pip install -r examples/cloud-edge-collaborative-inference-for-llm/requirements.txt
...
...$ ianvs -f examples/cloud-edge-collaborative-inference-for-llm/benchmarkingjob.yaml
...
File ".../core/common/log.py", line 18, in <module>
    import colorlog
ModuleNotFoundError: No module named 'colorlog'

A few other notes:

This PR mixes fixes for cloud-edge-llm with path fixes for llm_simple_qa. I strongly recommend isolating the llm_simple_qa path fixes into a separate PR. Keeping the history clean makes it much easier for the maintainers to track what actually changed and why.
I noticed venv311/ was added. Please use a generic venv/ pattern instead; it ensures portability since different contributors use different Python versions.

Chidroopakanaparthy · 2026-05-23T06:46:51Z

Reproduction result:

Checked out pr-394. Changes confined to
examples/cloud-edge-collaborative-inference-for-llm/ only.

Key diff:

-vllm 
-datamodel_code_generator
+datamodel-code-generator
+retry
+torch>=2.0.0
+vllm # Uncomment for Linux + CUDA environments only

Removing vllm is a critical fix, its wheel build fails on macOS entirely, blocking pip install -r requirements.txt for all Mac contributors.

Note for smart_coding: this example has no requirements.txt at all. This PR establishes the minimum viable pattern. A dedicated PR should create examples/smart_coding/requirements.txt with at minimum:
torch>=2.0.0, transformers, openai, accelerate, tqdm, numpy.

Recommendation: Merge.

kubeedge-bot requested review from Poorunga and hsj576 April 12, 2026 17:32

kubeedge-bot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Apr 12, 2026

gemini-code-assist Bot reviewed Apr 12, 2026

View reviewed changes

Comment thread examples/llm_simple_qa/testalgorithms/gen/basemodel.py Outdated

Comment thread examples/llm_simple_qa/testalgorithms/gen/basemodel.py Outdated

Comment thread examples/cloud-edge-collaborative-inference-for-llm/requirements.txt Outdated

rakshaak29 added 5 commits April 12, 2026 23:05

fix(llm_simple_qa): update BaseModel.preprocess to return data

8249840

Signed-off-by: rakshaak29 <rakshak29@gmail.com>

feat: enable Apple Silicon (MPS) acceleration fallback for llm_simple_qa

2b64550

Signed-off-by: rakshaak29 <rakshak29@gmail.com>

fix(preprocess): make data parameter optional

db8e08a

Signed-off-by: rakshaak29 <rakshak29@gmail.com>

fix(cloud-edge-llm): fix environment setup crashes and dependency typos

67e1eab

Signed-off-by: rakshaak29 <rakshak29@gmail.com>

rakshaak29 force-pushed the fix/cloud-edge-env-setup branch from fc45dbf to 67e1eab Compare April 12, 2026 17:35

rakshaak29 added 3 commits April 12, 2026 23:07

fix(llm_simple_qa): safe check for torch.backends.mps to support olde…

e2c8393

…r torch versions Signed-off-by: rakshaak29 <rakshak29@gmail.com>

refactor(llm_simple_qa): deduplicate model_id string to improve maint…

78334bd

…ainability Signed-off-by: rakshaak29 <rakshak29@gmail.com>

build(cloud-edge-llm): specify minimum torch version and fix EOF newline

723530e

Signed-off-by: rakshaak29 <rakshak29@gmail.com>

Rishabh-git10 mentioned this pull request May 20, 2026

Issue summarization for example of cloud-edge-collaborative-inference-for-llm #430

Open

Rishabh-git10 suggested changes May 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(cloud-edge-llm): fix environment setup crashes and dependency typos#394

fix(cloud-edge-llm): fix environment setup crashes and dependency typos#394
rakshaak29 wants to merge 8 commits into
kubeedge:mainfrom
rakshaak29:fix/cloud-edge-env-setup

rakshaak29 commented Apr 12, 2026

Uh oh!

kubeedge-bot commented Apr 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Rishabh-git10 left a comment •

edited

Loading

Uh oh!

Chidroopakanaparthy commented May 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

rakshaak29 commented Apr 12, 2026

Description

Changes Made:

Testing Done

Uh oh!

kubeedge-bot commented Apr 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Rishabh-git10 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Chidroopakanaparthy commented May 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Rishabh-git10 left a comment •

edited

Loading

Chidroopakanaparthy commented May 23, 2026 •

edited

Loading