Skip to content

added git workflow for unittest#23

Open
quic-rishinr wants to merge 15 commits into
quic-rishinr:mainfrom
quic:unit_test_workflow
Open

added git workflow for unittest#23
quic-rishinr wants to merge 15 commits into
quic-rishinr:mainfrom
quic:unit_test_workflow

Conversation

@quic-rishinr
Copy link
Copy Markdown
Owner

No description provided.

@quic-rishinr quic-rishinr force-pushed the unit_test_workflow branch 3 times, most recently from f29d4bf to 078b0b9 Compare March 27, 2026 12:24
Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
quic-rishinr and others added 7 commits April 3, 2026 10:19
…efill/Decode/Vision/Encoder/Embedding)" (#902)

Reverts #894

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
… align the subfunction changes. (#882)

This PR rebases dev/rebase_transformers_v4_57_3 onto main and
consolidates our transformer rebase changes with the PR #880
subfunction/KV alignment so we keep the branch simpler and unify the
subfunction approach.

  ### What changed


  - KV/subfunction alignment:
- Applied PR #880-style wrapper changes for causal model families to
reduce divergence from mainline.
- Removed local resolve_kv_seq_len usage from remaining wrappers
(grok_1, molmo) to match the cache-native pattern used elsewhere.
- Removed now-unused helper resolve_kv_seq_len from
QEfficient/utils/_utils.py.
  - Unit test updates:
- Added a new quickcheck unit test for use_onnx_subfunctions=True that
validates decoder-block subfunction cardinality per causal model.
- Important: test counts only decoder model block functions (via
get_submodules_for_export()), not all ONNX helper functions, so the
assertion tracks the intended behavior.

  ### Decoder-block subfunction status (causal model list)

- Single decoder-block subfunction: falcon, gpt2, gptj, granite, llama,
mistral, mpt, olmo2, phi3, qwen2
- Multiple decoder-block subfunctions: codegen, gpt_oss, mixtral, phi
(Phi-1), starcoder2

  ### Tests verified

```
python -m pytest -q tests/unit_test/models/test_model_quickcheck.py -n auto
Result after subfunction-count + KV-helper cleanup: 75 passed, 1 skipped
```

cc: @anujgupt-github @quic-hemagnih

---------

Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
Signed-off-by: Abhishek Kumar Singh <sabhis@qti.qualcomm.com>
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>
Signed-off-by: Mamta Singh <mamtsing@qti.qualcomm.com>
Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com>
Co-authored-by: Abhishek Kumar Singh <sabhis@qti.qualcomm.com>
Co-authored-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>
Co-authored-by: Mamta Singh <mamtsing@qti.qualcomm.com>
Co-authored-by: vtirumal <vtirumal@qti.qualcomm.com>
1. Added Support for Qwen3-VL
2. Added Support for Qwen3-VL-Moe
3. Added Support for Disaggregated VL MOE Model. (Three QPC )
4. CB support

---------

Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>
Signed-off-by: Dipankar Sarkar <quic_dipankar@quicinc.com>
Signed-off-by: Onkar Chougule <ochougul@qti.qualcomm.com>
Signed-off-by: Dhiraj Kumar Sah <dhirajku@qti.qualcomm.com>
Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com>
Co-authored-by: Onkar Chougule <168134249+ochougul@users.noreply.github.com>
Co-authored-by: Dhiraj Kumar Sah <dhirajku@qti.qualcomm.com>
Co-authored-by: vtirumal <vtirumal@qti.qualcomm.com>
Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
Updating wav2vec2 for batched_input and adding args for parsing
batch_size, ctx_len and num_devices

    model.compile(batch_size=args.batch_size,
                  num_devices=args.num_devices,
                  seq_len=args.seq_len,
                  num_cores=args.num_cores)

---------

Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>
  to allow authenticated access to gated HuggingFace models
- Wrap model/tokenizer loading in try/except blocks in quickcheck tests
  to gracefully skip tests when model fetch fails (e.g., auth errors)
- Fix HybridChunkedCache backward compatibility for transformers>=4.57
  which no longer exposes key_cache/value_cache attributes
- Fix Gemma3 config compatibility by mapping _sliding_window_pattern
  to sliding_window_pattern when the public attribute is missing

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants