added git workflow for unittest by quic-rishinr · Pull Request #23 · quic-rishinr/efficient-transformers

quic-rishinr · 2026-03-18T09:17:59Z

No description provided.

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

…efill/Decode/Vision/Encoder/Embedding)" (#902) Reverts #894 Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

@anujgupt-github

… align the subfunction changes. (#882) This PR rebases dev/rebase_transformers_v4_57_3 onto main and consolidates our transformer rebase changes with the PR #880 subfunction/KV alignment so we keep the branch simpler and unify the subfunction approach. ### What changed - KV/subfunction alignment: - Applied PR #880-style wrapper changes for causal model families to reduce divergence from mainline. - Removed local resolve_kv_seq_len usage from remaining wrappers (grok_1, molmo) to match the cache-native pattern used elsewhere. - Removed now-unused helper resolve_kv_seq_len from QEfficient/utils/_utils.py. - Unit test updates: - Added a new quickcheck unit test for use_onnx_subfunctions=True that validates decoder-block subfunction cardinality per causal model. - Important: test counts only decoder model block functions (via get_submodules_for_export()), not all ONNX helper functions, so the assertion tracks the intended behavior. ### Decoder-block subfunction status (causal model list) - Single decoder-block subfunction: falcon, gpt2, gptj, granite, llama, mistral, mpt, olmo2, phi3, qwen2 - Multiple decoder-block subfunctions: codegen, gpt_oss, mixtral, phi (Phi-1), starcoder2 ### Tests verified ``` python -m pytest -q tests/unit_test/models/test_model_quickcheck.py -n auto Result after subfunction-count + KV-helper cleanup: 75 passed, 1 skipped ``` cc: @anujgupt-github @quic-hemagnih --------- Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com> Signed-off-by: Abhishek Kumar Singh <sabhis@qti.qualcomm.com> Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com> Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com> Signed-off-by: Mamta Singh <mamtsing@qti.qualcomm.com> Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com> Co-authored-by: Abhishek Kumar Singh <sabhis@qti.qualcomm.com> Co-authored-by: Dipankar Sarkar <dipankar@qti.qualcomm.com> Co-authored-by: Mamta Singh <mamtsing@qti.qualcomm.com> Co-authored-by: vtirumal <vtirumal@qti.qualcomm.com>

1. Added Support for Qwen3-VL 2. Added Support for Qwen3-VL-Moe 3. Added Support for Disaggregated VL MOE Model. (Three QPC ) 4. CB support --------- Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com> Signed-off-by: Dipankar Sarkar <quic_dipankar@quicinc.com> Signed-off-by: Onkar Chougule <ochougul@qti.qualcomm.com> Signed-off-by: Dhiraj Kumar Sah <dhirajku@qti.qualcomm.com> Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com> Co-authored-by: Onkar Chougule <168134249+ochougul@users.noreply.github.com> Co-authored-by: Dhiraj Kumar Sah <dhirajku@qti.qualcomm.com> Co-authored-by: vtirumal <vtirumal@qti.qualcomm.com>

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

Updating wav2vec2 for batched_input and adding args for parsing batch_size, ctx_len and num_devices model.compile(batch_size=args.batch_size, num_devices=args.num_devices, seq_len=args.seq_len, num_cores=args.num_cores) --------- Signed-off-by: Tanisha Chawada <tchawada@qti.qualcomm.com>

to allow authenticated access to gated HuggingFace models - Wrap model/tokenizer loading in try/except blocks in quickcheck tests to gracefully skip tests when model fetch fails (e.g., auth errors) - Fix HybridChunkedCache backward compatibility for transformers>=4.57 which no longer exposes key_cache/value_cache attributes - Fix Gemma3 config compatibility by mapping _sliding_window_pattern to sliding_window_pattern when the public attribute is missing Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

quic-rishinr requested a review from quic-amitraj as a code owner March 18, 2026 09:18

quic-rishinr force-pushed the unit_test_workflow branch 3 times, most recently from f29d4bf to 078b0b9 Compare March 27, 2026 12:24

quic-rishinr force-pushed the unit_test_workflow branch from f1577e6 to e36bf6c Compare April 2, 2026 09:53

quic-rishinr added 8 commits April 2, 2026 15:27

added git workflow for unittest

21d24f0

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

lint fix

e8dc5d1

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

lint fix

2f06ae4

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

removed cloud infer test

cccf63d

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

Updated tests

3c990c8

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

Moved quickcheck.yml file to unittest

297ebef

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

test rebased

000155e

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

lint fix

941fd3f

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

quic-rishinr force-pushed the unit_test_workflow branch from e36bf6c to 941fd3f Compare April 2, 2026 09:57

quic-rishinr and others added 7 commits April 3, 2026 10:19

Revert "feat: Named graph specializations in specializations.json (Pr…

c36329c

…efill/Decode/Vision/Encoder/Embedding)" (#902) Reverts #894 Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

Merge branch 'main' into unit_test_workflow

d775a79

Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

Merge branch 'main' into unit_test_workflow

2a6ee77

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added git workflow for unittest#23

added git workflow for unittest#23
quic-rishinr wants to merge 15 commits into
quic-rishinr:mainfrom
quic:unit_test_workflow

quic-rishinr commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

quic-rishinr commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants