Switch batch inference to Feature Store offline store by shlbatra · Pull Request #28 · shlbatra/Machine-learning-Ops-Deployment-Inference

shlbatra · 2026-06-16T13:04:14Z

Summary

Read from canonical feature table (iris_features) instead of raw BQ tables (iris / iris_pubsub_data)
Filter to source = 'batch_input' server-side in SQL — only scores unlabeled inference data
Remove the conditional column rename hack (old lines 36-48) — canonical table has consistent names regardless of data source
Use canonical feature column names (sepal_length_cm, etc.) matching the retrained model
Use BigQueryClient(project=project_id) for query job permissions (same pattern as Step 6)

What was removed

The old inference component had a brittle if bq_table == "iris_pubsub_data" branch that renamed snake_case Pub/Sub columns to CamelCase. With the feature store, all data flows through ingest.py into iris_features with canonical names — no conditional renaming needed.

Prerequisites

ingest.py must be run to populate iris_features with both training and batch_input rows
bq_dataloader.py --generate-random N must be run first to create batch_input data
Model must be retrained on canonical column names (Step 6 PR)

Test plan

Run bq_dataloader.py --generate-random 20 then ingest.py to populate feature table
Submit inference pipeline and confirm it reads only batch_input rows
Verify predictions are written to iris_predictions table
Confirm no column rename errors — canonical names used end-to-end

🤖 Generated with Claude Code

Read from canonical feature table (iris_features) with server-side source='batch_input' filter instead of raw BQ tables. Remove the conditional column rename hack — canonical table has consistent names regardless of data source. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Append -training and -inference to PIPELINE_NAME in each pipeline file so they show as distinct pipelines in Vertex AI (e.g. pipeline-iris-staging-training, pipeline-iris-staging-inference). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

list_models returned models in creation order and [0] grabbed the first (oldest) version — trained with CamelCase columns before the feature store migration. Sort by create_time descending so [0] is the most recently registered model. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Drop CamelCase aliases and ConfigDict — field names match the feature platform directly. No backward compat needed since the model is retrained on canonical names. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Also fix sepal_width_cm type from integer to number to match the other feature fields. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Loads the latest registered model from GCS and checks that feature_names_in_ matches the canonical names from the feature store. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Prevents stale bytecache or non-editable installs from causing KFP to serialize old component code into pipeline YAML. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The upper bound <3.11 excluded the local Python 3.11.0, blocking editable installs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

list_models returns parent model entries, not versions. Use list_model_versions to get all versions of the model, then sort by create_time to pick the latest one. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

@blessed

register.py already sets version_aliases=['blessed'] on each uploaded model. Use get_model(name + '@blessed') to directly fetch the blessed version instead of listing all versions and sorting. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

shlbatra · 2026-06-16T15:57:35Z

Successful run -

https://console.cloud.google.com/agent-platform/pipelines/locations/us-central1/runs/pipeline-iris-staging-inference-20260616115218?project=deeplearning-sahil

Data loaded (50 rows) -

/gcs/sb-vertex/staging/pipeline_root/57434141298/pipeline-iris-staging-inference-20260616115218/get-model_4592835321065897984/latest_model

shlbatra and others added 14 commits June 16, 2026 09:03

Simplify Instance model to use canonical feature names only

bfa47bd

Drop CamelCase aliases and ConfigDict — field names match the feature platform directly. No backward compat needed since the model is retrained on canonical names. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Update instance schema to canonical feature names

d6c9f6a

Also fix sepal_width_cm type from integer to number to match the other feature fields. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add notebook to verify model feature names match feature platform

1e35bf9

Loads the latest registered model from GCS and checks that feature_names_in_ matches the canonical names from the feature store. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add script to clear pycache and reinstall package in editable mode

65c959e

Prevents stale bytecache or non-editable installs from causing KFP to serialize old component code into pipeline YAML. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Extend Python version constraint to include 3.11

eaf7375

The upper bound <3.11 excluded the local Python 3.11.0, blocking editable installs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cleanup

19e1311

Use uv for faster editable install in clean_reinstall.sh

40d8f56

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Fix get_model to fetch latest version, not just parent model

449fa7a

list_models returns parent model entries, not versions. Use list_model_versions to get all versions of the model, then sort by create_time to pick the latest one. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Update notebook to use blessed alias for model lookup

20b36b0

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

update verify feature notebook

1867085

shlbatra merged commit f159bac into main Jun 16, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch batch inference to Feature Store offline store#28

Switch batch inference to Feature Store offline store#28
shlbatra merged 14 commits into
mainfrom
feature/fp-step8-batch-inference

shlbatra commented Jun 16, 2026

Uh oh!

shlbatra commented Jun 16, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

shlbatra commented Jun 16, 2026

Summary

What was removed

Prerequisites

Test plan

Uh oh!

shlbatra commented Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

shlbatra commented Jun 16, 2026 •

edited

Loading