"BaseEstimator - Pipeline ..." track

Были представлены BaseEstimator, Pipeline. Основной вопрос - как это правильно собрать. Выяснилось, что при при малом количестве значений одного из признаков есть вероятность, что они попадут все либо в train либо в test, и, ohe наделает разное количество новых признаков. Выровнять колонки можно командой align, но на каком этапе pipeline ее применить.

[germs] Interactive Pipeline and Composite Estimators for your end-to-end ML model
[germs2] Pipeline, ColumnTransformer and FeatureUnion explained
[germs3] Difference between fit() , transform() and fit_transform() method in Scikit-learn
[germs4] get_params, set_params
[germs5]Метрики в задачах машинного обучения
Daily notes

[2023-02-12]Python Decorators to Take Your Code to the Next Level

[2023-02-17]

[2023-02-21]

[2023-02-22] df = pd.DataFrame(np.random.randn(100, 4), columns=list('ABCD'))S [2023-02-24]dummyclassificator(strategy='stratified')

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"BaseEstimator - Pipeline ..." track

FilesExpand file tree

index.md

Latest commit

History

index.md

File metadata and controls

"BaseEstimator - Pipeline ..." track