WIP: Implement LTU model by The-Mats · Pull Request #19 · DBD-research-group/BioFoundation

The-Mats · 2024-11-10T16:21:30Z

We want to experiment with LTU as it is a promising model that combines natural language and audio classification to create a more capable and diverse model. Gong et al. created a newer version of the model named LTU-AS, which uses whisper features and performs better especially for speech and music. I think we should first try LTU and then maybe LTU-AS!

For inference they provide shell scripts that locally load a website for easy interaction, which we don't really need.

The following steps are needed to integrate the model into the BS pipeline:

Started implementing LTU

3f155b2

The-Mats marked this pull request as draft November 10, 2024 16:21

The-Mats added 5 commits November 21, 2024 16:53

Added peft package

95888ad

ltu changes

b2f2407

Added missing package sentencepiece for LLM

8f9315f

LTU fixes, added download script (Doesn't download correctly)

9d7ff9f

Script path fixes

d90b8f6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Implement LTU model#19

WIP: Implement LTU model#19
The-Mats wants to merge 6 commits intomainfrom
ltu-model

The-Mats commented Nov 10, 2024 •

edited by raphaelschwinger

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

The-Mats commented Nov 10, 2024 • edited by raphaelschwinger Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

The-Mats commented Nov 10, 2024 •

edited by raphaelschwinger

Loading