Skip to content

Adding gmb dataset#39

Open
tejasvaidhyadev wants to merge 16 commits intoJuliaText:masterfrom
tejasvaidhyadev:adding_GMB_Dataset
Open

Adding gmb dataset#39
tejasvaidhyadev wants to merge 16 commits intoJuliaText:masterfrom
tejasvaidhyadev:adding_GMB_Dataset

Conversation

@tejasvaidhyadev
Copy link
Copy Markdown
Member

Adding GMB Dataset.
The dataset an extract from GMB corpus which is tagged, annotated and built specifically to train the classifier to predict named entities such as name, location, etc.

Copy link
Copy Markdown
Member

@oxinabox oxinabox left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cool, good idea.
A few comments.
Also needs docs and tests

tejasvaidhyadev and others added 3 commits March 11, 2020 20:52
Co-Authored-By: Lyndon White <oxinabox@ucc.asn.au>
Co-Authored-By: Lyndon White <oxinabox@ucc.asn.au>
Co-Authored-By: Lyndon White <oxinabox@ucc.asn.au>
@tejasvaidhyadev
Copy link
Copy Markdown
Member Author

Thankyou I will implement suggested changes(including Docs and tests ) soon

Co-Authored-By: Lyndon White <oxinabox@ucc.asn.au>
@tejasvaidhyadev
Copy link
Copy Markdown
Member Author

Hi @oxinabox added some testsets by taking examples from other datasets.I don't know much about tests and i am still learning.
let me know what else tests can be added.

@tejasvaidhyadev
Copy link
Copy Markdown
Member Author

tejasvaidhyadev commented Mar 14, 2020

Hi @oxinabox
For now I added only POS tagged of GMB
As my project only need POS tags and i will also implement NER tags soon
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants