When running cogent, most of sequences ended up being in the tucked category

Hi! I have four populations I have been trying to run Cogent for. Three seemed to work really well and I didn't have an issue. The other one, I have been struggling with. First I  tried the instructions for family finding for a small dataset.  8969 of the sequences ended up going into one partition (and didn't seem to have much sequence similarity). Then within this bin there were many folders titled split with a number.  I thought the problem might be because I had greater than 20,000 sequences as input so I switched to the instructions for the large dataset. I'm not sure what happened here but I had only 3,432  bins created in the precluster_out directory and a large number of sequences were in the tucked category (15,174). What are the tucked sequences and what do you think I am doing incorrectly? (For my other populations, I had around 9000 bins created and there wasn't this same issue of so many sequences going into one partition). Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When running cogent, most of sequences ended up being in the tucked category #92

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

When running cogent, most of sequences ended up being in the tucked category #92

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions