Skip to content
This repository was archived by the owner on Feb 25, 2026. It is now read-only.
This repository was archived by the owner on Feb 25, 2026. It is now read-only.

When running cogent, most of sequences ended up being in the tucked category #92

@mprotas69

Description

@mprotas69

Hi! I have four populations I have been trying to run Cogent for. Three seemed to work really well and I didn't have an issue. The other one, I have been struggling with. First I tried the instructions for family finding for a small dataset. 8969 of the sequences ended up going into one partition (and didn't seem to have much sequence similarity). Then within this bin there were many folders titled split with a number. I thought the problem might be because I had greater than 20,000 sequences as input so I switched to the instructions for the large dataset. I'm not sure what happened here but I had only 3,432 bins created in the precluster_out directory and a large number of sequences were in the tucked category (15,174). What are the tucked sequences and what do you think I am doing incorrectly? (For my other populations, I had around 9000 bins created and there wasn't this same issue of so many sequences going into one partition). Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions