Cross-platform Accessible Reproducible-NGS data analysis (CAR-NGS)

"CAR-NGS: Buckle Up, We're Taking Your DNA for a Ride!"

This repository contains a collection of automated data analysis pipelines for various genomic techniques. Each pipeline is designed to simplify the process of analyzing high-throughput sequencing data using Docker, ensuring reproducibility and ease of use.

Repository Structure

The repository is organized into directories, each corresponding to a specific genomic technique. Below is a brief description of each pipeline:

1. `16S`

This pipeline is designed for the analysis of 16S rRNA gene sequencing data, typically used for microbial community profiling. The pipeline includes steps for quality control, taxonomic classification, and diversity analysis.

2. `Atac-seq`

The ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing) pipeline facilitates the identification of open chromatin regions across the genome. The pipeline includes steps for quality control, alignment, peak calling, and visualization.

3. `Bulk_rnaseq`

This pipeline processes bulk RNA-Seq data to analyze gene expression levels in a population of cells. The pipeline includes steps for read alignment, quantification, differential expression analysis, and data visualization.

4. `Detect-seq`

Detect-seq is a pipeline for detecting sequence variants from high-throughput sequencing data. The pipeline includes steps for read alignment, variant calling, and annotation.

5. `HTGTS`

High-Throughput Genome-Wide Translocation Sequencing (HTGTS) is a technique used to map chromosomal translocations. This pipeline includes steps for alignment, translocation detection, and visualization.

6. `SCI`

The SCI (Single Cell Indexing) pipeline processes single-cell sequencing data to analyze gene expression at the individual cell level. The pipeline includes steps for cell demultiplexing, alignment, gene quantification, and clustering analysis.

7. `Single_cell_rnaseq`

This pipeline is tailored for single-cell RNA-Seq data, enabling the analysis of gene expression at the single-cell level. The pipeline includes steps for quality control, normalization, clustering, and differential expression analysis.

8. `WholeGenomeSequencing`

The Whole Genome Sequencing (WGS) pipeline processes data from WGS experiments, enabling comprehensive analysis of genomic variants. The pipeline includes steps for alignment, variant calling, annotation, and visualization.

How to Use These Pipelines

Each directory contains a dedicated pipeline for a specific genomic analysis technique. To run a pipeline:

Navigate to the appropriate directory:

  cd Bulk_rnaseq

Read the specific README.md: Each directory contains its own README.md file with detailed instructions on how to run the pipeline, including dependencies, input data formats, and output file descriptions.
Prepare your environment: Ensure Docker is installed on your system. The pipelines are designed to run inside Docker containers, ensuring consistency and reproducibility across different computing environments.
Execute the pipeline: Follow the instructions in the respective directory to run the analysis pipeline using Docker. Most pipelines can be started with a simple command, such as:

./run_pipeline.sh

Prerequisites

Docker: Ensure that Docker is installed and running on your system. Docker containers are used to encapsulate the environment needed for each pipeline, making the setup process easier and ensuring reproducibility.
Sequencing Data: Input data formats vary depending on the pipeline, but generally include FASTQ files, BAM files, or VCF files.

Contributing

Contributions to this repository are welcome. If you have improvements, bug fixes, or new pipelines to add, please submit a pull request. Ensure that all pipelines are well-documented and tested.

License

This repository is licensed under the MIT License. See the LICENSE file for more information.

Contact

For any questions or issues related to these pipelines, please open an issue on the GitHub repository or contact the repository owner.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross-platform Accessible Reproducible-NGS data analysis (CAR-NGS)

"CAR-NGS: Buckle Up, We're Taking Your DNA for a Ride!"

Repository Structure

1. `16S`

2. `Atac-seq`

3. `Bulk_rnaseq`

4. `Detect-seq`

5. `HTGTS`

6. `SCI`

7. `Single_cell_rnaseq`

8. `WholeGenomeSequencing`

How to Use These Pipelines

Prerequisites

Contributing

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
16S		16S
Atac-seq		Atac-seq
Bulk_rnaseq		Bulk_rnaseq
Detect-seq		Detect-seq
HTGTS		HTGTS
SCI		SCI
Single_cell_rnaseq		Single_cell_rnaseq
WholeGenomeSequencing		WholeGenomeSequencing
Logo.png		Logo.png
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Cross-platform Accessible Reproducible-NGS data analysis (CAR-NGS)

"CAR-NGS: Buckle Up, We're Taking Your DNA for a Ride!"

Repository Structure

1. 16S

2. Atac-seq

3. Bulk_rnaseq

4. Detect-seq

5. HTGTS

6. SCI

7. Single_cell_rnaseq

8. WholeGenomeSequencing

How to Use These Pipelines

Prerequisites

Contributing

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. `16S`

2. `Atac-seq`

3. `Bulk_rnaseq`

4. `Detect-seq`

5. `HTGTS`

6. `SCI`

7. `Single_cell_rnaseq`

8. `WholeGenomeSequencing`

Packages