Skip to content

Fairflow-BioinformaticsFramework/CAR-NGS_Backend

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cross-platform Accessible Reproducible-NGS data analysis (CAR-NGS)

Logo

"CAR-NGS: Buckle Up, We're Taking Your DNA for a Ride!"

This repository contains a collection of automated data analysis pipelines for various genomic techniques. Each pipeline is designed to simplify the process of analyzing high-throughput sequencing data using Docker, ensuring reproducibility and ease of use.

Repository Structure

The repository is organized into directories, each corresponding to a specific genomic technique. Below is a brief description of each pipeline:

1. 16S

This pipeline is designed for the analysis of 16S rRNA gene sequencing data, typically used for microbial community profiling. The pipeline includes steps for quality control, taxonomic classification, and diversity analysis.

2. Atac-seq

The ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing) pipeline facilitates the identification of open chromatin regions across the genome. The pipeline includes steps for quality control, alignment, peak calling, and visualization.

3. Bulk_rnaseq

This pipeline processes bulk RNA-Seq data to analyze gene expression levels in a population of cells. The pipeline includes steps for read alignment, quantification, differential expression analysis, and data visualization.

4. Detect-seq

Detect-seq is a pipeline for detecting sequence variants from high-throughput sequencing data. The pipeline includes steps for read alignment, variant calling, and annotation.

5. HTGTS

High-Throughput Genome-Wide Translocation Sequencing (HTGTS) is a technique used to map chromosomal translocations. This pipeline includes steps for alignment, translocation detection, and visualization.

6. SCI

The SCI (Single Cell Indexing) pipeline processes single-cell sequencing data to analyze gene expression at the individual cell level. The pipeline includes steps for cell demultiplexing, alignment, gene quantification, and clustering analysis.

7. Single_cell_rnaseq

This pipeline is tailored for single-cell RNA-Seq data, enabling the analysis of gene expression at the single-cell level. The pipeline includes steps for quality control, normalization, clustering, and differential expression analysis.

8. WholeGenomeSequencing

The Whole Genome Sequencing (WGS) pipeline processes data from WGS experiments, enabling comprehensive analysis of genomic variants. The pipeline includes steps for alignment, variant calling, annotation, and visualization.

How to Use These Pipelines

Each directory contains a dedicated pipeline for a specific genomic analysis technique. To run a pipeline:

  1. Navigate to the appropriate directory:
  cd Bulk_rnaseq
  1. Read the specific README.md: Each directory contains its own README.md file with detailed instructions on how to run the pipeline, including dependencies, input data formats, and output file descriptions.

  2. Prepare your environment: Ensure Docker is installed on your system. The pipelines are designed to run inside Docker containers, ensuring consistency and reproducibility across different computing environments.

  3. Execute the pipeline: Follow the instructions in the respective directory to run the analysis pipeline using Docker. Most pipelines can be started with a simple command, such as:

./run_pipeline.sh

Prerequisites

  • Docker: Ensure that Docker is installed and running on your system. Docker containers are used to encapsulate the environment needed for each pipeline, making the setup process easier and ensuring reproducibility.
  • Sequencing Data: Input data formats vary depending on the pipeline, but generally include FASTQ files, BAM files, or VCF files.

Contributing

Contributions to this repository are welcome. If you have improvements, bug fixes, or new pipelines to add, please submit a pull request. Ensure that all pipelines are well-documented and tested.

License

This repository is licensed under the MIT License. See the LICENSE file for more information.

Contact

For any questions or issues related to these pipelines, please open an issue on the GitHub repository or contact the repository owner.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors