# PangenomeAlignmentGenerator **Repository Path**: battle_ball/PangenomeAlignmentGenerator ## Basic Information - **Project Name**: PangenomeAlignmentGenerator - **Description**: 开个源,方便下载!!!!!!!11 - **Primary Language**: Go - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2023-05-28 - **Last Updated**: 2023-05-28 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # PangenomeAlignmentGenerator This repository contains what you need to align SRAs to a pangenome multi-FASTA file generated by Roary via reference-guided alignment. ### Input: Whole genome assemblies from NCBI ### Outputs: * XMFA file for whole genome sequences which have been aligned to a pangenome reference generated by Roary. * Pangenome analysis statistics ### Installation Start by cloning this repository and installing the different parts of the package via pip: * `pip install ~/go/src/github.com/kussell-lab/PangenomeAlignmentGenerator` You will also need to install the following dependencies: * [Roary](https://sanger-pathogens.github.io/Roary/) * [Prokka](https://github.com/tseemann/prokka) * [GNU parallel](https://www.gnu.org/software/parallel/) If you want to use SplitGenome (to split the final alignment into XMFA files for core and accessory genes) this can be installed via `go get`: * `go get -u github.com/kussell-lab/PangenomeAlignmentGenerator/SplitGenome` ## Usage `PangenomeAlignmentGenerate ` * `` can be download from ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/assembly_summary_refseq.txt * `` tsv of assembly accessions; search for assemblies in NCBI, select the Send option when viewing search results, then select "File" for "Choose Destination" and "ID Table" for "Format". * `` list of sra files which you want to align * `` is the working space and output directory * `` is the output_prefix for the final pangenome alignment Output is an XMFA file (`/_pangenome.xmfa`) containing the alignments of each sequence to the pangenome reference. You can then use `SplitGenome` to split this into XMFA files for core and accessory genes. It may be preferable to run each of the steps of this program separately (due to download issues, etc). You can view each step the program takes by looking at the script `bin/PangenomeAlignmentGenerate` or `PangenomeAlignmentGenerate.sh`.