├── .gitignore └── README.md /.gitignore: -------------------------------------------------------------------------------- 1 | .idea/ -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # Awesome phages 2 | 3 | A curated list of awesome phage related software and other resources to facilitate the life of phage scientists, 4 | bioinformaticians and enthusiasts. 5 | 6 | 7 | ## How to contribute 8 | 9 | If you want to add a link, please try to follow these soft rules: 10 | 11 | - Position your resource in alphabetical order. 12 | - Add section if needed. 13 | - Add section title to Table of contents. 14 | - Search previous suggestions before making a new one, as yours may be a duplicate. 15 | - Add your links using another entry as inspiration 16 | - Check your spelling and grammar. 17 | - Make sure your text editor is set to remove trailing whitespace. 18 | - Send a Pull Request. 19 | - Add short description in your Pull Request of why the project/resource/tool is awesome. 20 | 21 | ## Table of Contents 22 | 23 | - [Annotation](#annotation) 24 | - [Genome prediction](#genome-prediction) 25 | - [Host prediction](#host-prediction) 26 | - [Prophage prediction](#prophage-prediction) 27 | - [Gene prediction annotation](#gene-prediction-annotation) 28 | - [Web tools](#web-tools) 29 | - [Miscellaneous](#miscellaneous) 30 | 31 | ## Annotation 32 | 33 | ### Genome prediction 34 | 35 | - **[Seeker](https://github.com/gussow/seeker)** - Seeker is a python library for discriminating between bacterial and 36 | phage genomes. Seeker is based on an LSTM deep-learning models and does not rely on a reference genome, genomic 37 | alignment or any direct genome comparison. 38 | [ [web](http://seeker.pythonanywhere.com/predict/) ] 39 | - **[VIRify](https://github.com/EBI-Metagenomics/emg-viral-pipeline)** - VIRify: detection of phages and eukaryotic 40 | viruses from metagenomic and metatranscriptomic assemblies. 41 | 42 | ### Host prediction 43 | 44 | - **[iphop](https://bitbucket.org/srouxjgi/iphop/src/main/)** - **iPHoP** stands for **i**ntegrated **P**hage **H**ost **P**rediction. It is an automated command-line pipeline for predicting host genus of novel bacteriophages and archaeoviruses based on their genome sequences. 45 | 46 | - **[phirbo](https://github.com/aziele/phirbo)** - A tool to predict prokaryotic hosts for phage (meta)genomic 47 | sequences. The tool uses information on sequence similarity between phage and bacteria genomes as well as sequence 48 | relatedness among bacteria. 49 | 50 | ### Prophage prediction 51 | 52 | - **[DEPhT](https://github.com/chg60/DEPhT)** - A new tool for identifying prophages in bacteria scanning hundreds to thousands of genomes and accurately extract complete (likely active) prophages from them. [Publicaiton](https://academic.oup.com/nar/advance-article/doi/10.1093/nar/gkac273/6572362) 53 | 54 | - **[DBSCAN-SWA](https://github.com/HIT-ImmunologyLab/DBSCAN-SWA)** - An integrated tool for rapid prophage detection 55 | and annotation. 56 | - **[hafeZ](https://github.com/Chrisjrt/hafeZ)** - A tool for identifying inducible prophage elements through read 57 | mapping. 58 | - **[Phigaro](https://github.com/bobeobibo/phigaro)** - Phigaro is a standalone command-line application that is able 59 | to detect prophage regions taking raw genome and metagenome assemblies as an input. 60 | - **[PhageBoost](https://github.com/ku-cbd/PhageBoost)** - Rapid discovery of novel prophages using biological feature 61 | engineering and machine learning. 62 | - **[PhiSpy](https://github.com/linsalrob/phispy)** - PhiSpy identifies prophages in Bacterial genomes. Given an 63 | annotated genome it will use several approaches to identify the most likely prophage regions. 64 | - **[Vibrant](https://github.com/AnantharamanLab/VIBRANT)** - VIBRANT is a tool for automated recovery and annotation 65 | of bacterial and archaeal viruses, determination of genome completeness, and characterization of viral community 66 | function from metagenomic assemblies. 67 | - **[VirSorter2](https://github.com/jiarong/VirSorter2)** - VirSorter2 applies a multi-classifier, expert-guided 68 | approach to detect diverse DNA and RNA virus genomes. 69 | - **[WTP](https://github.com/replikation/What_the_Phage)** - What the Phage is a scalable workflow that combines multiple prophage prediction programs, including several of the aforementioned. It starts out of the raw fasta or fastq files and generates an integrated summary of the results obtained from each of the selected tools. 70 | 71 | ### Gene prediction-annotation 72 | 73 | - **[multiPhATE2](https://github.com/carolzhou/multiPhATE2)** - MultiPhATE is a command-line program that runs gene 74 | finding and the PhATE annotation code over user-specified phage genomes, then performs gene-by-gene comparisons among 75 | the genomes. 76 | - **[Phanns](https://github.com/Adrian-Cantu/PhANNs)** - PhANNs is a tool to classify any phage ORF as one of 10 77 | structural protein class, or as "others". It uses an ensemble of Artificial Neural Networks. [[web](https://edwards.sdsu.edu/phanns)] 78 | - **[PHANOTATE](https://github.com/deprekate/PHANOTATE)** - PHANOTATE is a tool to annotate phage genomes. It uses the 79 | assumption that non-coding bases in a phage genome is disadvantageous, and then populates a weighted graph to find the 80 | optimal path through the six frames of the DNA where open reading frames are beneficial paths, while gaps and overlaps 81 | are penalized paths. 82 | - **[pharokka](https://github.com/gbouras13/pharokka)** Pharokka is a rapid standardised annotation program for bacteriophages. It is inspired on [Prokka](https://github.com/tseemann/prokka). Gene prediction is performed with **[PHANOTATE](https://github.com/deprekate/PHANOTATE) and functional annotation is based on the [PHROGs database](https://phrogs.lmge.uca.fr) 83 | 84 | - **[Prokka](https://github.com/tseemann/prokka)** - Prokka is not phage specific but is often used to annotate phages. 85 | 86 | ## Web tools 87 | 88 | - **[Patric](https://www.patricbrc.org/)** PATRIC, the Pathosystems Resource Integration Center, provides integrated 89 | data and analysis tools to support biomedical research on bacterial infectious diseases. 90 | - **[Phamerator](https://phamerator.org/)** - Comparative genomics and genome exploration tool for bacteriophages. 91 | [ [source](https://github.com/scresawn/phamerator) ] 92 | - **[Phaster](https://phaster.ca/)** - Rapid identification and annotation of prophage sequences within bacterial 93 | genomes and plasmids. 94 | - **[Prophage Hunter](https://pro-hunter.genomics.cn/index.php/Home)** - Prophage Hunter is a web server tool that employs several criteria to score how likely a prophage is beeing active within a bacterial genome. See also the paper [here](https://academic.oup.com/nar/article/47/W1/W74/5494712). 95 | - **[RAST](https://rast.nmpdr.org/)** - RAST (Rapid Annotation using Subsystem Technology) is a fully-automated service 96 | for annotating complete or nearly complete bacterial and archaeal genomes. It provides high quality genome annotations 97 | for these genomes across the whole phylogenetic tree. 98 | 99 | ## Miscellaneous 100 | 101 | - **[bacteriophage.news](https://www.bacteriophage.news/)** - Bacteriophage.news is a virtual space that provides 102 | general news, research articles & patient stories on phage therapy. 103 | - **[inphared](https://github.com/RyanCook94/inphared)** - Providing up-to-date phage genome databases, metrics and 104 | useful input files for a number of bioinformatic pipelines. 105 | - **[millardlab](http://millardlab.org/)** - Lab from the University of Leicester focused on bacteriophages with plenty 106 | of resources for phage bioinformatics. 107 | - **[phage.directory](https://phage.directory/)** - Phage Directory curates a database of phage labs, phages, and host 108 | strains to advance research and phage therapy. 109 | - **[phagesdb](https://phagesdb.org/)** - The Actinobacteriophage database, more commonly known as PhagesDB, is a 110 | database-backed website that gathers and shares information related to the discovery, characterization and genomics of 111 | viruses that prefer to infect Actinobacterial hosts 112 | - **[PHAGES2050](https://github.com/ptynecki/PHAGES2050)** - a comprehensive AI-based Framework for Phage Research & Therapy 113 | - **[phage.ai](https://phage.ai/)** - PhageAI is an AI-driven software platform using advanced Machine Learning and 114 | Natural Language Processing techniques for deeper understanding of the bacteriophages genomics. 115 | - **[seaphages](https://seaphages.org/)** - SEA-PHAGES is a two-semester, discovery-based undergraduate research course 116 | that begins with simple digging in the soil to find new viruses, but progresses through a variety of microbiology 117 | techniques and eventually to complex genome annotation and bioinformatic analyses. 118 | 119 | 120 | ## Kwnologe base 121 | 122 | - **[Viral taxonomy](https://talk.ictvonline.org/ictv-reports/ictv_online_report/)** 123 | --------------------------------------------------------------------------------