crAssphage poops up again!

Yet again, analysis of a metagenomic sample shows that crAssphage is the most abundant phage anywhere. It also shows what a dis-service NCBI did to science by deleting the crAssphage record. We used meta-spades to reconstruct the entire crAssphage genome from someone else’s data set, but in their paper, the largest contig was ~3 kb. This analysis suggests that crAssphage is present in ulcerative colitis samples but the abundance goes down after treatment!

Phage Identification

We are interested in phages — viruses that infect bacteria. For years the Edwards’ lab has been looking at new, undiscovered phages.

Recently, we identified the crAssphage, a new type of virus that has never been seen before. By looking at the sequences in metagenomes we were able to identify a set of contigs that were common among many different metagenomes. When we assembled them, they looked like a phage. We could compare them to other known phages in our database of sequences.

Working with folks in the biology department we proved that this is a circular virus by using PCR. However, we have so far been unable to culture the virus in vivo. We’re working on it, and hopefully others are too, but until that point we don’t have an image of the virus or an idea of what it does.

What is the relative biomass of crAssphage in the intestine?

Following up from the crAssphage press and comments Dan asked me the following question:

It was interesting to hear that there are 10 times as many viruses as bacteria in the body. If you have time to answer a question, I’ve always wondered about the relative biomass of bacteria compared to human cells, and now the relative biomass of viruses compared to human cells.

Inspired by XKCD’s what-if we can use some Fermi estimation to answer this. A typical virus is about 10-19 kg. (e.g. Adenovirus which is about 50kb is 2.5 x 10-19 kg [1]). A typical bacterium, like E. coli is about 10-15 kg, and a typical human cell is about 10-12 kg.

Scientists like to say that we have ~10x more bacteria than human cells and ~10x more viruses than bacteria. In the human body there are about 37 trillion cells [2] (37 x 1012, but since we are estimating we’ll round that to 1014) . Based on these estimates we have the average human weighs about 100 kg (1014 cells x 10-12 kg) in human cells, 1 kg in bacteria (1015 cells x 10-15 kg), and 0.001 kg in viruses (1016 viruses x 10-19 kg)

Phage tRNA genes

We have just published a paper describing how phages affect translation of proteins very specifically. In our case, the phage expressed a peptide deformylase with increased specificty for proteins involved in photosynthesis. That lead me to wonder how else phages affect protein synthesis, and whether they are merely trying to increase the amount of proteins being made. One way to do that might be to increase the number of tRNA-Met initiator tRNAs. To test this hypothesis I  counted all the tRNAs in all the phages to see which is the most abundant. It wasn’t tRNA-Met, and after the read more I will tell you what it was.



I used tRNAScan-SE to identify all the tRNAs in all the phages in the PhAnToMe database, and from the output from tRNAScan-SE I counted all the different types. Here is a table listing all the tRNAs and their frequency in the phages:




tRNA Count tRNA Count tRNA Count
Arg 194 Pro 103 Ser 53
Leu 177 Gln 92 Ile 52
Met 168 Trp 86 His 48
Pseudo 166 Glu 78 Ala 48
Thr 137 Val 77 Tyr 41
Asn 124 Asp 72 Undet 30
Gly 120 Phe 62 Sup 19
Lys 112 Cys 55


This is all of the tRNAs, and clearly there are some really big differences. A question to answer is why some tRNAs more abundant than any others?