American Society for Microbiology, Journal of Virology, 7(90), p. 3627-3639, 2016
DOI: 10.1128/jvi.02988-15
Full text: Download
ABSTRACT Middle East respiratory syndrome-related coronavirus (MERS-CoV) spreads to humans via zoonotic transmission from camels. MERS-CoV belongs to lineage C of betacoronaviruses (betaCoVs), which also includes viruses isolated from bats and hedgehogs. A large portion of the betaCoV genome consists of two open reading frames (ORF1a and ORF1b) that are translated into polyproteins. These are cleaved by viral proteases to generate 16 nonstructural proteins (nsp1 to nsp16) which compose the viral replication-transcription complex. We investigated the evolution of ORF1a and ORF1b in lineage C betaCoVs. Results indicated widespread positive selection, acting mostly on ORF1a. The proportion of positively selected sites in ORF1a was much higher than that previously reported for the surface-exposed spike protein. Selected sites were unevenly distributed, with nsp3 representing the preferential target. Several pairs of coevolving sites were also detected, possibly indicating epistatic interactions; most of these were located in nsp3. Adaptive evolution at nsp3 is ongoing in MERS-CoV strains, and two selected sites (G720 and R911) were detected in the protease domain. While position 720 is variable in camel-derived viruses, suggesting that the selective event does not represent a specific adaptation to humans, the R911C substitution was observed only in human-derived MERS-CoV isolates, including the viral strain responsible for the recent South Korean outbreak. It will be extremely important to assess whether these changes affect host range or other viral phenotypes. More generally, data herein indicate that CoV nsp3 represents a major selection target and that nsp3 sequencing should be envisaged in monitoring programs and field surveys. IMPORTANCE Both severe acute respiratory syndrome coronavirus (SARS-CoV) and MERS-CoV originated in bats and spread to humans via an intermediate host. This clearly highlights the potential for coronavirus host shifting and the relevance of understanding the molecular events underlying the adaptation to new host species. We investigated the evolution of ORF1a and ORF1b in lineage C betaCoVs and in 87 sequenced MERS-CoV isolates. Results indicated widespread positive selection, stronger in ORF1a than in ORF1b. Several selected sites were found to be located in functionally relevant protein regions, and some of them corresponded to functional mutations in other coronaviruses. The proportion of selected sites we identified in ORF1a is much higher than that for the surface-exposed spike protein. This observation suggests that adaptive evolution in ORF1a might contribute to host shifts or immune evasion. Data herein also indicate that genetic diversity at nonstructural proteins should be taken into account when antiviral compounds are developed.