Automated identification of sequence-tailored Cas9 proteins using massive metagenomic data

Ciciani, Matteo; Demozzi, Michele; Pedrazzoli, Eleonora; Visentin, Elisabetta; Pezzè, Laura; Signorini, Lorenzo Federico; Blanco-Miguez, Aitor; Zolfo, Moreno; Asnicar, Francesco; Casini, Antonio; Cereseto, Anna; Segata, Nicola

Published in

Nature Research, Nature Communications, 1(13), 2022

DOI: 10.1038/s41467-022-34213-9

Tools

Export citation

Search in Google Scholar

Automated identification of sequence-tailored Cas9 proteins using massive metagenomic data

Journal article published in 2022 by Matteo Ciciani, Michele Demozzi

, Eleonora Pedrazzoli

, Elisabetta Visentin

, Laura Pezzè, Lorenzo Federico Signorini

, Aitor Blanco-Miguez

, Moreno Zolfo

, Francesco Asnicar

, Antonio Casini, Anna Cereseto

, Nicola Segata

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractThe identification of the protospacer adjacent motif (PAM) sequences of Cas9 nucleases is crucial for their exploitation in genome editing. Here we develop a computational pipeline that was used to interrogate a massively expanded dataset of metagenome and virome assemblies for accurate and comprehensive PAM predictions. This procedure allows the identification and isolation of sequence-tailored Cas9 nucleases by using the target sequence as bait. As proof of concept, starting from the disease-causing mutation P23H in the RHO gene, we find, isolate and experimentally validate a Cas9 which uses the mutated sequence as PAM. Our PAM prediction pipeline will be instrumental to generate a Cas9 nuclease repertoire responding to any PAM requirement.

Published in

Links

Tools

Automated identification of sequence-tailored Cas9 proteins using massive metagenomic data

Abstract