Choosing a Target Sequence for Gene Editing – on sgRNA design

Michał Chromiak, Michał Lupa


The CRISPR-Cas9 system is being widely used for genome engineering in many different biological applications. As a prokaryotic adaptive immunity system, that was originally adapted from the bacterial Type II CRISPR, CRISPR-Cas9 uses a non-coding RNAs. Those RNAs guides Cas9 nuclease, which in turn induce site-specific DNA cleavage at a specific locations in genome. Such mechanism gives an opportunity to create a programmable method for genome editing. The first step in a CRISPR/Cas9 gene engineering experiment is to design a custom single guide RNA (sgRNA). This paper discusses a possible way of organizing data for designing sgRNA using a fast and general-purpose cluster computing system based on MapReduce paradigm.


crispr; gene editing; mapreduce; gRNA; sgRNA; PAM

Full Text:



Anderson E.M., Haupt A., Schiel J.A., Chou E., Machado H.B., Strezoska Z., Lenger S., McClelland S., Birmingham A., Vermeulen A., van Brabant Smith A.: Systematic analysis of crispr-cas9 mismatch tolerance reveals low levels of off-target activity. Journal of Biotechnology, Vol. 211, 2015, p. 56÷65.

Barrangou R., Fremaux C., et al.: Crispr provides acquired resistance against viruses in prokaryotes. Science, Vol. 315(5819), 2007, p. 1709÷12.

Carlson D.F., Fahrenkrug S.C., Hackett P.B.: Targeting DNA with fingers and talens. Molecular Therapy – Nucleic Acids 1, 2012.

Carroll D.: Genome engineering with zinc-finger nucleases. Genetics, Vol. 188(4), 2011, p. 773÷782,

Dean J., Ghemawat S.: MapReduce: Simplified data processing on large clusters. Proceedings of the 6th Conference on Symposium on Opearting Systems Design & Implementation, Vol. 6, OSDI'04, USENIX Association, Berkeley, CA, USA 2004, p. 10.

Fuchs G., Voichek Y., Benjamin S., Gilad S., Amit I., Oren M.: 4sUDRB-seq: measuring genome wide transcriptional elongation rates and initiation frequencies within cells. Genome Biology, Vol. 15(5), R69, 2014.

Moore J.K., Haber J.E.: Cell cycle and genetic requirements of two pathways of nonhomologous end-joining repair of double-strand breaks in saccharomyces cerevisiae. Molecular and Cellular Biology, Vol. 16(5), 1996, p. 2164÷73.

Gagnon J.A., Valen E., et al.: Efficient mutagenesis by Cas9 protein-mediated oligonucleotide insertion and large-scale assessment of single-guide RNAs. Plos ONE, 2014.

Tebas P., Stein D., et al.: Gene editing of CCR5 in autologous CD4 T cells of persons infected with HIV. New England Journal of Medicine, Vol. 370(10), 2014, p. 901÷910.

Pardo B., Gomez-Gonzales B., Aguilera A.: DNA repair in mammalian cells: DNA double-strand break repair: how to fix a broken relationship. Cellular and Molecular Life Sciences, Vol. 66(6), 2009, p. 1039÷1056.

Strachan T., Read A.: Human Molecular Genetics 4. Garland Science/Taylor & Francis Group, 2011.

Thyme S., Akhmetova L., et al.: Internal guide RNA interactions interfere with Cas9-mediated cleavage. Nature Communications, Vol. 7, 2016.

Lu Y.: PD-1 knockout engineered T cells for metastatic non-small cell lung cancer,