Zebrafish Genome Literacy Workshop 2023
Exercise 3 - CRISPR design
Imagine you are planning an experiment to knock out the tfap2a gene using CRISPR/Cas9.
Find the Gene page for tfap2a. How many different transcripts are there?
6.
We want to target exons that appear in all transcripts. Find the stable IDs of exons which are present in all transcripts of tfap2a.
- ENSDARE00000597910
- ENSDARE00000597893
- ENSDARE00000597878
- ENSDARE00000597872
- ENSDARE00000597861
Exons may look the same on the graphical view on the Gene page, but may have slightly different start and end sites. You can use the Exons page for each transcript to compare the exons. Exons with identical start and end sites will have the same exon ID.
Use Biomart to download the sequence for these exons. Use the exon IDs you collected above to download the exons sequences with 10 bp of intron sequence on either side. The 10 bp allows you to find CRISPR sites that overlap a small portion of the intron, but still have their cut-site within the exon.
Use the "Input external references ID list" filter and select "Exon ID" from the dropdown menu. You can use the Upstream flank/Downstream flank attributes to do this. Some tools don't like non-alphanumeric characters in FASTA files, so just have the exon id in the sequence headers
Attributes
- Exon stable ID
- Exon sequences
- Upstream flank [10]
- Downstream flank [10]
Use these sequences to select sgRNA binding sites. Use any CRISPR design tool you want (e.g. https://eu.idtdna.com/site/order/designtool/index/CRISPR_CUSTOM or https://www.crisprscan.org/sequence/).
Note: For the IDT tool you will need to edit the FASTA file to remove characters from the sequence names that the tool doesn't accept. The easiest thing to do is to only select the exon ID for the sequence header.
Select several guide RNA sequences. Visualise the gRNA sequences using Ensembl BLAST. Choose BLASTN with "Normal" search sensitivity. Check that the cut-site will be inside the exon and not too close to the splice acceptor/donor. The cut-site is 6 bp from the end of the sgRNA sequence including PAM (e.g. NNNNNNNNNNNNNNNNN|NNNNGG).
Check for off-targets in other tfap2 genes. If your CRISPR design tool of choice doesn't do off-target prediction, you could use BLASTN and change the search sensitiivty to "Short sequences" to find sequences similar to the guide sequence.