; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002409 (gene) of Snake gourd v1 genome

Gene IDTan0002409
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRestriction endonuclease type II-like protein
Genome locationLG02:92272800..92277666
RNA-Seq ExpressionTan0002409
SyntenyTan0002409
Gene Ontology termsGO:0032774 - RNA biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR011335 - Restriction endonuclease type II-like
IPR011604 - Exonuclease, phage-type/RecB, C-terminal
IPR019080 - YqaJ viral recombinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011659114.2 uncharacterized protein LOC101215512 [Cucumis sativus]1.3e-12289.69Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RR QLWLEKLGAI+QFCGNLATCWSN+KEEEALERYKLITGN+VLFPEFQVYGK+NSEDDWLAASPDG IDKM+YGLPS+G+LEIKCPFF+GDMR
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
         ASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTP GSSLFRLYRDVEYW VLK+ALSDFWWKHVQPARE+CSKY +TNPLIELKSLRPSP+HELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQT
        CES+RVV++SKLLLREFDGRLQT
Subjt:  CESKRVVDSSKLLLREFDGRLQT

XP_022132090.1 uncharacterized protein LOC111005048 [Momordica charantia]5.8e-12392.38Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RRA+LWLEKLGA EQFCGNLAT WSN KE EALERYKLITGNTVLFPEFQVYGK+NSEDDWLAASPDG IDK+VYGLPSRG+LEIKCPFFDGDMR
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
        KASPW RVPLY IPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSP+HELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQT
        CESKR+VD+SKLLLREFDG+LQT
Subjt:  CESKRVVDSSKLLLREFDGRLQT

XP_022150402.1 uncharacterized protein LOC111018568 [Momordica charantia]1.3e-12291.93Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RRAQLWLEKLGAIEQF G LATCWSN KEEEALERYKLITGNTVLFPEFQVYGK NSEDDWLAASPDG IDK+V+GLPSRG+LEIKCPFFDGDMR
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
        KASPWSRVPLY IPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYW+VLKMALSDFWWKHVQPARE CSKYAITNPLIELKSLRPSP+HELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQT
        CESKR+VD+SKL+LREFDG+LQT
Subjt:  CESKRVVDSSKLLLREFDGRLQT

XP_022981568.1 uncharacterized protein LOC111480647 [Cucurbita maxima]2.9e-12291.03Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RRAQLWLEKLGAI+QF GNLATCWSN+KEEEALERYKLITGN+VLFPEFQVYGK NSE DWLAASPDG IDKMVYGLPSRG+LEIKCPFFDGDM 
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
        KASPWSRVPLYCIPQ QGLMEIMDRDWMDFYVWTPKGSSLFRLYRD EYW+VLK+ALSDFWWKHVQPARE+CSKY+ITNPLIELKSLRPSPKHELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQT
        CESKRVVD+S+LLLREF+GRLQT
Subjt:  CESKRVVDSSKLLLREFDGRLQT

XP_038900094.1 uncharacterized protein LOC120087241 [Benincasa hispida]4.7e-12592.79Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RRAQLWLEKLGAI+QFCGNLATCWSN+KEEEALERYKLITGN+VLFPEFQVYGK+NS DDWLAASPDG IDKM+YGLPSRG+LEIKCPFFDGDMR
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
        KASPWSR+PLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYW VLK+ALSDFWWKHVQPARE+CSKYAI NPLIELKSLRPSPKHELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQ
        CESKRVVD+SKLLLREFDGRLQ
Subjt:  CESKRVVDSSKLLLREFDGRLQ

TrEMBL top hitse value%identityAlignment
A0A0A0K4T5 YqaJ domain-containing protein7.4e-12491.03Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RR QLWLEKLGAI+QFCGNLATCWSN+KEEEALERYKLITGN+VLFPEFQVYGK+NSEDDWLAASPDG IDKMVYGLPSRG+LEIKCPFF+GDMR
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
         ASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTP GSSLFRLYRDVEYW VLK+ALSDFWWKHVQPARE+CSKY +TNPLIELKSLRPSP+HELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQT
        CESKRVV++SKLLLREFDGRLQT
Subjt:  CESKRVVDSSKLLLREFDGRLQT

A0A6J1BRH1 uncharacterized protein LOC1110050482.8e-12392.38Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RRA+LWLEKLGA EQFCGNLAT WSN KE EALERYKLITGNTVLFPEFQVYGK+NSEDDWLAASPDG IDK+VYGLPSRG+LEIKCPFFDGDMR
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
        KASPW RVPLY IPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSP+HELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQT
        CESKR+VD+SKLLLREFDG+LQT
Subjt:  CESKRVVDSSKLLLREFDGRLQT

A0A6J1D9D0 uncharacterized protein LOC1110185686.2e-12391.93Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RRAQLWLEKLGAIEQF G LATCWSN KEEEALERYKLITGNTVLFPEFQVYGK NSEDDWLAASPDG IDK+V+GLPSRG+LEIKCPFFDGDMR
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
        KASPWSRVPLY IPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYW+VLKMALSDFWWKHVQPARE CSKYAITNPLIELKSLRPSP+HELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQT
        CESKR+VD+SKL+LREFDG+LQT
Subjt:  CESKRVVDSSKLLLREFDGRLQT

A0A6J1EKA9 uncharacterized protein LOC1114333775.3e-12291.03Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RRAQLWLEKLGAI+QF GNLATCWSN+KEEEALERYKLITGN+VLFPEFQVY K NSE DWLAASPDG IDKMVYGLPSRG+LEIKCPFFDGDM 
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
        KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRD EYW+VLK+ALSDFWWKHVQPARE+CSKY+ITNPLIELKSLRPSPKHELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQT
        CESKRVVD+S+LLLREF+GRLQT
Subjt:  CESKRVVDSSKLLLREFDGRLQT

A0A6J1J2F9 uncharacterized protein LOC1114806471.4e-12291.03Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR
        GFWP RRAQLWLEKLGAI+QF GNLATCWSN+KEEEALERYKLITGN+VLFPEFQVYGK NSE DWLAASPDG IDKMVYGLPSRG+LEIKCPFFDGDM 
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMR

Query:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV
        KASPWSRVPLYCIPQ QGLMEIMDRDWMDFYVWTPKGSSLFRLYRD EYW+VLK+ALSDFWWKHVQPARE+CSKY+ITNPLIELKSLRPSPKHELC YIV
Subjt:  KASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIV

Query:  CESKRVVDSSKLLLREFDGRLQT
        CESKRVVD+S+LLLREF+GRLQT
Subjt:  CESKRVVDSSKLLLREFDGRLQT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13810.1 Restriction endonuclease, type II-like superfamily protein7.2e-6349.1Show/hide
Query:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNS-EDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDM
        GF P  R  LWLEK+GA + F GN AT W    E EALERY  +TGN +L PEF VY    S E++WL ASPDG I+ +  G+ S G+LE+KCPF + D 
Subjt:  GFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNS-EDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDM

Query:  RKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYI
         K  PW +VP  C+PQ QGLMEI+D DW+D Y WT  GSSLFR++RD  +W+ +K AL DFW  HV PARE+ + + I +P ++L+  +P   HE C+ I
Subjt:  RKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYI

Query:  VCESKRVVDSSKLLLREFDGRL
        +  ++R+  ++  L  E DG L
Subjt:  VCESKRVVDSSKLLLREFDGRL

AT1G67660.1 Restriction endonuclease, type II-like superfamily protein7.7e-4139.37Show/hide
Query:  GFWP-CRRAQLWLEKL----GAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFF
        GFW   RRA+LW EK+      + +     A  W    E  A+ERYK I G  V    F ++  SN E  WL ASPDG +D         GILE+KCP+ 
Subjt:  GFWP-CRRAQLWLEKL----GAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFF

Query:  DGDMRKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHEL
         G      PW +VP Y +PQ QG MEIMDR+W++ Y WT  GS++FR+ RD  YW+++   L +FWW+ V PARE      +     E+K   P+  H+ 
Subjt:  DGDMRKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHEL

Query:  CRYIVCESKRVVDSSKLLLRE
         +  + +S  +   SKL+ RE
Subjt:  CRYIVCESKRVVDSSKLLLRE

AT1G67660.2 Restriction endonuclease, type II-like superfamily protein7.7e-4139.37Show/hide
Query:  GFWP-CRRAQLWLEKL----GAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFF
        GFW   RRA+LW EK+      + +     A  W    E  A+ERYK I G  V    F ++  SN E  WL ASPDG +D         GILE+KCP+ 
Subjt:  GFWP-CRRAQLWLEKL----GAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFF

Query:  DGDMRKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHEL
         G      PW +VP Y +PQ QG MEIMDR+W++ Y WT  GS++FR+ RD  YW+++   L +FWW+ V PARE      +     E+K   P+  H+ 
Subjt:  DGDMRKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHEL

Query:  CRYIVCESKRVVDSSKLLLRE
         +  + +S  +   SKL+ RE
Subjt:  CRYIVCESKRVVDSSKLLLRE

AT1G67660.3 Restriction endonuclease, type II-like superfamily protein7.7e-4139.37Show/hide
Query:  GFWP-CRRAQLWLEKL----GAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFF
        GFW   RRA+LW EK+      + +     A  W    E  A+ERYK I G  V    F ++  SN E  WL ASPDG +D         GILE+KCP+ 
Subjt:  GFWP-CRRAQLWLEKL----GAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFF

Query:  DGDMRKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHEL
         G      PW +VP Y +PQ QG MEIMDR+W++ Y WT  GS++FR+ RD  YW+++   L +FWW+ V PARE      +     E+K   P+  H+ 
Subjt:  DGDMRKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHEL

Query:  CRYIVCESKRVVDSSKLLLRE
         +  + +S  +   SKL+ RE
Subjt:  CRYIVCESKRVVDSSKLLLRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGTTTTGGCCTTGTCGGAGGGCTCAATTATGGTTAGAGAAGCTTGGAGCAATTGAGCAGTTTTGTGGTAATTTAGCAACTTGTTGGAGCAACATCAAAGAAGA
AGAGGCACTTGAAAGATACAAGCTTATAACAGGAAACACTGTCTTGTTTCCTGAATTTCAAGTCTATGGTAAATCAAATTCTGAAGATGATTGGTTAGCTGCTTCACCTG
ATGGTACAATTGACAAAATGGTCTATGGATTGCCTTCACGAGGAATTTTGGAGATTAAGTGTCCATTTTTTGATGGTGATATGAGAAAGGCTTCACCATGGTCTCGAGTT
CCTCTTTACTGTATTCCACAAGCTCAAGGTTTGATGGAAATAATGGACAGAGATTGGATGGATTTTTATGTTTGGACTCCAAAAGGTAGTAGTCTATTTAGATTGTATCG
AGATGTAGAATACTGGAAGGTCTTGAAAATGGCTTTGTCTGATTTTTGGTGGAAGCATGTTCAACCAGCAAGGGAGTTGTGTAGTAAATATGCGATTACAAATCCCCTCA
TTGAGCTAAAATCACTCAGGCCATCTCCTAAGCATGAATTATGCAGGTATATAGTTTGTGAAAGCAAACGCGTTGTCGATAGTTCTAAGTTGCTCTTGCGTGAATTTGAT
GGGAGACTTCAAACTTGA
mRNA sequenceShow/hide mRNA sequence
CCCTGCCCCACTAATTCTCAGCCCAGCGCATTCCCCTTACCATTTTAATTTATTTTCGGATTAATGCTATCCTATTTCTTCTATTTGCAAAATGCCCCCTCTATCGTCTC
CCCGTAACCGACCATTCCCTCGCCGTCGCCGTTCTTCATCTTCTTCCTCCTCTCTCGGGCTATCACTCAGCACCCCTCGAATTTTGCAGTCGCTGCTATGTCATTAAATT
CCATTTTAGTTCAAACCTGCTTTATGGAACAACGAACCGTTGCAATTTTTTGTCCATTTTAATGGAAGGGTTTTGGCCTTGTCGGAGGGCTCAATTATGGTTAGAGAAGC
TTGGAGCAATTGAGCAGTTTTGTGGTAATTTAGCAACTTGTTGGAGCAACATCAAAGAAGAAGAGGCACTTGAAAGATACAAGCTTATAACAGGAAACACTGTCTTGTTT
CCTGAATTTCAAGTCTATGGTAAATCAAATTCTGAAGATGATTGGTTAGCTGCTTCACCTGATGGTACAATTGACAAAATGGTCTATGGATTGCCTTCACGAGGAATTTT
GGAGATTAAGTGTCCATTTTTTGATGGTGATATGAGAAAGGCTTCACCATGGTCTCGAGTTCCTCTTTACTGTATTCCACAAGCTCAAGGTTTGATGGAAATAATGGACA
GAGATTGGATGGATTTTTATGTTTGGACTCCAAAAGGTAGTAGTCTATTTAGATTGTATCGAGATGTAGAATACTGGAAGGTCTTGAAAATGGCTTTGTCTGATTTTTGG
TGGAAGCATGTTCAACCAGCAAGGGAGTTGTGTAGTAAATATGCGATTACAAATCCCCTCATTGAGCTAAAATCACTCAGGCCATCTCCTAAGCATGAATTATGCAGGTA
TATAGTTTGTGAAAGCAAACGCGTTGTCGATAGTTCTAAGTTGCTCTTGCGTGAATTTGATGGGAGACTTCAAACTTGATGCTCTGCATCTACTTGAGCAATGTCTTCAT
TAGCTTAAATTGCATTGGGATATAAGTTGTATTCATACTATGTTGCAGTATTTTAGAGCTTGGAGAGTTCTTATAAACATGAAAACTGAGGGCCAAACAATCAGCAAACA
TTGTCAAAAAACAAGTATGCCCTGAGTTCAACTTGACATCCTACTTGCTCCCTTGCTCGTGTGGATCTATCAAAAGGGTTAGACATGAGCCCCACTAGGAAAGTTGCTCG
TATGCGAGAATGTCCTAAACCAACACCGAAAGCGTAGCACATGTCACATTCGCAACCCTGTCTTCCTTGGGGTTGAAGGACATGGTAGCCGCAGTTAGCTCCACCACACA
GCATGAAGGCCAAAGCGCCAAAGGGAAGTAATTAACTTGGAGGAGTTCTTTTCGAAATCTATCGAACTTTTAAAAAAATTTCGAGGTCATCCTTGTTAAGTGTTAGCTGG
CAACAGAGTTGAGAAAATCTTGTTGTTCGGTTGGAAGGTTCTTAATTCTTATTCTAAAGACCTCTATTTCTTGAATCTTATAGATATGTCTGTTAGTGTAATGGAAGTGT
TTTTGGTTCTCATCATCACACTGCGAGGGCAATCCACTTACAGGACGAGCTGAAGCAAAGCAGATGGAAAATCTCGAAGCATGGTGGCACATGGAGATTGCATTAGATGC
ATGATGTGCACTCAACAAATTATCCAGGTAGCGATGGTCAAGTTGTAAGACAGTTGATTCAAGTGAGCAAATTTGATGATGATTACCGAAAGAGAAGTACAAAAAGATGG
AGATGAGGTATTCCACCAAGCCAAGGGATTACAAAAAGGCTTCCGATTGGCAAAGACATAAGAGAGAGTACAATTACAATAACCATCTGAAGGGAACATGAAGTAGGAGG
AAACTCATCGCAACTATGAGCCTGCCCTTGAAAAGTCCTCTTGTTCCTCTCGAACCAAAGTTTCCAAAGAAGCGTGGTTGCCATGTTCGTCCGAAAGACTCTTGTTTAAC
CTGTCTCCAAGATATATATGTTGGGTAATAACCTTTCAGATGCACTGCATATTCTCCTGCAACTCAGAGGTCCTGCCATTTGTATTTCATCGAGATTAAAAGCTTCTGAG
GAAGATGAGAATGCCTTGGAGTTTGAAGCTCAGCCACGGCGGCCAGTGGCTTGCTGTAGTGCGCCGCACGGCTGTTATGATTGTGATTTCTAATGTTTGTTTTTGCTTTT
CCTGTTCCAATTCTTTTGATATATTAGTCACAATCAGAATCAGGTCTCTTTAAAGTTTGTCCCTTATCATCAATGGCTTCCAGCAAGAATCTTCATGATTTGGTATGGTT
TTCTTACATTCTTTACTTTTGTTTTTGGTTAAATTACAAATGTAGTACCTATAGATCGGATCTAGTTTCAATTTGGTTCCTTTGGTTTTAAAGTTTTAGTTTAACTACCT
ATAGTCTAGGTTTAAGAAGGAAAGATATGGTGACATACTGTTTTTAAGAAATCTAGAACAGTACATGCACGACAATGACACGTTGAAAATATATATCACTTTTATATTAG
AAGGAAATTCAAAATCAATAAGGGTTATTTGAGGCGTTTGGTAGGTTATAATAGTTTTTGTTTGAGAG
Protein sequenceShow/hide protein sequence
MEGFWPCRRAQLWLEKLGAIEQFCGNLATCWSNIKEEEALERYKLITGNTVLFPEFQVYGKSNSEDDWLAASPDGTIDKMVYGLPSRGILEIKCPFFDGDMRKASPWSRV
PLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLFRLYRDVEYWKVLKMALSDFWWKHVQPARELCSKYAITNPLIELKSLRPSPKHELCRYIVCESKRVVDSSKLLLREFD
GRLQT