; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015609 (gene) of Snake gourd v1 genome

Gene IDTan0015609
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG11:9274833..9280044
RNA-Seq ExpressionTan0015609
SyntenyTan0015609
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG50019.1 hypothetical protein EZV62_025894 [Acer yangbiense]1.3e-3934.15Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW
        M  EE  N   +L+LKE E G ++ +     +   +R    +  ++++ K +N E F S++P+IW    + +IE    N+F  +F+ ++++  +++GGPW
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW

Query:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV
        +FD+ LLV EE  G  +++ ++F    FW+  H +P +C T +  + LG+ IG  + ++   +G C G+ +RVRV +DV +PLRR  ++ +   G+E  +
Subjt:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV

Query:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKT---NLQFGDWLKA
         +RYE+LPD C+ CG IGH+V+DC      ++    NL FG WL+A
Subjt:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKT---NLQFGDWLKA

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]3.0e-4133.57Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW
        ME ++ + +   L+L +++ G +  I+ +  E  E+     ++ + IT K IN E FKS +  IW  + ++ +E  G N+F   F++  ++ RI++GGPW
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW

Query:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV
         FD+ LLV  E  G+  +  L+FRY  FW+  H+LP  C  R+    LG  +G  + +++ E+G+C GQ +R+RV +DV+ PL+R  ++ +G   +   V
Subjt:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV

Query:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGI--KTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQ
         + YE+LP+FC+ CG IGHLV+DC      I   ++ +FG W++A  R         G +K  P G   G  +  L+
Subjt:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGI--KTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQ

XP_006485824.1 uncharacterized protein LOC102613298 [Citrus sinensis]5.7e-4034.96Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW
        M+ EE   +  ++ L+ EE   ++    N     E+     +V +I+ T+++  E  ++ + + W    + K+E+ G N+F+  F SK+EK RI+ GGPW
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW

Query:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV
         FDR L+V  E  G  ++K  +F +ALFWV  H++P +C  ++  + +G  IG  + VE+D+ G+C G  +RVR+ ++V RPL +   LK+   G ++ +
Subjt:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV

Query:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRG
        ++ YEKLPDFCF CG+IGH  ++C    G  K +L +G W++A  R     QA+      K HG G
Subjt:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRG

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]2.2e-3934.85Show/hide
Query:  VVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPWTFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFT
        VV ++ T+K I+AE  +S++  +W +    + E  G N+++  F+S +EK R++  GPWTF++ LLV            + F +  FW+  H++P  C +
Subjt:  VVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPWTFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFT

Query:  RKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWVQVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNL--QFGD
         + A  LG  +G  E +E D     +G  +RVRV++DV +PLRR  KLK    G+++W  +RYEKLPDFC+ CG IGH  ++C      + TN   Q+GD
Subjt:  RKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWVQVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNL--QFGD

Query:  WLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQRRERTRGDYLEENDGMPPREGPEDSGRR
        WL+A +     +      E+    G   GRG Q +      RGD+   ++     +GPE S RR
Subjt:  WLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQRRERTRGDYLEENDGMPPREGPEDSGRR

XP_024953751.1 uncharacterized protein LOC112498094 [Citrus sinensis]5.1e-4126.93Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW
        ME EE   R  ++ L +EE GG V  +    +  EK     ++ ++I T+ ++ E  K  + ++W    ++KIE+ G N+FI  F S+A+K  I+ GGPW
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW

Query:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKL-KIGLMGEEVW
         FDR L+V  E  G  ++K  +F +  FWV  HD+P +C T++   ALG +IG  E VE+D  G+C GQ LR+R+ +D+ RPL++  +L + G   E++ 
Subjt:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKL-KIGLMGEEVW

Query:  VQVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQRRERTRGDYLEENDGMPPREGPED
        +QV YE+LPDFCF CG IGH  ++C       K  L +G WLKA+      +Q++        H +       P + R  +  +  E+N    P    ++
Subjt:  VQVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQRRERTRGDYLEENDGMPPREGPED

Query:  SGRRYALKMGPNDQKSPEMRVGEVAGEQGKDSTGKIWKLESMDGTLQTKVSESGQCMHDEEDTCGGGTHSDEKGGITEDFDSNVGKASLGL---NSGSTT
           R+ L  G ++ +    +   V G           +LE+     Q K    G    D     G G     KGG  ++ ++N     L           
Subjt:  SGRRYALKMGPNDQKSPEMRVGEVAGEQGKDSTGKIWKLESMDGTLQTKVSESGQCMHDEEDTCGGGTHSDEKGGITEDFDSNVGKASLGL---NSGSTT

Query:  GGLDYGPKDKLG--LSTGAK----------KEKPEKGLNSTLISNKQVTRAVGKGERPSTPSTLSPVTRDDAEYRKPVYGEPLNDPRSQNDRGRDNGSLA
        G L  G KD+ G  ++ G +          + + EK +N   + N +  +   K +  +  + +  +     + ++P        P+++  +        
Subjt:  GGLDYGPKDKLG--LSTGAK----------KEKPEKGLNSTLISNKQVTRAVGKGERPSTPSTLSPVTRDDAEYRKPVYGEPLNDPRSQNDRGRDNGSLA

Query:  TNALLSKTEGVEIKAPRKWKRIAREMTHNPKIGETKDSLSGVKHGLEVMEVEIPNKK
           ++S T+       RK K IA ++    + GE ++ L  V   L  + V+ P ++
Subjt:  TNALLSKTEGVEIKAPRKWKRIAREMTHNPKIGETKDSLSGVKHGLEVMEVEIPNKK

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)3.6e-4036.4Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW
        M+ EE   R  ++ L +EE GG V  ++    + EK     +V +++ T+ ++ E  K  + ++W    ++KIE  G NVF+  F S+ +K  I+ GGPW
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW

Query:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLM-GEEVW
         FDR L+   E  G  ++K  +F +  FWV  HD+P +C ++  A  LG  IG  E VE+D  G+C GQ LR+R+ +D+ +PL++  +L+      +++ 
Subjt:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLM-GEEVW

Query:  VQVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRGRGRGT
        ++V YE+LPDFCF CG IGH  ++C       K  L +G WLKAN            +EK K  GRGR R T
Subjt:  VQVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRGRGRGT

A0A5C7GZQ4 CCHC-type domain-containing protein6.1e-4034.15Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW
        M  EE  N   +L+LKE E G ++ +     +   +R    +  ++++ K +N E F S++P+IW    + +IE    N+F  +F+ ++++  +++GGPW
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW

Query:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV
        +FD+ LLV EE  G  +++ ++F    FW+  H +P +C T +  + LG+ IG  + ++   +G C G+ +RVRV +DV +PLRR  ++ +   G+E  +
Subjt:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV

Query:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKT---NLQFGDWLKA
         +RYE+LPD C+ CG IGH+V+DC      ++    NL FG WL+A
Subjt:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKT---NLQFGDWLKA

A0A5C7H9Y2 CCHC-type domain-containing protein1.5e-4133.57Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW
        ME ++ + +   L+L +++ G +  I+ +  E  E+     ++ + IT K IN E FKS +  IW  + ++ +E  G N+F   F++  ++ RI++GGPW
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW

Query:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV
         FD+ LLV  E  G+  +  L+FRY  FW+  H+LP  C  R+    LG  +G  + +++ E+G+C GQ +R+RV +DV+ PL+R  ++ +G   +   V
Subjt:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV

Query:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGI--KTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQ
         + YE+LP+FC+ CG IGHLV+DC      I   ++ +FG W++A  R         G +K  P G   G  +  L+
Subjt:  QVRYEKLPDFCFGCGIIGHLVKDCGTEVGGI--KTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQ

A0A5C7IU01 CCHC-type domain-containing protein2.3e-3937.28Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW
        M   E      S++L +E+ G V+E+ +  + D ++     +V +++T K IN E FK ++ +IW+  G++++E  G NVF+  F ++A++ R+ + GPW
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPW

Query:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV
         F   L+  E+  G  N+  +EF  A FW+  HD+P +C  R+TA+ L   IG    + S E+ +C G  LRV+VR+D+ +PL+R  +LK+G   E + V
Subjt:  TFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWV

Query:  QVRYEKLPDFCFGCGIIGHLVKDCGTEV
         ++YE+LP+FC+ CG IGH +K+CG EV
Subjt:  QVRYEKLPDFCFGCGIIGHLVKDCGTEV

A0A6J1D765 uncharacterized protein LOC1110179021.0e-3934.85Show/hide
Query:  VVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPWTFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFT
        VV ++ T+K I+AE  +S++  +W +    + E  G N+++  F+S +EK R++  GPWTF++ LLV            + F +  FW+  H++P  C +
Subjt:  VVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPWTFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFT

Query:  RKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWVQVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNL--QFGD
         + A  LG  +G  E +E D     +G  +RVRV++DV +PLRR  KLK    G+++W  +RYEKLPDFC+ CG IGH  ++C      + TN   Q+GD
Subjt:  RKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWVQVRYEKLPDFCFGCGIIGHLVKDCGTEVGGIKTNL--QFGD

Query:  WLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQRRERTRGDYLEENDGMPPREGPEDSGRR
        WL+A +     +      E+    G   GRG Q +      RGD+   ++     +GPE S RR
Subjt:  WLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQRRERTRGDYLEENDGMPPREGPEDSGRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding6.3e-0522.54Show/hide
Query:  FRSKAEKCRIVQGGPWTFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLR
        F+S+     I++ GPW+F+  + V +     L+  A EF+   FW+    +P    T +   ++G  +G F                             
Subjt:  FRSKAEKCRIVQGGPWTFDRGLLVFEEIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLR

Query:  RAAKLKIGLMGEEVWVQVRYEKLPDFCFGCGIIGHLVKDCGT
            L+  L  +   ++ +YEKL +FC  CG++ H   +C T
Subjt:  RAAKLKIGLMGEEVWVQVRYEKLPDFCFGCGIIGHLVKDCGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGAAGAGTTCAACAACCGATTAGCGAGTCTGAATCTCAAAGAGGAGGAACTAGGAGGAGTTGTGGAGATCGAAGACAATGAACTCGAGGACTTTGAAAAACG
AAATCAAGACGAAGTGGTTTGCAGAATTATAACCACAAAAACCATAAACGCAGAAGTCTTCAAGAGCATAGTTCCAAAGATATGGAATATGGAAGGAAAAATCAAGATAG
AGGCTGCCGGAAGAAATGTCTTTATATGCTCATTCAGAAGCAAGGCTGAAAAATGCAGAATCGTTCAAGGAGGTCCATGGACCTTCGACAGGGGCTTGTTGGTATTTGAA
GAAATCAAAGGAGCTCTCAACCTAAAGGCACTAGAATTCAGGTATGCTTTATTTTGGGTGCATTTTCATGACCTTCCTAGAGTGTGCTTCACCAGGAAAACAGCGAAGGC
TCTAGGGAATTCCATCGGGAGTTTCGAAGGAGTTGAATCTGATGAAACAGGCAAGTGCAGCGGTCAAACCCTTAGAGTGAGAGTCCGGATGGATGTTCTGAGACCGCTGA
GAAGGGCTGCTAAGCTGAAGATTGGGTTGATGGGAGAAGAGGTGTGGGTTCAAGTCAGATATGAAAAGCTCCCAGACTTTTGCTTTGGTTGTGGTATAATAGGGCATCTG
GTCAAGGACTGCGGGACAGAGGTAGGAGGAATTAAAACAAATCTGCAGTTTGGTGACTGGCTAAAAGCAAATGTTCGAATGGGTGGGGACGAGCAAGCTAAGGCAGGCAG
TGAGAAATTCAAACCTCATGGAAGGGGTCGGGGAAGGGGCACTCAGCCGCTCCAGAGGAGAGAACGAACGAGAGGGGATTATCTTGAGGAGAATGACGGAATGCCGCCAA
GGGAGGGCCCGGAGGATAGTGGTAGGCGATATGCCTTAAAAATGGGGCCTAATGACCAGAAGTCGCCGGAAATGAGGGTCGGAGAGGTTGCCGGAGAGCAAGGAAAGGAC
TCAACGGGAAAAATTTGGAAACTGGAATCGATGGATGGGACATTACAGACCAAAGTCAGTGAGTCAGGACAGTGCATGCATGATGAAGAAGACACGTGTGGAGGGGGGAC
CCACTCTGACGAAAAGGGAGGGATCACTGAGGACTTCGATTCAAACGTTGGAAAAGCAAGTTTGGGCCTGAATAGTGGGTCGACAACAGGCGGGCTAGACTATGGGCCGA
AAGATAAGTTGGGTCTGTCCACTGGGGCAAAGAAGGAAAAACCCGAAAAGGGCTTAAATAGTACCCTCATTTCTAACAAGCAAGTCACACGGGCTGTTGGGAAAGGAGAA
AGGCCTTCGACCCCTTCTACCTTAAGCCCAGTAACGAGGGACGACGCTGAATACAGAAAGCCAGTTTATGGTGAACCATTAAACGATCCAAGGAGCCAAAACGACAGGGG
GAGAGACAATGGGAGCCTGGCCACTAATGCTCTTCTAAGCAAAACAGAGGGAGTAGAAATCAAAGCTCCCAGGAAATGGAAAAGGATTGCTAGAGAGATGACTCACAATC
CGAAAATCGGTGAAACGAAAGACAGTCTGAGTGGAGTCAAACACGGCCTGGAAGTGATGGAAGTGGAAATTCCTAATAAAAAACAGTGTGGAGAAGAACAAGATGAAAAT
TTATGGAGATCGGCGGGAGCTGGTTTTAGGGGCAACAAATATACGTGGAGGAAAAGCCGACATCACAATGCTACCAAGGAACGCCTGGACAGGCCTATTCTTGCCCACAT
TACATTTGATACAACCAGCTCTAACAATAGAAGATTGAGGGCTAATGCTCGTTTTAAGGAAAATTGGGTTGCCCGTGAGGAGGGTAGACGTGTTTGCGACTTGATTGATA
CGAATGGTAAGTGGAATGAAGATGAGGTTAGAAATAATTTTCCCCCGCAAGACTTTAATGATATTATGAATACTCCTCTGGGCCCGAAAGGGGCAAAGGATGTGATCATT
TGGGGAGAGGATAAAAAAGGGATCTTTTCGGTCAAAAGTGCGTATCATTTGGCAAAAAGATACCGCATAGGCCCTTCTAGCTCCACGGTGATTGAATGGAGCCCCAAAAG
TCTTGGACCAGTTTGTGGAAAGCCAAGTCCGTTCCCAGAGCAAAGATCTGGTACACCGGGAGGACACAATCCATATCATGTGGAGATGCAAAATCACCAAGAGGATTTGG
ATCAACTTTATCCCCAGAATGGAGACTTTGTTTCATATGTGTTTAAGAGACTGGAATCCTATGGATTGTTGGGATTGGATGATTTCAAACCTCAATATGGAGGAGATAGA
TTTGGCCATCATTATTATTTGGAAAATCTGGAATGCCAGGAATCTTATAAACGCTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACGAAGAGTTCAACAACCGATTAGCGAGTCTGAATCTCAAAGAGGAGGAACTAGGAGGAGTTGTGGAGATCGAAGACAATGAACTCGAGGACTTTGAAAAACG
AAATCAAGACGAAGTGGTTTGCAGAATTATAACCACAAAAACCATAAACGCAGAAGTCTTCAAGAGCATAGTTCCAAAGATATGGAATATGGAAGGAAAAATCAAGATAG
AGGCTGCCGGAAGAAATGTCTTTATATGCTCATTCAGAAGCAAGGCTGAAAAATGCAGAATCGTTCAAGGAGGTCCATGGACCTTCGACAGGGGCTTGTTGGTATTTGAA
GAAATCAAAGGAGCTCTCAACCTAAAGGCACTAGAATTCAGGTATGCTTTATTTTGGGTGCATTTTCATGACCTTCCTAGAGTGTGCTTCACCAGGAAAACAGCGAAGGC
TCTAGGGAATTCCATCGGGAGTTTCGAAGGAGTTGAATCTGATGAAACAGGCAAGTGCAGCGGTCAAACCCTTAGAGTGAGAGTCCGGATGGATGTTCTGAGACCGCTGA
GAAGGGCTGCTAAGCTGAAGATTGGGTTGATGGGAGAAGAGGTGTGGGTTCAAGTCAGATATGAAAAGCTCCCAGACTTTTGCTTTGGTTGTGGTATAATAGGGCATCTG
GTCAAGGACTGCGGGACAGAGGTAGGAGGAATTAAAACAAATCTGCAGTTTGGTGACTGGCTAAAAGCAAATGTTCGAATGGGTGGGGACGAGCAAGCTAAGGCAGGCAG
TGAGAAATTCAAACCTCATGGAAGGGGTCGGGGAAGGGGCACTCAGCCGCTCCAGAGGAGAGAACGAACGAGAGGGGATTATCTTGAGGAGAATGACGGAATGCCGCCAA
GGGAGGGCCCGGAGGATAGTGGTAGGCGATATGCCTTAAAAATGGGGCCTAATGACCAGAAGTCGCCGGAAATGAGGGTCGGAGAGGTTGCCGGAGAGCAAGGAAAGGAC
TCAACGGGAAAAATTTGGAAACTGGAATCGATGGATGGGACATTACAGACCAAAGTCAGTGAGTCAGGACAGTGCATGCATGATGAAGAAGACACGTGTGGAGGGGGGAC
CCACTCTGACGAAAAGGGAGGGATCACTGAGGACTTCGATTCAAACGTTGGAAAAGCAAGTTTGGGCCTGAATAGTGGGTCGACAACAGGCGGGCTAGACTATGGGCCGA
AAGATAAGTTGGGTCTGTCCACTGGGGCAAAGAAGGAAAAACCCGAAAAGGGCTTAAATAGTACCCTCATTTCTAACAAGCAAGTCACACGGGCTGTTGGGAAAGGAGAA
AGGCCTTCGACCCCTTCTACCTTAAGCCCAGTAACGAGGGACGACGCTGAATACAGAAAGCCAGTTTATGGTGAACCATTAAACGATCCAAGGAGCCAAAACGACAGGGG
GAGAGACAATGGGAGCCTGGCCACTAATGCTCTTCTAAGCAAAACAGAGGGAGTAGAAATCAAAGCTCCCAGGAAATGGAAAAGGATTGCTAGAGAGATGACTCACAATC
CGAAAATCGGTGAAACGAAAGACAGTCTGAGTGGAGTCAAACACGGCCTGGAAGTGATGGAAGTGGAAATTCCTAATAAAAAACAGTGTGGAGAAGAACAAGATGAAAAT
TTATGGAGATCGGCGGGAGCTGGTTTTAGGGGCAACAAATATACGTGGAGGAAAAGCCGACATCACAATGCTACCAAGGAACGCCTGGACAGGCCTATTCTTGCCCACAT
TACATTTGATACAACCAGCTCTAACAATAGAAGATTGAGGGCTAATGCTCGTTTTAAGGAAAATTGGGTTGCCCGTGAGGAGGGTAGACGTGTTTGCGACTTGATTGATA
CGAATGGTAAGTGGAATGAAGATGAGGTTAGAAATAATTTTCCCCCGCAAGACTTTAATGATATTATGAATACTCCTCTGGGCCCGAAAGGGGCAAAGGATGTGATCATT
TGGGGAGAGGATAAAAAAGGGATCTTTTCGGTCAAAAGTGCGTATCATTTGGCAAAAAGATACCGCATAGGCCCTTCTAGCTCCACGGTGATTGAATGGAGCCCCAAAAG
TCTTGGACCAGTTTGTGGAAAGCCAAGTCCGTTCCCAGAGCAAAGATCTGGTACACCGGGAGGACACAATCCATATCATGTGGAGATGCAAAATCACCAAGAGGATTTGG
ATCAACTTTATCCCCAGAATGGAGACTTTGTTTCATATGTGTTTAAGAGACTGGAATCCTATGGATTGTTGGGATTGGATGATTTCAAACCTCAATATGGAGGAGATAGA
TTTGGCCATCATTATTATTTGGAAAATCTGGAATGCCAGGAATCTTATAAACGCTACTAA
Protein sequenceShow/hide protein sequence
MEDEEFNNRLASLNLKEEELGGVVEIEDNELEDFEKRNQDEVVCRIITTKTINAEVFKSIVPKIWNMEGKIKIEAAGRNVFICSFRSKAEKCRIVQGGPWTFDRGLLVFE
EIKGALNLKALEFRYALFWVHFHDLPRVCFTRKTAKALGNSIGSFEGVESDETGKCSGQTLRVRVRMDVLRPLRRAAKLKIGLMGEEVWVQVRYEKLPDFCFGCGIIGHL
VKDCGTEVGGIKTNLQFGDWLKANVRMGGDEQAKAGSEKFKPHGRGRGRGTQPLQRRERTRGDYLEENDGMPPREGPEDSGRRYALKMGPNDQKSPEMRVGEVAGEQGKD
STGKIWKLESMDGTLQTKVSESGQCMHDEEDTCGGGTHSDEKGGITEDFDSNVGKASLGLNSGSTTGGLDYGPKDKLGLSTGAKKEKPEKGLNSTLISNKQVTRAVGKGE
RPSTPSTLSPVTRDDAEYRKPVYGEPLNDPRSQNDRGRDNGSLATNALLSKTEGVEIKAPRKWKRIAREMTHNPKIGETKDSLSGVKHGLEVMEVEIPNKKQCGEEQDEN
LWRSAGAGFRGNKYTWRKSRHHNATKERLDRPILAHITFDTTSSNNRRLRANARFKENWVAREEGRRVCDLIDTNGKWNEDEVRNNFPPQDFNDIMNTPLGPKGAKDVII
WGEDKKGIFSVKSAYHLAKRYRIGPSSSTVIEWSPKSLGPVCGKPSPFPEQRSGTPGGHNPYHVEMQNHQEDLDQLYPQNGDFVSYVFKRLESYGLLGLDDFKPQYGGDR
FGHHYYLENLECQESYKRY