; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020652 (gene) of Snake gourd v1 genome

Gene IDTan0020652
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSWIM-type domain-containing protein
Genome locationLG11:12242849..12251865
RNA-Seq ExpressionTan0020652
SyntenyTan0020652
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_014495097.1 uncharacterized protein LOC106757029 [Vigna radiata var. radiata]7.3e-7248.16Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT   EW K M  +K+I   AYNWL+AIP   WCKHAFS+YP+CD  +NN+ E+FNSTIL ARDKPII M+EWIR Y+M RF   ++K     G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK +K+L+   +KSG+WL  W GG +FEV+ G   E ++VD+   +CSC++W+L+ +PC HA+++I YK +  EDYV   Y    Y+ CYAP I PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVCVIKSDITKEVVKGRLVK
        NG+ LW R + + +LPPI +  P RPKKLRR+E DE    +K+S++NT M CS C       +   KG+  K
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVCVIKSDITKEVVKGRLVK

XP_014499551.1 uncharacterized protein LOC106760651 [Vigna radiata var. radiata]3.3e-7247.79Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT   EW K M  +K+I   AYNWL+A+P   WCKHAFS+YP+CD  +NN+ E+FNSTIL ARDKPII M+EWIR Y+M RF   ++K     G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK +K+L+   +KSG+WL  W+GG +FEV+ G   + ++VD+   +CSC++W+L+G+PC HA+++I YK +  EDYV   Y    Y+ CYAP I PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVCVIKSDITKEVVKGRLVK
        NG+ LW R + + +LPPI +  PGRPKKLRR+E DE    +K+S++N  M CS C       +   KG+  K
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVCVIKSDITKEVVKGRLVK

XP_014522672.1 uncharacterized protein LOC106779131 [Vigna radiata var. radiata]6.2e-7149.41Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT   EW K M  +K++   A+NWL+ IP   WCKHAFS+YP+CD  +NN+ E+FNSTIL ARDKPII M+EWIR YLM+RF   ++K+ +  G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK +K+L++  +KSG+WL  W+GG +FEV+ G   + ++VDL   TC+C+ W+L+G+PC H +++I YKS+  E+YV   Y    Y  CYAP I PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC
        NG+ LW   D   +LPPI +  PGRPKKLRR+E DE     K+S+ NT++ CS C
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC

XP_017419721.1 PREDICTED: uncharacterized protein LOC108329871 [Vigna angularis]1.6e-7149.8Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT + +W K M  +  +   AYNWL+AIP   WCKHAFSSYP+CD  LNN+ E+FNSTIL ARDKPII M+EWIR Y+MRRF   ++K K   G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK + +L+   +KSG+W+ TW+G  +FEV+ G   + ++VDL   +CSC++W+L+G+PC HA+++I+YK +  EDYV   Y    YE CY P I PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC
        NG+ LW   +  ++LPPI +  PGRPKKLRR+E DE     K+S++N  M CS C
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC

XP_027905892.1 uncharacterized protein LOC114165477 [Vigna unguiculata]9.6e-7250.2Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT    W K M  +++I   AYNWL+AIP   WCKHAFS YP+CD  +NN+ E+FNSTIL ARDKPII M+EWIR Y M RF   ++K+    G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK QK+L+   +KSG+WL  W+GG +FEV++G   E ++VDL K TC+C++W+L+G+PC HA+++I YK +  E YV  CY    Y  CY+P I PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC
        NG+ LW R +   + PPI +  PGRPKKLRR+E DE    +K+S++N  M CS C
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC

TrEMBL top hitse value%identityAlignment
A0A1S3TMW4 uncharacterized protein LOC1067570293.6e-7248.16Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT   EW K M  +K+I   AYNWL+AIP   WCKHAFS+YP+CD  +NN+ E+FNSTIL ARDKPII M+EWIR Y+M RF   ++K     G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK +K+L+   +KSG+WL  W GG +FEV+ G   E ++VD+   +CSC++W+L+ +PC HA+++I YK +  EDYV   Y    Y+ CYAP I PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVCVIKSDITKEVVKGRLVK
        NG+ LW R + + +LPPI +  P RPKKLRR+E DE    +K+S++NT M CS C       +   KG+  K
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVCVIKSDITKEVVKGRLVK

A0A1S3U0L5 uncharacterized protein LOC1067606511.6e-7247.79Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT   EW K M  +K+I   AYNWL+A+P   WCKHAFS+YP+CD  +NN+ E+FNSTIL ARDKPII M+EWIR Y+M RF   ++K     G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK +K+L+   +KSG+WL  W+GG +FEV+ G   + ++VD+   +CSC++W+L+G+PC HA+++I YK +  EDYV   Y    Y+ CYAP I PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVCVIKSDITKEVVKGRLVK
        NG+ LW R + + +LPPI +  PGRPKKLRR+E DE    +K+S++N  M CS C       +   KG+  K
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVCVIKSDITKEVVKGRLVK

A0A1S3UJQ2 uncharacterized protein LOC1067660182.5e-7049.41Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT   EW K M  +K++   A+NWL+ IP   WCKHAFS+YP+CD  +NN+ E+FNSTIL ARDKPII M+EWIR YLM RF   ++K+ +  G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK +K+L+   +KSG+WL  W+GG +FEV+ G   + ++VDL   TC+C+ W+L+G+PC H +++I YKS+  E+YV   Y    Y  CYAPTI PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC
        NG+ LW   D   +LPPI +  PGRP KLRR+E DE     K+S+ NT++ CS C
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC

A0A1S3VGC9 uncharacterized protein LOC1067747482.0e-7049.41Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT   EW K M  +K++   A+NWL+ IP   WCKHAFS+YP+CD  +NN+ E+FNSTIL ARDKPII M+EWIR YLM+RF   ++K+ +  G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK +K+L++  +KSG+WL  W+GG +FE + G   + ++VDL   TC+C  W+L+G+PC HA ++I YKS+  E+YV   Y    Y  CYAP I PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC
        NG+ LW   D   +LPPI +  PGRPKKLRR+E DE     K+S+ NT++ CS C
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC

A0A1S3VWV3 uncharacterized protein LOC1067791313.0e-7149.41Show/hide
Query:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG
        M+ A +AT   EW K M  +K++   A+NWL+ IP   WCKHAFS+YP+CD  +NN+ E+FNSTIL ARDKPII M+EWIR YLM+RF   ++K+ +  G
Subjt:  MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKG

Query:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI
         + PK +K+L++  +KSG+WL  W+GG +FEV+ G   + ++VDL   TC+C+ W+L+G+PC H +++I YKS+  E+YV   Y    Y  CYAP I PI
Subjt:  KICPKIQKKLEENKKKSGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPI

Query:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC
        NG+ LW   D   +LPPI +  PGRPKKLRR+E DE     K+S+ NT++ CS C
Subjt:  NGENLWLRVDYDTILPPITRRQPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase8.5e-1022.82Show/hide
Query:  KATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWC-----KHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKN
        +A  ++ + E+   M+ IK     A+ WL   PP  W         +         L  +C+ F    +      + G    ++      F+ ++  +K+
Subjt:  KATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWC-----KHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKN

Query:  VKGKICPK-IQKKLEENKKKSGSWLSTWSGGER--FEVS------------RGNSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDC
          G +  + + +KLEE +  S +W+ T +  ER  ++VS              +S + IV L+  TC+C  ++    PC HA++           YVDDC
Subjt:  VKGKICPK-IQKKLEENKKKSGSWLSTWSGGER--FEVS------------RGNSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDC

Query:  YSIAMYEKCYAPTIHPINGENLWLRV-DYDTILPPITRRQP
        Y++  Y K Y+    P+   + W       T++PP+    P
Subjt:  YSIAMYEKCYAPTIHPINGENLWLRV-DYDTILPPITRRQP

AT1G64255.1 MuDR family transposase1.4e-0421.55Show/hide
Query:  KATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWC-----KHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQEN---KQK
        +A   + + E+V  M  IK     A  WL   P   W         +         L  +C AF     E     + G +  +   L  +F ++    + 
Subjt:  KATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWC-----KHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQEN---KQK

Query:  MKNVKGKICPKIQKKLEENKKK--SGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKC
          N        +  KLEE +    + S++ T      F+V+   +    IV L   +C+C  ++    PC HA++           YVDDCY++   ++ 
Subjt:  MKNVKGKICPKIQKKLEENKKK--SGSWLSTWSGGERFEVSRG-NSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKC

Query:  YAPTIHPINGENLWLRVD-YDTILPPITRRQP
        YA     +   + W        +LPP+    P
Subjt:  YAPTIHPINGENLWLRVD-YDTILPPITRRQP

AT1G64260.1 MuDR family transposase2.1e-0831.03Show/hide
Query:  KLEENKKKSGSWLSTWSGGERFEVSRGN-SEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPINGENLWLR
        KLEE    S  ++ T    + F+VS  +  E +IV L+  TC+C  ++    PC HA++           YVD+CY++  Y K YA T  P+     W  
Subjt:  KLEENKKKSGSWLSTWSGGERFEVSRGN-SEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPINGENLWLR

Query:  -VDYDTILPPITRRQP
             T+ PP  +  P
Subjt:  -VDYDTILPPITRRQP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAAAGCTACCAGGGCGACAACTCAACCAGAATGGGTTAAAGTGATGGAGACAATCAAAAGTATTGAAGAGGGGGCTTATAATTGGCTGATTGCAATCCCCCCTCC
ATTGTGGTGTAAGCATGCGTTTAGTAGTTATCCTAAATGTGATGCCACCCTAAACAACATGTGTGAAGCTTTTAATTCTACCATACTGGAAGCCAGGGATAAACCAATTA
TTGGGATGATTGAATGGATTAGACTTTATCTCATGAGAAGATTTCAAGAAAATAAACAGAAGATGAAGAATGTTAAGGGAAAAATATGTCCAAAGATTCAGAAAAAATTG
GAAGAGAATAAGAAAAAAAGTGGGAGTTGGCTTTCTACTTGGTCCGGAGGTGAAAGGTTTGAGGTTAGTAGGGGTAATAGTGAGGCATACATAGTGGACTTGGATAAGAG
AACTTGTTCATGTTGGTATTGGGAGTTGATTGGACTCCCATGTGAACATGCTATTAGTTCCATATTCTACAAGTCTGATACAGTTGAAGATTATGTGGATGATTGCTACT
CTATAGCTATGTACGAAAAATGTTATGCTCCCACGATTCATCCTATAAATGGTGAAAACTTATGGTTACGTGTGGATTACGACACAATTCTCCCTCCAATCACAAGAAGA
CAACCTGGACGGCCGAAGAAATTGCGACGAAAAGAACCAGATGAGAAGAGGCCACCTACTAAAATGAGTAGGCAAAACACTTCCATGTCATGCTCTGTTTGTGTCATCAA
ATCTGACATAACAAAAGAAGTTGTAAAGGGAAGGTTAGTAAAATGTTAA
mRNA sequenceShow/hide mRNA sequence
AATCAAATCACCACAAATAATACCATTGGATTTTTTTTTTTATCTCTCTGATTTCATTTCTTCACTAAATCTATCTTTCTTTCTTTGTACCTTTAGGTTAATCTCCTATG
TTCCGGGTTTCTTTGTTCTTTGTTCTAGTTTCTAATCTTTAATTACAACATCTCAATACCTTTGAGTTAATCTCATCGGTTTCGGCCCTCTCTTAATTTTTTTCCATTTA
ACCATTCCGAAAAGGGTATGATAAAAATGATTTTGATCATTTTAAAATTACTCTCAAATATGCTCGCATATCTCTATATGATCAATGATAGGATGCACACAGATGAGTGC
AAAACCACCCGAGTTTGATGAATCTGAACTTTTCATGAGAAATAGAAAAAACTTCTATCAATAGTTTTTGAAACAGTTTTTTAATATAACAGGAACCAAGTGATCTCTTC
CTTCTTCGCTCTATCCTAATAATTATAGAGGTGTGATTTCCTTGTATAAATATATACGTCGAGTCTGCAGATTTTTCAAAGATTCAGTGAGGTTGAGGGATGGAGGAAGC
GGAGGGGACGATGTCGTGGAAGGTACTCAGCGACGAGATCAAGAAAGTTGGTGGTTTTGCAGCGCCGATGGCGGTGGCGACAGCGCTGCAATATATCTTCTGCAAGTGAT
AGGGATGATGATGGTGGGTTATTTGGGTGAACTCTACCTTTCGAGTGTCGCCATTGCTATTGCCCTCACCAACGTTACTGGCTTCGTAGTTCTTTGAGCATGGAGGAATT
TAGTGTAATTGTTAACCATGGTGGTAGGGTAGAAAAGAGTCCGGACAAATATGTAGGGGGATTGAAATCATCTTTTGAAAAAATTGACATTGACTTGTGGTCTTTGTTTG
AATTGTTTGACTTGGTTAAGTCCTTAGGCTATAACAAATGGGAAAATTTGAGGTATGAAGGTGGTAATGGTTGCTTTAAGGTAATAGACTCAGATAAAGTTGTGATGAAT
TTGGCTTCCTTCGCTATACTAAATGAAAATGCATCTATAAATGTATATGTGGAGCATGTAGCAAATGATCAGAGCCTTGCACATGAGGACAATGGAGTTGGACTTAACCA
AATATTTGACGAAGATGATAATGTGGAGGATATTCATTTTTCTGATAGTGAAGAGAACTTAAATGATGATCCCCATGGAGTTGGACTTGACCAAATATATGATGAAAATG
ATAATGTGGATGATATTCATTTTTCTGATAGTGAGGAGGACTTAAATGATGAGTTAGAACATGAGAAAGAAAAAGGTAAAGATAAAGAAGTTGATGAGGAAAATTGTGAA
GAGAACGATGGTTTAGGGCATGCCAATGAAGTTAATGTCGAGGAAGATAATGAGTGTAATTTAGATTTAGTTAACTTTTCTGACTCGGATTTAGAGGATGACATCGTAAG
GTCTCCATCATTTAAAAATGTGATTGATCCTTCTATTTTGAAGTTGACATGTTGTTTAGCTCTTTAGAAGAGTTTAAGAGCGCTGTATCTGAGTACGCCATAAATGGAGG
ATGGTCGATTCGTTATAAGAAGGCAGATAAGACTAGAGTTAGAGCTATATGCTCAAAAACATGTAAGTGGGTTACGTATGTGACTAAAATGAAGGGTGAGAAGACTTTTC
AATTGCGAACTTTGGTTGATAAGCATACTTGCACCCGTTCATTTAAGAATAGTAGAGTAACTTCTAAATGGTTGGCCAATAAGTTAGTCAATAGAGTTAAAGAGCAACCT
AGCATAAAGGTGAACACAATTCAAGAGAAGATCCAACGTAAGTATGTTGTCAAGATATCGAGGAGTAAGGCATATAGAGCGAAGAGGAAAGCTGTGGACATGGTTGAAGG
GAATCACGCTGAGCAATATGGAAAATTGTGGGACTATTGTGGTGAGCTACTTCGGGGTAATAATGGTAGTAGTGTTAAATTAAGCGTTGGTGCATATGATTGTTTGGTAA
AGGGTGAAGAACATCCAAGGTCATCTCTTGTGTTTGAGAGATTATATATATGTTTTGATGGGATGAAGAAAGGCTTCTTGGCAAGGTGTAGGCCATTTATTGGGTTGGAT
GGGTGCCACCTAAAAGGGCCCTATAGAGGCCAACTTTTAAGTGTCGTGGGTAGGGATCCAAACGATCAAATGTATCCTTTAGCATTTGCAGTGGTTGAGGTAGAATGCAA
GGCTTCATGAACATGGTTTCTAAGTATTTATTGATTGATATCGGTAGTATAAAGGAGTGTCGATGGATATTCATAAGTGACCAACAAAAGGGGTTGGTCCAAACATTTGA
GGAATTATTGCCAGGTGTAGAACATCGATTTTGTGTCCGACATCTATACAACAATATGTGAAAAAAGTACCCGGGCAAACAACTTAAGGATCTAATGTTGAAAGCTACCA
GGGCGACAACTCAACCAGAATGGGTTAAAGTGATGGAGACAATCAAAAGTATTGAAGAGGGGGCTTATAATTGGCTGATTGCAATCCCCCCTCCATTGTGGTGTAAGCAT
GCGTTTAGTAGTTATCCTAAATGTGATGCCACCCTAAACAACATGTGTGAAGCTTTTAATTCTACCATACTGGAAGCCAGGGATAAACCAATTATTGGGATGATTGAATG
GATTAGACTTTATCTCATGAGAAGATTTCAAGAAAATAAACAGAAGATGAAGAATGTTAAGGGAAAAATATGTCCAAAGATTCAGAAAAAATTGGAAGAGAATAAGAAAA
AAAGTGGGAGTTGGCTTTCTACTTGGTCCGGAGGTGAAAGGTTTGAGGTTAGTAGGGGTAATAGTGAGGCATACATAGTGGACTTGGATAAGAGAACTTGTTCATGTTGG
TATTGGGAGTTGATTGGACTCCCATGTGAACATGCTATTAGTTCCATATTCTACAAGTCTGATACAGTTGAAGATTATGTGGATGATTGCTACTCTATAGCTATGTACGA
AAAATGTTATGCTCCCACGATTCATCCTATAAATGGTGAAAACTTATGGTTACGTGTGGATTACGACACAATTCTCCCTCCAATCACAAGAAGACAACCTGGACGGCCGA
AGAAATTGCGACGAAAAGAACCAGATGAGAAGAGGCCACCTACTAAAATGAGTAGGCAAAACACTTCCATGTCATGCTCTGTTTGTGTCATCAAATCTGACATAACAAAA
GAAGTTGTAAAGGGAAGGTTAGTAAAATGTTAATTTTTTGTGTTTAGCAATGGTAGCTTATTGATGAGGTAACTTATTACTAAATGTATGTGATAGTTATTACTAAATGT
ATGTTATAGCTATAAAGTAGGGAACTTTAGTTCAAAGTAGTGGAGGACAAACACCAAATCGAGGTCGAAGTCAAAGTGGGCATTTGGGAAGTGCCCCTTCTCCATTATTG
TTTAATTCGTGCCAAAGTATAGAAATTCAAGCTTCACACGAATCGAGAAGTGGACCAAACCCCTTGTTGGTTGGGAAAACTACTTGTCTTCCTCCAAGAGCTTCAACTGG
AATTTGGATGAACAATACATCAAATGGAACGAGAGAAAAACTACCCATAAGAGGAAAATTGAAGAAGAAAGTCAGTAGCCTTCCTTTGTTGCCAGTTTGGGATAACAATT
GGATAACACGTTCAATCTTTACTGATCCAGAAGCCTTGAGGAAGCTGTCAAAGAAATGATAAGATTATGTTGTTGCTCACTTTTGTAAAGTTTAGGCCTTATATAAAATT
TTGTGTAGAAGATTAGCTTAGAAAGTGTACATGAACTAAAAGGATCATTTTTTGTGTACATGTACATTGTCGTAGGTGATAGCAATTTTTTGTATCTAATCTTTTAGGAA
AGGCCACAACTTAATGACAATGTTGCTATATAACTTTGATGACAAATTGAAGTTTGATTGCTATTGATATTTGAG
Protein sequenceShow/hide protein sequence
MLKATRATTQPEWVKVMETIKSIEEGAYNWLIAIPPPLWCKHAFSSYPKCDATLNNMCEAFNSTILEARDKPIIGMIEWIRLYLMRRFQENKQKMKNVKGKICPKIQKKL
EENKKKSGSWLSTWSGGERFEVSRGNSEAYIVDLDKRTCSCWYWELIGLPCEHAISSIFYKSDTVEDYVDDCYSIAMYEKCYAPTIHPINGENLWLRVDYDTILPPITRR
QPGRPKKLRRKEPDEKRPPTKMSRQNTSMSCSVCVIKSDITKEVVKGRLVKC