; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032839 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032839
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCACTA en-spm transposon protein
Genome locationscaffold11:14434859..14436415
RNA-Seq ExpressionSpg032839
SyntenySpg032839
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]2.8e-7943.2Show/hide
Query:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +      RG  RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKTWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                           + +D D+ K H+ KYV +++ NTF+E+RS+L++HY+ F+DPK AR  P
Subjt:  TIPLHYKTWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENP

Query:  PERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVYLEMDGLLKASSE
        P+RITD  DWN+LC+RWETP+WKKK + NK SRS++PY HR+G KSFV                     Q+HF EK GW+N+ AK+ YLEM  L++AS +
Subjt:  PERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVYLEMDGLLKASSE

Query:  QGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQR
        +    +S  +V KQVLG RSG+IKGLG +PK  SS SV+S    +KELE+K+E M+ EM  +K+  E M+    AL S              +  +WE R
Subjt:  QGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQR

Query:  WAQIQHLFGEQSG-GGSSN
        WA+IQ++ G   G  G SN
Subjt:  WAQIQHLFGEQSG-GGSSN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]1.1e-8346.17Show/hide
Query:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +      RG  RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKTWPEVPQQVRDNVKDRLLTKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKAD
        TIPL  K W +V ++VRD V D+LL+ +D D+ K H+ KYV +++ NTF+E+RS+L++HY+ F+DPK AR  PP+RITD  DWN+LC+RWETP+WKKK +
Subjt:  TIPLHYKTWPEVPQQVRDNVKDRLLTKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKAD

Query:  QNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLG
         NK SRS++PY HR+G KSFV                     Q+HF EK GW+N+ AK+ YLEM  L++AS ++    +S  +V KQVLG RSG+IKGLG
Subjt:  QNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLG

Query:  WDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQRWAQIQHLFGEQSG-GGSSN
         +PK  SS SV+S    +KELE+K+E M+ EM  +K+  E M+    AL S              +  +WE RWA+IQ++ G   G  G SN
Subjt:  WDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQRWAQIQHLFGEQSG-GGSSN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]8.9e-6539.14Show/hide
Query:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +      RG  RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKTWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                           + +D D+ K H+ KYV +++ NTF+E+RS+L++HY+ F+DPK AR  P
Subjt:  TIPLHYKTWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENP

Query:  PERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVYLEMDGLLKASSE
        P+RITD  DWN+LC+RWETP+WKKK + NK SRS++PY HR+G KSFV                     Q+HF EK GW+N+ AK+ YLEM  L++AS +
Subjt:  PERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVYLEMDGLLKASSE

Query:  QGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQR
        +                           DP   SSV        +KELE+K+E M+ EM  +K+  E M+    AL S              +  +WE R
Subjt:  QGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQR

Query:  WAQIQHLFGEQSG-GGSSN
        WA+IQ++ G   G  G SN
Subjt:  WAQIQHLFGEQSG-GGSSN

XP_038887411.1 poly [ADP-ribose] polymerase 1-like isoform X4 [Benincasa hispida]2.3e-5743.75Show/hide
Query:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +      RG  RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKTWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                           + +D D+ K H+ KYV +++ NTF+E+RS+L++HY+ F+DPK AR  P
Subjt:  TIPLHYKTWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENP

Query:  PERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVY
        P+RITD  DWN+LC+RWETP+WKKK + NK SRS++PY HR+G KSFV                     Q+HF EK GW+N+ AK+ Y
Subjt:  PERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVY

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]2.8e-7943.2Show/hide
Query:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +      RG  RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNAL-----RG--RRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKTWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                           + +D D+ K H+ KYV +++ NTF+E+RS+L++HY+ F+DPK AR  P
Subjt:  TIPLHYKTWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENP

Query:  PERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVYLEMDGLLKASSE
        P+RITD  DWN+LC+RWETP+WKKK + NK SRS++PY HR+G KSFV                     Q+HF EK GW+N+ AK+ YLEM  L++AS +
Subjt:  PERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFV---------------------QNHFTEKKGWINDKAKEVYLEMDGLLKASSE

Query:  QGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQR
        +    +S  +V KQVLG RSG+IKGLG +PK  SS SV+S    +KELE+K+E M+ EM  +K+  E M+    AL S              +  +WE R
Subjt:  QGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQR

Query:  WAQIQHLFGEQSG-GGSSN
        WA+IQ++ G   G  G SN
Subjt:  WAQIQHLFGEQSG-GGSSN

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase4.0e-4737.17Show/hide
Query:  RGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKTWPEVPQQVRDNVKDRLLTKWDVDLSKPHINKYVERQIMN
        R R  RG+ RNI++D++V  H ++ I I+E  GKP   FA + +  IGT  R+TI L  + W  +P  V++ + DR  T ++ D +   + KY++ ++ N
Subjt:  RGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKTWPEVPQQVRDNVKDRLLTKWDVDLSKPHINKYVERQIMN

Query:  TFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFVQ---------------------NHFT
         FREFR+ LH++Y  F+D   AR NPP++IT + DWNM+CDRWET  WKKK + NK SRS + +NH  G KSF+Q                      HF 
Subjt:  TFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFVQ---------------------NHFT

Query:  EKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMD
        EK+GWINDKAK+ Y     ++  S+E G + IS     K VLG+ S  I  L    KSG S+  + S+ REKE        ++EM  LK   E +  E+ 
Subjt:  EKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMD

Query:  ALKSREEVRQSEINAFKGKEDIWEQRWAQIQHLFGEQSG
                              WEQRW  I+   G + G
Subjt:  ALKSREEVRQSEINAFKGKEDIWEQRWAQIQHLFGEQSG

A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class1.4e-4741.14Show/hide
Query:  STEPSSAQASG-SNALRGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKTWPEVPQQVRDNVKDRLLTKWDVD
        S +PS+   S      RGR  RG+ RNI++D++V  H +I I I+E  GKP   FA + +  IGT  R+TIPL  + W  VP  VR+ V DRL T ++ D
Subjt:  STEPSSAQASG-SNALRGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKTWPEVPQQVRDNVKDRLLTKWDVD

Query:  LSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGC-----KSFVQNHFT
         +   + KY+E ++ N FREFR++LH++Y  F+D   AR NPP RIT   DWNM+CDRWET  WKKK                 GC     + F + HF 
Subjt:  LSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGC-----KSFVQNHFT

Query:  EKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEM
        EK+GWINDKAK+ YLEM  ++  S+E G + IS     + VLG+RS        +P+SG S+  + S+ REKE        ++EM  LK   E +  E+
Subjt:  EKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEM

A0A5A7TRX4 DUF4216 domain-containing protein7.9e-5141.03Show/hide
Query:  EPSSAQASGSNALRGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKTWPEVPQQVRDNVKDRLLTKWDVDLSK
        E    Q+S S+   GR  RG+ RNI++D++V  H +I I I+E  GKP   FA + +  IGT  R+TIPL  + W  VP  VR+ V D L T ++ D + 
Subjt:  EPSSAQASGSNALRGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKTWPEVPQQVRDNVKDRLLTKWDVDLSK

Query:  PHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFVQ------------
          + KY+E ++ NTFREFR++LH++Y  F+D   AR NP  RIT + DWNM+CDRWET  WKKK + NK S S + +NH +G KSF+Q            
Subjt:  PHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFVQ------------

Query:  ---------NHFTEKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNA
                  HF EK+GW NDKAK+ YLEM  +++ S+E G + IS     + VLG+RS        +P+SG S+  + S+ REKE        ++EM  
Subjt:  ---------NHFTEKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNA

Query:  LKSREEMMQSEM
        LK   E +  E+
Subjt:  LKSREEMMQSEM

A0A5A7US78 Uncharacterized protein1.2e-4640.47Show/hide
Query:  STEPSSAQASG-SNALRGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKTWPEVPQQVRDNVKDRLLTKWDVD
        S +PS+   S      RGR  RG+ RNI++D++V  H +I I I+E  GKP   F  + +  IGT  R+TIPL  + W  VP  VR+ V D L T ++ D
Subjt:  STEPSSAQASG-SNALRGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKTWPEVPQQVRDNVKDRLLTKWDVD

Query:  LSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGC-----KSFVQNHFT
         +   + KY+E ++ NTFREFR+ LH++Y  F+D   AR NPP RIT + DWNM+CDRWET  WKKK               + GC     + F + HF 
Subjt:  LSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGC-----KSFVQNHFT

Query:  EKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEM
        +K+GWINDKAK+ YLEM  ++  S+E G + IS     K VLG+RS        +P+SG S+  + S+ REK+        ++EM  LK   E +  E+
Subjt:  EKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEM

A0A6J1DUH3 uncharacterized protein LOC1110232122.4e-4740.07Show/hide
Query:  VDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFVQ-------
        VDLSK  +NK++E+Q+  +F+++RS+LH++Y  FEDP  AR NPPER+T+  DWN LCDRWETP+WK+   +NK +R++LP+NHR+G KSF+Q       
Subjt:  VDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFVQ-------

Query:  --------------NHFTEKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDP-----KSGSSVSVSSSNPREKELEQK
                      +H+ EK G +ND A++ Y  M  L+KA +++G E ++QP+  ++VLG R  H+KGLG+ P     K GSS +V+SS   EKELE+K
Subjt:  --------------NHFTEKKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDP-----KSGSSVSVSSSNPREKELEQK

Query:  VEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQRWAQIQHLFGEQSGGGSSN
        VE M+ EM  +K+  + ++  +                       WE RW +I      + G G SN
Subjt:  VEMMQSEMNALKSREEMMQSEMDALKSREEVRQSEINAFKGKEDIWEQRWAQIQHLFGEQSGGGSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGTCAAAGTCGAGAGCACGATGCATCGACAGTGTATGGAGAGGATGTGGACCTCGCAGTGGTTGGTTCCTCGACTGAGCCTTCATCTGCTCAAGCTTCAGGATC
TAACGCATTGCGAGGAAGACGAAAGAGAGGGCATAGTCGAAACATCAAAATTGATCGGTATGTGGCTACACACAATAGGATACCCATTCACATTGATGAGTCCGTCGGCA
AACCAACGTGTAAGTTTGCTACACAATTCAGTGGGACGATTGGTACCCTCACACGAGATACAATTCCGTTACATTACAAGACTTGGCCTGAGGTCCCCCAACAAGTCCGA
GACAACGTAAAAGATCGACTCTTGACGAAATGGGATGTGGATTTGTCAAAGCCGCATATTAACAAGTATGTGGAACGACAAATAATGAACACGTTTAGGGAGTTTAGGAG
CGAGTTGCATAGGCACTACAAGGGATTCGAGGACCCTAAAGTTGCTCGAGAAAATCCACCAGAAAGGATTACCGACCAGAACGATTGGAATATGCTATGCGACAGATGGG
AGACTCCCAAATGGAAAAAAAAAGCGGATCAAAATAAGAATAGTCGCTCACAACTCCCCTACAACCATCGAAGTGGATGTAAGTCTTTTGTTCAGAATCATTTTACTGAG
AAGAAGGGATGGATCAACGACAAAGCAAAAGAAGTATACTTGGAAATGGATGGGCTACTAAAAGCATCGTCAGAACAAGGGTCCGAACAGATCTCACAACCTGATGTTCT
GAAACAGGTTTTGGGAACTCGATCAGGCCATATCAAAGGTCTTGGTTGGGATCCAAAATCTGGCTCATCTGTCAGTGTCTCGTCTTCAAACCCTCGTGAAAAAGAGCTAG
AGCAGAAAGTCGAGATGATGCAATCTGAGATGAATGCACTCAAAAGTAGGGAGGAGATGATGCAATCTGAGATGGATGCACTCAAGAGCAGGGAGGAGGTGAGGCAATCT
GAAATAAATGCGTTCAAGGGCAAGGAGGACATATGGGAACAAAGATGGGCTCAAATCCAACATTTGTTTGGCGAACAATCGGGAGGAGGGTCTTCGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGTCAAAGTCGAGAGCACGATGCATCGACAGTGTATGGAGAGGATGTGGACCTCGCAGTGGTTGGTTCCTCGACTGAGCCTTCATCTGCTCAAGCTTCAGGATC
TAACGCATTGCGAGGAAGACGAAAGAGAGGGCATAGTCGAAACATCAAAATTGATCGGTATGTGGCTACACACAATAGGATACCCATTCACATTGATGAGTCCGTCGGCA
AACCAACGTGTAAGTTTGCTACACAATTCAGTGGGACGATTGGTACCCTCACACGAGATACAATTCCGTTACATTACAAGACTTGGCCTGAGGTCCCCCAACAAGTCCGA
GACAACGTAAAAGATCGACTCTTGACGAAATGGGATGTGGATTTGTCAAAGCCGCATATTAACAAGTATGTGGAACGACAAATAATGAACACGTTTAGGGAGTTTAGGAG
CGAGTTGCATAGGCACTACAAGGGATTCGAGGACCCTAAAGTTGCTCGAGAAAATCCACCAGAAAGGATTACCGACCAGAACGATTGGAATATGCTATGCGACAGATGGG
AGACTCCCAAATGGAAAAAAAAAGCGGATCAAAATAAGAATAGTCGCTCACAACTCCCCTACAACCATCGAAGTGGATGTAAGTCTTTTGTTCAGAATCATTTTACTGAG
AAGAAGGGATGGATCAACGACAAAGCAAAAGAAGTATACTTGGAAATGGATGGGCTACTAAAAGCATCGTCAGAACAAGGGTCCGAACAGATCTCACAACCTGATGTTCT
GAAACAGGTTTTGGGAACTCGATCAGGCCATATCAAAGGTCTTGGTTGGGATCCAAAATCTGGCTCATCTGTCAGTGTCTCGTCTTCAAACCCTCGTGAAAAAGAGCTAG
AGCAGAAAGTCGAGATGATGCAATCTGAGATGAATGCACTCAAAAGTAGGGAGGAGATGATGCAATCTGAGATGGATGCACTCAAGAGCAGGGAGGAGGTGAGGCAATCT
GAAATAAATGCGTTCAAGGGCAAGGAGGACATATGGGAACAAAGATGGGCTCAAATCCAACATTTGTTTGGCGAACAATCGGGAGGAGGGTCTTCGAACTGA
Protein sequenceShow/hide protein sequence
MLGQSREHDASTVYGEDVDLAVVGSSTEPSSAQASGSNALRGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKTWPEVPQQVR
DNVKDRLLTKWDVDLSKPHINKYVERQIMNTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPKWKKKADQNKNSRSQLPYNHRSGCKSFVQNHFTE
KKGWINDKAKEVYLEMDGLLKASSEQGSEQISQPDVLKQVLGTRSGHIKGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMNALKSREEMMQSEMDALKSREEVRQS
EINAFKGKEDIWEQRWAQIQHLFGEQSGGGSSN