; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019556 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019556
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCACTA en-spm transposon protein
Genome locationscaffold5:34134873..34138383
RNA-Seq ExpressionSpg019556
SyntenySpg019556
Gene Ontology termsNA
InterPro domainsIPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]3.6e-5736.79Show/hide
Query:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLL------------------------------------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                                     H+ KYV +++ +TF+E+RS+L++HY+ F+DPK AR  P
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLL------------------------------------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENP

Query:  PERITDQNDWNMLCDRWETPEWKKE-------------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVY------------
        P+RITD  DWN+LC+RWETPEWKK+             LH                  GRD+  VDLF  SHF EK GW+N+ AK+ Y            
Subjt:  PERITDQNDWNMLCDRWETPEWKKE-------------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVY------------

Query:  --------------VLETRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQ---SELDALKGREEVR
                      VL  RSG+I+GLG +PK  SS SV+S    +KELE+K+E M+ E+  +K+  E M+    AL  +  M +   +E+  + GR + +
Subjt:  --------------VLETRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQ---SELDALKGREEVR

Query:  QSEMN
            N
Subjt:  QSEMN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]2.6e-6039.42Show/hide
Query:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLL---------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKE--
        TIPL  K W +V ++VRD V D+LL          H+ KYV +++ +TF+E+RS+L++HY+ F+DPK AR  PP+RITD  DWN+LC+RWETPEWKK+  
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLL---------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKE--

Query:  -----------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVY--------------------------VLETRSGHIRGLG
                   LH                  GRD+  VDLF  SHF EK GW+N+ AK+ Y                          VL  RSG+I+GLG
Subjt:  -----------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVY--------------------------VLETRSGHIRGLG

Query:  WDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQ---SELDALKGREEVRQSEMN
         +PK  SS SV+S    +KELE+K+E M+ E+  +K+  E M+    AL  +  M +   +E+  + GR + +    N
Subjt:  WDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQ---SELDALKGREEVRQSEMN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]4.5e-5236.94Show/hide
Query:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLL------------------------------------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                                     H+ KYV +++ +TF+E+RS+L++HY+ F+DPK AR  P
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLL------------------------------------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENP

Query:  PERITDQNDWNMLCDRWETPEWKKE-------------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVYVLETRSGHIRGL
        P+RITD  DWN+LC+RWETPEWKK+             LH                  GRD+  VDLF  SHF EK GW+N+ AK+ Y LE +       
Subjt:  PERITDQNDWNMLCDRWETPEWKKE-------------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVYVLETRSGHIRGL

Query:  GWDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQ---SELDALKGREEVRQSEMN
          DP   SSV        +KELE+K+E M+ E+  +K+  E M+    AL  +  M +   +E+  + GR + +    N
Subjt:  GWDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQ---SELDALKGREEVRQSEMN

XP_038887411.1 poly [ADP-ribose] polymerase 1-like isoform X4 [Benincasa hispida]1.6e-4939.93Show/hide
Query:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLL------------------------------------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                                     H+ KYV +++ +TF+E+RS+L++HY+ F+DPK AR  P
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLL------------------------------------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENP

Query:  PERITDQNDWNMLCDRWETPEWKKE-------------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVY
        P+RITD  DWN+LC+RWETPEWKK+             LH                  GRD+  VDLF  SHF EK GW+N+ AK+ Y
Subjt:  PERITDQNDWNMLCDRWETPEWKKE-------------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVY

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]3.6e-5736.79Show/hide
Query:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++DR+V  H RI I IDE VGKP C  AT+FS  IGT+ R+
Subjt:  MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNAL-------QGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLL------------------------------------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                                     H+ KYV +++ +TF+E+RS+L++HY+ F+DPK AR  P
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLL------------------------------------PHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENP

Query:  PERITDQNDWNMLCDRWETPEWKKE-------------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVY------------
        P+RITD  DWN+LC+RWETPEWKK+             LH                  GRD+  VDLF  SHF EK GW+N+ AK+ Y            
Subjt:  PERITDQNDWNMLCDRWETPEWKKE-------------LH------------------GRDIGPVDLFHDSHFTEKKGWINDRAKEVY------------

Query:  --------------VLETRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQ---SELDALKGREEVR
                      VL  RSG+I+GLG +PK  SS SV+S    +KELE+K+E M+ E+  +K+  E M+    AL  +  M +   +E+  + GR + +
Subjt:  --------------VLETRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQ---SELDALKGREEVR

Query:  QSEMN
            N
Subjt:  QSEMN

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase6.6e-3331.91Show/hide
Query:  RASGSNALQGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKAWPEVPQQVRDNVKDRLLPH---------INK
        R+      + R  RG+ RNI++D++V  H ++ I I+E  GKP   FA + +  IGT  R+TI L  + W  +P  V++ + DR   H         + K
Subjt:  RASGSNALQGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKAWPEVPQQVRDNVKDRLLPH---------INK

Query:  YVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKELH-------------------------------GRDIGPVD
        Y++ K+ + FREFR+ LH++Y  F+D   AR NPP++IT + DWNM+CDRWET  WKK+                                 G D+  V+
Subjt:  YVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKELH-------------------------------GRDIGPVD

Query:  LFHDSHFTEKKGWINDRAKEVY----------------------VLETRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQ
        +F ++HF EK+GWIND+AK+ Y                      VL + S  I  L    KSG S+  + S+ REKE        ++E+  LK   E + 
Subjt:  LFHDSHFTEKKGWINDRAKEVY----------------------VLETRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQ

Query:  SEMDALKGREEMMQSELDALKGREEVRQS
         E+   + R   ++ E+ +  GR   R S
Subjt:  SEMDALKGREEMMQSELDALKGREEVRQS

A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class4.6e-4238.21Show/hide
Query:  STEPSSARASG-SNALQGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKAWPEVPQQVRDNVKDRLLPH----
        S +PS+   S      +GR  RG+ RNI++D++V  H +I I I+E  GKP   FA + +  IGT  R+TIPL  + W  VP  VR+ V DRL  H    
Subjt:  STEPSSARASG-SNALQGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKAWPEVPQQVRDNVKDRLLPH----

Query:  -----INKYVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKELHGRDIGPVDLFHDSHFTEKKGWINDRAKEVY-
             + KY+E K+ + FREFR++LH++Y  F+D   AR NPP RIT   DWNM+CDRWET  WKK+  G D+  +++FH++HF EK+GWIND+AK+ Y 
Subjt:  -----INKYVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKELHGRDIGPVDLFHDSHFTEKKGWINDRAKEVY-

Query:  -----VLETRSGHIRGLGW----------------DPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEM
             + E+    ++ +                  +P+SG S+  + S+ REKE        ++E+  LK   E +  E+
Subjt:  -----VLETRSGHIRGLGW----------------DPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEM

A0A5A7TRX4 DUF4216 domain-containing protein1.7e-3634.11Show/hide
Query:  SNALQGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKAWPEVPQQVRDNVKDRLLPH---------INKYVER
        S+   GR  RG+ RNI++D++V  H +I I I+E  GKP   FA + +  IGT  R+TIPL  + W  VP  VR+ V D L  H         + KY+E 
Subjt:  SNALQGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKAWPEVPQQVRDNVKDRLLPH---------INKYVER

Query:  KIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKELH-------------------------------GRDIGPVDLFHD
        K+ +TFREFR++LH++Y  F+D   AR NP  RIT + DWNM+CDRWET  WKK+                                 G D+  +++FH+
Subjt:  KIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKELH-------------------------------GRDIGPVDLFHD

Query:  SHFTEKKGWINDRAKEVY------VLETRSGHIRGLGW----------------DPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEM
        +HF EK+GW ND+AK+ Y      + E+    ++ +                  +P+SG S+  + S+ REKE        ++E+  LK   E +  E+
Subjt:  SHFTEKKGWINDRAKEVY------VLETRSGHIRGLGW----------------DPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEM

A0A5A7US78 Uncharacterized protein1.5e-4038.08Show/hide
Query:  STEPSSARASG-SNALQGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKAWPEVPQQVRDNVKDRLLPH----
        S +PS+   S      +GR  RG+ RNI++D++V  H +I I I+E  GKP   F  + +  IGT  R+TIPL  + W  VP  VR+ V D L  H    
Subjt:  STEPSSARASG-SNALQGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKAWPEVPQQVRDNVKDRLLPH----

Query:  -----INKYVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKELHGRDIGPVDLFHDSHFTEKKGWINDRAKEVY-
             + KY+E K+ +TFREFR+ LH++Y  F+D   AR NPP RIT + DWNM+CDRWET  WKK+  G D+  +++FH++HF +K+GWIND+AK+ Y 
Subjt:  -----INKYVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKELHGRDIGPVDLFHDSHFTEKKGWINDRAKEVY-

Query:  -----VLETRSGHIRGLGWDPKSG---SSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQSEL
             + E+    ++ +           S S+ + NPR        E ++S +    S  E  ++EM  LK   E +  EL
Subjt:  -----VLETRSGHIRGLGWDPKSG---SSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQSEL

A0A6J1DUH3 uncharacterized protein LOC1110232124.4e-2936.65Show/hide
Query:  INKYVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWK-------------------------------KELHGRDIG
        +NK++E+++  +F+++RS+LH++Y  FEDP  AR NPPER+T+  DWN LCDRWETPEWK                               K   G DIG
Subjt:  INKYVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWK-------------------------------KELHGRDIG

Query:  PVDLFHDSHFTEKKGWINDRAKEVY--------------------------VLETRSGHIRGLGWDP-----KSGSSVSVSSSNPREKELEQKVEMMQSE
        PVDLF +SH+ EK G +ND A++ Y                          VL  R  H++GLG+ P     K GSS +V+SS   EKELE+KVE M+ E
Subjt:  PVDLFHDSHFTEKKGWINDRAKEVY--------------------------VLETRSGHIRGLGWDP-----KSGSSVSVSSSNPREKELEQKVEMMQSE

Query:  IDALKSREEMMQSEMDALKGR
        +  +K+  + ++  +   + R
Subjt:  IDALKSREEMMQSEMDALKGR

SwissProt top hitse value%identityAlignment
F4JGB7 Chromatin remodeling protein At4g042601.6e-0447.83Show/hide
Query:  HPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD
        H +C+G+TIEEAKKL+HF+C +C S+ D  +  Q   G  S+ T+D
Subjt:  HPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD

F4JL28 Chromatin remodeling protein EBS1.6e-0745.28Show/hide
Query:  KGREEMFHPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD
        +G ++ +HP+C+G+TIEEAKKLDHF+C++C S++D  +      G  S+P  D
Subjt:  KGREEMFHPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD

Arabidopsis top hitse value%identityAlignment
AT4G04260.1 Bromo-adjacent homology (BAH) domain-containing protein1.2e-0547.83Show/hide
Query:  HPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD
        H +C+G+TIEEAKKL+HF+C +C S+ D  +  Q   G  S+ T+D
Subjt:  HPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD

AT4G22140.1 PHD finger family protein / bromo-adjacent homology (BAH) domain-containing protein1.1e-0845.28Show/hide
Query:  KGREEMFHPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD
        +G ++ +HP+C+G+TIEEAKKLDHF+C++C S++D  +      G  S+P  D
Subjt:  KGREEMFHPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD

AT4G22140.2 PHD finger family protein / bromo-adjacent homology (BAH) domain-containing protein1.1e-0845.28Show/hide
Query:  KGREEMFHPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD
        +G ++ +HP+C+G+TIEEAKKLDHF+C++C S++D  +      G  S+P  D
Subjt:  KGREEMFHPSCMGITIEEAKKLDHFLCSDCLSENDANRPTQVCVGILSTPTHD

AT4G39100.1 PHD finger family protein / bromo-adjacent homology (BAH) domain-containing protein1.3e-0469.23Show/hide
Query:  EMFHPSCMGITIEEAKKLDHFLCSDC
        E FHPSC+G TIEEAKK D+F C +C
Subjt:  EMFHPSCMGITIEEAKKLDHFLCSDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGTCAAAGTCGAGAGCACGATGCATCGACAGTGTATGGAGAGGATGTGGATCCCGTGGTGGTTGGTTCCTCGACTGAGCCTTCATCTGCTCGAGCTTCAGGATC
TAACGCATTACAAGGAAGACGAAAGAGAGGGCATAGTCGAAATATCAAAATTGATCGGTATGTGGCTACACACAATAGGATACCCATTCACATTGATGAGTCCGTCGGCA
AACCAACGTGTAAGTTTGCTACACAATTCAGTGGGACGATTGGTACCCTCACACGAGATACAATTCCGTTGCATTACAAGGCTTGGCCTGAGGTCCCCCAACAAGTTCGA
GACAACGTAAAAGATCGACTCTTGCCGCATATTAACAAGTATGTGGAACGTAAAATAATGGACACTTTTAGGGAGTTTAGGAGCGAGTTGCATAGGCACTACAAGGGATT
CGAGGACCCTAAAGTTGCTCGAGAAAATCCACCAGAAAGGATTACCGACCAGAACGATTGGAATATGCTATGCGACAGATGGGAGACTCCCGAATGGAAAAAAGAACTAC
ATGGTCGTGACATTGGGCCAGTGGATTTGTTCCACGATAGTCATTTTACTGAGAAGAAAGGATGGATCAACGACAGAGCAAAAGAAGTATACGTTTTGGAAACTCGATCA
GGCCACATCAGAGGTCTTGGTTGGGATCCAAAATCTGGCTCATCTGTCAGTGTCTCGTCTTCAAACCCTCGTGAAAAAGAGCTAGAGCAGAAAGTCGAGATGATGCAATC
TGAGATTGATGCACTCAAGAGCAGGGAGGAGATGATGCAATCTGAGATGGATGCACTCAAGGGCAGGGAGGAGATGATGCAATCTGAGTTGGATGCACTCAAGGGCAGGG
AGGAGGTGAGGCAATCTGAAATGAATGCGCTCAAGGGCAGGGAGGAGATGTTTCATCCATCATGCATGGGAATAACCATTGAAGAAGCAAAGAAGTTGGATCACTTTTTG
TGTTCAGATTGTTTATCAGAAAATGATGCGAATAGGCCAACGCAAGTTTGCGTCGGCATATTGTCTACGCCGACACACGATTGCGTCGGCGTAGGTCATTCAAACCAGCA
ATTTCAGATTATCAAGGTCGAGGCAATGTGCGTTGGCGTATACATCTACGCCGACGTGTCTACTCCGACGTCCGTGCCGACGCAATCTTCGCGTCGGGGAGTTGTCTACC
CCGACGCGGGAGGGGTCTACGCCGACGCAACGATGCGTCGGCGTAGACCGAAAATCTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGTCAAAGTCGAGAGCACGATGCATCGACAGTGTATGGAGAGGATGTGGATCCCGTGGTGGTTGGTTCCTCGACTGAGCCTTCATCTGCTCGAGCTTCAGGATC
TAACGCATTACAAGGAAGACGAAAGAGAGGGCATAGTCGAAATATCAAAATTGATCGGTATGTGGCTACACACAATAGGATACCCATTCACATTGATGAGTCCGTCGGCA
AACCAACGTGTAAGTTTGCTACACAATTCAGTGGGACGATTGGTACCCTCACACGAGATACAATTCCGTTGCATTACAAGGCTTGGCCTGAGGTCCCCCAACAAGTTCGA
GACAACGTAAAAGATCGACTCTTGCCGCATATTAACAAGTATGTGGAACGTAAAATAATGGACACTTTTAGGGAGTTTAGGAGCGAGTTGCATAGGCACTACAAGGGATT
CGAGGACCCTAAAGTTGCTCGAGAAAATCCACCAGAAAGGATTACCGACCAGAACGATTGGAATATGCTATGCGACAGATGGGAGACTCCCGAATGGAAAAAAGAACTAC
ATGGTCGTGACATTGGGCCAGTGGATTTGTTCCACGATAGTCATTTTACTGAGAAGAAAGGATGGATCAACGACAGAGCAAAAGAAGTATACGTTTTGGAAACTCGATCA
GGCCACATCAGAGGTCTTGGTTGGGATCCAAAATCTGGCTCATCTGTCAGTGTCTCGTCTTCAAACCCTCGTGAAAAAGAGCTAGAGCAGAAAGTCGAGATGATGCAATC
TGAGATTGATGCACTCAAGAGCAGGGAGGAGATGATGCAATCTGAGATGGATGCACTCAAGGGCAGGGAGGAGATGATGCAATCTGAGTTGGATGCACTCAAGGGCAGGG
AGGAGGTGAGGCAATCTGAAATGAATGCGCTCAAGGGCAGGGAGGAGATGTTTCATCCATCATGCATGGGAATAACCATTGAAGAAGCAAAGAAGTTGGATCACTTTTTG
TGTTCAGATTGTTTATCAGAAAATGATGCGAATAGGCCAACGCAAGTTTGCGTCGGCATATTGTCTACGCCGACACACGATTGCGTCGGCGTAGGTCATTCAAACCAGCA
ATTTCAGATTATCAAGGTCGAGGCAATGTGCGTTGGCGTATACATCTACGCCGACGTGTCTACTCCGACGTCCGTGCCGACGCAATCTTCGCGTCGGGGAGTTGTCTACC
CCGACGCGGGAGGGGTCTACGCCGACGCAACGATGCGTCGGCGTAGACCGAAAATCTTCTAG
Protein sequenceShow/hide protein sequence
MLGQSREHDASTVYGEDVDPVVVGSSTEPSSARASGSNALQGRRKRGHSRNIKIDRYVATHNRIPIHIDESVGKPTCKFATQFSGTIGTLTRDTIPLHYKAWPEVPQQVR
DNVKDRLLPHINKYVERKIMDTFREFRSELHRHYKGFEDPKVARENPPERITDQNDWNMLCDRWETPEWKKELHGRDIGPVDLFHDSHFTEKKGWINDRAKEVYVLETRS
GHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEIDALKSREEMMQSEMDALKGREEMMQSELDALKGREEVRQSEMNALKGREEMFHPSCMGITIEEAKKLDHFL
CSDCLSENDANRPTQVCVGILSTPTHDCVGVGHSNQQFQIIKVEAMCVGVYIYADVSTPTSVPTQSSRRGVVYPDAGGVYADATMRRRRPKIF