; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008030 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008030
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCACTA en-spm transposon protein
Genome locationchr9:10266774..10268314
RNA-Seq ExpressionLag0008030
SyntenyLag0008030
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]7.5e-7240.76Show/hide
Query:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++D++V  H RI I IDE VGKP C  AT+FS  IG + R+
Subjt:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                           + +D D+ K H+ KYV +++ +TF+E+RS+L+ HY+ F+DPK AR  P
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENP

Query:  PERITDQNDWKMLCDRWETPEW------------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKAS
        P+RITD  DW +LC+RWETPEW            KI +L              + IKE  GRD+  VDLF  SHF +K GW+N+ AK+ YLEM  +++AS
Subjt:  PERITDQNDWKMLCDRWETPEW------------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKAS

Query:  SEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMN-ALKGREEMW
         +E    +S   V KQVLG RSG+I+GLG +PK  SS SV+S    +KELE+K+E M+ EM  +K+  E M+               E N AL  +  MW
Subjt:  SEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMN-ALKGREEMW

Query:  EQRWAQIQHLFGEQSG-GGSSN
        E RWA+IQ++ G   G  G SN
Subjt:  EQRWAQIQHLFGEQSG-GGSSN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]2.9e-7643.54Show/hide
Query:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++D++V  H RI I IDE VGKP C  AT+FS  IG + R+
Subjt:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLLTKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEW-----
        TIPL  K W +V ++VRD V D+LL+ +D D+ K H+ KYV +++ +TF+E+RS+L+ HY+ F+DPK AR  PP+RITD  DW +LC+RWETPEW     
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLLTKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEW-----

Query:  -------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRG
               KI +L              + IKE  GRD+  VDLF  SHF +K GW+N+ AK+ YLEM  +++AS +E    +S   V KQVLG RSG+I+G
Subjt:  -------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRG

Query:  LGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMN-ALKGREEMWEQRWAQIQHLFGEQSG-GGSSN
        LG +PK  SS SV+S    +KELE+K+E M+ EM  +K+  E M+               E N AL  +  MWE RWA+IQ++ G   G  G SN
Subjt:  LGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMN-ALKGREEMWEQRWAQIQHLFGEQSG-GGSSN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]8.0e-5836.97Show/hide
Query:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++D++V  H RI I IDE VGKP C  AT+FS  IG + R+
Subjt:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                           + +D D+ K H+ KYV +++ +TF+E+RS+L+ HY+ F+DPK AR  P
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENP

Query:  PERITDQNDWKMLCDRWETPEW------------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKAS
        P+RITD  DW +LC+RWETPEW            KI +L              + IKE  GRD+  VDLF  SHF +K GW+N+ AK+ YLEM  +++AS
Subjt:  PERITDQNDWKMLCDRWETPEW------------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKAS

Query:  SEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMN-ALKGREEMW
         +E                           DP   SSV        +KELE+K+E M+ EM  +K+  E M+               E N AL  +  MW
Subjt:  SEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMN-ALKGREEMW

Query:  EQRWAQIQHLFGEQSG-GGSSN
        E RWA+IQ++ G   G  G SN
Subjt:  EQRWAQIQHLFGEQSG-GGSSN

XP_038887411.1 poly [ADP-ribose] polymerase 1-like isoform X4 [Benincasa hispida]6.2e-5040Show/hide
Query:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++D++V  H RI I IDE VGKP C  AT+FS  IG + R+
Subjt:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                           + +D D+ K H+ KYV +++ +TF+E+RS+L+ HY+ F+DPK AR  P
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENP

Query:  PERITDQNDWKMLCDRWETPEW------------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVY
        P+RITD  DW +LC+RWETPEW            KI +L              + IKE  GRD+  VDLF  SHF +K GW+N+ AK+ Y
Subjt:  PERITDQNDWKMLCDRWETPEW------------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVY

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]7.5e-7240.76Show/hide
Query:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD
        +L  S +   + +  +  D   +GSSTE  +  ASGS +        + RR RGHSRN+++D++V  H RI I IDE VGKP C  AT+FS  IG + R+
Subjt:  MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTF-------QGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRD

Query:  TIPLHYKAWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENP
        TIPL  K W +V ++VRD V D+LL                           + +D D+ K H+ KYV +++ +TF+E+RS+L+ HY+ F+DPK AR  P
Subjt:  TIPLHYKAWPEVPQQVRDNVKDRLL---------------------------TKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENP

Query:  PERITDQNDWKMLCDRWETPEW------------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKAS
        P+RITD  DW +LC+RWETPEW            KI +L              + IKE  GRD+  VDLF  SHF +K GW+N+ AK+ YLEM  +++AS
Subjt:  PERITDQNDWKMLCDRWETPEW------------KICFL-------------VLLIKELHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKAS

Query:  SEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMN-ALKGREEMW
         +E    +S   V KQVLG RSG+I+GLG +PK  SS SV+S    +KELE+K+E M+ EM  +K+  E M+               E N AL  +  MW
Subjt:  SEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMN-ALKGREEMW

Query:  EQRWAQIQHLFGEQSG-GGSSN
        E RWA+IQ++ G   G  G SN
Subjt:  EQRWAQIQHLFGEQSG-GGSSN

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase3.6e-4033.91Show/hide
Query:  RASGSNTFQGRR-KRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRDTIPLHYKAWPEVPQQVRDNVKDRLLTKWDVDLSKPHIN
        R S S T +  R  RG+ RNI++D++V  H ++ I I+E  GKP   FA + +  IG   R+TI L  + W  +P  V++ + DR  T ++ D +   + 
Subjt:  RASGSNTFQGRR-KRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRDTIPLHYKAWPEVPQQVRDNVKDRLLTKWDVDLSKPHIN

Query:  KYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWK---------------------ICFLVLL--IKELHGRDIGPV
        KY++ K+ + FREFR+ LH +Y  F+D   AR NPP++IT + DW M+CDRWET  WK                       FL +   +++  G D+  V
Subjt:  KYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWK---------------------ICFLVLL--IKELHGRDIGPV

Query:  DLFHDSHFTKKKGWINDRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSR
        ++F ++HF +K+GWIND+AK+ Y     I+  S+E G + IS     K VLG+ S  I  L    KSG S+  + S+ REKE        ++EM  LK  
Subjt:  DLFHDSHFTKKKGWINDRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSR

Query:  EEMMQSEMDALKGREELRQSEMNALKGREEMWEQRWAQIQHLFGEQSG
         E +  E+                       WEQRW  I+   G + G
Subjt:  EEMMQSEMDALKGREELRQSEMNALKGREEMWEQRWAQIQHLFGEQSG

A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class1.5e-4940.75Show/hide
Query:  STEPSSARASGSNT-FQGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRDTIPLHYKAWPEVPQQVRDNVKDRLLTKWDVD
        S +PS+   S   T  +GR  RG+ RNI++D++V  H +I I I+E  GKP   FA + +  IG   R+TIPL  + W  VP  VR+ V DRL T ++ D
Subjt:  STEPSSARASGSNT-FQGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRDTIPLHYKAWPEVPQQVRDNVKDRLLTKWDVD

Query:  LSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWKICFLVLLIKELHGRDIGPVDLFHDSHFTKKKGWIN
         +   + KY+E K+ + FREFR++LH +Y  F+D   AR NPP RIT   DW M+CDRWET  WK            G D+  +++FH++HF +K+GWIN
Subjt:  LSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWKICFLVLLIKELHGRDIGPVDLFHDSHFTKKKGWIN

Query:  DRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEM
        D+AK+ YLEM  I+  S+E G + IS     + VLG+RS        +P+SG S+  + S+ REKE        ++EM  LK   E +  E+
Subjt:  DRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEM

A0A5A7TRX4 DUF4216 domain-containing protein2.6e-4638.28Show/hide
Query:  SNTFQGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRDTIPLHYKAWPEVPQQVRDNVKDRLLTKWDVDLSKPHINKYVER
        S+   GR  RG+ RNI++D++V  H +I I I+E  GKP   FA + +  IG   R+TIPL  + W  VP  VR+ V D L T ++ D +   + KY+E 
Subjt:  SNTFQGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRDTIPLHYKAWPEVPQQVRDNVKDRLLTKWDVDLSKPHINKYVER

Query:  KIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWK---------------------ICFLVLL--IKELHGRDIGPVDLFHD
        K+ +TFREFR++LH +Y  F+D   AR NP  RIT + DW M+CDRWET  WK                       FL +   +K+  G D+  +++FH+
Subjt:  KIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWK---------------------ICFLVLL--IKELHGRDIGPVDLFHD

Query:  SHFTKKKGWINDRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQ
        +HF +K+GW ND+AK+ YLEM  I++ S+E G + IS     + VLG+RS        +P+SG S+  + S+ REKE        ++EM  LK   E + 
Subjt:  SHFTKKKGWINDRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQ

Query:  SEM
         E+
Subjt:  SEM

A0A5A7US78 Uncharacterized protein3.0e-5041.1Show/hide
Query:  STEPSSARASGSNT-FQGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRDTIPLHYKAWPEVPQQVRDNVKDRLLTKWDVD
        S +PS+   S   T  +GR  RG+ RNI++D++V  H +I I I+E  GKP   F  + +  IG   R+TIPL  + W  VP  VR+ V D L T ++ D
Subjt:  STEPSSARASGSNT-FQGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRDTIPLHYKAWPEVPQQVRDNVKDRLLTKWDVD

Query:  LSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWKICFLVLLIKELHGRDIGPVDLFHDSHFTKKKGWIN
         +   + KY+E K+ +TFREFR+ LH +Y  F+D   AR NPP RIT + DW M+CDRWET  WK        K+  G D+  +++FH++HF KK+GWIN
Subjt:  LSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWKICFLVLLIKELHGRDIGPVDLFHDSHFTKKKGWIN

Query:  DRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEM
        D+AK+ YLEM  I+  S+E G + IS     K VLG+RS        +P+SG S+  + S+ REK+        ++EM  LK   E +  E+
Subjt:  DRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEM

A0A6J1DUH3 uncharacterized protein LOC1110232123.2e-4440.45Show/hide
Query:  VDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWK---------------------ICFLVLL--IKE
        VDLSK  +NK++E+++  +F+++RS+LH +Y  FEDP  AR NPPER+T+  DW  LCDRWETPEWK                       FL L   +K 
Subjt:  VDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWK---------------------ICFLVLL--IKE

Query:  LHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDP-----KSGSSVSVSSSNPREKELEQK
          G DIGPVDLF +SH+ +K G +ND A++ Y  M  ++KA ++EG E ++QP   ++VLG R  H++GLG+ P     K GSS +V+SS   EKELE+K
Subjt:  LHGRDIGPVDLFHDSHFTKKKGWINDRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDP-----KSGSSVSVSSSNPREKELEQK

Query:  VEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMNALKGREEMWEQRWAQIQHLFGEQSGGGSSN
        VE M+ EM  +K                     +E   LK     WE RW +I      + G G SN
Subjt:  VEMMQSEMDALKSREEMMQSEMDALKGREELRQSEMNALKGREEMWEQRWAQIQHLFGEQSGGGSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGTCAAAGTCGAGAGCACGATGCATCGACAGTGTATGGAGAGGATGTGGATCCCGCAGTGGTTGGTTCCTCGACTGAGCCTTCATCTGCTCGAGCTTCAGGATC
TAACACATTTCAAGGAAGACGAAAGAGAGGGCATAGTCGAAACATCAAAATTGATCAATATGTGGCTACACACAATAGGATACCCATTCACATTGATGAGTCCGTCGGCA
AACCAACATGTAAGTTTGCTACACAATTCAGTGGGACGATTGGTAACCTCACACGAGACACAATTCCGTTGCATTACAAGGCTTGGCCTGAGGTCCCCCAACAAGTTCGA
GACAACGTAAAAGATCGACTCTTGACGAAATGGGATGTGGATTTGTCAAAGCCGCATATTAACAAGTATGTGGAACGTAAAATAATGGACACGTTTAGGGAGTTTAGGAG
TGAGTTGCATAGTCACTACAAGGGATTCGAGGACCCTAAAGTTGCTCGAGAAAATCCACCAGAAAGGATTACCGACCAGAACGATTGGAAGATGTTATGCGACAGATGGG
AGACTCCCGAATGGAAAATATGTTTTCTTGTTTTACTTATAAAAGAACTACATGGTCGTGACATTGGGCCAGTGGATTTGTTCCACGATAGTCATTTTACTAAGAAGAAG
GGATGGATCAACGACAGAGCAAAAGAAGTATACTTGGAAATGGATGGGATACTGAAAGCATCGTCAGAAGAAGGGTCCGAACAGATCTCGCAACCTAATGTTCTGAAACA
GGTTTTGGGAACTCGATCAGGCCACATCAGAGGTCTTGGTTGGGATCCAAAATCTGGCTCATCTGTCAGTGTCTCGTCTTCAAACCCTCGTGAAAAAGAGCTAGAGCAGA
AAGTTGAGATGATGCAATCTGAGATGGATGCACTCAAGAGCAGGGAGGAGATGATGCAATCTGAGATGGATGCACTCAAGGGCAGGGAGGAGTTGAGGCAATCTGAAATG
AATGCGCTCAAGGGCAGGGAGGAGATGTGGGAACAAAGATGGGCTCAAATCCAACATTTGTTTGGCGAACAATCGGGAGGAGGGTCTTCGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGTCAAAGTCGAGAGCACGATGCATCGACAGTGTATGGAGAGGATGTGGATCCCGCAGTGGTTGGTTCCTCGACTGAGCCTTCATCTGCTCGAGCTTCAGGATC
TAACACATTTCAAGGAAGACGAAAGAGAGGGCATAGTCGAAACATCAAAATTGATCAATATGTGGCTACACACAATAGGATACCCATTCACATTGATGAGTCCGTCGGCA
AACCAACATGTAAGTTTGCTACACAATTCAGTGGGACGATTGGTAACCTCACACGAGACACAATTCCGTTGCATTACAAGGCTTGGCCTGAGGTCCCCCAACAAGTTCGA
GACAACGTAAAAGATCGACTCTTGACGAAATGGGATGTGGATTTGTCAAAGCCGCATATTAACAAGTATGTGGAACGTAAAATAATGGACACGTTTAGGGAGTTTAGGAG
TGAGTTGCATAGTCACTACAAGGGATTCGAGGACCCTAAAGTTGCTCGAGAAAATCCACCAGAAAGGATTACCGACCAGAACGATTGGAAGATGTTATGCGACAGATGGG
AGACTCCCGAATGGAAAATATGTTTTCTTGTTTTACTTATAAAAGAACTACATGGTCGTGACATTGGGCCAGTGGATTTGTTCCACGATAGTCATTTTACTAAGAAGAAG
GGATGGATCAACGACAGAGCAAAAGAAGTATACTTGGAAATGGATGGGATACTGAAAGCATCGTCAGAAGAAGGGTCCGAACAGATCTCGCAACCTAATGTTCTGAAACA
GGTTTTGGGAACTCGATCAGGCCACATCAGAGGTCTTGGTTGGGATCCAAAATCTGGCTCATCTGTCAGTGTCTCGTCTTCAAACCCTCGTGAAAAAGAGCTAGAGCAGA
AAGTTGAGATGATGCAATCTGAGATGGATGCACTCAAGAGCAGGGAGGAGATGATGCAATCTGAGATGGATGCACTCAAGGGCAGGGAGGAGTTGAGGCAATCTGAAATG
AATGCGCTCAAGGGCAGGGAGGAGATGTGGGAACAAAGATGGGCTCAAATCCAACATTTGTTTGGCGAACAATCGGGAGGAGGGTCTTCGAACTGA
Protein sequenceShow/hide protein sequence
MLGQSREHDASTVYGEDVDPAVVGSSTEPSSARASGSNTFQGRRKRGHSRNIKIDQYVATHNRIPIHIDESVGKPTCKFATQFSGTIGNLTRDTIPLHYKAWPEVPQQVR
DNVKDRLLTKWDVDLSKPHINKYVERKIMDTFREFRSELHSHYKGFEDPKVARENPPERITDQNDWKMLCDRWETPEWKICFLVLLIKELHGRDIGPVDLFHDSHFTKKK
GWINDRAKEVYLEMDGILKASSEEGSEQISQPNVLKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALKSREEMMQSEMDALKGREELRQSEM
NALKGREEMWEQRWAQIQHLFGEQSGGGSSN