; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024699 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024699
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCACTA en-spm transposon protein
Genome locationchr10:5061628..5063744
RNA-Seq ExpressionLag0024699
SyntenyLag0024699
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156286.1 uncharacterized protein LOC111023212 [Momordica charantia]2.2e-5546.27Show/hide
Query:  VDLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEEL
        VDLSK  ++NK++E+++  +F+++R  LH ++Y  FEDP  AR NPP+R+T+  DWN LCDRWETPEWK+   +NK +R++LP+NHR+G K F+Q Q EL
Subjt:  VDLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEEL

Query:  KELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDP-----KSGSSVSVSSSNPREKELE
        K   G DIGPVDLF +SH+ EK G +ND A++ Y  M  ++KA +++G E ++QP+  ++VLG R  H++GLG+ P     K GSS +V+SS   EKELE
Subjt:  KELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDP-----KSGSSVSVSSSNPREKELE

Query:  QKVEMMQSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSGGGSSN
        +KVE M+ EM  +       ++E   LK     WE RW +I      + G G SN
Subjt:  QKVEMMQSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSGGGSSN

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]2.8e-7946.86Show/hide
Query:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLL---------------------------TKWDV
        ++ +DR+V  H RI I IDE VGKP    AT+FS  IGT+ R+TIPL  K+W +V ++VRD + D+LL                           + +D 
Subjt:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLL---------------------------TKWDV

Query:  DLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELK
        D+ K H + KYV +R+ +TF+E+R  L+ +HY+ F+DPK AR  PPKRITD  DWN+LC+RWETPEWKKK + NK SRS++PY HR+G K FVQ Q E+K
Subjt:  DLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELK

Query:  ELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMM
           G D+  VDLF  SHF EK GW+N+ AK+ YLEM  +++AS ++    +S  +V KQVLG RSG+I+GLG +PK  SS SV+S    +KELE+K+E M
Subjt:  ELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMM

Query:  QSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSG-GGSSN
        + EM  + +  E  +   +AL S+  MWE RWA+IQ++ G   G  G SN
Subjt:  QSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSG-GGSSN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]1.1e-8350.77Show/hide
Query:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL
        ++ +DR+V  H RI I IDE VGKP    AT+FS  IGT+ R+TIPL  K+W +V ++VRD + D+LL+ +D D+ K H + KYV +R+ +TF+E+R  L
Subjt:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL

Query:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND
        + +HY+ F+DPK AR  PPKRITD  DWN+LC+RWETPEWKKK + NK SRS++PY HR+G K FVQ Q E+K   G D+  VDLF  SHF EK GW+N+
Subjt:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND

Query:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM
         AK+ YLEM  +++AS ++    +S  +V KQVLG RSG+I+GLG +PK  SS SV+S    +KELE+K+E M+ EM  + +  E  +   +AL S+  M
Subjt:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM

Query:  WEQRWAKIQHLFGEQSG-GGSSN
        WE RWA+IQ++ G   G  G SN
Subjt:  WEQRWAKIQHLFGEQSG-GGSSN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]3.9e-6542.29Show/hide
Query:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLL---------------------------TKWDV
        ++ +DR+V  H RI I IDE VGKP    AT+FS  IGT+ R+TIPL  K+W +V ++VRD + D+LL                           + +D 
Subjt:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLL---------------------------TKWDV

Query:  DLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELK
        D+ K H + KYV +R+ +TF+E+R  L+ +HY+ F+DPK AR  PPKRITD  DWN+LC+RWETPEWKKK + NK SRS++PY HR+G K FVQ Q E+K
Subjt:  DLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELK

Query:  ELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMM
           G D+  VDLF  SHF EK GW+N+ AK+ YLEM  +++AS ++                           DP   SSV        +KELE+K+E M
Subjt:  ELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMM

Query:  QSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSG-GGSSN
        + EM  + +  E  +   +AL S+  MWE RWA+IQ++ G   G  G SN
Subjt:  QSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSG-GGSSN

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]2.8e-7946.86Show/hide
Query:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLL---------------------------TKWDV
        ++ +DR+V  H RI I IDE VGKP    AT+FS  IGT+ R+TIPL  K+W +V ++VRD + D+LL                           + +D 
Subjt:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLL---------------------------TKWDV

Query:  DLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELK
        D+ K H + KYV +R+ +TF+E+R  L+ +HY+ F+DPK AR  PPKRITD  DWN+LC+RWETPEWKKK + NK SRS++PY HR+G K FVQ Q E+K
Subjt:  DLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELK

Query:  ELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMM
           G D+  VDLF  SHF EK GW+N+ AK+ YLEM  +++AS ++    +S  +V KQVLG RSG+I+GLG +PK  SS SV+S    +KELE+K+E M
Subjt:  ELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMM

Query:  QSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSG-GGSSN
        + EM  + +  E  +   +AL S+  MWE RWA+IQ++ G   G  G SN
Subjt:  QSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSG-GGSSN

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase6.6e-5039.43Show/hide
Query:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL
        +I +D++V  H ++ I I+E  GKP   FA + +  IGT  R+TI L  + W  +P  V++ + DR  T ++ D     I+ KY++ ++ + FREFR  L
Subjt:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL

Query:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND
        H ++Y  F+D   AR NPP +IT + DWNM+CDRWET  WKKK + NK SRS + +NH  G K F+Q + EL++  G D+  V++F ++HF EK+GWIND
Subjt:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND

Query:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM
        KAK+ Y     I+  S+E G + IS     K VLG+ S  I  L    KSG S+  + S+ REKE        ++EM  L    E    E+         
Subjt:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM

Query:  WEQRWAKIQHLFGEQSG
        WEQRW  I+   G + G
Subjt:  WEQRWAKIQHLFGEQSG

A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class1.2e-4336.83Show/hide
Query:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL
        +I +D++V  H +I I I+E  GKP   FA + +  IGT  R+TIPL  + W  VP  VR+ + DRL T ++ D +   ++ KY+E ++ + FREFR  L
Subjt:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL

Query:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND
        H ++Y  F+D   AR NPP RIT   DWNM+CDRWET  WKKK                 GC                D+  +++FH++HF EK+GWIND
Subjt:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND

Query:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM
        KAK+ YLEM  I+  S+E G + IS     + VLG+RS        +P+SG S+  + S+ REKE                      ++EM  LK  +E 
Subjt:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM

Query:  WEQRWAKIQHLFGEQ
             AK +  +G Q
Subjt:  WEQRWAKIQHLFGEQ

A0A5A7TRX4 DUF4216 domain-containing protein1.5e-5440Show/hide
Query:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL
        +I +D++V  H +I I I+E  GKP   FA + +  IGT  R+TIPL  + W  VP  VR+ + D L T ++ D +   ++ KY+E ++ +TFREFR  L
Subjt:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL

Query:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND
        H ++Y  F+D   AR NP  RIT + DWNM+CDRWET  WKKK + NK S S + +NH +G K F+Q + ELK+  G+D+  +++FH++HF EK+GW ND
Subjt:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND

Query:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM
        KAK+ YLEM  I++ S+E G + IS     + VLG+RS        +P+SG S+  + S+ REKE                      ++EM  LK  +E 
Subjt:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM

Query:  WEQRWAKIQHLFGEQ
             AK +  +G Q
Subjt:  WEQRWAKIQHLFGEQ

A0A5A7US78 Uncharacterized protein2.1e-4336.19Show/hide
Query:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL
        +I +D++V  H +I I I+E  GKP   F  + +  IGT  R+TIPL  + W  VP  VR+ + D L T ++ D +   ++ KY+E ++ +TFREFR  L
Subjt:  DIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVDLSKPHIINKYVERRIMDTFREFRIKL

Query:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND
        H ++Y  F+D   AR NPP RIT + DWNM+CDRWET  WKKK               + GC                D+  +++FH++HF +K+GWIND
Subjt:  HIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVDLFHDSHFTEKKGWIND

Query:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM
        KAK+ YLEM  I+  S+E G + IS     K VLG+RS        +P+SG S+  + S+ REK+                      ++EM  LK  +E 
Subjt:  KAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIALKSKDEM

Query:  WEQRWAKIQHLFGEQ
             AK +  +G Q
Subjt:  WEQRWAKIQHLFGEQ

A0A6J1DUH3 uncharacterized protein LOC1110232121.0e-5546.27Show/hide
Query:  VDLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEEL
        VDLSK  ++NK++E+++  +F+++R  LH ++Y  FEDP  AR NPP+R+T+  DWN LCDRWETPEWK+   +NK +R++LP+NHR+G K F+Q Q EL
Subjt:  VDLSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEEL

Query:  KELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDP-----KSGSSVSVSSSNPREKELE
        K   G DIGPVDLF +SH+ EK G +ND A++ Y  M  ++KA +++G E ++QP+  ++VLG R  H++GLG+ P     K GSS +V+SS   EKELE
Subjt:  KELHGHDIGPVDLFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDP-----KSGSSVSVSSSNPREKELE

Query:  QKVEMMQSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSGGGSSN
        +KVE M+ EM  +       ++E   LK     WE RW +I      + G G SN
Subjt:  QKVEMMQSEMDALTSREEVRQSEMIALKSKDEMWEQRWAKIQHLFGEQSGGGSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTATGCCGACGTAAATTCGTCGGCGTACATGAGTTCAACAGCTGCTCACTTCCGTCAATCTCTCTTCCCTTCACCAATCATCACGCTCGCCATCCCTATCCCAGA
TATCAGAATTGATCGGTATGTGGCTACACACAATAGGATACCCATTCACATTGATGAGTCCGTCGGCAAACCAACGTATAAGTTTGCTACACAATTCAGTGGGACGATTG
GTACCCTCACACGAGATACAATTCCATTGCATTACAAGGAGTGGCCTGAGGTCCCCCAACAAGTCCGAGACAACATAAAAGATCGACTCTTGACGAAATGGGATGTGGAT
TTGTCGAAGCCGCATATTATTAACAAGTATGTGGAACGTAGAATAATGGACACGTTTAGGGAGTTTAGGATCAAGTTGCATATTAGGCACTACAAGGGATTCGAGGACCC
TAAAGTTGCTCGAGAAAATCCACCAAAAAGGATTACCGACCAGAACGATTGGAATATGCTTTGCGACAGATGGGAGACTCCCGAATGGAAAAAAAAAGCGGATCAAAATA
AGAATAGTCGCTCACAACTCCCCTACAACCATCGAAGTGGATGTAAGTATTTTGTTCAGAAGCAAGAAGAACTGAAAGAACTACATGGTCATGACATTGGGCCAGTGGAT
TTGTTCCACGATAGTCATTTTACTGAGAAAAAGGGATGGATCAACGACAAAGCAAAAGAAGTATACTTGGAAATGGATGGGATACTAAAAGCATCGTCTGAACAAGGGTC
TGAACAGATCTCGCAACCTGATGTTATGAAACAGGTTTTGGGAACTCGATCAGGCCACATCAGAGGTCTTGGTTGGGATCCAAAATCTGGCTCATCTGTCAGTGTCTCGT
CTTCAAATCCTCGTGAAAAAGAGCTAGAGCAGAAAGTCGAGATGATGCAATCTGAGATGGATGCACTCACGAGCAGGGAGGAGGTGAGGCAATCTGAAATGATTGCGCTC
AAGAGCAAGGATGAGATGTGGGAACAAAGATGGGCGAAAATCCAACATTTGTTTGGCGAACAATCGGGAGGAGGGTCTTCAAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTATGCCGACGTAAATTCGTCGGCGTACATGAGTTCAACAGCTGCTCACTTCCGTCAATCTCTCTTCCCTTCACCAATCATCACGCTCGCCATCCCTATCCCAGA
TATCAGAATTGATCGGTATGTGGCTACACACAATAGGATACCCATTCACATTGATGAGTCCGTCGGCAAACCAACGTATAAGTTTGCTACACAATTCAGTGGGACGATTG
GTACCCTCACACGAGATACAATTCCATTGCATTACAAGGAGTGGCCTGAGGTCCCCCAACAAGTCCGAGACAACATAAAAGATCGACTCTTGACGAAATGGGATGTGGAT
TTGTCGAAGCCGCATATTATTAACAAGTATGTGGAACGTAGAATAATGGACACGTTTAGGGAGTTTAGGATCAAGTTGCATATTAGGCACTACAAGGGATTCGAGGACCC
TAAAGTTGCTCGAGAAAATCCACCAAAAAGGATTACCGACCAGAACGATTGGAATATGCTTTGCGACAGATGGGAGACTCCCGAATGGAAAAAAAAAGCGGATCAAAATA
AGAATAGTCGCTCACAACTCCCCTACAACCATCGAAGTGGATGTAAGTATTTTGTTCAGAAGCAAGAAGAACTGAAAGAACTACATGGTCATGACATTGGGCCAGTGGAT
TTGTTCCACGATAGTCATTTTACTGAGAAAAAGGGATGGATCAACGACAAAGCAAAAGAAGTATACTTGGAAATGGATGGGATACTAAAAGCATCGTCTGAACAAGGGTC
TGAACAGATCTCGCAACCTGATGTTATGAAACAGGTTTTGGGAACTCGATCAGGCCACATCAGAGGTCTTGGTTGGGATCCAAAATCTGGCTCATCTGTCAGTGTCTCGT
CTTCAAATCCTCGTGAAAAAGAGCTAGAGCAGAAAGTCGAGATGATGCAATCTGAGATGGATGCACTCACGAGCAGGGAGGAGGTGAGGCAATCTGAAATGATTGCGCTC
AAGAGCAAGGATGAGATGTGGGAACAAAGATGGGCGAAAATCCAACATTTGTTTGGCGAACAATCGGGAGGAGGGTCTTCAAACTGA
Protein sequenceShow/hide protein sequence
MVYADVNSSAYMSSTAAHFRQSLFPSPIITLAIPIPDIRIDRYVATHNRIPIHIDESVGKPTYKFATQFSGTIGTLTRDTIPLHYKEWPEVPQQVRDNIKDRLLTKWDVD
LSKPHIINKYVERRIMDTFREFRIKLHIRHYKGFEDPKVARENPPKRITDQNDWNMLCDRWETPEWKKKADQNKNSRSQLPYNHRSGCKYFVQKQEELKELHGHDIGPVD
LFHDSHFTEKKGWINDKAKEVYLEMDGILKASSEQGSEQISQPDVMKQVLGTRSGHIRGLGWDPKSGSSVSVSSSNPREKELEQKVEMMQSEMDALTSREEVRQSEMIAL
KSKDEMWEQRWAKIQHLFGEQSGGGSSN