; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G010030 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G010030
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionAWPM-19-like family protein
Genome locationGy14Chr2:9767928..9773086
RNA-Seq ExpressionCsGy2G010030
SyntenyCsGy2G010030
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008390 - AWPM-19-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147191.1 membrane protein PM19L [Cucumis sativus]1.27e-126100Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

XP_008460698.1 PREDICTED: uncharacterized protein LOC103499466 [Cucumis melo]7.37e-12699.44Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGT GTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

XP_022959288.1 uncharacterized protein LOC111460317 [Cucurbita moschata]9.66e-12295Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLFLNLIMY ILLGFASWC+NR+INGTTYHPSMGGNGATPFFLTFAMLTAV+G+ASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACKQI+IGGHRGWRLRVVEAFIIILT TQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTA PGEPPKGT GTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

XP_023549469.1 membrane protein PM19L [Cucurbita pepo subsp. pepo]3.93e-12194.44Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLFLNLIMY ILLGFASWC+NR+INGTTYHPSMGGNGATPFFLTFAMLTAV+G+ASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACKQI+IGGHRGWRLRVVEAFIIILT TQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGG+A PGEPPKGT GTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

XP_038874443.1 membrane protein PM19L [Benincasa hispida]1.22e-12497.22Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLF+NLIMYLIL+GFASWCLNR+INGTTYHPSMGGNGATPFFLTFAMLTAVLG+ASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGT GTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

TrEMBL top hitse value%identityAlignment
A0A0A0LI45 Uncharacterized protein6.17e-127100Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

A0A1S3CDH8 uncharacterized protein LOC1034994663.57e-12699.44Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGT GTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

A0A6J1CDE2 uncharacterized protein LOC1110097053.67e-11290Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLFLNLIMY IL+GFASWCLNR+INGTTYHPSMGGNGATPFFLTFAMLTAVLG+ASKLAGLYHIRAWRSDSLAGAGS S++TWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACK+I+IGGHRGWRLRVVEAFIIIL  TQLLY+LLLHAGIFSR YGPGY +TDYGMGGGTAA GE PKGT GTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

A0A6J1H5I7 uncharacterized protein LOC1114603174.68e-12295Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLFLNLIMY ILLGFASWC+NR+INGTTYHPSMGGNGATPFFLTFAMLTAV+G+ASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACKQI+IGGHRGWRLRVVEAFIIILT TQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTA PGEPPKGT GTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

A0A6J1KYX7 uncharacterized protein LOC1114988588.40e-11893.33Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MAQTMGRNMAAPLLFLNLIMY ILLGFASWC+NR+INGTTYHPSMGG+  TPFFLTFAMLTAV+G+ASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV
        AFGLACKQI+IGGHRGWRLRVVEAFIIILT TQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTA PGEPPKGT GTRVV
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV

SwissProt top hitse value%identityAlignment
Q6L4D2 Membrane protein PM19L7.1e-5061.54Show/hide
Query:  MGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVLAFGL
        +GR M APLL LNLIMYLI++GFASW LN +ING T HP + GNGAT +FL FA+L  V+G ASKLAG++H+R+W + SLA   +++L+ WA+T LAFGL
Subjt:  MGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVLAFGL

Query:  ACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRY--GPGYWDTDY
        ACK+I+IGG+RGWRLRV+EAF+IIL FTQLLY+ +LH G+FS  +  G G +  DY
Subjt:  ACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRY--GPGYWDTDY

Arabidopsis top hitse value%identityAlignment
AT1G04560.1 AWPM-19-like family protein8.6e-6773.91Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL
        MA T+GRN+AAPLLFLNL+MYLI+LGFASWCLN++ING T HPS GGNGATPFFLTF++L AV+G+ASKLAG  HIR WR+DSLA AG++S++ WA+T L
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVL

Query:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMG
        A GLACKQINIGG RGWRL+++EAFIIILTFTQLLYL+L+HAG  S +YGPGY D DY  G
Subjt:  AFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMG

AT1G29520.1 AWPM-19-like family protein2.6e-2341.56Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFI-NGTTYHPSMG------------GNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGA
        MA    + +A+ LL LN  MY+I+LG   W +NR I +G    P++             GN AT FF+ FA+L  V+G AS ++GL HIR+W   SL  A
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFI-NGTTYHPSMG------------GNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGA

Query:  GSTSLLTWAVTVLAFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLH
         + + + W +TVLA G A K+I + G R  +LR +EAF+IIL+ TQL+Y+  +H
Subjt:  GSTSLLTWAVTVLAFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLH

AT5G18970.1 AWPM-19-like family protein1.0e-1937.58Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSM-------------GGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGA
        MA    ++ A  LL LNL +Y ++   ASW +N  I  T    S               GN AT FF+ F ++  V+G+A+ L G+ ++  W S +L  A
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSM-------------GGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGA

Query:  GSTSLLTWAVTVLAFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGI
         ++SL++W++T+LA GLACK+INIG      LR +E   II++ TQLL    +HAG+
Subjt:  GSTSLLTWAVTVLAFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGI

AT5G46530.1 AWPM-19-like family protein2.2e-2240Show/hide
Query:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFIN-----GTTY---------HPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAG
        M +   + +A+ LL LN  MY I+LG  +W +N+ IN     G  Y         H  M GN AT FF+ FA++  V G AS ++G+ H+++W + SL  
Subjt:  MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFIN-----GTTY---------HPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAG

Query:  AGSTSLLTWAVTVLAFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLH
        A S + + W++T+LA G  CK+I + G R  RLR +EAF+IIL+ TQLLY+  ++
Subjt:  AGSTSLLTWAVTVLAFGLACKQINIGGHRGWRLRVVEAFIIILTFTQLLYLLLLH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAAACAATGGGAAGGAACATGGCAGCTCCATTGTTGTTTCTTAACTTGATTATGTATTTGATTCTCTTGGGTTTTGCTAGTTGGTGTCTTAATAGATTCATCAA
TGGCACTACCTACCATCCAAGTATGGGAGGCAACGGAGCGACGCCGTTTTTCCTGACATTTGCAATGCTGACGGCTGTTCTTGGAATAGCATCAAAACTGGCCGGACTCT
ACCATATCAGAGCATGGAGGAGCGACAGCCTCGCCGGTGCAGGATCAACTTCCTTGCTCACTTGGGCTGTAACTGTTCTAGCTTTTGGGTTGGCCTGTAAGCAAATCAAC
ATAGGAGGCCACAGAGGGTGGAGGCTGAGAGTGGTGGAAGCTTTTATAATTATCTTGACTTTCACTCAGCTTCTGTATCTGCTTTTGCTCCATGCTGGAATCTTCAGCCG
CCGCTATGGCCCTGGTTATTGGGACACCGACTATGGCATGGGCGGAGGAACAGCAGCTCCCGGCGAGCCTCCCAAAGGAACAACCGGAACTAGGGTGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAAAAAAAGGATAGTAGTTGTAGAATGAATATTTTTATTACACTTTCCCAACATAAAAATAATTCTAATCATAAACCTTACAAGTAGTTCTGTTGTTGGGTAAAAA
GAAAAAAATAACTCAGCTCCCACAGTAATGCATAATACTTATTCCCATTAAAACAAGCAGCAGCTAGAAACAGACATAATTCTCACATTCCTTCATTCATTCTTCTGTTG
GCCATATCAAAGAAACACAAAACAAAAGCACTTTTTGATTGAGACCTTACTGGATCAGAAACCTGCACCGTTTGATCAACATAAGAGAAGGTTATTGACAAGTGTCCCAA
CACGTGTACCTATGAGAAAGCATGCAAGACATTTTGTTCACACGTGTATGTAAGGGACTAAGCTTGGTTCTACTATATATGTGTATTTAAATACAAACTTTGGTTGTTAC
AGGGAATGTATTAAAAGCGCTTTGGTTTGCCTTATCTGCTTTTCTTTGTTGTGGATTTGTGATAATATCTATCCATCTTCATCTTTGAAAGTGTTTTTTGTTGTGGTTAG
AGAGAAAAGATGGCTCAAACAATGGGAAGGAACATGGCAGCTCCATTGTTGTTTCTTAACTTGATTATGTATTTGATTCTCTTGGGTTTTGCTAGTTGGTGTCTTAATAG
ATTCATCAATGGCACTACCTACCATCCAAGTATGGGAGGCAACGGAGCGACGCCGTTTTTCCTGACATTTGCAATGCTGACGGCTGTTCTTGGAATAGCATCAAAACTGG
CCGGACTCTACCATATCAGAGCATGGAGGAGCGACAGCCTCGCCGGTGCAGGATCAACTTCCTTGCTCACTTGGGCTGTAACTGTTCTAGCTTTTGGGTTGGCCTGTAAG
CAAATCAACATAGGAGGCCACAGAGGGTGGAGGCTGAGAGTGGTGGAAGCTTTTATAATTATCTTGACTTTCACTCAGCTTCTGTATCTGCTTTTGCTCCATGCTGGAAT
CTTCAGCCGCCGCTATGGCCCTGGTTATTGGGACACCGACTATGGCATGGGCGGAGGAACAGCAGCTCCCGGCGAGCCTCCCAAAGGAACAACCGGAACTAGGGTGGTTT
AGTAAGGATGGTTGTGTATGATATCAGTTGGGTTTGTTTTTTTCTTGTTTTGTTTTTATGTATAACATAGTTTGTGGTTCTGTCGATCTCTTTCAAAACCATCAATGTAG
GAATGTTTAATAAGCCATTGGGAACTTTCACATAGAAACAAGGATGTGCGGTTTCATATTTTTGTTATGCTTAGACAAATTCATTGTGTTTAGATGTGAAAAGGGCCGAT
CCACTAAACAAAACATATGAAAACAACTAGTAATGTAATGAACCAACTGCTATTGTCATAACAAAAAGACAAATGTACTGAATACTTTAAATGAGAAGAGAAGCCATGGT
AATTAATAAAATACAAATCGCCATTAAACACAATTAACATCTGGAGCTAAATGCATCAGACCATGGGTGACAACCAACATGAAGCATTTGATTTGTAAAGAATAACTTGA
ACAGATTGCATTAATTTGATCTCAACGTTTGTATCACGTTATGAATCTGATTAGTTTATGGACTAAATTCTTCTCGTCGGGCAACTGCTGAATTGCTTACTATAGTCAAA
ACAAACAAGGGTAGAAAACTGATTTGACTGTTGAATTTGAATCCATACTTTGATGTTAAAACAACTTCCCTAAACATACATGAGATGAAGTTATGAAGGGTGTGAATAGA
TTAACCAATTTAAATTCAAACCTCAAAACTGGGGGAGAAACTGCGAAGGTTACACTCACTTCTTTCCACTTCATGCGCTGCTTCCATTAATGAAACCATCCTTGCGACCT
TCGAAGCCAGGGCGCATTAACTTTTTCTCCCTCTTTTCAATCCGTTTCATCTTCTTGTCATGGATTTTCTGAGCTATGTTTTCAGATCTCTTTTGTTGTCTTTCTGCTTT
CATCTTCTGGGTGGTTTCAATTCTACCTTTCCATTTCTCGACACTCTTCAGCTGTTGTTTCTTTTCCTTCCGTAAACTCTTCATCAGACGCTGTGGATCATCATGAACTT
TAAACCCAGCAGCTCTACTTGTTGCTGCTTGCCAAGAATGCTTCTTCAATACTATCTCGCCCTTTCCGGGGTCTTTCTTTGCTTCTTCCAACTTTTTTGCCTTTTCAAGT
TCCTTCAACTTGGAGAGTTTTTTTTTCTTATTCTTACCTTGCTCTTCTTCAGTTCCCAGTTTAACATGACCAAACGCAAGTTCTTTGGAGGCCTCAACTACATTCCTCTC
CATTTCAATTTCTGAGGGCATGGTGACTGATTTCTTCTCGTCAGAGTCATTTTCTCTCTTGCGTTTCTTCTGGATAGCTTCCCTTCTCTCGTTTCTTTCATTTCTCTTCT
TTTCCCTATTTGAACATCCTGTATTTCTATTCACCCGGAACTCCTCAATTTTTCGATGAAGGCGTTGTCTCAGTTCTTCATATGTCACTGACTGGTCATCACTGTCCCAC
CCAGTTGCCACTGGCTTAACATCATCTCCATCATCGTCATTTTTACTTTTCAACTTTTCATTTTCCAAGCTTTCTTTAAGCAAATGAACAGTTGATTTAGAAGACTTTTC
TGGATCCATACGATCTCTTCGTGCTTTCTTAAGGTTTTCCTTTGATTCCTTCTTAGCCATTGCCTTTTCTTTCTTGCTAAGGCCCTGAAACCAAGGTTTTCCTTTTTCAT
CACTAGATAAATAGAACCTTACCGGGATAAGCTCCACTAACTTATCAAAGAACAACGCATGGTTATGGATAATGGATTTCAGATCTACGCCTGAATTAGAGACTGCAGCG
GTCTTCTGCTTTTTCTTCTTCACCTGAGACAATTTAATAAATAAGAAAGCAGGAAACGATAAATAAAAAGAAAAACAAAGAGTATCTAAAAACAATTATAATTCACTTTC
ATATCCCTTCATTTGTATAAAAACACAACTAGAGAAATGCAATTTTTTCTTTTCTTGAGAACAAAAGTTAAAACAAAGTTGATTTCCACTCGCTTTTCTTTTTCTTATAG
AAAACATACTCATTACTCACAAATCACAATCTTCGAACAGAAGATTCTGCACATTCCTCCAAATCCATGAAGTTCCAAGCTTTTGCGATTAATCAACAAGCACTTAATGA
ATTTATCTATTCCAAAAAGAGAGACTTTCCATCCTGGTTCTAGACAAGGGAGTAATCCCTTTGGTTTGGGTTTAGTTTTGATACTGAACTGATGATCTAGAAAAGAAAAA
CGGAAGAAGTGATGGAGAGAACCAAAGTGCTCCGATCAAGAGAAGATGCTTGCCTGCTGACCTACTATGTCTATTGTTTCTTCTGTCAAAACCACCGCCCCTTGGGAGGC
CCCATTGTAGGGTGTCATTAACGATAGGTGTTTAAATCACGAACAGAAACTAGCAAAACTCAACCAGTCTTATGATCCCAAGTTGAAATAAAAGACGAACAGCTGAATGG
AAAGTGGAAACGATCAGGCGAGGTGTGAAATGAAATACATGAGCTTCTCTAATACTTCTTCAACAACTTCGAGAGAGAGCTGATCGTTGGTCTACGGGAGAGAGGAAGGG
GAAAGTACAGGCAGTTTGCCATTTTGAATAAAAGATGCTAACTTGTATTGGAAAAGCACAGGCACTTTGCCATTTTGAAGAAAAGATGCTAACTTACTTTGGGTTGCTTA
TGTAACTCACTAAGATCGAGTTCGAGGAAAGGTCTCAGTAGATGAGAGCAACAATGTCTCATTTGAGCAAGCAATTTACCACTTCCTCACCATAGAAACACTAGAAACTA
AAATGAGCACCCCAACTTCAAGCTAGAGTAAGAATGAATCCGAAATACGCACAAAATTAATATCTTCAACGGGCTATTGCCAGTTACTCCCGCAGCCGTGTGAAAAGAAA
AAAAATCAAGATTTATTGAATTATAGAAGAATCACAATCTATTTAAGACGTATACCAAACAACAACATTTCTTAAACTGAGTAAGAAATTGAGCATTGTATGTCTTTCCA
ATGATTATGAACTGCAATCTGAGAGATACCAAAATTCAAATAAATCAAACGCCTTAACACTACTGAATTAAACTACTTGTATCGTAGGAATATAACAAGGCGCCAATAAA
TTCGACAAGACCGAAAGAGTTGAGAAGGCGGCTTTATGCCATTAAAGAGAAAGGAAGAGTAGATTATACCTTCACTTGGGAAGAAGAAGGGACTACCGGTTCAGTATCCG
GCGCAGTCATCTTCAACCTCGTTTTGGACTTTGATGGAAACAACCCAACTCCACAGCACGAAAATTAGGGCAGAAAATTAAAAAGAGGATGGGTAAATTGCTAAAAGTAC
GAAATATTTCG
Protein sequenceShow/hide protein sequence
MAQTMGRNMAAPLLFLNLIMYLILLGFASWCLNRFINGTTYHPSMGGNGATPFFLTFAMLTAVLGIASKLAGLYHIRAWRSDSLAGAGSTSLLTWAVTVLAFGLACKQIN
IGGHRGWRLRVVEAFIIILTFTQLLYLLLLHAGIFSRRYGPGYWDTDYGMGGGTAAPGEPPKGTTGTRVV