; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027749 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027749
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationtig00153055:2244607..2246379
RNA-Seq ExpressionSgr027749
SyntenySgr027749
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452621.1 PREDICTED: uncharacterized protein LOC103493588 [Cucumis melo]3.9e-24090.34Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS
        MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMS LLRRRK+HQM+QP+L+IERSGSLRP EALSPLKEGPDGNE R+SKEGKWGNWMRGQLCRAPS
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS

Query:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
        AVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
Subjt:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD

Query:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
        AESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
Subjt:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK

Query:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
        ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQN+G DAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
Subjt:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA

Query:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        FNVPGLSIDCFIPPAELRYASMSEACDF RG G KN MAAAAYRAKVAALEK+HD+KVKVIWKPDI
Subjt:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

XP_022139891.1 uncharacterized protein LOC111010692 [Momordica charantia]1.8e-24592.27Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS
        MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMS LLRRRKSHQM  PDLLIERSGSLRPAEALSPLKEGPDGN+GRDSKEGKWGNWMRGQLCRAPS
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS

Query:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
        AVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
Subjt:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD

Query:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
        AESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
Subjt:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK

Query:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
        ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
Subjt:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA

Query:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        FNVPGLS+DCFIPPAELRYASMSEACDFPRGHGFKNAMAAAA+RAKVAALEK+HDSKVKVIWKPDI
Subjt:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

XP_022940354.1 uncharacterized protein LOC111445995 isoform X1 [Cucurbita moschata]4.1e-24291.43Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQM-AQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAP
        MERKQGFFSALKGEIVRGLSPGRSR KSPARSASPMSSLLRRRKSHQM  QP+LLIERSGSLRPAEALSPLKEGPD N+GR+SKEGKWGNWMRGQLCRAP
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQM-AQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAP

Query:  SAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK
        SAVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK
Subjt:  SAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK

Query:  DAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL
        DAESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL
Subjt:  DAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL

Query:  KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV
        KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV
Subjt:  KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV

Query:  AFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        AFNVPGLSIDCFIPPAELRYASMSEACD PRGHGFKNAMAAAA+RAKVAALEK+HD KVKVIWKPDI
Subjt:  AFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

XP_022982239.1 uncharacterized protein LOC111481127 [Cucurbita maxima]1.2e-24191.22Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQM-AQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAP
        MERKQGFFSALKGEIVRGLSPGRSR KSPARSASPMSSLLRRRKSHQM  QP+LLIERSGSLRPAEALSPLKEGPD N+GR+SKEGKWGNWMRGQLCRAP
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQM-AQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAP

Query:  SAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK
        SAVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK
Subjt:  SAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK

Query:  DAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL
        DAESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL
Subjt:  DAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL

Query:  KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV
        KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV
Subjt:  KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV

Query:  AFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        AFNVPGLSIDCFIPPAELRYASMSEACD PRGHGFKNAMAAA +RAKVAALEK+HD KVKVIWKPDI
Subjt:  AFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

XP_038899776.1 uncharacterized protein LOC120087006 [Benincasa hispida]2.4e-24290.99Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS
        MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMS LLRRRK+HQM+QPDL+IERSGSLRP EALSPLKEGPDGNE R+SKEGKWGNWMRGQLCRAPS
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS

Query:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
        AVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
Subjt:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD

Query:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
        AESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVN EDCFILKLCTDPSTLK
Subjt:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK

Query:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
        ARSEGPAEIIRHTMFGYFSQKSGLL+HLEDSHLTRIQNNG DAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
Subjt:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA

Query:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        FNVPGLSIDCFIPPAELRYASMSEACDF RGHGFK+AMAAAAYRAKVAALEK+HDSKVKVIWKPDI
Subjt:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

TrEMBL top hitse value%identityAlignment
A0A1S3BU81 uncharacterized protein LOC1034935881.9e-24090.34Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS
        MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMS LLRRRK+HQM+QP+L+IERSGSLRP EALSPLKEGPDGNE R+SKEGKWGNWMRGQLCRAPS
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS

Query:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
        AVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
Subjt:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD

Query:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
        AESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
Subjt:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK

Query:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
        ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQN+G DAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
Subjt:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA

Query:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        FNVPGLSIDCFIPPAELRYASMSEACDF RG G KN MAAAAYRAKVAALEK+HD+KVKVIWKPDI
Subjt:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

A0A5A7VFX6 Uncharacterized protein1.9e-24090.34Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS
        MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMS LLRRRK+HQM+QP+L+IERSGSLRP EALSPLKEGPDGNE R+SKEGKWGNWMRGQLCRAPS
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS

Query:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
        AVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
Subjt:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD

Query:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
        AESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
Subjt:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK

Query:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
        ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQN+G DAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
Subjt:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA

Query:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        FNVPGLSIDCFIPPAELRYASMSEACDF RG G KN MAAAAYRAKVAALEK+HD+KVKVIWKPDI
Subjt:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

A0A6J1CE19 uncharacterized protein LOC1110106928.7e-24692.27Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS
        MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMS LLRRRKSHQM  PDLLIERSGSLRPAEALSPLKEGPDGN+GRDSKEGKWGNWMRGQLCRAPS
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPS

Query:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
        AVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD
Subjt:  AVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKD

Query:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
        AESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK
Subjt:  AESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLK

Query:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
        ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA
Subjt:  ARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVA

Query:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        FNVPGLS+DCFIPPAELRYASMSEACDFPRGHGFKNAMAAAA+RAKVAALEK+HDSKVKVIWKPDI
Subjt:  FNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

A0A6J1FI84 uncharacterized protein LOC111445995 isoform X12.0e-24291.43Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQM-AQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAP
        MERKQGFFSALKGEIVRGLSPGRSR KSPARSASPMSSLLRRRKSHQM  QP+LLIERSGSLRPAEALSPLKEGPD N+GR+SKEGKWGNWMRGQLCRAP
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQM-AQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAP

Query:  SAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK
        SAVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK
Subjt:  SAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK

Query:  DAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL
        DAESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL
Subjt:  DAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL

Query:  KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV
        KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV
Subjt:  KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV

Query:  AFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        AFNVPGLSIDCFIPPAELRYASMSEACD PRGHGFKNAMAAAA+RAKVAALEK+HD KVKVIWKPDI
Subjt:  AFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

A0A6J1J4C8 uncharacterized protein LOC1114811275.8e-24291.22Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQM-AQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAP
        MERKQGFFSALKGEIVRGLSPGRSR KSPARSASPMSSLLRRRKSHQM  QP+LLIERSGSLRPAEALSPLKEGPD N+GR+SKEGKWGNWMRGQLCRAP
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQM-AQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAP

Query:  SAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK
        SAVSCSAQKRSDLRLLLG                           E SSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK
Subjt:  SAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSK

Query:  DAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL
        DAESGGFVLWQM+PDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL
Subjt:  DAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTL

Query:  KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV
        KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV
Subjt:  KARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEV

Query:  AFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI
        AFNVPGLSIDCFIPPAELRYASMSEACD PRGHGFKNAMAAA +RAKVAALEK+HD KVKVIWKPDI
Subjt:  AFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27690.1 Protein of unknown function (DUF620)5.0e-14560.84Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPD------GNEGRDSK-EGKWGNWMRG
        M++K G FS             R R+KSP RS SP+  ++RRRK   + QPD           +E L+P+ EGPD      G+ G  S+ E +W NWM+ 
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPD------GNEGRDSK-EGKWGNWMRG

Query:  QLCRAPSAVSCSAQ-KRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVV
        QL  AP +VS S+  KR+DLRLLLG                           ETSSAQYILQQYTAASGGQKL +SV N Y MG+++ +A EFET +K  
Subjt:  QLCRAPSAVSCSAQ-KRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVV

Query:  RTR-NSSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILK
        +++ NSSK  ESGGFVLW M PDMWY+EL LGGSKV AGC+GKLVWRHTPWLG HAAKGPVRPLRRALQGLDP+TTA MFANARCIGEK ++GEDCFILK
Subjt:  RTR-NSSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILK

Query:  LCTDPSTLKARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSH-TKTRM
        LC DP+TLKARSEG +E IRHT+FGYFSQK+GLL+HLEDS LTRIQNNGG+AVYWETTINS+L+DY+PVEGIMIAHSGRSV TL RFG+ +  H TKT M
Subjt:  LCTDPSTLKARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSH-TKTRM

Query:  EEAWTIEEVAFNVPGLSIDCFIPPAELRYASMSEACDFPRG------HGFKN
        +EAW I+E++FNVPGLSIDCFIPP+ELR+ S  E  D  +G      HG KN
Subjt:  EEAWTIEEVAFNVPGLSIDCFIPPAELRYASMSEACDFPRG------HGFKN

AT1G49840.1 Protein of unknown function (DUF620)8.8e-15861.07Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAK----SPARSASP-MSSLLRRRK--------------------SHQMAQPDLLIERSGSLRPAEALSPLKEGPD
        ME+KQGFFS+L+ E+VRGLSP RSR +    SP+RS +P M +LL  RK                     + ++QP+  I RS SLR      P+ EGPD
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAK----SPARSASP-MSSLLRRRK--------------------SHQMAQPDLLIERSGSLRPAEALSPLKEGPD

Query:  GNEGR----DSKEGKWG--NWMRGQLCRAPSAVSCS-AQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNS
         + G     DSK    G  +W++GQ  RAPS  S + A ++SDLRLLLG                           ETSSAQYILQQYTAA GG KL N+
Subjt:  GNEGR----DSKEGKWG--NWMRGQLCRAPSAVSCS-AQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNS

Query:  VSNAYAMGKVKMIACEFETANKVVRTRNSSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTA
        + NAYAMGK+KMI  E ET    VR RNS+K +E+GGFVLWQM PDMWYVEL++GGSKV AGCNGKLVWRHTPWLG+H AKGPVRPLRRALQGLDP+TTA
Subjt:  VSNAYAMGKVKMIACEFETANKVVRTRNSSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTA

Query:  SMFANARCIGEKSVNGEDCFILKLCTDPSTLKARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHS
        +MFA ++C+GE+ VNGEDCFILKLCTDP TL+ARSEGPAEI+RH +FGYFSQ++GLL  +EDS LTRIQ+N GDAVYWETTINS LDDY+ VEGIMIAHS
Subjt:  SMFANARCIGEKSVNGEDCFILKLCTDPSTLKARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHS

Query:  GRSVVTLFRFGETAMSHTKTRMEEAWTIEEVAFNVPGLSIDCFIPPAELRYASMSEACDF----PRGHGFKNAMAAAAYRAKVAALEK
        GRSVVTLFRFGE AMSHT+T+MEE WTIEEVAFNVPGLS+DCFIPPA+LR  S++EAC++     +G       +  A+RAKVAALEK
Subjt:  GRSVVTLFRFGETAMSHTKTRMEEAWTIEEVAFNVPGLSIDCFIPPAELRYASMSEACDF----PRGHGFKNAMAAAAYRAKVAALEK

AT1G79420.1 Protein of unknown function (DUF620)2.8e-9547.66Show/hide
Query:  PAEALSPLKEGPDGN--EGRDSKEGKW---GNWMR------GQLCRAPSAVSCSA----QKRSDLRLLLG------------------------------
        P +AL+PL EGPD +  + R  KE  W     W +      G +        C++     K  DLRLLLG                              
Subjt:  PAEALSPLKEGPDGN--EGRDSKEGKW---GNWMR------GQLCRAPSAVSCSA----QKRSDLRLLLG------------------------------

Query:  ---ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFE-TANKVVRT---RNSSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVW
           ETS+A YI+QQY AA+G  K   +  N YA G +KM  CE E  A K V+T     + +  +SG FVLWQMQP MW +EL LGG+K+ +G +GK VW
Subjt:  ---ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFE-TANKVVRT---RNSSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVW

Query:  RHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLKARSE--GPAEIIRHTMFGYFSQKSGLLIHLEDSHLTR
        RHTPWLG HAAKGP RPLRR +QGLDPKTTAS+FA A+C+GE+ +  +DCF+LK+  D  +L  R++   PAE+IRH ++GYF QKSGLL++LEDSHLTR
Subjt:  RHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLKARSE--GPAEIIRHTMFGYFSQKSGLLIHLEDSHLTR

Query:  IQ--NNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVAFNVPGLSIDCFIPPAEL
        +   +   +AVYWETTI + + DYR V+G+ +AH GR+V T+FRFGET++ +++TRMEE W I++V F+VPGLS+D FIPPA++
Subjt:  IQ--NNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVAFNVPGLSIDCFIPPAEL

AT3G19540.1 Protein of unknown function (DUF620)8.8e-17465.23Show/hide
Query:  MERKQGFFSALKGEIVRGLSPGRSRAK----SPARSASPMSSLLRRRKS------------HQMAQPDLLIERSGSLRPA-EALSPLKEGPDGNEGRDSK
        ME+KQGFFSAL+ E+VRGLSP RSRA+    SPARS+SPMS+L   RK+            + +AQP+ LI RSGSLRP  E   P + G  GN G   +
Subjt:  MERKQGFFSALKGEIVRGLSPGRSRAK----SPARSASPMSSLLRRRKS------------HQMAQPDLLIERSGSLRPA-EALSPLKEGPDGNEGRDSK

Query:  EGK-WGNWMRGQLCRAPSAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIA
         G   G+W++GQL RAPS  + +A +R+DLRLLLG                           ETSSAQYILQQYTAASGGQKLQNS+ NAYAMGK+KMI 
Subjt:  EGK-WGNWMRGQLCRAPSAVSCSAQKRSDLRLLLG---------------------------ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIA

Query:  CEFETANKVVRTRNSSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSV
         E ETA + VR RN SK AE+GGFVLWQM PDMWYVELA+GGSKV AGCNGKLVWRHTPWLG+H AKGPVRPLRR LQGLDP+TTA+MFA A+CIGEK V
Subjt:  CEFETANKVVRTRNSSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSV

Query:  NGEDCFILKLCTDPSTLKARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETA
        NGEDCFILKLCTDP TLKARSEGPAEIIRH +FGYFSQK+GLL+H+EDSHLTRIQ+NGG+ V+WETT NS LDDYR VEGIMIAHSG SVVTLFRFGE A
Subjt:  NGEDCFILKLCTDPSTLKARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETA

Query:  MSHTKTRMEEAWTIEEVAFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAA--AAYRAKVAALEKHHDSKVKVIWKPDI
         SHT+T+MEE+WTIEEVAFNVPGLS+DCFIPPA+L+  S++E+C++P+    KN   A  AA+RAKVAALE       + +W  D+
Subjt:  MSHTKTRMEEAWTIEEVAFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAA--AAYRAKVAALEKHHDSKVKVIWKPDI

AT5G05840.1 Protein of unknown function (DUF620)9.3e-9155.36Show/hide
Query:  ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETA-----NKVVRTRN-SSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWR
        E S AQYI++QY AA GG +  N+V + YAMGKV+M A EF T      +K+V+ R+  S   E GGFVLWQ   ++W +EL + G K+ AG + K+ WR
Subjt:  ETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETA-----NKVVRTRN-SSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWR

Query:  HTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLKARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQN
         TPW  +HA++GP RPLRR LQGLDPK+TA++FA + C+GEK +N EDCFILKL  +PS LKARS    EIIRHT++G FSQ++GLLI LEDSHL RI+ 
Subjt:  HTPWLGAHAAKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLKARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQN

Query:  NGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVAFNVPGLSIDCFIPPAELR
           ++++WETT+ S + DYR V+GI++AH+G+S V+LFRFGE + +H++TRMEE W IEE+ FN+ GLS+DCF+PP++L+
Subjt:  NGGDAVYWETTINSFLDDYRPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVAFNVPGLSIDCFIPPAELR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGAAAACAAGGTTTCTTCTCGGCGCTAAAGGGGGAGATAGTGAGAGGACTATCACCTGGAAGGTCAAGAGCGAAGAGTCCGGCGAGAAGTGCGTCTCCCATGTC
GAGTTTACTGCGGCGGAGGAAGAGCCACCAAATGGCGCAACCTGACTTGTTGATTGAGAGATCCGGGAGCTTGAGGCCGGCGGAGGCATTGTCGCCGTTGAAGGAAGGAC
CCGACGGGAACGAAGGCAGAGACTCGAAGGAGGGGAAGTGGGGAAATTGGATGAGGGGCCAGCTTTGTCGGGCACCTTCAGCTGTTTCCTGCTCTGCCCAGAAACGATCT
GATCTGAGACTGCTGCTTGGGGAGACTTCATCTGCTCAGTATATATTGCAGCAGTACACAGCAGCCTCGGGAGGGCAGAAGCTTCAAAACTCTGTAAGCAATGCCTATGC
CATGGGAAAGGTGAAAATGATAGCATGTGAGTTTGAAACAGCAAACAAGGTGGTGAGGACCCGGAATTCCTCCAAAGATGCAGAGTCGGGCGGGTTCGTTTTGTGGCAGA
TGCAGCCAGATATGTGGTATGTTGAGCTGGCTCTGGGAGGTAGCAAGGTTCATGCTGGTTGCAATGGGAAGCTGGTGTGGAGGCACACACCTTGGCTAGGTGCTCATGCT
GCCAAAGGGCCTGTTAGACCCCTCCGCCGTGCACTGCAGGGACTTGATCCTAAAACAACAGCAAGCATGTTTGCTAATGCAAGATGCATCGGAGAGAAGAGTGTTAATGG
TGAAGATTGTTTCATCCTCAAGCTCTGTACAGATCCTTCAACGTTGAAGGCGAGAAGTGAAGGACCTGCAGAGATAATCAGACACACCATGTTTGGATACTTCAGCCAGA
AATCAGGACTCCTGATACATTTGGAAGATTCGCATTTAACTCGCATCCAAAACAATGGAGGCGATGCTGTCTATTGGGAAACCACGATCAATTCGTTTCTTGATGATTAC
CGACCAGTAGAGGGGATTATGATTGCTCACTCGGGTCGTTCTGTAGTTACCCTTTTCAGATTTGGGGAAACAGCCATGAGCCACACAAAAACCAGGATGGAAGAAGCTTG
GACTATTGAGGAAGTAGCATTTAATGTCCCTGGCCTATCCATTGATTGTTTCATTCCCCCTGCTGAACTTAGATACGCTTCCATGAGCGAGGCCTGCGATTTCCCTCGAG
GTCATGGCTTTAAGAACGCCATGGCAGCAGCAGCTTATCGGGCCAAAGTCGCAGCTCTAGAGAAGCATCACGACAGCAAGGTCAAAGTTATCTGGAAACCAGATATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGAAAACAAGGTTTCTTCTCGGCGCTAAAGGGGGAGATAGTGAGAGGACTATCACCTGGAAGGTCAAGAGCGAAGAGTCCGGCGAGAAGTGCGTCTCCCATGTC
GAGTTTACTGCGGCGGAGGAAGAGCCACCAAATGGCGCAACCTGACTTGTTGATTGAGAGATCCGGGAGCTTGAGGCCGGCGGAGGCATTGTCGCCGTTGAAGGAAGGAC
CCGACGGGAACGAAGGCAGAGACTCGAAGGAGGGGAAGTGGGGAAATTGGATGAGGGGCCAGCTTTGTCGGGCACCTTCAGCTGTTTCCTGCTCTGCCCAGAAACGATCT
GATCTGAGACTGCTGCTTGGGGAGACTTCATCTGCTCAGTATATATTGCAGCAGTACACAGCAGCCTCGGGAGGGCAGAAGCTTCAAAACTCTGTAAGCAATGCCTATGC
CATGGGAAAGGTGAAAATGATAGCATGTGAGTTTGAAACAGCAAACAAGGTGGTGAGGACCCGGAATTCCTCCAAAGATGCAGAGTCGGGCGGGTTCGTTTTGTGGCAGA
TGCAGCCAGATATGTGGTATGTTGAGCTGGCTCTGGGAGGTAGCAAGGTTCATGCTGGTTGCAATGGGAAGCTGGTGTGGAGGCACACACCTTGGCTAGGTGCTCATGCT
GCCAAAGGGCCTGTTAGACCCCTCCGCCGTGCACTGCAGGGACTTGATCCTAAAACAACAGCAAGCATGTTTGCTAATGCAAGATGCATCGGAGAGAAGAGTGTTAATGG
TGAAGATTGTTTCATCCTCAAGCTCTGTACAGATCCTTCAACGTTGAAGGCGAGAAGTGAAGGACCTGCAGAGATAATCAGACACACCATGTTTGGATACTTCAGCCAGA
AATCAGGACTCCTGATACATTTGGAAGATTCGCATTTAACTCGCATCCAAAACAATGGAGGCGATGCTGTCTATTGGGAAACCACGATCAATTCGTTTCTTGATGATTAC
CGACCAGTAGAGGGGATTATGATTGCTCACTCGGGTCGTTCTGTAGTTACCCTTTTCAGATTTGGGGAAACAGCCATGAGCCACACAAAAACCAGGATGGAAGAAGCTTG
GACTATTGAGGAAGTAGCATTTAATGTCCCTGGCCTATCCATTGATTGTTTCATTCCCCCTGCTGAACTTAGATACGCTTCCATGAGCGAGGCCTGCGATTTCCCTCGAG
GTCATGGCTTTAAGAACGCCATGGCAGCAGCAGCTTATCGGGCCAAAGTCGCAGCTCTAGAGAAGCATCACGACAGCAAGGTCAAAGTTATCTGGAAACCAGATATCTGA
Protein sequenceShow/hide protein sequence
MERKQGFFSALKGEIVRGLSPGRSRAKSPARSASPMSSLLRRRKSHQMAQPDLLIERSGSLRPAEALSPLKEGPDGNEGRDSKEGKWGNWMRGQLCRAPSAVSCSAQKRS
DLRLLLGETSSAQYILQQYTAASGGQKLQNSVSNAYAMGKVKMIACEFETANKVVRTRNSSKDAESGGFVLWQMQPDMWYVELALGGSKVHAGCNGKLVWRHTPWLGAHA
AKGPVRPLRRALQGLDPKTTASMFANARCIGEKSVNGEDCFILKLCTDPSTLKARSEGPAEIIRHTMFGYFSQKSGLLIHLEDSHLTRIQNNGGDAVYWETTINSFLDDY
RPVEGIMIAHSGRSVVTLFRFGETAMSHTKTRMEEAWTIEEVAFNVPGLSIDCFIPPAELRYASMSEACDFPRGHGFKNAMAAAAYRAKVAALEKHHDSKVKVIWKPDI