; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020218 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020218
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr09:13915789..13938501
RNA-Seq ExpressionPay0020218
SyntenyPay0020218
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648436.1 hypothetical protein Csa_008851 [Cucumis sativus]5.4e-29591.61Show/hide
Query:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE
        EGARLNIENLT EN MPPE LDWVRVESISSL GTLADGVDN GSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRH EGGFGV SGE
Subjt:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE

Query:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE
         LQCFLKKR+KSMFASEE MEIENVLHSRTGSHAP PC PSEVCSP LTLTGSYCSGN CVNKSTES DDMELKEDKICSTEKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE

Query:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK
        NLLS T VKDEPYDHVDDSNIYGKDMNNVFS+TVSIKSEAT PDEHYENKVDNMRLQDRMKFFSS+KDFGFTP++YEHPKPSDPGCSILVSEPASL NIK
Subjt:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK

Query:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
        RR KRKKTVTNSVETALEEDAPGLLQ  VDKGVLVDEIKLYGETESDEDLDESF ED F EL+DVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
Subjt:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL

Query:  IEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL-----------WVLLS
        IEQTRYL FR WPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL            VL S
Subjt:  IEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL-----------WVLLS

Query:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTS+KVAEYSSSQTTQVKLEL
Subjt:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

XP_008460540.1 PREDICTED: uncharacterized protein LOC103499334 isoform X1 [Cucumis melo]0.0e+0097.57Show/hide
Query:  MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP
        MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP
Subjt:  MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP

Query:  SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV
        SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV
Subjt:  SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV

Query:  PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT
        PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT
Subjt:  PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT

Query:  NIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL
        NIKRRCKRKKTVTNSVETALEEDAPGLLQ  VDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL
Subjt:  NIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL

Query:  VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------V
        VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL            V
Subjt:  VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------V

Query:  LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
Subjt:  LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

XP_008460542.1 PREDICTED: uncharacterized protein LOC103499334 isoform X2 [Cucumis melo]4.3e-30092.52Show/hide
Query:  MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP
        MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP
Subjt:  MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP

Query:  SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV
        SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV
Subjt:  SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV

Query:  PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT
        PKENLLSSTTVKDEPYDHVDDSNIY                             DNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT
Subjt:  PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT

Query:  NIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL
        NIKRRCKRKKTVTNSVETALEEDAPGLLQ  VDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL
Subjt:  NIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL

Query:  VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------V
        VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL            V
Subjt:  VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------V

Query:  LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
Subjt:  LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

XP_011655236.1 uncharacterized protein LOC101212787 isoform X1 [Cucumis sativus]5.4e-29591.61Show/hide
Query:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE
        EGARLNIENLT EN MPPE LDWVRVESISSL GTLADGVDN GSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRH EGGFGV SGE
Subjt:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE

Query:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE
         LQCFLKKR+KSMFASEE MEIENVLHSRTGSHAP PC PSEVCSP LTLTGSYCSGN CVNKSTES DDMELKEDKICSTEKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE

Query:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK
        NLLS T VKDEPYDHVDDSNIYGKDMNNVFS+TVSIKSEAT PDEHYENKVDNMRLQDRMKFFSS+KDFGFTP++YEHPKPSDPGCSILVSEPASL NIK
Subjt:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK

Query:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
        RR KRKKTVTNSVETALEEDAPGLLQ  VDKGVLVDEIKLYGETESDEDLDESF ED F EL+DVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
Subjt:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL

Query:  IEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL-----------WVLLS
        IEQTRYL FR WPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL            VL S
Subjt:  IEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL-----------WVLLS

Query:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTS+KVAEYSSSQTTQVKLEL
Subjt:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

XP_031741378.1 uncharacterized protein LOC101212787 isoform X2 [Cucumis sativus]6.7e-26985.49Show/hide
Query:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE
        EGARLNIENLT EN MPPE LDWVRVESISSL GTLADGVDN GSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRH EGGFGV SGE
Subjt:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE

Query:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE
         LQCFLKKR+KSMFASEE MEIENVLHSRTGSHAP PC PSEVCSP LTLTGSYCSGN CVNKSTES DDMELKEDKICSTEKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE

Query:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK
        NLLS T VKDEPYDHVDDSNIYGKDMNNVFS+TVSIKSEAT PDEHYENKVDNMRLQDRMKFFSS+KDFGFTP++YEHPKPSDPGCSILVSEPASL NIK
Subjt:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK

Query:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
        RR KRKKTVTNSVETALEEDAPGLLQ  VDKGVLVDEIKLYGETESDEDLDESF ED F EL+DVISR                                
Subjt:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL

Query:  IEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL-----------WVLLS
           TRYL FR WPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL            VL S
Subjt:  IEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL-----------WVLLS

Query:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTS+KVAEYSSSQTTQVKLEL
Subjt:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

TrEMBL top hitse value%identityAlignment
A0A0A0KNC1 Uncharacterized protein2.6e-29591.61Show/hide
Query:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE
        EGARLNIENLT EN MPPE LDWVRVESISSL GTLADGVDN GSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRH EGGFGV SGE
Subjt:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE

Query:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE
         LQCFLKKR+KSMFASEE MEIENVLHSRTGSHAP PC PSEVCSP LTLTGSYCSGN CVNKSTES DDMELKEDKICSTEKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE

Query:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK
        NLLS T VKDEPYDHVDDSNIYGKDMNNVFS+TVSIKSEAT PDEHYENKVDNMRLQDRMKFFSS+KDFGFTP++YEHPKPSDPGCSILVSEPASL NIK
Subjt:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK

Query:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
        RR KRKKTVTNSVETALEEDAPGLLQ  VDKGVLVDEIKLYGETESDEDLDESF ED F EL+DVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
Subjt:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL

Query:  IEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL-----------WVLLS
        IEQTRYL FR WPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL            VL S
Subjt:  IEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL-----------WVLLS

Query:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTS+KVAEYSSSQTTQVKLEL
Subjt:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

A0A1S3CCP8 uncharacterized protein LOC103499334 isoform X22.1e-30092.52Show/hide
Query:  MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP
        MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP
Subjt:  MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP

Query:  SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV
        SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV
Subjt:  SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV

Query:  PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT
        PKENLLSSTTVKDEPYDHVDDSNIY                             DNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT
Subjt:  PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT

Query:  NIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL
        NIKRRCKRKKTVTNSVETALEEDAPGLLQ  VDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL
Subjt:  NIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL

Query:  VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------V
        VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL            V
Subjt:  VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------V

Query:  LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
Subjt:  LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

A0A1S3CCT1 uncharacterized protein LOC103499334 isoform X10.0e+0097.57Show/hide
Query:  MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP
        MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP
Subjt:  MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVP

Query:  SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV
        SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV
Subjt:  SGELLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHV

Query:  PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT
        PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT
Subjt:  PKENLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLT

Query:  NIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL
        NIKRRCKRKKTVTNSVETALEEDAPGLLQ  VDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL
Subjt:  NIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACL

Query:  VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------V
        VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPL            V
Subjt:  VSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------V

Query:  LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
Subjt:  LLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

A0A6J1FLT1 uncharacterized protein LOC111445382 isoform X19.4e-26180.98Show/hide
Query:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE
        EGARLN EN T EN  PPE LD VRVES S LSGTL  GVDN   AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGL N+HVEGG GVPSG+
Subjt:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE

Query:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE
        LLQCFLK++ KSMFASEE MEI NVLH ++GS+APR CSPS VCSP+ TL+GSY S NH +NKSTESG+DMELKEDKICS+EKVATEL SR LT+HVP+E
Subjt:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE

Query:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK
        NLLSST VKDEPYDH +  +IYGKDMNNV+SNT+SIKSE T PDE YENKVD+M LQDRMKFFSSRKD GFT +DYEHPKPSDPGCS+LVSEP +  N K
Subjt:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK

Query:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPS-IRCMKSSRVSYCLACLVS
        RR K+KKT TNS+ETALEEDAPGLLQ  V+KG+ VDEIKLYGETESD+DLDES  ED F ELEDVI+RLF QRHSF+KFPS IRCMK+SR SYCLACLVS
Subjt:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPS-IRCMKSSRVSYCLACLVS

Query:  LIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------VLL
        LIEQTRYLHFR WPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVI+MKLT+CSRISLLEN PL            VLL
Subjt:  LIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------VLL

Query:  SYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        SYGWM NSGLGTMLNYRGRVVHDR+NEDISEW+SKIGKLLMDGYNGGAL+LENT  KVAEYSSSQ TQVKLEL
Subjt:  SYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

A0A6J1FMV3 uncharacterized protein LOC111445382 isoform X29.4e-26180.98Show/hide
Query:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE
        EGARLN EN T EN  PPE LD VRVES S LSGTL  GVDN   AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGL N+HVEGG GVPSG+
Subjt:  EGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGE

Query:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE
        LLQCFLK++ KSMFASEE MEI NVLH ++GS+APR CSPS VCSP+ TL+GSY S NH +NKSTESG+DMELKEDKICS+EKVATEL SR LT+HVP+E
Subjt:  LLQCFLKKRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKE

Query:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK
        NLLSST VKDEPYDH +  +IYGKDMNNV+SNT+SIKSE T PDE YENKVD+M LQDRMKFFSSRKD GFT +DYEHPKPSDPGCS+LVSEP +  N K
Subjt:  NLLSSTTVKDEPYDHVDDSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIK

Query:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPS-IRCMKSSRVSYCLACLVS
        RR K+KKT TNS+ETALEEDAPGLLQ  V+KG+ VDEIKLYGETESD+DLDES  ED F ELEDVI+RLF QRHSF+KFPS IRCMK+SR SYCLACLVS
Subjt:  RRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPS-IRCMKSSRVSYCLACLVS

Query:  LIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------VLL
        LIEQTRYLHFR WPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVI+MKLT+CSRISLLEN PL            VLL
Subjt:  LIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLW-----------VLL

Query:  SYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        SYGWM NSGLGTMLNYRGRVVHDR+NEDISEW+SKIGKLLMDGYNGGAL+LENT  KVAEYSSSQ TQVKLEL
Subjt:  SYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G16610.1 unknown protein1.7e-8443.04Show/hide
Query:  EGGFGVPSGELLQCFLKKRDKSMFASEEL-MEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTE--SGDDMELKEDKICSTEKVATE
        E G    SG     FL+K D  +  +  +  E  + L     S  P     S   SP  +L     S N    K  +    + + L E++I ST+     
Subjt:  EGGFGVPSGELLQCFLKKRDKSMFASEEL-MEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTE--SGDDMELKEDKICSTEKVATE

Query:  LDSRPLTDHVPKENLLS-STTVKDEPYDH---VDDSNIYGKDMNNVFSNTVS-------IKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDY
        L    + D+  K  + S    VK E   H   +D++ +    ++   +   S       +K+EA    E  E+ +D+M+L DR+K  S         L+ 
Subjt:  LDSRPLTDHVPKENLLS-STTVKDEPYDH---VDDSNIYGKDMNNVFSNTVS-------IKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDY

Query:  EHPKPSDPGCSILVSEPASLTNIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSF
            PS         E    + + R  KRKKT T+S+ETALEEDAPGLLQ  + +GV VDE++LYG    D   D+S   + F ELEDVIS+LF +R + 
Subjt:  EHPKPSDPGCSILVSEPASLTNIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSF

Query:  MKFPSIRCMKSSRVSYCLACLVSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRI
         K  +    K+SR SYCL CL SLIEQ RYL FRKWPVEWGWCRDLQSFIFVFERH RIVMERPEYGYATYFFEL ++  I WQ+KRLV++MKL SC R 
Subjt:  MKFPSIRCMKSSRVSYCLACLVSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRI

Query:  SLLENTPLW-----------VLLSYGWMPNSGLGTMLNYRGRVVHDRNNE-DISEWKSKIGKLLMDGYNGGALL
         L+EN PL            VL+ YGW+ N+GLGTMLNYR RV HDR  +   SEW+SKI +LL+DGYN G ++
Subjt:  SLLENTPLW-----------VLLSYGWMPNSGLGTMLNYRGRVVHDRNNE-DISEWKSKIGKLLMDGYNGGALL

AT5G16610.2 unknown protein3.3e-8838.9Show/hide
Query:  IENLTSENLMPPEFLDWVRVESISSL--SGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHV--------------
        + N  +   +  E L+ + +     L  SG +    + +G    AV         +  +DL+H+ L ER +MLL R A+ L   +V              
Subjt:  IENLTSENLMPPEFLDWVRVESISSL--SGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHV--------------

Query:  -------EGGFGVPSGELLQCFLKKRDKSMFASEEL-MEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTE--SGDDMELKEDKICS
               E G    SG     FL+K D  +  +  +  E  + L     S  P     S   SP  +L     S N    K  +    + + L E++I S
Subjt:  -------EGGFGVPSGELLQCFLKKRDKSMFASEEL-MEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTE--SGDDMELKEDKICS

Query:  TEKVATELDSRPLTDHVPKENLLS-STTVKDEPYDH---VDDSNIYGKDMNNVFSNTVS-------IKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDF
        T+     L    + D+  K  + S    VK E   H   +D++ +    ++   +   S       +K+EA    E  E+ +D+M+L DR+K  S     
Subjt:  TEKVATELDSRPLTDHVPKENLLS-STTVKDEPYDH---VDDSNIYGKDMNNVFSNTVS-------IKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDF

Query:  GFTPLDYEHPKPSDPGCSILVSEPASLTNIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRL
            L+     PS         E    + + R  KRKKT T+S+ETALEEDAPGLLQ  + +GV VDE++LYG    D   D+S   + F ELEDVIS+L
Subjt:  GFTPLDYEHPKPSDPGCSILVSEPASLTNIKRRCKRKKTVTNSVETALEEDAPGLLQ-YVDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRL

Query:  FSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMK
        F +R +  K  +    K+SR SYCL CL SLIEQ RYL FRKWPVEWGWCRDLQSFIFVFERH RIVMERPEYGYATYFFEL ++  I WQ+KRLV++MK
Subjt:  FSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMK

Query:  LTSCSRISLLENTPLW-----------VLLSYGWMPNSGLGTMLNYRGRVVHDRNNE-DISEWKSKIGKLLMDGYNGGALL
        L SC R  L+EN PL            VL+ YGW+ N+GLGTMLNYR RV HDR  +   SEW+SKI +LL+DGYN G ++
Subjt:  LTSCSRISLLENTPLW-----------VLLSYGWMPNSGLGTMLNYRGRVVHDRNNE-DISEWKSKIGKLLMDGYNGGALL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTACGAGGGGGCTCGATTAAATATAGAGAACCTGACTTCTGAGAATCTAATGCCACCTGAATTTCTTGATTGGGTAAGGGTGGAATCGATAAGCAGCTTATCAGG
TACCTTGGCAGATGGTGTAGATAACATTGGTTCTGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGATGTTTGATGACTTTGATGAAGATCTTGATCATGTTTTATTGA
TAGAGCGACTAAGGATGCTCCTATCAAGGCGAGCATTGGGTTTGACAAATCGACATGTGGAGGGTGGTTTTGGTGTGCCTTCGGGAGAACTTCTCCAATGCTTCTTGAAA
AAGAGAGATAAATCCATGTTTGCTAGTGAAGAACTGATGGAAATTGAAAATGTGTTGCATTCTAGAACTGGAAGTCATGCTCCTCGTCCTTGCAGCCCTTCAGAAGTTTG
TTCACCTAGTCTAACACTTACAGGATCATATTGCTCAGGCAATCATTGTGTGAACAAGTCAACTGAATCAGGCGATGATATGGAACTGAAAGAAGATAAGATCTGCTCAA
CAGAGAAGGTAGCTACAGAATTAGATTCACGGCCTTTGACTGATCATGTTCCTAAAGAAAATTTATTGAGTTCCACAACAGTGAAGGATGAACCTTATGATCATGTAGAT
GACAGCAACATATATGGTAAGGATATGAATAATGTTTTCAGCAACACTGTGTCGATAAAGAGTGAAGCAACCTTTCCTGATGAACATTATGAAAACAAGGTAGACAATAT
GCGATTGCAAGATCGAATGAAGTTTTTCTCTTCTCGGAAGGATTTTGGTTTTACACCTCTGGATTATGAGCATCCAAAACCTTCTGACCCTGGATGCAGCATTCTTGTTT
CAGAACCTGCTAGTTTAACGAACATTAAACGAAGATGCAAACGGAAAAAGACTGTCACGAATTCAGTTGAAACAGCACTAGAGGAAGATGCTCCTGGCCTTCTCCAGTAT
GTCGACAAAGGTGTACTAGTTGATGAAATCAAGCTTTATGGGGAGACAGAAAGCGATGAAGATCTAGATGAGTCTTTTGGTGAAGACATCTTTGGTGAGCTTGAAGATGT
GATATCGAGGCTTTTTTCTCAACGCCATTCCTTTATGAAGTTTCCCTCCATAAGATGCATGAAAAGTTCAAGAGTAAGCTATTGTTTAGCTTGTCTAGTTTCACTTATTG
AGCAGACAAGATATCTTCATTTCCGGAAATGGCCTGTCGAATGGGGGTGGTGCCGGGATCTCCAGTCTTTTATATTTGTATTTGAGAGACATAAAAGAATAGTAATGGAA
CGTCCTGAGTATGGCTATGCGACATATTTTTTTGAGCTTGTGGATTCCTTACCCATCAATTGGCAGATAAAGCGGTTGGTGATTTCCATGAAGCTTACGAGTTGTAGCAG
AATTTCATTACTTGAGAACACACCATTATGGGTTTTATTGAGCTATGGGTGGATGCCAAATAGTGGCTTGGGTACAATGCTGAACTACCGTGGGAGAGTTGTTCATGACC
GGAATAATGAGGACATCTCCGAATGGAAATCGAAAATAGGGAAGCTATTGATGGATGGTTATAATGGCGGAGCTCTTTTGCTAGAAAATACTTCAATGAAGGTTGCAGAA
TACAGCAGTTCCCAAACCACACAAGTTAAGCTGGAACTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTACGAGGGGGCTCGATTAAATATAGAGAACCTGACTTCTGAGAATCTAATGCCACCTGAATTTCTTGATTGGGTAAGGGTGGAATCGATAAGCAGCTTATCAGG
TACCTTGGCAGATGGTGTAGATAACATTGGTTCTGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGATGTTTGATGACTTTGATGAAGATCTTGATCATGTTTTATTGA
TAGAGCGACTAAGGATGCTCCTATCAAGGCGAGCATTGGGTTTGACAAATCGACATGTGGAGGGTGGTTTTGGTGTGCCTTCGGGAGAACTTCTCCAATGCTTCTTGAAA
AAGAGAGATAAATCCATGTTTGCTAGTGAAGAACTGATGGAAATTGAAAATGTGTTGCATTCTAGAACTGGAAGTCATGCTCCTCGTCCTTGCAGCCCTTCAGAAGTTTG
TTCACCTAGTCTAACACTTACAGGATCATATTGCTCAGGCAATCATTGTGTGAACAAGTCAACTGAATCAGGCGATGATATGGAACTGAAAGAAGATAAGATCTGCTCAA
CAGAGAAGGTAGCTACAGAATTAGATTCACGGCCTTTGACTGATCATGTTCCTAAAGAAAATTTATTGAGTTCCACAACAGTGAAGGATGAACCTTATGATCATGTAGAT
GACAGCAACATATATGGTAAGGATATGAATAATGTTTTCAGCAACACTGTGTCGATAAAGAGTGAAGCAACCTTTCCTGATGAACATTATGAAAACAAGGTAGACAATAT
GCGATTGCAAGATCGAATGAAGTTTTTCTCTTCTCGGAAGGATTTTGGTTTTACACCTCTGGATTATGAGCATCCAAAACCTTCTGACCCTGGATGCAGCATTCTTGTTT
CAGAACCTGCTAGTTTAACGAACATTAAACGAAGATGCAAACGGAAAAAGACTGTCACGAATTCAGTTGAAACAGCACTAGAGGAAGATGCTCCTGGCCTTCTCCAGTAT
GTCGACAAAGGTGTACTAGTTGATGAAATCAAGCTTTATGGGGAGACAGAAAGCGATGAAGATCTAGATGAGTCTTTTGGTGAAGACATCTTTGGTGAGCTTGAAGATGT
GATATCGAGGCTTTTTTCTCAACGCCATTCCTTTATGAAGTTTCCCTCCATAAGATGCATGAAAAGTTCAAGAGTAAGCTATTGTTTAGCTTGTCTAGTTTCACTTATTG
AGCAGACAAGATATCTTCATTTCCGGAAATGGCCTGTCGAATGGGGGTGGTGCCGGGATCTCCAGTCTTTTATATTTGTATTTGAGAGACATAAAAGAATAGTAATGGAA
CGTCCTGAGTATGGCTATGCGACATATTTTTTTGAGCTTGTGGATTCCTTACCCATCAATTGGCAGATAAAGCGGTTGGTGATTTCCATGAAGCTTACGAGTTGTAGCAG
AATTTCATTACTTGAGAACACACCATTATGGGTTTTATTGAGCTATGGGTGGATGCCAAATAGTGGCTTGGGTACAATGCTGAACTACCGTGGGAGAGTTGTTCATGACC
GGAATAATGAGGACATCTCCGAATGGAAATCGAAAATAGGGAAGCTATTGATGGATGGTTATAATGGCGGAGCTCTTTTGCTAGAAAATACTTCAATGAAGGTTGCAGAA
TACAGCAGTTCCCAAACCACACAAGTTAAGCTGGAACTCTAA
Protein sequenceShow/hide protein sequence
MDYEGARLNIENLTSENLMPPEFLDWVRVESISSLSGTLADGVDNIGSAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGFGVPSGELLQCFLK
KRDKSMFASEELMEIENVLHSRTGSHAPRPCSPSEVCSPSLTLTGSYCSGNHCVNKSTESGDDMELKEDKICSTEKVATELDSRPLTDHVPKENLLSSTTVKDEPYDHVD
DSNIYGKDMNNVFSNTVSIKSEATFPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPLDYEHPKPSDPGCSILVSEPASLTNIKRRCKRKKTVTNSVETALEEDAPGLLQY
VDKGVLVDEIKLYGETESDEDLDESFGEDIFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQTRYLHFRKWPVEWGWCRDLQSFIFVFERHKRIVME
RPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENTPLWVLLSYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAE
YSSSQTTQVKLEL