; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000419 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000419
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationscaffold44:1584578..1587448
RNA-Seq ExpressionMS000419
SyntenyMS000419
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]1.6e-25892.18Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TP KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESE S +QTS+ 
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
         Q+ENK  SREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

XP_004140832.3 uncharacterized protein LOC101210841 [Cucumis sativus]5.2e-25791.78Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TP KRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPY Y+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGS+TPNGMRQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESE S +QTS+ 
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
         Q+ENK SSREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

XP_008439268.1 PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]4.7e-25891.98Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TP KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESE S +QTS+ 
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
         Q+ENK  SREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

XP_022141198.1 uncharacterized protein LOC111011654 [Momordica charantia]3.6e-28299.8Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWG CWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
        YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]4.0e-25792.38Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTP KRRWGSCWSLYWCFGI SQKNNKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDG+GMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ LQDG LLD+QISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSM S RT SE S +QTS+ 
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
        YQ+ENK SSREAETCE FDIKTSTAPEK+   +DDQCYQNQRA+TLGSFKEFNFDQTKGEI+NTASIGAEWWANEKVAVKEANPGNNWTFFP+LQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840982.3e-25891.98Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TP KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESE S +QTS+ 
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
         Q+ENK  SREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 12.3e-25891.98Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TP KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESE S +QTS+ 
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
         Q+ENK  SREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 17.8e-25992.18Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TP KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESE S +QTS+ 
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
         Q+ENK  SREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

A0A6J1CJS5 uncharacterized protein LOC1110116541.7e-28299.8Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWG CWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
        YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

A0A6J1E856 uncharacterized protein LOC1114307671.9e-25290.18Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTP KRRWGSCWSLYWCFG GSQKNNKRIGHAVLVPEP V G VAP VEHRTPSTT+VLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P SNAQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SDPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGM MG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK
        SRLGSGS+TPNG+RQDSRL SGT+TPDGLG+ALQDG LLD+QISEVASLANSE+GCQNDV NHRVSFELTGEDVARCLANKS            +QTS+ 
Subjt:  SRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSK

Query:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
         Q++NK SS+EAE+CEFFDIKTSTAPEK+ A EDDQCYQNQRAV LGSFKEFNFDQTKGE+H+TASIGAEWWANEKVAVKEA+PGNNWTFFPMLQPGVS
Subjt:  YQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.2e-3246.98Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRT------PSTTMVLPFIAPPSSPASFLQSDPSSNAQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE        P   H+        +  + L  +APPSSPASF  S   S  QSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRT------PSTTMVLPFIAPPSSPASFLQSDPSSNAQSPAGLLSLTALSVNNYS

Query:  QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
          GP +S++A GPYA+ETQLVSPPVFS F TEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)1.7e-5151.33Show/hide
Query:  NNSVDTVNAAATAIVSAEARVQPPTP--SKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPV-VPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQS
        NN  DT+NAAA+AI S++ R+   +P   KR+W + WSL  CF  GS +  KRIG++VLVPEPV +  + +        S    LPFIAPPSSPASF QS
Subjt:  NNSVDTVNAAATAIVSAEARVQPPTP--SKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPV-VPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQS

Query:  DPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG
        +P S  QSP G+LS + L  NN       SIFAIGPYA+ETQLVSPPVFS + TEPS+AP TPP +   +    TTPSSPEVPFA+L  S  +H   S+G
Subjt:  DPSSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG

Query:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
           KF +S   +FQ YQ  PGSP   LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.6e-3346.98Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRT------PSTTMVLPFIAPPSSPASFLQSDPSSNAQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE        P   H+        +  + L  +APPSSPASF  S   S  QSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRT------PSTTMVLPFIAPPSSPASFLQSDPSSNAQSPAGLLSLTALSVNNYS

Query:  QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
          GP +S++A GPYA+ETQLVSPPVFS F TEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein9.2e-11955.8Show/hide
Query:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPG-TVAPVVEHRTPSTTMVLPFIAPPSSPASF
        M S+NN SVDTVNAAA+AIVSAE+R QP +  K+R GS WSLYWCF  GS+KNNKRIGHAVLVPEP   G  VAPV    + ST++ +PFIAPPSSPASF
Subjt:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPG-TVAPVVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSDPSSNAQSP-AGLL-SLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK
        L S P S + +P  GLL SLT         N P S F IGPYA+ETQ V+PPVFSAF TEPSTAPFTPPPES     PSSPEVPFA+LLTSSL  +  N 
Subjt:  LQSDPSSNAQSP-AGLL-SLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK

Query:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP
          G NQKF+ +H +F+  Q YPGSPG +LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP   G GSRLGSG LTP
Subjt:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP

Query:  DGMGMGSRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMASIRTE
        D    GS+L SG +TPNG     R+  G LTP        +GSLLD+QISEVASLANS+ G    ND   V  HRVSFELTGEDVARCLA+K   S    
Subjt:  DGMGMGSRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMASIRTE

Query:  SESSEQQTSSKYQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVK-EANPGNNW
           S ++ S ++   N         C     KTS   E   +       Q  R+ + GS KEF FD T  E+     I +EWWANEKVA K + +P N+W
Subjt:  SESSEQQTSSKYQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVK-EANPGNNW

Query:  TFFPMLQPG
        TFFP+L+ G
Subjt:  TFFPMLQPG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.4e-11953.4Show/hide
Query:  MNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQSDP
        +NNSV+TVNAAATAIV+AE+RVQP +  K RWG CWSLY CF  G+QKNNKRIG+AVLVPEPV  G     V++   STT+VLPFIAPPSSPASFLQSDP
Subjt:  MNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQSDP

Query:  SSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ
        SS + SP G LSLT+   N +S   P S+F +GPYA ETQ V+PPVFSAF TEPSTAP+TPPPE SV +TTPSSPEVPFA+LLTSSL  T +  + G NQ
Subjt:  SSNAQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ

Query:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        KF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP                  +G 
Subjt:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSS
        GS L SG++TPNG      + SG LTP+     LQ      NQISEVASLANS+ G +  V +HRVSFELTGEDVARCLA+K                + 
Subjt:  GSRLGSGSMTPNGMRQDSRLDSGTLTPDGLGNALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSS

Query:  KYQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS
         +   N     E E     DI+ +          +    Q   + ++GS KEF FD TK E              EKVA      GN+W+FFP L+ GVS
Subjt:  KYQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCATGAACAACAGCGTGGACACGGTTAATGCGGCCGCTACTGCGATCGTCTCCGCGGAGGCTCGAGTCCAGCCTCCGACACCTTCGAAACGAAGATGGGGTAG
CTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGTTCGCAGAAAAACAATAAGCGTATTGGTCATGCTGTACTGGTTCCAGAACCTGTGGTACCAGGAACTGTTGCCCCTG
TTGTTGAACATCGGACACCTTCAACCACAATGGTATTACCTTTTATTGCGCCTCCGTCTTCTCCTGCGTCTTTCCTCCAGTCCGATCCTTCATCAAACGCTCAATCTCCG
GCTGGATTACTATCTTTAACTGCCCTCTCAGTCAATAACTACTCCCAAAATGGACCTGCGTCCATTTTTGCCATAGGCCCTTATGCATATGAGACCCAGTTGGTCTCACC
TCCAGTTTTTTCTGCCTTCCCAACTGAACCATCTACTGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAACTGACCACACCCTCATCTCCTGAAGTGCCATTTGCTAAAT
TGCTGACATCTTCTCTGAGTCATACTAATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGC
GCCCATCTTATATCACCTGGATCGGTAATTTCGAACTCTGGTACATCTTCTCCTTTCCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCTCCGAAGCTCTT
GGGTCTCGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACGCCAGATGGTACTGGTTTAGGTTCTAGGTTAGGTTCGGGAACTTTGACCC
CTGATGGTATGGGTATGGGTTCGAGATTGGGATCTGGATCCATGACCCCGAATGGCATGAGGCAAGATTCAAGATTGGATTCTGGTACCTTGACGCCCGATGGTTTGGGG
AATGCCTTGCAAGATGGTTCACTGTTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACAAATCATAGGGTGTCATT
TGAGTTAACTGGTGAAGATGTTGCACGTTGTCTTGCAAACAAGTCAATGGCATCCATTAGAACTGAATCAGAGTCTTCAGAGCAACAAACGAGCTCAAAGTACCAAAGCG
AAAACAAAGGATCCTCGAGAGAAGCTGAAACTTGTGAGTTCTTTGACATCAAGACTTCCACAGCACCCGAAAAAAGTCCAGCAGGAGAGGATGATCAATGCTACCAAAAT
CAGCGAGCCGTAACTCTTGGTTCATTCAAAGAGTTCAATTTTGACCAAACTAAAGGTGAAATACACAATACAGCCTCCATTGGTGCAGAGTGGTGGGCCAATGAAAAGGT
GGCTGTGAAGGAAGCTAATCCAGGCAACAACTGGACTTTCTTCCCAATGTTGCAACCTGGAGTCAGC
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCATGAACAACAGCGTGGACACGGTTAATGCGGCCGCTACTGCGATCGTCTCCGCGGAGGCTCGAGTCCAGCCTCCGACACCTTCGAAACGAAGATGGGGTAG
CTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGTTCGCAGAAAAACAATAAGCGTATTGGTCATGCTGTACTGGTTCCAGAACCTGTGGTACCAGGAACTGTTGCCCCTG
TTGTTGAACATCGGACACCTTCAACCACAATGGTATTACCTTTTATTGCGCCTCCGTCTTCTCCTGCGTCTTTCCTCCAGTCCGATCCTTCATCAAACGCTCAATCTCCG
GCTGGATTACTATCTTTAACTGCCCTCTCAGTCAATAACTACTCCCAAAATGGACCTGCGTCCATTTTTGCCATAGGCCCTTATGCATATGAGACCCAGTTGGTCTCACC
TCCAGTTTTTTCTGCCTTCCCAACTGAACCATCTACTGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAACTGACCACACCCTCATCTCCTGAAGTGCCATTTGCTAAAT
TGCTGACATCTTCTCTGAGTCATACTAATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGC
GCCCATCTTATATCACCTGGATCGGTAATTTCGAACTCTGGTACATCTTCTCCTTTCCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCTCCGAAGCTCTT
GGGTCTCGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACGCCAGATGGTACTGGTTTAGGTTCTAGGTTAGGTTCGGGAACTTTGACCC
CTGATGGTATGGGTATGGGTTCGAGATTGGGATCTGGATCCATGACCCCGAATGGCATGAGGCAAGATTCAAGATTGGATTCTGGTACCTTGACGCCCGATGGTTTGGGG
AATGCCTTGCAAGATGGTTCACTGTTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACAAATCATAGGGTGTCATT
TGAGTTAACTGGTGAAGATGTTGCACGTTGTCTTGCAAACAAGTCAATGGCATCCATTAGAACTGAATCAGAGTCTTCAGAGCAACAAACGAGCTCAAAGTACCAAAGCG
AAAACAAAGGATCCTCGAGAGAAGCTGAAACTTGTGAGTTCTTTGACATCAAGACTTCCACAGCACCCGAAAAAAGTCCAGCAGGAGAGGATGATCAATGCTACCAAAAT
CAGCGAGCCGTAACTCTTGGTTCATTCAAAGAGTTCAATTTTGACCAAACTAAAGGTGAAATACACAATACAGCCTCCATTGGTGCAGAGTGGTGGGCCAATGAAAAGGT
GGCTGTGAAGGAAGCTAATCCAGGCAACAACTGGACTTTCTTCCCAATGTTGCAACCTGGAGTCAGC
Protein sequenceShow/hide protein sequence
MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQSDPSSNAQSP
AGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQPYQPYPGSPG
AHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMGSRLGSGSMTPNGMRQDSRLDSGTLTPDGLG
NALQDGSLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSKYQSENKGSSREAETCEFFDIKTSTAPEKSPAGEDDQCYQN
QRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEANPGNNWTFFPMLQPGVS