; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030116 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030116
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationtig00153554:3080564..3083531
RNA-Seq ExpressionSgr030116
SyntenySgr030116
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]3.0e-26593.56Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPE  +PGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        S+P SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSG+LTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQ SEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESP+QTST  
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY

Query:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        QN+ K  SREAETCEFFDIKTS APEKT GEDDQCYQNQRA+TLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV  KEASPGNNWTFFP+LQPGVS
Subjt:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

XP_004140832.3 uncharacterized protein LOC101210841 [Cucumis sativus]6.3e-26392.76Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPE  +PGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        S+P SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSG+LTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGH LQD  LLDNQ SEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESP+QTST  
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY

Query:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        QN+ K SSREAETCEFFDIKTS APEKT GEDDQCYQNQRA+TLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV  KEASPGNNWTFFP+LQPGVS
Subjt:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

XP_008439268.1 PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]8.8e-26593.36Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPE  +PGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        S P SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSG+LTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQ SEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESP+QTST  
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY

Query:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        QN+ K  SREAETCEFFDIKTS APEKT GEDDQCYQNQRA+TLGSFKEFNFDQTKGE+HNTASIGAEWWANEKV  KEASPGNNWTFFP+LQPGVS
Subjt:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

XP_022141198.1 uncharacterized protein LOC111011654 [Momordica charantia]5.2e-26593.99Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTP KRRWG CWSLYWCFGIGSQKNNKRIGHAVLVPE V+PG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        SDP SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSG+LTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPE-QTSTK
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ALQDG LLDNQ SEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSM SIRTESES E QTS+K
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPE-QTSTK

Query:  YQNKTKGSSREAETCEFFDIKTSTAPEKT-SGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        YQ++ KGSSREAETCEFFDIKTSTAPEK+ +GEDDQCYQNQRA+TLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVA KEA+PGNNWTFFPMLQPGVS
Subjt:  YQNKTKGSSREAETCEFFDIKTSTAPEKT-SGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]8.8e-26593.96Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGI SQKNNKRIGHAVLVPE  +PGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        S+PPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSG+LTPDG+GMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQDG LLD+Q SEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTS RT SESP+QTST Y
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY

Query:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        Q + K SSREAETCE FDIKTSTAPEKTS +DDQCYQNQRA+TLGSFKEFNFDQTKGEI+NTASIGAEWWANEKVA KEA+PGNNWTFFP+LQPGVS
Subjt:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840984.3e-26593.36Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPE  +PGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        S P SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSG+LTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQ SEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESP+QTST  
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY

Query:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        QN+ K  SREAETCEFFDIKTS APEKT GEDDQCYQNQRA+TLGSFKEFNFDQTKGE+HNTASIGAEWWANEKV  KEASPGNNWTFFP+LQPGVS
Subjt:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 14.3e-26593.36Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPE  +PGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        S P SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSG+LTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQ SEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESP+QTST  
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY

Query:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        QN+ K  SREAETCEFFDIKTS APEKT GEDDQCYQNQRA+TLGSFKEFNFDQTKGE+HNTASIGAEWWANEKV  KEASPGNNWTFFP+LQPGVS
Subjt:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 11.5e-26593.56Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPE  +PGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        S+P SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSG+LTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQ SEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESP+QTST  
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY

Query:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        QN+ K  SREAETCEFFDIKTS APEKT GEDDQCYQNQRA+TLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV  KEASPGNNWTFFP+LQPGVS
Subjt:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

A0A6J1CJS5 uncharacterized protein LOC1110116542.5e-26593.99Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTP KRRWG CWSLYWCFGIGSQKNNKRIGHAVLVPE V+PG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        SDP SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSG+LTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPE-QTSTK
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ALQDG LLDNQ SEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSM SIRTESES E QTS+K
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPE-QTSTK

Query:  YQNKTKGSSREAETCEFFDIKTSTAPEKT-SGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        YQ++ KGSSREAETCEFFDIKTSTAPEK+ +GEDDQCYQNQRA+TLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVA KEA+PGNNWTFFPMLQPGVS
Subjt:  YQNKTKGSSREAETCEFFDIKTSTAPEKT-SGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

A0A6J1E856 uncharacterized protein LOC1114307673.5e-25991.75Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFG GSQKNNKRIGHAVLVPE  + GAVAPAVEHRTPSTT+VLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ
        S+PPSN QSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL+HTNKSFGTNQ
Subjt:  SDPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSG+LTPDGM MG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY
        SRLGSGSVTPNGVRQDSRLGSGT+TPDGLGHALQDGLLLD+Q SEVASLANSE+GCQNDV NHRVSFELTGEDVARCLANKS           +QTST  
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKY

Query:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS
        QNK K SS+EAE+CEFFDIKTSTAPEKTS EDDQCYQNQRA+ LGSFKEFNFDQTKGE+H+TASIGAEWWANEKVA KEASPGNNWTFFPMLQPGVS
Subjt:  QNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766607.2e-3648.84Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSDPPSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSDPPSNTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPYA+ETQLVSPPVFS F TEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS++  N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)3.9e-5352.09Show/hide
Query:  NNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETV-LPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQS
        NN  DT+NAAA+AI S++ R+   +P   KR+W + WSL  CF  GS +  KRIG++VLVPE V +  + +        S    LPFIAPPSSPASF QS
Subjt:  NNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETV-LPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQS

Query:  DPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLNHTNKSFG
        +PPS TQSP G+LS + L  NN       SIFAIGPYA+ETQLVSPPVFS + TEPS+AP TPP +   +    TTPSSPEVPFA+L  S  NH   S+G
Subjt:  DPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLNHTNKSFG

Query:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
           KF +S   +FQ YQ  PGSP   LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown5.1e-3748.84Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSDPPSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSDPPSNTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPYA+ETQLVSPPVFS F TEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS++  N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.3e-12055.91Show/hide
Query:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF
        M S+NN SVDTVNAAA+AIVSAE+R QP +  K+R GS WSLYWCF  GS+KNNKRIGHAVLVPE    G AVAP     + ST++ +PFIAPPSSPASF
Subjt:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSDPP--SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHT--NK
        L S PP  S+T  P  L SLT         N P S F IGPYA+ETQ V+PPVFSAF TEPSTAPFTPPPES     PSSPEVPFA+LLTSSL     N 
Subjt:  LQSDPP--SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHT--NK

Query:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTP
          G NQKF+ +H +F+  Q YPGSPG +LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP   G GSRLGSG+LTP
Subjt:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTP

Query:  DGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMTSIRTE
        D    GS+L SG VTPNG     R+  G LTP        +G LLD+Q SEVASLANS+ G    ND   V  HRVSFELTGEDVARCLA+K   S   E
Subjt:  DGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMTSIRTE

Query:  SESPEQTSTKYQNKTKGSSREAETCEFFDIKTSTAPEKTSGE-DDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAK-EASPGNNWT
                     K  G       C            KTSGE + +  Q  R+ + GS KEF FD T  E+     I +EWWANEKVA K + SP N+WT
Subjt:  SESPEQTSTKYQNKTKGSSREAETCEFFDIKTSTAPEKTSGE-DDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAK-EASPGNNWT

Query:  FFPMLQPG
        FFP+L+ G
Subjt:  FFPMLQPG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.6e-11853.49Show/hide
Query:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSDP
        +NNSV+TVNAAATAIV+AE+RVQP +  K RWG CWSLY CF  G+QKNNKRIG+AVLVPE V  G     V++   STT+VLPFIAPPSSPASFLQSDP
Subjt:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSDP

Query:  PSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLNHTNK--SFGTNQ
         S + SP G LSLT+   N +SP  P S+F +GPYA ETQ V+PPVFSAF TEPSTAP+TPPPE SV +TTPSSPEVPFA+LLTSSL  T +  + G NQ
Subjt:  PSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLNHTNK--SFGTNQ

Query:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGM
        KF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP                  +G 
Subjt:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTK
        GS L SG++TPNG      + SG LTP+     LQ      NQ SEVASLANS+ G +  V +HRVSFELTGEDVARCLA+K               +  
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTK

Query:  YQNKTKGSSREAETCEFFDIKTSTAPEKTSGE---DDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGV
        +         E E     DI+ +   EK SG+   +    Q   + ++GS KEF FD TK E              EKVA      GN+W+FFP L+ GV
Subjt:  YQNKTKGSSREAETCEFFDIKTSTAPEKTSGE---DDQCYQNQRALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGV

Query:  S
        S
Subjt:  S


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGTATGAACAACAGCGTGGACACGGTTAATGCTGCCGCTACTGCGATCGTCTCCGCGGAGGCTCGAGTCCAGCCTCCGACACCTCCGAAACGAAGATGGGGTAG
CTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGTTCGCAGAAAAACAATAAGCGTATTGGTCATGCCGTACTTGTTCCGGAAACTGTGCTACCAGGAGCTGTTGCCCCTG
CTGTTGAACATCGAACACCTTCAACCACAATGGTATTACCTTTCATTGCCCCTCCGTCTTCTCCTGCATCTTTCCTCCAGTCCGATCCTCCATCAAATACTCAATCTCCG
GCTGGATTACTATCTTTAACTGCCCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCGTCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTAGTCTCACC
TCCAGTTTTTTCTGCCTTCCCCACTGAACCATCTACTGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAACTGACCACACCCTCATCTCCTGAAGTGCCATTTGCTAAAT
TGCTGACATCTTCTCTGAACCATACTAATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCCGGC
GCCCATCTCATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTCCCTGATAAACACCCTATTCTTGAGTTCCGCATGGCAGATGCTCCCAAGCTCTT
GGGTCTCGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACGCCAGATGGTACTGGTTTAGGTTCTAGGCTAGGTTCGGGATCTTTGACCC
CCGATGGTATGGGTATGGGTTCAAGATTGGGATCTGGATCTGTGACCCCGAATGGTGTCAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTTTGGGG
CATGCCTTGCAAGATGGTTTACTGTTGGACAACCAAAAATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACAAATCATAGGGTGTCTTT
TGAGTTAACTGGTGAAGATGTTGCACGTTGTCTTGCAAATAAGTCAATGACATCTATTAGGACTGAATCAGAGTCCCCAGAGCAAACGAGCACGAAATATCAAAACAAAA
CCAAAGGATCCTCGAGAGAAGCTGAAACTTGTGAGTTTTTTGACATCAAGACTTCCACAGCACCCGAAAAAACTTCAGGAGAGGATGATCAATGCTACCAAAACCAGCGA
GCCCTAACTCTTGGTTCGTTCAAAGAGTTCAATTTTGACCAAACAAAAGGAGAAATTCATAATACAGCCTCCATTGGTGCTGAGTGGTGGGCCAATGAAAAGGTGGCTGC
GAAGGAAGCTAGTCCAGGCAACAACTGGACTTTCTTCCCAATGTTGCAACCTGGGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGTATGAACAACAGCGTGGACACGGTTAATGCTGCCGCTACTGCGATCGTCTCCGCGGAGGCTCGAGTCCAGCCTCCGACACCTCCGAAACGAAGATGGGGTAG
CTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGTTCGCAGAAAAACAATAAGCGTATTGGTCATGCCGTACTTGTTCCGGAAACTGTGCTACCAGGAGCTGTTGCCCCTG
CTGTTGAACATCGAACACCTTCAACCACAATGGTATTACCTTTCATTGCCCCTCCGTCTTCTCCTGCATCTTTCCTCCAGTCCGATCCTCCATCAAATACTCAATCTCCG
GCTGGATTACTATCTTTAACTGCCCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCGTCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTAGTCTCACC
TCCAGTTTTTTCTGCCTTCCCCACTGAACCATCTACTGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAACTGACCACACCCTCATCTCCTGAAGTGCCATTTGCTAAAT
TGCTGACATCTTCTCTGAACCATACTAATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCCGGC
GCCCATCTCATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTCCCTGATAAACACCCTATTCTTGAGTTCCGCATGGCAGATGCTCCCAAGCTCTT
GGGTCTCGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACGCCAGATGGTACTGGTTTAGGTTCTAGGCTAGGTTCGGGATCTTTGACCC
CCGATGGTATGGGTATGGGTTCAAGATTGGGATCTGGATCTGTGACCCCGAATGGTGTCAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTTTGGGG
CATGCCTTGCAAGATGGTTTACTGTTGGACAACCAAAAATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACAAATCATAGGGTGTCTTT
TGAGTTAACTGGTGAAGATGTTGCACGTTGTCTTGCAAATAAGTCAATGACATCTATTAGGACTGAATCAGAGTCCCCAGAGCAAACGAGCACGAAATATCAAAACAAAA
CCAAAGGATCCTCGAGAGAAGCTGAAACTTGTGAGTTTTTTGACATCAAGACTTCCACAGCACCCGAAAAAACTTCAGGAGAGGATGATCAATGCTACCAAAACCAGCGA
GCCCTAACTCTTGGTTCGTTCAAAGAGTTCAATTTTGACCAAACAAAAGGAGAAATTCATAATACAGCCTCCATTGGTGCTGAGTGGTGGGCCAATGAAAAGGTGGCTGC
GAAGGAAGCTAGTCCAGGCAACAACTGGACTTTCTTCCCAATGTTGCAACCTGGGGTCAGCTGA
Protein sequenceShow/hide protein sequence
MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPETVLPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSDPPSNTQSP
AGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLNHTNKSFGTNQKFALSHCDFQPYQPYPGSPG
AHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGSLTPDGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLG
HALQDGLLLDNQKSEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPEQTSTKYQNKTKGSSREAETCEFFDIKTSTAPEKTSGEDDQCYQNQR
ALTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAAKEASPGNNWTFFPMLQPGVS