; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0009157 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0009157
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationchr06:7338657..7342120
RNA-Seq ExpressionPI0009157
SyntenyPI0009157
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]5.3e-27898.39Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLD+QISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESES KQTSTSN
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

XP_004140832.3 uncharacterized protein LOC101210841 [Cucumis sativus]1.0e-27697.99Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTY+TQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLD+QISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESES KQTSTSN
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
        QNENKESSREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

XP_008439268.1 PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]4.5e-27797.99Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S PTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLD+QISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESES KQTSTSN
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKV VKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

XP_022141198.1 uncharacterized protein LOC111011654 [Momordica charantia]6.5e-26092.59Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQP TP KRRWG CWSLYWCFGIGSQKNNKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPY YETQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESL-KQTSTS
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ LQD  LLD+QISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESES  +QTS+ 
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESL-KQTSTS

Query:  NQNENKESSREAETCEFFDIKTSTAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
         Q+ENK SSREAETCEFFDIKTSTAPEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEA+PGNNWTFFP+LQPGVS
Subjt:  NQNENKESSREAETCEFFDIKTSTAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]2.5e-26795.57Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGI SQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY YETQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDG+GMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQD PLLD QISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RT SES KQTSTS 
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
        Q ENKESSREAETCE FDIKTSTAPEKT  +DDQCYQNQRA+TLGSFKEFNFDQTKGEI+NTASIGAEWWANEKVAVKEA+PGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840982.2e-27797.99Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S PTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLD+QISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESES KQTSTSN
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKV VKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 12.2e-27797.99Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S PTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLD+QISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESES KQTSTSN
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKV VKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 12.6e-27898.39Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN
        SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLD+QISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESES KQTSTSN
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

A0A6J1CJS5 uncharacterized protein LOC1110116543.2e-26092.59Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQP TP KRRWG CWSLYWCFGIGSQKNNKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPY YETQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESL-KQTSTS
        SRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ LQD  LLD+QISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESES  +QTS+ 
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESL-KQTSTS

Query:  NQNENKESSREAETCEFFDIKTSTAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
         Q+ENK SSREAETCEFFDIKTSTAPEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEA+PGNNWTFFP+LQPGVS
Subjt:  NQNENKESSREAETCEFFDIKTSTAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

A0A6J1E856 uncharacterized protein LOC1114307671.0e-25892.15Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFG GSQKNNKRIGHAVLVPEPAV GAVAPAVEHRTPSTT+VLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SN QSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGM MG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN
        SRLGSGSVTPNGVRQDSRLGSGT+TPDGLGH LQD  LLDSQISEVASLANSETGCQNDV NHRVSFELTGEDVARCLANKS           KQTST++
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
        QN+NKESS+EAE+CEFFDIKTSTAPEKT  EDDQCYQNQRAV LGSFKEFNFDQTKGE+H+TASIGAEWWANEKVAVKEASPGNNWTFFP+LQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.4e-3447.91Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S   S TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPY +ETQLVSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)5.1e-5352.09Show/hide
Query:  NNSVDTVNAAATAIVSAEARVQPTTP--PKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEP-AVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQS
        NN  DT+NAAA+AI S++ R+  ++P   KR+W + WSL  CF  GS +  KRIG++VLVPEP ++  + +        S    LPFIAPPSSPASF QS
Subjt:  NNSVDTVNAAATAIVSAEARVQPTTP--PKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEP-AVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQS

Query:  EPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG
        EP S TQSP G+LS + L  NN       SIFAIGPY +ETQLVSPPVFS +TTEPS+APITPP +   +    TTPSSPEVPFA+L  S  +H   S+G
Subjt:  EPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG

Query:  TNQKFTLSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
           KF +S   +FQ YQ  PGSP   LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  TNQKFTLSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown9.7e-3647.91Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S   S TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPY +ETQLVSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein3.1e-11955.51Show/hide
Query:  MGSMNN-SVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF
        M S+NN SVDTVNAAA+AIVSAE+R QP++  K+R GS WSLYWCF  GS+KNNKRIGHAVLVPEPA  G AVAP     + ST++ +PFIAPPSSPASF
Subjt:  MGSMNN-SVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSEP--TSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK
        L S P   S+T  P  L SLT         N P S F IGPY +ETQ V+PPVFSAFTTEPSTAP TPPPES     PSSPEVPFA+LLTSSL  +  N 
Subjt:  LQSEP--TSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK

Query:  SFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP
          G NQKF+ +H +F+  Q YPGSPG +LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP   G GSRLGSG LTP
Subjt:  SFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP

Query:  DGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGC--QND---VTNHRVSFELTGEDVARCLANKSLTSIRTE
        D    GS+L SG VTPNG     R+  G LTP        +  LLDSQISEVASLANS+ G    ND   V  HRVSFELTGEDVARCLA+K        
Subjt:  DGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGC--QND---VTNHRVSFELTGEDVARCLANKSLTSIRTE

Query:  SESLKQTSTSNQNENKESSREAETCEFFDIKTSTAPEKTPGE-DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVK-EASPGNNWT
               + S  +E          C            KT GE + +  Q  R+ + GS KEF FD T  E+     I +EWWANEKVA K + SP N+WT
Subjt:  SESLKQTSTSNQNENKESSREAETCEFFDIKTSTAPEKTPGE-DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVK-EASPGNNWT

Query:  FFPLLQPG
        FFP+L+ G
Subjt:  FFPLLQPG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein2.7e-11853.31Show/hide
Query:  MNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP
        +NNSV+TVNAAATAIV+AE+RVQP++  K RWG CWSLY CF  G+QKNNKRIG+AVLVPEP   G     V++   STT+VLPFIAPPSSPASFLQS+P
Subjt:  MNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP

Query:  TSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ
        +S + SP G LSLT+   N +SP  P S+F +GPY  ETQ V+PPVFSAF TEPSTAP TPPPE SV +TTPSSPEVPFA+LLTSSL  T +  + G NQ
Subjt:  TSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ

Query:  KFTLSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        KF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP                  +G 
Subjt:  KFTLSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTS
        GS L SG++TPNG      + SG LTP+     LQ      +QISEVASLANS+ G +  V +HRVSFELTGEDVARCLA+K               + S
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTS

Query:  NQNENKESSREAETCEFFDIKTSTAPEKTPGEDDQ-CYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS
        +   N     E E     DI+ +        E++Q   Q   + ++GS KEF FD TK E              EKVA      GN+W+FFP L+ GVS
Subjt:  NQNENKESSREAETCEFFDIKTSTAPEKTPGEDDQ-CYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCATGAACAACAGCGTGGATACCGTTAATGCTGCCGCTACTGCTATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAG
CTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTTCACAGAAAAACAATAAACGTATAGGTCATGCTGTACTCGTTCCAGAACCTGCAGTACCAGGAGCTGTTGCCCCTG
CTGTTGAGCATCGAACGCCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCT
GCTGGATTACTATCTTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGAGACTCAGTTGGTCTCACC
TCCAGTTTTTTCTGCCTTCACCACTGAACCATCGACCGCTCCTATTACTCCTCCTCCTGAGTCTGTGCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAAT
TGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTTACACTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGT
GCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTCCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTT
GGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGACTC
CTGATGGTATGGGTATGGGTTCGAGATTGGGATCTGGATCTGTTACCCCAAATGGTGTGAGGCAAGATTCAAGACTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGG
CATGGCTTGCAAGATAGTCCATTGTTGGACAGCCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATT
TGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCACTGACATCCATTAGAACTGAATCCGAGTCTCTGAAGCAAACAAGCACAAGCAATCAAAACGAAA
ACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCACAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGA
GCTGTAACTCTCGGTTCATTCAAAGAGTTCAACTTTGACCAAACAAAAGGAGAAATACACAACACAGCCTCCATTGGTGCAGAATGGTGGGCCAATGAGAAAGTGGCTGT
GAAGGAAGCTAGTCCAGGTAACAACTGGACTTTCTTCCCATTATTGCAACCTGGGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
CTATTTTTTTATTTTCATTTTATTATATTTCCTAATTTATAATTTTATTTATTATTTCTTTCTTTATTTTGGAAATATATAAAATCAGAAAATTCAATCTCTCTCTTTCT
TCGTCTGAGTTTTTCATTTCTTAGATCAAATCCAAAAGTCTTCTCTCTTCCTTTTTTTGGCTGTCAAATCTCACTCCAGAGTAATTGATTTTGCCGGCGACTTAGCGGCC
TCGATTGATCGGAAAATACGGCGACCTTCTTCAAAAGTCCGCCGCCGGAGCTCTCTGGATTGGAAGAAAGTGGGGATGGGAAGCATGAACAACAGCGTGGATACCGTTAA
TGCTGCCGCTACTGCTATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTT
CACAGAAAAACAATAAACGTATAGGTCATGCTGTACTCGTTCCAGAACCTGCAGTACCAGGAGCTGTTGCCCCTGCTGTTGAGCATCGAACGCCTTCAACCACCATGGTA
TTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCTGCTGGATTACTATCTTTAACTGCTCTTTCAGTCAA
TAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGAGACTCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCGA
CCGCTCCTATTACTCCTCCTCCTGAGTCTGTGCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAAAAGT
TTTGGGACTAACCAAAAGTTTACACTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGTGCTCATCTTATATCACCTGGATCAGTAATTTCAAA
CTCTGGTACATCTTCTCCTTTCCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCT
CAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATGGGTATGGGTTCGAGATTGGGATCT
GGATCTGTTACCCCAAATGGTGTGAGGCAAGATTCAAGACTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGGCATGGCTTGCAAGATAGTCCATTGTTGGACAGCCA
AATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTG
CAAATAAGTCACTGACATCCATTAGAACTGAATCCGAGTCTCTGAAGCAAACAAGCACAAGCAATCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAG
TTCTTTGACATCAAGACTTCCACAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGCTGTAACTCTCGGTTCATTCAAAGAGTTCAACTT
TGACCAAACAAAAGGAGAAATACACAACACAGCCTCCATTGGTGCAGAATGGTGGGCCAATGAGAAAGTGGCTGTGAAGGAAGCTAGTCCAGGTAACAACTGGACTTTCT
TCCCATTATTGCAACCTGGGGTCAGCTGATTTTGACAAAGGATATCAACACTAAAATGAACAAAACAAAGAAGGAGAAGAAGAAACAACAACCAGCCTTTTGAATGTACA
TTTGAATGTAATCCTCCTTTGGAGACGATGCAATGATTTGGAGGAAAAAGAATGTTTTTTCAAAGTTTGTCTTGTGAAAAACAGTATTCCCAAATAGACATCAGAAAAGA
AAGTAGTTATTAGGGATAACTTGTGTGGGTACTGAAGGAGCATTCTTTGTCATAGCTCAAGAAGTAGATCATTCATAATCATAGGATCTTTGGAAGTGCTTTATTCTTTC
TTGTCTTTGTATAATAATAAGAAATTTCATTCTTCCCCAACAATCAAAACTTCCTTTCTTGAAAA
Protein sequenceShow/hide protein sequence
MGSMNNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSP
AGLLSLTALSVNNYSPNGPASIFAIGPYTYETQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPG
AHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLG
HGLQDSPLLDSQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESLKQTSTSNQNENKESSREAETCEFFDIKTSTAPEKTPGEDDQCYQNQR
AVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPLLQPGVS