; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G15285 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G15285
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationctg2009:202105..205681
RNA-Seq ExpressionCucsat.G15285
SyntenyCucsat.G15285
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]0.098.39Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPTSNTQSPAGLLS TALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

XP_004140832.3 uncharacterized protein LOC101210841 [Cucumis sativus]0.099.8Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPTSNTQSPAGLLS TALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

XP_008439268.1 PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]0.097.99Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S PTSNTQSPAGLLS TALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

XP_022141198.1 uncharacterized protein LOC111011654 [Momordica charantia]0.091.38Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TP KRRWG CWSLYWCFGIGSQK+NKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P+SN QSPAGLLS TALSVNNYS NGPASIFAIGPY Y+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQ-TSTS
        SRLGSGS+TPNGMRQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESES +Q TS+ 
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQ-TSTS

Query:  NQNENKESSREAETCEFFDIKTSAAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
         Q+ENK SSREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  NQNENKESSREAETCEFFDIKTSAAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]0.093.96Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGI SQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SNTQSPAGLLS TALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDG+GMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQD PLLD+QISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RT SESPKQTSTS 
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        Q ENKESSREAETCE FDIKTS APEKT  +DDQCYQNQRA+TLGSFKEFNFDQTKGEI+NTASIGAEWWANEKV VKEA+PGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840980.097.99Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S PTSNTQSPAGLLS TALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 10.097.99Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S PTSNTQSPAGLLS TALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 10.098.39Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPTSNTQSPAGLLS TALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

A0A6J1CJS5 uncharacterized protein LOC1110116540.091.38Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TP KRRWG CWSLYWCFGIGSQK+NKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P+SN QSPAGLLS TALSVNNYS NGPASIFAIGPY Y+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQ-TSTS
        SRLGSGS+TPNGMRQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESES +Q TS+ 
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQ-TSTS

Query:  NQNENKESSREAETCEFFDIKTSAAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
         Q+ENK SSREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  NQNENKESSREAETCEFFDIKTSAAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

A0A6J1E856 uncharacterized protein LOC1114307670.090.54Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFG GSQK+NKRIGHAVLVPEPAV GAVAPAVEHRTPSTT+VLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SN QSPAGLLS TALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGM MG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGT+TPDGLGH LQD  LLD+QISEVASLANSETGCQNDV NHRVSFELTGEDVARCLANKS           KQTST++
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QN+NKESS+EAE+CEFFDIKTS APEKT  EDDQCYQNQRAV LGSFKEFNFDQTKGE+H+TASIGAEWWANEKV VKEASPGNNWTFFP+LQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.2e-3346.98Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSFTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S   S TQSP   LS  A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSFTALSVNNYS

Query:  PNGP-ASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPY ++TQLVSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)5.1e-5351.88Show/hide
Query:  ASINNSVDTVNAAATAIVSAEARVQPTTP--PKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEP-AVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASF
        A+ NN  DT+NAAA+AI S++ R+  ++P   KR+W + WSL  CF  GS +  KRIG++VLVPEP ++  + +        S    LPFIAPPSSPASF
Subjt:  ASINNSVDTVNAAATAIVSAEARVQPTTP--PKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEP-AVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNK
         QSEP S TQSP G+LSF+ L  NN       SIFAIGPY ++TQLVSPPVFS +TTEPS+APITPP +   +    TTPSSPEVPFA+L  S  +H   
Subjt:  LQSEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNK

Query:  SFGTNQKFTLSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
        S+G   KF +S   +FQ YQ  PGSP   LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  SFGTNQKFTLSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown8.2e-3546.98Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSFTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S   S TQSP   LS  A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSFTALSVNNYS

Query:  PNGP-ASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPY ++TQLVSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein2.9e-11754.6Show/hide
Query:  MASINN-SVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF
        M S+NN SVDTVNAAA+AIVSAE+R QP++  K+R GS WSLYWCF  GS+K+NKRIGHAVLVPEPA  G AVAP     + ST++ +PFIAPPSSPASF
Subjt:  MASINN-SVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSEPTSNTQSP-AGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNKS
        L S P S + +P  GLL   +L+VN      P S F IGPY ++TQ V+PPVFSAFTTEPSTAP TPPPES     PSSPEVPFA+LLTSSL  +  N  
Subjt:  LQSEPTSNTQSP-AGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNKS

Query:  FGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPD
         G NQKF+ +H +F+  Q YPGSPG +LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP G G  SRLGSG LTPD
Subjt:  FGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPD

Query:  GMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGC--QND---VTNHRVSFELTGEDVARCLANKSLTSIRTES
            GS+L SG VTPNG     R+  G LTP        +  LLD+QISEVASLANS+ G    ND   V  HRVSFELTGEDVARCLA+K   S   E 
Subjt:  GMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGC--QND---VTNHRVSFELTGEDVARCLANKSLTSIRTES

Query:  ES-----PKQTSTSNQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV-GVKEASPGN
         S     P    TS + E+++S                             Q  R+ + GS KEF FD T  E+     I +EWWANEKV G  + SP N
Subjt:  ES-----PKQTSTSNQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV-GVKEASPGN

Query:  NWTFFPLLQPG
        +WTFFP+L+ G
Subjt:  NWTFFPLLQPG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein3.8e-11752.71Show/hide
Query:  INNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP
        +NNSV+TVNAAATAIV+AE+RVQP++  K RWG CWSLY CF  G+QK+NKRIG+AVLVPEP   G     V++   STT+VLPFIAPPSSPASFLQS+P
Subjt:  INNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP

Query:  TSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ
        +S + SP G LS T+   N +SP  P S+F +GPY  +TQ V+PPVFSAF TEPSTAP TPPPE SV +TTPSSPEVPFA+LLTSSL  T +  + G NQ
Subjt:  TSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ

Query:  KFTLSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM
        KF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP                  +G 
Subjt:  KFTLSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTS
        GS L SG++TPNG      + SG LTP+     LQ      NQISEVASLANS+ G +  V +HRVSFELTGEDVARCLA+K               + S
Subjt:  GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTS

Query:  NQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQ-CYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        +   N     E E     DI+ +        E++Q   Q   + ++GS KEF FD TK E              EKV       GN+W+FFP L+ GVS
Subjt:  NQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQ-CYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGTATCAACAACAGCGTGGATACGGTTAATGCTGCCGCTACTGCCATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAG
CTGCTGGAGTCTCTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAACGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTG
CTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCT
GCTGGATTACTATCTTTTACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACC
TCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCTCCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAAT
TGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTCACACTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGT
GCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTT
GGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTC
CTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAATGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGC
CATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATT
TGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACATCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAA
ACAAAGAATCGTCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGA
GCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAGAAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGT
GAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCTGGCGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGTATCAACAACAGCGTGGATACGGTTAATGCTGCCGCTACTGCCATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAG
CTGCTGGAGTCTCTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAACGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTG
CTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCT
GCTGGATTACTATCTTTTACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACC
TCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCTCCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAAT
TGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTCACACTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGT
GCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTT
GGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTC
CTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAATGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGC
CATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATT
TGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACATCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAA
ACAAAGAATCGTCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGA
GCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAGAAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGT
GAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCTGGCGTCAGCTGA
Protein sequenceShow/hide protein sequence
MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSP
AGLLSFTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPG
AHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDGLG
HGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQR
AVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS