; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G17110 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G17110
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationChr3:12838461..12841963
RNA-Seq ExpressionCSPI03G17110
SyntenyCSPI03G17110
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]6.3e-27998.59Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

XP_004140832.3 uncharacterized protein LOC101210841 [Cucumis sativus]1.9e-283100Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

XP_008439268.1 PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]5.3e-27898.19Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S PTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

XP_022141198.1 uncharacterized protein LOC111011654 [Momordica charantia]3.0e-25791.58Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TP KRRWG CWSLYWCFGIGSQK+NKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPY Y+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESE-SPKQTSTS
        SRLGSGS+TPNGMRQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESE S +QTS+ 
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESE-SPKQTSTS

Query:  NQNENKESSREAETCEFFDIKTSAAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
         Q+ENK SSREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  NQNENKESSREAETCEFFDIKTSAAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]2.0e-26494.16Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGI SQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDG+GMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQD PLLD+QISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RT SESPKQTSTS 
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        Q ENKESSREAETCE FDIKTS APEKT  +DDQCYQNQRA+TLGSFKEFNFDQTKGEI+NTASIGAEWWANEKV VKEA+PGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840982.6e-27898.19Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S PTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 12.6e-27898.19Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S PTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 13.0e-27998.59Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QNENKE SREAETCEFFDIKTS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

A0A6J1CJS5 uncharacterized protein LOC1110116541.5e-25791.58Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TP KRRWG CWSLYWCFGIGSQK+NKRIGHAVLVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P+SN QSPAGLLSLTALSVNNYS NGPASIFAIGPY Y+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESE-SPKQTSTS
        SRLGSGS+TPNGMRQDSRL SGTLTPDGLG+ LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+ SIRTESE S +QTS+ 
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESE-SPKQTSTS

Query:  NQNENKESSREAETCEFFDIKTSAAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
         Q+ENK SSREAETCEFFDIKTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEA+PGNNWTFFP+LQPGVS
Subjt:  NQNENKESSREAETCEFFDIKTSAAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

A0A6J1E856 uncharacterized protein LOC1114307676.8e-25590.74Show/hide
Query:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFG GSQK+NKRIGHAVLVPEPAV GAVAPAVEHRTPSTT+VLPFIAPPSSPASFLQ
Subjt:  MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SN QSPAGLLSLTALSVNNYSPNGPASIFAIGPY YDTQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGM MG
Subjt:  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN
        SRLGSGSVTPNG+RQDSRLGSGT+TPDGLGH LQD  LLD+QISEVASLANSETGCQNDV NHRVSFELTGEDVARCLANKS           KQTST++
Subjt:  SRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSN

Query:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        QN+NKESS+EAE+CEFFDIKTS APEKT  EDDQCYQNQRAV LGSFKEFNFDQTKGE+H+TASIGAEWWANEKV VKEASPGNNWTFFP+LQPGVS
Subjt:  QNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766604.0e-3447.44Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S   S TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPY ++TQLVSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)2.5e-5251.5Show/hide
Query:  ASINNSVDTVNAAATAIVSAEARVQPTTP--PKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEP-AVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASF
        A+ NN  DT+NAAA+AI S++ R+  ++P   KR+W + WSL  CF  GS +  KRIG++VLVPEP ++  + +        S    LPFIAPPSSPASF
Subjt:  ASINNSVDTVNAAATAIVSAEARVQPTTP--PKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEP-AVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNK
         QSEP S TQSP G+LS + L  NN       SIFAIGPY ++TQLVSPPVFS +TTEPS+APITPP +   +    TTPSSPEVPFA+L  S  +H   
Subjt:  LQSEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNK

Query:  SFGTNQKFTLSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
        S+G   KF +S   +FQ YQ  PGSP   LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  SFGTNQKFTLSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown2.8e-3547.44Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S   S TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPY ++TQLVSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.0e-11754.49Show/hide
Query:  MASINN-SVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF
        M S+NN SVDTVNAAA+AIVSAE+R QP++  K+R GS WSLYWCF  GS+K+NKRIGHAVLVPEPA  G AVAP     + ST++ +PFIAPPSSPASF
Subjt:  MASINN-SVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSEP--TSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK
        L S P   S+T  P  L SLT         N P S F IGPY ++TQ V+PPVFSAFTTEPSTAP TPPPES     PSSPEVPFA+LLTSSL  +  N 
Subjt:  LQSEP--TSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK

Query:  SFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTP
          G NQKF+ +H +F+  Q YPGSPG +LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP G G  SRLGSG LTP
Subjt:  SFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTP

Query:  DGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGC--QND---VTNHRVSFELTGEDVARCLANKSLTSIRTE
        D    GS+L SG VTPNG     R+  G LTP        +  LLD+QISEVASLANS+ G    ND   V  HRVSFELTGEDVARCLA+K   S   E
Subjt:  DGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGC--QND---VTNHRVSFELTGEDVARCLANKSLTSIRTE

Query:  SES-----PKQTSTSNQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV-GVKEASPG
          S     P    TS + E+++S                             Q  R+ + GS KEF FD T  E+     I +EWWANEKV G  + SP 
Subjt:  SES-----PKQTSTSNQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV-GVKEASPG

Query:  NNWTFFPLLQPG
        N+WTFFP+L+ G
Subjt:  NNWTFFPLLQPG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.0e-11752.91Show/hide
Query:  INNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP
        +NNSV+TVNAAATAIV+AE+RVQP++  K RWG CWSLY CF  G+QK+NKRIG+AVLVPEP   G     V++   STT+VLPFIAPPSSPASFLQS+P
Subjt:  INNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP

Query:  TSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ
        +S + SP G LSLT+   N +SP  P S+F +GPY  +TQ V+PPVFSAF TEPSTAP TPPPE SV +TTPSSPEVPFA+LLTSSL  T +  + G NQ
Subjt:  TSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ

Query:  KFTLSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM
        KF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP                  +G 
Subjt:  KFTLSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTS
        GS L SG++TPNG      + SG LTP+     LQ      NQISEVASLANS+ G +  V +HRVSFELTGEDVARCLA+K               + S
Subjt:  GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTS

Query:  NQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQ-CYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
        +   N     E E     DI+ +        E++Q   Q   + ++GS KEF FD TK E              EKV       GN+W+FFP L+ GVS
Subjt:  NQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQ-CYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGTATCAACAACAGCGTCGATACGGTTAATGCTGCCGCTACTGCCATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAG
CTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAACGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTG
CTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCT
GCTGGATTACTATCCTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACC
TCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCTCCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAAT
TGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTCACACTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGT
GCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTT
GGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTC
CTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAATGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGC
CATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATT
TGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACATCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAA
ACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGA
GCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAGAAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGT
GAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCTGGCGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
TCTTCATTTTTATTACATTTCCTAATTTATAATTTCATTTTGTTTATTTATTTGGAAATATATAAAATGAGAAAATTCAATCTCTCTCTTTCTTCGTCTGAGTTTTTCAT
TTCTTAGATCAAATCCAAAAGTCTTCTCTCTTCCTTCTTTTGCCTGTCAAATCTCACTCCATACTAATTGATTTTCCCGGCCACTTACCGCCCTCGATTGATCCCAAATC
ACGGCGACCTTGTTAAGAACTCCGCCGGCCGAGCTCTCTGGATTCGAAGAAACTGGGGATGGCAAGTATCAACAACAGCGTCGATACGGTTAATGCTGCCGCTACTGCCA
TCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAA
CGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTGCTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCC
TCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCTGCTGGATTACTATCCTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATG
GACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCT
CCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAA
GTTCACACTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGTGCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTC
CTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGA
TCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAA
TGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGCCATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTT
CTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACA
TCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGAC
TTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAG
AAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGTGAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCT
GGCGTCAGCTGACTTTGACAAAGGATATCAACACTAAAAAGAACAAAACAAAGATGAAGAAGAAACAACAGCCAGCCCTTTTGAATGTACATTTGAATGTAATCCTCCTT
TGGAGGTGATGCAATGATTTGGAGGAAAAAGAATGTTTTTTCAAAGTTTGTTTTGTGAAAAACAGTATTCCCAAATAGACATCAGAAAAGAAAGTAGTTATTAGGGATAA
CTTGTGTGGGTACCGAAGGAGCATTCTTTGTCATAGCTCAAAAGTAGATCATTCATAATCATAGGATCTTTTGGAAGTGCTTTATTCTTTCTTGTCTTTGTATAATAATA
AGAAATTTCATTCTTCCCCAACAATCAAAACATCATTTCTTGAAAACTTAGTCTTGAATTTAGTTTTTTCATGTCAA
Protein sequenceShow/hide protein sequence
MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSP
AGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPG
AHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDGLG
HGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQR
AVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS