; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002686 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002686
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationscaffold6:1391937..1397787
RNA-Seq ExpressionSpg002686
SyntenySpg002686
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]1.8e-26493.99Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP VPGAVAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPD GMGM
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQR +TLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

XP_004140832.3 uncharacterized protein LOC101210841 [Cucumis sativus]3.8e-26293.19Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQK+NKRI +AVLVPEP VPGAVAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPD GMGM
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNG+RQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKESSREAETCEFFDIKTS  PEKT GEDDQCYQNQR +TLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

XP_008439268.1 PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]1.5e-26393.79Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP VPGAVAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPD GMGM
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQR +TLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

XP_022141198.1 uncharacterized protein LOC111011654 [Momordica charantia]9.6e-25892.42Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTP KRRWG CWSLYWCFGIGSQKNNKRI +AVLVPEPVVPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPD GMGM
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESE-SPKQTST
        GSRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ALQDG LLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSM S RTESE S +QTS+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESE-SPKQTST

Query:  NCQNENKESSREAETCEFFDIKTSTTPEKT-SGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGV
          Q+ENK SSREAETCEFFDIKTST PEK+ +GEDDQCYQNQR +TLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEA+PG NNWTFFPMLQPGV
Subjt:  NCQNENKESSREAETCEFFDIKTSTTPEKT-SGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGV

Query:  S
        S
Subjt:  S

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]6.9e-26494.39Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGI SQKNNKRI +AVLVPEP VPGAVAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPD G+GM
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQDG LLD+QISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTS RT SESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         Q ENKESSREAETCE FDIKTST PEKTS +DDQCYQNQR ITLGSFKEFNFDQTKGE++NTASIG+EWWANEKV VKEA+PG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840987.4e-26493.79Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP VPGAVAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPD GMGM
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQR +TLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 17.4e-26493.79Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP VPGAVAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPD GMGM
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQR +TLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 18.8e-26593.99Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP VPGAVAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPD GMGM
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQR +TLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

A0A6J1CJS5 uncharacterized protein LOC1110116544.7e-25892.42Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTP KRRWG CWSLYWCFGIGSQKNNKRI +AVLVPEPVVPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPD GMGM
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESE-SPKQTST
        GSRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ALQDG LLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSM S RTESE S +QTS+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESE-SPKQTST

Query:  NCQNENKESSREAETCEFFDIKTSTTPEKT-SGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGV
          Q+ENK SSREAETCEFFDIKTST PEK+ +GEDDQCYQNQR +TLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEA+PG NNWTFFPMLQPGV
Subjt:  NCQNENKESSREAETCEFFDIKTSTTPEKT-SGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGV

Query:  S
        S
Subjt:  S

A0A6J1E856 uncharacterized protein LOC1114307676.1e-25891.78Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFG GSQKNNKRI +AVLVPEP V GAVAP VEHRTPSTT+VLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPSN QSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPD GM M
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGT+TPDGLGHALQDGLLLD+QISEVASLANSE+GCQNDV NHRVSFELTGEDVARCLANKS           KQTSTN
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QN+NKESS+EAE+CEFFDIKTST PEKTS EDDQCYQNQR + LGSFKEFNFDQTKGE+H+TASIG+EWWANEKV VKEASPG NNWTFFPMLQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.0e-3649.3Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSNTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPYA+ETQLVSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)6.6e-5553.23Show/hide
Query:  NNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPV-VPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQS
        NN  DT+NAAA+AI S++ R+   +P   KR+W + WSL  CF  GS +  KRI N+VLVPEPV +  + + T      S    LPFIAPPSSPASF QS
Subjt:  NNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPV-VPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQS

Query:  EPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG
        EPPS TQSP G+LS + L  NN       SIFAIGPYA+ETQLVSPPVFS +TTEPS+AP TPP +   +    TTPSSPEVPFA+L  S  +H   S+G
Subjt:  EPPSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG

Query:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
           KF +S   +FQ YQ  PGSP   LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown7.3e-3849.3Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSNTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPYA+ETQLVSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.8e-12155.8Show/hide
Query:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPG-AVAPTVEHRTPSTTMVLPFIAPPSSPASF
        M S+NN SVDTVNAAA+AIVSAE+R QP +  K+R GS WSLYWCF  GS+KNNKRI +AVLVPEP   G AVAP     + ST++ +PFIAPPSSPASF
Subjt:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPG-AVAPTVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSEPP--SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK
        L S PP  S+T  P  L SLT         N P S F IGPYA+ETQ V+PPVFSAFTTEPSTAPFTPPPES     PSSPEVPFA+LLTSSL  +  N 
Subjt:  LQSEPP--SNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK

Query:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP
          G NQKF+ +H +F+  Q YPGSPG +LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP   G GSRLGSG LTP
Subjt:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP

Query:  DGGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMTSNRT
        D     GS+L SG VTPNG     R+  G LTP        +G LLD+QISEVASLANS+ G    ND   V  HRVSFELTGEDVARCLA+K   S   
Subjt:  DGGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMTSNRT

Query:  ESESPKQTSTNCQNENKESSREAETCEFFDIKTSTTPEKTSGE-DDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNW
        E  S +    NC             C            KTSGE + +  Q  R  + GS KEF FD T  E+     I SEWWANEKV  K      N+W
Subjt:  ESESPKQTSTNCQNENKESSREAETCEFFDIKTSTTPEKTSGE-DDQCYQNQRVITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNW

Query:  TFFPMLQPG
        TFFP+L+ G
Subjt:  TFFPMLQPG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein3.8e-11954.08Show/hide
Query:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQSEP
        +NNSV+TVNAAATAIV+AE+RVQP +  K RWG CWSLY CF  G+QKNNKRI NAVLVPEPV  G    TV++   STT+VLPFIAPPSSPASFLQS+P
Subjt:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQSEP

Query:  PSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ
         S + SP G LSLT+   N +SP  P S+F +GPYA ETQ V+PPVFSAF TEPSTAP+TPPPE SV +TTPSSPEVPFA+LLTSSL  T +  + G NQ
Subjt:  PSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ

Query:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMG
        KF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP                   +G
Subjt:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMG

Query:  MGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTST
         GS L SG++TPNG      + SG LTP+     LQ      NQISEVASLANS+ G +  V +HRVSFELTGEDVARCLA+K        + S  + + 
Subjt:  MGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTST

Query:  NCQNENKESSREAETCEFFDIKTSTTPEKTSGE-DDQCYQNQRV--ITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQP
        N + E +ESS         DI+ +   EK SG+ +++ ++ Q++   ++GS KEF FD TK E              EKV         N+W+FFP L+ 
Subjt:  NCQNENKESSREAETCEFFDIKTSTTPEKTSGE-DDQCYQNQRV--ITLGSFKEFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQP

Query:  GVS
        GVS
Subjt:  GVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTGCAACCGAAGCCGGATCGCTAAAGAAGAAGAGTTGGATCGCCAAAAGCCCAATCCCCATCCTCGCCCTTGGCCTCAACCCGATGCCGAGGCCGACCCTGGACA
CAAGCCCGGGGTTTTGCCAAACCCCCTGATGCCCAGTTCTCTGAGGTCTAGGTCAAATTGGCAAGCATCACACCAGTATGCAGTTGTTTACTGGTTTTGCAAGTCACATC
TTCCCCTCCAAACAAATTCACTGTTTTTGTCACGTGAAGGTCAGGTCTGGCGGCGGAGATTTCGGGATTGGAAGAAAGTGGAGATGGGAAGCATGAACAACAGCGTGGAT
ACGGTTAATGCTGCAGCTACTGCGATCGTCTCCGCCGAGGCTCGAGTCCAGCCTCCGACGCCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGTTTTGG
AATTGGTTCGCAGAAAAACAATAAGCGTATTAGTAATGCTGTACTTGTTCCGGAACCTGTGGTACCCGGAGCTGTCGCCCCTACTGTTGAACACCGAACACCTTCAACTA
CAATGGTATTGCCTTTCATTGCCCCTCCGTCTTCTCCAGCATCTTTCCTCCAGTCCGAACCTCCATCAAACACTCAATCTCCAGCTGGATTACTCTCTTTAACTGCCCTT
TCAGTCAATAACTACTCCCCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTGGTCTCACCACCAGTTTTTTCTGCCTTCACCACTGA
ACCATCAACTGCTCCTTTTACGCCTCCTCCTGAATCTGTGCAACTGACCACACCTTCATCTCCTGAAGTGCCATTTGCTAAATTGCTGACATCTTCTCTGAGCCATACTA
ATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCGTATCCAGGAAGCCCTGGTGCCCATCTTATATCACCTGGATCAGTA
ATTTCAAACTCTGGCACATCTTCTCCTTTCCCTGATAAGCACCCCATTCTCGAGTTCCGGATGGCAGATGCTCCGAAGCTCTTGGGTCTTGAACATTTTACGACTCGCAA
ATGGATCTCAAGAATGGGTTCTGGATCTTTGACACCAGACGGTACTGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGACCCCTGATGGTGGTATGGGCATGGGTTCGA
GATTGGGATCTGGATCTGTGACCCCAAATGGCGTGAGGCAAGATTCAAGATTGGGATCTGGAACCTTGACGCCTGATGGTTTGGGGCATGCCTTGCAAGATGGTCTACTG
TTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACGAATCATAGGGTGTCATTTGAGTTAACTGGGGAAGATGTTGC
ACGTTGTCTTGCAAATAAGTCAATGACTTCCAATAGAACCGAATCAGAGTCTCCAAAGCAAACAAGCACGAACTGTCAAAACGAGAACAAAGAATCATCAAGAGAAGCTG
AAACTTGTGAGTTCTTTGACATCAAGACTTCCACAACCCCTGAGAAAACTTCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGTCATAACTCTCGGTTCGTTCAAA
GAGTTCAACTTCGACCAAACGAAAGGAGAACTACACAATACAGCCTCCATCGGTTCAGAGTGGTGGGCCAATGAAAAGGTGACTGTGAAGGAAGCTAGTCCAGGCAACAA
CAACTGGACTTTCTTCCCAATGTTGCAACCTGGGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTGCAACCGAAGCCGGATCGCTAAAGAAGAAGAGTTGGATCGCCAAAAGCCCAATCCCCATCCTCGCCCTTGGCCTCAACCCGATGCCGAGGCCGACCCTGGACA
CAAGCCCGGGGTTTTGCCAAACCCCCTGATGCCCAGTTCTCTGAGGTCTAGGTCAAATTGGCAAGCATCACACCAGTATGCAGTTGTTTACTGGTTTTGCAAGTCACATC
TTCCCCTCCAAACAAATTCACTGTTTTTGTCACGTGAAGGTCAGGTCTGGCGGCGGAGATTTCGGGATTGGAAGAAAGTGGAGATGGGAAGCATGAACAACAGCGTGGAT
ACGGTTAATGCTGCAGCTACTGCGATCGTCTCCGCCGAGGCTCGAGTCCAGCCTCCGACGCCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGTTTTGG
AATTGGTTCGCAGAAAAACAATAAGCGTATTAGTAATGCTGTACTTGTTCCGGAACCTGTGGTACCCGGAGCTGTCGCCCCTACTGTTGAACACCGAACACCTTCAACTA
CAATGGTATTGCCTTTCATTGCCCCTCCGTCTTCTCCAGCATCTTTCCTCCAGTCCGAACCTCCATCAAACACTCAATCTCCAGCTGGATTACTCTCTTTAACTGCCCTT
TCAGTCAATAACTACTCCCCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTGGTCTCACCACCAGTTTTTTCTGCCTTCACCACTGA
ACCATCAACTGCTCCTTTTACGCCTCCTCCTGAATCTGTGCAACTGACCACACCTTCATCTCCTGAAGTGCCATTTGCTAAATTGCTGACATCTTCTCTGAGCCATACTA
ATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCGTATCCAGGAAGCCCTGGTGCCCATCTTATATCACCTGGATCAGTA
ATTTCAAACTCTGGCACATCTTCTCCTTTCCCTGATAAGCACCCCATTCTCGAGTTCCGGATGGCAGATGCTCCGAAGCTCTTGGGTCTTGAACATTTTACGACTCGCAA
ATGGATCTCAAGAATGGGTTCTGGATCTTTGACACCAGACGGTACTGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGACCCCTGATGGTGGTATGGGCATGGGTTCGA
GATTGGGATCTGGATCTGTGACCCCAAATGGCGTGAGGCAAGATTCAAGATTGGGATCTGGAACCTTGACGCCTGATGGTTTGGGGCATGCCTTGCAAGATGGTCTACTG
TTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACGAATCATAGGGTGTCATTTGAGTTAACTGGGGAAGATGTTGC
ACGTTGTCTTGCAAATAAGTCAATGACTTCCAATAGAACCGAATCAGAGTCTCCAAAGCAAACAAGCACGAACTGTCAAAACGAGAACAAAGAATCATCAAGAGAAGCTG
AAACTTGTGAGTTCTTTGACATCAAGACTTCCACAACCCCTGAGAAAACTTCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGTCATAACTCTCGGTTCGTTCAAA
GAGTTCAACTTCGACCAAACGAAAGGAGAACTACACAATACAGCCTCCATCGGTTCAGAGTGGTGGGCCAATGAAAAGGTGACTGTGAAGGAAGCTAGTCCAGGCAACAA
CAACTGGACTTTCTTCCCAATGTTGCAACCTGGGGTCAGCTGA
Protein sequenceShow/hide protein sequence
MKCNRSRIAKEEELDRQKPNPHPRPWPQPDAEADPGHKPGVLPNPLMPSSLRSRSNWQASHQYAVVYWFCKSHLPLQTNSLFLSREGQVWRRRFRDWKKVEMGSMNNSVD
TVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVPGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQSEPPSNTQSPAGLLSLTAL
SVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSV
ISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLL
LDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTNCQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRVITLGSFK
EFNFDQTKGELHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS