; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027620 (gene) of Chayote v1 genome

Gene IDSed0027620
OrganismSechium edule (Chayote v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationLG07:7355075..7358463
RNA-Seq ExpressionSed0027620
SyntenySed0027620
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014759.1 hypothetical protein SDJN02_22388, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-24487.62Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAI SAEARVQPPTPPKRRWGSCWSLYWCFG GSQKN+KRI  A LVPEP V GAVAPAVEHRTPSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        SEPPS+ QSPAGLLSL ALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTS+LSHTNKS GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFALSHCDFQPYQPYPGSPG +LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGM MG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGT+TPDGLGHALQDGLLLD   SQISEVASLANSE+ CQ +V NHRVSFELTGE VARCLANKS           KQ S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV
        TN QN+NKESS+EAE+CE FDI+ TSTAPEKTS E DQCYQNQRAV+LGSFKEFNFDQTKGEIH+TASIG+EWW  EKVAV EASPGNNWTFFP+LQ GV
Subjt:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV

Query:  S
        S
Subjt:  S

TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]5.2e-24989.22Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAI SAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI  A LVPEP VPGAVAPAVEHRTPSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        SEP S+TQSPAGLLSL ALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTS+LSHTNKS GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPG +LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGTLTPDGLGH LQD  LLDN   QISEVASLANSE+ CQ +VTNHRVSFELTGE VARCLANKS++S RT SESPKQ S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV
        T+ QNENKE SREAETCE FDI+ TS APEKT GE DQCYQNQRAV+LGSFKEFNFDQTKGEIHNTASIG+EWW  EKV V EASPGNNWTFFPLLQ GV
Subjt:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV

Query:  S
        S
Subjt:  S

XP_004140832.3 uncharacterized protein LOC101210841 [Cucumis sativus]6.4e-24788.62Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAI SAEARVQP TPPKRRWGSCWSLYWCFGIGSQK++KRI  A LVPEP VPGAVAPAVEHRTPSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        SEP S+TQSPAGLLSL ALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTS+LSHTNKS GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPG +LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGTLTPDGLGH LQD  LLDN   QISEVASLANSE+ CQ +VTNHRVSFELTGE VARCLANKS++S RT SESPKQ S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV
        T+ QNENKESSREAETCE FDI+ TS APEKT GE DQCYQNQRAV+LGSFKEFNFDQTKGEIHNTASIG+EWW  EKV V EASPGNNWTFFPLLQ GV
Subjt:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV

Query:  S
        S
Subjt:  S

XP_008439268.1 PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]4.4e-24888.82Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAI SAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI  A LVPEP VPGAVAPAVEHRTPSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        S P S+TQSPAGLLSL ALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTS+LSHTNKS GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPG +LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGTLTPDGLGH LQD  LLDN   QISEVASLANSE+ CQ +VTNHRVSFELTGE VARCLANKS++S RT SESPKQ S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV
        T+ QNENKE SREAETCE FDI+ TS APEKT GE DQCYQNQRAV+LGSFKEFNFDQTKGE+HNTASIG+EWW  EKV V EASPGNNWTFFPLLQ GV
Subjt:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV

Query:  S
        S
Subjt:  S

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]1.2e-25090.02Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAI SAEARVQPPTPPKRRWGSCWSLYWCFGI SQKN+KRI  A LVPEP VPGAVAPAVEHRTPSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        SEPPS+TQSPAGLLSL ALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTS+LSHTNKS GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPG +LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDG+GMG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGTLTPDGLGH LQDG LLD+   QISEVASLANSES CQ +VTNHRVSFELTGE VARCLANKSM+S RTGSESPKQ S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV
        T+ Q ENKESSREAETCELFDI+ TSTAPEKTS + DQCYQNQRA++LGSFKEFNFDQTKGEI+NTASIG+EWW  EKVAV EA+PGNNWTFFPLLQ GV
Subjt:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840982.1e-24888.82Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAI SAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI  A LVPEP VPGAVAPAVEHRTPSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        S P S+TQSPAGLLSL ALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTS+LSHTNKS GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPG +LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGTLTPDGLGH LQD  LLDN   QISEVASLANSE+ CQ +VTNHRVSFELTGE VARCLANKS++S RT SESPKQ S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV
        T+ QNENKE SREAETCE FDI+ TS APEKT GE DQCYQNQRAV+LGSFKEFNFDQTKGE+HNTASIG+EWW  EKV V EASPGNNWTFFPLLQ GV
Subjt:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV

Query:  S
        S
Subjt:  S

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 12.1e-24888.82Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAI SAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI  A LVPEP VPGAVAPAVEHRTPSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        S P S+TQSPAGLLSL ALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTS+LSHTNKS GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPG +LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGTLTPDGLGH LQD  LLDN   QISEVASLANSE+ CQ +VTNHRVSFELTGE VARCLANKS++S RT SESPKQ S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV
        T+ QNENKE SREAETCE FDI+ TS APEKT GE DQCYQNQRAV+LGSFKEFNFDQTKGE+HNTASIG+EWW  EKV V EASPGNNWTFFPLLQ GV
Subjt:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV

Query:  S
        S
Subjt:  S

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 12.5e-24989.22Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAI SAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI  A LVPEP VPGAVAPAVEHRTPSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        SEP S+TQSPAGLLSL ALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTS+LSHTNKS GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPG +LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGTLTPDGLGH LQD  LLDN   QISEVASLANSE+ CQ +VTNHRVSFELTGE VARCLANKS++S RT SESPKQ S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV
        T+ QNENKE SREAETCE FDI+ TS APEKT GE DQCYQNQRAV+LGSFKEFNFDQTKGEIHNTASIG+EWW  EKV V EASPGNNWTFFPLLQ GV
Subjt:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV

Query:  S
        S
Subjt:  S

A0A6J1E856 uncharacterized protein LOC1114307677.1e-24487.43Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAI SAEARVQPPTPPKRRWGSCWSLYWCFG GSQKN+KRI  A LVPEP V GAVAPAVEHRTPSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        SEPPS+ QSPAGLLSL ALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTS+LSHTNKS GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFALSHCDFQPYQPYPGSPG +LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGM MG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGT+TPDGLGHALQDGLLLD   SQISEVASLANSE+ CQ +V NHRVSFELTGE VARCLANKS           KQ S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV
        TN QN+NKESS+EAE+CE FDI+ TSTAPEKTS E DQCYQNQRAV+LGSFKEFNFDQTKGE+H+TASIG+EWW  EKVAV EASPGNNWTFFP+LQ GV
Subjt:  TNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGV

Query:  S
        S
Subjt:  S

A0A6J1HTJ2 uncharacterized protein LOC1114676941.2e-24388.29Show/hide
Query:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAI SAEARVQPPTP KRRWGSCWSLYWCFGIGSQKNSKRIS+A LVPEP VPGAVA AVEHR PSTT +LPFIAPPSSPASFLQ
Subjt:  MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP
        SEPPSH+QSPAGLLSL ALSVNNYSPNGPASIFAIGPY YETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTS+LSHTN+S GTN 
Subjt:  SEPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNP

Query:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFALSHCDFQPYQPYPGSPG YLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS
        SRLGSGSVTPNG RQDSRLGSGTLTPDGLGHALQDGLLLDN   QISEVASLANS S C  +VTNHRVSFELTGE VARCLA KSM+STRT SES    S
Subjt:  SRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQAS

Query:  TNCQNENKE-SSREAETCELFDIETT-STAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEA-SPGNNWTFFPLLQ
          CQNENKE SSREAET E FDI+TT STAPEK +GE ++CYQNQRAV+LGSFKEFNFD+TKGE+ NTAS+G+EWW  EKV V EA SPGNNWTFFP+LQ
Subjt:  TNCQNENKE-SSREAETCELFDIETT-STAPEKTSGEGDQCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEA-SPGNNWTFFPLLQ

Query:  SGVS
        SGVS
Subjt:  SGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.7e-3749.77Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRT------PSTTTILPFIAPPSSPASFLQSEPPSHTQSPAGLLSLAALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A+ +PE G   A  P   H+        +    L  +APPSSPASF  S  PS TQSP   LSLAA      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRT------PSTTTILPFIAPPSSPASFLQSEPPSHTQSPAGLLSLAALSVNNYS

Query:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNPKFALSHCDFQ-PYQPYPGSPGGYL
        P GP +S++A GPYA+ETQLVSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTS++   N   G        + D Q  Y  YPGSP   L
Subjt:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNPKFALSHCDFQ-PYQPYPGSPGGYL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)6.7e-5352.47Show/hide
Query:  NNSVDTVNAAATAIFSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEP-GVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQS
        NN  DT+NAAA+AI S++ R+   +P   KR+W + WSL  CF  GS +  KRI ++ LVPEP  +  + +        S  T LPFIAPPSSPASF QS
Subjt:  NNSVDTVNAAATAIFSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEP-GVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQS

Query:  EPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSALSHTNKSLG
        EPPS TQSP G+LS + L  NN       SIFAIGPYA+ETQLVSPPVFS +TTEPS+AP TPP +   +    TTPSSPEVPFA+L  S  +H   S G
Subjt:  EPPSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSALSHTNKSLG

Query:  TNPKFALSHC-DFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
           KF +S   +FQ YQ  PGSP G LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  TNPKFALSHC-DFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.2e-3849.77Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRT------PSTTTILPFIAPPSSPASFLQSEPPSHTQSPAGLLSLAALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A+ +PE G   A  P   H+        +    L  +APPSSPASF  S  PS TQSP   LSLAA      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRT------PSTTTILPFIAPPSSPASFLQSEPPSHTQSPAGLLSLAALSVNNYS

Query:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNPKFALSHCDFQ-PYQPYPGSPGGYL
        P GP +S++A GPYA+ETQLVSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTS++   N   G        + D Q  Y  YPGSP   L
Subjt:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNPKFALSHCDFQ-PYQPYPGSPGGYL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.1e-11454.49Show/hide
Query:  MGSLNN-SVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPG-AVAPAVEHRTPSTTTILPFIAPPSSPASF
        M S+NN SVDTVNAAA+AI SAE+R QP +  K+R GS WSLYWCF  GS+KN+KRI  A LVPEP   G AVAP     + ST+  +PFIAPPSSPASF
Subjt:  MGSLNN-SVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPG-AVAPAVEHRTPSTTTILPFIAPPSSPASF

Query:  LQSEPP--SHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSAL--SHTNK
        L S PP  SHT  P  L SL          N P S F IGPYA+ETQ V+PPVFSAFTTEPSTAPFTPPPES     PSSPEVPFA+LLTS+L  +  N 
Subjt:  LQSEPP--SHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSAL--SHTNK

Query:  SLGTNPKFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP
          G N KF+ +H +F+  Q YPGSPGG LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP   G GSRLGSG LTP
Subjt:  SLGTNPKFALSHCDFQPYQPYPGSPGGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP

Query:  DGMGMGSRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTN-----VTNHRVSFELTGEGVARCLANKSMSST
        D    GS+L SG VTPNGA    R+  G LTP        +G LLD   SQISEVASLANS+     +     V  HRVSFELTGE VARCLA+K   S 
Subjt:  DGMGMGSRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTN-----VTNHRVSFELTGEGVARCLANKSMSST

Query:  RTGSESPKQASTNCQNENKESSREAETCELFDIETTSTAPEKTSGEGD-QCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVA-VDEASPG
             S +    NC             C             KTSGE + +  Q  R+ S GS KEF FD T  E+     I SEWW  EKVA   + SP 
Subjt:  RTGSESPKQASTNCQNENKESSREAETCELFDIETTSTAPEKTSGEGD-QCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVA-VDEASPG

Query:  NNWTFFPLLQSG
        N+WTFFP+L+SG
Subjt:  NNWTFFPLLQSG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein2.6e-11352.28Show/hide
Query:  LNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQSEP
        +NNSV+TVNAAATAI +AE+RVQP +  K RWG CWSLY CF  G+QKN+KRI +A LVPEP   G     V++   STT +LPFIAPPSSPASFLQS+P
Subjt:  LNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQSEP

Query:  PSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSALSHTNK--SLGTNP
         S + SP G LSL +   N +SP  P S+F +GPYA ETQ V+PPVFSAF TEPSTAP+TPPPE SV +TTPSSPEVPFA+LLTS+L  T +  + G N 
Subjt:  PSHTQSPAGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSALSHTNK--SLGTNP

Query:  KFALSHCDFQPYQPYPGSP-GGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        KF+ SH +F+  Q  PGSP GG LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP                  +G 
Subjt:  KFALSHCDFQPYQPYPGSP-GGYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQA
        GS L SG++TPNG      + SG LTP+     LQ         +QISEVASLANS+   +  V +HRVSFELTGE VARCLA+K        + S  + 
Subjt:  GSRLGSGSVTPNGARQDSRLGSGTLTPDGLGHALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQA

Query:  STNCQNENKESSREAETCELFDIETTSTAPEKTSGEGD---QCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLL
        + N + E +ESS         DI       EK SG+ +      Q   + S+GS KEF FD TK E              EKVA      GN+W+FFP L
Subjt:  STNCQNENKESSREAETCELFDIETTSTAPEKTSGEGD---QCYQNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLL

Query:  QSGVS
        +SGVS
Subjt:  QSGVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCTTGAACAACAGCGTGGATACGGTTAATGCTGCTGCGACGGCGATCTTCTCCGCCGAGGCTCGAGTTCAGCCTCCGACGCCTCCGAAACGAAGGTGGGGTAG
CTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGTTCGCAGAAAAACAGTAAGCGTATTAGTAGTGCTGCACTTGTTCCGGAACCTGGGGTACCAGGAGCTGTTGCTCCTG
CTGTTGAACATCGAACACCTTCAACCACAACGATATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTTCAGTCCGAACCTCCATCGCATACTCAATCTCCT
GCTGGATTACTCTCTTTAGCTGCCCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATGCATACGAGACCCAGTTGGTCTCACC
TCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAACTGACCACACCCTCATCTCCTGAAGTACCATTTGCAAAAT
TGCTCACATCTGCTCTGAGCCATACTAATAAAAGTCTTGGAACTAACCCAAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCCTATCCAGGAAGCCCCGGT
GGCTATCTAATATCACCTGGCTCAGTAATTTCTAACTCCGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTTCGCATGGCAGATGCTCCAAAGCTCTT
AGGTCTTGAGCATTTCACAACTCGCAAATGGATCTCGAGAATGGGTTCTGGATCTTTGACGCCGGATGGCACTGGTTTAGGTTCTAGGTTAGGTTCGGGAACTTTGACGC
CGGATGGCATGGGCATGGGTTCGAGATTGGGATCTGGATCTGTGACCCCAAATGGCGCGAGGCAAGATTCGAGATTGGGTTCTGGAACCTTGACACCTGATGGTTTGGGG
CATGCATTGCAAGATGGTCTACTGTTGGACAACCAAATATCTCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGTATGCCAAACCAATGTGACAAATCATAG
GGTGTCATTTGAGTTAACTGGCGAAGGTGTTGCACGTTGTCTTGCCAATAAGTCAATGTCATCCACTAGAACCGGATCAGAGTCTCCAAAGCAAGCAAGCACAAACTGTC
AAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGTGAGCTCTTCGACATCGAGACGACTTCCACAGCGCCTGAAAAAACCTCAGGAGAAGGCGACCAATGCTAC
CAAAATCAGCGAGCCGTAAGTCTCGGTTCGTTCAAAGAGTTCAACTTCGATCAAACGAAAGGAGAAATACACAATACAGCCTCTATTGGTTCTGAGTGGTGGACCAAGGA
AAAGGTAGCTGTGGATGAAGCTAGTCCAGGTAACAACTGGACTTTCTTTCCATTGTTGCAATCTGGGGTTAGCTAA
mRNA sequenceShow/hide mRNA sequence
AGGTTATTCTATTTATTATATAAAAGAATTAGAGATATATAAAATTAGAAAATTCGATCTCTCTCTCTTTTCTGGTCTGTGTTCTTCATTTCTTGGATCAAATCCAAAAC
TCGATTCTGTTTCTTTCTTTTGGCTGTCGAATCTCACTGAAGAGTAATTGATTTTTCCGGCGAGTTGACGGCGGTGGATTTGTTGGAAAACACGGCGAGCTTGTTGAAAA
GTCCGGCGGCGGAGCTTTCTGGATTGGAAGAAAGTGGAGATGGGAAGCTTGAACAACAGCGTGGATACGGTTAATGCTGCTGCGACGGCGATCTTCTCCGCCGAGGCTCG
AGTTCAGCCTCCGACGCCTCCGAAACGAAGGTGGGGTAGCTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGTTCGCAGAAAAACAGTAAGCGTATTAGTAGTGCTGCAC
TTGTTCCGGAACCTGGGGTACCAGGAGCTGTTGCTCCTGCTGTTGAACATCGAACACCTTCAACCACAACGATATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCT
TTCCTTCAGTCCGAACCTCCATCGCATACTCAATCTCCTGCTGGATTACTCTCTTTAGCTGCCCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCATCCATTTTTGC
AATAGGCCCTTATGCATACGAGACCCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAAC
TGACCACACCCTCATCTCCTGAAGTACCATTTGCAAAATTGCTCACATCTGCTCTGAGCCATACTAATAAAAGTCTTGGAACTAACCCAAAGTTTGCACTATCCCATTGT
GATTTCCAGCCTTATCAACCCTATCCAGGAAGCCCCGGTGGCTATCTAATATCACCTGGCTCAGTAATTTCTAACTCCGGTACATCTTCTCCTTTTCCTGATAAACACCC
CATTCTTGAGTTTCGCATGGCAGATGCTCCAAAGCTCTTAGGTCTTGAGCATTTCACAACTCGCAAATGGATCTCGAGAATGGGTTCTGGATCTTTGACGCCGGATGGCA
CTGGTTTAGGTTCTAGGTTAGGTTCGGGAACTTTGACGCCGGATGGCATGGGCATGGGTTCGAGATTGGGATCTGGATCTGTGACCCCAAATGGCGCGAGGCAAGATTCG
AGATTGGGTTCTGGAACCTTGACACCTGATGGTTTGGGGCATGCATTGCAAGATGGTCTACTGTTGGACAACCAAATATCTCAAATATCTGAGGTGGCTTCCCTTGCCAA
CTCAGAAAGTGTATGCCAAACCAATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGCGAAGGTGTTGCACGTTGTCTTGCCAATAAGTCAATGTCATCCACTAGAA
CCGGATCAGAGTCTCCAAAGCAAGCAAGCACAAACTGTCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGTGAGCTCTTCGACATCGAGACGACTTCCACA
GCGCCTGAAAAAACCTCAGGAGAAGGCGACCAATGCTACCAAAATCAGCGAGCCGTAAGTCTCGGTTCGTTCAAAGAGTTCAACTTCGATCAAACGAAAGGAGAAATACA
CAATACAGCCTCTATTGGTTCTGAGTGGTGGACCAAGGAAAAGGTAGCTGTGGATGAAGCTAGTCCAGGTAACAACTGGACTTTCTTTCCATTGTTGCAATCTGGGGTTA
GCTAACTTTTACATGGATGCCAACAGTAAAAAGAAAACTAAAACAAAAAACACAAACAACAAACCTTTTGAATGTACATTTGAATGTAATATTCCTTTGGAGATGCAACA
TTTAGGACCTGTGATTTCAGATAACAGATGAAATGATTTGGAGGATAAGAATGTTTTTGAAAGAAAGGATAGTTTTATTAGGGATAACTAGTGTGGGTACTGAAGGGGGA
GCATTCTTTATCATAGTGCAAGAAGTAGATCATTCATAATCATAGGATATCTTTTAAGTGTTTTATTCTTTCTTTCTTGTCTTGTATAATAATAAGAAATTTTATTCTTC
CCAACAATGAAACTTCACTTCTTGAAAA
Protein sequenceShow/hide protein sequence
MGSLNNSVDTVNAAATAIFSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISSAALVPEPGVPGAVAPAVEHRTPSTTTILPFIAPPSSPASFLQSEPPSHTQSP
AGLLSLAALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSALSHTNKSLGTNPKFALSHCDFQPYQPYPGSPG
GYLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMGSRLGSGSVTPNGARQDSRLGSGTLTPDGLG
HALQDGLLLDNQISQISEVASLANSESVCQTNVTNHRVSFELTGEGVARCLANKSMSSTRTGSESPKQASTNCQNENKESSREAETCELFDIETTSTAPEKTSGEGDQCY
QNQRAVSLGSFKEFNFDQTKGEIHNTASIGSEWWTKEKVAVDEASPGNNWTFFPLLQSGVS