; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg15122 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg15122
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationCarg_Chr16:617042..619821
RNA-Seq ExpressionCarg15122
SyntenyCarg15122
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576714.1 hypothetical protein SDJN03_24288, partial [Cucurbita argyrosperma subsp. sororia]2.0e-274100Show/hide
Query:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQSEP
        MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQSEP
Subjt:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQSEP

Query:  PSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFA
        PSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFA
Subjt:  PSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFA

Query:  LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMGSRL
        LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMGSRL
Subjt:  LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMGSRL

Query:  GSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEAESC
        GSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEAESC
Subjt:  GSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEAESC

Query:  EFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        EFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
Subjt:  EFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

KAG7014759.1 hypothetical protein SDJN02_22388, partial [Cucurbita argyrosperma subsp. argyrosperma]4.9e-276100Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ

Query:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
        SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA

Query:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
Subjt:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

XP_022922938.1 uncharacterized protein LOC111430767 [Cucurbita moschata]1.1e-27599.79Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ

Query:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
        SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA

Query:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGE+HSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
Subjt:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

XP_022984784.1 uncharacterized protein LOC111482967 [Cucurbita maxima]4.1e-27599.79Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ

Query:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
        SRL SGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA

Query:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
Subjt:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

XP_023552418.1 uncharacterized protein LOC111810082 [Cucurbita pepo subsp. pepo]4.5e-27499.38Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ

Query:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPSNAQSPAGLLSLTALSVNNYSPN PASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
        SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTS NSQNKNKESSKEA
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA

Query:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        ESCEFFDIKTSTAP+KTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
Subjt:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840984.6e-25691.35Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFG GSQKNNKRIGHAVLVPEPAV GAVAPAVEHRTPSTT+VLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ

Query:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P SN QSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGM MG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKS-----------KQTSTNS
        SRLGSGSVTPNGVRQDSRLGSGT+TPDGLGH LQD  LLD+QISEVASLANSETGCQNDV NHRVSFELTGEDVARCLANKS           KQTST++
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKS-----------KQTSTNS

Query:  QNKNKESSKEAESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        QN+NKE S+EAE+CEFFDIKTS APEKT  EDDQCYQNQRAV LGSFKEFNFDQTKGE+H+TASIGAEWWANEKV VKEASPGNNWTFFP+LQPGVS
Subjt:  QNKNKESSKEAESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 14.6e-25691.35Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFG GSQKNNKRIGHAVLVPEPAV GAVAPAVEHRTPSTT+VLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ

Query:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P SN QSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGM MG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKS-----------KQTSTNS
        SRLGSGSVTPNGVRQDSRLGSGT+TPDGLGH LQD  LLD+QISEVASLANSETGCQNDV NHRVSFELTGEDVARCLANKS           KQTST++
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKS-----------KQTSTNS

Query:  QNKNKESSKEAESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        QN+NKE S+EAE+CEFFDIKTS APEKT  EDDQCYQNQRAV LGSFKEFNFDQTKGE+H+TASIGAEWWANEKV VKEASPGNNWTFFP+LQPGVS
Subjt:  QNKNKESSKEAESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 15.5e-25791.75Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFG GSQKNNKRIGHAVLVPEPAV GAVAPAVEHRTPSTT+VLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ

Query:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP SN QSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGM MG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKS-----------KQTSTNS
        SRLGSGSVTPNGVRQDSRLGSGT+TPDGLGH LQD  LLD+QISEVASLANSETGCQNDV NHRVSFELTGEDVARCLANKS           KQTST++
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKS-----------KQTSTNS

Query:  QNKNKESSKEAESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        QN+NKE S+EAE+CEFFDIKTS APEKT  EDDQCYQNQRAV LGSFKEFNFDQTKGEIH+TASIGAEWWANEKV VKEASPGNNWTFFP+LQPGVS
Subjt:  QNKNKESSKEAESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

A0A6J1E856 uncharacterized protein LOC1114307675.2e-27699.79Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ

Query:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
        SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA

Query:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGE+HSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
Subjt:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

A0A6J1JBI8 uncharacterized protein LOC1114829672.0e-27599.79Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQ

Query:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMG

Query:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
        SRL SGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA
Subjt:  SRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEA

Query:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
Subjt:  ESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766603.5e-3547.91Show/hide
Query:  KRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRT------PSTTVVLPFIAPPSSPASFLQSEPPSNAQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS  QSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRT------PSTTVVLPFIAPPSSPASFLQSEPPSNAQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPYA++TQLVSPPVFS F TEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)6.5e-5351.71Show/hide
Query:  NNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEP-AVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQS
        NN  DT+NAAA+AI S++ R+   +P   KR+W + WSL  CFG+  Q+  KRIG++VLVPEP ++S + +        S    LPFIAPPSSPASF QS
Subjt:  NNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEP-AVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQS

Query:  EPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG
        EPPS  QSP G+LS + L  NN       SIFAIGPYA++TQLVSPPVFS + TEPS+AP TPP +   +    TTPSSPEVPFA+L  S  +H   S+G
Subjt:  EPPSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG

Query:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
           KF +S   +FQ YQ  PGSP   LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown2.5e-3647.91Show/hide
Query:  KRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRT------PSTTVVLPFIAPPSSPASFLQSEPPSNAQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS  QSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRT------PSTTVVLPFIAPPSSPASFLQSEPPSNAQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPYA++TQLVSPPVFS F TEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein3.0e-12257.46Show/hide
Query:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSG-AVAPAVEHRTPSTTVVLPFIAPPSSPASF
        M S+NN SVDTVNAAA+AIVSAE+R QP +  K+R GS WSLYWCF  GS+KNNKRIGHAVLVPEPA SG AVAP     + ST++ +PFIAPPSSPASF
Subjt:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSG-AVAPAVEHRTPSTTVVLPFIAPPSSPASF

Query:  LQSEPPSNAQSP-AGLL-SLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK
        L S PPS + +P  GLL SLT         N P S F IGPYA++TQ V+PPVFSAF TEPSTAPFTPPPES     PSSPEVPFA+LLTSSL  +  N 
Subjt:  LQSEPPSNAQSP-AGLL-SLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK

Query:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP
          G NQKF+ +H +F+  Q YPGSPG +LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP   G GSRLGSG LTP
Subjt:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP

Query:  DGMAMGSRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGC--QND---VANHRVSFELTGEDVARCLANKSKQTSTNS
        D    GS+L SG VTPNG     R+  G +TP        +G LLDSQISEVASLANS+ G    ND   V  HRVSFELTGEDVARCLA+K        
Subjt:  DGMAMGSRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGC--QND---VANHRVSFELTGEDVARCLANKSKQTSTNS

Query:  QNKNKESSKEAESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVK-EASPGNNWTFFPMLQPG
           N+  S E  S E           +T +E     Q  R+ + GS KEF FD T  E+     I +EWWANEKVA K + SP N+WTFFP+L+ G
Subjt:  QNKNKESSKEAESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVK-EASPGNNWTFFPMLQPG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein5.6e-12155.12Show/hide
Query:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQSEP
        +NNSV+TVNAAATAIV+AE+RVQP +  K RWG CWSLY CF  G+QKNNKRIG+AVLVPEP  SG     V++   STTVVLPFIAPPSSPASFLQS+P
Subjt:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQSEP

Query:  PSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ
         S + SP G LSLT+   N +SP  P S+F +GPYA +TQ V+PPVFSAF TEPSTAP+TPPPE SV +TTPSSPEVPFA+LLTSSL  T +  + G NQ
Subjt:  PSNAQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ

Query:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAM
        KF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP   G GS L SG LTP+    
Subjt:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKE
        G  + SG++TPN           T  P            L +QISEVASLANS+ G +  VA+HRVSFELTGEDVARCLA+K  ++     N ++  ++E
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTVTPDGLGHALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKE

Query:  AESCEFFDIKTSTAPEKTSAEDDQ-CYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS
        + S    DI+ +        E++Q   Q   + ++GS KEF FD TK E              EKVA      GN+W+FFP L+ GVS
Subjt:  AESCEFFDIKTSTAPEKTSAEDDQ-CYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCATGAACAACAGCGTGGATACGGTTAATGCTGCTGCTACTGCGATCGTCTCCGCTGAGGCTCGAGTCCAGCCTCCGACACCTCCGAAACGAAGATGGGGGAG
CTGCTGGAGTCTGTACTGGTGTTTTGGAAATGGTTCGCAGAAAAACAATAAGCGTATAGGTCATGCTGTGCTTGTTCCTGAACCTGCAGTATCTGGAGCTGTTGCCCCTG
CTGTTGAGCATCGTACACCTTCAACCACCGTGGTATTGCCTTTCATAGCCCCTCCGTCTTCTCCTGCGTCTTTCCTCCAGTCCGAACCTCCATCAAATGCTCAATCTCCT
GCTGGATTACTATCTTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCGTCCATTTTTGCAATAGGCCCTTACGCATATGATACCCAGTTGGTCTCACC
TCCAGTCTTTTCTGCCTTCCCCACTGAACCATCGACTGCCCCTTTTACTCCTCCTCCTGAGTCTGTGCAATTGACCACACCCTCATCTCCTGAAGTACCATTTGCTAAAT
TGCTGACATCTTCTCTGAGCCATACTAATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTTTCACATTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGT
GCCCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTTCGCATGGCAGATGCTCCAAAGCTCTT
GGGTCTCGAACATTTTACGACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACGCCAGATGGTACTGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGACCC
CTGATGGTATGGCTATGGGTTCGAGATTGGGATCTGGATCTGTGACGCCAAATGGTGTGAGGCAAGATTCAAGATTGGGTTCTGGAACCGTTACTCCTGATGGTTTGGGG
CATGCCTTGCAAGATGGTCTACTGTTGGACAGCCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAACTGGATGTCAAAATGATGTGGCAAATCATAGGGTGTCCTT
TGAGTTGACTGGTGAAGATGTTGCGCGTTGTCTTGCAAATAAGTCAAAGCAAACAAGCACAAACTCTCAAAACAAAAACAAAGAATCATCGAAAGAAGCTGAAAGTTGTG
AGTTCTTTGACATCAAGACTTCCACAGCACCGGAAAAAACTTCAGCAGAGGATGATCAATGCTACCAAAATCAACGAGCCGTAAATCTTGGTTCGTTCAAAGAGTTCAAC
TTTGACCAAACCAAAGGAGAAATACACAGCACAGCCTCCATTGGTGCAGAATGGTGGGCCAATGAAAAGGTGGCTGTGAAGGAAGCTAGTCCAGGCAACAACTGGACTTT
CTTCCCAATGTTGCAACCTGGGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCATGAACAACAGCGTGGATACGGTTAATGCTGCTGCTACTGCGATCGTCTCCGCTGAGGCTCGAGTCCAGCCTCCGACACCTCCGAAACGAAGATGGGGGAG
CTGCTGGAGTCTGTACTGGTGTTTTGGAAATGGTTCGCAGAAAAACAATAAGCGTATAGGTCATGCTGTGCTTGTTCCTGAACCTGCAGTATCTGGAGCTGTTGCCCCTG
CTGTTGAGCATCGTACACCTTCAACCACCGTGGTATTGCCTTTCATAGCCCCTCCGTCTTCTCCTGCGTCTTTCCTCCAGTCCGAACCTCCATCAAATGCTCAATCTCCT
GCTGGATTACTATCTTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCGTCCATTTTTGCAATAGGCCCTTACGCATATGATACCCAGTTGGTCTCACC
TCCAGTCTTTTCTGCCTTCCCCACTGAACCATCGACTGCCCCTTTTACTCCTCCTCCTGAGTCTGTGCAATTGACCACACCCTCATCTCCTGAAGTACCATTTGCTAAAT
TGCTGACATCTTCTCTGAGCCATACTAATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTTTCACATTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGT
GCCCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTTCGCATGGCAGATGCTCCAAAGCTCTT
GGGTCTCGAACATTTTACGACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACGCCAGATGGTACTGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGACCC
CTGATGGTATGGCTATGGGTTCGAGATTGGGATCTGGATCTGTGACGCCAAATGGTGTGAGGCAAGATTCAAGATTGGGTTCTGGAACCGTTACTCCTGATGGTTTGGGG
CATGCCTTGCAAGATGGTCTACTGTTGGACAGCCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAACTGGATGTCAAAATGATGTGGCAAATCATAGGGTGTCCTT
TGAGTTGACTGGTGAAGATGTTGCGCGTTGTCTTGCAAATAAGTCAAAGCAAACAAGCACAAACTCTCAAAACAAAAACAAAGAATCATCGAAAGAAGCTGAAAGTTGTG
AGTTCTTTGACATCAAGACTTCCACAGCACCGGAAAAAACTTCAGCAGAGGATGATCAATGCTACCAAAATCAACGAGCCGTAAATCTTGGTTCGTTCAAAGAGTTCAAC
TTTGACCAAACCAAAGGAGAAATACACAGCACAGCCTCCATTGGTGCAGAATGGTGGGCCAATGAAAAGGTGGCTGTGAAGGAAGCTAGTCCAGGCAACAACTGGACTTT
CTTCCCAATGTTGCAACCTGGGGTCAGCTGA
Protein sequenceShow/hide protein sequence
MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAVLVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQSEPPSNAQSP
AGLLSLTALSVNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQPYQPYPGSPG
AHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMAMGSRLGSGSVTPNGVRQDSRLGSGTVTPDGLG
HALQDGLLLDSQISEVASLANSETGCQNDVANHRVSFELTGEDVARCLANKSKQTSTNSQNKNKESSKEAESCEFFDIKTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFN
FDQTKGEIHSTASIGAEWWANEKVAVKEASPGNNWTFFPMLQPGVS