; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004319 (gene) of Snake gourd v1 genome

Gene IDTan0004319
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationLG01:1962694..1966417
RNA-Seq ExpressionTan0004319
SyntenyTan0004319
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]2.6e-26494.18Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI +AVLVPEP VPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP S+TQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC
        SRLGSGSVTPNGVRQDS LGSGTLTPDGLGH L+D  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESPKQTST+ 
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC

Query:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        QNENKE SREAETCEFFDIK TS APEKT GE+DQCYQNQRAVTL SFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

XP_004140832.3 uncharacterized protein LOC101210841 [Cucumis sativus]5.3e-26293.37Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        M S+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQK++KRI +AVLVPEP VPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP S+TQSPAGLLSLTALSVNNYSPNGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC
        SRLGSGSVTPNG+RQDS LGSGTLTPDGLGH L+D  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESPKQTST+ 
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC

Query:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        QNENKESSREAETCEFFDIK TS APEKT GE+DQCYQNQRAVTL SFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

XP_008439268.1 PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]2.2e-26393.78Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI +AVLVPEP VPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P S+TQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC
        SRLGSGSVTPNGVRQDS LGSGTLTPDGLGH L+D  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESPKQTST+ 
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC

Query:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        QNENKE SREAETCEFFDIK TS APEKT GE+DQCYQNQRAVTL SFKEFNFDQTKGE+HNTASIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

XP_022141198.1 uncharacterized protein LOC111011654 [Momordica charantia]2.3e-25792.6Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTP KRRWG CWSLYWCFGIGSQKN+KRI +AVLVPEPVVPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P S+ QSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESE-SPKQTSTN
        SRLGSGS+TPNG+RQDS L SGTLTPDGLG+AL+DG LLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSM SIRTESE S +QTS+ 
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESE-SPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTTSTAPEKT-SGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
         Q+ENK SSREAETCEFFDIK TSTAPEK+ +GE+DQCYQNQRAVTL SFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEA+PGNNWTFFPMLQ GVS
Subjt:  CQNENKESSREAETCEFFDIKTTSTAPEKT-SGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]8.2e-26394.18Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGI SQKN+KRI +AVLVPEP VPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPS+TQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDG+GMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC
        SRLGSGSVTPNGVRQDS LGSGTLTPDGLGH L+DG LLD+QISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTS RT SESPKQTST+ 
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC

Query:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q ENKESSREAETCE FDIK TSTAPEKTS ++DQCYQNQRA+TL SFKEFNFDQTKGEI+NTASIGAEWWANEKVAVKEA+PGNNWTFFP+LQ GVS
Subjt:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840981.1e-26393.78Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI +AVLVPEP VPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P S+TQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC
        SRLGSGSVTPNGVRQDS LGSGTLTPDGLGH L+D  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESPKQTST+ 
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC

Query:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        QNENKE SREAETCEFFDIK TS APEKT GE+DQCYQNQRAVTL SFKEFNFDQTKGE+HNTASIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 11.1e-26393.78Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI +AVLVPEP VPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S P S+TQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC
        SRLGSGSVTPNGVRQDS LGSGTLTPDGLGH L+D  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESPKQTST+ 
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC

Query:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        QNENKE SREAETCEFFDIK TS APEKT GE+DQCYQNQRAVTL SFKEFNFDQTKGE+HNTASIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 11.2e-26494.18Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKN+KRI +AVLVPEP VPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEP S+TQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC
        SRLGSGSVTPNGVRQDS LGSGTLTPDGLGH L+D  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TSIRTESESPKQTST+ 
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC

Query:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        QNENKE SREAETCEFFDIK TS APEKT GE+DQCYQNQRAVTL SFKEFNFDQTKGEIHNTASIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

A0A6J1CJS5 uncharacterized protein LOC1110116541.1e-25792.6Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPPTP KRRWG CWSLYWCFGIGSQKN+KRI +AVLVPEPVVPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        S+P S+ QSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESE-SPKQTSTN
        SRLGSGS+TPNG+RQDS L SGTLTPDGLG+AL+DG LLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSM SIRTESE S +QTS+ 
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESE-SPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTTSTAPEKT-SGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
         Q+ENK SSREAETCEFFDIK TSTAPEK+ +GE+DQCYQNQRAVTL SFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEA+PGNNWTFFPMLQ GVS
Subjt:  CQNENKESSREAETCEFFDIKTTSTAPEKT-SGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

A0A6J1HTJ2 uncharacterized protein LOC1114676949.5e-25792.42Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ
        MGSMNNSVDTVNAAATAI+SAEARVQPPTP KRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVA AVEHR PSTTMVLPFIAPPSSPASFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQ

Query:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ
        SEPPSH+QSPAGLLSLTALSVNNYSPNGPASIFAIGPY YETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTN+SFGTN 
Subjt:  SEPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
        KFALSHCDFQPYQPYPGSPGA+LISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG
Subjt:  KFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMG

Query:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC
        SRLGSGSVTPNG+RQDS LGSGTLTPDGLGHAL+DGLLLDNQISEVASLANS SGC NDVTNHRVSFELTGEDVARCLA KSMTS RTESES    S  C
Subjt:  SRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNC

Query:  QNENKE-SSREAETCEFFDIKTT-STAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEA-SPGNNWTFFPMLQSGV
        QNENKE SSREAET EFFDIKTT STAPEK +GE+++CYQNQRAVTL SFKEFNFD+TKGE+ NTAS+GAEWWANEKV VKEA SPGNNWTFFP+LQSGV
Subjt:  QNENKE-SSREAETCEFFDIKTT-STAPEKTSGENDQCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEA-SPGNNWTFFPMLQSGV

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.5e-3649.3Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSHTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSHTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPYA+ETQLVSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)4.6e-5452.85Show/hide
Query:  NNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPV-VPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQS
        NN  DT+NAAA+AI S++ R+   +P   KR+W + WSL  CF  GS +  KRI N+VLVPEPV +  + +        S    LPFIAPPSSPASF QS
Subjt:  NNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPV-VPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQS

Query:  EPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG
        EPPS TQSP G+LS + L  NN       SIFAIGPYA+ETQLVSPPVFS +TTEPS+AP TPP +   +    TTPSSPEVPFA+L  S  +H   S+G
Subjt:  EPPSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTNKSFG

Query:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
           KF +S   +FQ YQ  PGSP   LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  TNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.8e-3749.3Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSHTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS TQSP   LSL A      S
Subjt:  KRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSHTQSPAGLLSLTALSVNNYS

Query:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL
        P GP +S++A GPYA+ETQLVSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L
Subjt:  PNGP-ASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein5.7e-12155.99Show/hide
Query:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF
        M S+NN SVDTVNAAA+AIVSAE+R QP +  K+R GS WSLYWCF  GS+KN+KRI +AVLVPEP   G AVAP     + ST++ +PFIAPPSSPASF
Subjt:  MGSMNN-SVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSEPP--SHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK
        L S PP  SHT  P  L SLT         N P S F IGPYA+ETQ V+PPVFSAFTTEPSTAPFTPPPES     PSSPEVPFA+LLTSSL  +  N 
Subjt:  LQSEPP--SHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK

Query:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP
          G NQKF+ +H +F+  Q YPGSPG +LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+TP   G GSRLGSG LTP
Subjt:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTP

Query:  DGMGMGSRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMTSIRTE
        D    GS+L SG VTPNG      +  G LTP        +G LLD+QISEVASLANS+ G    ND   V  HRVSFELTGEDVARCLA+K   S   E
Subjt:  DGMGMGSRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMTSIRTE

Query:  SESPKQTSTNCQNENKESSREAETCEFFDIKTTSTAPEKTSGEND-QCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVK-EASPGNNW
          S +    NC             C             KTSGE + +  Q  R+ +  S KEF FD T  E+     I +EWWANEKVA K + SP N+W
Subjt:  SESPKQTSTNCQNENKESSREAETCEFFDIKTTSTAPEKTSGEND-QCYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVK-EASPGNNW

Query:  TFFPMLQSG
        TFFP+L+SG
Subjt:  TFFPMLQSG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein9.2e-11954.98Show/hide
Query:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP
        +NNSV+TVNAAATAIV+AE+RVQP +  K RWG CWSLY CF  G+QKN+KRI NAVLVPEPV  G     V++   STT+VLPFIAPPSSPASFLQS+P
Subjt:  MNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP

Query:  PSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ
         S + SP G LSLT+   N +SP  P S+F +GPYA ETQ V+PPVFSAF TEPSTAP+TPPPE SV +TTPSSPEVPFA+LLTSSL  T +  + G NQ
Subjt:  PSHTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SFGTNQ

Query:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        KF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP   G GS L SG LTP+    
Subjt:  KFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTN
        G  + SG++TPN           T  P            L NQISEVASLANS+ G +  V +HRVSFELTGEDVARCLA+K        + S  + + N
Subjt:  GSRLGSGSVTPNGVRQDSILGSGTLTPDGLGHALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTTSTAPEKTSG--ENDQ-CYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSG
         + E +ESS         DI+      EK SG  EN+Q   Q   + ++ S KEF FD TK E              EKVA      GN+W+FFP L+SG
Subjt:  CQNENKESSREAETCEFFDIKTTSTAPEKTSG--ENDQ-CYQNQRAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSG

Query:  VS
        VS
Subjt:  VS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCATGAACAACAGCGTGGATACGGTTAATGCTGCCGCTACTGCGATCGTCTCCGCTGAGGCTCGAGTTCAGCCTCCGACGCCTCCGAAACGAAGATGGGGTAG
CTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGTTCTCAGAAAAACAGTAAGCGTATCAGTAATGCTGTACTTGTTCCGGAACCTGTGGTACCTGGAGCTGTTGCCCCTG
CTGTTGAACACCGAACACCTTCAACGACCATGGTATTACCTTTCATTGCCCCTCCGTCTTCTCCTGCATCTTTCCTCCAGTCTGAACCTCCATCACATACTCAATCTCCA
GCTGGATTACTCTCTTTAACTGCCCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTGGTCTCACC
TCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACTGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAACTGACCACTCCCTCATCTCCTGAAGTACCATTTGCTAAAT
TGCTGACATCTTCTCTGAGCCATACTAATAAAAGTTTTGGAACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCGTATCCAGGAAGCCCTGGC
GCCCATCTTATATCACCTGGATCAGTAATCTCAAACTCTGGTACATCTTCTCCTTTCCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCTCCGAAGCTCTT
GGGTCTTGAACATTTTACAACTCGCAAATGGATCTCGAGAATGGGTTCTGGATCTTTGACGCCAGACGGTACTGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGACCC
CCGATGGTATGGGCATGGGTTCGAGATTGGGATCTGGATCTGTGACCCCAAATGGCGTTAGGCAAGATTCAATATTGGGTTCTGGAACGTTGACACCTGATGGTTTGGGG
CATGCCTTGCGAGATGGTCTACTGTTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACAAATCATAGGGTGTCATT
TGAGTTAACTGGCGAAGATGTTGCACGCTGTCTTGCAAATAAGTCAATGACATCCATTAGAACTGAATCGGAGTCTCCAAAGCAAACAAGCACAAACTGTCAAAACGAAA
ACAAAGAATCATCCAGAGAAGCTGAAACTTGTGAGTTCTTTGACATCAAGACGACTTCCACAGCACCTGAAAAAACTTCAGGAGAGAACGATCAATGCTACCAAAATCAG
CGAGCCGTAACTCTCAGTTCGTTCAAAGAGTTCAACTTCGACCAAACGAAAGGAGAAATACACAATACAGCCTCCATTGGTGCAGAATGGTGGGCCAATGAAAAGGTGGC
TGTGAAGGAAGCTAGTCCAGGCAACAACTGGACTTTCTTCCCAATGTTGCAATCTGGGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
TCCAGATTTATATATTTTATTTTATTTTATTTTTAGCTTATTATTATTATTTATTATATAAAAAAAATCGGAAATATATAAAATGAGAAAATTCAATCTCTCTCTTCCTT
GGTCTGAGTTTTTCATTTCTTAGATCAAATCCAAAAGTCGTCTCTGTTTCTTTTTTTTGGCTGTCAAATCTCACTGGAGAGTAATTGATTTTGCCGGCGAGTTAGCGGGG
GTGGATTGGTCGGAAAACACGGCGAGCTTGTTGAAAAGTCCGGCGGCGGAGCTCTCTGGATTGGAAGAAAGTGGAGATGGGAAGCATGAACAACAGCGTGGATACGGTTA
ATGCTGCCGCTACTGCGATCGTCTCCGCTGAGGCTCGAGTTCAGCCTCCGACGCCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGT
TCTCAGAAAAACAGTAAGCGTATCAGTAATGCTGTACTTGTTCCGGAACCTGTGGTACCTGGAGCTGTTGCCCCTGCTGTTGAACACCGAACACCTTCAACGACCATGGT
ATTACCTTTCATTGCCCCTCCGTCTTCTCCTGCATCTTTCCTCCAGTCTGAACCTCCATCACATACTCAATCTCCAGCTGGATTACTCTCTTTAACTGCCCTTTCAGTCA
ATAACTACTCCCCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCA
ACTGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAACTGACCACTCCCTCATCTCCTGAAGTACCATTTGCTAAATTGCTGACATCTTCTCTGAGCCATACTAATAAAAG
TTTTGGAACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCGTATCCAGGAAGCCCTGGCGCCCATCTTATATCACCTGGATCAGTAATCTCAA
ACTCTGGTACATCTTCTCCTTTCCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCTCCGAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATC
TCGAGAATGGGTTCTGGATCTTTGACGCCAGACGGTACTGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGACCCCCGATGGTATGGGCATGGGTTCGAGATTGGGATC
TGGATCTGTGACCCCAAATGGCGTTAGGCAAGATTCAATATTGGGTTCTGGAACGTTGACACCTGATGGTTTGGGGCATGCCTTGCGAGATGGTCTACTGTTGGACAACC
AAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGCGAAGATGTTGCACGCTGTCTT
GCAAATAAGTCAATGACATCCATTAGAACTGAATCGGAGTCTCCAAAGCAAACAAGCACAAACTGTCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGTGA
GTTCTTTGACATCAAGACGACTTCCACAGCACCTGAAAAAACTTCAGGAGAGAACGATCAATGCTACCAAAATCAGCGAGCCGTAACTCTCAGTTCGTTCAAAGAGTTCA
ACTTCGACCAAACGAAAGGAGAAATACACAATACAGCCTCCATTGGTGCAGAATGGTGGGCCAATGAAAAGGTGGCTGTGAAGGAAGCTAGTCCAGGCAACAACTGGACT
TTCTTCCCAATGTTGCAATCTGGGGTCAGCTGACTTTGACATGGATGTCAACAGTAAAAAGGAAAAAAAAAAAAGAAAAAAGAAAAAAGAAAAAAGAAGAAGTAGAAGAC
ACAACAAAACCAACCTTTTGAATGTACATTTGATTGTAATCTTCCTTTGGAGATGCAACTTTTAGGACCTGTGATCTCAGATAACAGATGGAATGATTTGGAGGAAAAGA
ATGTTTTTCAAAGTTGTCTTTTGAAAACACTGTTCCAATAGACAGCAGAAAGAAAGGTAGTTATTAGGGATAACTAGTGTGGGTACTGAAGGAGCATTCTTTATCATAGT
GCAAGAAGTAGTAGATCATTCATAATCATAGGATCTCTTTGAAGTTGTTTATTCTTTCTTGTCTTGTATAATAATAAGAAATTTCATTCTTCCCCAACAATGAAACTTCT
TCCCTTCTTGAAAACTCTCTGGTGATTTTAGTTTATTCCTCTGCAAGTAAATGTAAAAACTTTATTCTTCTTCTAGAGCAAAAACAGCTTCATATGTCCAAGGATGAAGA
TGATTCCAAAATGGATATTTTGGTTTAATATGTATCTTAATGTTTTAGAGTTCCAAGTCAATCTTCTCTGCCCATCTGCC
Protein sequenceShow/hide protein sequence
MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNSKRISNAVLVPEPVVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPPSHTQSP
AGLLSLTALSVNNYSPNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQPYQPYPGSPG
AHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMGSRLGSGSVTPNGVRQDSILGSGTLTPDGLG
HALRDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSIRTESESPKQTSTNCQNENKESSREAETCEFFDIKTTSTAPEKTSGENDQCYQNQ
RAVTLSSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS