; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020033 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020033
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationchr5:47535927..47538790
RNA-Seq ExpressionLag0020033
SyntenyLag0020033
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014759.1 hypothetical protein SDJN02_22388, partial [Cucurbita argyrosperma subsp. argyrosperma]6.8e-25792.18Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        MGSM NNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFG GSQKNNKRI +AVLVPEP VSGAVAP VEHRTPSTT+VLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
        QSEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN

Query:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGM M
Subjt:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGT+TPDGLGHALQDGLLLD+QISEVASLANSE+GCQNDV NHRVSFELTGEDVARCLANKS           KQTSTN
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QN+NKESS+EAE+CEFFDIKTST PEKTS EDDQCYQNQRAV LGSFKEFNFDQTKGEIH+TASIG+EWWANEKV VKEASPG NNWTFFPMLQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]3.1e-26293.99Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        MGS+ NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP V GAVAP VEHRTPSTTMVLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
        QSEP SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN

Query:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        QKF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
Subjt:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

XP_004140832.3 uncharacterized protein LOC101210841 [Cucumis sativus]2.9e-26093.72Show/hide
Query:  NNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQSEPP
        NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQK+NKRI +AVLVPEP V GAVAP VEHRTPSTTMVLPFIAPPSSPASFLQSEP 
Subjt:  NNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQSEPP

Query:  SNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFAL
        SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPY Y+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKF L
Subjt:  SNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFAL

Query:  SHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMGSRLG
        SHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGL SRLGSGTLTPDGMGMGSRLG
Subjt:  SHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMGSRLG

Query:  SGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTNCQNEN
        SGSVTPNG+RQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+ QNEN
Subjt:  SGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTNCQNEN

Query:  KESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
        KESSREAETCEFFDIKTS  PEKT GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  KESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

XP_008439268.1 PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]2.7e-26193.59Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        MGS+ NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP V GAVAP VEHRTPSTTMVLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
        QS P SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN

Query:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        QKF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
Subjt:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]2.0e-26193.99Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        MGSM NNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGI SQKNNKRI +AVLVPEP V GAVAP VEHRTPSTTMVLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
        QSEPPSNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN

Query:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        QKF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGLGSRLGSGTLTPDG+GM
Subjt:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQDG LLD+QISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTS RT SESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         Q ENKESSREAETCE FDIKTST PEKTS +DDQCYQNQRA+TLGSFKEFNFDQTKGEI+NTASIG+EWWANEKV VKEA+PG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840981.3e-26193.59Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        MGS+ NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP V GAVAP VEHRTPSTTMVLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
        QS P SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN

Query:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        QKF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
Subjt:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 11.3e-26193.59Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        MGS+ NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP V GAVAP VEHRTPSTTMVLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
        QS P SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN

Query:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        QKF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
Subjt:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 11.5e-26293.99Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        MGS+ NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFGIGSQKNNKRI +AVLVPEP V GAVAP VEHRTPSTTMVLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
        QSEP SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFTTEPSTAP TPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN

Query:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        QKF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
Subjt:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGH LQD  LLDNQISEVASLANSE+GCQNDVTNHRVSFELTGEDVARCLANKS+TS RTESESPKQTST+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QNENKE SREAETCEFFDIKTS  PEKT GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIG+EWWANEKV VKEASPG NNWTFFP+LQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

A0A6J1CJS5 uncharacterized protein LOC1110116547.3e-25792.61Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        MGSM NNSVDTVNAAATAIVSAEARVQPPTP KRRWG CWSLYWCFGIGSQKNNKRI +AVLVPEPVV G VAP VEHRTPSTTMVLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
        QS+P SN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN

Query:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
Subjt:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESE-SPKQTST
        GSRLGSGS+TPNG+RQDSRL SGTLTPDGLG+ALQDG LLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSM S RTESE S +QTS+
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESE-SPKQTST

Query:  NCQNENKESSREAETCEFFDIKTSTTPEKT-SGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGV
          Q+ENK SSREAETCEFFDIKTST PEK+ +GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIG+EWWANEKV VKEA+PG NNWTFFPMLQPGV
Subjt:  NCQNENKESSREAETCEFFDIKTSTTPEKT-SGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGV

Query:  S
        S
Subjt:  S

A0A6J1E856 uncharacterized protein LOC1114307677.3e-25791.98Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        MGSM NNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFG GSQKNNKRI +AVLVPEP VSGAVAP VEHRTPSTT+VLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
        QSEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTN

Query:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM
        QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFT RKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGM M
Subjt:  QKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGM

Query:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN
        GSRLGSGSVTPNGVRQDSRLGSGT+TPDGLGHALQDGLLLD+QISEVASLANSE+GCQNDV NHRVSFELTGEDVARCLANKS           KQTSTN
Subjt:  GSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTN

Query:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS
         QN+NKESS+EAE+CEFFDIKTST PEKTS EDDQCYQNQRAV LGSFKEFNFDQTKGE+H+TASIG+EWWANEKV VKEASPG NNWTFFPMLQPGVS
Subjt:  CQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766605.6e-3649.07Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS TQSP   LSL A S    S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSNTQSPAGLLSLTALSVNNYS

Query:  SNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHLI
        S    S++A GPYA+ETQLVSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L 
Subjt:  SNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHLI

Query:  SPGSVISNSGTSSP
        SP S  S  G  SP
Subjt:  SPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)1.4e-5553.18Show/hide
Query:  GSMNNNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPV-VSGAVAPTVEHRTPSTTMVLPFIAPPSSPAS
        G+  NN  DT+NAAA+AI S++ R+   +P   KR+W + WSL  CF  GS +  KRI N+VLVPEPV +S + + T      S    LPFIAPPSSPAS
Subjt:  GSMNNNSVDTVNAAATAIVSAEARVQPPTP--PKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPV-VSGAVAPTVEHRTPSTTMVLPFIAPPSSPAS

Query:  FLQSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTN
        F QSEPPS TQSP G+LS + L  NN       SIFAIGPYA+ETQLVSPPVFS +TTEPS+AP TPP +   +    TTPSSPEVPFA+L  S  +H  
Subjt:  FLQSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQL----TTPSSPEVPFAKLLTSSLSHTN

Query:  KSFGTNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL
         S+G   KF +S   +FQ YQ  PGSP   LISP      SG +SPFPD    L   F+++D PKLL
Subjt:  KSFGTNQKFALSHC-DFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPIL--EFRMADAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown3.9e-3749.07Show/hide
Query:  KRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSNTQSPAGLLSLTALSVNNYS
        ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + L  +APPSSPASF  S  PS TQSP   LSL A S    S
Subjt:  KRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRT------PSTTMVLPFIAPPSSPASFLQSEPPSNTQSPAGLLSLTALSVNNYS

Query:  SNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHLI
        S    S++A GPYA+ETQLVSPPVFS FTTEPSTAPFTPPPE  +LT PSSP+VP+A+ LTSS+   N   G        + D Q  Y  YPGSP + L 
Subjt:  SNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPGAHLI

Query:  SPGSVISNSGTSSP
        SP S  S  G  SP
Subjt:  SPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.2e-12356.3Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSG-AVAPTVEHRTPSTTMVLPFIAPPSSPASF
        M S+NN+SVDTVNAAA+AIVSAE+R QP +  K+R GS WSLYWCF  GS+KNNKRI +AVLVPEP  SG AVAP     + ST++ +PFIAPPSSPASF
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSG-AVAPTVEHRTPSTTMVLPFIAPPSSPASF

Query:  LQSEPP--SNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK
        L S PP  S+T  P  L SLT         N P S F IGPYA+ETQ V+PPVFSAFTTEPSTAPFTPPPES     PSSPEVPFA+LLTSSL  +  N 
Subjt:  LQSEPP--SNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSL--SHTNK

Query:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTP
          G NQKF+ +H +F+  Q YPGSPG +LISPG     SGTSSP+P K  I+EFR+ + PK LG EHFTARKW SR GSGS+TP   G GSRLGSG LTP
Subjt:  SFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTP

Query:  DGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMTSNRTE
        D    GS+L SG VTPNG     R+  G LTP        +G LLD+QISEVASLANS+ G    ND   V  HRVSFELTGEDVARCLA+K   S   E
Subjt:  DGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGC--QND---VTNHRVSFELTGEDVARCLANKSMTSNRTE

Query:  SESPKQTSTNCQNENKESSREAETCEFFDIKTSTTPEKTSGE-DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWT
          S +    NC             C            KTSGE + +  Q  R+ + GS KEF FD T  E+     I SEWWANEKV  K      N+WT
Subjt:  SESPKQTSTNCQNENKESSREAETCEFFDIKTSTTPEKTSGE-DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWT

Query:  FFPMLQPG
        FFP+L+ G
Subjt:  FFPMLQPG

AT5G52430.1 hydroxyproline-rich glycoprotein family protein5.4e-11954.15Show/hide
Query:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL
        M ++ NNSV+TVNAAATAIV+AE+RVQP +  K RWG CWSLY CF  G+QKNNKRI NAVLVPEPV SG    TV++   STT+VLPFIAPPSSPASFL
Subjt:  MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFL

Query:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SF
        QS+P S + SP G LSLT+   N +S   P S+F +GPYA ETQ V+PPVFSAF TEPSTAP+TPPPE SV +TTPSSPEVPFA+LLTSSL  T +  + 
Subjt:  QSEPPSNTQSPAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPE-SVQLTTPSSPEVPFAKLLTSSLSHTNK--SF

Query:  GTNQKFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPD
        G NQKF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+P K P++EFR+ + PK LG EHFTARKW SR GSGS+TP                 
Subjt:  GTNQKFALSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPD

Query:  GMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQ
         +G GS L SG++TPNG      + SG LTP+     LQ      NQISEVASLANS+ G +  V +HRVSFELTGEDVARCLA+K        + S  +
Subjt:  GMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGLGHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQ

Query:  TSTNCQNENKESSREAETCEFFDIKTSTTPEKTSGE---DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPM
         + N + E +ESS         DI+ +   EK SG+   +    Q   + ++GS KEF FD TK E              EKV         N+W+FFP 
Subjt:  TSTNCQNENKESSREAETCEFFDIKTSTTPEKTSGE---DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPM

Query:  LQPGVS
        L+ GVS
Subjt:  LQPGVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCATGAACAACAACAGCGTGGATACGGTTAATGCTGCCGCTACTGCGATCGTCTCCGCCGAGGCTCGAGTCCAGCCTCCGACGCCTCCGAAACGAAGATGGGG
TAGCTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGTTCGCAGAAAAACAATAAGCGTATTAGTAATGCTGTACTTGTTCCGGAACCTGTGGTATCCGGAGCTGTTGCAC
CTACTGTTGAACACCGAACACCTTCAACTACAATGGTATTGCCTTTCATTGCCCCTCCGTCTTCTCCAGCATCTTTCCTCCAGTCCGAACCTCCATCAAACACTCAATCT
CCAGCTGGATTACTCTCTTTAACTGCCCTTTCAGTCAATAACTACTCCTCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTGGTCTC
ACCACCAGTTTTTTCTGCCTTCACCACTGAACCTTCAACTGCTCCTTTTACGCCTCCTCCTGAATCTGTGCAACTGACCACACCTTCATCTCCTGAAGTGCCATTTGCTA
AATTGCTGACATCTTCTCTGAGCCATACTAATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCGTATCCAGGAAGCCCC
GGTGCCCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGCACATCTTCTCCTTTCCCTGATAAGCACCCCATTCTCGAGTTCCGCATGGCAGATGCTCCGAAGCT
CCTGGGTCTCGAACATTTTACGGCTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACACCAGACGGTACTGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGA
CCCCTGATGGTATGGGCATGGGTTCGAGATTGGGATCTGGATCTGTGACCCCAAATGGCGTGAGGCAAGATTCAAGATTGGGATCTGGAACCTTGACGCCTGATGGTTTG
GGTCATGCCTTGCAAGATGGTCTACTGTTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACGAATCATAGGGTGTC
ATTTGAGTTAACTGGGGAAGATGTTGCACGTTGTCTTGCAAATAAGTCAATGACTTCCAATAGAACCGAATCAGAGTCTCCAAAGCAAACAAGCACGAACTGTCAAAACG
AGAACAAAGAATCATCAAGAGAAGCTGAAACTTGTGAGTTCTTTGACATCAAGACTTCCACAACCCCTGAAAAAACTTCAGGAGAGGATGATCAATGCTACCAAAATCAG
CGAGCCGTAACTCTCGGTTCGTTCAAAGAGTTCAACTTCGACCAAACGAAAGGAGAAATACACAATACAGCCTCCATTGGTTCAGAGTGGTGGGCCAATGAAAAGGTGAC
TGTGAAGGAAGCTAGTCCAGGCAACAACAACTGGACTTTCTTCCCAATGTTGCAACCTGGGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCATGAACAACAACAGCGTGGATACGGTTAATGCTGCCGCTACTGCGATCGTCTCCGCCGAGGCTCGAGTCCAGCCTCCGACGCCTCCGAAACGAAGATGGGG
TAGCTGCTGGAGTCTGTACTGGTGTTTTGGAATTGGTTCGCAGAAAAACAATAAGCGTATTAGTAATGCTGTACTTGTTCCGGAACCTGTGGTATCCGGAGCTGTTGCAC
CTACTGTTGAACACCGAACACCTTCAACTACAATGGTATTGCCTTTCATTGCCCCTCCGTCTTCTCCAGCATCTTTCCTCCAGTCCGAACCTCCATCAAACACTCAATCT
CCAGCTGGATTACTCTCTTTAACTGCCCTTTCAGTCAATAACTACTCCTCAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTGGTCTC
ACCACCAGTTTTTTCTGCCTTCACCACTGAACCTTCAACTGCTCCTTTTACGCCTCCTCCTGAATCTGTGCAACTGACCACACCTTCATCTCCTGAAGTGCCATTTGCTA
AATTGCTGACATCTTCTCTGAGCCATACTAATAAAAGTTTTGGGACTAACCAGAAGTTTGCACTATCCCATTGTGATTTCCAGCCTTATCAACCGTATCCAGGAAGCCCC
GGTGCCCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGCACATCTTCTCCTTTCCCTGATAAGCACCCCATTCTCGAGTTCCGCATGGCAGATGCTCCGAAGCT
CCTGGGTCTCGAACATTTTACGGCTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACACCAGACGGTACTGGTTTAGGTTCTAGGTTAGGTTCAGGAACTTTGA
CCCCTGATGGTATGGGCATGGGTTCGAGATTGGGATCTGGATCTGTGACCCCAAATGGCGTGAGGCAAGATTCAAGATTGGGATCTGGAACCTTGACGCCTGATGGTTTG
GGTCATGCCTTGCAAGATGGTCTACTGTTGGACAACCAAATATCTGAGGTGGCTTCCCTTGCCAACTCAGAAAGTGGATGTCAAAATGATGTGACGAATCATAGGGTGTC
ATTTGAGTTAACTGGGGAAGATGTTGCACGTTGTCTTGCAAATAAGTCAATGACTTCCAATAGAACCGAATCAGAGTCTCCAAAGCAAACAAGCACGAACTGTCAAAACG
AGAACAAAGAATCATCAAGAGAAGCTGAAACTTGTGAGTTCTTTGACATCAAGACTTCCACAACCCCTGAAAAAACTTCAGGAGAGGATGATCAATGCTACCAAAATCAG
CGAGCCGTAACTCTCGGTTCGTTCAAAGAGTTCAACTTCGACCAAACGAAAGGAGAAATACACAATACAGCCTCCATTGGTTCAGAGTGGTGGGCCAATGAAAAGGTGAC
TGTGAAGGAAGCTAGTCCAGGCAACAACAACTGGACTTTCTTCCCAATGTTGCAACCTGGGGTCAGCTGA
Protein sequenceShow/hide protein sequence
MGSMNNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGIGSQKNNKRISNAVLVPEPVVSGAVAPTVEHRTPSTTMVLPFIAPPSSPASFLQSEPPSNTQS
PAGLLSLTALSVNNYSSNGPASIFAIGPYAYETQLVSPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFALSHCDFQPYQPYPGSP
GAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTARKWISRMGSGSLTPDGTGLGSRLGSGTLTPDGMGMGSRLGSGSVTPNGVRQDSRLGSGTLTPDGL
GHALQDGLLLDNQISEVASLANSESGCQNDVTNHRVSFELTGEDVARCLANKSMTSNRTESESPKQTSTNCQNENKESSREAETCEFFDIKTSTTPEKTSGEDDQCYQNQ
RAVTLGSFKEFNFDQTKGEIHNTASIGSEWWANEKVTVKEASPGNNNWTFFPMLQPGVS