; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024007 (gene) of Chayote v1 genome

Gene IDSed0024007
OrganismSechium edule (Chayote v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationLG13:890912..894866
RNA-Seq ExpressionSed0024007
SyntenySed0024007
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014759.1 hypothetical protein SDJN02_22388, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-22783.73Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP   GAVAPAVEHRTPS T+  PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KFALSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+GSGSVTPNGVRQDS+LGSGT+TP        DGLLLD+Q         S++GCQND  NHRVSFELTGEDVARCLANKS             TSTN 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGE+H+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa]2.4e-22783.73Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP +P KRRWGS WSLYWCFG G QKNNKRI +A LVPEP   GAVAPAVEHRTPS T  +PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        SEP SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KF LSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+GMG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+GSGSVTPNGVRQDS+LGSGTLTPDGL        LLDNQ         S++GCQND TNHRVSFELTGEDVARCLANKS+TS R  +ESP  TST+ 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q EN+E S EAETCEFFDIKT S A EKT  EDD+CYQNQRAV LGSFKEFNF+QTKGE+HNT SIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

XP_022922938.1 uncharacterized protein LOC111430767 [Cucurbita moschata]3.7e-22883.94Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP   GAVAPAVEHRTPS T+  PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KFALSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+GSGSVTPNGVRQDS+LGSGT+TP        DGLLLD+Q         S++GCQND  NHRVSFELTGEDVARCLANKS             TSTN 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGEMH+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

XP_022984784.1 uncharacterized protein LOC111482967 [Cucurbita maxima]9.1e-22783.53Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP   GAVAPAVEHRTPS T+  PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KFALSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+ SGSVTPNGVRQDS+LGSGT+TP        DGLLLD+Q         S++GCQND  NHRVSFELTGEDVARCLANKS             TSTN 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGE+H+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida]3.4e-22984.74Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFG   QKNNKRI +A LVPEP   GAVAPAVEHRTPS T  +PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        SEPPSNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAF TEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KF LSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDGIGMG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+GSGSVTPNGVRQDS+LGSGTLTPDGL        LLD+Q         S+SGCQND TNHRVSFELTGEDVARCLANKSMTS R  +ESP  TST+ 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q EN+ESS EAETCE FDIKT STA EKTS +DD+CYQNQRA+ LGSFKEFNF+QTKGE++NT SIGAEWWANEKVAVKEA+PGNNWTFFP+LQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

TrEMBL top hitse value%identityAlignment
A0A1S3AYC5 uncharacterized protein LOC1034840987.6e-22783.53Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP +P KRRWGS WSLYWCFG G QKNNKRI +A LVPEP   GAVAPAVEHRTPS T  +PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        S P SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KF LSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+GMG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+GSGSVTPNGVRQDS+LGSGTLTPDGL        LLDNQ         S++GCQND TNHRVSFELTGEDVARCLANKS+TS R  +ESP  TST+ 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q EN+E S EAETCEFFDIKT S A EKT  EDD+CYQNQRAV LGSFKEFNF+QTKGE+HNT SIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 17.6e-22783.53Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP +P KRRWGS WSLYWCFG G QKNNKRI +A LVPEP   GAVAPAVEHRTPS T  +PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        S P SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KF LSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+GMG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+GSGSVTPNGVRQDS+LGSGTLTPDGL        LLDNQ         S++GCQND TNHRVSFELTGEDVARCLANKS+TS R  +ESP  TST+ 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q EN+E S EAETCEFFDIKT S A EKT  EDD+CYQNQRAV LGSFKEFNF+QTKGE+HNT SIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 11.2e-22783.73Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
        MGS+NNSVDTVNAAATAIVSAEARVQP +P KRRWGS WSLYWCFG G QKNNKRI +A LVPEP   GAVAPAVEHRTPS T  +PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        SEP SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KF LSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+GMG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+GSGSVTPNGVRQDS+LGSGTLTPDGL        LLDNQ         S++GCQND TNHRVSFELTGEDVARCLANKS+TS R  +ESP  TST+ 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q EN+E S EAETCEFFDIKT S A EKT  EDD+CYQNQRAV LGSFKEFNF+QTKGE+HNT SIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

A0A6J1E856 uncharacterized protein LOC1114307671.8e-22883.94Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP   GAVAPAVEHRTPS T+  PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KFALSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+GSGSVTPNGVRQDS+LGSGT+TP        DGLLLD+Q         S++GCQND  NHRVSFELTGEDVARCLANKS             TSTN 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGEMH+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

A0A6J1JBI8 uncharacterized protein LOC1114829674.4e-22783.53Show/hide
Query:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
        MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP   GAVAPAVEHRTPS T+  PFIAPPSSP SFLQ
Subjt:  MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ

Query:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
        SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt:  SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ

Query:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
        KFALSHCDFQPYQPYPGSP  +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt:  KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG

Query:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
        SR+ SGSVTPNGVRQDS+LGSGT+TP        DGLLLD+Q         S++GCQND  NHRVSFELTGEDVARCLANKS             TSTN 
Subjt:  SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC

Query:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
        Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGE+H+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt:  QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766603.2e-3347.44Show/hide
Query:  KRRWGSFWSLYWCFGNGLQKNNKRISNAALVPE--PVGAVAPAVEHR--------TPSITIPFIAPPSSPTSFLQSEPPSNTQSPAGLLSLTALSVNNYS
        ++RWG    ++ CF +  QK  KRI  A+ +PE   V A  P   H+           I +  +APPSSP SF  S  PS TQSP   LSL A      S
Subjt:  KRRWGSFWSLYWCFGNGLQKNNKRISNAALVPE--PVGAVAPAVEHR--------TPSITIPFIAPPSSPTSFLQSEPPSNTQSPAGLLSLTALSVNNYS

Query:  QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPSGYL
          GP +S++A GPYA+ETQLVSPPVFS F TEPSTAPFTPPPE  + T PSSP+VPYA+ LTSS+   N   G        + D Q  Y  YPGSP+  L
Subjt:  QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPSGYL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)6.4e-5351.71Show/hide
Query:  NNSVDTVNAAATAIVSAEARVQPPSP--SKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV-----GAVAPAVEHRTPSITIPFIAPPSSPTSFLQS
        NN  DT+NAAA+AI S++ R+   SP   KR+W + WSL  CFG+  Q+  KRI N+ LVPEPV      +      +R+   T+PFIAPPSSP SF QS
Subjt:  NNSVDTVNAAATAIVSAEARVQPPSP--SKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV-----GAVAPAVEHRTPSITIPFIAPPSSPTSFLQS

Query:  EPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPES----VQFTTPSSPEVPYAKLLTSSLSHTNKSFG
        EPPS TQSP G+LS + L  NN       SIFAIGPYA+ETQLVSPPVFS + TEPS+AP TPP +     +  TTPSSPEVP+A+L  S  +H   S+G
Subjt:  EPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPES----VQFTTPSSPEVPYAKLLTSSLSHTNKSFG

Query:  TNQKFALSHC-DFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPIL--EFRMAEAPKLL
           KF +S   +FQ YQ  PGSP G LISP      SG +SPFPD    L   F++++ PKLL
Subjt:  TNQKFALSHC-DFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPIL--EFRMAEAPKLL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown2.3e-3447.44Show/hide
Query:  KRRWGSFWSLYWCFGNGLQKNNKRISNAALVPE--PVGAVAPAVEHR--------TPSITIPFIAPPSSPTSFLQSEPPSNTQSPAGLLSLTALSVNNYS
        ++RWG    ++ CF +  QK  KRI  A+ +PE   V A  P   H+           I +  +APPSSP SF  S  PS TQSP   LSL A      S
Subjt:  KRRWGSFWSLYWCFGNGLQKNNKRISNAALVPE--PVGAVAPAVEHR--------TPSITIPFIAPPSSPTSFLQSEPPSNTQSPAGLLSLTALSVNNYS

Query:  QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPSGYL
          GP +S++A GPYA+ETQLVSPPVFS F TEPSTAPFTPPPE  + T PSSP+VPYA+ LTSS+   N   G        + D Q  Y  YPGSP+  L
Subjt:  QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPSGYL

Query:  ISPGSVISNSGTSSP
         SP S  S  G  SP
Subjt:  ISPGSVISNSGTSSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein2.3e-11153.69Show/hide
Query:  MGSMNN-SVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV---GAVAPA--VEHRTPSITIPFIAPPSSPTSF
        M S+NN SVDTVNAAA+AIVSAE+R QP S  K+R GS+WSLYWCFG+  +KNNKRI +A LVPEP     AVAP       + SI +PFIAPPSSP SF
Subjt:  MGSMNN-SVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV---GAVAPA--VEHRTPSITIPFIAPPSSPTSF

Query:  LQSEPP--SNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSL--SHTNK
        L S PP  S+T  P  L SLT         N P S F IGPYA+ETQ V+PPVFSAF TEPSTAPFTPPPES     PSSPEVP+A+LLTSSL  +  N 
Subjt:  LQSEPP--SNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSL--SHTNK

Query:  SFGTNQKFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTP
          G NQKF+ +H +F+  Q YPGSP G LISPG     SGTSSP+P K  I+EFR+ E PK LG EHFT RKW SR GSGS+TP G   GSRLGSG LTP
Subjt:  SFGTNQKFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTP

Query:  DGIGMGSRMGSGSVTPNGVRQDSKLGSGTLTP-DGLLLDNQTSQ----------SGCQND---ATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPT
        D    GS++ SG VTPNG     ++  G LTP +G LLD+Q S+          S   ND      HRVSFELTGEDVARCLA+K               
Subjt:  DGIGMGSRMGSGSVTPNGVRQDSKLGSGTLTP-DGLLLDNQTSQ----------SGCQND---ATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPT

Query:  STNCQKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVK-EASPGNNWTFFPMLQS
               NR  S E  + E        T+ E  SE+     Q  R+ + GS KEF F+ T  EM     I +EWWANEKVA K + SP N+WTFFP+L+S
Subjt:  STNCQKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVK-EASPGNNWTFFPMLQS

Query:  G
        G
Subjt:  G

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.1e-10552.25Show/hide
Query:  MNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPVGAVAPAV----EHRTPSITIPFIAPPSSPTSFLQSEP
        +NNSV+TVNAAATAIV+AE+RVQP S  K RWG  WSLY CFG   QKNNKRI NA LVPEPV +  P V       + ++ +PFIAPPSSP SFLQS+P
Subjt:  MNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPVGAVAPAV----EHRTPSITIPFIAPPSSPTSFLQSEP

Query:  PSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPE-SVQFTTPSSPEVPYAKLLTSSLSHTNK--SFGTNQ
         S + SP G LSLT+   N +S   P S+F +GPYA ETQ V+PPVFSAF TEPSTAP+TPPPE SV  TTPSSPEVP+A+LLTSSL  T +  + G NQ
Subjt:  PSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPE-SVQFTTPSSPEVPYAKLLTSSLSHTNK--SFGTNQ

Query:  KFALSHCDFQPYQPYPGSP-SGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGM
        KF+ SH +F+  Q  PGSP  G LISPGSVISNSGTSSP+P K P++EFR+ E PK LG EHFT RKW SR GSGS+TP G+G G  L SG LTP+    
Subjt:  KFALSHCDFQPYQPYPGSP-SGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGM

Query:  GSRMGSGSVTPNGV-----RQDSKLGSGTLTPDGLLLDNQTSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNCQKENRESSGE
        G  + SG++TPN        Q S++ S         L N  S  G +    +HRVSFELTGEDVARCLA+K    NR    S +  + N + E  ESS  
Subjt:  GSRMGSGSVTPNGV-----RQDSKLGSGTLTPDGLLLDNQTSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNCQKENRESSGE

Query:  AETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
               DI+ +        E +    Q   + ++GS KEF F+ TK E              EKVA      GN+W+FFP L+SGVS
Subjt:  AETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCATGAACAACAGCGTGGATACAGTTAATGCCGCCGCTACTGCGATCGTCTCCGCAGAGGCTCGAGTTCAGCCTCCGTCGCCTTCGAAACGAAGATGGGGTAG
CTTCTGGAGTCTGTACTGGTGTTTCGGAAATGGCTTGCAAAAAAACAATAAGCGTATTAGTAATGCTGCACTTGTTCCAGAACCTGTAGGAGCTGTTGCCCCTGCTGTTG
AACACCGAACACCTTCGATCACGATACCTTTTATTGCCCCTCCGTCTTCTCCTACATCTTTCCTCCAGTCCGAACCTCCATCAAATACTCAATCTCCGGCTGGATTACTC
TCTTTAACTGCCCTTTCAGTCAATAACTACTCCCAAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTGGTTTCACCTCCAGTTTTTTC
TGCCTTCCCCACTGAACCATCAACTGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAATTTACCACACCCTCATCTCCTGAAGTTCCCTATGCTAAATTGCTGACTTCTT
CTCTGAGCCACACTAACAAAAGTTTTGGAACCAATCAGAAGTTTGCATTGTCCCATTGTGATTTTCAGCCTTATCAACCTTATCCAGGAAGCCCCAGTGGCTATCTTATA
TCACCTGGTTCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGAAGCTCCGAAACTCTTGGGCCTTGAGCA
TTTTACAACTCGCAAATGGATCTCGAGAATGGGTTCTGGATCGTTGACGCCGGATGGTAATGGTATAGGTTCTCGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATTG
GTATGGGTTCAAGAATGGGATCTGGATCTGTGACCCCAAATGGCGTGAGGCAAGATTCGAAATTGGGTTCTGGAACATTGACGCCTGATGGTCTACTGTTGGACAACCAA
ACGTCACAAAGTGGATGTCAAAATGATGCGACGAATCATAGGGTGTCATTCGAGTTAACTGGCGAAGATGTCGCACGGTGTCTTGCAAATAAGTCGATGACATCCAATAG
AATCACAGCAGAATCTCCAAACCCAACAAGCACAAACTGTCAAAAGGAAAACAGAGAGTCATCAGGAGAAGCTGAAACTTGTGAGTTCTTTGACATCAAGACAAGTTCCA
CGGCAGCTGAAAAAACTTCAGAAGAGGACGATCGATGTTACCAGAATCAGCGAGCCGTAAATCTTGGTTCGTTCAAAGAGTTCAACTTCGAGCAAACGAAAGGAGAAATG
CACAATACACCCTCCATTGGTGCAGAATGGTGGGCAAATGAAAAGGTGGCTGTGAAGGAAGCTAGTCCAGGCAACAACTGGACTTTCTTCCCAATGTTGCAATCTGGGGT
CAGCTGA
mRNA sequenceShow/hide mRNA sequence
GATTGCTAAAAATTGTGGAAACCAGAGGATAAATACTGGACGAGCCACCCAAAACTCGACTTTTATTTTATTTATTTTATATTTAGGTTGTTATTATTATTTATTATATA
AAATGAGAAAATTCGATCTCTCTCTTCCTTGATCTTCAGTTTTTCATTTCTTAAATCAAATCCGAAGGTCGTCGTCTCTGATTCGTTGTTTTGGCCGTCAAATCTCACCG
GAGAGTGATCGATTTTCAGGCGAGTTGGCGGCGGTGGATTCGTCGGAAAACACGGCGATCTTGTTGAAAACTCTGGCGGTGGAGCTCTTTGGATTGGAAGAAATTGGAGA
TGGGAAGCATGAACAACAGCGTGGATACAGTTAATGCCGCCGCTACTGCGATCGTCTCCGCAGAGGCTCGAGTTCAGCCTCCGTCGCCTTCGAAACGAAGATGGGGTAGC
TTCTGGAGTCTGTACTGGTGTTTCGGAAATGGCTTGCAAAAAAACAATAAGCGTATTAGTAATGCTGCACTTGTTCCAGAACCTGTAGGAGCTGTTGCCCCTGCTGTTGA
ACACCGAACACCTTCGATCACGATACCTTTTATTGCCCCTCCGTCTTCTCCTACATCTTTCCTCCAGTCCGAACCTCCATCAAATACTCAATCTCCGGCTGGATTACTCT
CTTTAACTGCCCTTTCAGTCAATAACTACTCCCAAAATGGACCTGCATCCATTTTTGCAATAGGCCCTTATGCATATGAGACCCAGTTGGTTTCACCTCCAGTTTTTTCT
GCCTTCCCCACTGAACCATCAACTGCTCCTTTTACTCCTCCTCCTGAGTCTGTGCAATTTACCACACCCTCATCTCCTGAAGTTCCCTATGCTAAATTGCTGACTTCTTC
TCTGAGCCACACTAACAAAAGTTTTGGAACCAATCAGAAGTTTGCATTGTCCCATTGTGATTTTCAGCCTTATCAACCTTATCCAGGAAGCCCCAGTGGCTATCTTATAT
CACCTGGTTCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGAAGCTCCGAAACTCTTGGGCCTTGAGCAT
TTTACAACTCGCAAATGGATCTCGAGAATGGGTTCTGGATCGTTGACGCCGGATGGTAATGGTATAGGTTCTCGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATTGG
TATGGGTTCAAGAATGGGATCTGGATCTGTGACCCCAAATGGCGTGAGGCAAGATTCGAAATTGGGTTCTGGAACATTGACGCCTGATGGTCTACTGTTGGACAACCAAA
CGTCACAAAGTGGATGTCAAAATGATGCGACGAATCATAGGGTGTCATTCGAGTTAACTGGCGAAGATGTCGCACGGTGTCTTGCAAATAAGTCGATGACATCCAATAGA
ATCACAGCAGAATCTCCAAACCCAACAAGCACAAACTGTCAAAAGGAAAACAGAGAGTCATCAGGAGAAGCTGAAACTTGTGAGTTCTTTGACATCAAGACAAGTTCCAC
GGCAGCTGAAAAAACTTCAGAAGAGGACGATCGATGTTACCAGAATCAGCGAGCCGTAAATCTTGGTTCGTTCAAAGAGTTCAACTTCGAGCAAACGAAAGGAGAAATGC
ACAATACACCCTCCATTGGTGCAGAATGGTGGGCAAATGAAAAGGTGGCTGTGAAGGAAGCTAGTCCAGGCAACAACTGGACTTTCTTCCCAATGTTGCAATCTGGGGTC
AGCTGACTTTGACATGAATTCCAACAGTAAAAAGGAAAACAAAAAAGAAGCAGAAAAGGAAGACACCAAAACCAACCTTTTGGATGTACATTTGATTGTAATCTTCCTTT
GGAGATAACAGGTGGAATGATTTGGAGGAAAAGAATGTTTTTCAAAGTTGTCTTGTGATAACTAGATCTTTTTATATCAGTGAGTATACGGGTCGGGTTGTGTACACTTC
GACTATTCTCATGGGATATACAGTCTGACCTTATAATTCGGATGTCAAAGAAATTTGTAGAAAATTATTTTCTAGATAGGTTATCATAGGTGCAAGAAGTAATTTATAAT
CACAGGTTAACTTTGAAGTGGTTTATTCTTTCTTTGTCTTGTATAATAATAAGAAATTTCATTCTTCATTAGCAATGAAACTCTTTTTTTATGGTGATTTTAGTTATTCC
TTTGCAGGTAAATGTAAAAACTTTATTCTTCAT
Protein sequenceShow/hide protein sequence
MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPVGAVAPAVEHRTPSITIPFIAPPSSPTSFLQSEPPSNTQSPAGLL
SLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQKFALSHCDFQPYQPYPGSPSGYLI
SPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMGSRMGSGSVTPNGVRQDSKLGSGTLTPDGLLLDNQ
TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNCQKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEM
HNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS