; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G18180 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G18180
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionlarge proline-rich protein bag6-B
Genome locationClcChr02:30770260..30778957
RNA-Seq ExpressionClc02G18180
SyntenyClc02G18180
Gene Ontology termsGO:0030433 - ubiquitin-dependent ERAD pathway (biological process)
GO:0071818 - BAT3 complex (cellular component)
GO:0031593 - polyubiquitin modification-dependent protein binding (molecular function)
GO:0051787 - misfolded protein binding (molecular function)
InterPro domainsIPR000626 - Ubiquitin-like domain
IPR019956 - Ubiquitin domain
IPR029071 - Ubiquitin-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139265.1 ubiquitin-like domain-containing protein CIP73 [Cucumis sativus]0.0e+0094.08Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFI E TSCGEADGSE TIEIKLKTLDSQ YTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TLPNRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDG+PPEINRIVSAVLSSIGLSNSVTG +GMDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSLTTLSQNL NMRRDFE+IGRVGGNNAQE NIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT

Query:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
        GEVSECLLQLARQLENHRNVTDPTLRMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
Subjt:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV

Query:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
        PMG MQPGSALIHGLGSGFLPRRIDIQIRRGSP+TASNG+PEERSGAQQ SGQQEAARG GEN TNQATTR+VEGPSVGGESGVRVVPIRTMVAALPGPF
Subjt:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF

Query:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI
        SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFP GEI
Subjt:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI

Query:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER
        NIEDASFQEISGSIPAHHSMASSS+ANVQESESRT+DEGMFLSNIFHQIMPFT+Q GNEPDM SVEANASERQN PDSSAQASN DAETSRRR D+EA  
Subjt:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER

Query:  LTQQNVTRD
         + +   +D
Subjt:  LTQQNVTRD

XP_008456646.1 PREDICTED: large proline-rich protein bag6-B [Cucumis melo]0.0e+0094.08Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFI E TSCGEADGSE TIEIKLKTLDSQ YTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TLPNRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTG +GMDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSLTTLSQNLSNMRRDFE+IGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT

Query:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
        GEVSECLLQLARQLENHRNVTDPTLRMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
Subjt:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV

Query:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
        PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSP+TASNG+PEERSGAQQ SGQQEA RG GEN TN ATTR+VEGPSVGGESGVRVVPIRTMVAALPGPF
Subjt:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF

Query:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI
        SRLPSNSSGNSFGLYYPVLGRFPHPAS NARAERGSQASSERQSTGLQSEQ TILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFP GEI
Subjt:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI

Query:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER
        NIEDASFQEISGSIPAHHSMASSSVANVQESESRT+DEGMFLSNIFHQIMPFT+Q G EPDM SVEANASERQN PDSSAQASN DAETSRRR D+EA  
Subjt:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER

Query:  LTQQNVTRD
         + +   +D
Subjt:  LTQQNVTRD

XP_022946658.1 large proline-rich protein BAG6 isoform X2 [Cucurbita moschata]0.0e+0089.73Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFIDETTSCGEADGSE TIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TL NRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLS SVTG E MDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSL TLS  L+NMRRDFE+IGRVGGNN QEAN HG EESSSNSS+RPSTT ESFPTPASLAEVM STRQMLT
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT

Query:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
        GEVSECLLQL RQLENHRNVTDPT RMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQN SLGPV
Subjt:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV

Query:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
        PMGAMQP  ALIHGLGSGFLPRRIDIQIRRGSP+TA N +PEERSGAQQPSGQQEA R VGENPTNQATTRIVEGPSVGGESGVRVVP+RTMVAALPGPF
Subjt:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF

Query:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI
        SRL SNSSGNS GLYYPVLGRFPHPASG AR ERGSQASS RQSTGLQ+EQHT LESVVEQ N E+AARD  AQG LESERQVPSNVVQFLRTLFP GEI
Subjt:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI

Query:  NIEDASFQEISGSIPAHHSM-ASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQA-SNGDAETSRRRADAEA
        NIEDASFQEISGS+PAHHSM ASSSV+NVQES+ RT+DEGMFLSNIFHQIMPF SQRGN+PDM SVEANASE +N+ DSSAQA SNGDAETSRRR DAE+
Subjt:  NIEDASFQEISGSIPAHHSM-ASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQA-SNGDAETSRRRADAEA

Query:  ERLTQQNVTRD
        +  + +   +D
Subjt:  ERLTQQNVTRD

XP_038889776.1 ubiquitin-like domain-containing protein CIP73 isoform X1 [Benincasa hispida]0.0e+0096.04Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFID TTSCGEADGSE TIEIKLKTLDSQ YTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TLPNRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTG EGMDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSLTTLSQNL NMRRDFE+IGRVGGNN+QEANIHGDEESSSNSSSRPST QESFPTPASLAEVMLSTRQMLT
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT

Query:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
        GEVSECLLQLARQLENHRNVTDPTLRMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQ+GPNPIMVQPLPFQQNASLG V
Subjt:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV

Query:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
        PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSP+TASNG+ EERSGAQQP GQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
Subjt:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF

Query:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI
        SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFP GEI
Subjt:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI

Query:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQ
        NIEDASFQEISGSIP H SMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDM SVEAN SE QN+PDSS Q
Subjt:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQ

XP_038889778.1 ubiquitin-like domain-containing protein CIP73 isoform X2 [Benincasa hispida]0.0e+0094.64Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFID TTSCGEADGSE TIEIKLKTLDSQ YTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TLPNRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTG EGMDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSLTTLSQNL NMRRDFE+IGRVGGNN+QEANIHGDEESSSNSSSRPST QESFPTPASLAEVMLSTRQMLT
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT

Query:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
        GEVSECLLQLARQLENHRNVTDPTLRMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQ+GPNPIMVQPLPFQQNASLG V
Subjt:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV

Query:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
        PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSP+TASNG+ EERSGAQQP GQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
Subjt:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF

Query:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI
        SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFP GEI
Subjt:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI

Query:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER
        NIEDASFQEISGSIP H SMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDM SVEAN SE QN+PDSS QASNGDAETSRRRAD+EA  
Subjt:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER

Query:  LTQQNVTRD
         + +   +D
Subjt:  LTQQNVTRD

TrEMBL top hitse value%identityAlignment
A0A0A0LL64 Ubiquitin-like domain-containing protein0.0e+0094.08Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFI E TSCGEADGSE TIEIKLKTLDSQ YTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TLPNRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDG+PPEINRIVSAVLSSIGLSNSVTG +GMDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSLTTLSQNL NMRRDFE+IGRVGGNNAQE NIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT

Query:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
        GEVSECLLQLARQLENHRNVTDPTLRMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
Subjt:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV

Query:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
        PMG MQPGSALIHGLGSGFLPRRIDIQIRRGSP+TASNG+PEERSGAQQ SGQQEAARG GEN TNQATTR+VEGPSVGGESGVRVVPIRTMVAALPGPF
Subjt:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF

Query:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI
        SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFP GEI
Subjt:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI

Query:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER
        NIEDASFQEISGSIPAHHSMASSS+ANVQESESRT+DEGMFLSNIFHQIMPFT+Q GNEPDM SVEANASERQN PDSSAQASN DAETSRRR D+EA  
Subjt:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER

Query:  LTQQNVTRD
         + +   +D
Subjt:  LTQQNVTRD

A0A1S3C513 large proline-rich protein bag6-B0.0e+0094.08Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFI E TSCGEADGSE TIEIKLKTLDSQ YTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TLPNRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTG +GMDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSLTTLSQNLSNMRRDFE+IGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT

Query:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
        GEVSECLLQLARQLENHRNVTDPTLRMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
Subjt:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV

Query:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
        PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSP+TASNG+PEERSGAQQ SGQQEA RG GEN TN ATTR+VEGPSVGGESGVRVVPIRTMVAALPGPF
Subjt:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF

Query:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI
        SRLPSNSSGNSFGLYYPVLGRFPHPAS NARAERGSQASSERQSTGLQSEQ TILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFP GEI
Subjt:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI

Query:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER
        NIEDASFQEISGSIPAHHSMASSSVANVQESESRT+DEGMFLSNIFHQIMPFT+Q G EPDM SVEANASERQN PDSSAQASN DAETSRRR D+EA  
Subjt:  NIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAER

Query:  LTQQNVTRD
         + +   +D
Subjt:  LTQQNVTRD

A0A6J1G4A8 large proline-rich protein BAG6 isoform X10.0e+0089.61Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFIDETTSCGEADGSE TIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TL NRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLS SVTG E MDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESI-GRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQML
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSL TLS  L+NMRRDFE+I GRVGGNN QEAN HG EESSSNSS+RPSTT ESFPTPASLAEVM STRQML
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESI-GRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQML

Query:  TGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGP
        TGEVSECLLQL RQLENHRNVTDPT RMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQN SLGP
Subjt:  TGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGP

Query:  VPMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGP
        VPMGAMQP  ALIHGLGSGFLPRRIDIQIRRGSP+TA N +PEERSGAQQPSGQQEA R VGENPTNQATTRIVEGPSVGGESGVRVVP+RTMVAALPGP
Subjt:  VPMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGP

Query:  FSRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGE
        FSRL SNSSGNS GLYYPVLGRFPHPASG AR ERGSQASS RQSTGLQ+EQHT LESVVEQ N E+AARD  AQG LESERQVPSNVVQFLRTLFP GE
Subjt:  FSRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGE

Query:  INIEDASFQEISGSIPAHHSM-ASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQA-SNGDAETSRRRADAE
        INIEDASFQEISGS+PAHHSM ASSSV+NVQES+ RT+DEGMFLSNIFHQIMPF SQRGN+PDM SVEANASE +N+ DSSAQA SNGDAETSRRR DAE
Subjt:  INIEDASFQEISGSIPAHHSM-ASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQA-SNGDAETSRRRADAE

Query:  AERLTQQNVTRD
        ++  + +   +D
Subjt:  AERLTQQNVTRD

A0A6J1G4G3 large proline-rich protein bag6-B isoform X30.0e+0089.61Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFIDETTSCGEADGSE TIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TL NRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLS SVTG E MDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESI-GRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQML
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSL TLS  L+NMRRDFE+I GRVGGNN QEAN HG EESSSNSS+RPSTT ESFPTPASLAEVM STRQML
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESI-GRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQML

Query:  TGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGP
        TGEVSECLLQL RQLENHRNVTDPT RMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQN SLGP
Subjt:  TGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGP

Query:  VPMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGP
        VPMGAMQP  ALIHGLGSGFLPRRIDIQIRRGSP+TA N +PEERSGAQQPSGQQEA R VGENPTNQATTRIVEGPSVGGESGVRVVP+RTMVAALPGP
Subjt:  VPMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGP

Query:  FSRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGE
        FSRL SNSSGNS GLYYPVLGRFPHPASG AR ERGSQASS RQSTGLQ+EQHT LESVVEQ N E+AARD  AQG LESERQVPSNVVQFLRTLFP GE
Subjt:  FSRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGE

Query:  INIEDASFQEISGSIPAHHSM-ASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQA-SNGDAETSRRRADAE
        INIEDASFQEISGS+PAHHSM ASSSV+NVQES+ RT+DEGMFLSNIFHQIMPF SQRGN+PDM SVEANASE +N+ DSSAQA SNGDAETSRRR DAE
Subjt:  INIEDASFQEISGSIPAHHSM-ASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQA-SNGDAETSRRRADAE

Query:  AERLTQQNVTRD
        ++  + +   +D
Subjt:  AERLTQQNVTRD

A0A6J1G4G5 large proline-rich protein BAG6 isoform X20.0e+0089.73Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
        MGSNFIDETTSCGEADGSE TIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSE

Query:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS
        TL NRPETDPNSSTSR+HS+RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLS SVTG E MDVVREIDQQRSGERVIAAGMIDLNQHQSGD+GS
Subjt:  TLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGS

Query:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT
        RPLSDRFHGTSGHPSIP LGSFPPPVIPDSL TLS  L+NMRRDFE+IGRVGGNN QEAN HG EESSSNSS+RPSTT ESFPTPASLAEVM STRQMLT
Subjt:  RPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLT

Query:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV
        GEVSECLLQL RQLENHRNVTDPT RMNTQS+AWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQN SLGPV
Subjt:  GEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPV

Query:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF
        PMGAMQP  ALIHGLGSGFLPRRIDIQIRRGSP+TA N +PEERSGAQQPSGQQEA R VGENPTNQATTRIVEGPSVGGESGVRVVP+RTMVAALPGPF
Subjt:  PMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPF

Query:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI
        SRL SNSSGNS GLYYPVLGRFPHPASG AR ERGSQASS RQSTGLQ+EQHT LESVVEQ N E+AARD  AQG LESERQVPSNVVQFLRTLFP GEI
Subjt:  SRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEI

Query:  NIEDASFQEISGSIPAHHSM-ASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQA-SNGDAETSRRRADAEA
        NIEDASFQEISGS+PAHHSM ASSSV+NVQES+ RT+DEGMFLSNIFHQIMPF SQRGN+PDM SVEANASE +N+ DSSAQA SNGDAETSRRR DAE+
Subjt:  NIEDASFQEISGSIPAHHSM-ASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQA-SNGDAETSRRRADAEA

Query:  ERLTQQNVTRD
        +  + +   +D
Subjt:  ERLTQQNVTRD

SwissProt top hitse value%identityAlignment
A4IH17 Large proline-rich protein bag61.8e-1134.12Show/hide
Query:  IEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSETLPNRPETDPNSSTSRIHSSR
        +E+ +KTLDSQT T  VD ++ V   K  I+S  G+  E+QRLI +G+VL++D+ L  Y+V DG  +HLV R P P ++     P T  ++S S  +++ 
Subjt:  IEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSETLPNRPETDPNSSTSRIHSSR

Query:  VAPG-----------VVIETFSMP--VQGDGVPPEINRIVSAVLSSIGLSNSVTGIEG--MDVVREIDQQ
        V PG           V++ TF++P  + G G      R+           ++V+G +G  +DV   +DQQ
Subjt:  VAPG-----------VVIETFSMP--VQGDGVPPEINRIVSAVLSSIGLSNSVTGIEG--MDVVREIDQQ

A7X5R6 Large proline-rich protein BAG63.6e-1234.29Show/hide
Query:  ATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPL--------------PPSETLPNR
        A +E+ +KTLDSQT T  V  +M V   KE IA+   +  ++QRLI +G+VL+DD+ L  Y+V  G  +HLV R P                PS      
Subjt:  ATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPL--------------PPSETLPNR

Query:  PETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEIN
        P   P    + +H       V++ TF++P  G  V   IN
Subjt:  PETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEIN

D5LXJ0 Ubiquitin-like domain-containing protein CIP738.5e-16351.71Show/hide
Query:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP-LPPS
        MGSN  +E TS      +  TIEIK+K LDSQT+TLRVDKQMPVPALK QI S+TGV+SE+QRLIC+GKVLKDDQLLSAYHVEDGHTLHLV R P L P 
Subjt:  MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQP-LPPS

Query:  ETLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSG
         +LPN   T+PNSST   +S++VAPGV IETF++PVQGDGVP EINRIVSAVL S+GL N  +G EG+  VRE D    G      G  + ++ Q   +G
Subjt:  ETLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSG

Query:  SRPLSDRFHGTSGHP---SIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTR
         R  SD    + G P   S+  LGS  PPVIPDSLTTL Q LS++  +F++I R GGNN Q A  H +EE     SSR S+T E   +PASLAEV+LSTR
Subjt:  SRPLSDRFHGTSGHP---SIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTR

Query:  QMLTGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNAS
        +++  +  ECLLQLARQLENH ++ DP  R +TQS A R+GV+F NLGAYLLELGRT MTLR+GQ PSEAVVN GPAVFIS +GPN IMVQPLPFQ  AS
Subjt:  QMLTGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNAS

Query:  LGPVPMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAAL
         G +P+GA Q  S+L  GLGS F PRRIDIQIRRG+ +T   G+ +E  G  Q +  Q   R  GE+  NQ T+R  +  S+ GE GVRVVPIRTMVAA+
Subjt:  LGPVPMGAMQPGSALIHGLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAAL

Query:  PGPFSRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDG------------------------G
                            PVLGRF   +S N   E+GSQ +S++ +       H+  E  + +Q++ED+AR+G                        G
Subjt:  PGPFSRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDG------------------------G

Query:  AQGTLESERQVPSNVVQFLRTLFPSGEINIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQ---RGNEPDMSSVEANA
             ESERQVPS+V+QFLR LFP GEI++ED S Q  +  + +  +  SS  A   E+E   S+EG+FLSN+   IMP  SQ   RG +         +
Subjt:  AQGTLESERQVPSNVVQFLRTLFPSGEINIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQ---RGNEPDMSSVEANA

Query:  SERQNIPDSSAQASNGDAETSRRRADAEA
        SE Q   D S Q   G A TSRR++D+E+
Subjt:  SERQNIPDSSAQASNGDAETSRRRADAEA

Q6PA26 Large proline-rich protein bag6-B6.1e-1237.7Show/hide
Query:  IEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSETLPNRPETDPNSSTSRIHSSR
        +++ +KTLDSQT T  V+ ++ V   K  I+S  G+  E+QRLI +G+VL++D+ L+ Y+V DG  +HLV R P P ++T  + P T  ++S S  +++ 
Subjt:  IEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSETLPNRPETDPNSSTSRIHSSR

Query:  VAPG---------VVIETFSMP
        V PG         V++ TF++P
Subjt:  VAPG---------VVIETFSMP

Q9Z1R2 Large proline-rich protein BAG62.8e-1235.26Show/hide
Query:  IDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSETLPNR
        ++ + S   A     ++E+ +KTLDSQT T  V  QM V   KE IA+   + SE+QRLI +G+VL+DD+ L  Y+V  G  +HLV R   PP   LP+ 
Subjt:  IDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSETLPNR

Query:  PETDPNSSTSRIHSSRVAPG----------------VVIETFSMPVQGDGVPPEIN
          +    S S  H     PG                V++ TF++P  G  V   IN
Subjt:  PETDPNSSTSRIHSSRVAPG----------------VVIETFSMPVQGDGVPPEIN

Arabidopsis top hitse value%identityAlignment
AT5G11080.1 Ubiquitin-like superfamily protein2.1e-2329.56Show/hide
Query:  IEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQ-RLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSETLPNRPETDPNSSTSRIHSS
        I IK+K L S T+TL V++ +PV  LK+ I    GV  E+Q RL+ RG+VLK+DQ LS YHVE+GHTL+LV   P  P                      
Subjt:  IEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQ-RLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSETLPNRPETDPNSSTSRIHSS

Query:  RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGSRPLSDRFHGTSGHPSIPPLG
                                      + SS   +NS  G                      G +  + +Q        L+ R + T          
Subjt:  RVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGSRPLSDRFHGTSGHPSIPPLG

Query:  SFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLTGEVSECLLQLARQLENHRNV
          P  VIPDS+TTLS++L  +R+ F + G    NN Q  N                + ++         E+  +TR++L GEV+ECL  ++  L +  NV
Subjt:  SFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLTGEVSECLLQLARQLENHRNV

Query:  TDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPN
        TDP+ R   Q     SG L  +LG+ +L LG     + MG+   +     G AVFIS TG N
Subjt:  TDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPN

AT5G11080.2 Ubiquitin-like superfamily protein8.2e-2027.94Show/hide
Query:  IEIKLKTLDSQTYTLRVDK---------------------QMPVPALKEQIASVTGVLSEQQ-RLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPS
        I IK+K L S T+TL V++                      +PV  LK+ I    GV  E+Q RL+ RG+VLK+DQ LS YHVE+GHTL+LV   P  P 
Subjt:  IEIKLKTLDSQTYTLRVDK---------------------QMPVPALKEQIASVTGVLSEQQ-RLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPS

Query:  ETLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSG
                                                           + SS   +NS  G                      G +  + +Q     
Subjt:  ETLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSG

Query:  SRPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQML
           L+ R + T            P  VIPDS+TTLS++L  +R+ F + G    NN Q  N                + ++         E+  +TR++L
Subjt:  SRPLSDRFHGTSGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQML

Query:  TGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPN
         GEV+ECL  ++  L +  NVTDP+ R   Q     SG L  +LG+ +L LG     + MG+   +     G AVFIS TG N
Subjt:  TGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPN

AT5G25270.1 Ubiquitin-like superfamily protein6.3e-9740.43Show/hide
Query:  MGSNFIDE-TTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPS
        MG N  DE      +  G  A +EIK+KTLDSQTYTLRVDK +PVPALKEQ+ASVTGV++EQQRLICRGKV+KDDQLLSAYHVEDGHTLHLVVRQP+  S
Subjt:  MGSNFIDE-TTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPS

Query:  ETLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSG
        E+  +    DP  S      S+    VV+ +F++  Q DGV  ++ +IVSAVL S+G+SN   GIEG+D                  M  L++  S  SG
Subjt:  ETLPNRPETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSG

Query:  SRPLSDRFHGTSGHPSI-----PPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQES-FPTPASLAEVML
             D   G S  P+       PL S  P  IPDSLTTLS+ L+++R++F +     G+NA   N+   E S  N     STT ES  P P+ LAEV+ 
Subjt:  SRPLSDRFHGTSGHPSI-----PPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQES-FPTPASLAEVML

Query:  STRQMLTGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQ
        STRQ+L GEV++CL  L+RQL +H NVTDP  R   QS   +SG L  +LG  LLELGR  M LR+GQ P +AVV+AGPAVFIS TG N     PLP   
Subjt:  STRQMLTGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQ

Query:  NASLGPVPMGAMQPGSALIH---GLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIR
        ++ LG   +G++Q G+A  +   G      PR I+I+IR GS   AS  +  E S  QQ  GQ                                 +P  
Subjt:  NASLGPVPMGAMQPGSALIH---GLGSGFLPRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIR

Query:  TMVAALPGPFSRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESE--RQVPSNVV
              P P +R PS    N   L  PV+ R+   + G             R STGL      + ES  + Q+V    R+G +  +       ++ + + 
Subjt:  TMVAALPGPFSRLPSNSSGNSFGLYYPVLGRFPHPASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESE--RQVPSNVV

Query:  QFLRTLFPSGEINIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAE
        QFLR L    E        Q  +       +  + +VAN Q   + T+DEG F+S++  QIMPF SQ  N    SS EA      N    S QAS+ +AE
Subjt:  QFLRTLFPSGEINIEDASFQEISGSIPAHHSMASSSVANVQESESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAE

AT5G42220.1 Ubiquitin-like superfamily protein2.1e-5534.82Show/hide
Query:  EATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPL--PPSETLPNRPET-----DPN
        E+T+E+ +KTLDS+TYT +V+K   V   KE+IAS TGV   QQRLI RG+VLKDD  LS YH+E+GHTLHL+VRQP    PS   P++  T     + N
Subjt:  EATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPL--PPSETLPNRPET-----DPN

Query:  SSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLS------NSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGSRPLSD
           SR +   V+  VV+ +F++  Q +G+ P+++R++ AVL+S G+S      +S  G +      +      G        I      +G S  R    
Subjt:  SSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLS------NSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGSRPLSD

Query:  RFHGTSGHPSIP-------------PLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRP----STTQESFPTPASL
         F G S   S+P             P+ SF  P IPDSL TL + ++ M +            A   N +  + SS+ S  RP       +    TP +L
Subjt:  RFHGTSGHPSIP-------------PLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRP----STTQESFPTPASL

Query:  AEVMLSTRQMLTGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQP
        + V+ + + +L+G     L  +A +LE   + +DPTLR   Q+ A + G+   +LGA LLELGRT++TLRM  +P  + VNAGPAV+IS +GPNPIMVQP
Subjt:  AEVMLSTRQMLTGEVSECLLQLARQLENHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQP

Query:  LPFQQNASLGPVPMGAMQPGSALIHGLGSGFLPRRIDIQIR---RGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEG--PSVGGESG
         P Q    + P+  GA    + L   +G G   R I+I I     GSP  +S G+        Q       +      P++     +  G  P +G +  
Subjt:  LPFQQNASLGPVPMGAMQPGSALIHGLGSGFLPRRIDIQIR---RGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEG--PSVGGESG

Query:  VRVV--PIRTMVAALPG
        V  +   IR MV  + G
Subjt:  VRVV--PIRTMVAALPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCAATTTTATTGACGAAACTACCAGCTGTGGTGAGGCTGATGGCTCTGAAGCCACTATTGAGATTAAATTAAAAACTTTAGATTCACAAACTTATACT
CTCAGAGTGGATAAACAGATGCCAGTCCCTGCTTTGAAAGAACAAATTGCTTCTGTAACTGGTGTGTTATCAGAACAACAACGTCTTATATGTCGAGGAAAAGTT
CTCAAGGATGATCAGCTCTTGTCTGCCTACCACGTTGAGGATGGTCACACCTTGCATTTGGTTGTCAGGCAGCCTCTTCCGCCATCAGAGACTTTGCCCAATCGG
CCAGAGACTGATCCAAATTCGAGTACGAGTCGCATTCATAGCAGTCGGGTGGCTCCGGGTGTGGTGATTGAAACTTTTAGTATGCCTGTTCAAGGGGATGGTGTG
CCCCCAGAAATCAACAGGATTGTGTCCGCTGTTCTAAGTTCAATTGGGCTTTCAAATTCTGTAACTGGCATTGAAGGGATGGACGTTGTCAGGGAAATTGACCAA
CAAAGATCTGGAGAACGGGTGATTGCAGCTGGGATGATAGATTTGAACCAGCATCAATCTGGTGACAGTGGTTCAAGGCCACTGTCTGATAGGTTTCATGGCACT
TCTGGACATCCATCAATTCCTCCTTTAGGATCGTTTCCTCCTCCTGTGATTCCTGATTCTTTGACAACTTTGTCTCAAAACCTCAGTAATATGAGGCGTGATTTT
GAAAGTATTGGCAGAGTTGGAGGAAATAATGCTCAAGAAGCTAATATTCATGGAGATGAGGAAAGCAGCTCTAATTCCTCATCTCGTCCAAGCACCACCCAAGAA
AGTTTCCCTACTCCTGCATCGTTGGCAGAAGTCATGCTTTCTACTAGACAAATGCTTACAGGGGAAGTCTCAGAGTGCCTACTTCAACTCGCAAGGCAACTGGAG
AATCACAGGAATGTTACTGATCCTACATTAAGGATGAATACTCAGTCTACTGCATGGAGAAGTGGAGTTCTATTTAATAACCTAGGTGCATATCTCCTAGAACTT
GGTCGCACTATGATGACGTTGCGCATGGGTCAAAATCCTTCAGAAGCTGTTGTTAATGCAGGCCCTGCAGTTTTCATTTCCCAAACTGGTCCTAATCCTATAATG
GTTCAGCCTCTTCCCTTTCAACAAAATGCAAGCTTAGGTCCAGTTCCCATGGGAGCCATGCAGCCTGGCTCTGCACTAATTCATGGTCTTGGTTCTGGATTTCTC
CCAAGACGTATTGACATACAAATTCGAAGAGGTTCACCAAGTACGGCATCAAATGGCAGTCCAGAGGAACGCAGTGGAGCTCAACAGCCTTCAGGGCAACAAGAA
GCAGCCAGAGGTGTGGGTGAAAATCCCACCAATCAAGCAACCACTAGGATTGTAGAGGGCCCAAGTGTGGGAGGTGAATCTGGAGTGCGAGTAGTGCCAATTAGG
ACCATGGTTGCAGCATTACCTGGACCCTTTAGCCGCTTACCTTCAAATTCTTCGGGTAATTCATTTGGGTTATACTATCCTGTTCTTGGAAGATTTCCTCACCCT
GCTTCAGGAAATGCAAGGGCCGAGCGAGGAAGTCAAGCTTCATCAGAGCGTCAGTCTACTGGCCTCCAGAGTGAGCAACATACAATTCTTGAATCTGTCGTGGAA
CAGCAAAATGTAGAAGATGCTGCAAGAGATGGTGGGGCCCAAGGTACCCTAGAATCTGAACGACAAGTTCCCAGCAATGTTGTCCAATTTCTTCGGACTCTCTTT
CCCAGTGGTGAAATCAACATCGAAGATGCCAGTTTCCAGGAGATTAGTGGTTCTATTCCTGCTCATCATTCAATGGCATCAAGTTCTGTTGCAAATGTTCAAGAA
TCAGAATCAAGAACTAGTGATGAAGGAATGTTTTTGTCCAACATATTTCACCAAATCATGCCATTCACATCTCAACGAGGCAACGAGCCAGATATGTCTTCGGTA
GAGGCAAATGCCTCTGAGCGTCAAAACATTCCAGATTCTTCTGCACAAGCTTCAAATGGGGATGCTGAAACATCTCGCAGACGTGCTGATGCTGAAGCAGAAAGA
TTGACACAACAAAATGTGACTAGAGATTATGGGATCTTAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCAATTTTATTGACGAAACTACCAGCTGTGGTGAGGCTGATGGCTCTGAAGCCACTATTGAGATTAAATTAAAAACTTTAGATTCACAAACTTATACT
CTCAGAGTGGATAAACAGATGCCAGTCCCTGCTTTGAAAGAACAAATTGCTTCTGTAACTGGTGTGTTATCAGAACAACAACGTCTTATATGTCGAGGAAAAGTT
CTCAAGGATGATCAGCTCTTGTCTGCCTACCACGTTGAGGATGGTCACACCTTGCATTTGGTTGTCAGGCAGCCTCTTCCGCCATCAGAGACTTTGCCCAATCGG
CCAGAGACTGATCCAAATTCGAGTACGAGTCGCATTCATAGCAGTCGGGTGGCTCCGGGTGTGGTGATTGAAACTTTTAGTATGCCTGTTCAAGGGGATGGTGTG
CCCCCAGAAATCAACAGGATTGTGTCCGCTGTTCTAAGTTCAATTGGGCTTTCAAATTCTGTAACTGGCATTGAAGGGATGGACGTTGTCAGGGAAATTGACCAA
CAAAGATCTGGAGAACGGGTGATTGCAGCTGGGATGATAGATTTGAACCAGCATCAATCTGGTGACAGTGGTTCAAGGCCACTGTCTGATAGGTTTCATGGCACT
TCTGGACATCCATCAATTCCTCCTTTAGGATCGTTTCCTCCTCCTGTGATTCCTGATTCTTTGACAACTTTGTCTCAAAACCTCAGTAATATGAGGCGTGATTTT
GAAAGTATTGGCAGAGTTGGAGGAAATAATGCTCAAGAAGCTAATATTCATGGAGATGAGGAAAGCAGCTCTAATTCCTCATCTCGTCCAAGCACCACCCAAGAA
AGTTTCCCTACTCCTGCATCGTTGGCAGAAGTCATGCTTTCTACTAGACAAATGCTTACAGGGGAAGTCTCAGAGTGCCTACTTCAACTCGCAAGGCAACTGGAG
AATCACAGGAATGTTACTGATCCTACATTAAGGATGAATACTCAGTCTACTGCATGGAGAAGTGGAGTTCTATTTAATAACCTAGGTGCATATCTCCTAGAACTT
GGTCGCACTATGATGACGTTGCGCATGGGTCAAAATCCTTCAGAAGCTGTTGTTAATGCAGGCCCTGCAGTTTTCATTTCCCAAACTGGTCCTAATCCTATAATG
GTTCAGCCTCTTCCCTTTCAACAAAATGCAAGCTTAGGTCCAGTTCCCATGGGAGCCATGCAGCCTGGCTCTGCACTAATTCATGGTCTTGGTTCTGGATTTCTC
CCAAGACGTATTGACATACAAATTCGAAGAGGTTCACCAAGTACGGCATCAAATGGCAGTCCAGAGGAACGCAGTGGAGCTCAACAGCCTTCAGGGCAACAAGAA
GCAGCCAGAGGTGTGGGTGAAAATCCCACCAATCAAGCAACCACTAGGATTGTAGAGGGCCCAAGTGTGGGAGGTGAATCTGGAGTGCGAGTAGTGCCAATTAGG
ACCATGGTTGCAGCATTACCTGGACCCTTTAGCCGCTTACCTTCAAATTCTTCGGGTAATTCATTTGGGTTATACTATCCTGTTCTTGGAAGATTTCCTCACCCT
GCTTCAGGAAATGCAAGGGCCGAGCGAGGAAGTCAAGCTTCATCAGAGCGTCAGTCTACTGGCCTCCAGAGTGAGCAACATACAATTCTTGAATCTGTCGTGGAA
CAGCAAAATGTAGAAGATGCTGCAAGAGATGGTGGGGCCCAAGGTACCCTAGAATCTGAACGACAAGTTCCCAGCAATGTTGTCCAATTTCTTCGGACTCTCTTT
CCCAGTGGTGAAATCAACATCGAAGATGCCAGTTTCCAGGAGATTAGTGGTTCTATTCCTGCTCATCATTCAATGGCATCAAGTTCTGTTGCAAATGTTCAAGAA
TCAGAATCAAGAACTAGTGATGAAGGAATGTTTTTGTCCAACATATTTCACCAAATCATGCCATTCACATCTCAACGAGGCAACGAGCCAGATATGTCTTCGGTA
GAGGCAAATGCCTCTGAGCGTCAAAACATTCCAGATTCTTCTGCACAAGCTTCAAATGGGGATGCTGAAACATCTCGCAGACGTGCTGATGCTGAAGCAGAAAGA
TTGACACAACAAAATGTGACTAGAGATTATGGGATCTTAATCTGAAATTGCACCTTCAAACTTCTGGGAACGGAAAATTATCACAAAACAGGATCAAATCAACGG
TGGAATTCACTTTACAAGTTGCAAGGAGCACCACAGCATTCTGCCCTCTGCCAATTAAGTGTACTATTTTTCATTCTCTTTTTTTGCATTTCTTCATCCTCCCTT
GGCCCCACCCTCCCCATCACCTTGTACGATATGGTGTGACATTACTATTTTACGCCTCGCCCCCGTGCCTGGAACCCAAAATGGAAAAGAAGTAAAAATGAAAGC
ATTCTTCGAAATTTACAGTGTTCCCATATTTGTGAATGGCCATAAAACTATTAGATCAAAGTCCTTATAAAGCACATCTTCTGCTTCTTGAGTTTGTGCTTAATG
GAACTGTGATTTGGGTTTCAGTCTTCCATCTGAGATTGCGGACTGCTGCTCCAGTTGAATTTCATATGTTTTTCCCTTAAGAGCGCCCCAACACTCATTTGCCCT
TTGTATTTGGTTTGTTCAAGTTATTACCACAATATTAAATATGGAAGAGAAATAGAGTAAATCGCACAAAAATTTGTATCATTTTTGCATTTTGCGTTTGTTAAC
TATGTTGTATTCAATTAGGGGAACCACCCACATGATCTCATCTGGGTATATAAATAGTTTGGTT
Protein sequenceShow/hide protein sequence
MGSNFIDETTSCGEADGSEATIEIKLKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPLPPSETLPNR
PETDPNSSTSRIHSSRVAPGVVIETFSMPVQGDGVPPEINRIVSAVLSSIGLSNSVTGIEGMDVVREIDQQRSGERVIAAGMIDLNQHQSGDSGSRPLSDRFHGT
SGHPSIPPLGSFPPPVIPDSLTTLSQNLSNMRRDFESIGRVGGNNAQEANIHGDEESSSNSSSRPSTTQESFPTPASLAEVMLSTRQMLTGEVSECLLQLARQLE
NHRNVTDPTLRMNTQSTAWRSGVLFNNLGAYLLELGRTMMTLRMGQNPSEAVVNAGPAVFISQTGPNPIMVQPLPFQQNASLGPVPMGAMQPGSALIHGLGSGFL
PRRIDIQIRRGSPSTASNGSPEERSGAQQPSGQQEAARGVGENPTNQATTRIVEGPSVGGESGVRVVPIRTMVAALPGPFSRLPSNSSGNSFGLYYPVLGRFPHP
ASGNARAERGSQASSERQSTGLQSEQHTILESVVEQQNVEDAARDGGAQGTLESERQVPSNVVQFLRTLFPSGEINIEDASFQEISGSIPAHHSMASSSVANVQE
SESRTSDEGMFLSNIFHQIMPFTSQRGNEPDMSSVEANASERQNIPDSSAQASNGDAETSRRRADAEAERLTQQNVTRDYGILI