; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022226 (gene) of Chayote v1 genome

Gene IDSed0022226
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationLG03:11600652..11604190
RNA-Seq ExpressionSed0022226
SyntenySed0022226
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061519.1 DUF1005 domain-containing protein [Cucumis melo var. makuwa]1.1e-22390.5Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL V+SLALNL QATRPAGAAVHPSATPCFCKI+I +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRRLSGKPV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVR+TVS DGAEN PKVFQNGWVKLGKDE K+ ARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATG+PIAE TMSVKKGG+FCIDRKT RDF   S+ST+KG+FVMASSV+GEGK SKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

XP_008458969.1 PREDICTED: uncharacterized protein LOC103498220 [Cucumis melo]3.2e-22390.26Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL V+SLALNL QATRPAGAAVHPSATPCFCKI+I +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRRLSGKPV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVR+TVS DGAEN PKVFQNGWVKLGKDE K+ ARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA++GLATG+PIAE TMSVKKGG+FCIDRKT RDF   S+ST+KG+FVMASSV+GEGK SKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

XP_022938284.1 uncharacterized protein LOC111444418 [Cucurbita moschata]1.7e-22189.79Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL VESL+LNL QATRPAGAAVHPS TPCFCKIAI +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRRLSGKPV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVR+ +S DGAEN PK FQNGWV LGKDE K  ARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATG+PIAE TMSVKKGG+FCIDRKT RDF+P SRS IKGNFVMASSV+GEGK SKP+VQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

XP_022965765.1 uncharacterized protein LOC111465558 [Cucurbita maxima]4.6e-22290.02Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL VESL+LNL QATRPAGAAVHPS TPCFCKIAI +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRRLSGKPV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVR+ +S DGAEN PK FQNGWV LGKDE K  ARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATG+PIAE TMSVKKGG+FCIDRKT RDF+P SRS IKGNFVMASSV+GEGK SKP+VQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

XP_038889424.1 uncharacterized protein LOC120079336 [Benincasa hispida]1.6e-22290.26Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL VESLALNL QATRPAGAAVHPSATPCFCKIAI +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRR+SG PV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVR+TVS DGAEN P+VFQNGWVKLGKD+ K+ ARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATG+PIAE TMSVKKGG+FCIDRKT RDF+  S+S +KGNFVMASSV+GEGK SKPVVQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

TrEMBL top hitse value%identityAlignment
A0A0A0LIQ0 Uncharacterized protein5.5e-22189.55Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL V+SLALNL QATRPAGAAVHPSATPCFCKI+I +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRRLSGKPV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVR+TVS DGAE+ PKVFQNGWVKLGK E K+ ARLHLVVRSEPDPRFVFQFG EPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATG+PIAE TMSVKKGG+FCIDRKT RD    S+ST+KG+FVMASSV+GEGK SKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

A0A1S3C9N5 uncharacterized protein LOC1034982201.5e-22390.26Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL V+SLALNL QATRPAGAAVHPSATPCFCKI+I +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRRLSGKPV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVR+TVS DGAEN PKVFQNGWVKLGKDE K+ ARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA++GLATG+PIAE TMSVKKGG+FCIDRKT RDF   S+ST+KG+FVMASSV+GEGK SKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

A0A5A7V777 DUF1005 domain-containing protein5.3e-22490.5Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL V+SLALNL QATRPAGAAVHPSATPCFCKI+I +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRRLSGKPV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVR+TVS DGAEN PKVFQNGWVKLGKDE K+ ARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATG+PIAE TMSVKKGG+FCIDRKT RDF   S+ST+KG+FVMASSV+GEGK SKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

A0A6J1FDM1 uncharacterized protein LOC1114444188.4e-22289.79Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL VESL+LNL QATRPAGAAVHPS TPCFCKIAI +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRRLSGKPV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVR+ +S DGAEN PK FQNGWV LGKDE K  ARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATG+PIAE TMSVKKGG+FCIDRKT RDF+P SRS IKGNFVMASSV+GEGK SKP+VQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

A0A6J1HPN4 uncharacterized protein LOC1114655582.2e-22290.02Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR
        MDPCPF+RL VESL+LNL QATRPAGAAVHPS TPCFCKIAI +FPSQTA LPLSSVSGD+PPDS ASSA FHLDP SLRRLSGKPV MC SVFAGRMG 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGR

Query:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVR+ +S DGAEN PK FQNGWV LGKDE K  ARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNI+QPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPG DRVSRSNPGAWLILRP+GFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATG+PIAE TMSVKKGG+FCIDRKT RDF+P SRS IKGNFVMASSV+GEGK SKP+VQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTHKLRRELCHDEHDSSFL
        RHFT KLRRELCHDEHDSSFL
Subjt:  RHFTHKLRRELCHDEHDSSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.4e-12048.28Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPV-----GMCFSVFA
        MDPCPFIRL + +LAL +  A +   + VHPS++PCFCKI + +FP QTA +P   +     P+    +A FHL    ++RL+ + +      +   ++ 
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPV-----GMCFSVFA

Query:  GRMGRTCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGK--MLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFS---
        GR G  CGV+SG+LL +V V +   G ++ P VF NGW+ +GK  GK    A+ HL V++EPDPRFVFQF GEPECSP V QIQGNI+QPVF+CKFS   
Subjt:  GRMGRTCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGK--MLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFS---

Query:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLE
          DR  R+RSLP++    S    W+ +F  ERE+PG+ERKGW I V+DLSGSPVA AS++TPFV SPG DRVSRSNPG+WLILRP      +W+PWGRLE
Subjt:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLE

Query:  AWRER-GPIDGLGYKFELVANTGLATGVPIAEGTMSVKKGGRFCID----RKTARDFNPISRS----------------------------------TIK
        AWRER G  DGLGY+FEL+ +     G+ +AE T+S  +GG+F I+      ++   + ++RS                                   + 
Subjt:  AWRER-GPIDGLGYKFELVANTGLATGVPIAEGTMSVKKGGRFCID----RKTARDFNPISRS----------------------------------TIK

Query:  GNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTHKLRRELCHD
          FVM++SV+GEGK SKP V+V VQHV+CM DAA +VAL+AAIDLSMDACR F  ++R+ELCH+
Subjt:  GNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTHKLRRELCHD

AT1G50040.1 Protein of unknown function (DUF1005)8.9e-8343.68Show/hide
Query:  MDPCPFIRLKVESLALNL-------SQATRPAGAAVHP-SATPCFCKIAINHFPSQTAFLPL------SSVSGDTPPDSIASSADFHLDPPSLRRLSGKP
        MDPC F+R+ V +LA+         S ++  +G +V   S+  C+CKI    FP Q   +P+       S S     +    +A F L    +     K 
Subjt:  MDPCPFIRLKVESLALNL-------SQATRPAGAAVHP-SATPCFCKIAINHFPSQTAFLPL------SSVSGDTPPDSIASSADFHLDPPSLRRLSGKP

Query:  VGMCFSV-FAGRMGRTCG---VNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLG---KDEGKMLA--RLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQ
             SV    R   +CG    +  KL+GR +VT+    AE+   +  NGWV LG   K+  K  +   LH+ VR EPD RFVFQF GEPECSP VFQ+Q
Subjt:  VGMCFSV-FAGRMGRTCG---VNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLG---KDEGKMLA--RLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQ

Query:  GNIQQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGF
        GN +Q VF+CKF   RNS  R+L    S + T GK         E+  +ERKGW I ++DLSGSPVA ASM+TPFVPSPG +RVSRS+PGAWLILRP+G+
Subjt:  GNIQQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGF

Query:  SVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTAR-----------DFNPISRSTIKGN--------------
           +WKPW RL+AWRE G  D LGY+FEL  + G+A  V  A  ++S K GG F ID  T+             F+  S S+I+ +              
Subjt:  SVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRKTAR-----------DFNPISRSTIKGN--------------

Query:  --------FVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTHKLRREL
                FVM++ V G  K SKP V+VGV+HVTC  DAA  VALAAA+DLSMDACR F+ KLR EL
Subjt:  --------FVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTHKLRREL

AT3G19680.1 Protein of unknown function (DUF1005)1.5e-9043.12Show/hide
Query:  MDPCPFIRLKVESLALNL-------SQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASS-------ADFHLDPPSLRRLSGKP
        MDPC F+R+ V +LA+         S ++ P+ + ++P+A  C+CKI   +FP +   +P+   +        +SS       A F L    +     KP
Subjt:  MDPCPFIRLKVESLALNL-------SQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASS-------ADFHLDPPSLRRLSGKP

Query:  VGMCFSVFA----------GRMGRTCGVNSG--KLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLA----RLHLVVRSEPDPRFVFQFGGEPECS
             SV A          G  G +CG+ +   KLLGR  V++    AE    +  NGWV L   + K        LH+ VR EPDPRFVFQF GEPECS
Subjt:  VGMCFSVFA----------GRMGRTCGVNSG--KLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLA----RLHLVVRSEPDPRFVFQFGGEPECS

Query:  PVVFQIQGNIQQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNP
        P VFQ+QGN +Q VF+CKF S + NS  R+L    S  S  S+    + + + E+E+P +ERKGW I V+DLSGSPVA ASM+TPFVPSPG +RV+RS+P
Subjt:  PVVFQIQGNIQQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNP

Query:  GAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGVPIAEGTMSVKKGGRFCID------------------------------RK
        GAWLILRP+G    +WKPWGRLEAWRE G  D LGY+FEL  + G+AT V  A  ++S+K GG F ID                              R 
Subjt:  GAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGVPIAEGTMSVKKGGRFCID------------------------------RK

Query:  TAR-------DFNPI-----SRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTHKLRREL
         +R       DF  +     S +     FVM+++V+G GK SKP V+VGV HVTC  DAA  VALAAA+DLS+DACR F+HKLR+EL
Subjt:  TAR-------DFNPI-----SRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTHKLRREL

AT4G29310.1 Protein of unknown function (DUF1005)1.9e-15765.57Show/hide
Query:  MDPCPFIRLKVESLALNLSQ--ATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVS-GDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGR
        MDPCPF+RL ++SLAL L +    +  G  VHPS+TPC+CK+ I HFPSQ A LPLSS S   +PP+S  S+  FHLD  ++RR+SGK + +  SV+AGR
Subjt:  MDPCPFIRLKVESLALNLSQ--ATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVS-GDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGR

Query:  MGRTCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRT
         G TCGV SGKLLG+V V V    A +    F NGW KLG D  K  ARLHL+V +EPDPRFVFQFGGEPECSPVV+QIQ N++QPVFSCKFS+DRN R+
Subjt:  MGRTCGVNSGKLLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRT

Query:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERG
        RSLPS F++ S++G   RT SG++  +K  RERKGWMI ++DLSGSPVAAASMITPFV SPG DRVSRSNPGAWLILRP+G  VSSWKPWGRLEAWRERG
Subjt:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERG

Query:  PIDGLGYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRK-TARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAID
         IDGLGYKFELV +   +TG+PIAEGTMS K+GG+F IDR+ + +  +P   S +KG FVM SSV+GEGK SKPVV VG QHVTCMADAALFVAL+AA+D
Subjt:  PIDGLGYKFELVANTGLATGVPIAEGTMSVKKGGRFCIDRK-TARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAID

Query:  LSMDACRHFTHKLRRELCHDEHDS
        LS+DAC+ F+ KLR+ELCHD+  S
Subjt:  LSMDACRHFTHKLRRELCHDEHDS

AT5G17640.1 Protein of unknown function (DUF1005)2.5e-8541.59Show/hide
Query:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPS---ATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRL------SGKPVGMCF
        MDP  FIRL V SLAL + +    + +  +     ++ C C+I +  FP QT  +PL   S D  PD  + S  F+L+   LR L            +  
Subjt:  MDPCPFIRLKVESLALNLSQATRPAGAAVHPS---ATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRL------SGKPVGMCF

Query:  SVFAGRMGRTCGVNSGK-LLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFS
        SVF G+    CGV   +  +G  ++ V  +  E  P +  NGW+ +GK +    A LHL V+ +PDPR+VFQF      SP + Q++G+++QP+FSCKFS
Subjt:  SVFAGRMGRTCGVNSGK-LLGRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFS

Query:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLE
         DR S+   L          G W  +  G E E   RERKGW + ++DLSGS VAAA + TPFVPS G D V++SNPGAWL++RP+    +SW+PWG+LE
Subjt:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLE

Query:  AWRERGPIDGLGYKFELVANTGLATG-VPIAEGTMSVKKGGRFCIDR---------------KTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVG
        AWRERG  D +  +F L++N GL  G V ++E  +S +KGG F ID                +++ DF+ + +    G FVM+S V GEGK+SKPVVQ+ 
Subjt:  AWRERGPIDGLGYKFELVANTGLATG-VPIAEGTMSVKKGGRFCIDR---------------KTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVG

Query:  VQHVTCMADAALFVALAAAIDLSMDACRHFTHKLRRELCH
        ++HVTC+ DAA+F+ALAAA+DLS+ AC+ F    RR   H
Subjt:  VQHVTCMADAALFVALAAAIDLSMDACRHFTHKLRRELCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCATGTCCGTTCATCAGGCTGAAGGTGGAGTCCCTCGCTCTCAACCTCTCTCAGGCAACCCGACCCGCCGGCGCCGCCGTCCACCCGTCGGCCACGCCCTGCTT
CTGCAAGATCGCGATCAACCATTTCCCTTCCCAGACGGCGTTTCTTCCCCTTTCCTCCGTTTCCGGCGACACGCCGCCGGACTCCATCGCGTCGTCCGCCGATTTCCACC
TCGACCCGCCGTCCCTCCGCCGGCTCTCTGGTAAGCCCGTTGGGATGTGTTTTTCGGTTTTTGCCGGCCGGATGGGCCGCACGTGTGGGGTTAATTCTGGAAAATTGCTC
GGCCGGGTTCGGGTCACTGTTTCGACCGACGGTGCTGAGAATGTTCCGAAAGTGTTTCAGAATGGGTGGGTGAAATTGGGGAAAGATGAGGGTAAAATGTTGGCTCGGCT
TCACTTGGTTGTCCGGTCTGAACCGGATCCACGGTTCGTGTTCCAGTTCGGCGGTGAACCGGAATGTAGCCCGGTTGTATTCCAGATCCAAGGCAATATCCAACAGCCGG
TTTTCAGCTGCAAGTTCAGTGCCGATCGGAACTCAAGAACCCGATCACTGCCTTCAGATTTCAGCTTCAACAGCACCAAAGGAAAATGGATGAGAACTTTTTCAGGGGAA
AGAGAGAAGCCAGGGAGAGAAAGAAAAGGTTGGATGATCATGGTTTACGACCTCTCCGGCTCCCCGGTCGCAGCTGCCTCCATGATCACGCCGTTCGTCCCTTCCCCGGG
CATGGACCGTGTCTCGCGCTCCAACCCCGGTGCCTGGCTCATCCTCCGCCCCAACGGTTTCTCCGTCAGTAGTTGGAAGCCATGGGGTCGCCTTGAGGCTTGGCGTGAGC
GGGGACCTATAGATGGCCTTGGCTACAAGTTCGAGCTTGTCGCCAACACCGGACTCGCCACAGGCGTTCCCATCGCCGAAGGTACCATGAGCGTAAAAAAAGGTGGCCGG
TTTTGCATCGACCGTAAAACTGCAAGAGATTTCAATCCAATCTCTAGATCCACCATTAAAGGCAACTTTGTAATGGCTTCGAGCGTGGATGGGGAAGGGAAGGCGAGCAA
GCCCGTCGTACAAGTCGGGGTTCAGCACGTGACGTGCATGGCGGATGCTGCTTTATTTGTAGCACTTGCAGCAGCCATTGATCTTAGCATGGATGCTTGTAGACATTTTA
CACATAAGCTAAGAAGGGAGCTTTGTCACGACGAACACGATTCAAGTTTTCTTTGA
mRNA sequenceShow/hide mRNA sequence
TTATTATTCTTTATAAGGATTTGAAGTAAATAATAATATTTTGACAAAAATGTCAAGGTTCTCTATCTGTCACAAGAGAGAGAAACACAATAAAAAAGCAAAACCTCCAA
TTCATCAATAGCAAACAGCTCACCCAAATCGCCAATTTTAGATGTTCACTAATTACTCTTATAAATCTGCAATTCCCAATTCACATTGCCGGCGCCGGCGAATTTTCCGA
TGGATCCATGTCCGTTCATCAGGCTGAAGGTGGAGTCCCTCGCTCTCAACCTCTCTCAGGCAACCCGACCCGCCGGCGCCGCCGTCCACCCGTCGGCCACGCCCTGCTTC
TGCAAGATCGCGATCAACCATTTCCCTTCCCAGACGGCGTTTCTTCCCCTTTCCTCCGTTTCCGGCGACACGCCGCCGGACTCCATCGCGTCGTCCGCCGATTTCCACCT
CGACCCGCCGTCCCTCCGCCGGCTCTCTGGTAAGCCCGTTGGGATGTGTTTTTCGGTTTTTGCCGGCCGGATGGGCCGCACGTGTGGGGTTAATTCTGGAAAATTGCTCG
GCCGGGTTCGGGTCACTGTTTCGACCGACGGTGCTGAGAATGTTCCGAAAGTGTTTCAGAATGGGTGGGTGAAATTGGGGAAAGATGAGGGTAAAATGTTGGCTCGGCTT
CACTTGGTTGTCCGGTCTGAACCGGATCCACGGTTCGTGTTCCAGTTCGGCGGTGAACCGGAATGTAGCCCGGTTGTATTCCAGATCCAAGGCAATATCCAACAGCCGGT
TTTCAGCTGCAAGTTCAGTGCCGATCGGAACTCAAGAACCCGATCACTGCCTTCAGATTTCAGCTTCAACAGCACCAAAGGAAAATGGATGAGAACTTTTTCAGGGGAAA
GAGAGAAGCCAGGGAGAGAAAGAAAAGGTTGGATGATCATGGTTTACGACCTCTCCGGCTCCCCGGTCGCAGCTGCCTCCATGATCACGCCGTTCGTCCCTTCCCCGGGC
ATGGACCGTGTCTCGCGCTCCAACCCCGGTGCCTGGCTCATCCTCCGCCCCAACGGTTTCTCCGTCAGTAGTTGGAAGCCATGGGGTCGCCTTGAGGCTTGGCGTGAGCG
GGGACCTATAGATGGCCTTGGCTACAAGTTCGAGCTTGTCGCCAACACCGGACTCGCCACAGGCGTTCCCATCGCCGAAGGTACCATGAGCGTAAAAAAAGGTGGCCGGT
TTTGCATCGACCGTAAAACTGCAAGAGATTTCAATCCAATCTCTAGATCCACCATTAAAGGCAACTTTGTAATGGCTTCGAGCGTGGATGGGGAAGGGAAGGCGAGCAAG
CCCGTCGTACAAGTCGGGGTTCAGCACGTGACGTGCATGGCGGATGCTGCTTTATTTGTAGCACTTGCAGCAGCCATTGATCTTAGCATGGATGCTTGTAGACATTTTAC
ACATAAGCTAAGAAGGGAGCTTTGTCACGACGAACACGATTCAAGTTTTCTTTGAAGTTCTATCGATTTTTTCGAGATTTTTTTTGGCTTTTTTTCTTTGCAATGATAAC
CAGTTTATCATCCAAGGCAGAGCTGAATTTGAGCAGAGAAATCGTATATTTATGTAATGTTCAATTCATTTTTTTTTTCTCTTTCGAAACCCTTAATTTTCTCCTTTAGA
TTGATTCTTTAATTCCCTTTTGTTTTATTTCCACTTTTAATTTGGATTTTGATTAAGTAGTTTTAGTTTCAGTTTCAAAGGCTTTTTCTGTGTAAATTTAAATTTGATCC
AGCCAAAAGTGTATAGACAGGGAACAAGAATTTCTCCTATGTTTTGTGATATGATATAAACCGAATGAAAATTGATTTATTGCTCTTGCTTTA
Protein sequenceShow/hide protein sequence
MDPCPFIRLKVESLALNLSQATRPAGAAVHPSATPCFCKIAINHFPSQTAFLPLSSVSGDTPPDSIASSADFHLDPPSLRRLSGKPVGMCFSVFAGRMGRTCGVNSGKLL
GRVRVTVSTDGAENVPKVFQNGWVKLGKDEGKMLARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIQQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGE
REKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGMDRVSRSNPGAWLILRPNGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGVPIAEGTMSVKKGGR
FCIDRKTARDFNPISRSTIKGNFVMASSVDGEGKASKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTHKLRRELCHDEHDSSFL