; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009675 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009675
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTransglut_core2 domain-containing protein
Genome locationChr06:8772712..8780091
RNA-Seq ExpressionHG10009675
SyntenyHG10009675
Gene Ontology termsNA
InterPro domainsIPR032698 - Protein SirB1, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579123.1 hypothetical protein SDJN03_23571, partial [Cucurbita argyrosperma subsp. sororia]1.2e-19384.16Show/hide
Query:  SKFN------SSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDD
        SKFN      SSSS S    FRVVCS GF  QP++ KDF+FLLHDA+DSSGIDSTYAKEARKGF+TQIHYLS IER+TSISINR VDLAKAALYIAAEDD
Subjt:  SKFN------SSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDD

Query:  SLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWS
        SLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRRT+ KAQ+EPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWS
Subjt:  SLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWS

Query:  LLDFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESG
        LLDFDVEIYHP DD+SLPT+YHKLK +ESDQPHIITTQ+LLVE                ILSNLKESFWPFQQNQSRSLFLRAADVANC D SNA+EESG
Subjt:  LLDFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESG

Query:  FQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALI
        FQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLY+ETK+SSSPT TLS QEEEAVD+L+KRLALI
Subjt:  FQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALI

Query:  MMEDGWSKPSFAPKFIGKNSEPW
        MMEDGWS+P+FA KFIGKNSEPW
Subjt:  MMEDGWSKPSFAPKFIGKNSEPW

XP_008465887.1 PREDICTED: uncharacterized protein LOC103503468 isoform X1 [Cucumis melo]7.2e-20288.15Show/hide
Query:  FSKFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAK----EARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDS
        F KFNS SSHS PPCFRVVCS GF  QPNSSKDF FLLHDAMDSSGIDST+AK    EARKGF++QIHYLSKIERDTSISI+RRVDLAKAALYIAAEDDS
Subjt:  FSKFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAK----EARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDS

Query:  LVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSL
        LVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRT SKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSL
Subjt:  LVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSL

Query:  LDFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGF
        LDFDVEIYHP DDYSLP  YHKLKSKESDQ HI+TTQTLLVE                ILSNLKESFWPFQQNQSRSLFLRAADVANC DSS+A EESGF
Subjt:  LDFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGF

Query:  QLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIM
        QLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK SSS TS LS QEEEAVDNLMKRLALIM
Subjt:  QLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIM

Query:  MEDGWSKPSFAPKFIGKNSEPW
        MEDGWS+PSF+ KFI K+SEPW
Subjt:  MEDGWSKPSFAPKFIGKNSEPW

XP_008465893.1 PREDICTED: uncharacterized protein LOC103503468 isoform X2 [Cucumis melo]1.3e-20389Show/hide
Query:  FSKFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSH
        F KFNS SSHS PPCFRVVCS GF  QPNSSKDF FLLHDAMDSSGIDST+AKEARKGF++QIHYLSKIERDTSISI+RRVDLAKAALYIAAEDDSLVSH
Subjt:  FSKFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSH

Query:  SSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFD
        SSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRT SKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFD
Subjt:  SSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFD

Query:  VEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLAS
        VEIYHP DDYSLP  YHKLKSKESDQ HI+TTQTLLVE                ILSNLKESFWPFQQNQSRSLFLRAADVANC DSS+A EESGFQLAS
Subjt:  VEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLAS

Query:  AKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMEDG
        AKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK SSS TS LS QEEEAVDNLMKRLALIMMEDG
Subjt:  AKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMEDG

Query:  WSKPSFAPKFIGKNSEPW
        WS+PSF+ KFI K+SEPW
Subjt:  WSKPSFAPKFIGKNSEPW

XP_011655252.2 uncharacterized protein LOC101204123 isoform X1 [Cucumis sativus]2.5e-20288.46Show/hide
Query:  KFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSS
        KFNS SSHS PP FRV CS GFL  PNSS  F FLLH A+DSSGIDST+AKEARKGF++QIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSS
Subjt:  KFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSS

Query:  VPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVE
        VPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRT SKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVE
Subjt:  VPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVE

Query:  IYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAK
        IYHP DDYSLP  YHKLKSKESDQPHI+TTQTLLVE                IL+NLKESFWPFQQNQSRSLFLRAAD ANC DSS+A EESGFQLASAK
Subjt:  IYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAK

Query:  AAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMEDGWS
        AAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK SSSPTS LS QEEEAVDNLMKRLALIMMEDGWS
Subjt:  AAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMEDGWS

Query:  KPSFAPKFIGKNSEPW
        +PSF+ KFIGKNSEPW
Subjt:  KPSFAPKFIGKNSEPW

XP_038875536.1 uncharacterized protein LOC120067957 [Benincasa hispida]7.7e-20489.02Show/hide
Query:  FSKFNSSSSHSPPPCFRVVCSDGFL-HQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVS
        FSKFNSSS HS PPCFRVVCS GFL  QPNS KDFQFLLHDAMDSSGIDST+AKEARKGF++QIHYLSK+ERDTSISINRRVDLAKAALYIAAEDDSLVS
Subjt:  FSKFNSSSSHSPPPCFRVVCSDGFL-HQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVS

Query:  HSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDF
        HSSVPLPVDAFI+R+SDLSMGYCTHYKSSFNSSPEIFLESIE YMYVMKGFRR +SKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDF
Subjt:  HSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDF

Query:  DVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLA
        DVE+YHP DDYSLPT YHKLKSKESDQPHI+TTQTLLVE                ILSNLKESFWPFQQNQSRSLFLRAADVANC DS NA EESGFQLA
Subjt:  DVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLA

Query:  SAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMED
        SAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTS LS QEEEAVDNLM RLALIMMED
Subjt:  SAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMED

Query:  GWSKPSFAPKFIGKNSEPW
        GWS+PS   KFIGKNSEPW
Subjt:  GWSKPSFAPKFIGKNSEPW

TrEMBL top hitse value%identityAlignment
A0A0A0KTQ8 Transglut_core2 domain-containing protein1.2e-20288.46Show/hide
Query:  KFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSS
        KFNS SSHS PP FRV CS GFL  PNSS  F FLLH A+DSSGIDST+AKEARKGF++QIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSS
Subjt:  KFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSS

Query:  VPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVE
        VPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRT SKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVE
Subjt:  VPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVE

Query:  IYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAK
        IYHP DDYSLP  YHKLKSKESDQPHI+TTQTLLVE                IL+NLKESFWPFQQNQSRSLFLRAAD ANC DSS+A EESGFQLASAK
Subjt:  IYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAK

Query:  AAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMEDGWS
        AAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK SSSPTS LS QEEEAVDNLMKRLALIMMEDGWS
Subjt:  AAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMEDGWS

Query:  KPSFAPKFIGKNSEPW
        +PSF+ KFIGKNSEPW
Subjt:  KPSFAPKFIGKNSEPW

A0A1S3CPY9 uncharacterized protein LOC103503468 isoform X26.3e-20489Show/hide
Query:  FSKFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSH
        F KFNS SSHS PPCFRVVCS GF  QPNSSKDF FLLHDAMDSSGIDST+AKEARKGF++QIHYLSKIERDTSISI+RRVDLAKAALYIAAEDDSLVSH
Subjt:  FSKFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSH

Query:  SSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFD
        SSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRT SKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFD
Subjt:  SSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFD

Query:  VEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLAS
        VEIYHP DDYSLP  YHKLKSKESDQ HI+TTQTLLVE                ILSNLKESFWPFQQNQSRSLFLRAADVANC DSS+A EESGFQLAS
Subjt:  VEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLAS

Query:  AKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMEDG
        AKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK SSS TS LS QEEEAVDNLMKRLALIMMEDG
Subjt:  AKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMMEDG

Query:  WSKPSFAPKFIGKNSEPW
        WS+PSF+ KFI K+SEPW
Subjt:  WSKPSFAPKFIGKNSEPW

A0A1S3CRC3 uncharacterized protein LOC103503468 isoform X13.5e-20288.15Show/hide
Query:  FSKFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAK----EARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDS
        F KFNS SSHS PPCFRVVCS GF  QPNSSKDF FLLHDAMDSSGIDST+AK    EARKGF++QIHYLSKIERDTSISI+RRVDLAKAALYIAAEDDS
Subjt:  FSKFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAK----EARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDS

Query:  LVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSL
        LVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRT SKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSL
Subjt:  LVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSL

Query:  LDFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGF
        LDFDVEIYHP DDYSLP  YHKLKSKESDQ HI+TTQTLLVE                ILSNLKESFWPFQQNQSRSLFLRAADVANC DSS+A EESGF
Subjt:  LDFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGF

Query:  QLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIM
        QLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK SSS TS LS QEEEAVDNLMKRLALIM
Subjt:  QLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIM

Query:  MEDGWSKPSFAPKFIGKNSEPW
        MEDGWS+PSF+ KFI K+SEPW
Subjt:  MEDGWSKPSFAPKFIGKNSEPW

A0A6J1FEJ1 uncharacterized protein LOC111445003 isoform X22.8e-19184.82Show/hide
Query:  SKFNSSSSHSPPP----CFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL
        SKFN SSS S  P     FRVVCS GF  QP++ KDF+FLLHDA+DSSGIDSTYAKEARKGF+TQIHYLS IER+TSISINR VDLAKAALYIAAEDDSL
Subjt:  SKFNSSSSHSPPP----CFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL

Query:  VSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLL
        VSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRRT+ KAQ+EPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLL
Subjt:  VSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLL

Query:  DFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQ
        DFDVEIYHP DD+SLPT+YHKLK +ESDQPHIITTQ+LLVE                ILSNLKESFWPFQQNQSRSLFLRAADVANC D SNA+EESGFQ
Subjt:  DFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQ

Query:  LASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMM
        LASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETK+SSSPT TLS QEEEAVD+LMKRLALIMM
Subjt:  LASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMM

Query:  EDGWSKPSFAPKFIG
        EDGWS+P+FA KFIG
Subjt:  EDGWSKPSFAPKFIG

A0A6J1JZH6 uncharacterized protein LOC111489694 isoform X18.0e-19184.41Show/hide
Query:  SKFNSSSSHSPPP------CFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDD
        SKFNSSSS S          FRVVCS GF  QP+  KDF+FLLHDA+DSSGIDSTYAKEARKGF+TQIHYLS IER+TSISINR VDLAKAALYIAAEDD
Subjt:  SKFNSSSSHSPPP------CFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDD

Query:  SLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWS
        SLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRRT+ KAQ+EPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWS
Subjt:  SLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWS

Query:  LLDFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESG
        LLDFDVEIYHP DD+SLPT+YHKLK KESDQPHIITTQ+LLVE                ILSNLKESFWPFQQNQSRSLFLRAADVANC D SNA+EESG
Subjt:  LLDFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESG

Query:  FQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALI
        FQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCG+YEQSLEYLKLYQETK+SSSPT TLS QEEEAVD+LMKRLALI
Subjt:  FQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALI

Query:  MMEDGWSKPSFAPKFIG
        MMEDGWS+P+FA KFIG
Subjt:  MMEDGWSKPSFAPKFIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19160.1 unknown protein1.7e-0422.55Show/hide
Query:  LESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPRDDYSLPTSYHKLKSKE
        LE++   ++ ++GF+RT+     +P   YLH+VL  R  +A L+S+IY E+ K L              +W   ++  E++      SL   +  +  + 
Subjt:  LESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPRDDYSLPTSYHKLKSKE

Query:  SDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAKAAQHRLERGVWTSVRYGDMRR
         D P  + +      +  LD   +  +     L+NL    W      S  L L +          N    S F L                 +R  D+R 
Subjt:  SDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAKAAQHRLERGVWTSVRYGDMRR

Query:  ALSACERLILLDVDPKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMM
        A++A ERL++L      L RD  ++LY+   Y ++++ L +      + +P      +EE  ++  ++RL L+ +
Subjt:  ALSACERLILLDVDPKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMM

AT4G19160.2 unknown protein2.7e-0522.9Show/hide
Query:  KEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNS-SP
        K AR+ F  +I   SK   D+ ISI      AK   YIAAED++                   V   S P   D+    L  L     + + S  ++ S 
Subjt:  KEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNS-SP

Query:  EI---------------FLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYH
        E+                LE++   ++ ++GF+RT+     +P   YLH+VL  R  +A L+S+IY E+ K L              +W   ++  E++ 
Subjt:  EI---------------FLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYH

Query:  PRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAKAAQ
             SL   +  +  +  D P  + +      +  LD   +  +     L+NL    W      S  L L +          N    S F L       
Subjt:  PRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAKAAQ

Query:  HRLERGVWTSVRYGDMRRALSACERLILLDVDPKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMM
                  +R  D+R A++A ERL++L      L RD  ++LY+   Y ++++ L +      + +P      +EE  ++  ++RL L+ +
Subjt:  HRLERGVWTSVRYGDMRRALSACERLILLDVDPKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLALIMM

AT4G19160.3 unknown protein4.6e-0524.11Show/hide
Query:  KEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNS-SP
        K AR+ F  +I   SK   D+ ISI      AK   YIAAED++                   V   S P   D+    L  L     + + S  ++ S 
Subjt:  KEARKGFMTQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNS-SP

Query:  EI---------------FLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYH
        E+                LE++   ++ ++GF+RT+     +P   YLH+VL  R  +A L+S+IY E+ K L              +W   ++  E++ 
Subjt:  EI---------------FLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYH

Query:  PRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAKAAQ
             SL   +  +  +  D P  + +      +  LD   +  +     L+NL    W      S  L L +          N    S F L       
Subjt:  PRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNASEESGFQLASAKAAQ

Query:  HRLERGVWTSVRYGDMRRALSACERLILLDVDPKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAV-DNLMKRLALIMM
                  +R  D+R A++A ERL++L      L RD  ++LY+      S +Y +  QE     S     +  EEEAV +  ++RL L+ +
Subjt:  HRLERGVWTSVRYGDMRRALSACERLILLDVDPKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAV-DNLMKRLALIMM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGACCCACCCCAACCCCACTTTCTCTCTCTACACTAGAATGGATGGACCCTTCTCTAAGTTTTCATGCGCAAACTACACCAACAGATCTCCCAAGTCGACCCAT
GTTGCTCGTCGCCGCCGGCGGCCACGTTGGGAGCTCTCCCTCTCTTTCACCTGAGTTTTGCACATCCACCGTCAGTCGACTTCAAAAGGAAGTTGGAAATGGTGAGTCGA
TTCAGTTGTTTCATCATCCTTGGATTTCCTGTTCTGTTTCTTTTCATATGGTGACGCCAAATTTTGGGAATTCGAATGCAACTATGGATTCCTCGATTAGAAATGACGGC
AGGTGGGACGAGGATAAGGTAAAATCCTTACTGTATCCTAAGTTTAATTTGAACAATTCGTTTTCTTGGCTCTCCAACTGTTCGGACTGGGTGTTGGCTAGTTGTAATCA
TGATGTAACTAACTTTGTTCAGCCTTTGTTCTCCAAATTCAATTCATCTTCATCCCATTCCCCTCCCCCGTGTTTTCGAGTGGTCTGTTCTGATGGGTTTCTTCACCAAC
CCAATTCTTCCAAGGATTTCCAGTTCCTTCTCCACGATGCCATGGATTCTTCTGGAATTGACTCCACCTATGCCAAGGAGGCTAGGAAGGGTTTCATGACTCAGATTCAC
TATTTATCTAAGATAGAGAGGGACACAAGTATTAGCATTAATAGGCGTGTTGATTTGGCGAAAGCTGCTCTTTATATTGCAGCAGAGGATGATTCCTTGGTATCTCATTC
ATCTGTTCCTCTTCCCGTCGATGCATTTATTCATAGACTAAGTGATCTTTCCATGGGCTATTGTACTCACTACAAATCTTCATTCAATTCATCACCTGAAATTTTTTTGG
AAAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTCAGAAGAACCAATTCTAAAGCTCAATCAGAACCACGAGCTCTATATCTTCACACGGTCTTGACCCATCGTACA
GGCTCAGCTGCACTACTTTCACTCATATACTCAGAGATCCTGAAAATGCTTCGTTTATGGAGCCTTTTAGATTTTGATGTAGAGATATATCATCCTCGTGATGATTATAG
CCTTCCCACGAGCTATCATAAACTGAAAAGCAAGGAATCTGATCAACCACACATAATAACAACTCAAACTCTCTTGGTGGAGGTAACTACTCTTGATCCTATCGAAGATA
GTTATTTAGAAAGGTCACAGATTTTAAGCAATTTAAAGGAATCATTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCTGCTGATGTTGCTAACTGT
TGTGATAGTTCGAATGCATCTGAGGAAAGTGGCTTTCAGCTTGCGTCTGCAAAGGCTGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGTGTGCGTTATGGAGATAT
GAGGCGTGCATTATCTGCATGTGAACGGCTTATTCTCCTTGATGTTGATCCGAAGGAATTGAGAGATTATAGCATCCTTCTGTACCATTGTGGCTTTTATGAGCAATCTC
TGGAGTATTTGAAGTTGTATCAGGAAACAAAGAGCTCCTCAAGTCCAACCAGCACATTAAGTTTCCAGGAGGAAGAGGCTGTGGATAATTTGATGAAACGCCTTGCACTT
ATTATGATGGAAGATGGTTGGAGCAAACCCTCATTTGCTCCAAAGTTCATTGGTAAGAACTCAGAACCGTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGACCCACCCCAACCCCACTTTCTCTCTCTACACTAGAATGGATGGACCCTTCTCTAAGTTTTCATGCGCAAACTACACCAACAGATCTCCCAAGTCGACCCAT
GTTGCTCGTCGCCGCCGGCGGCCACGTTGGGAGCTCTCCCTCTCTTTCACCTGAGTTTTGCACATCCACCGTCAGTCGACTTCAAAAGGAAGTTGGAAATGGTGAGTCGA
TTCAGTTGTTTCATCATCCTTGGATTTCCTGTTCTGTTTCTTTTCATATGGTGACGCCAAATTTTGGGAATTCGAATGCAACTATGGATTCCTCGATTAGAAATGACGGC
AGGTGGGACGAGGATAAGGTAAAATCCTTACTGTATCCTAAGTTTAATTTGAACAATTCGTTTTCTTGGCTCTCCAACTGTTCGGACTGGGTGTTGGCTAGTTGTAATCA
TGATGTAACTAACTTTGTTCAGCCTTTGTTCTCCAAATTCAATTCATCTTCATCCCATTCCCCTCCCCCGTGTTTTCGAGTGGTCTGTTCTGATGGGTTTCTTCACCAAC
CCAATTCTTCCAAGGATTTCCAGTTCCTTCTCCACGATGCCATGGATTCTTCTGGAATTGACTCCACCTATGCCAAGGAGGCTAGGAAGGGTTTCATGACTCAGATTCAC
TATTTATCTAAGATAGAGAGGGACACAAGTATTAGCATTAATAGGCGTGTTGATTTGGCGAAAGCTGCTCTTTATATTGCAGCAGAGGATGATTCCTTGGTATCTCATTC
ATCTGTTCCTCTTCCCGTCGATGCATTTATTCATAGACTAAGTGATCTTTCCATGGGCTATTGTACTCACTACAAATCTTCATTCAATTCATCACCTGAAATTTTTTTGG
AAAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTCAGAAGAACCAATTCTAAAGCTCAATCAGAACCACGAGCTCTATATCTTCACACGGTCTTGACCCATCGTACA
GGCTCAGCTGCACTACTTTCACTCATATACTCAGAGATCCTGAAAATGCTTCGTTTATGGAGCCTTTTAGATTTTGATGTAGAGATATATCATCCTCGTGATGATTATAG
CCTTCCCACGAGCTATCATAAACTGAAAAGCAAGGAATCTGATCAACCACACATAATAACAACTCAAACTCTCTTGGTGGAGGTAACTACTCTTGATCCTATCGAAGATA
GTTATTTAGAAAGGTCACAGATTTTAAGCAATTTAAAGGAATCATTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCTGCTGATGTTGCTAACTGT
TGTGATAGTTCGAATGCATCTGAGGAAAGTGGCTTTCAGCTTGCGTCTGCAAAGGCTGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGTGTGCGTTATGGAGATAT
GAGGCGTGCATTATCTGCATGTGAACGGCTTATTCTCCTTGATGTTGATCCGAAGGAATTGAGAGATTATAGCATCCTTCTGTACCATTGTGGCTTTTATGAGCAATCTC
TGGAGTATTTGAAGTTGTATCAGGAAACAAAGAGCTCCTCAAGTCCAACCAGCACATTAAGTTTCCAGGAGGAAGAGGCTGTGGATAATTTGATGAAACGCCTTGCACTT
ATTATGATGGAAGATGGTTGGAGCAAACCCTCATTTGCTCCAAAGTTCATTGGTAAGAACTCAGAACCGTGGTAA
Protein sequenceShow/hide protein sequence
MDGPTPTPLSLSTLEWMDPSLSFHAQTTPTDLPSRPMLLVAAGGHVGSSPSLSPEFCTSTVSRLQKEVGNGESIQLFHHPWISCSVSFHMVTPNFGNSNATMDSSIRNDG
RWDEDKVKSLLYPKFNLNNSFSWLSNCSDWVLASCNHDVTNFVQPLFSKFNSSSSHSPPPCFRVVCSDGFLHQPNSSKDFQFLLHDAMDSSGIDSTYAKEARKGFMTQIH
YLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRT
GSAALLSLIYSEILKMLRLWSLLDFDVEIYHPRDDYSLPTSYHKLKSKESDQPHIITTQTLLVEVTTLDPIEDSYLERSQILSNLKESFWPFQQNQSRSLFLRAADVANC
CDSSNASEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDPKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPTSTLSFQEEEAVDNLMKRLAL
IMMEDGWSKPSFAPKFIGKNSEPW