; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G10365 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G10365
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTransglut_core2 domain-containing protein
Genome locationctg1675:874210..879719
RNA-Seq ExpressionCucsat.G10365
SyntenyCucsat.G10365
Gene Ontology termsNA
InterPro domainsIPR032698 - Protein SirB1, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050203.1 Transglut_core2 domain-containing protein [Cucumis melo var. makuwa]8.30e-25298.34Show/hide
Query:  LDSSGIDSTFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIE
        +DSSGIDSTFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIE
Subjt:  LDSSGIDSTFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIE

Query:  RYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILT
        RYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQ HIMTTQTLLVEIL+
Subjt:  RYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILT

Query:  NLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQS
        NLKESFWPFQQNQSRSLFLRAAD ANCSDSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQS
Subjt:  NLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQS

Query:  LEYLKLYQETKGSSSPTSKLSSQEEEAVDNLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW
        LEYLKLYQETKGSSS TSKLSSQEEEAVDNLMKRLALIMMEDGWSRPSFSRKFIGK+SEPW
Subjt:  LEYLKLYQETKGSSSPTSKLSSQEEEAVDNLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW

XP_008465887.1 PREDICTED: uncharacterized protein LOC103503468 isoform X1 [Cucumis melo]1.53e-27693.74Show/hide
Query:  SASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAK----EARKGFLSQIHYLSKIER
        SASASLC PR TSSSSSSS     K FKFNSFSSHSTPP FRV CSGGF Q PNSS  FNFLLH A+DSSGIDSTFAK    EARKGFLSQIHYLSKIER
Subjt:  SASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAK----EARKGFLSQIHYLSKIER

Query:  DTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLT
        DTSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLT
Subjt:  DTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLT

Query:  HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDS
        HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQ HIMTTQTLLVEIL+NLKESFWPFQQNQSRSLFLRAAD ANCSDS
Subjt:  HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDS

Query:  SDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVDN
        SDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSS TSKLSSQEEEAVDN
Subjt:  SDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVDN

Query:  LMKRLALIMMEDGWSRPSFSRKFIGKNSEPW
        LMKRLALIMMEDGWSRPSFSRKFI K+SEPW
Subjt:  LMKRLALIMMEDGWSRPSFSRKFIGKNSEPW

XP_008465893.1 PREDICTED: uncharacterized protein LOC103503468 isoform X2 [Cucumis melo]6.83e-27994.61Show/hide
Query:  SASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIERDTSI
        SASASLC PR TSSSSSSS     K FKFNSFSSHSTPP FRV CSGGF Q PNSS  FNFLLH A+DSSGIDSTFAKEARKGFLSQIHYLSKIERDTSI
Subjt:  SASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIERDTSI

Query:  SINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTG
        SI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTG
Subjt:  SINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTG

Query:  SAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAF
        SAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQ HIMTTQTLLVEIL+NLKESFWPFQQNQSRSLFLRAAD ANCSDSSDAF
Subjt:  SAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAF

Query:  EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVDNLMKR
        EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSS TSKLSSQEEEAVDNLMKR
Subjt:  EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVDNLMKR

Query:  LALIMMEDGWSRPSFSRKFIGKNSEPW
        LALIMMEDGWSRPSFSRKFI K+SEPW
Subjt:  LALIMMEDGWSRPSFSRKFIGKNSEPW

XP_011655252.2 uncharacterized protein LOC101204123 isoform X1 [Cucumis sativus]2.24e-303100Show/hide
Query:  MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIE
        MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIE
Subjt:  MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIE

Query:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVL
        RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVL
Subjt:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVL

Query:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSD
        THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSD
Subjt:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSD

Query:  SSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVD
        SSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVD
Subjt:  SSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVD

Query:  NLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW
        NLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW
Subjt:  NLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW

XP_038875536.1 uncharacterized protein LOC120067957 [Benincasa hispida]2.88e-26389.61Show/hide
Query:  MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFL-QHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKI
        M+SFT ASASLC PRL SSSS        K  KFNS S HSTPP FRV CS GFL Q PNS   F FLLH A+DSSGIDST AKEARKGFLSQIHYLSK+
Subjt:  MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFL-QHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKI

Query:  ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTV
        ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFI+R+SDLSMGYCTHYKSSFNSSPEIFLESIE YMYVMKGFRR  SKAQSEPRALYLHTV
Subjt:  ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTV

Query:  LTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCS
        LTHRTGSAALLSLIYSEILKMLRLWSLLDFDVE+YHPHDDYSLP GYHKLKSKESDQPHIMTTQTLLVEIL+NLKESFWPFQQNQSRSLFLRAAD ANCS
Subjt:  LTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCS

Query:  DSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAV
        DS +AFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSSPTSKLS QEEEAV
Subjt:  DSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAV

Query:  DNLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW
        DNLM RLALIMMEDGWSRPS  RKFIGKNSEPW
Subjt:  DNLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW

TrEMBL top hitse value%identityAlignment
A0A0A0KTQ8 Transglut_core2 domain-containing protein2.18e-30099.31Show/hide
Query:  MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIE
        MASFTSASASLCFPRL   SSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIE
Subjt:  MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIE

Query:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVL
        RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVL
Subjt:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVL

Query:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSD
        THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSD
Subjt:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSD

Query:  SSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVD
        SSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVD
Subjt:  SSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVD

Query:  NLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW
        NLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW
Subjt:  NLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW

A0A1S3CPY9 uncharacterized protein LOC103503468 isoform X23.31e-27994.61Show/hide
Query:  SASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIERDTSI
        SASASLC PR TSSSSSSS     K FKFNSFSSHSTPP FRV CSGGF Q PNSS  FNFLLH A+DSSGIDSTFAKEARKGFLSQIHYLSKIERDTSI
Subjt:  SASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIERDTSI

Query:  SINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTG
        SI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTG
Subjt:  SINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTG

Query:  SAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAF
        SAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQ HIMTTQTLLVEIL+NLKESFWPFQQNQSRSLFLRAAD ANCSDSSDAF
Subjt:  SAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAF

Query:  EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVDNLMKR
        EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSS TSKLSSQEEEAVDNLMKR
Subjt:  EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVDNLMKR

Query:  LALIMMEDGWSRPSFSRKFIGKNSEPW
        LALIMMEDGWSRPSFSRKFI K+SEPW
Subjt:  LALIMMEDGWSRPSFSRKFIGKNSEPW

A0A1S3CRC3 uncharacterized protein LOC103503468 isoform X17.41e-27793.74Show/hide
Query:  SASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAK----EARKGFLSQIHYLSKIER
        SASASLC PR TSSSSSSS     K FKFNSFSSHSTPP FRV CSGGF Q PNSS  FNFLLH A+DSSGIDSTFAK    EARKGFLSQIHYLSKIER
Subjt:  SASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAK----EARKGFLSQIHYLSKIER

Query:  DTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLT
        DTSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLT
Subjt:  DTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLT

Query:  HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDS
        HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQ HIMTTQTLLVEIL+NLKESFWPFQQNQSRSLFLRAAD ANCSDS
Subjt:  HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDS

Query:  SDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVDN
        SDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSS TSKLSSQEEEAVDN
Subjt:  SDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVDN

Query:  LMKRLALIMMEDGWSRPSFSRKFIGKNSEPW
        LMKRLALIMMEDGWSRPSFSRKFI K+SEPW
Subjt:  LMKRLALIMMEDGWSRPSFSRKFIGKNSEPW

A0A5D3BJ46 Transglut_core2 domain-containing protein4.02e-25298.34Show/hide
Query:  LDSSGIDSTFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIE
        +DSSGIDSTFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIE
Subjt:  LDSSGIDSTFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIE

Query:  RYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILT
        RYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQ HIMTTQTLLVEIL+
Subjt:  RYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILT

Query:  NLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQS
        NLKESFWPFQQNQSRSLFLRAAD ANCSDSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQS
Subjt:  NLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQS

Query:  LEYLKLYQETKGSSSPTSKLSSQEEEAVDNLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW
        LEYLKLYQETKGSSS TSKLSSQEEEAVDNLMKRLALIMMEDGWSRPSFSRKFIGK+SEPW
Subjt:  LEYLKLYQETKGSSSPTSKLSSQEEEAVDNLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW

A0A6J1JZH6 uncharacterized protein LOC111489694 isoform X12.12e-24285.21Show/hide
Query:  MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIE
        MASFTSAS  LC PRL SSS  S  +SSS S   +S SS ST  SFRV CSGGF Q P+    F FLLH ALDSSGIDST+AKEARKGFL+QIHYLS IE
Subjt:  MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIE

Query:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVL
        R+TSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRRT  KAQ+EPRALYLHTVL
Subjt:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVL

Query:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSD
        THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLP  YHKLK KESDQPHI+TTQ+LLVEIL+NLKESFWPFQQNQSRSLFLRAAD ANCSD
Subjt:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSD

Query:  SSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVD
         S+A EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCG+YEQSLEYLKLYQETK SSSPT  LS QEEEAVD
Subjt:  SSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVD

Query:  NLMKRLALIMMEDGWSRPSFSRKFIG
        +LMKRLALIMMEDGWSRP+F+RKFIG
Subjt:  NLMKRLALIMMEDGWSRPSFSRKFIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19160.1 unknown protein9.1e-0623.11Show/hide
Query:  LESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPMGYHKLKSKE
        LE++   ++ ++GF+RT      +P   YLH+VL  R  +A L+S+IY E+ K L              +W   ++  E++      SL   +  +  + 
Subjt:  LESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPMGYHKLKSKE

Query:  SDQPHIM----TTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLD
         D P  M    T ++L    +   ++       N  R  + RA+ +++           G  L S  +  + +    +  +R  D+R A++A ERL++L 
Subjt:  SDQPHIM----TTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLD

Query:  VDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAV-DNLMKRLALIMM
          +  L RD  ++LY+   Y ++++ L +              +  EEEAV +  ++RL L+ +
Subjt:  VDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAV-DNLMKRLALIMM

AT4G19160.2 unknown protein6.3e-0723.44Show/hide
Query:  FLLHHALDSSGIDS--TFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPVDAFIHRLS
        F   H  DSS   +     K AR+ F  +I   SK   D+ ISI      AK   YIAAED++                   V   S P   D+    L 
Subjt:  FLLHHALDSSGIDS--TFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPVDAFIHRLS

Query:  DLSMGYCTHYKSSFNS-SPEI---------------FLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------
         L     + + S  ++ S E+                LE++   ++ ++GF+RT      +P   YLH+VL  R  +A L+S+IY E+ K L        
Subjt:  DLSMGYCTHYKSSFNS-SPEI---------------FLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------

Query:  ------LWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIM----TTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQL
              +W   ++  E++      SL   +  +  +  D P  M    T ++L    +   ++       N  R  + RA+ +++           G  L
Subjt:  ------LWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIM----TTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQL

Query:  ASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAV-DNLMKRLALIM
         S  +  + +    +  +R  D+R A++A ERL++L   +  L RD  ++LY+   Y ++++ L +              +  EEEAV +  ++RL L+ 
Subjt:  ASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAV-DNLMKRLALIM

Query:  M
        +
Subjt:  M

AT4G19160.3 unknown protein1.1e-0624.19Show/hide
Query:  FLLHHALDSSGIDS--TFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPVDAFIHRLS
        F   H  DSS   +     K AR+ F  +I   SK   D+ ISI      AK   YIAAED++                   V   S P   D+    L 
Subjt:  FLLHHALDSSGIDS--TFAKEARKGFLSQIHYLSKIERDTSISINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPVDAFIHRLS

Query:  DLSMGYCTHYKSSFNS-SPEI---------------FLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------
         L     + + S  ++ S E+                LE++   ++ ++GF+RT      +P   YLH+VL  R  +A L+S+IY E+ K L        
Subjt:  DLSMGYCTHYKSSFNS-SPEI---------------FLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------

Query:  ------LWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIM----TTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQL
              +W   ++  E++      SL   +  +  +  D P  M    T ++L    +   ++       N  R  + RA+ +++           G  L
Subjt:  ------LWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIM----TTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQL

Query:  ASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAV-DNLMKRLALIM
         S  +  + +    +  +R  D+R A++A ERL++L   +  L RD  ++LY+      S +Y +  QE     S     +  EEEAV +  ++RL L+ 
Subjt:  ASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAV-DNLMKRLALIM

Query:  M
        +
Subjt:  M


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTTCACTTCTGCTTCCGCTTCCTTATGCTTTCCAAGGCTAACTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTAAGTCCTTCAAATTCAATTCATT
TTCCTCCCATTCCACTCCCCCCTCTTTCCGAGTGTTTTGTTCTGGTGGGTTTCTTCAACACCCCAATTCTTCCAACCATTTCAACTTTCTTCTCCATCATGCCTTGGATT
CTTCTGGAATTGACTCCACCTTTGCCAAGGAAGCTAGGAAGGGTTTCTTGAGTCAGATTCACTATTTATCTAAGATTGAGAGGGATACAAGTATTAGCATTAATAGACGT
GTTGATTTGGCGAAAGCTGCTCTTTATATTGCAGCAGAGGATGATTCTTTGGTATCTCATTCATCTGTTCCTCTTCCCGTTGATGCATTTATTCATAGACTAAGTGATCT
TTCCATGGGCTATTGTACTCACTACAAATCTTCATTCAATTCATCCCCTGAAATATTTTTGGAAAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTTAGAAGAACCG
GTTCTAAAGCTCAATCAGAACCACGAGCTCTATATCTTCACACGGTCTTGACCCATCGTACAGGCTCAGCTGCATTACTTTCACTCATATACTCAGAGATCCTGAAAATG
CTTCGTTTATGGAGTCTTCTAGATTTTGATGTAGAGATATATCACCCGCATGATGATTATAGCCTTCCCATGGGCTATCATAAACTGAAAAGCAAGGAATCTGATCAACC
ACACATAATGACAACTCAAACTCTCTTGGTGGAGATTCTAACCAATTTAAAGGAATCTTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTGTTCTTAAGGGCAGCCG
ATGCTGCTAACTGTAGTGATAGTTCCGATGCATTTGAGGAAAGTGGCTTTCAGCTTGCATCTGCCAAGGCCGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGTGTG
CGTTATGGAGATATGAGGCGTGCATTATCTGCATGTGAACGGCTTATCCTCCTTGATGTTGATTCGAAGGAATTGAGAGATTATAGCATCCTTCTCTACCATTGTGGTTT
TTATGAGCAATCTCTGGAGTATTTGAAGTTGTATCAGGAAACAAAGGGTTCCTCAAGTCCAACCAGCAAGTTAAGTAGCCAGGAAGAAGAAGCTGTGGATAACTTGATGA
AACGCCTTGCACTTATTATGATGGAAGATGGTTGGAGCAGACCCTCATTTTCTCGAAAGTTCATCGGTAAGAACTCTGAACCTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTTCACTTCTGCTTCCGCTTCCTTATGCTTTCCAAGGCTAACTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTAAGTCCTTCAAATTCAATTCATT
TTCCTCCCATTCCACTCCCCCCTCTTTCCGAGTGTTTTGTTCTGGTGGGTTTCTTCAACACCCCAATTCTTCCAACCATTTCAACTTTCTTCTCCATCATGCCTTGGATT
CTTCTGGAATTGACTCCACCTTTGCCAAGGAAGCTAGGAAGGGTTTCTTGAGTCAGATTCACTATTTATCTAAGATTGAGAGGGATACAAGTATTAGCATTAATAGACGT
GTTGATTTGGCGAAAGCTGCTCTTTATATTGCAGCAGAGGATGATTCTTTGGTATCTCATTCATCTGTTCCTCTTCCCGTTGATGCATTTATTCATAGACTAAGTGATCT
TTCCATGGGCTATTGTACTCACTACAAATCTTCATTCAATTCATCCCCTGAAATATTTTTGGAAAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTTAGAAGAACCG
GTTCTAAAGCTCAATCAGAACCACGAGCTCTATATCTTCACACGGTCTTGACCCATCGTACAGGCTCAGCTGCATTACTTTCACTCATATACTCAGAGATCCTGAAAATG
CTTCGTTTATGGAGTCTTCTAGATTTTGATGTAGAGATATATCACCCGCATGATGATTATAGCCTTCCCATGGGCTATCATAAACTGAAAAGCAAGGAATCTGATCAACC
ACACATAATGACAACTCAAACTCTCTTGGTGGAGATTCTAACCAATTTAAAGGAATCTTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTGTTCTTAAGGGCAGCCG
ATGCTGCTAACTGTAGTGATAGTTCCGATGCATTTGAGGAAAGTGGCTTTCAGCTTGCATCTGCCAAGGCCGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGTGTG
CGTTATGGAGATATGAGGCGTGCATTATCTGCATGTGAACGGCTTATCCTCCTTGATGTTGATTCGAAGGAATTGAGAGATTATAGCATCCTTCTCTACCATTGTGGTTT
TTATGAGCAATCTCTGGAGTATTTGAAGTTGTATCAGGAAACAAAGGGTTCCTCAAGTCCAACCAGCAAGTTAAGTAGCCAGGAAGAAGAAGCTGTGGATAACTTGATGA
AACGCCTTGCACTTATTATGATGGAAGATGGTTGGAGCAGACCCTCATTTTCTCGAAAGTTCATCGGTAAGAACTCTGAACCTTGGTAA
Protein sequenceShow/hide protein sequence
MASFTSASASLCFPRLTSSSSSSSSSSSSKSFKFNSFSSHSTPPSFRVFCSGGFLQHPNSSNHFNFLLHHALDSSGIDSTFAKEARKGFLSQIHYLSKIERDTSISINRR
VDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRTGSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKM
LRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIMTTQTLLVEILTNLKESFWPFQQNQSRSLFLRAADAANCSDSSDAFEESGFQLASAKAAQHRLERGVWTSV
RYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKGSSSPTSKLSSQEEEAVDNLMKRLALIMMEDGWSRPSFSRKFIGKNSEPW