; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G007530 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G007530
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTransglut_core2 domain-containing protein
Genome locationCG_Chr01:8578336..8583570
RNA-Seq ExpressionClCG01G007530
SyntenyClCG01G007530
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579123.1 hypothetical protein SDJN03_23571, partial [Cucurbita argyrosperma subsp. sororia]7.1e-19081.72Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------
        M+SFT+          A LCIPRL SSS  S  + SSSSS      SSSS S+   FR+VCS GF +QP++ KDF+FLLHDA+DSSGIDSTYAK      
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------

Query:  ---IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQS
           IHYLS IERETSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRR   KAQ+
Subjt:  ---IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQS

Query:  EPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLF
        EPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLPT YHKLK +ESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQSRSLF
Subjt:  EPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLF

Query:  LRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPAS
        LRAADVANC D SNA EESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLY+ETK+SSSP  
Subjt:  LRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPAS

Query:  TLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        TLSCQEEEAVD+L+KRLALIMMEDGWSRP+FARKFIGKNSEPW
Subjt:  TLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_008465887.1 PREDICTED: uncharacterized protein LOC103503468 isoform X1 [Cucumis melo]8.3e-20788.18Show/hide
Query:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-------------
        S+  SASA ASLCIPR T        SSSSSSSKF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK             
Subjt:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-------------

Query:  IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPR
        IHYLSKIER+TSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPR
Subjt:  IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPR

Query:  ALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRA
        ALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRA
Subjt:  ALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRA

Query:  ADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLS
        ADVANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS
Subjt:  ADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLS

Query:  CQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
         QEEEAVDNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  CQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_008465893.1 PREDICTED: uncharacterized protein LOC103503468 isoform X2 [Cucumis melo]2.9e-20788.99Show/hide
Query:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYL
        S+  SASA ASLCIPR T        SSSSSSSKF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK         IHYL
Subjt:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYL

Query:  SKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYL
        SKIER+TSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPRALYL
Subjt:  SKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYL

Query:  HTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVA
        HTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVA
Subjt:  HTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVA

Query:  NCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEE
        NC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QEE
Subjt:  NCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEE

Query:  EAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        EAVDNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  EAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_011655252.2 uncharacterized protein LOC101204123 isoform X1 [Cucumis sativus]1.9e-20688.04Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------
        M+SFT        SA ASLC PRLTSSSSSSSSSSSS S    KFNS SSHS+PP FR+ CS GFLQ PNSS  F FLLH A+DSSGIDST+AK      
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------

Query:  ---IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQS
           IHYLSKIER+TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQS
Subjt:  ---IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQS

Query:  EPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLF
        EPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQPHI+TTQTLLVEIL+NLKESFWPFQQNQSRSLF
Subjt:  EPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLF

Query:  LRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPAS
        LRAAD ANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSSP S
Subjt:  LRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPAS

Query:  TLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
         LS QEEEAVDNLMKRLALIMMEDGWSRPSF+RKFIGKNSEPW
Subjt:  TLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_038875536.1 uncharacterized protein LOC120067957 [Benincasa hispida]9.8e-20887.84Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFL-QQPNSSKDFQFLLHDAMDSSGIDSTYAK-----
        MSSFTAS         ASLCIPRL           SSSS KFSKFNSSS HS+PPCFR+VCSAGFL QQPNS KDFQFLLHDAMDSSGIDST+AK     
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFL-QQPNSSKDFQFLLHDAMDSSGIDSTYAK-----

Query:  ----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQ
            IHYLSK+ER+TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFI+R+SDLSMGYCTHYKSSFNSSPEIFLESIE YMYVMKGFRR  SKAQ
Subjt:  ----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQ

Query:  SEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSL
        SEPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVE+YHPHDDYSLPTGYHKLKSKESDQPHI+TTQTLLVEILSNLKESFWPFQQNQSRSL
Subjt:  SEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSL

Query:  FLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPA
        FLRAADVANC DS NAFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSP 
Subjt:  FLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPA

Query:  STLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        S LSCQEEEAVDNLM RLALIMMEDGWSRPS  RKFIGKNSEPW
Subjt:  STLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

TrEMBL top hitse value%identityAlignment
A0A0A0KTQ8 Transglut_core2 domain-containing protein5.8e-20687.36Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------
        M+SFT        SA ASLC PRL      SSSSSSSSSSK  KFNS SSHS+PP FR+ CS GFLQ PNSS  F FLLH A+DSSGIDST+AK      
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------

Query:  ---IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQS
           IHYLSKIER+TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQS
Subjt:  ---IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQS

Query:  EPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLF
        EPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQPHI+TTQTLLVEIL+NLKESFWPFQQNQSRSLF
Subjt:  EPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLF

Query:  LRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPAS
        LRAAD ANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSSP S
Subjt:  LRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPAS

Query:  TLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
         LS QEEEAVDNLMKRLALIMMEDGWSRPSF+RKFIGKNSEPW
Subjt:  TLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

A0A1S3CPY9 uncharacterized protein LOC103503468 isoform X21.4e-20788.99Show/hide
Query:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYL
        S+  SASA ASLCIPR T        SSSSSSSKF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK         IHYL
Subjt:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYL

Query:  SKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYL
        SKIER+TSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPRALYL
Subjt:  SKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYL

Query:  HTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVA
        HTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVA
Subjt:  HTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVA

Query:  NCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEE
        NC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QEE
Subjt:  NCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEE

Query:  EAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        EAVDNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  EAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

A0A1S3CRC3 uncharacterized protein LOC103503468 isoform X14.0e-20788.18Show/hide
Query:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-------------
        S+  SASA ASLCIPR T        SSSSSSSKF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK             
Subjt:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-------------

Query:  IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPR
        IHYLSKIER+TSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPR
Subjt:  IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPR

Query:  ALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRA
        ALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRA
Subjt:  ALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRA

Query:  ADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLS
        ADVANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS
Subjt:  ADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLS

Query:  CQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
         QEEEAVDNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  CQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

A0A6J1FEJ1 uncharacterized protein LOC111445003 isoform X22.2e-18981.41Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPP----CFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK--
        M+SFT+          ASLCIPRL             SSSK SKFN SSS SS P     FR+VCS GF +QP++ KDF+FLLHDA+DSSGIDSTYAK  
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPP----CFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK--

Query:  -------IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGS
               IHYLS IERETSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRR   
Subjt:  -------IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGS

Query:  KAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQS
        KAQ+EPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLPT YHKLK +ESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQS
Subjt:  KAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQS

Query:  RSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSS
        RSLFLRAADVANC D SNA EESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+SS
Subjt:  RSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSS

Query:  SPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIG
        SP  TLSCQEEEAVD+LMKRLALIMMEDGWSRP+FARKFIG
Subjt:  SPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIG

A0A6J1FL90 uncharacterized protein LOC111445003 isoform X11.1e-18881.36Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPP----CFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK--
        M+SFT+          ASLCIPRL             SSSK SKFN SSS SS P     FR+VCS GF +QP++ KDF+FLLHDA+DSSGIDSTYAK  
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPP----CFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK--

Query:  -------IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGS
               IHYLS IERETSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRR   
Subjt:  -------IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGS

Query:  KAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQS
        KAQ+EPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLPT YHKLK +ESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQS
Subjt:  KAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQS

Query:  RSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSS
        RSLFLRAADVANC D SNA EESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+SS
Subjt:  RSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSS

Query:  SPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFI
        SP  TLSCQEEEAVD+LMKRLALIMMEDGWSRP+FARKFI
Subjt:  SPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19160.1 unknown protein5.7e-0421.92Show/hide
Query:  LESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE
        LE++   ++ ++GF+R  +    +P   YLH+V+  R  +A L+S+IY E+ K L              +W   ++  E++      SL   +  +  + 
Subjt:  LESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE

Query:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK
         D P  + +  L  + L +L              + +  A++        +    G  L S  +  + +    +  +   D+R A++A ERL++L   + 
Subjt:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK

Query:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAV-DNLMKRLALIMM
         L RD  ++LY          Y + Y E     S     +  EEEAV +  ++RL L+ +
Subjt:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAV-DNLMKRLALIMM

AT4G19160.2 unknown protein3.9e-0522.79Show/hide
Query:  LESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL-----------
        LE++   ++ ++GF+R  +    +P   YLH+V+  R  +A L+S+IY E+ K L              +W   ++  E++      SL           
Subjt:  LESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL-----------

Query:  -PTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSA
          +    L +K      + T + ++   L+NL    W      S  L L +          N    S F L   +                 D+R A++A
Subjt:  -PTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSA

Query:  CERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAV-DNLMKRLALIMM
         ERL++L   +  L RD  ++LY          Y + Y E     S     +  EEEAV +  ++RL L+ +
Subjt:  CERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAV-DNLMKRLALIMM

AT4G19160.3 unknown protein9.7e-0421.24Show/hide
Query:  LESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE
        LE++   ++ ++GF+R  +    +P   YLH+V+  R  +A L+S+IY E+ K L              +W   ++  E++      SL   +  +  + 
Subjt:  LESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE

Query:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK
         D P  + +  L  + L +L              + +  A++        +    G  L S  +  + +    +  +   D+R A++A ERL++L   + 
Subjt:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK

Query:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAVDNLMKRLALIMM
         L RD  ++LY+      S +Y +  QE     + A     +EE  ++  ++RL L+ +
Subjt:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAVDNLMKRLALIMM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAATGAGTTCCTTCACTGCTTCTTCTGCTTCTGCTTCTGCTTCTGCTTTTGCTTCCTTATGCATTCCAAGGCTTACTTCTTCTTCTTCTTCTTCTTCTTCTTCTTC
TTCTTCTTCTTCTTCCAAGTTCTCCAAATTCAATTCATCTTCATCTCATTCCTCTCCCCCGTGTTTTCGACTGGTTTGTTCTGCTGGGTTTCTTCAGCAACCCAATTCTT
CCAAGGATTTCCAGTTCCTTCTCCACGATGCCATGGATTCTTCTGGAATTGACTCCACCTATGCCAAGATTCACTATTTATCTAAGATAGAGAGGGAAACAAGTATTAGC
ATTAATAGGCGTGTTGATTTGGCGAAAGCTGCTCTTTATATTGCAGCAGAGGATGATTCCTTGGTATCTCATTCATCTGTTCCTCTTCCTGTCGATGCATTTATTCATAG
ACTAAGTGATCTTTCCATGGGCTATTGTACTCACTACAAATCTTCATTCAATTCGTCACCTGAAATCTTTTTGGAAAGTATAGAGAGGTATATGTACGTCATGAAGGGTT
TCAGAAGAATCGGTTCTAAAGCTCAATCAGAACCACGAGCTCTATATCTTCACACGGTCATGACCCATCGTACAGGCTCAGCTGCACTACTTTCACTCATATACTCAGAG
ATTCTGAAAATGCTTCGTTTATGGAGTCTTCTAGATTTTGATGTAGAGATATATCATCCTCATGATGATTATAGCCTTCCCACAGGCTACCATAAACTGAAAAGCAAGGA
ATCTGATCAACCACACATAGTAACAACTCAAACTCTCTTGGTGGAGATCTTAAGTAATTTAAAGGAATCTTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCT
TAAGGGCCGCTGATGTTGCTAACTGTTGTGATAGCTCGAATGCATTTGAGGAAAGTGGCTTTCAGCTTGCGTCTGCAAAGGCTGCTCAACACAGGCTAGAACGTGGAGTT
TGGACCAGTGTGCATTATGGAGATATGAGGCGTGCGTTATCTGCATGTGAGCGGCTCATCCTCCTTGATGTTGATTCGAAGGAATTGAGAGATTATAGCATCCTTCTCTA
CCATTGTGGCTTTTATGAGCAATCTCTGGAGTATTTGAAATTGTATCAGGAAACAAAGAGCTCCTCTAGTCCAGCCAGCACGTTAAGTTGCCAGGAGGAAGAAGCTGTGG
ATAACTTGATGAAACGCCTTGCACTTATTATGATGGAAGATGGTTGGAGCAGACCCTCATTTGCTCGAAAGTTCATTGGTAAGAACTCGGAACCATGGGTATACGATCTG
AACATCATTGAGTATAATCTGTTGGAAGGTCGACAGTTCAATCCTTCCCCTAAGTTTTCCCCTAAGTTTGTTGAGAGCCATCTCTGCTGGAAAATTCCCATTGCCATAGG
ACTCAAAGAGAACATGGAACGCTGGCTGGAAGTCGAAGAATATGGCCAAAATTGA
mRNA sequenceShow/hide mRNA sequence
CATGTTCATAAGATATTTTGTATCTCTCCGTCTTCATCTCTGCTCTCCATTTCTCTCTCCGTCCTTTTGGATTCCAAGTGGCAATGACAATGAGTTCCTTCACTGCTTCT
TCTGCTTCTGCTTCTGCTTCTGCTTTTGCTTCCTTATGCATTCCAAGGCTTACTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCAAGTTCTCCAA
ATTCAATTCATCTTCATCTCATTCCTCTCCCCCGTGTTTTCGACTGGTTTGTTCTGCTGGGTTTCTTCAGCAACCCAATTCTTCCAAGGATTTCCAGTTCCTTCTCCACG
ATGCCATGGATTCTTCTGGAATTGACTCCACCTATGCCAAGATTCACTATTTATCTAAGATAGAGAGGGAAACAAGTATTAGCATTAATAGGCGTGTTGATTTGGCGAAA
GCTGCTCTTTATATTGCAGCAGAGGATGATTCCTTGGTATCTCATTCATCTGTTCCTCTTCCTGTCGATGCATTTATTCATAGACTAAGTGATCTTTCCATGGGCTATTG
TACTCACTACAAATCTTCATTCAATTCGTCACCTGAAATCTTTTTGGAAAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTCAGAAGAATCGGTTCTAAAGCTCAAT
CAGAACCACGAGCTCTATATCTTCACACGGTCATGACCCATCGTACAGGCTCAGCTGCACTACTTTCACTCATATACTCAGAGATTCTGAAAATGCTTCGTTTATGGAGT
CTTCTAGATTTTGATGTAGAGATATATCATCCTCATGATGATTATAGCCTTCCCACAGGCTACCATAAACTGAAAAGCAAGGAATCTGATCAACCACACATAGTAACAAC
TCAAACTCTCTTGGTGGAGATCTTAAGTAATTTAAAGGAATCTTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCCGCTGATGTTGCTAACTGTT
GTGATAGCTCGAATGCATTTGAGGAAAGTGGCTTTCAGCTTGCGTCTGCAAAGGCTGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGTGTGCATTATGGAGATATG
AGGCGTGCGTTATCTGCATGTGAGCGGCTCATCCTCCTTGATGTTGATTCGAAGGAATTGAGAGATTATAGCATCCTTCTCTACCATTGTGGCTTTTATGAGCAATCTCT
GGAGTATTTGAAATTGTATCAGGAAACAAAGAGCTCCTCTAGTCCAGCCAGCACGTTAAGTTGCCAGGAGGAAGAAGCTGTGGATAACTTGATGAAACGCCTTGCACTTA
TTATGATGGAAGATGGTTGGAGCAGACCCTCATTTGCTCGAAAGTTCATTGGTAAGAACTCGGAACCATGGGTATACGATCTGAACATCATTGAGTATAATCTGTTGGAA
GGTCGACAGTTCAATCCTTCCCCTAAGTTTTCCCCTAAGTTTGTTGAGAGCCATCTCTGCTGGAAAATTCCCATTGCCATAGGACTCAAAGAGAACATGGAACGCTGGCT
GGAAGTCGAAGAATATGGCCAAAATTGA
Protein sequenceShow/hide protein sequence
MTMSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAKIHYLSKIERETSIS
INRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSE
ILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGV
WTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPWVYDL
NIIEYNLLEGRQFNPSPKFSPKFVESHLCWKIPIAIGLKENMERWLEVEEYGQN