; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G07910 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G07910
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTransglut_core2 domain-containing protein
Genome locationClcChr01:8063952..8069189
RNA-Seq ExpressionClc01G07910
SyntenyClc01G07910
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579123.1 hypothetical protein SDJN03_23571, partial [Cucurbita argyrosperma subsp. sororia]3.5e-18981.31Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-----
        M+SFT+          A LCIPRL SSS  S  + SSSSS       SSSS S+   FR+VCS GF +QP++ KDF+FLLHDA+DSSGIDSTYAK     
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-----

Query:  ----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQ
            IHYLS IERETSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRR   K Q
Subjt:  ----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQ

Query:  SEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSL
        +EPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLPT YHKLK +ESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQSRSL
Subjt:  SEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSL

Query:  FLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPA
        FLRAADVANC D SNA EESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLY+ETK+SSSP 
Subjt:  FLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPA

Query:  STLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
         TLSCQEEEAVD+L+KRLALIMMEDGWSRP+FARKFIGKNSEPW
Subjt:  STLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_008465887.1 PREDICTED: uncharacterized protein LOC103503468 isoform X1 [Cucumis melo]4.1e-20687.76Show/hide
Query:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------------
        S+  SASA ASLCIPR T         SSSSSSSKF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK            
Subjt:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------------

Query:  -IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQSEP
         IHYLSKIER+TSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSK QSEP
Subjt:  -IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQSEP

Query:  RALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLR
        RALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLR
Subjt:  RALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLR

Query:  AADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTL
        AADVANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S L
Subjt:  AADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTL

Query:  SCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        S QEEEAVDNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  SCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_008465893.1 PREDICTED: uncharacterized protein LOC103503468 isoform X2 [Cucumis melo]1.4e-20688.56Show/hide
Query:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHY
        S+  SASA ASLCIPR T         SSSSSSSKF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK         IHY
Subjt:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHY

Query:  LSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQSEPRALY
        LSKIER+TSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSK QSEPRALY
Subjt:  LSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQSEPRALY

Query:  LHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADV
        LHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADV
Subjt:  LHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADV

Query:  ANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQE
        ANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QE
Subjt:  ANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQE

Query:  EEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        EEAVDNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  EEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_011655252.2 uncharacterized protein LOC101204123 isoform X1 [Cucumis sativus]9.2e-20687.61Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-----
        M+SFT        SA ASLC PRLTSSSSSSSSSSSS S     KFNS SSHS+PP FR+ CS GFLQ PNSS  F FLLH A+DSSGIDST+AK     
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-----

Query:  ----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQ
            IHYLSKIER+TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSK Q
Subjt:  ----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQ

Query:  SEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSL
        SEPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQPHI+TTQTLLVEIL+NLKESFWPFQQNQSRSL
Subjt:  SEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSL

Query:  FLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPA
        FLRAAD ANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSSP 
Subjt:  FLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPA

Query:  STLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        S LS QEEEAVDNLMKRLALIMMEDGWSRPSF+RKFIGKNSEPW
Subjt:  STLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_038875536.1 uncharacterized protein LOC120067957 [Benincasa hispida]3.7e-20787.42Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFL-QQPNSSKDFQFLLHDAMDSSGIDSTYAK----
        MSSFTAS         ASLCIPRL            SSSS KFSKFNSSS HS+PPCFR+VCSAGFL QQPNS KDFQFLLHDAMDSSGIDST+AK    
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFL-QQPNSSKDFQFLLHDAMDSSGIDSTYAK----

Query:  -----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKV
             IHYLSK+ER+TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFI+R+SDLSMGYCTHYKSSFNSSPEIFLESIE YMYVMKGFRR  SK 
Subjt:  -----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKV

Query:  QSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRS
        QSEPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVE+YHPHDDYSLPTGYHKLKSKESDQPHI+TTQTLLVEILSNLKESFWPFQQNQSRS
Subjt:  QSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRS

Query:  LFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSP
        LFLRAADVANC DS NAFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSP
Subjt:  LFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSP

Query:  ASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
         S LSCQEEEAVDNLM RLALIMMEDGWSRPS  RKFIGKNSEPW
Subjt:  ASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

TrEMBL top hitse value%identityAlignment
A0A0A0KTQ8 Transglut_core2 domain-containing protein2.9e-20586.94Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-----
        M+SFT        SA ASLC PRL       SSSSSSSSSSK  KFNS SSHS+PP FR+ CS GFLQ PNSS  F FLLH A+DSSGIDST+AK     
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-----

Query:  ----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQ
            IHYLSKIER+TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSK Q
Subjt:  ----IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQ

Query:  SEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSL
        SEPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQPHI+TTQTLLVEIL+NLKESFWPFQQNQSRSL
Subjt:  SEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSL

Query:  FLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPA
        FLRAAD ANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSSP 
Subjt:  FLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPA

Query:  STLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        S LS QEEEAVDNLMKRLALIMMEDGWSRPSF+RKFIGKNSEPW
Subjt:  STLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

A0A1S3CPY9 uncharacterized protein LOC103503468 isoform X26.9e-20788.56Show/hide
Query:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHY
        S+  SASA ASLCIPR T         SSSSSSSKF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK         IHY
Subjt:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHY

Query:  LSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQSEPRALY
        LSKIER+TSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSK QSEPRALY
Subjt:  LSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQSEPRALY

Query:  LHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADV
        LHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADV
Subjt:  LHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADV

Query:  ANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQE
        ANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QE
Subjt:  ANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQE

Query:  EEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        EEAVDNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  EEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

A0A1S3CRC3 uncharacterized protein LOC103503468 isoform X12.0e-20687.76Show/hide
Query:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------------
        S+  SASA ASLCIPR T         SSSSSSSKF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK            
Subjt:  SASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK------------

Query:  -IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQSEP
         IHYLSKIER+TSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSK QSEP
Subjt:  -IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQSEP

Query:  RALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLR
        RALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLR
Subjt:  RALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLR

Query:  AADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTL
        AADVANC DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S L
Subjt:  AADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTL

Query:  SCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        S QEEEAVDNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  SCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

A0A6J1FEJ1 uncharacterized protein LOC111445003 isoform X28.5e-18981Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPP----CFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-
        M+SFT+          ASLCIPRL              SSSK SKFN SSS SS P     FR+VCS GF +QP++ KDF+FLLHDA+DSSGIDSTYAK 
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPP----CFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-

Query:  --------IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIG
                IHYLS IERETSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRR  
Subjt:  --------IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIG

Query:  SKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQ
         K Q+EPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLPT YHKLK +ESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQ
Subjt:  SKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQ

Query:  SRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSS
        SRSLFLRAADVANC D SNA EESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+S
Subjt:  SRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSS

Query:  SSPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIG
        SSP  TLSCQEEEAVD+LMKRLALIMMEDGWSRP+FARKFIG
Subjt:  SSPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIG

A0A6J1FL90 uncharacterized protein LOC111445003 isoform X14.2e-18880.95Show/hide
Query:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPP----CFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-
        M+SFT+          ASLCIPRL              SSSK SKFN SSS SS P     FR+VCS GF +QP++ KDF+FLLHDA+DSSGIDSTYAK 
Subjt:  MSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPP----CFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-

Query:  --------IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIG
                IHYLS IERETSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRR  
Subjt:  --------IHYLSKIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIG

Query:  SKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQ
         K Q+EPRALYLHTV+THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLPT YHKLK +ESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQ
Subjt:  SKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQ

Query:  SRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSS
        SRSLFLRAADVANC D SNA EESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+S
Subjt:  SRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSS

Query:  SSPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFI
        SSP  TLSCQEEEAVD+LMKRLALIMMEDGWSRP+FARKFI
Subjt:  SSPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19160.1 unknown protein2.0e-0421.92Show/hide
Query:  LESIERYMYVMKGFRRIGSKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE
        LE++   ++ ++GF+R  + +  +P   YLH+V+  R  +A L+S+IY E+ K L              +W   ++  E++      SL   +  +  + 
Subjt:  LESIERYMYVMKGFRRIGSKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE

Query:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK
         D P  + +  L  + L +L              + +  A++        +    G  L S  +  + +    +  +   D+R A++A ERL++L   + 
Subjt:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK

Query:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAV-DNLMKRLALIMM
         L RD  ++LY          Y + Y E     S     +  EEEAV +  ++RL L+ +
Subjt:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAV-DNLMKRLALIMM

AT4G19160.2 unknown protein1.0e-0522.79Show/hide
Query:  LESIERYMYVMKGFRRIGSKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL-----------
        LE++   ++ ++GF+R  + +  +P   YLH+V+  R  +A L+S+IY E+ K L              +W   ++  E++      SL           
Subjt:  LESIERYMYVMKGFRRIGSKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL-----------

Query:  -PTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSA
          +    L +K      + T + ++   L+NL    W      S  L L +          N    S F L   +                 D+R A++A
Subjt:  -PTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSA

Query:  CERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAV-DNLMKRLALIMM
         ERL++L   +  L RD  ++LY          Y + Y E     S     +  EEEAV +  ++RL L+ +
Subjt:  CERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAV-DNLMKRLALIMM

AT4G19160.3 unknown protein3.3e-0421.24Show/hide
Query:  LESIERYMYVMKGFRRIGSKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE
        LE++   ++ ++GF+R  + +  +P   YLH+V+  R  +A L+S+IY E+ K L              +W   ++  E++      SL   +  +  + 
Subjt:  LESIERYMYVMKGFRRIGSKVQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE

Query:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK
         D P  + +  L  + L +L              + +  A++        +    G  L S  +  + +    +  +   D+R A++A ERL++L   + 
Subjt:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK

Query:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAVDNLMKRLALIMM
         L RD  ++LY+      S +Y +  QE     + A     +EE  ++  ++RL L+ +
Subjt:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAVDNLMKRLALIMM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAATGAGTTCCTTCACTGCTTCTTCTGCTTCTGCTTCTGCTTCTGCTTTTGCTTCCTTATGCATTCCAAGGCTTACTTCTTCTTCTTCTTCTTCTTCTTCTTCTTC
TTCTTCTTCTTCTTCTTCCAAGTTCTCCAAATTCAATTCATCTTCATCTCATTCCTCTCCCCCGTGTTTTCGACTGGTTTGTTCTGCTGGGTTTCTTCAGCAACCCAATT
CTTCCAAGGATTTCCAGTTCCTTCTCCACGATGCCATGGATTCTTCTGGAATTGACTCCACCTATGCCAAGATTCACTATTTATCTAAGATAGAGAGGGAAACAAGTATT
AGCATTAATAGGCGTGTTGATTTGGCGAAAGCTGCTCTTTATATTGCAGCAGAGGATGATTCCTTGGTATCTCATTCATCTGTTCCTCTTCCTGTCGATGCATTTATTCA
TAGACTAAGTGATCTTTCCATGGGCTATTGTACTCACTACAAATCTTCATTCAATTCGTCACCTGAAATCTTTTTGGAAAGTATAGAGAGGTATATGTACGTCATGAAGG
GTTTCAGAAGAATCGGTTCTAAAGTTCAATCAGAACCACGAGCTCTATATCTTCACACGGTCATGACCCATCGTACAGGCTCAGCTGCACTACTTTCACTCATATACTCA
GAGATTCTGAAAATGCTTCGTTTATGGAGTCTTCTAGATTTTGATGTAGAGATATATCATCCTCATGATGATTATAGCCTTCCCACAGGCTACCATAAACTGAAAAGCAA
GGAATCTGATCAACCACACATAGTAACAACTCAAACTCTCTTGGTGGAGATCTTAAGTAATTTAAAGGAATCTTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTAT
TCTTAAGGGCCGCTGATGTTGCTAACTGTTGTGATAGCTCGAATGCATTTGAGGAAAGTGGCTTTCAGCTTGCGTCTGCAAAGGCTGCTCAACACAGGCTAGAACGTGGA
GTTTGGACCAGTGTGCATTATGGAGATATGAGGCGTGCGTTATCTGCATGTGAGCGGCTCATCCTCCTTGATGTTGATTCGAAGGAATTGAGAGATTATAGCATCCTTCT
CTACCATTGTGGCTTTTATGAGCAATCTCTGGAGTATTTGAAATTGTATCAGGAAACAAAGAGCTCCTCTAGTCCAGCCAGCACGTTAAGTTGCCAGGAGGAAGAAGCTG
TGGATAACTTGATGAAACGCCTTGCACTTATTATGATGGAAGATGGTTGGAGCAGACCCTCATTTGCTCGAAAGTTCATTGGTAAGAACTCGGAACCATGGGTATACGAT
CTGAACATCATTGAGTATAATCTGTTGGAAGGTCGACAGTTCAATCCTTCCCCTAAGTTTTCCCCTAAGTTTGTTGAGAGCCATCTCTGCTGGAAAATTCCCATTGCCAT
AGGACTCAAAGAGAACATGGAACGCTGGCTGGAAGTCGAAGAATATGGCCAAAATTGA
mRNA sequenceShow/hide mRNA sequence
CATGTTCATAAGATATTTTGTATCTCTCCGTCTTCATCTCTGCTCTCCATTTCTCTCTCCGTCCTTTTGGATTCCAAGTGGCAATGACAATGAGTTCCTTCACTGCTTCT
TCTGCTTCTGCTTCTGCTTCTGCTTTTGCTTCCTTATGCATTCCAAGGCTTACTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCAAGTTCTC
CAAATTCAATTCATCTTCATCTCATTCCTCTCCCCCGTGTTTTCGACTGGTTTGTTCTGCTGGGTTTCTTCAGCAACCCAATTCTTCCAAGGATTTCCAGTTCCTTCTCC
ACGATGCCATGGATTCTTCTGGAATTGACTCCACCTATGCCAAGATTCACTATTTATCTAAGATAGAGAGGGAAACAAGTATTAGCATTAATAGGCGTGTTGATTTGGCG
AAAGCTGCTCTTTATATTGCAGCAGAGGATGATTCCTTGGTATCTCATTCATCTGTTCCTCTTCCTGTCGATGCATTTATTCATAGACTAAGTGATCTTTCCATGGGCTA
TTGTACTCACTACAAATCTTCATTCAATTCGTCACCTGAAATCTTTTTGGAAAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTCAGAAGAATCGGTTCTAAAGTTC
AATCAGAACCACGAGCTCTATATCTTCACACGGTCATGACCCATCGTACAGGCTCAGCTGCACTACTTTCACTCATATACTCAGAGATTCTGAAAATGCTTCGTTTATGG
AGTCTTCTAGATTTTGATGTAGAGATATATCATCCTCATGATGATTATAGCCTTCCCACAGGCTACCATAAACTGAAAAGCAAGGAATCTGATCAACCACACATAGTAAC
AACTCAAACTCTCTTGGTGGAGATCTTAAGTAATTTAAAGGAATCTTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCCGCTGATGTTGCTAACT
GTTGTGATAGCTCGAATGCATTTGAGGAAAGTGGCTTTCAGCTTGCGTCTGCAAAGGCTGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGTGTGCATTATGGAGAT
ATGAGGCGTGCGTTATCTGCATGTGAGCGGCTCATCCTCCTTGATGTTGATTCGAAGGAATTGAGAGATTATAGCATCCTTCTCTACCATTGTGGCTTTTATGAGCAATC
TCTGGAGTATTTGAAATTGTATCAGGAAACAAAGAGCTCCTCTAGTCCAGCCAGCACGTTAAGTTGCCAGGAGGAAGAAGCTGTGGATAACTTGATGAAACGCCTTGCAC
TTATTATGATGGAAGATGGTTGGAGCAGACCCTCATTTGCTCGAAAGTTCATTGGTAAGAACTCGGAACCATGGGTATACGATCTGAACATCATTGAGTATAATCTGTTG
GAAGGTCGACAGTTCAATCCTTCCCCTAAGTTTTCCCCTAAGTTTGTTGAGAGCCATCTCTGCTGGAAAATTCCCATTGCCATAGGACTCAAAGAGAACATGGAACGCTG
GCTGGAAGTCGAAGAATATGGCCAAAATTGA
Protein sequenceShow/hide protein sequence
MTMSSFTASSASASASAFASLCIPRLTSSSSSSSSSSSSSSSSKFSKFNSSSSHSSPPCFRLVCSAGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAKIHYLSKIERETSI
SINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKVQSEPRALYLHTVMTHRTGSAALLSLIYS
EILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERG
VWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSPASTLSCQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPWVYD
LNIIEYNLLEGRQFNPSPKFSPKFVESHLCWKIPIAIGLKENMERWLEVEEYGQN