; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G007960 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G007960
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionTransglut_core2 domain-containing protein
Genome locationCicolChr01:8449730..8454742
RNA-Seq ExpressionCcUC01G007960
SyntenyCcUC01G007960
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579123.1 hypothetical protein SDJN03_23571, partial [Cucurbita argyrosperma subsp. sororia]1.2e-19083.06Show/hide
Query:  MSSFTASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPP---CFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIER
        M+SFT+   A LCIPRL SSS  S KF+  +SSSS SSP     FR+VCS GF +QP++ KDF+FLLHDA+DSSGIDSTYAK         IHYLS IER
Subjt:  MSSFTASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPP---CFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIER

Query:  DTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMT
        +TSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRR   KAQ+EPRALYLHTV+T
Subjt:  DTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMT

Query:  HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDS
        HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLPT YHKLK +ESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQSRSLFLRAADVANC D 
Subjt:  HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDS

Query:  SNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDN
        SNA EESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLY+ETK+SSS   TLS QEEEAVD+
Subjt:  SNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDN

Query:  LMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        L+KRLALIMMEDGWSRP+FARKFIGKNSEPW
Subjt:  LMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_008465887.1 PREDICTED: uncharacterized protein LOC103503468 isoform X1 [Cucumis melo]2.4e-21090.07Show/hide
Query:  MSSFT-ASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-------------IHYLSKI
        MSS T ASA ASLCIPR TSSSSSS KF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK             IHYLSKI
Subjt:  MSSFT-ASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-------------IHYLSKI

Query:  ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTV
        ERDTSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPRALYLHTV
Subjt:  ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTV

Query:  MTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCC
        +THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANC 
Subjt:  MTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCC

Query:  DSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAV
        DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QEEEAV
Subjt:  DSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAV

Query:  DNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        DNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  DNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_008465893.1 PREDICTED: uncharacterized protein LOC103503468 isoform X2 [Cucumis melo]8.1e-21190.91Show/hide
Query:  MSSFT-ASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIERDT
        MSS T ASA ASLCIPR TSSSSSS KF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK         IHYLSKIERDT
Subjt:  MSSFT-ASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIERDT

Query:  SISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHR
        SISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPRALYLHTV+THR
Subjt:  SISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHR

Query:  TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSN
        TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANC DSS+
Subjt:  TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSN

Query:  AFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDNLM
        AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QEEEAVDNLM
Subjt:  AFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDNLM

Query:  KRLALIMMEDGWSRPSFARKFIGKNSEPW
        KRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  KRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_011655252.2 uncharacterized protein LOC101204123 isoform X1 [Cucumis sativus]1.6e-20688.68Show/hide
Query:  MSSFTASAFASLCIPRLTSSSSSSYKFS-----KFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKI
        M+SFT SA ASLC PRLTSSSSSS   S     KFNS SSHS+PP FR+ CS GFLQ PNSS  F FLLH A+DSSGIDST+AK         IHYLSKI
Subjt:  MSSFTASAFASLCIPRLTSSSSSSYKFS-----KFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKI

Query:  ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTV
        ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPRALYLHTV
Subjt:  ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTV

Query:  MTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCC
        +THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQPHI+TTQTLLVEIL+NLKESFWPFQQNQSRSLFLRAAD ANC 
Subjt:  MTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCC

Query:  DSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAV
        DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QEEEAV
Subjt:  DSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAV

Query:  DNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        DNLMKRLALIMMEDGWSRPSF+RKFIGKNSEPW
Subjt:  DNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

XP_038875536.1 uncharacterized protein LOC120067957 [Benincasa hispida]5.8e-20990.44Show/hide
Query:  MSSFTASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFL-QQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIERDT
        MSSFTAS  ASLCIPRL   SSSS+KFSKFNSSS HS+PPCFR+VCS+GFL QQPNS KDFQFLLHDAMDSSGIDST+AK         IHYLSK+ERDT
Subjt:  MSSFTASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFL-QQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIERDT

Query:  SISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHR
        SISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFI+R+SDLSMGYCTHYKSSFNSSPEIFLESIE YMYVMKGFRR  SKAQSEPRALYLHTV+THR
Subjt:  SISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHR

Query:  TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSN
        TGSAALLSLIYSEILKMLRLWSLLDFDVE+YHPHDDYSLPTGYHKLKSKESDQPHI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANC DS N
Subjt:  TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSN

Query:  AFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDNLM
        AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSS  S LS QEEEAVDNLM
Subjt:  AFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDNLM

Query:  KRLALIMMEDGWSRPSFARKFIGKNSEPW
         RLALIMMEDGWSRPS  RKFIGKNSEPW
Subjt:  KRLALIMMEDGWSRPSFARKFIGKNSEPW

TrEMBL top hitse value%identityAlignment
A0A0A0KTQ8 Transglut_core2 domain-containing protein1.3e-20689.07Show/hide
Query:  MSSFTASAFASLCIPRL--TSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIERD
        M+SFT SA ASLC PRL  +SSSSSS K  KFNS SSHS+PP FR+ CS GFLQ PNSS  F FLLH A+DSSGIDST+AK         IHYLSKIERD
Subjt:  MSSFTASAFASLCIPRL--TSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIERD

Query:  TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTH
        TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPRALYLHTV+TH
Subjt:  TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTH

Query:  RTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSS
        RTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQPHI+TTQTLLVEIL+NLKESFWPFQQNQSRSLFLRAAD ANC DSS
Subjt:  RTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSS

Query:  NAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDNL
        +AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QEEEAVDNL
Subjt:  NAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDNL

Query:  MKRLALIMMEDGWSRPSFARKFIGKNSEPW
        MKRLALIMMEDGWSRPSF+RKFIGKNSEPW
Subjt:  MKRLALIMMEDGWSRPSFARKFIGKNSEPW

A0A1S3CPY9 uncharacterized protein LOC103503468 isoform X23.9e-21190.91Show/hide
Query:  MSSFT-ASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIERDT
        MSS T ASA ASLCIPR TSSSSSS KF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK         IHYLSKIERDT
Subjt:  MSSFT-ASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIERDT

Query:  SISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHR
        SISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPRALYLHTV+THR
Subjt:  SISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHR

Query:  TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSN
        TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANC DSS+
Subjt:  TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSN

Query:  AFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDNLM
        AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QEEEAVDNLM
Subjt:  AFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDNLM

Query:  KRLALIMMEDGWSRPSFARKFIGKNSEPW
        KRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  KRLALIMMEDGWSRPSFARKFIGKNSEPW

A0A1S3CRC3 uncharacterized protein LOC103503468 isoform X11.1e-21090.07Show/hide
Query:  MSSFT-ASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-------------IHYLSKI
        MSS T ASA ASLCIPR TSSSSSS KF KFNS SSHS+PPCFR+VCS GF QQPNSSKDF FLLHDAMDSSGIDST+AK             IHYLSKI
Subjt:  MSSFT-ASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK-------------IHYLSKI

Query:  ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTV
        ERDTSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRR GSKAQSEPRALYLHTV
Subjt:  ERDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTV

Query:  MTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCC
        +THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HI+TTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANC 
Subjt:  MTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCC

Query:  DSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAV
        DSS+AFEESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS  S LS QEEEAV
Subjt:  DSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAV

Query:  DNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW
        DNLMKRLALIMMEDGWSRPSF+RKFI K+SEPW
Subjt:  DNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW

A0A6J1FEJ1 uncharacterized protein LOC111445003 isoform X23.2e-18983.57Show/hide
Query:  MSSFTASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPP----CFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIE
        M+SFT+   ASLCIPRL SSS    K SKFN SSS SS P     FR+VCS GF +QP++ KDF+FLLHDA+DSSGIDSTYAK         IHYLS IE
Subjt:  MSSFTASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPP----CFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIE

Query:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVM
        R+TSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRR   KAQ+EPRALYLHTV+
Subjt:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVM

Query:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCD
        THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLPT YHKLK +ESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQSRSLFLRAADVANC D
Subjt:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCD

Query:  SSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVD
         SNA EESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+SSS   TLS QEEEAVD
Subjt:  SSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVD

Query:  NLMKRLALIMMEDGWSRPSFARKFIG
        +LMKRLALIMMEDGWSRP+FARKFIG
Subjt:  NLMKRLALIMMEDGWSRPSFARKFIG

A0A6J1FL90 uncharacterized protein LOC111445003 isoform X11.6e-18883.53Show/hide
Query:  MSSFTASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPP----CFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIE
        M+SFT+   ASLCIPRL SSS    K SKFN SSS SS P     FR+VCS GF +QP++ KDF+FLLHDA+DSSGIDSTYAK         IHYLS IE
Subjt:  MSSFTASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPP----CFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAK---------IHYLSKIE

Query:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVM
        R+TSISINR VDLAKAALYIAAEDDSLVSHSSVPLPVDAF+HR++DLSMGYCTHYKSSFN SPE  LESIERY+YVMKGFRR   KAQ+EPRALYLHTV+
Subjt:  RDTSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVM

Query:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCD
        THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLPT YHKLK +ESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQSRSLFLRAADVANC D
Subjt:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCD

Query:  SSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVD
         SNA EESGFQLASAKAAQHRLERGVWTSV YGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+SSS   TLS QEEEAVD
Subjt:  SSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVD

Query:  NLMKRLALIMMEDGWSRPSFARKFI
        +LMKRLALIMMEDGWSRP+FARKFI
Subjt:  NLMKRLALIMMEDGWSRPSFARKFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19160.1 unknown protein8.3e-0421.92Show/hide
Query:  LESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE
        LE++   ++ ++GF+R  +    +P   YLH+V+  R  +A L+S+IY E+ K L              +W   ++  E++      SL   +  +  + 
Subjt:  LESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSLPTGYHKLKSKE

Query:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK
         D P  + +  L  + L +L              + +  A++        +    G  L S  +  + +    +  +   D+R A++A ERL++L   + 
Subjt:  SDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSK

Query:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAV-DNLMKRLALIMM
         L RD  ++LY          Y + Y E     S     +  EEEAV +  ++RL L+ +
Subjt:  EL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAV-DNLMKRLALIMM

AT4G19160.2 unknown protein6.3e-0420.71Show/hide
Query:  PRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAKIHYLSKIERDTSISINRRVD---LAKAALYIAAED
        PR  ++S+S+Y   +   +   S P  ++ V           S    F    ++ S   + + AK+ +    E +  +++NR  D   L K    +  + 
Subjt:  PRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAKIHYLSKIERDTSISINRRVD---LAKAALYIAAED

Query:  DSLVSHSSVPLPVDA-----FIHRLSDLSMGYCTHYKS-SFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEIL
        D   + S   L +D      ++  +  +S        S          LE++   ++ ++GF+R  +    +P   YLH+V+  R  +A L+S+IY E+ 
Subjt:  DSLVSHSSVPLPVDA-----FIHRLSDLSMGYCTHYKS-SFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEIL

Query:  KMLR-------------LWSLLDFDVEIYHPHDDYSL------------PTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAA
        K L              +W   ++  E++      SL             +    L +K      + T + ++   L+NL    W      S  L L + 
Subjt:  KMLR-------------LWSLLDFDVEIYHPHDDYSL------------PTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAA

Query:  DVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLS
                 N    S F L   +                 D+R A++A ERL++L   +  L RD  ++LY          Y + Y E     S     +
Subjt:  DVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLS

Query:  YQEEEAV-DNLMKRLALIMM
          EEEAV +  ++RL L+ +
Subjt:  YQEEEAV-DNLMKRLALIMM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAATGAGTTCCTTCACTGCTTCTGCTTTTGCTTCCTTATGCATTCCAAGGCTTACTTCTTCTTCTTCTTCTTCTTACAAGTTCTCCAAATTCAATTCATCTTCATC
TCATTCCTCTCCCCCGTGTTTTCGACTGGTTTGTTCTTCTGGGTTTCTTCAGCAACCCAATTCTTCCAAGGATTTCCAGTTCCTTCTCCACGATGCCATGGATTCTTCTG
GAATTGACTCCACCTATGCCAAGATTCACTATTTATCTAAGATAGAGAGGGACACAAGTATTAGCATTAATAGGCGTGTTGATTTGGCGAAAGCTGCTCTTTATATTGCA
GCAGAGGATGATTCCTTGGTATCTCATTCATCTGTTCCTCTTCCTGTCGATGCATTTATTCATAGACTAAGTGATCTTTCCATGGGCTATTGTACTCACTACAAATCTTC
ATTCAATTCATCACCTGAAATCTTTTTGGAAAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTCAGAAGAATCGGTTCTAAAGCTCAATCAGAACCACGAGCTCTAT
ATCTTCACACGGTCATGACCCATCGTACAGGCTCAGCTGCACTACTTTCACTCATATACTCAGAGATTCTGAAAATGCTTCGTTTATGGAGTCTTCTAGATTTTGATGTA
GAGATATATCATCCTCATGATGATTATAGCCTTCCCACAGGCTACCATAAACTGAAAAGCAAGGAATCTGATCAACCACACATAGTAACAACTCAAACTCTCTTGGTGGA
GATCTTAAGTAATTTAAAGGAATCTTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCCGCTGATGTTGCTAACTGTTGTGATAGCTCGAATGCAT
TTGAGGAAAGTGGCTTTCAGCTTGCGTCTGCAAAGGCTGCTCAACATAGGCTAGAACGTGGAGTTTGGACCAGTGTGCATTATGGAGATATGAGGCGTGCGTTATCTGCA
TGTGAGCGGCTCATCCTCCTTGATGTTGATTCGAAGGAATTGAGAGATTATAGCATCCTTCTCTACCATTGTGGCTTTTATGAGCAATCTCTGGAGTATTTGAAGTTGTA
TCAGGAAACAAAGAGCTCCTCTAGTCGAGCCAGCACATTAAGTTACCAGGAGGAAGAAGCTGTGGATAACTTGATGAAACGCCTTGCACTTATTATGATGGAAGATGGTT
GGAGCAGACCCTCATTTGCTCGAAAGTTCATTGGTAAGAACTCGGAACCATGGTAA
mRNA sequenceShow/hide mRNA sequence
AGTTCATAAGATATTTTGTATCTCTCCATCTTCATCTTTGCTCTCCATTTCTTTCTCTCCGTCCTTTTGGATTCCAAGTGGGAATGACAATGAGTTCCTTCACTGCTTCT
GCTTTTGCTTCCTTATGCATTCCAAGGCTTACTTCTTCTTCTTCTTCTTCTTACAAGTTCTCCAAATTCAATTCATCTTCATCTCATTCCTCTCCCCCGTGTTTTCGACT
GGTTTGTTCTTCTGGGTTTCTTCAGCAACCCAATTCTTCCAAGGATTTCCAGTTCCTTCTCCACGATGCCATGGATTCTTCTGGAATTGACTCCACCTATGCCAAGATTC
ACTATTTATCTAAGATAGAGAGGGACACAAGTATTAGCATTAATAGGCGTGTTGATTTGGCGAAAGCTGCTCTTTATATTGCAGCAGAGGATGATTCCTTGGTATCTCAT
TCATCTGTTCCTCTTCCTGTCGATGCATTTATTCATAGACTAAGTGATCTTTCCATGGGCTATTGTACTCACTACAAATCTTCATTCAATTCATCACCTGAAATCTTTTT
GGAAAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTCAGAAGAATCGGTTCTAAAGCTCAATCAGAACCACGAGCTCTATATCTTCACACGGTCATGACCCATCGTA
CAGGCTCAGCTGCACTACTTTCACTCATATACTCAGAGATTCTGAAAATGCTTCGTTTATGGAGTCTTCTAGATTTTGATGTAGAGATATATCATCCTCATGATGATTAT
AGCCTTCCCACAGGCTACCATAAACTGAAAAGCAAGGAATCTGATCAACCACACATAGTAACAACTCAAACTCTCTTGGTGGAGATCTTAAGTAATTTAAAGGAATCTTT
TTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCCGCTGATGTTGCTAACTGTTGTGATAGCTCGAATGCATTTGAGGAAAGTGGCTTTCAGCTTGCGT
CTGCAAAGGCTGCTCAACATAGGCTAGAACGTGGAGTTTGGACCAGTGTGCATTATGGAGATATGAGGCGTGCGTTATCTGCATGTGAGCGGCTCATCCTCCTTGATGTT
GATTCGAAGGAATTGAGAGATTATAGCATCCTTCTCTACCATTGTGGCTTTTATGAGCAATCTCTGGAGTATTTGAAGTTGTATCAGGAAACAAAGAGCTCCTCTAGTCG
AGCCAGCACATTAAGTTACCAGGAGGAAGAAGCTGTGGATAACTTGATGAAACGCCTTGCACTTATTATGATGGAAGATGGTTGGAGCAGACCCTCATTTGCTCGAAAGT
TCATTGGTAAGAACTCGGAACCATGGTAATCTACTCATCATGTACATTATGTTATACCGGGTTGACTAAACCTCAGTTCCGAGATGGATGGTGCCAGTCAACAGTGATCC
ATCATATGGAATAGTTAACTGAATACAGCTTGTATGGTTTATAACAAATAGGAAGGGGAGCCATAGGACCAGGTGCTGAAAGCTTTAATTTGGCATTTGTGAGAGTTTGG
TCCTTACTACACTATTCCACTAAACATCCACGTATTGTTTCCTGTAGGCTGTAGCTGTAAATATATGTAATATAGTACAGTTTTTTTTTCTTTTATATTTATAATTTTGT
CTGAAGAATCATTAAGGGTGGTATAAAGGGTATACGATCTGAACATCATTGAGTATAATCTGTTGGAAGGTCGACAGTTCAATCCTTCCCCTAAGTTTTCCCCTAAGTTT
GTTGAGGTAAAAGATGTACATAAGCATGGTTACGATCTAATACCAACGAAATTGTGTTTGTTTTGTGGCATTTTATTGAACTGGAGCAACAAGATAACAG
Protein sequenceShow/hide protein sequence
MTMSSFTASAFASLCIPRLTSSSSSSYKFSKFNSSSSHSSPPCFRLVCSSGFLQQPNSSKDFQFLLHDAMDSSGIDSTYAKIHYLSKIERDTSISINRRVDLAKAALYIA
AEDDSLVSHSSVPLPVDAFIHRLSDLSMGYCTHYKSSFNSSPEIFLESIERYMYVMKGFRRIGSKAQSEPRALYLHTVMTHRTGSAALLSLIYSEILKMLRLWSLLDFDV
EIYHPHDDYSLPTGYHKLKSKESDQPHIVTTQTLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCCDSSNAFEESGFQLASAKAAQHRLERGVWTSVHYGDMRRALSA
CERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKSSSSRASTLSYQEEEAVDNLMKRLALIMMEDGWSRPSFARKFIGKNSEPW