; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G018070 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G018070
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionTransmembrane protein
Genome locationCiama_Chr01:31478687..31485492
RNA-Seq ExpressionCaUC01G018070
SyntenyCaUC01G018070
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036314.1 uncharacterized protein E6C27_scaffold18G001130 [Cucumis melo var. makuwa]9.1e-12987.63Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGR----RHGGDGDFDIDSLL
        MSL FQ LSLTSPS     STLCFST FSRNP +SL F PSRFPNTLHFQILD+K R+PFNFGSI+AHQFCPRVSTSGGVGR      GG GDFDIDSLL
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGR----RHGGDGDFDIDSLL

Query:  SAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLS
        SAAELFCLV S IGSVGFALNCAK RSKSVF AVFGDG+ VGA+LFLVAGVAIGAWIRRRQWNR+F ET KGVLEVNLMEK NKLEEDLRSSATLIRVLS
Subjt:  SAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLS

Query:  RQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
        RQLEKLGIRFRVTRK LKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQEHSGGQS
Subjt:  RQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS

KAE8647553.1 hypothetical protein Csa_003483 [Cucumis sativus]2.8e-13087.8Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE
        MSL FQ LSLTSPSPS   ST CFSTF SRNPC+SL F PSRFPNTLHFQILD+K R+PFNFGSINAH FCPRVSTSGGVGRR GG  DFDIDSLLSA E
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE

Query:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE
         FCLV S IGSVGFALNCAK RSKS+F AVFGDG+ VG +LFLVAGVAIGAWIRRRQWNR+F ETAKGVLEVNLMEK NKLEEDLRSSATLIRVLSRQLE
Subjt:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE

Query:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
        KLGIRFRVTRK LKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQL+LILAIG SGK+WESRQEHSGGQS
Subjt:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS

XP_004143460.1 uncharacterized protein LOC101207421 [Cucumis sativus]2.8e-13087.8Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE
        MSL FQ LSLTSPSPS   ST CFSTF SRNPC+SL F PSRFPNTLHFQILD+K R+PFNFGSINAH FCPRVSTSGGVGRR GG  DFDIDSLLSA E
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE

Query:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE
         FCLV S IGSVGFALNCAK RSKS+F AVFGDG+ VG +LFLVAGVAIGAWIRRRQWNR+F ETAKGVLEVNLMEK NKLEEDLRSSATLIRVLSRQLE
Subjt:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE

Query:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
        KLGIRFRVTRK LKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQL+LILAIG SGK+WESRQEHSGGQS
Subjt:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS

XP_008440583.1 PREDICTED: uncharacterized protein LOC103484959 isoform X1 [Cucumis melo]9.1e-12987.63Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGR----RHGGDGDFDIDSLL
        MSL FQ LSLTSPS     STLCFST FSRNP +SL F PSRFPNTLHFQILD+K R+PFNFGSI+AHQFCPRVSTSGGVGR      GG GDFDIDSLL
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGR----RHGGDGDFDIDSLL

Query:  SAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLS
        SAAELFCLV S IGSVGFALNCAK RSKSVF AVFGDG+ VGA+LFLVAGVAIGAWIRRRQWNR+F ET KGVLEVNLMEK NKLEEDLRSSATLIRVLS
Subjt:  SAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLS

Query:  RQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
        RQLEKLGIRFRVTRK LKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQEHSGGQS
Subjt:  RQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS

XP_038881992.1 uncharacterized protein LOC120073309 [Benincasa hispida]2.5e-14294.74Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE
        MSLAFQ LSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQIL+HK R+PFNFGSINAHQFCPRVSTSGGVGR+HGGDGDFDIDSLLSAAE
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE

Query:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE
        LFCLVTS IGSVGFALN AKARSKSVF AVFGDGIFVGA+LFLVAGVAIGAWIRRRQWNRIF ETAKGVL VNLMEK N+LEEDLRSSATLIRVLSRQLE
Subjt:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE

Query:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGG
        KLGIRFRVTRK LKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQEHSGG
Subjt:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGG

TrEMBL top hitse value%identityAlignment
A0A0A0KK16 Uncharacterized protein1.4e-13087.8Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE
        MSL FQ LSLTSPSPS   ST CFSTF SRNPC+SL F PSRFPNTLHFQILD+K R+PFNFGSINAH FCPRVSTSGGVGRR GG  DFDIDSLLSA E
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE

Query:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE
         FCLV S IGSVGFALNCAK RSKS+F AVFGDG+ VG +LFLVAGVAIGAWIRRRQWNR+F ETAKGVLEVNLMEK NKLEEDLRSSATLIRVLSRQLE
Subjt:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE

Query:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
        KLGIRFRVTRK LKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQL+LILAIG SGK+WESRQEHSGGQS
Subjt:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS

A0A1S3B274 uncharacterized protein LOC103484959 isoform X14.4e-12987.63Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGR----RHGGDGDFDIDSLL
        MSL FQ LSLTSPS     STLCFST FSRNP +SL F PSRFPNTLHFQILD+K R+PFNFGSI+AHQFCPRVSTSGGVGR      GG GDFDIDSLL
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGR----RHGGDGDFDIDSLL

Query:  SAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLS
        SAAELFCLV S IGSVGFALNCAK RSKSVF AVFGDG+ VGA+LFLVAGVAIGAWIRRRQWNR+F ET KGVLEVNLMEK NKLEEDLRSSATLIRVLS
Subjt:  SAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLS

Query:  RQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
        RQLEKLGIRFRVTRK LKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQEHSGGQS
Subjt:  RQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS

A0A5A7SYF0 Uncharacterized protein4.4e-12987.63Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGR----RHGGDGDFDIDSLL
        MSL FQ LSLTSPS     STLCFST FSRNP +SL F PSRFPNTLHFQILD+K R+PFNFGSI+AHQFCPRVSTSGGVGR      GG GDFDIDSLL
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGR----RHGGDGDFDIDSLL

Query:  SAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLS
        SAAELFCLV S IGSVGFALNCAK RSKSVF AVFGDG+ VGA+LFLVAGVAIGAWIRRRQWNR+F ET KGVLEVNLMEK NKLEEDLRSSATLIRVLS
Subjt:  SAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLS

Query:  RQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
        RQLEKLGIRFRVTRK LKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQEHSGGQS
Subjt:  RQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS

A0A6J1BTL1 uncharacterized protein LOC111005633 isoform X14.3e-12483.33Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNT-LHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAA
        MS+AFQYLSL+SPSPSPPPST  FS+FFSRNPC SLRFAP  FPN  LHFQ LDHKLR+PFNF SIN HQFCPRVSTSGGVGRR   DGDF++DS LSAA
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNT-LHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAA

Query:  ELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQL
        ELFCLV+S + SVG ALN  KARSKS+F AVFGDGIFVGA LFLVAGVAIGAWIRRRQWNRI+  TAK  LE++L+E+ NKLEEDL+SSATLIRVLSRQL
Subjt:  ELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQL

Query:  EKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
        EKLGIRFRVTRK LKKP+EETA LAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQE + GQS
Subjt:  EKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS

A0A6J1IMY1 uncharacterized protein LOC111478487 isoform X12.8e-12385.37Show/hide
Query:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE
        MSLAFQY SL+S SPSPPPST  FS FFSRNPCLSL FAP RFPN   FQILD+KLR+ FNFGSINAH  C R STS GV R H GDGDFDIDSLLSAAE
Subjt:  MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAE

Query:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE
        LFCLVTS I SVGFAL+CAKA SKS F AVF   IFVGA+LFLVAGVAIGAWIRRRQWNR F +TAKG+LEVNL+E  NKLEEDLRSSAT+IRVLSRQLE
Subjt:  LFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLE

Query:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
        KLGIRFRVTRK LKKPVEETAALAQKT EATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS
Subjt:  KLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQS

SwissProt top hitse value%identityAlignment
C4B8C4 EPIDERMAL PATTERNING FACTOR-like protein 33.9e-0535.29Show/hide
Query:  KTRLGSTPPSCHNKCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSSSQGNRYSFYKPLGWKCRC
        + R+GS PPSC  KC  C PC A+Q P++          S++P            Y+ Y+P GW+C C
Subjt:  KTRLGSTPPSCHNKCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSSSQGNRYSFYKPLGWKCRC

Q1PEY6 EPIDERMAL PATTERNING FACTOR-like protein 62.5e-0431.94Show/hide
Query:  LGSTPPSCHNKCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP
        LGS+PP C +KC  C PC  V VP  P      +                     Y P  W+C+CGN  + P
Subjt:  LGSTPPSCHNKCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP

Q9LFT5 EPIDERMAL PATTERNING FACTOR-like protein 19.5e-2056.25Show/hide
Query:  EDKTRLGSTPPSCHNKCNECHPCMAVQVPSMPARA--GRLDSHSALPMRFFDS-SSQGNRYSFYKPLGWKCRCGNHFFNP
        EDK RLGSTPPSCHN+CN CHPCMA+QVP++P R+   R++  S   +R   S ++  ++YS YKP+GWKC C  HF+NP
Subjt:  EDKTRLGSTPPSCHNKCNECHPCMAVQVPSMPARA--GRLDSHSALPMRFFDS-SSQGNRYSFYKPLGWKCRCGNHFFNP

Q9T068 EPIDERMAL PATTERNING FACTOR-like protein 24.9e-0829.69Show/hide
Query:  SDLASTAVVPSFNQSYRNSQASDGQPK--AVEWKNRASFGIFFEDKTRLGSTPPSCHN-KCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSS----
        S+++S  ++     S   S  ++G+P+  +VE+       +    +  +GS PP C   +C  C  C A+QVP+ P    +   HS L      SS    
Subjt:  SDLASTAVVPSFNQSYRNSQASDGQPK--AVEWKNRASFGIFFEDKTRLGSTPPSCHN-KCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSS----

Query:  ---SQGNRYSFYKPLGWKCRCGNHFFNP
           ++G+  + YKP+ WKC+CGN  +NP
Subjt:  ---SQGNRYSFYKPLGWKCRCGNHFFNP

Arabidopsis top hitse value%identityAlignment
AT2G30370.1 allergen-related1.8e-0531.94Show/hide
Query:  LGSTPPSCHNKCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP
        LGS+PP C +KC  C PC  V VP  P      +                     Y P  W+C+CGN  + P
Subjt:  LGSTPPSCHNKCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP

AT3G13898.1 unknown protein2.7e-0635.29Show/hide
Query:  KTRLGSTPPSCHNKCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSSSQGNRYSFYKPLGWKCRC
        + R+GS PPSC  KC  C PC A+Q P++          S++P            Y+ Y+P GW+C C
Subjt:  KTRLGSTPPSCHNKCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSSSQGNRYSFYKPLGWKCRC

AT4G37810.1 unknown protein3.5e-0929.69Show/hide
Query:  SDLASTAVVPSFNQSYRNSQASDGQPK--AVEWKNRASFGIFFEDKTRLGSTPPSCHN-KCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSS----
        S+++S  ++     S   S  ++G+P+  +VE+       +    +  +GS PP C   +C  C  C A+QVP+ P    +   HS L      SS    
Subjt:  SDLASTAVVPSFNQSYRNSQASDGQPK--AVEWKNRASFGIFFEDKTRLGSTPPSCHN-KCNECHPCMAVQVPSMPARAGRLDSHSALPMRFFDSS----

Query:  ---SQGNRYSFYKPLGWKCRCGNHFFNP
           ++G+  + YKP+ WKC+CGN  +NP
Subjt:  ---SQGNRYSFYKPLGWKCRCGNHFFNP

AT5G10310.1 unknown protein6.7e-2156.25Show/hide
Query:  EDKTRLGSTPPSCHNKCNECHPCMAVQVPSMPARA--GRLDSHSALPMRFFDS-SSQGNRYSFYKPLGWKCRCGNHFFNP
        EDK RLGSTPPSCHN+CN CHPCMA+QVP++P R+   R++  S   +R   S ++  ++YS YKP+GWKC C  HF+NP
Subjt:  EDKTRLGSTPPSCHNKCNECHPCMAVQVPSMPARA--GRLDSHSALPMRFFDS-SSQGNRYSFYKPLGWKCRCGNHFFNP

AT5G65250.1 unknown protein4.8e-4353.14Show/hide
Query:  STSGGVGRRHGGDGDFDIDSLLSAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLE---
        S+S  +      DG FD+ S +S AE  C+++S + SV  A+N        V     G  +     + LV  VA G+W+RRRQW RI     KG  E   
Subjt:  STSGGVGRRHGGDGDFDIDSLLSAAELFCLVTSFIGSVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLE---

Query:  VNLMEKINKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGK
         NL+ ++ KLE+DL+SS +++RVLSR LEKLGIRFRVTRK LK+P+ ETAALAQK SEATR L  + +ILEKEL EIQKVLLAMQEQQ+KQLELIL I K
Subjt:  VNLMEKINKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKGLKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGK

Query:  SGKIWES
        S K++ES
Subjt:  SGKIWES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCTTGCTTTTCAATACCTTTCACTCACTTCACCTTCTCCTTCCCCTCCCCCTTCAACCTTGTGCTTTTCCACCTTCTTTTCTAGGAATCCGTGCTTGTCTCTTCG
ATTCGCCCCTAGCCGTTTCCCAAACACCCTGCATTTTCAAATTCTCGATCACAAGCTTCGAAACCCTTTTAATTTTGGTTCCATCAATGCCCATCAGTTCTGTCCTCGAG
TTTCTACATCTGGAGGAGTAGGACGGAGACACGGTGGTGATGGTGATTTCGATATCGATTCTTTACTTTCAGCTGCCGAGTTGTTTTGCCTCGTTACGTCGTTCATCGGT
TCTGTTGGTTTTGCTCTGAATTGCGCGAAAGCCAGGTCTAAGAGCGTGTTCTTCGCGGTGTTCGGTGACGGGATTTTCGTTGGTGCAGTTTTATTTCTGGTGGCTGGAGT
TGCAATTGGTGCTTGGATTCGCAGGCGGCAGTGGAATCGAATTTTTCCAGAGACAGCGAAGGGCGTGTTAGAGGTGAATTTGATGGAAAAGATTAACAAGCTGGAGGAGG
ATTTGAGGAGCTCGGCAACGCTAATTCGAGTCCTGTCGAGGCAGCTGGAGAAGCTAGGGATTAGGTTTAGAGTTACTCGAAAGGGTCTGAAGAAGCCCGTCGAGGAGACT
GCAGCTTTAGCTCAAAAAACTTCCGAGGCCACGCGAGCATTAGCAGTTCGAGGAGATATTTTGGAGAAGGAGCTTGCTGAAATCCAGAAGGTCTTACTGGCTATGCAGGA
ACAACAACAAAAGCAACTTGAGTTGATTCTAGCAATAGGGAAGTCAGGAAAGATATGGGAAAGCAGACAGGAGCATAGTGGAGGACAAAGCCTACAGTATATGCAGAGTT
GCAGCCTGAGCGAGCACTATTCTTCCTGGATAACTGACGGTCCTTTAACTTCAGTCGTTTCGGGAGCTTTTACACGGGCAACGAGGACAACAATTCAAATGGAGAGACGT
CAAGCTGGGAACCTGCAGCTTACTTTTTTTCATGTAATGGAAGCTACGACATGTAACGCTTGCCCTATCAAAACCATACAATTTGGTGGCTACGGAAGGAAGCTGGTGTT
CAGACAGAGGCATGTAACAGCCAAGTTCGTCAGATCTGATCTCGCATCTACAGCTGTTGTCCCCTCCTTCAATCAGAGTTATCGGAACAGCCAGGCTAGTGATGGACAGC
CTAAGGCTGTAGAATGGAAGAATAGGGCGTCCTTTGGTATATTTTTCGAAGACAAAACCAGGCTTGGATCCACCCCACCAAGCTGCCATAATAAGTGCAACGAGTGCCAC
CCATGCATGGCTGTGCAAGTGCCTAGCATGCCCGCCCGCGCCGGCCGGTTGGATTCGCACTCAGCTTTGCCTATGAGATTTTTTGACTCGTCTTCTCAAGGGAACAGATA
CTCATTTTACAAGCCATTGGGTTGGAAATGCCGCTGTGGAAACCACTTCTTCAATCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGCTTGCTTTTCAATACCTTTCACTCACTTCACCTTCTCCTTCCCCTCCCCCTTCAACCTTGTGCTTTTCCACCTTCTTTTCTAGGAATCCGTGCTTGTCTCTTCG
ATTCGCCCCTAGCCGTTTCCCAAACACCCTGCATTTTCAAATTCTCGATCACAAGCTTCGAAACCCTTTTAATTTTGGTTCCATCAATGCCCATCAGTTCTGTCCTCGAG
TTTCTACATCTGGAGGAGTAGGACGGAGACACGGTGGTGATGGTGATTTCGATATCGATTCTTTACTTTCAGCTGCCGAGTTGTTTTGCCTCGTTACGTCGTTCATCGGT
TCTGTTGGTTTTGCTCTGAATTGCGCGAAAGCCAGGTCTAAGAGCGTGTTCTTCGCGGTGTTCGGTGACGGGATTTTCGTTGGTGCAGTTTTATTTCTGGTGGCTGGAGT
TGCAATTGGTGCTTGGATTCGCAGGCGGCAGTGGAATCGAATTTTTCCAGAGACAGCGAAGGGCGTGTTAGAGGTGAATTTGATGGAAAAGATTAACAAGCTGGAGGAGG
ATTTGAGGAGCTCGGCAACGCTAATTCGAGTCCTGTCGAGGCAGCTGGAGAAGCTAGGGATTAGGTTTAGAGTTACTCGAAAGGGTCTGAAGAAGCCCGTCGAGGAGACT
GCAGCTTTAGCTCAAAAAACTTCCGAGGCCACGCGAGCATTAGCAGTTCGAGGAGATATTTTGGAGAAGGAGCTTGCTGAAATCCAGAAGGTCTTACTGGCTATGCAGGA
ACAACAACAAAAGCAACTTGAGTTGATTCTAGCAATAGGGAAGTCAGGAAAGATATGGGAAAGCAGACAGGAGCATAGTGGAGGACAAAGCCTACAGTATATGCAGAGTT
GCAGCCTGAGCGAGCACTATTCTTCCTGGATAACTGACGGTCCTTTAACTTCAGTCGTTTCGGGAGCTTTTACACGGGCAACGAGGACAACAATTCAAATGGAGAGACGT
CAAGCTGGGAACCTGCAGCTTACTTTTTTTCATGTAATGGAAGCTACGACATGTAACGCTTGCCCTATCAAAACCATACAATTTGGTGGCTACGGAAGGAAGCTGGTGTT
CAGACAGAGGCATGTAACAGCCAAGTTCGTCAGATCTGATCTCGCATCTACAGCTGTTGTCCCCTCCTTCAATCAGAGTTATCGGAACAGCCAGGCTAGTGATGGACAGC
CTAAGGCTGTAGAATGGAAGAATAGGGCGTCCTTTGGTATATTTTTCGAAGACAAAACCAGGCTTGGATCCACCCCACCAAGCTGCCATAATAAGTGCAACGAGTGCCAC
CCATGCATGGCTGTGCAAGTGCCTAGCATGCCCGCCCGCGCCGGCCGGTTGGATTCGCACTCAGCTTTGCCTATGAGATTTTTTGACTCGTCTTCTCAAGGGAACAGATA
CTCATTTTACAAGCCATTGGGTTGGAAATGCCGCTGTGGAAACCACTTCTTCAATCCTTAGACCAAAAGTTGCATGTTCAGTGTAGAGGGGTTAGTCTTCCATTCTTCTG
GTTTTGTAAATGACATTGAATTGTTTCTTTCTTTTATTATAAGGTTGGAATTCTTTGTTCAGTGATAAGCAGGAAGTTCGGTTTTTGCATTTATTGTACATATACTCCAA
CTCTATACTATCTATCCCCTAGAGAGATCAACATTTACCCTACTTTTAGTTCTAGTTCTACC
Protein sequenceShow/hide protein sequence
MSLAFQYLSLTSPSPSPPPSTLCFSTFFSRNPCLSLRFAPSRFPNTLHFQILDHKLRNPFNFGSINAHQFCPRVSTSGGVGRRHGGDGDFDIDSLLSAAELFCLVTSFIG
SVGFALNCAKARSKSVFFAVFGDGIFVGAVLFLVAGVAIGAWIRRRQWNRIFPETAKGVLEVNLMEKINKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKGLKKPVEET
AALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKIWESRQEHSGGQSLQYMQSCSLSEHYSSWITDGPLTSVVSGAFTRATRTTIQMERR
QAGNLQLTFFHVMEATTCNACPIKTIQFGGYGRKLVFRQRHVTAKFVRSDLASTAVVPSFNQSYRNSQASDGQPKAVEWKNRASFGIFFEDKTRLGSTPPSCHNKCNECH
PCMAVQVPSMPARAGRLDSHSALPMRFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP