; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G00860 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G00860
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDUF4149 domain-containing protein
Genome locationClcChr08:1489925..1491538
RNA-Seq ExpressionClc08G00860
SyntenyClc08G00860
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025423 - Domain of unknown function DUF4149


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039259.1 DUF4149 domain-containing protein [Cucumis melo var. makuwa]9.0e-15674.34Show/hide
Query:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHKIASAVEKAKVMT
        M NLFA+ LIVTTLTAAGLWSPPPP    P++VIVKEGHRVVVVEY  Q QHNTKVSISSEPD D   SE H T+DLICDVYGKCKHK+ASAVEKAKVM 
Subjt:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHKIASAVEKAKVMT

Query:  SETAQEAH----HVADAFEGAKDKI--------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLM
        +ETAQEAH     VA AF+ AKDK+                            KLKEGAKETL+ AK+ EE V++G ER+A ET EKI+TGENK+KENLM
Subjt:  SETAQEAH----HVADAFEGAKDKI--------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLM

Query:  GIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVV
        G+V RG +V+NY FRHLG GMD LGLLGF+MALGMGVWVTFISSYVL SVLPR QLGVVQSKIYPVYFKAMA CIGMALLGHLFSRTEW FPIPKN+EVV
Subjt:  GIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVV

Query:  QGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLM
        QGYVLVAAL MIFANSLYMEPRATKVMFERLK+EKEEGRGIEDI  E          AI +STPTQ V+REVVKSR+VGLNKRLKKLNSYSSLLNLLTLM
Subjt:  QGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLM

Query:  ALTWHLVYLSHRFCNPC
        ALTWHLVYLS R CNPC
Subjt:  ALTWHLVYLSHRFCNPC

KAE8649008.1 hypothetical protein Csa_009289 [Cucumis sativus]1.8e-15975.3Show/hide
Query:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHKIASAVEKAKVMT
        M NLFA+ LIVTT TAAGLWSPPPP    P++VIVKEGHR+VVVEY  Q QHNTKVSISSEPDQD   SE H T+DLICDVYGKCKHK+ASAVEKAKVM 
Subjt:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHKIASAVEKAKVMT

Query:  SETAQEAH----HVADAFEGAKDKI--------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLM
        +ETAQEAH     V DAF+GAKDK+                           GKLKEGAKETL+ AK+ EE V++G ER+A ET EKIKTGENK+KENLM
Subjt:  SETAQEAH----HVADAFEGAKDKI--------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLM

Query:  GIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVV
        G+V RG +VI+Y FRHLGFGMD LGLLGF+MALGMGVWVTFISSYVL SVLPR QLGVVQSKIYPVYFKAMA CIGMALLGHLFSRTEW FPIPKN+EVV
Subjt:  GIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVV

Query:  QGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLM
        QGYVLVAAL MIFANSLYMEPRATKVMFERLK+EKEEGRGIEDIA E          AI +STPTQ V+REVVKSR+VGLNKRLKKLNSYSSLLNLLTLM
Subjt:  QGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLM

Query:  ALTWHLVYLSHRFCNPC
        ALTWHLVYLS R CNPC
Subjt:  ALTWHLVYLSHRFCNPC

XP_004141640.3 uncharacterized protein LOC101208468 [Cucumis sativus]4.6e-16073.89Show/hide
Query:  LHLGFGFNFNFKMINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHK
        LH        F M NLFA+ LIVTT TAAGLWSPPPP    P++VIVKEGHR+VVVEY  Q QHNTKVSISSEPDQD   SE H T+DLICDVYGKCKHK
Subjt:  LHLGFGFNFNFKMINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHK

Query:  IASAVEKAKVMTSETAQEAH----HVADAFEGAKDKI--------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKI
        +ASAVEKAKVM +ETAQEAH     V DAF+GAKDK+                           GKLKEGAKETL+ AK+ EE V++G ER+A ET EKI
Subjt:  IASAVEKAKVMTSETAQEAH----HVADAFEGAKDKI--------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKI

Query:  KTGENKVKENLMGIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTE
        KTGENK+KENLMG+V RG +VI+Y FRHLGFGMD LGLLGF+MALGMGVWVTFISSYVL SVLPR QLGVVQSKIYPVYFKAMA CIGMALLGHLFSRTE
Subjt:  KTGENKVKENLMGIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTE

Query:  WMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLN
        W FPIPKN+EVVQGYVLVAAL MIFANSLYMEPRATKVMFERLK+EKEEGRGIEDIA E          AI +STPTQ V+REVVKSR+VGLNKRLKKLN
Subjt:  WMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLN

Query:  SYSSLLNLLTLMALTWHLVYLSHRFCNPC
        SYSSLLNLLTLMALTWHLVYLS R CNPC
Subjt:  SYSSLLNLLTLMALTWHLVYLSHRFCNPC

XP_008459625.1 PREDICTED: uncharacterized protein LOC103498695 [Cucumis melo]2.4e-15672.22Show/hide
Query:  LHLGFGFNFNFKMINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHK
        LH       +F M NLFA+ LI+TTLTAAGLWSPPPP    P++VIVKEGHRVVVVEY  Q QHNTKVSISSEPD D   SE H T+DLICDVYGKCKHK
Subjt:  LHLGFGFNFNFKMINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHK

Query:  IASAVEKAKVMTSETAQEAH----HVADAFEGAKDKI-----------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETR
        +ASAVEKAKVM +ETAQEAH     VA AF+ AKDK+                               KLKEGAKETL+ AK+ EE V++G ER+A ET 
Subjt:  IASAVEKAKVMTSETAQEAH----HVADAFEGAKDKI-----------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETR

Query:  EKIKTGENKVKENLMGIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFS
        EKI+TGENK+KENLMG+V RG +V+NY FRHLG GMD LGLLGF+MALGMGVWVTFISSYVL SVLPR QLGVVQSKIYPVYFKAMA CIGMALLGHLFS
Subjt:  EKIKTGENKVKENLMGIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFS

Query:  RTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLK
        RTEW FPIPKN+EVVQGYVLVAAL MIFANSLYMEPRATKVMFERLK+EKEEGRGIEDI  E          AI +STPTQ V+REVVKSR+VGLNKRLK
Subjt:  RTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLK

Query:  KLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC
        KLNSYSSLLNLLTLMALTWHLVYLS R CNPC
Subjt:  KLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC

XP_038890648.1 uncharacterized protein LOC120080154 [Benincasa hispida]2.7e-16077.59Show/hide
Query:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQ---DSETHPTRDLICDVYGKCKHKIASAVEKAKVMT
        M NLFALCLIVTTL+AAGLWS    PPPS +DVIVKEGHRVVVVEYG QDQHNTKVSISS+PDQ   DSE+H TRDLICDVYGKCKHK+ASAVEKAKVM 
Subjt:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQ---DSETHPTRDLICDVYGKCKHKIASAVEKAKVMT

Query:  SETAQEAHH----VADAFEGAKDKISGK--------------LKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLMGIVGRGGRVINY
        SET QEAH     VADAF+ AKD IS K               KEGAK+T+K+AK+ EE V+EG ERLA ET EKIKTGEN++K+NLMGIV RG +VI Y
Subjt:  SETAQEAHH----VADAFEGAKDKISGK--------------LKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLMGIVGRGGRVINY

Query:  SFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMI
        SFRHLGFG DVLGLLGF+MALGMGVWVTFISSYVL SVLPR QL VVQSKIYPVYFKAMA  IGMAL GHLF R EW FPIPKNAE+VQGYVLVAALFMI
Subjt:  SFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMI

Query:  FANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE----------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSH
        FAN LYMEPRATKVMFERLK+EKEEGRGIEDI GE           AI T+TPTQ V+REVVKSR+VGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS 
Subjt:  FANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE----------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSH

Query:  RFCNPC
        RFCNPC
Subjt:  RFCNPC

TrEMBL top hitse value%identityAlignment
A0A0A0KSX1 DUF4149 domain-containing protein2.7e-15880.16Show/hide
Query:  LIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHKIASAVEKAKVMTSETAQEAH
        LIVTT TAAGLWSPPPP    P++VIVKEGHR+VVVEY  Q QHNTKVSISSEPDQD   SE H T+DLICDVYGKCKHK+ASAVEKAKVM +ETAQEAH
Subjt:  LIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHKIASAVEKAKVMTSETAQEAH

Query:  ----HVADAFEGAKDKISGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLMGIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALG
             V DAF+GAKD    KLKEGAKETL+ AK+ EE V++G ER+A ET EKIKTGENK+KENLMG+V RG +VI+Y FRHLGFGMD LGLLGF+MALG
Subjt:  ----HVADAFEGAKDKISGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLMGIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALG

Query:  MGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKME
        MGVWVTFISSYVL SVLPR QLGVVQSKIYPVYFKAMA CIGMALLGHLFSRTEW FPIPKN+EVVQGYVLVAAL MIFANSLYMEPRATKVMFERLK+E
Subjt:  MGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKME

Query:  KEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC
        KEEGRGIEDIA E          AI +STPTQ V+REVVKSR+VGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS R CNPC
Subjt:  KEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC

A0A1S3CA56 uncharacterized protein LOC1034986951.2e-15672.22Show/hide
Query:  LHLGFGFNFNFKMINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHK
        LH       +F M NLFA+ LI+TTLTAAGLWSPPPP    P++VIVKEGHRVVVVEY  Q QHNTKVSISSEPD D   SE H T+DLICDVYGKCKHK
Subjt:  LHLGFGFNFNFKMINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHK

Query:  IASAVEKAKVMTSETAQEAH----HVADAFEGAKDKI-----------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETR
        +ASAVEKAKVM +ETAQEAH     VA AF+ AKDK+                               KLKEGAKETL+ AK+ EE V++G ER+A ET 
Subjt:  IASAVEKAKVMTSETAQEAH----HVADAFEGAKDKI-----------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETR

Query:  EKIKTGENKVKENLMGIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFS
        EKI+TGENK+KENLMG+V RG +V+NY FRHLG GMD LGLLGF+MALGMGVWVTFISSYVL SVLPR QLGVVQSKIYPVYFKAMA CIGMALLGHLFS
Subjt:  EKIKTGENKVKENLMGIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFS

Query:  RTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLK
        RTEW FPIPKN+EVVQGYVLVAAL MIFANSLYMEPRATKVMFERLK+EKEEGRGIEDI  E          AI +STPTQ V+REVVKSR+VGLNKRLK
Subjt:  RTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLK

Query:  KLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC
        KLNSYSSLLNLLTLMALTWHLVYLS R CNPC
Subjt:  KLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC

A0A5D3BMZ0 DUF4149 domain-containing protein4.4e-15674.34Show/hide
Query:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHKIASAVEKAKVMT
        M NLFA+ LIVTTLTAAGLWSPPPP    P++VIVKEGHRVVVVEY  Q QHNTKVSISSEPD D   SE H T+DLICDVYGKCKHK+ASAVEKAKVM 
Subjt:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQD---SETHPTRDLICDVYGKCKHKIASAVEKAKVMT

Query:  SETAQEAH----HVADAFEGAKDKI--------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLM
        +ETAQEAH     VA AF+ AKDK+                            KLKEGAKETL+ AK+ EE V++G ER+A ET EKI+TGENK+KENLM
Subjt:  SETAQEAH----HVADAFEGAKDKI--------------------------SGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLM

Query:  GIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVV
        G+V RG +V+NY FRHLG GMD LGLLGF+MALGMGVWVTFISSYVL SVLPR QLGVVQSKIYPVYFKAMA CIGMALLGHLFSRTEW FPIPKN+EVV
Subjt:  GIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVV

Query:  QGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLM
        QGYVLVAAL MIFANSLYMEPRATKVMFERLK+EKEEGRGIEDI  E          AI +STPTQ V+REVVKSR+VGLNKRLKKLNSYSSLLNLLTLM
Subjt:  QGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE---------AAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLM

Query:  ALTWHLVYLSHRFCNPC
        ALTWHLVYLS R CNPC
Subjt:  ALTWHLVYLSHRFCNPC

A0A6J1FL32 uncharacterized protein LOC1114449651.5e-13263.32Show/hide
Query:  MINLFALCLIVTTLTAAGLWSPPPPP-PPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSE-------------------PDQDSETHPTRDLICDVYG
        M NLFALCL++T+LTAAGLWSP P       +DVIVKEGHRVVVVEYG Q QHNTKVSISSE                   P++DSE H TRDLICD  G
Subjt:  MINLFALCLIVTTLTAAGLWSPPPPP-PPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSE-------------------PDQDSETHPTRDLICDVYG

Query:  KCKHKIASAVEKAKVMTSETAQEAHH----VADAFEGAKDKISGK-------------------------------------------LKEGAKETLKQA
        KCKHKIASAV KAKVM SETAQEAH     VA AF+ AK+ +S K                                           LKEGAK+ LK+ 
Subjt:  KCKHKIASAVEKAKVMTSETAQEAHH----VADAFEGAKDKISGK-------------------------------------------LKEGAKETLKQA

Query:  KAGE--EYVMEGGERLAMETREKIKTGENKVKENLMGIVGRGGRVINYSFRHL------GFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGV
        KA +  E  ME G R A +T EKIKTG NKVKENLMGI   G +++N SFR+L         MDVL LLGF MALGMGVW TFISSYVL S LPR QL V
Subjt:  KAGE--EYVMEGGERLAMETREKIKTGENKVKENLMGIVGRGGRVINYSFRHL------GFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGV

Query:  VQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE--------AAI
        VQSKIYP+YF+AMA  IGMAL GHLFSRT+WMFPIPKNAEVVQGYVLVAAL MIFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA E         A+
Subjt:  VQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE--------AAI

Query:  ATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC
         TST TQ VERE VKSR+VGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS R C PC
Subjt:  ATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC

A0A6J1JR64 uncharacterized protein LOC1114890831.4e-13363.76Show/hide
Query:  MINLFALCLIVTTLTAAGLWSPPPPPP-PSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSE-------------------PDQDSETHPTRDLICDVYG
        M NLFALCL++T+LTAAGLWSP P       +DVIVKEGHRVVVVEYG Q QHNTKVSISSE                   P++DSE H TRDLICD  G
Subjt:  MINLFALCLIVTTLTAAGLWSPPPPPP-PSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSE-------------------PDQDSETHPTRDLICDVYG

Query:  KCKHKIASAVEKAKVMTSETAQEAHH----VADAFEGAKDKISGK-------------------------------------------LKEGAKETLKQA
        KCKHKIASAV KAKVM SETAQEAH     VA AF+ AK+ +S K                                           LKEGAK+ LK+ 
Subjt:  KCKHKIASAVEKAKVMTSETAQEAHH----VADAFEGAKDKISGK-------------------------------------------LKEGAKETLKQA

Query:  KAGE--EYVMEGGERLAMETREKIKTGENKVKENLMGIVGRGGRVINYSFRHLG------FGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGV
        KA E  E  ME G R A +T EKIKTG NKVKENL+GI G G +++N SFR+LG        MDVL LLGFSMALGMGVW TFISSYVL S LPR QL V
Subjt:  KAGE--EYVMEGGERLAMETREKIKTGENKVKENLMGIVGRGGRVINYSFRHLG------FGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGV

Query:  VQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE--------AAI
        VQSKIYP+YF+AMA  IGMAL GHLFSRT+WMFPIPKNAEVVQGYVLVAAL  IFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA E         AI
Subjt:  VQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGE--------AAI

Query:  ATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC
         TST TQ V+RE VKSR+VGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS R C PC
Subjt:  ATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC

SwissProt top hitse value%identityAlignment
Q32L10 Transmembrane protein 2051.2e-0932.6Show/hide
Query:  VLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFK-AMACC-IGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYME
        V+ LL  S A GM +WVTFIS +VL   LPRH  G+VQSK++P YF  +M C  + + +L    S  +  F      E  Q ++L+ +L +   N+ ++E
Subjt:  VLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFK-AMACC-IGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYME

Query:  PRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS
         R T  M+    +EKE G G E          S P   +  +    +   L ++  + +  SSL NL  L++   HL  L+
Subjt:  PRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS

Q5REM8 Transmembrane protein 2051.1e-1031.61Show/hide
Query:  HLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFK-AMACC-IGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIF
        +LG  + ++ LL  S A GM +WVTF+S ++L   LPRH  G+VQSK++P YF  +M C  + + +L    +  +  F      E  Q Y+L  +L +  
Subjt:  HLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFK-AMACC-IGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIF

Query:  ANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNL
         N+ ++EPR T  M+    +EKE G G     GE   +   P    +      +   L +   + +  SSL NL
Subjt:  ANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNL

Q6GPW4 Transmembrane protein 2053.2e-1030.94Show/hide
Query:  VLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACC--IGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYME
        +  LL  S + GM  W+TF++ +VL   +PRH  G+VQSK++P Y   + CC  I +A+      R E + P    +E VQ  +   +L +    + +  
Subjt:  VLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACC--IGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYME

Query:  PRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS
        P  TK MF+   +E+E   G     G    A     Q ++ +  K +   L KR  + +  SSL NLL L+    +LVY++
Subjt:  PRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS

Q6UW68 Transmembrane protein 2058.3e-1132.18Show/hide
Query:  HLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFK-AMACC-IGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIF
        +LG  + ++ LL  S A GM +WVTF+S ++L   LPRH  G+VQSK++P YF  +M C  I + +L    +  +  F      E  Q Y+L  +L +  
Subjt:  HLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFK-AMACC-IGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIF

Query:  ANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNL
         N+ ++EPR T  M+    +EKE G G     GE   +   P    +      +   L +   + +  SSL NL
Subjt:  ANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNL

Q91XE8 Transmembrane protein 2053.7e-1140.62Show/hide
Query:  VLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACC--IGMALLG------HLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFA
        V+ LL  S A GM VWVTFIS ++L   LPRH  G+VQSK++PVYF     C  I + +L       HL   T W        EV Q  +L+ +L +   
Subjt:  VLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACC--IGMALLG------HLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFA

Query:  NSLYMEPRATKVMFERLKMEKEEGRGIE
        N+ ++E R T VM     +EKE G G E
Subjt:  NSLYMEPRATKVMFERLKMEKEEGRGIE

Arabidopsis top hitse value%identityAlignment
AT1G22600.1 Late embryogenesis abundant protein (LEA) family protein8.4e-5136.32Show/hide
Query:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSIS---SEPDQDS------------------------ETHPT-R
        M  L AL L+++ L   G  +       S  D+IV++GHRVVVVEY    + NT+V IS    E DQ S                        E H T  
Subjt:  MINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSIS---SEPDQDS------------------------ETHPT-R

Query:  DLICDVYGKCKHKIASAVEKAKVMT-SETAQEAHHVADAFEG--AKDKISGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLMGI
        +LICD  GKCKHK+ + + + K  T S+ + E   +  A E    ++K+S K +E   +  ++A      V +  E++ +  R     G   V    +G+
Subjt:  DLICDVYGKCKHKIASAVEKAKVMT-SETAQEAHHVADAFEG--AKDKISGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLMGI

Query:  VGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQG
           G                V+G++G + A GM VWVTF+S YVL SVL   Q GVVQSK+YPVYFKA++  I + LLGH+  R   +F      ++ Q 
Subjt:  VGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQG

Query:  YVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGI----EDIAGEAAIAT---STPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTW
          L++++ M+ AN+ ++  RATK MFE +K EKE+GRG     +  + E+A  T      T+  + +VVK R+  L++R++KLN+YSS LNLLTLM+LTW
Subjt:  YVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGI----EDIAGEAAIAT---STPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTW

Query:  HLVYLSHRFCNPC
        H VYL +R    C
Subjt:  HLVYLSHRFCNPC

AT1G72100.1 late embryogenesis abundant domain-containing protein / LEA domain-containing protein5.8e-5234.81Show/hide
Query:  MINLFALCLIVTTLTAAGLWSPPP------PPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSIS-----------------------------------
        M NL ALCL+++TL AA +WSP P          S  +VIVK+GH VVVVEY    + NT+VSIS                                   
Subjt:  MINLFALCLIVTTLTAAGLWSPPP------PPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSIS-----------------------------------

Query:  -------SEP---DQDSETHPTR-DLICDVYGKCKHKIASAVEKAKVMTSET------------AQEAHH---------------VADAFEGAKDKISGK
               S+P   D+  + H T  ++ICD +GKC+ KIAS V +AK  T ++            A +AH                VAD  + AK +++ K
Subjt:  -------SEP---DQDSETHPTR-DLICDVYGKCKHKIASAVEKAKVMTSET------------AQEAHH---------------VADAFEGAKDKISGK

Query:  L---KEG------------------AKETLKQ-AKAGEEYVMEGG----ERLAMETREKIKTGENKVKENLMGIVGRGGRVINYSFRHLGFG--------
            KEG                  AKE++ Q A   +E V E      E +A +  E  +  +++V+E    +          ++  +  G        
Subjt:  L---KEG------------------AKETLKQ-AKAGEEYVMEGG----ERLAMETREKIKTGENKVKENLMGIVGRGGRVINYSFRHLGFG--------

Query:  ---------MDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFM
                   ++GL G + A G  VWVTF+SSYVL SVL R Q GVVQSK+YPVYFKA +  I + L GH+ SR   +  +    E+ QG  L+++ FM
Subjt:  ---------MDVLGLLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFM

Query:  IFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC
        I AN  ++EPRATK MFER+K EKEEGRG E           T  Q + R++ +     L++RL KLN+YSS LN+LTLM+LTWH VYL  R    C
Subjt:  IFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC

AT3G62580.1 Late embryogenesis abundant protein (LEA) family protein2.9e-1129.51Show/hide
Query:  LLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAM--ACCIGMALLGHLFSRTEWMFPIPKNAEVVQGY---VLVAALFMIFANSLYME
        LL F+ A G  +W TFI   ++   LPRHQ G +QSK++P YF  +   C I ++  G+L     W     K++  V+ Y    L++A      N     
Subjt:  LLGFSMALGMGVWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAM--ACCIGMALLGHLFSRTEWMFPIPKNAEVVQGY---VLVAALFMIFANSLYME

Query:  PRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKS--RMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS
        P    +M +R K+E+E   G E               +  RE  KS  ++  +NK+   ++  SSL N+ +  +L  H  YL+
Subjt:  PRATKVMFERLKMEKEEGRGIEDIAGEAAIATSTPTQAVEREVVKS--RMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCTCCCTGTCTGCATCTTGGCTTCGGCTTCAACTTCAACTTCAAAATGATCAATCTATTTGCTTTGTGTCTCATCGTTACTACTTTAACTGCCGCAGGACTCTG
GTCTCCTCCTCCTCCTCCTCCTCCTTCTCCCAAGGACGTTATTGTAAAAGAAGGGCATCGAGTGGTTGTGGTCGAGTATGGCCACCAAGATCAACACAATACTAAAGTTT
CCATCTCTTCCGAACCCGACCAAGATTCCGAAACCCACCCCACCAGAGATCTTATTTGCGATGTCTATGGAAAATGTAAGCATAAGATAGCCAGTGCTGTAGAGAAAGCT
AAAGTAATGACCTCGGAAACGGCCCAAGAGGCCCACCATGTCGCGGATGCATTTGAGGGAGCCAAAGACAAAATTTCAGGCAAACTGAAAGAGGGTGCAAAGGAAACATT
GAAACAAGCCAAAGCAGGGGAAGAATACGTTATGGAAGGTGGTGAACGGCTGGCAATGGAGACACGGGAGAAGATCAAAACAGGGGAAAACAAAGTGAAGGAGAATCTGA
TGGGTATAGTTGGTAGAGGAGGAAGGGTAATAAATTATTCATTTAGGCACTTGGGTTTTGGTATGGATGTGTTGGGTCTGTTGGGATTTTCCATGGCTTTGGGAATGGGC
GTGTGGGTTACCTTCATTTCTAGCTATGTGTTGGGGAGTGTGCTGCCAAGGCACCAATTGGGTGTGGTACAGAGCAAGATATATCCTGTGTATTTTAAGGCCATGGCTTG
TTGCATTGGGATGGCTCTATTGGGGCATCTATTCAGCCGCACAGAATGGATGTTTCCAATTCCAAAAAATGCGGAAGTGGTTCAAGGATATGTACTTGTGGCTGCACTTT
TCATGATCTTTGCCAATTCTCTCTACATGGAGCCTCGAGCCACCAAGGTAATGTTTGAGAGATTAAAAATGGAAAAGGAGGAAGGAAGAGGAATTGAGGACATAGCCGGT
GAAGCAGCAATCGCAACCAGCACACCCACACAAGCTGTAGAGCGAGAAGTGGTGAAATCTAGAATGGTGGGGTTGAACAAGAGGCTAAAGAAGTTGAATTCATATTCGTC
CTTGTTAAACCTACTCACTCTCATGGCTCTCACCTGGCATCTTGTGTACCTGAGTCATCGTTTTTGCAACCCCTGCTAA
mRNA sequenceShow/hide mRNA sequence
TGCTTGCTTCGTAAATAAATGTCATCTCCCTGTCTGCATCTTGGCTTCGGCTTCAACTTCAACTTCAAAATGATCAATCTATTTGCTTTGTGTCTCATCGTTACTACTTT
AACTGCCGCAGGACTCTGGTCTCCTCCTCCTCCTCCTCCTCCTTCTCCCAAGGACGTTATTGTAAAAGAAGGGCATCGAGTGGTTGTGGTCGAGTATGGCCACCAAGATC
AACACAATACTAAAGTTTCCATCTCTTCCGAACCCGACCAAGATTCCGAAACCCACCCCACCAGAGATCTTATTTGCGATGTCTATGGAAAATGTAAGCATAAGATAGCC
AGTGCTGTAGAGAAAGCTAAAGTAATGACCTCGGAAACGGCCCAAGAGGCCCACCATGTCGCGGATGCATTTGAGGGAGCCAAAGACAAAATTTCAGGCAAACTGAAAGA
GGGTGCAAAGGAAACATTGAAACAAGCCAAAGCAGGGGAAGAATACGTTATGGAAGGTGGTGAACGGCTGGCAATGGAGACACGGGAGAAGATCAAAACAGGGGAAAACA
AAGTGAAGGAGAATCTGATGGGTATAGTTGGTAGAGGAGGAAGGGTAATAAATTATTCATTTAGGCACTTGGGTTTTGGTATGGATGTGTTGGGTCTGTTGGGATTTTCC
ATGGCTTTGGGAATGGGCGTGTGGGTTACCTTCATTTCTAGCTATGTGTTGGGGAGTGTGCTGCCAAGGCACCAATTGGGTGTGGTACAGAGCAAGATATATCCTGTGTA
TTTTAAGGCCATGGCTTGTTGCATTGGGATGGCTCTATTGGGGCATCTATTCAGCCGCACAGAATGGATGTTTCCAATTCCAAAAAATGCGGAAGTGGTTCAAGGATATG
TACTTGTGGCTGCACTTTTCATGATCTTTGCCAATTCTCTCTACATGGAGCCTCGAGCCACCAAGGTAATGTTTGAGAGATTAAAAATGGAAAAGGAGGAAGGAAGAGGA
ATTGAGGACATAGCCGGTGAAGCAGCAATCGCAACCAGCACACCCACACAAGCTGTAGAGCGAGAAGTGGTGAAATCTAGAATGGTGGGGTTGAACAAGAGGCTAAAGAA
GTTGAATTCATATTCGTCCTTGTTAAACCTACTCACTCTCATGGCTCTCACCTGGCATCTTGTGTACCTGAGTCATCGTTTTTGCAACCCCTGCTAATACATAAAAGAGT
ATTTTCTTTTTCTTGTTTTGTTTTTATTAGGATGAAGATGATGGGCATAGCATATTTTAGCTTTTAAGTACATATACTAATGGTAATGGTACTGTTCTTAGGTGTGGTCC
ATGTGCCAAATATTATTAATGTCTAACTTAGAGCAGGAGGATTATCCAATGTATATTTCAGATTGGTCATTAGTTGTTGTTTTGATTATTTTATTCTCAATTCCAAAATT
ATTTCTGGAAGGAAACTAAAGAGAGTGAAAATGGCGATTAGTTTGAAGAACAGGTGGTGAGATGCATGAGAGAGGTG
Protein sequenceShow/hide protein sequence
MSSPCLHLGFGFNFNFKMINLFALCLIVTTLTAAGLWSPPPPPPPSPKDVIVKEGHRVVVVEYGHQDQHNTKVSISSEPDQDSETHPTRDLICDVYGKCKHKIASAVEKA
KVMTSETAQEAHHVADAFEGAKDKISGKLKEGAKETLKQAKAGEEYVMEGGERLAMETREKIKTGENKVKENLMGIVGRGGRVINYSFRHLGFGMDVLGLLGFSMALGMG
VWVTFISSYVLGSVLPRHQLGVVQSKIYPVYFKAMACCIGMALLGHLFSRTEWMFPIPKNAEVVQGYVLVAALFMIFANSLYMEPRATKVMFERLKMEKEEGRGIEDIAG
EAAIATSTPTQAVEREVVKSRMVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSHRFCNPC