; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G18100 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G18100
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
Genome locationClcChr03:30437259..30439679
RNA-Seq ExpressionClc03G18100
SyntenyClc03G18100
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR037219 - Peptidase M41-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK10198.1 uncharacterized protein E5676_scaffold16G003430 [Cucumis melo var. makuwa]4.8e-12172.67Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MF T A YDFTF+L+FHRR+PVTG V+SS          A+RRRALKLVDRALSKRQYKSA+SLVKQLQGKPYGLR FGAAKQI KR   +DESE+N  D
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG
        +LSLQPLVD ILDS+QQCLQ                           SF L+E+         SAEK ES +AEGRHSSRCEE+E  ICAQHEAGHFLVG
Subjt:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG

Query:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIM
        YLMGVLPKEY+VPS+QALSQNRFAEGKVSFVGFEFLGE LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL SVLTW GL KSEADLHL+WAATNTAFIM
Subjt:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIM

Query:  SQHCETRSRLAEAMALGKPIGLCIDTIENCLQG
        S+HCETR RLAEAM L KPIGLCIDTIENCL+G
Subjt:  SQHCETRSRLAEAMALGKPIGLCIDTIENCLQG

XP_004135797.2 uncharacterized protein LOC101213254 isoform X2 [Cucumis sativus]1.4e-12067.95Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MF TTA YDFTFNL+FH R+PVTGDV+SS          AKRRRALKLVDRALSKRQYKSA+SLVKQLQGKPYGLR FGAAKQI K+   +DESE+NR D
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQ-SLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLV
        ILSLQPLVD ILDS+QQCLQ SL E +                                     S EKLES +AEGRHSSRCEE+E  ICAQHEAGHFLV
Subjt:  ILSLQPLVDLILDSIQQCLQ-SLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLV

Query:  GYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKL
        GYLMGVLPK Y+VPSIQAL QNRFAEGKVSFVGFEFLGEI                            LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL
Subjt:  GYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKL

Query:  GSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
         SVLTWLGL KSEADLHL+WAATNTAFIMS+HCETRSRLAEAMAL KPIGLCID IENCL+G  I
Subjt:  GSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

XP_022968755.1 uncharacterized protein LOC111467900 isoform X1 [Cucurbita maxima]4.6e-12469.19Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MFFT AD DFT NL+FHRRIPVTGDVISS KR +S DGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQI KRPSAMDESELN KD
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG
        ILSLQPLVD ILDSIQ C                                          LQ SAE+LESLIAEGR+ SRCEEEE LICAQHEAGHFLVG
Subjt:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG

Query:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------------LNQFSCVALGGLVAELLVAGNSDGHLA
        YLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLG+I                                  LNQFSCV LGGLVAELLVAGNSDGHLA
Subjt:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------------LNQFSCVALGGLVAELLVAGNSDGHLA

Query:  DMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
        D+LKL SVL WLGL KS+AD HLKWAA NTAFIMS+H ETR  LA+ MALGK IG CIDTIENCLQG+EI
Subjt:  DMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

XP_022968757.1 uncharacterized protein LOC111467900 isoform X2 [Cucurbita maxima]1.9e-12572.65Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MFFT AD DFT NL+FHRRIPVTGDVISS KR +S DGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQI KRPSAMDESELN KD
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG
        ILSLQPLVD ILDSIQ C                                          LQ SAE+LESLIAEGR+ SRCEEEE LICAQHEAGHFLVG
Subjt:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG

Query:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGE---------------ILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEA
        YLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLG+                LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL SVL WLGL KS+A
Subjt:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGE---------------ILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEA

Query:  DLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
        D HLKWAA NTAFIMS+H ETR  LA+ MALGK IG CIDTIENCLQG+EI
Subjt:  DLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

XP_038879283.1 uncharacterized protein LOC120071224 isoform X1 [Benincasa hispida]2.6e-13573.97Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MFFT A YDFTFNL+FHRRIPVTG+VISSVKRGES DGA KRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQI KR S MDE ELNRKD
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQ-SLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLV
        IL+LQPLV  ILDSIQQCLQ SL E++                                     SAEKL+SL+A+GRHSSRCEEEE  ICAQHEAGHFLV
Subjt:  ILSLQPLVDLILDSIQQCLQ-SLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLV

Query:  GYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKL
        GYLMGVLPKEYEVPSIQAL+QNRFAEGKVSFVGFEFLGEI                            LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL
Subjt:  GYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKL

Query:  GSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
        GSVLTWLG SKSEAD+HLKWAATNTAFIMS+HCETRSRLAEAMALGKPIGLCID IENCLQGME+
Subjt:  GSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

TrEMBL top hitse value%identityAlignment
A0A1S3BP83 uncharacterized protein LOC1034922182.3e-11666.76Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MF T A YDFTF+L+FHRR+PVTG V+SS          A+RRRALKLVDRALSKRQYKSA+SLVKQLQGKPYGLR FGAAKQI KR   +DESE+N  D
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG
        +LSLQPLVD ILDS+QQCLQ                           SF L+E+         SAEK ES +AEGRHSSRCEE+E  ICAQHEAGHFLVG
Subjt:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG

Query:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKLG
        YLMGVLPKEY+VPS+QALSQNRFAEGKVSFVGFEFLGEI                            LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL 
Subjt:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKLG

Query:  SVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQG
        SVLTW GL KSEADLHL+WAATNTAFIMS+HCETR RLAEAM L KPIGLCI+ IENCL+G
Subjt:  SVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQG

A0A5D3CG48 Uncharacterized protein2.3e-12172.67Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MF T A YDFTF+L+FHRR+PVTG V+SS          A+RRRALKLVDRALSKRQYKSA+SLVKQLQGKPYGLR FGAAKQI KR   +DESE+N  D
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG
        +LSLQPLVD ILDS+QQCLQ                           SF L+E+         SAEK ES +AEGRHSSRCEE+E  ICAQHEAGHFLVG
Subjt:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG

Query:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIM
        YLMGVLPKEY+VPS+QALSQNRFAEGKVSFVGFEFLGE LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL SVLTW GL KSEADLHL+WAATNTAFIM
Subjt:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIM

Query:  SQHCETRSRLAEAMALGKPIGLCIDTIENCLQG
        S+HCETR RLAEAM L KPIGLCIDTIENCL+G
Subjt:  SQHCETRSRLAEAMALGKPIGLCIDTIENCLQG

A0A6J1HDU1 uncharacterized protein LOC1114619604.1e-11867.3Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MFFT AD DFTFNL+FHRRIPVTGDVISS KRG+S DGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQI KRPSAMD        
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG
         LSLQPLVD ILDSIQ C                                          LQ SAE+LESLIAEGR+ SRCEEEE LICAQHEAGHFLVG
Subjt:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG

Query:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------------LNQFSCVALGGLVAELLVAGNSDGHLA
        YLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLGEI                                  L QFSCV LGGLVAELLVAGNSDGHLA
Subjt:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------------LNQFSCVALGGLVAELLVAGNSDGHLA

Query:  DMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
        D+LKL SVL WLGL KS+AD   KWAA NTAFIMS+H ETRS LA+ MALGK IG CIDTIENCLQG+EI
Subjt:  DMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

A0A6J1HUE1 uncharacterized protein LOC111467900 isoform X29.1e-12672.65Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MFFT AD DFT NL+FHRRIPVTGDVISS KR +S DGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQI KRPSAMDESELN KD
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG
        ILSLQPLVD ILDSIQ C                                          LQ SAE+LESLIAEGR+ SRCEEEE LICAQHEAGHFLVG
Subjt:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG

Query:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGE---------------ILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEA
        YLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLG+                LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL SVL WLGL KS+A
Subjt:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGE---------------ILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEA

Query:  DLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
        D HLKWAA NTAFIMS+H ETR  LA+ MALGK IG CIDTIENCLQG+EI
Subjt:  DLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

A0A6J1HY40 uncharacterized protein LOC111467900 isoform X12.2e-12469.19Show/hide
Query:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD
        MFFT AD DFT NL+FHRRIPVTGDVISS KR +S DGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQI KRPSAMDESELN KD
Subjt:  MFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKD

Query:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG
        ILSLQPLVD ILDSIQ C                                          LQ SAE+LESLIAEGR+ SRCEEEE LICAQHEAGHFLVG
Subjt:  ILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVG

Query:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------------LNQFSCVALGGLVAELLVAGNSDGHLA
        YLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLG+I                                  LNQFSCV LGGLVAELLVAGNSDGHLA
Subjt:  YLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI----------------------------------LNQFSCVALGGLVAELLVAGNSDGHLA

Query:  DMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
        D+LKL SVL WLGL KS+AD HLKWAA NTAFIMS+H ETR  LA+ MALGK IG CIDTIENCLQG+EI
Subjt:  DMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54680.1 unknown protein6.0e-3743.58Show/hide
Query:  EEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQN-RFAEGKVSFVGFEFLGEI------------------------LNQFSCVALGGLVAELLV
        EE+     QHE+GHFLVGYL+GVLP+ YE+P+++A+ QN     G+V FVGFEFL ++                        LN FSCV LGG+V E ++
Subjt:  EEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQN-RFAEGKVSFVGFEFLGEI------------------------LNQFSCVALGGLVAELLV

Query:  AGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
         G S+G  +D++KL  VL WLG ++SE + H+KWA +NT  ++  H E R  LAE MA  KPI  CI+ IE+ +   +I
Subjt:  AGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

AT1G54680.2 unknown protein6.0e-3743.58Show/hide
Query:  EEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQN-RFAEGKVSFVGFEFLGEI------------------------LNQFSCVALGGLVAELLV
        EE+     QHE+GHFLVGYL+GVLP+ YE+P+++A+ QN     G+V FVGFEFL ++                        LN FSCV LGG+V E ++
Subjt:  EEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQN-RFAEGKVSFVGFEFLGEI------------------------LNQFSCVALGGLVAELLV

Query:  AGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
         G S+G  +D++KL  VL WLG ++SE + H+KWA +NT  ++  H E R  LAE MA  KPI  CI+ IE+ +   +I
Subjt:  AGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

AT1G54680.3 unknown protein1.6e-3745.09Show/hide
Query:  EEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQN-RFAEGKVSFVGFEFLGEI------------------LNQFSCVALGGLVAELLVAGNSDG
        EE+     QHE+GHFLVGYL+GVLP+ YE+P+++A+ QN     G+V FVGFEFL ++                  LN FSCV LGG+V E ++ G S+G
Subjt:  EEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQN-RFAEGKVSFVGFEFLGEI------------------LNQFSCVALGGLVAELLVAGNSDG

Query:  HLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI
          +D++KL  VL WLG ++SE + H+KWA +NT  ++  H E R  LAE MA  KPI  CI+ IE+ +   +I
Subjt:  HLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALGKPIGLCIDTIENCLQGMEI

AT5G27290.1 unknown protein6.4e-3936.36Show/hide
Query:  SVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQ
        S  G + RR+AL+ VD  LS    ++ALSLVK LQGKP GLR FGAA+Q+ +R   ++E +LN  +  SL    D  L SI++ LQ  +  V   ++  +
Subjt:  SVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQ

Query:  VFISEDFGVIDDSSFVLQELHSLFVLLQT-----SAEKLESLIAEGR-HSSRCEEEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQ--NRFAEG
         F        D SS  L  L   F+ L T         + SL+ +   H+       R++  QHEAGHFLV YL+G+LP+ Y + S++AL +  +   + 
Subjt:  VFISEDFGVIDDSSFVLQELHSLFVLLQT-----SAEKLESLIAEGR-HSSRCEEEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQ--NRFAEG

Query:  KVSFVGFEFLGEI---------LNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALG
          +FV +EFL E+         LN+FSC+AL G+  E L+ G ++G L D+ KL  ++  LG ++ +AD  ++W+  NT  ++ +H   RS+LA+AM+ G
Subjt:  KVSFVGFEFLGEI---------LNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMALG

Query:  KPIGLCIDTIENCLQGMEI
        + +G CI  IE+ +   +I
Subjt:  KPIGLCIDTIENCLQGMEI

AT5G27290.2 unknown protein2.4e-2237.08Show/hide
Query:  SVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQ
        S  G + RR+AL+ VD  LS    ++ALSLVK LQGKP GLR FGAA+Q+ +R   ++E +LN  +  SL    D  L SI++ LQ  +  V   ++  +
Subjt:  SVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQ

Query:  VFISEDFGVIDDSSFVLQELHSLFVLLQT-----SAEKLESLIAEGR-HSSRCEEEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQ--NRFAEG
         F        D SS  L  L   F+ L T         + SL+ +   H+       R++  QHEAGHFLV YL+G+LP+ Y + S++AL +  +   + 
Subjt:  VFISEDFGVIDDSSFVLQELHSLFVLLQT-----SAEKLESLIAEGR-HSSRCEEEERLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQ--NRFAEG

Query:  KVSFVGFEFLGEI---------LNQFSCVALGGLVAELLV
          +FV +EFL E+         LN+FSC+AL G+  E L+
Subjt:  KVSFVGFEFLGEI---------LNQFSCVALGGLVAELLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATGTAATATGTTCTTCACTACTGCTGATTACGATTTCACTTTCAACCTACAGTTTCATCGGAGGATTCCGGTGACCGGAGACGTCATTTCATCGGTGAAG
CGCGGGGAATCTGTTGATGGTGCAGCGAAACGACGTCGAGCTCTGAAGCTTGTGGATCGAGCACTCTCGAAACGGCAATACAAATCCGCTCTTTCGTTGGTTAAG
CAGTTGCAAGGGAAACCCTATGGCCTTCGTGCTTTCGGTGCCGCCAAGCAGATATTCAAGAGGCCTTCAGCAATGGACGAATCAGAGCTCAATAGAAAGGATATT
TTATCCCTTCAACCATTAGTGGATTTGATTCTGGATTCAATTCAACAATGTCTTCAATCTTTATCTGAGAGGGTACATTCCCTGCTTCTTGTTATGCAAGTTTTC
ATTTCTGAAGATTTTGGTGTTATTGATGACTCTTCATTTGTGCTTCAAGAACTTCACTCTCTGTTTGTTCTTTTACAGACCTCTGCTGAAAAGCTAGAGAGTTTA
ATTGCTGAAGGTAGACATTCTTCTCGTTGTGAAGAAGAAGAACGCCTCATATGTGCACAACATGAAGCTGGCCATTTCCTTGTTGGCTATTTGATGGGTGTTCTT
CCAAAAGAATATGAGGTGCCAAGCATTCAAGCTCTAAGCCAAAACAGATTTGCTGAAGGAAAAGTTTCATTTGTTGGTTTTGAATTTCTTGGGGAAATATTGAAC
CAGTTTTCATGTGTAGCATTAGGAGGCTTAGTGGCTGAGCTTCTAGTTGCTGGAAATTCTGATGGCCATCTAGCAGATATGCTCAAGCTGGGGAGTGTTCTTACA
TGGCTTGGCCTTTCCAAGTCTGAAGCTGATCTTCATTTAAAATGGGCTGCAACGAACACGGCATTCATAATGTCCCAGCATTGTGAAACAAGATCAAGACTCGCA
GAGGCCATGGCGCTGGGGAAACCGATTGGGCTCTGTATCGACACAATTGAAAACTGTTTACAGGGAATGGAGATATGA
mRNA sequenceShow/hide mRNA sequence
CGGGAATGTTATGTAATATGTTCTTCACTACTGCTGATTACGATTTCACTTTCAACCTACAGTTTCATCGGAGGATTCCGGTGACCGGAGACGTCATTTCATCGG
TGAAGCGCGGGGAATCTGTTGATGGTGCAGCGAAACGACGTCGAGCTCTGAAGCTTGTGGATCGAGCACTCTCGAAACGGCAATACAAATCCGCTCTTTCGTTGG
TTAAGCAGTTGCAAGGGAAACCCTATGGCCTTCGTGCTTTCGGTGCCGCCAAGCAGATATTCAAGAGGCCTTCAGCAATGGACGAATCAGAGCTCAATAGAAAGG
ATATTTTATCCCTTCAACCATTAGTGGATTTGATTCTGGATTCAATTCAACAATGTCTTCAATCTTTATCTGAGAGGGTACATTCCCTGCTTCTTGTTATGCAAG
TTTTCATTTCTGAAGATTTTGGTGTTATTGATGACTCTTCATTTGTGCTTCAAGAACTTCACTCTCTGTTTGTTCTTTTACAGACCTCTGCTGAAAAGCTAGAGA
GTTTAATTGCTGAAGGTAGACATTCTTCTCGTTGTGAAGAAGAAGAACGCCTCATATGTGCACAACATGAAGCTGGCCATTTCCTTGTTGGCTATTTGATGGGTG
TTCTTCCAAAAGAATATGAGGTGCCAAGCATTCAAGCTCTAAGCCAAAACAGATTTGCTGAAGGAAAAGTTTCATTTGTTGGTTTTGAATTTCTTGGGGAAATAT
TGAACCAGTTTTCATGTGTAGCATTAGGAGGCTTAGTGGCTGAGCTTCTAGTTGCTGGAAATTCTGATGGCCATCTAGCAGATATGCTCAAGCTGGGGAGTGTTC
TTACATGGCTTGGCCTTTCCAAGTCTGAAGCTGATCTTCATTTAAAATGGGCTGCAACGAACACGGCATTCATAATGTCCCAGCATTGTGAAACAAGATCAAGAC
TCGCAGAGGCCATGGCGCTGGGGAAACCGATTGGGCTCTGTATCGACACAATTGAAAACTGTTTACAGGGAATGGAGATATGA
Protein sequenceShow/hide protein sequence
MLCNMFFTTADYDFTFNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDI
LSLQPLVDLILDSIQQCLQSLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVLLQTSAEKLESLIAEGRHSSRCEEEERLICAQHEAGHFLVGYLMGVL
PKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLA
EAMALGKPIGLCIDTIENCLQGMEI