; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G01820 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G01820
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationClcChr10:1682822..1686804
RNA-Seq ExpressionClc10G01820
SyntenyClc10G01820
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]7.4e-5136.43Show/hide
Query:  KLLNKPPITT------------FQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLIS---------------------NNIRQPCPVGSVPIRRTL
        KLLNKP I T            +QTKEGD+IDC+DINKQPA DHP LK HKVQ   +  +S                     NN  + CP G VPIRRTL
Subjt:  KLLNKPPITT------------FQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLIS---------------------NNIRQPCPVGSVPIRRTL

Query:  KIDLIRLRSLSS--KQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGI-TYYGISAHISVYNLSV
        K DLIRL+SLSS  K   S +          F    + S+ F Y    Y N                      VVS S+++G   YYG  +++SVYN+S+
Subjt:  KIDLIRLRSLSS--KQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGI-TYYGISAHISVYNLSV

Query:  AQDQSSSANVWIV-ALSNAHDVILAGWQVNPGINGDSLPRLDV-------NRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAF
        + DQSSS N+WIV    ++  V++ GW VNP +NGD + R  V            YNM CQGFVQ +PS     PL P+S  EGQQYDY+F I       
Subjt:  AQDQSSSANVWIV-ALSNAHDVILAGWQVNPGINGDSLPRLDV-------NRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAF

Query:  VNRKYNSDWILAQGV-----VWISR-----VGSSTQVGWGGIAKPSPNGINPPLGNSHLP--------------------------------NSVKYVSN
           +   +W +  G       W        V  + Q+ WGGIA+PS +G++P LG+ H P                                N++ Y SN
Subjt:  VNRKYNSDWILAQGV-----VWISR-----VGSSTQVGWGGIAKPSPNGINPPLGNSHLP--------------------------------NSVKYVSN

Query:  PNCYGLSNNQHCGDECFSDYFTFGGLGGNHC
         +CY L+ N +CG +     FTFGG GG +C
Subjt:  PNCYGLSNNQHCGDECFSDYFTFGGLGGNHC

KCW77946.1 hypothetical protein EUGRSUZ_D02193 [Eucalyptus grandis]1.0e-4434.16Show/hide
Query:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLISN-------------NIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQTTS-
        Q KLLNKPPI TF T+EGD+IDCIDI+KQPA DHP LK HK+Q K    +SN               R+PCP+G+VPI+R  K DLIR RS+    + + 
Subjt:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLISN-------------NIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQTTS-

Query:  -DVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIVALSNAH
         +++D    G         V +S  +  LI                                     YG S +ISVYN+S A DQ SS N+WI      H
Subjt:  -DVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIVALSNAH

Query:  -DVILAGWQVNPGINGDSLPRL------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA--------
          +I+AGW+V+P +N D L RL      D  R   YN  CQGFVQ    ITP++PL P S   G  Y+ +  + +++          +W L         
Subjt:  -DVILAGWQVNPGINGDSLPRL------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA--------

Query:  ---QGVVWISRVGSSTQVGWGGIAKPSPNGINPPLGNSHLP---------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG
              ++++    S    WGG+AK   NG  PP+GN H+P                           N  K V  P+CY L N +   D  +   FTFG
Subjt:  ---QGVVWISRVGSSTQVGWGGIAKPSPNGINPPLGNSHLP---------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG

Query:  GLGG
        G GG
Subjt:  GLGG

TYK11502.1 neprosin 2 [Cucumis melo var. makuwa]6.9e-4936.43Show/hide
Query:  RQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLIS-------------------NNIRQPCPVGSVPIRRTLKIDLIRLRSLS
        RQ KLLNKP I T++TKEGD+IDC+DINKQPA DHP LK HKVQ   +  IS                   NN  + CP+G VPIRRTLK DLIRL+SLS
Subjt:  RQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLIS-------------------NNIRQPCPVGSVPIRRTLKIDLIRLRSLS

Query:  SKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIY-YNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVA-QDQSSSANVWI
        S     +         S+     + S  F +D + + Y+Q                     VVS S+ +   Y+G  A I+VYN+S++ ++QSSSAN+W+
Subjt:  SKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIY-YNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVA-QDQSSSANVWI

Query:  V-ALSNAHDVILAGWQVNPGINGDSLPRLDV-------NRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA
        V     + +V++A       ++GDSLPR  V            YNMLCQGFV  +P I     + P+SI +G+QYDY+F I + +  +  R  +    L 
Subjt:  V-ALSNAHDVILAGWQVNPGINGDSLPRLDV-------NRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA

Query:  --QGVVWISRVGSSTQVGWGGIAKPS-PNGINPPLGNSHLP----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTF
             ++ + +  + QV WGG A+PS  +  +PPLG+ H P                            N++ YVS+ +CY L +N++C  + F   FTF
Subjt:  --QGVVWISRVGSSTQVGWGGIAKPS-PNGINPPLGNSHLP----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTF

Query:  GGLGGNHCA
        GG GG  CA
Subjt:  GGLGGNHCA

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]1.6e-6141.71Show/hide
Query:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQ----------------AKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQT
        Q KLLN+P ITTFQT+EGD+IDC+DINKQPA DHPSLK HK+Q                ++  S I+NN R  CP G VPIRRT+K DLIR+RSLSSK+ 
Subjt:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQ----------------AKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQT

Query:  TSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIV-ALSN
        T     I   GG  FPYN++                                    VVS++MK+GI YYG S  +SVYNLSVAQDQSSS+N+WI+     
Subjt:  TSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIV-ALSN

Query:  AHDVILAGWQVNPGINGDSLPRLDV--NRRPVYN-MLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILAQGVVWISRVG
        A +VILAGWQVNP INGDSL R+ V    RP  N  L  G  ++H +I  ++P                   +EL   +N                    
Subjt:  AHDVILAGWQVNPGINGDSLPRLDV--NRRPVYN-MLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILAQGVVWISRVG

Query:  SSTQVGWGGIAKPSPNGINPPLGNSHLP-----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFGGLGGNHCAP
         + QV WGGIAKPSPNG++PPLGN H P                             N+  Y+SN +CY L N + CG E F    TFGG GGN+C+P
Subjt:  SSTQVGWGGIAKPSPNGINPPLGNSHLP-----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFGGLGGNHCAP

XP_031738649.1 uncharacterized protein LOC116402744 [Cucumis sativus]3.4e-5638.33Show/hide
Query:  RQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLIS------------------NNIRQPCPVGSVPIRRTLKIDLIRLRSLSS
        RQ KLLNKP I T++TKEGD+IDC+DINKQPA DHP LK HKVQ   +  +S                  NN  + CPVG VPIRRTLK DLIRL+SLSS
Subjt:  RQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLIS------------------NNIRQPCPVGSVPIRRTLKIDLIRLRSLSS

Query:  KQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIY-YNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQD-QSSSANVWIV
                       S  P + ++S    YD + + Y Q                     VVS S+ +   Y+G  A I+V+N+S++ + QSSSAN+W++
Subjt:  KQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIY-YNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQD-QSSSANVWIV

Query:  ALS-NAHDVILAGWQVNPGINGDSLPRLDVN-------RRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA-
          S ++ +V++AGWQVNP +NGD+LPR  V            YNMLCQGFV  +P+I     + P+SI +GQQYDY+F I + +  +  R  ++   L  
Subjt:  ALS-NAHDVILAGWQVNPGINGDSLPRLDVN-------RRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA-

Query:  -QGVVWISRVGSSTQVGWGGIAKPSPNG-INPPLGNSHLP----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG
            ++ + +  + QV WGG A+P+  G  +PPLG+ H P                            N++ YVSN +CY L +N++C  + F   FTFG
Subjt:  -QGVVWISRVGSSTQVGWGGIAKPSPNG-INPPLGNSHLP----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG

Query:  GLGGNHC
        G GG+ C
Subjt:  GLGGNHC

TrEMBL top hitse value%identityAlignment
A0A059CIH2 Uncharacterized protein5.0e-4534.16Show/hide
Query:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLISN-------------NIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQTTS-
        Q KLLNKPPI TF T+EGD+IDCIDI+KQPA DHP LK HK+Q K    +SN               R+PCP+G+VPI+R  K DLIR RS+    + + 
Subjt:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLISN-------------NIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQTTS-

Query:  -DVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIVALSNAH
         +++D    G         V +S  +  LI                                     YG S +ISVYN+S A DQ SS N+WI      H
Subjt:  -DVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIVALSNAH

Query:  -DVILAGWQVNPGINGDSLPRL------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA--------
          +I+AGW+V+P +N D L RL      D  R   YN  CQGFVQ    ITP++PL P S   G  Y+ +  + +++          +W L         
Subjt:  -DVILAGWQVNPGINGDSLPRL------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA--------

Query:  ---QGVVWISRVGSSTQVGWGGIAKPSPNGINPPLGNSHLP---------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG
              ++++    S    WGG+AK   NG  PP+GN H+P                           N  K V  P+CY L N +   D  +   FTFG
Subjt:  ---QGVVWISRVGSSTQVGWGGIAKPSPNGINPPLGNSHLP---------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG

Query:  GLGG
        G GG
Subjt:  GLGG

A0A1R3H4A8 Uncharacterized protein2.4e-3935.99Show/hide
Query:  RQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSL------ISNN-------IRQPCPVGSVPIRRTLKIDLIRLRSLSSKQTTS
        RQ K+LNKPPI TFQT  GD+IDCIDINKQPA DHP LK HK+Q K  ++      IS+         R  CP G+VPIRRT K DLIR++SL  K    
Subjt:  RQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSL------ISNN-------IRQPCPVGSVPIRRTLKIDLIRLRSLSSKQTTS

Query:  DVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWI----VALS
        D +  S++    FP N +                                    VV   MK+G  YYG+  ++S YNLSV  +Q SSANVWI       S
Subjt:  DVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWI----VALS

Query:  NAHDVILAGWQVNPGINGDSLPRLDVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILAQGVVWISRVG--
        +  +VILAGW                N+   YN  C GFV  +P I P + L P S   G QY    ++ ++           +W    G  W + +G  
Subjt:  NAHDVILAGWQVNPGINGDSLPRLDVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILAQGVVWISRVG--

Query:  ----------SSTQVGWGGIAKPSPNGINPPLGNSHLPN
                   +  V WGGIA     G +PP+G+ H+PN
Subjt:  ----------SSTQVGWGGIAKPSPNGINPPLGNSHLPN

A0A5D3CJM0 Neprosin 23.3e-4936.43Show/hide
Query:  RQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLIS-------------------NNIRQPCPVGSVPIRRTLKIDLIRLRSLS
        RQ KLLNKP I T++TKEGD+IDC+DINKQPA DHP LK HKVQ   +  IS                   NN  + CP+G VPIRRTLK DLIRL+SLS
Subjt:  RQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLIS-------------------NNIRQPCPVGSVPIRRTLKIDLIRLRSLS

Query:  SKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIY-YNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVA-QDQSSSANVWI
        S     +         S+     + S  F +D + + Y+Q                     VVS S+ +   Y+G  A I+VYN+S++ ++QSSSAN+W+
Subjt:  SKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIY-YNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVA-QDQSSSANVWI

Query:  V-ALSNAHDVILAGWQVNPGINGDSLPRLDV-------NRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA
        V     + +V++A       ++GDSLPR  V            YNMLCQGFV  +P I     + P+SI +G+QYDY+F I + +  +  R  +    L 
Subjt:  V-ALSNAHDVILAGWQVNPGINGDSLPRLDV-------NRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA

Query:  --QGVVWISRVGSSTQVGWGGIAKPS-PNGINPPLGNSHLP----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTF
             ++ + +  + QV WGG A+PS  +  +PPLG+ H P                            N++ YVS+ +CY L +N++C  + F   FTF
Subjt:  --QGVVWISRVGSSTQVGWGGIAKPS-PNGINPPLGNSHLP----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTF

Query:  GGLGGNHCA
        GG GG  CA
Subjt:  GGLGGNHCA

A0A6J1CVJ6 uncharacterized protein LOC1110147777.7e-6241.71Show/hide
Query:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQ----------------AKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQT
        Q KLLN+P ITTFQT+EGD+IDC+DINKQPA DHPSLK HK+Q                ++  S I+NN R  CP G VPIRRT+K DLIR+RSLSSK+ 
Subjt:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQ----------------AKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQT

Query:  TSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIV-ALSN
        T     I   GG  FPYN++                                    VVS++MK+GI YYG S  +SVYNLSVAQDQSSS+N+WI+     
Subjt:  TSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIV-ALSN

Query:  AHDVILAGWQVNPGINGDSLPRLDV--NRRPVYN-MLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILAQGVVWISRVG
        A +VILAGWQVNP INGDSL R+ V    RP  N  L  G  ++H +I  ++P                   +EL   +N                    
Subjt:  AHDVILAGWQVNPGINGDSLPRLDV--NRRPVYN-MLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILAQGVVWISRVG

Query:  SSTQVGWGGIAKPSPNGINPPLGNSHLP-----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFGGLGGNHCAP
         + QV WGGIAKPSPNG++PPLGN H P                             N+  Y+SN +CY L N + CG E F    TFGG GGN+C+P
Subjt:  SSTQVGWGGIAKPSPNGINPPLGNSHLP-----------------------------NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFGGLGGNHCAP

A0A6J1CW60 uncharacterized protein LOC1110147756.5e-4537.46Show/hide
Query:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQTTSDVKDISSIGGSTFP
        Q KLLNKP ITTFQT+EGD+IDC+DINKQPA DHP LK HKVQ                                                         
Subjt:  QFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQTTSDVKDISSIGGSTFP

Query:  YNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIV-ALSNAHDVILAGWQVNPGIN
                                                V S++MK+G  YYG +  +SVYNLSVAQDQSSS+N+WI+     A +VIL GWQVNP IN
Subjt:  YNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIV-ALSNAHDVILAGWQVNPGIN

Query:  GDSLPRLDVN-------RRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA----QGVV--WISRV-----GS
        GDS  R+ V            YNM C+ F+Q +PS  P+ PL PSS  +G+QYDY F + ++       +   DW LA    Q  +  W   +       
Subjt:  GDSLPRLDVN-------RRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILA----QGVV--WISRV-----GS

Query:  STQVGWGGIAKPSPNGINPPLGNSHLPNSVKYVSNPNCY
        + QV WGGIAKPSPNG++PPLGN H PN  K+  +  CY
Subjt:  STQVGWGGIAKPSPNGINPPLGNSHLPNSVKYVSNPNCY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)3.6e-2727.34Show/hide
Query:  MDCTNAAEPAPKHLKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLISNNIR--------------QPCPVGSVPIRRTL
        +D  N  +   +HL++     +NKP I T  + +GD+IDC+ ++ QPAFDHPSL+G K          +N R              + CP G+VPIRRT 
Subjt:  MDCTNAAEPAPKHLKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLISNNIR--------------QPCPVGSVPIRRTL

Query:  KIDLIRLRSLSSKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVA-Q
        + D++R  S+SS                   + K         +L +Y + T         ++ G +  +  VS     G  YYG  A I+V+   V  Q
Subjt:  KIDLIRLRSLSSKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVA-Q

Query:  DQSSSANVWIVALSNAHDV--ILAGWQVNPGINGDSLPRL-------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFV
         + S + +WI++ S  +D+  I AGWQV+P + GD+ PR               YN+LC GFVQ +  I     + PSS  +G Q+D   +I      + 
Subjt:  DQSSSANVWIVALSNAHDV--ILAGWQVNPGINGDSLPRL-------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFV

Query:  NRKYNSDWI-LAQGVV---WISRVGS-----STQVGWGG-IAKPSPNG--INPPLGNSHL--------------------------PNSVKYVSNPNCYG
        + K+ + W+    G++   W S + +     ++ V +GG I   SP G   +  +G+ H                           PN      +PNCY 
Subjt:  NRKYNSDWI-LAQGVV---WISRVGS-----STQVGWGG-IAKPSPNG--INPPLGNSHL--------------------------PNSVKYVSNPNCYG

Query:  LSNNQHCGDECFSDYFTFGGLGGNHCAP
        +   Q   +  +  YF +GG G N   P
Subjt:  LSNNQHCGDECFSDYFTFGGLGGNHCAP

AT1G55360.1 Protein of Unknown Function (DUF239)2.4e-2326Show/hide
Query:  KHLKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHN-----SLISNNIRQP------------------CPVGSVPIRRTLKI
        KHL R     LNKP + + Q+ +GD+IDC+ I+KQPAFDHP LK HK+Q K N         N +  P                  C  G++P+RRT + 
Subjt:  KHLKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHN-----SLISNNIRQP------------------CPVGSVPIRRTLKI

Query:  DLIRLRSLSSKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSV-AQDQ
        D++R          S VK        + P  K+     +       NQ              G +  I  V     +G  YYG  A I+V+   +  Q++
Subjt:  DLIRLRSLSSKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSV-AQDQ

Query:  SSSANVWIVALSNAHDV--ILAGWQVNPGINGDSLPRL-------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRE------L
         S + +W++  S   D+  I AGWQV+P + GD+  RL              YN+LC GF+Q +  I     + P S     QYD   +I ++       
Subjt:  SSSANVWIVALSNAHDV--ILAGWQVNPGINGDSLPRL-------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRE------L

Query:  VAFVNRKYNSDWILAQGVVWISRVGSSTQVGWGG-IAKPSPNG--INPPLGNSHLP-------------------NSVK-------YVSNPNCYGLSNNQ
        + F N      W      ++     S++ + WGG +     +G   +  +G+   P                   N++K       +    NCY +   Q
Subjt:  VAFVNRKYNSDWILAQGVVWISRVGSSTQVGWGG-IAKPSPNG--INPPLGNSHLP-------------------NSVK-------YVSNPNCYGLSNNQ

Query:  HCGDECFSDYFTFGGLGGNHCAP
           ++ +  YF +GG G N   P
Subjt:  HCGDECFSDYFTFGGLGGNHCAP

AT1G70550.1 Protein of Unknown Function (DUF239)6.7e-2626.89Show/hide
Query:  LKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKV--------------QAKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSK
        L R++   +NKP + T Q+ +GD IDC+  ++QPAFDHP L+G K                +  NS + +   + CP G++PIRRT + D++R  S    
Subjt:  LKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKV--------------QAKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSK

Query:  QTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSV-AQDQSSSANVWIVAL
                +   G       ++                          TN G +  +  V+     G  YYG  A I+V++  V +Q + S + +W++A 
Subjt:  QTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSV-AQDQSSSANVWIVAL

Query:  SNAHDV--ILAGWQVNPGINGDSLPRL------DVNRRP-VYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRE------------------
        S  HD+  I AGWQ++P + GD+ PR       D  R    YN+LC GFVQ +  I     + P S  +G Q+D   +I ++                  
Subjt:  SNAHDV--ILAGWQVNPGINGDSLPRL------DVNRRP-VYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRE------------------

Query:  --LVAFVNRKYNSDWILAQGVVWISRVGSS---TQVGWG-----GIAKPS--PNGINPPLGNSHLP--NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG
             F + K +   +   G +  +R G S   TQ+G G     G  K S   N       N+ +P  N      +PNCY +       +  + +YF +G
Subjt:  --LVAFVNRKYNSDWILAQGVVWISRVGSS---TQVGWG-----GIAKPS--PNGINPPLGNSHLP--NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG

Query:  GLGGNHCAP
        G G N   P
Subjt:  GLGGNHCAP

AT1G70550.2 Protein of Unknown Function (DUF239)6.7e-2626.89Show/hide
Query:  LKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKV--------------QAKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSK
        L R++   +NKP + T Q+ +GD IDC+  ++QPAFDHP L+G K                +  NS + +   + CP G++PIRRT + D++R  S    
Subjt:  LKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKV--------------QAKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSK

Query:  QTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSV-AQDQSSSANVWIVAL
                +   G       ++                          TN G +  +  V+     G  YYG  A I+V++  V +Q + S + +W++A 
Subjt:  QTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSV-AQDQSSSANVWIVAL

Query:  SNAHDV--ILAGWQVNPGINGDSLPRL------DVNRRP-VYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRE------------------
        S  HD+  I AGWQ++P + GD+ PR       D  R    YN+LC GFVQ +  I     + P S  +G Q+D   +I ++                  
Subjt:  SNAHDV--ILAGWQVNPGINGDSLPRL------DVNRRP-VYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRE------------------

Query:  --LVAFVNRKYNSDWILAQGVVWISRVGSS---TQVGWG-----GIAKPS--PNGINPPLGNSHLP--NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG
             F + K +   +   G +  +R G S   TQ+G G     G  K S   N       N+ +P  N      +PNCY +       +  + +YF +G
Subjt:  --LVAFVNRKYNSDWILAQGVVWISRVGSS---TQVGWG-----GIAKPS--PNGINPPLGNSHLP--NSVKYVSNPNCYGLSNNQHCGDECFSDYFTFG

Query:  GLGGNHCAP
        G G N   P
Subjt:  GLGGNHCAP

AT3G13510.1 Protein of Unknown Function (DUF239)1.8e-2326.07Show/hide
Query:  KHLKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHN----SLISNN------------IRQ------PCPVGSVPIRRTLKID
        KHL R     LNKPP+ T Q+ +GD+IDCI I+KQPAFDHP LK HK+Q + +     L  +N            I Q       C  G++P+RRT + D
Subjt:  KHLKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHN----SLISNN------------IRQ------PCPVGSVPIRRTLKID

Query:  LIRLRSLSSKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQ-DQS
        ++R          S VK        + P  K+     +                       G +  I  V     +G  YYG  A ++V+   +   ++ 
Subjt:  LIRLRSLSSKQTTSDVKDISSIGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQ-DQS

Query:  SSANVWIVALSNAHDV--ILAGWQVNPGINGDSLPRL-------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRE------LV
        S + +W++  S   D+  I AGWQV+P + GD+  RL              YN+LC GF+Q +  I     + P S     QYD   +I ++       +
Subjt:  SSANVWIVALSNAHDV--ILAGWQVNPGINGDSLPRL-------DVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRE------LV

Query:  AFVNRKYNSDWILAQGVVWISRVGSSTQVGWGG-IAKPSPNGIN--PPLGNSHLP-------------------NSVK-------YVSNPNCYGLSNNQH
         F N      W      ++     S++ + WGG +      G +    +G+ H P                   N++K       +    NCY +   Q 
Subjt:  AFVNRKYNSDWILAQGVVWISRVGSSTQVGWGG-IAKPSPNGIN--PPLGNSHLP-------------------NSVK-------YVSNPNCYGLSNNQH

Query:  CGDECFSDYFTFGGLGGNHCAP
          ++ +  YF +GG G N   P
Subjt:  CGDECFSDYFTFGGLGGNHCAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGCACAAATGCAGCAGAACCAGCACCCAAACATCTCAAGAGAAGGCAGTTCAAACTTCTCAACAAGCCACCCATCACAACATTTCAGACGAAGGAGGGAGATTT
AATCGATTGTATTGACATCAATAAACAACCAGCTTTTGATCATCCGTCCCTAAAAGGTCACAAAGTTCAGGCAAAACATAACTCACTCATAAGTAACAATATCAGACAAC
CTTGTCCAGTTGGGTCTGTTCCCATCCGAAGAACTTTGAAAATTGATTTGATTAGGTTAAGGTCTCTATCATCAAAACAAACAACGTCCGATGTGAAAGATATTAGTAGT
ATTGGTGGAAGTACATTTCCATACAATAAAAATGTAAGCATCTCTTTCGTATATGATCAATTAATTTATTACAATCAAAAGACCTGTTTTTCTTTTAATTTTTTTGTACA
AACAAATAGAGGCAAAAAGACATGTATTTGGGTGGTTTCTCTTAGTATGAAGCAAGGCATAACATATTATGGAATTAGTGCACATATATCTGTATACAATTTGAGTGTGG
CTCAAGATCAATCTTCTTCTGCAAATGTCTGGATAGTGGCCCTTTCTAATGCTCATGATGTAATACTTGCAGGGTGGCAGGTGAATCCAGGAATAAATGGAGATAGTCTC
CCTAGATTGGACGTTAACAGGCGTCCTGTTTACAATATGCTTTGTCAAGGCTTTGTACAAGCACATCCAAGTATTACTCCACACTTTCCTCTTCAACCATCCTCCATCGA
TGAAGGACAACAATATGACTACGAGTTTGTTATTGATCGGGAATTGGTGGCTTTTGTTAACAGAAAATACAACAGCGATTGGATATTGGCCCAAGGAGTTGTTTGGATAT
CTAGAGTTGGGAGCAGCACACAAGTGGGGTGGGGAGGCATTGCAAAGCCTTCACCAAATGGAATCAACCCTCCCTTAGGGAACTCCCACCTTCCAAACAGTGTGAAGTAT
GTTAGTAACCCTAATTGTTATGGTTTGAGCAATAATCAACATTGTGGGGATGAATGCTTTAGCGATTATTTTACCTTTGGAGGACTGGGGGGGAATCATTGTGCTCCAAA
TTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTGCACAAATGCAGCAGAACCAGCACCCAAACATCTCAAGAGAAGGCAGTTCAAACTTCTCAACAAGCCACCCATCACAACATTTCAGACGAAGGAGGGAGATTT
AATCGATTGTATTGACATCAATAAACAACCAGCTTTTGATCATCCGTCCCTAAAAGGTCACAAAGTTCAGGCAAAACATAACTCACTCATAAGTAACAATATCAGACAAC
CTTGTCCAGTTGGGTCTGTTCCCATCCGAAGAACTTTGAAAATTGATTTGATTAGGTTAAGGTCTCTATCATCAAAACAAACAACGTCCGATGTGAAAGATATTAGTAGT
ATTGGTGGAAGTACATTTCCATACAATAAAAATGTAAGCATCTCTTTCGTATATGATCAATTAATTTATTACAATCAAAAGACCTGTTTTTCTTTTAATTTTTTTGTACA
AACAAATAGAGGCAAAAAGACATGTATTTGGGTGGTTTCTCTTAGTATGAAGCAAGGCATAACATATTATGGAATTAGTGCACATATATCTGTATACAATTTGAGTGTGG
CTCAAGATCAATCTTCTTCTGCAAATGTCTGGATAGTGGCCCTTTCTAATGCTCATGATGTAATACTTGCAGGGTGGCAGGTGAATCCAGGAATAAATGGAGATAGTCTC
CCTAGATTGGACGTTAACAGGCGTCCTGTTTACAATATGCTTTGTCAAGGCTTTGTACAAGCACATCCAAGTATTACTCCACACTTTCCTCTTCAACCATCCTCCATCGA
TGAAGGACAACAATATGACTACGAGTTTGTTATTGATCGGGAATTGGTGGCTTTTGTTAACAGAAAATACAACAGCGATTGGATATTGGCCCAAGGAGTTGTTTGGATAT
CTAGAGTTGGGAGCAGCACACAAGTGGGGTGGGGAGGCATTGCAAAGCCTTCACCAAATGGAATCAACCCTCCCTTAGGGAACTCCCACCTTCCAAACAGTGTGAAGTAT
GTTAGTAACCCTAATTGTTATGGTTTGAGCAATAATCAACATTGTGGGGATGAATGCTTTAGCGATTATTTTACCTTTGGAGGACTGGGGGGGAATCATTGTGCTCCAAA
TTAG
Protein sequenceShow/hide protein sequence
MDCTNAAEPAPKHLKRRQFKLLNKPPITTFQTKEGDLIDCIDINKQPAFDHPSLKGHKVQAKHNSLISNNIRQPCPVGSVPIRRTLKIDLIRLRSLSSKQTTSDVKDISS
IGGSTFPYNKNVSISFVYDQLIYYNQKTCFSFNFFVQTNRGKKTCIWVVSLSMKQGITYYGISAHISVYNLSVAQDQSSSANVWIVALSNAHDVILAGWQVNPGINGDSL
PRLDVNRRPVYNMLCQGFVQAHPSITPHFPLQPSSIDEGQQYDYEFVIDRELVAFVNRKYNSDWILAQGVVWISRVGSSTQVGWGGIAKPSPNGINPPLGNSHLPNSVKY
VSNPNCYGLSNNQHCGDECFSDYFTFGGLGGNHCAPN