; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G01830 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G01830
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionNeprosin domain-containing protein
Genome locationClcChr10:1686953..1688470
RNA-Seq ExpressionClc10G01830
SyntenyClc10G01830
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ESR41651.1 hypothetical protein CICLE_v10012304mg [Citrus clementina]3.5e-2352.17Show/hide
Query:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ
        +D   +G +PKELF+++S GA+ V WGGIA A KNG+SPP+GSG L N +FR  CYIR I+YV+ +N    P    L+Q+   S CYGL+D + CG   +
Subjt:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ

Query:  MYYCFTYGGTGGRCG
        MYYC  +GG GGRCG
Subjt:  MYYCFTYGGTGGRCG

KDO46423.1 hypothetical protein CISIN_1g045979mg [Citrus sinensis]7.9e-2352.17Show/hide
Query:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ
        +D   +G +PKELF+++S GA+ V WGGIA A KNG SPP+GSG L N +FR  CYIR I+YV+ +N    P    L+Q+   S CYGL+D + CG   +
Subjt:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ

Query:  MYYCFTYGGTGGRCG
        MYYC  +GG GGRCG
Subjt:  MYYCFTYGGTGGRCG

OMP03993.1 hypothetical protein COLO4_10040 [Corchorus olitorius]5.1e-2247.27Show/hide
Query:  IGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQMYYCF
        IG +PKEL   +SNGA QV WGGIA A K G SPP+GSG+ PN ++  +C+   I ++N++   + P +     Y   S CYGL D + CG  + M+YCF
Subjt:  IGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQMYYCF

Query:  TYGGTGGRCG
        T+GG GG+CG
Subjt:  TYGGTGGRCG

XP_022145286.1 uncharacterized protein LOC111014774 [Momordica charantia]2.7e-2345.53Show/hide
Query:  STRKLEVTVYEDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLP-NGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGL
        ST    + V +    IG +PKELF ++++GA+QV WGGIA+ + NGMSPPLG+G+ P NG +  ACY + I Y++  N G+ P    +  +  +S+CYGL
Subjt:  STRKLEVTVYEDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLP-NGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGL

Query:  QDGRTCGGFDQMYYCFTYGGTGG
         DG      D MY+CFT+GG GG
Subjt:  QDGRTCGGFDQMYYCFTYGGTGG

XP_024038072.1 uncharacterized protein LOC18039972 [Citrus clementina]3.5e-2352.17Show/hide
Query:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ
        +D   +G +PKELF+++S GA+ V WGGIA A KNG+SPP+GSG L N +FR  CYIR I+YV+ +N    P    L+Q+   S CYGL+D + CG   +
Subjt:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ

Query:  MYYCFTYGGTGGRCG
        MYYC  +GG GGRCG
Subjt:  MYYCFTYGGTGGRCG

TrEMBL top hitse value%identityAlignment
A0A067DUA4 Neprosin domain-containing protein3.8e-2352.17Show/hide
Query:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ
        +D   +G +PKELF+++S GA+ V WGGIA A KNG SPP+GSG L N +FR  CYIR I+YV+ +N    P    L+Q+   S CYGL+D + CG   +
Subjt:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ

Query:  MYYCFTYGGTGGRCG
        MYYC  +GG GGRCG
Subjt:  MYYCFTYGGTGGRCG

A0A1R3KA97 Neprosin domain-containing protein2.5e-2247.27Show/hide
Query:  IGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQMYYCF
        IG +PKEL   +SNGA QV WGGIA A K G SPP+GSG+ PN ++  +C+   I ++N++   + P +     Y   S CYGL D + CG  + M+YCF
Subjt:  IGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQMYYCF

Query:  TYGGTGGRCG
        T+GG GG+CG
Subjt:  TYGGTGGRCG

A0A2H5QMC5 Uncharacterized protein1.7e-2352.17Show/hide
Query:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ
        +D   +G +PKELF+++S GA+ V WGGIA A KNG+SPP+GSG L N +FR  CYIR I+YV+ +N    P    L+Q+   S CYGL+D + CG   +
Subjt:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ

Query:  MYYCFTYGGTGGRCG
        MYYC  +GG GGRCG
Subjt:  MYYCFTYGGTGGRCG

A0A6J1CVW9 uncharacterized protein LOC1110147741.3e-2345.53Show/hide
Query:  STRKLEVTVYEDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLP-NGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGL
        ST    + V +    IG +PKELF ++++GA+QV WGGIA+ + NGMSPPLG+G+ P NG +  ACY + I Y++  N G+ P    +  +  +S+CYGL
Subjt:  STRKLEVTVYEDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLP-NGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGL

Query:  QDGRTCGGFDQMYYCFTYGGTGG
         DG      D MY+CFT+GG GG
Subjt:  QDGRTCGGFDQMYYCFTYGGTGG

V4SWW2 Uncharacterized protein1.7e-2352.17Show/hide
Query:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ
        +D   +G +PKELF+++S GA+ V WGGIA A KNG+SPP+GSG L N +FR  CYIR I+YV+ +N    P    L+Q+   S CYGL+D + CG   +
Subjt:  EDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQ

Query:  MYYCFTYGGTGGRCG
        MYYC  +GG GGRCG
Subjt:  MYYCFTYGGTGGRCG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G35250.1 Protein of Unknown Function (DUF239)1.6e-1030.97Show/hide
Query:  KAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQMY-
        + IG +PKELFS+++NGA  V +GG    + +G+SPP+G+G  P  +F+   +   +  +N++   +  +  +++ Y    NC+      T  G+ +   
Subjt:  KAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQMY-

Query:  YCFTYGGTGGRCG
          F++GG GG CG
Subjt:  YCFTYGGTGGRCG

AT4G23390.1 Protein of Unknown Function (DUF239)4.8e-1030.08Show/hide
Query:  KLEVTVYED-----------GKAIGCFPKELFS--NISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGM-PPKQSELQQ
        +LEV++Y+D            + IG +PK LF+   +++GA  V WGG   ++    SP +GSG+ P   F+ A Y+ G++ + +    +  P  S L+ 
Subjt:  KLEVTVYED-----------GKAIGCFPKELFS--NISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGM-PPKQSELQQ

Query:  YNGDSNCYGLQDGRTCGGFDQMYYCFTYGGTGG
        +    NCY +Q     G F        +GG GG
Subjt:  YNGDSNCYGLQDGRTCGGFDQMYYCFTYGGTGG

AT5G11660.1 Protein of Unknown Function (DUF239)2.7e-1336.94Show/hide
Query:  IGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQMYYCF
        IG +PKELF  I N    V   G  +A+ +G+SPP+G+G LP+ +   + +++G++ V+++      K+ +L++   D+ CYGL+DG+    F +    F
Subjt:  IGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQMYYCF

Query:  TYGGTGGR-CG
        TYGG GG  CG
Subjt:  TYGGTGGR-CG

AT5G25410.1 Protein of Unknown Function (DUF239)8.1e-1029.58Show/hide
Query:  LQLSAHLQENMRSNYMSTRKLEVTVYEDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQ
        L  S H Q+    N+  T+ +      D   +G +PKELF+ I NGA  V  GG  +A+  G SPP+G+G  P G+ + +     I  +N+         
Subjt:  LQLSAHLQENMRSNYMSTRKLEVTVYEDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQ

Query:  SELQQYNGDSNCYGLQDGRTCGGFDQMYYCFTYGGTGGR-CG
          +++      CYG+   +       + + F YGG GG  CG
Subjt:  SELQQYNGDSNCYGLQDGRTCGGFDQMYYCFTYGGTGGR-CG

AT5G25415.1 Protein of Unknown Function (DUF239)6.2e-1035.34Show/hide
Query:  QENMRSNYMSTRKLEVTVYEDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYN
        Q+    N+  T+ L+    ED   IG +PKELF+ I+NGA  V  GG  +A+ +G SPP+G+GN P G    +     I  V + N       S   +  
Subjt:  QENMRSNYMSTRKLEVTVYEDGKAIGCFPKELFSNISNGAKQVCWGGIAEAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYN

Query:  GDS-NCYGLQDGRT-CGGFDQMYYCFTYGGTGG
         DS  CYGL+ G+       ++ + F YGG GG
Subjt:  GDS-NCYGLQDGRT-CGGFDQMYYCFTYGGTGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTAGAATATGCTTTGTCGAGGCTTTGTCCAAACAGATCGATCATATTATGTAGGTTTCCTTTGAGACCAACTTCTTTTGTTGGGGGAGATGAATATGTGAGATC
AAATTATGGATGCAATCATACCAGCACTAATGCAACGGATCTCATTAGAACTCTGCAGTTAAGCGCGCACCTTCAAGAAAATATGAGATCAAATTATATGTCTACCAGGA
AATTGGAAGTTACAGTTTATGAAGATGGAAAAGCGATTGGATGTTTTCCAAAAGAGTTGTTTTCAAATATAAGCAATGGGGCAAAACAAGTGTGTTGGGGAGGCATTGCA
GAGGCAGCGAAAAATGGAATGAGCCCTCCATTGGGCAGTGGTAATTTACCTAATGGAAACTTTAGGGTTGCATGTTACATTAGAGGAATTAGATATGTGAATAATGAAAA
CTTGGGAATGCCTCCAAAACAATCTGAACTTCAACAATATAATGGGGACTCTAATTGTTATGGTTTGCAAGATGGTAGAACTTGTGGGGGTTTTGACCAAATGTATTATT
GCTTCACATATGGTGGAACAGGTGGGAGATGTGGGGCTGTTCAAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTAGAATATGCTTTGTCGAGGCTTTGTCCAAACAGATCGATCATATTATGTAGGTTTCCTTTGAGACCAACTTCTTTTGTTGGGGGAGATGAATATGTGAGATC
AAATTATGGATGCAATCATACCAGCACTAATGCAACGGATCTCATTAGAACTCTGCAGTTAAGCGCGCACCTTCAAGAAAATATGAGATCAAATTATATGTCTACCAGGA
AATTGGAAGTTACAGTTTATGAAGATGGAAAAGCGATTGGATGTTTTCCAAAAGAGTTGTTTTCAAATATAAGCAATGGGGCAAAACAAGTGTGTTGGGGAGGCATTGCA
GAGGCAGCGAAAAATGGAATGAGCCCTCCATTGGGCAGTGGTAATTTACCTAATGGAAACTTTAGGGTTGCATGTTACATTAGAGGAATTAGATATGTGAATAATGAAAA
CTTGGGAATGCCTCCAAAACAATCTGAACTTCAACAATATAATGGGGACTCTAATTGTTATGGTTTGCAAGATGGTAGAACTTGTGGGGGTTTTGACCAAATGTATTATT
GCTTCACATATGGTGGAACAGGTGGGAGATGTGGGGCTGTTCAAAATTAA
Protein sequenceShow/hide protein sequence
MVLEYALSRLCPNRSIILCRFPLRPTSFVGGDEYVRSNYGCNHTSTNATDLIRTLQLSAHLQENMRSNYMSTRKLEVTVYEDGKAIGCFPKELFSNISNGAKQVCWGGIA
EAAKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNNENLGMPPKQSELQQYNGDSNCYGLQDGRTCGGFDQMYYCFTYGGTGGRCGAVQN