; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G012050 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G012050
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr07:17560277..17561186
RNA-Seq ExpressionLsi07G012050
SyntenyLsi07G012050
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB99729.1 hypothetical protein L484_023259 [Morus notabilis]2.6e-2544.19Show/hide
Query:  YGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGWQDQESGNWWLLLTEKR--IAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNS
        YG    VS+ +L +A DQ SSA+VWI  GPS +LN I AGW D+++GNWW  ++++   I +GYWP++L          V WGGI KP   G+SP +GN 
Subjt:  YGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGWQDQESGNWWLLLTEKR--IAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNS

Query:  HFPGNGKYKEACYFRNINYITKNLETQKP
        HFP +G Y+ ACYF +++Y+    +   P
Subjt:  HFPGNGKYKEACYFRNINYITKNLETQKP

KAA0053047.1 uncharacterized protein E6C27_scaffold344G001630 [Cucumis melo var. makuwa]2.2e-2435.96Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW-------------------------------------------------------QDQ
        YYG  +++SVYN++++  QSSS+N+WIVGGP+N+L V++ GW                                                       Q Q
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW-------------------------------------------------------QDQ

Query:  E----------SGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPG-NGKYKEACYFRNINYIT-KNLETQK-PFYEN
        +          +GNWW+L+ E  + +GYWPKEL   + +GAEQ+ WGGI KPS +GMSP LG+ H P  NG Y E CY RNI  I+     T K P ++N
Subjt:  E----------SGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPG-NGKYKEACYFRNINYIT-KNLETQK-PFYEN

Query:  TIS
        T+S
Subjt:  TIS

XP_022145286.1 uncharacterized protein LOC111014774 [Momordica charantia]5.7e-4458.22Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNV-----------ILAGWQDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSS
        YYGV  + SVYNL+VAQDQSSS+N+WIVGGP   LNV           +   W D+ +G+WWL +++ +  IGYWPKELFG++ +GAEQV WGGI KPS 
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNV-----------ILAGWQDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSS

Query:  NGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENTIS
        NGMSPPLGN H P NGKY EACYF++INYI  N     P YEN +S
Subjt:  NGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENTIS

XP_022145287.1 uncharacterized protein LOC111014775 [Momordica charantia]1.5e-3642.08Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW----------------------------------------------------------
        YYG    VSVYNL+VAQDQSSS+N+WI+GGP  A NVIL GW                                                          
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW----------------------------------------------------------

Query:  ---------QDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENT
                 QD+ +G+WWL +++ +  IGYWPKELFG++ +GAEQV WGGI KPS NGMSPPLGN H P  GK+ +ACYFR +NYI +N E++    ENT
Subjt:  ---------QDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENT

Query:  IS
         S
Subjt:  IS

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]1.8e-4256.95Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGWQ------------------DQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWG
        YYG +  VSVYNL+VAQDQSSS+N+WI+GGP  A NVILAGWQ                  D+ +GNWWL + E    IGYWPKELFG++ +G EQV WG
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGWQ------------------DQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWG

Query:  GIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENT
        GI KPS NGMSPPLGN H P   KY +ACYFR +NY+ +N + Q P  ENT
Subjt:  GIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENT

TrEMBL top hitse value%identityAlignment
A0A5A7UEV4 Uncharacterized protein1.1e-2435.96Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW-------------------------------------------------------QDQ
        YYG  +++SVYN++++  QSSS+N+WIVGGP+N+L V++ GW                                                       Q Q
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW-------------------------------------------------------QDQ

Query:  E----------SGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPG-NGKYKEACYFRNINYIT-KNLETQK-PFYEN
        +          +GNWW+L+ E  + +GYWPKEL   + +GAEQ+ WGGI KPS +GMSP LG+ H P  NG Y E CY RNI  I+     T K P ++N
Subjt:  E----------SGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPG-NGKYKEACYFRNINYIT-KNLETQK-PFYEN

Query:  TIS
        T+S
Subjt:  TIS

A0A6J1CVJ6 uncharacterized protein LOC1110147778.8e-4356.95Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGWQ------------------DQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWG
        YYG +  VSVYNL+VAQDQSSS+N+WI+GGP  A NVILAGWQ                  D+ +GNWWL + E    IGYWPKELFG++ +G EQV WG
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGWQ------------------DQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWG

Query:  GIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENT
        GI KPS NGMSPPLGN H P   KY +ACYFR +NY+ +N + Q P  ENT
Subjt:  GIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENT

A0A6J1CVW9 uncharacterized protein LOC1110147742.7e-4458.22Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNV-----------ILAGWQDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSS
        YYGV  + SVYNL+VAQDQSSS+N+WIVGGP   LNV           +   W D+ +G+WWL +++ +  IGYWPKELFG++ +GAEQV WGGI KPS 
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNV-----------ILAGWQDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSS

Query:  NGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENTIS
        NGMSPPLGN H P NGKY EACYF++INYI  N     P YEN +S
Subjt:  NGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENTIS

A0A6J1CW60 uncharacterized protein LOC1110147757.2e-3742.08Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW----------------------------------------------------------
        YYG    VSVYNL+VAQDQSSS+N+WI+GGP  A NVIL GW                                                          
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW----------------------------------------------------------

Query:  ---------QDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENT
                 QD+ +G+WWL +++ +  IGYWPKELFG++ +GAEQV WGGI KPS NGMSPPLGN H P  GK+ +ACYFR +NYI +N E++    ENT
Subjt:  ---------QDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQKPFYENT

Query:  IS
         S
Subjt:  IS

W9RN49 Neprosin domain-containing protein1.3e-2544.19Show/hide
Query:  YGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGWQDQESGNWWLLLTEKR--IAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNS
        YG    VS+ +L +A DQ SSA+VWI  GPS +LN I AGW D+++GNWW  ++++   I +GYWP++L          V WGGI KP   G+SP +GN 
Subjt:  YGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGWQDQESGNWWLLLTEKR--IAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNS

Query:  HFPGNGKYKEACYFRNINYITKNLETQKP
        HFP +G Y+ ACYF +++Y+    +   P
Subjt:  HFPGNGKYKEACYFRNINYITKNLETQKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27320.1 Protein of Unknown Function (DUF239)2.9e-1442.22Show/hide
Query:  WQDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACY----FRNINYITKNLETQK
        +QD  SGNW L + ++   +GYWPKELF ++ +GA  V +GG   PS +G SPP+GN +FP +  YK + +     +N NY T + E Q+
Subjt:  WQDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACY----FRNINYITKNLETQK

AT5G11660.1 Protein of Unknown Function (DUF239)8.5e-1425.84Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW----------------------------------------------------------
        Y+G+ A++SV++LN+++DQ+S A++++  G +  +N I  GW                                                          
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW----------------------------------------------------------

Query:  -----QDQESGNWWLLLTEKR-----IAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEA
             Q+++ GNWW  +T+ R     + IGYWPKELF  +    + VG  G V+ S +G+SPP+GN   P   + K A
Subjt:  -----QDQESGNWWLLLTEKR-----IAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEA

AT5G25410.1 Protein of Unknown Function (DUF239)1.8e-1930Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW----------------------------------------------------------
        Y+GV+A  S++ LN+ +DQ+S A +++  G ++ +N I AGW                                                          
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGW----------------------------------------------------------

Query:  --QDQESGNWW---LLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQK
          QD+++GNWW   L+     I +GYWPKELF  +  GA  VG GG V+ S  G SPP+GN  FP  G  KE+  F NI  +  N E ++
Subjt:  --QDQESGNWW---LLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQK

AT5G25415.1 Protein of Unknown Function (DUF239)3.4e-1544.32Show/hide
Query:  QDQESGNWWL---LLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQK
        QD+ +GNWWL   L       IGYWPKELF  +  GA  VG GG V+ S +G SPP+GN +FP  G+  ++  F NI  +  N   +K
Subjt:  QDQESGNWWL---LLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQK

AT5G60380.1 Protein of Unknown Function (DUF239)2.4e-1626.98Show/hide
Query:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAG-----------------------------------------------------------
        Y+G+ A  + YNLN+ +DQ+S + +++  G    +N I  G                                                           
Subjt:  YYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAG-----------------------------------------------------------

Query:  W---QDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQK
        W   QD+++ NWW++       IGYWPKELF  +  GA  VG GG+V+ S +G+SPP+GN  FP  G  + A  F N++ +    E  K
Subjt:  W---QDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYKEACYFRNINYITKNLETQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATATTATGGAGTTAATGCACATGTATCTGTATACAATTTGAATGTGGCTCAAGACCAATCTTCTTCTGCAAATGTATGGATAGTTGGTGGCCCTTCTAACGCTCT
TAATGTAATACTTGCAGGCTGGCAGGATCAAGAATCAGGAAATTGGTGGCTTTTGTTGACAGAAAAGCGAATAGCCATTGGATATTGGCCAAAGGAATTGTTTGGATATG
TAAAGGAAGGAGCAGAGCAAGTAGGATGGGGAGGCATTGTAAAGCCTTCATCAAATGGAATGAGTCCTCCCTTGGGAAATTCTCACTTTCCAGGAAATGGTAAATACAAG
GAGGCTTGTTACTTTAGAAATATTAATTACATTACAAAGAACCTTGAAACCCAAAAACCTTTTTACGAGAATACAATTTCGAGTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACATATTATGGAGTTAATGCACATGTATCTGTATACAATTTGAATGTGGCTCAAGACCAATCTTCTTCTGCAAATGTATGGATAGTTGGTGGCCCTTCTAACGCTCT
TAATGTAATACTTGCAGGCTGGCAGGATCAAGAATCAGGAAATTGGTGGCTTTTGTTGACAGAAAAGCGAATAGCCATTGGATATTGGCCAAAGGAATTGTTTGGATATG
TAAAGGAAGGAGCAGAGCAAGTAGGATGGGGAGGCATTGTAAAGCCTTCATCAAATGGAATGAGTCCTCCCTTGGGAAATTCTCACTTTCCAGGAAATGGTAAATACAAG
GAGGCTTGTTACTTTAGAAATATTAATTACATTACAAAGAACCTTGAAACCCAAAAACCTTTTTACGAGAATACAATTTCGAGTATGTGA
Protein sequenceShow/hide protein sequence
MTYYGVNAHVSVYNLNVAQDQSSSANVWIVGGPSNALNVILAGWQDQESGNWWLLLTEKRIAIGYWPKELFGYVKEGAEQVGWGGIVKPSSNGMSPPLGNSHFPGNGKYK
EACYFRNINYITKNLETQKPFYENTISSM