; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G012080 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G012080
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr07:17597433..17602650
RNA-Seq ExpressionLsi07G012080
SyntenyLsi07G012080
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053047.1 uncharacterized protein E6C27_scaffold344G001630 [Cucumis melo var. makuwa]5.6e-4048.54Show/hide
Query:  KGP-KYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTA---------------------------------
        KGP KYYG  + + VYN+SLS  QSSS+NIWIVGG  NSL VLM GW VNP +N D ++R FVYWTA                                 
Subjt:  KGP-KYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTA---------------------------------

Query:  ----------------GGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKP--NGEHKEACYIRNIHYI--ANDHRIAS
                         GNWWVLVGEN +G+GYWPKELV +L DGA+++AWGGIAKPS D  SP LG+GHKP  NG++ E CYIRNI  I  A  +    
Subjt:  ----------------GGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKP--NGEHKEACYIRNIHYI--ANDHRIAS

Query:  PNLDNT
        P  DNT
Subjt:  PNLDNT

TYK11502.1 neprosin 2 [Cucumis melo var. makuwa]1.6e-3947.12Show/hide
Query:  MKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT---------------------------------
        +K   Y+GA ARI VYN+SLS E QSSSANIW+VGG D SLNVLMA      A++ DSL R FVYWT                                 
Subjt:  MKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT---------------------------------

Query:  ----------------AGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPN
                        A G+WWV VG++QVG+GYWP EL P+L  GA++VAWGG A+PS   +ESPPLG+GHKPNG   EAC++RNI YIA+++ ++ P 
Subjt:  ----------------AGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPN

Query:  LDNTQFFV
        LDNT  +V
Subjt:  LDNTQFFV

XP_022145287.1 uncharacterized protein LOC111014775 [Momordica charantia]1.6e-3941.35Show/hide
Query:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTAG---------------------------------
        +G KYYG    + VYNLS++ +QSSS+NIWI+GG   + NV++ GWQVNP +N DS +RMFVYWTA                                  
Subjt:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTAG---------------------------------

Query:  ------------------GNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPN
                          G+WW+ V ++Q  +GYWPKEL   L+DGA++VAWGGIAKPS +  SPPLGNGHKPN G+H +ACY R ++YI  ++      
Subjt:  ------------------GNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPN

Query:  LDNTQFFV
        ++NT  ++
Subjt:  LDNTQFFV

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]2.6e-4556.77Show/hide
Query:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT--AGGNWWVLVGENQVGVGYWPKELVPSLSDGADR
        KG KYYGA+  + VYNLS++ +QSSS+NIWI+GG   + NV++AGWQVNP +N DSL+RMFVYWT    GNWW+ VGE+   +GYWPKEL   L+DG ++
Subjt:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT--AGGNWWVLVGENQVGVGYWPKELVPSLSDGADR

Query:  VAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNT
        VAWGGIAKPS +  SPPLGNGHKPN  ++ +ACY R ++Y+  +++   P  +NT
Subjt:  VAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNT

XP_031738649.1 uncharacterized protein LOC116402744 [Cucumis sativus]8.4e-4448.08Show/hide
Query:  MKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT---------------------------------
        +K   Y+GA ARI V+N+SLS   QSSSANIW++GG+D+SLNVLMAGWQVNPA+N D+L R FVYWT                                 
Subjt:  MKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT---------------------------------

Query:  ----------------AGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPN
                        A G+WWV VG+NQVG+GYWP EL P+L  GAD+VAWGG A+P+   +ESPPLG+GHKPNG+  EA ++RNI YIA ++ ++ P 
Subjt:  ----------------AGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPN

Query:  LDNTQFFV
        L+NT  +V
Subjt:  LDNTQFFV

TrEMBL top hitse value%identityAlignment
A0A0A0L400 Neprosin domain-containing protein4.1e-4448.08Show/hide
Query:  MKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT---------------------------------
        +K   Y+GA ARI V+N+SLS   QSSSANIW++GG+D+SLNVLMAGWQVNPA+N D+L R FVYWT                                 
Subjt:  MKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT---------------------------------

Query:  ----------------AGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPN
                        A G+WWV VG+NQVG+GYWP EL P+L  GAD+VAWGG A+P+   +ESPPLG+GHKPNG+  EA ++RNI YIA ++ ++ P 
Subjt:  ----------------AGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPN

Query:  LDNTQFFV
        L+NT  +V
Subjt:  LDNTQFFV

A0A5A7UEV4 Uncharacterized protein2.7e-4048.54Show/hide
Query:  KGP-KYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTA---------------------------------
        KGP KYYG  + + VYN+SLS  QSSS+NIWIVGG  NSL VLM GW VNP +N D ++R FVYWTA                                 
Subjt:  KGP-KYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTA---------------------------------

Query:  ----------------GGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKP--NGEHKEACYIRNIHYI--ANDHRIAS
                         GNWWVLVGEN +G+GYWPKELV +L DGA+++AWGGIAKPS D  SP LG+GHKP  NG++ E CYIRNI  I  A  +    
Subjt:  ----------------GGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKP--NGEHKEACYIRNIHYI--ANDHRIAS

Query:  PNLDNT
        P  DNT
Subjt:  PNLDNT

A0A5D3CJM0 Neprosin 27.9e-4047.12Show/hide
Query:  MKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT---------------------------------
        +K   Y+GA ARI VYN+SLS E QSSSANIW+VGG D SLNVLMA      A++ DSL R FVYWT                                 
Subjt:  MKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT---------------------------------

Query:  ----------------AGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPN
                        A G+WWV VG++QVG+GYWP EL P+L  GA++VAWGG A+PS   +ESPPLG+GHKPNG   EAC++RNI YIA+++ ++ P 
Subjt:  ----------------AGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPN

Query:  LDNTQFFV
        LDNT  +V
Subjt:  LDNTQFFV

A0A6J1CVJ6 uncharacterized protein LOC1110147771.3e-4556.77Show/hide
Query:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT--AGGNWWVLVGENQVGVGYWPKELVPSLSDGADR
        KG KYYGA+  + VYNLS++ +QSSS+NIWI+GG   + NV++AGWQVNP +N DSL+RMFVYWT    GNWW+ VGE+   +GYWPKEL   L+DG ++
Subjt:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT--AGGNWWVLVGENQVGVGYWPKELVPSLSDGADR

Query:  VAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNT
        VAWGGIAKPS +  SPPLGNGHKPN  ++ +ACY R ++Y+  +++   P  +NT
Subjt:  VAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNT

A0A6J1CVW9 uncharacterized protein LOC1110147747.9e-4053.9Show/hide
Query:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT--AGGNWWVLVGENQVGVGYWPKELVPSLSDGADR
        +G +YYG      VYNLS++ +QSSS+NIWIVGG   +LN       VNP +N DSL+RMFVYWT  + G+WW+ V ++Q  +GYWPKEL   L+DGA++
Subjt:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWT--AGGNWWVLVGENQVGVGYWPKELVPSLSDGADR

Query:  VAWGGIAKPSKDEESPPLGNGHKP-NGEHKEACYIRNIHYIANDHRIASPNLDN
        VAWGGIAKPS +  SPPLGNGHKP NG++ EACY ++I+YI  ++    P  +N
Subjt:  VAWGGIAKPSKDEESPPLGNGHKP-NGEHKEACYIRNIHYIANDHRIASPNLDN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G35250.1 Protein of Unknown Function (DUF239)1.4e-1729.53Show/hide
Query:  MKGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTAGG-----------NWWVLVGEN-QVG-------
        + G  Y GA A I +++L L + Q S + IW+  G    LN + AG  V+P L  DS++R  +YWT  G             +V+V  N ++G       
Subjt:  MKGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTAGG-----------NWWVLVGEN-QVG-------

Query:  ------------------------------VGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHR
                                      +GYWPKEL   L++GA  V +GG    S D  SPP+GNG  P  + K+  +  N+  I +D++
Subjt:  ------------------------------VGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHR

AT2G38255.1 Protein of Unknown Function (DUF239)9.3e-1728.27Show/hide
Query:  MKGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTA---------------------------------
        + G  Y GA + I ++N+++ + Q S + IW+  G    LN +  GW V+P L  D+L+R+ +YWTA                                 
Subjt:  MKGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTA---------------------------------

Query:  -------------------GGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPNGEHKEACYIRNIHYI
                            GNW +        VGYWPK+L   L+ GA  V +GG    S D  SPP+GNGH P        Y ++ HY+
Subjt:  -------------------GGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPNGEHKEACYIRNIHYI

AT2G44220.1 Protein of Unknown Function (DUF239)7.1e-1728.57Show/hide
Query:  MKGPKYYGANARIVVYNLSLS-SEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTAG-------------------------------
        ++  K+YG    I ++   +   ++ S A  W+V G  +SLN + AGWQV P L DD+  R FVYWT                                 
Subjt:  MKGPKYYGANARIVVYNLSLS-SEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTAG-------------------------------

Query:  --------------------GNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKD---EESPPLGNGHKPNGEHKEACYIRNIHYIANDHRI
                            GNWW+ V E  V +GYWP  L  SL   A RV WGG    SK      +  +G+GH  +   K+A Y RN+  +   + +
Subjt:  --------------------GNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKD---EESPPLGNGHKPNGEHKEACYIRNIHYIANDHRI

Query:  ASP
          P
Subjt:  ASP

AT5G05030.1 Protein of Unknown Function (DUF239)3.5e-1626.7Show/hide
Query:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTAG---------------------------------
        +GP Y G  A + V++L++S +Q+S ANI+I  G  + +N +  GW +NP+L  D  +  + +W                                    
Subjt:  KGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTAG---------------------------------

Query:  --------------GNWW---VLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPNGEHKEACYIRNIHYIAND
                      GNWW   +   E++  +GYWPKEL   +S  A+ V   G  + S   +SPP+GNGH P  +   +  +  + +I ND
Subjt:  --------------GNWW---VLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPNGEHKEACYIRNIHYIAND

AT5G25410.1 Protein of Unknown Function (DUF239)1.2e-1625.53Show/hide
Query:  GPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTA-----------------------------------
        GP Y+G +A   ++ L++  +Q+S A +++  G ++ +N + AGW +NP++  D     + +W                                     
Subjt:  GPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTA-----------------------------------

Query:  ---------GGNWWV---LVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPNGEHKEACYIRNIHYIANDH
                  GNWW+   +     + VGYWPKEL   + +GA+ V  GG  + S    SPP+GNG  P G+ KE+    NI  + +++
Subjt:  ---------GGNWWV---LVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPNGEHKEACYIRNIHYIANDH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGTCCGAAATATTATGGAGCTAATGCACGCATTGTTGTGTACAATTTGAGTTTGAGTTCAGAGCAATCTTCTTCTGCTAACATATGGATAGTTGGTGGCACTGA
TAACTCTCTTAATGTTCTTATGGCAGGCTGGCAGGTGAATCCAGCACTAAATGATGATAGTCTATCTAGAATGTTTGTGTATTGGACGGCTGGAGGGAATTGGTGGGTTC
TAGTAGGTGAAAATCAAGTGGGAGTAGGATATTGGCCAAAAGAGTTGGTTCCAAGTTTAAGTGATGGAGCAGATCGAGTTGCATGGGGAGGCATTGCAAAGCCTTCAAAA
GATGAAGAAAGCCCTCCATTGGGAAATGGCCACAAACCAAATGGTGAACACAAGGAAGCTTGTTACATTAGAAACATACATTACATAGCAAATGACCATAGAATTGCATC
ACCCAATTTGGATAACACCCAATTCTTTGTGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGTCCGAAATATTATGGAGCTAATGCACGCATTGTTGTGTACAATTTGAGTTTGAGTTCAGAGCAATCTTCTTCTGCTAACATATGGATAGTTGGTGGCACTGA
TAACTCTCTTAATGTTCTTATGGCAGGCTGGCAGGTGAATCCAGCACTAAATGATGATAGTCTATCTAGAATGTTTGTGTATTGGACGGCTGGAGGGAATTGGTGGGTTC
TAGTAGGTGAAAATCAAGTGGGAGTAGGATATTGGCCAAAAGAGTTGGTTCCAAGTTTAAGTGATGGAGCAGATCGAGTTGCATGGGGAGGCATTGCAAAGCCTTCAAAA
GATGAAGAAAGCCCTCCATTGGGAAATGGCCACAAACCAAATGGTGAACACAAGGAAGCTTGTTACATTAGAAACATACATTACATAGCAAATGACCATAGAATTGCATC
ACCCAATTTGGATAACACCCAATTCTTTGTGTTATGA
Protein sequenceShow/hide protein sequence
MKGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDDSLSRMFVYWTAGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSK
DEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNTQFFVL