; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014673 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014673
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold1315:124507..124884
RNA-Seq ExpressionMS014673
SyntenyMS014673
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053047.1 uncharacterized protein E6C27_scaffold344G001630 [Cucumis melo var. makuwa]1.4e-2551.26Show/hide
Query:  WWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPS-PNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCYGLYAAA
        WW+ VG+ Q  +GYWP ELF +L  GAEQVAWGG A+PS  +  +PPLG+GHKPNG+ +EAC+ ++I YI  N     P  +N ++YV +S CY L +  
Subjt:  WWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPS-PNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCYGLYAAA

Query:  GTCAVDNMYFCFTFGGPGG
          C  D   +CFTFGGPGG
Subjt:  GTCAVDNMYFCFTFGGPGG

KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]1.4e-2552Show/hide
Query:  GNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKP--NGKYNEACYFKSINYIDG--NNNGVDPAYENRVSYVGNSDCY
        GNWW L   N    +GYWPKEL  +L +GA+Q+AWGGIA+PS +G++P LG+GHKP  NG YNE CY ++I  I G   N  V P ++N +SY  N+ CY
Subjt:  GNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKP--NGKYNEACYFKSINYIDG--NNNGVDPAYENRVSYVGNSDCY

Query:  GLYAAAGTCAVDNMYFCFTFGGPGG
         L      C  D M +CFTFGGPGG
Subjt:  GLYAAAGTCAVDNMYFCFTFGGPGG

XP_022145286.1 uncharacterized protein LOC111014774 [Momordica charantia]6.0e-6188.98Show/hide
Query:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKP-NGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDC
        DRSTG+ WWLAV +SQTTIGYWPKELFGHLN+GAEQVAWGGIAKPSPNGM+PPLGNGHKP NGKYNEACYFKSINYIDGNNNGVDPAYEN VS+V NSDC
Subjt:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKP-NGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDC

Query:  YGLYAAAGTCAVDNMYFCFTFGGPGGN
        YGL+  AGTCA DNMYFCFTFGGPGGN
Subjt:  YGLYAAAGTCAVDNMYFCFTFGGPGGN

XP_022145287.1 uncharacterized protein LOC111014775 [Momordica charantia]1.8e-3373.68Show/hide
Query:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPN-GKYNEACYFKSINYIDGNNNGVDPAYENRVSYV
        DR TG+ WWLAV +SQTTIGYWPKELFGHLN+GAEQVAWGGIAKPSPNGM+PPLGNGHKPN GK+++ACYF+++NYI+ NN     A EN  SY+
Subjt:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPN-GKYNEACYFKSINYIDGNNNGVDPAYENRVSYV

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]7.4e-4367.72Show/hide
Query:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPN-GKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDC
        DR TGN WWLAVG S  TIGYWPKELFGHLN+G EQVAWGGIAKPSPNGM+PPLGNGHKPN  KY++ACYF+ +NY+D NN G  PA EN  +Y+ N+ C
Subjt:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPN-GKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDC

Query:  YGLYAAAGTCAVDNMYFCFTFGGPGGN
        Y L     TC  +  Y+C TFGGPGGN
Subjt:  YGLYAAAGTCAVDNMYFCFTFGGPGGN

TrEMBL top hitse value%identityAlignment
A0A5A7UEV4 Uncharacterized protein6.8e-2651.26Show/hide
Query:  WWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPS-PNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCYGLYAAA
        WW+ VG+ Q  +GYWP ELF +L  GAEQVAWGG A+PS  +  +PPLG+GHKPNG+ +EAC+ ++I YI  N     P  +N ++YV +S CY L +  
Subjt:  WWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPS-PNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCYGLYAAA

Query:  GTCAVDNMYFCFTFGGPGG
          C  D   +CFTFGGPGG
Subjt:  GTCAVDNMYFCFTFGGPGG

A0A5D3CJM0 Neprosin 26.8e-2651.26Show/hide
Query:  WWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPS-PNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCYGLYAAA
        WW+ VG+ Q  +GYWP ELF +L  GAEQVAWGG A+PS  +  +PPLG+GHKPNG+ +EAC+ ++I YI  N     P  +N ++YV +S CY L +  
Subjt:  WWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPS-PNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCYGLYAAA

Query:  GTCAVDNMYFCFTFGGPGG
          C  D   +CFTFGGPGG
Subjt:  GTCAVDNMYFCFTFGGPGG

A0A6J1CVJ6 uncharacterized protein LOC1110147773.6e-4367.72Show/hide
Query:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPN-GKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDC
        DR TGN WWLAVG S  TIGYWPKELFGHLN+G EQVAWGGIAKPSPNGM+PPLGNGHKPN  KY++ACYF+ +NY+D NN G  PA EN  +Y+ N+ C
Subjt:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPN-GKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDC

Query:  YGLYAAAGTCAVDNMYFCFTFGGPGGN
        Y L     TC  +  Y+C TFGGPGGN
Subjt:  YGLYAAAGTCAVDNMYFCFTFGGPGGN

A0A6J1CVW9 uncharacterized protein LOC1110147742.9e-6188.98Show/hide
Query:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKP-NGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDC
        DRSTG+ WWLAV +SQTTIGYWPKELFGHLN+GAEQVAWGGIAKPSPNGM+PPLGNGHKP NGKYNEACYFKSINYIDGNNNGVDPAYEN VS+V NSDC
Subjt:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKP-NGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDC

Query:  YGLYAAAGTCAVDNMYFCFTFGGPGGN
        YGL+  AGTCA DNMYFCFTFGGPGGN
Subjt:  YGLYAAAGTCAVDNMYFCFTFGGPGGN

A0A6J1CW60 uncharacterized protein LOC1110147758.8e-3473.68Show/hide
Query:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPN-GKYNEACYFKSINYIDGNNNGVDPAYENRVSYV
        DR TG+ WWLAV +SQTTIGYWPKELFGHLN+GAEQVAWGGIAKPSPNGM+PPLGNGHKPN GK+++ACYF+++NYI+ NN     A EN  SY+
Subjt:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPN-GKYNEACYFKSINYIDGNNNGVDPAYENRVSYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G35250.1 Protein of Unknown Function (DUF239)1.1e-1538.1Show/hide
Query:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCY
        D  +GNW   A+      IGYWPKELF HLNNGA  V +GG    SP+G++PP+GNG  P   + +  +F ++  I+ +   V         YV   D Y
Subjt:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCY

Query:  GLYAAAGTCAVDNMYFCFTFGGPGGN
          + A       +    F+FGGPGGN
Subjt:  GLYAAAGTCAVDNMYFCFTFGGPGGN

AT2G38255.1 Protein of Unknown Function (DUF239)7.7e-1435.38Show/hide
Query:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCY
        D  +GN W L        +GYWPK+LF HLN GA  V +GG    SP+G++PP+GNGH P   Y ++ ++  +   + N   VD        Y  +  CY
Subjt:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCY

Query:  -----GLYAAAGTCAVDNMYFCFTFGGPGG
             G + + G          F+FGGPGG
Subjt:  -----GLYAAAGTCAVDNMYFCFTFGGPGG

AT5G11660.1 Protein of Unknown Function (DUF239)1.3e-1332.82Show/hide
Query:  DRSTGNWWWLAV--GNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRV---SYVG
        ++  GNWW   V        IGYWPKELF  + N  + V   G  + SP+G++PP+GNG  P+   N++ + K +  +       D  Y  ++     + 
Subjt:  DRSTGNWWWLAV--GNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRV---SYVG

Query:  NSDCYGLYAAAGTCAVDNMYFCFTFGGPGGN
        ++ CYGL    G          FT+GGPGGN
Subjt:  NSDCYGLYAAAGTCAVDNMYFCFTFGGPGGN

AT5G25415.1 Protein of Unknown Function (DUF239)1.7e-1637.98Show/hide
Query:  DRSTGNWWWLAV--GNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSD
        D+ TGNWW   +        IGYWPKELF  +NNGA  V  GG  + S +G +PP+GNG+ P G   ++  F +I  +D N N            V +  
Subjt:  DRSTGNWWWLAV--GNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSD

Query:  CYGL-YAAAGTCAVDNMYFCFTFGGPGGN
        CYGL            + F F +GGPGGN
Subjt:  CYGL-YAAAGTCAVDNMYFCFTFGGPGGN

AT5G60380.1 Protein of Unknown Function (DUF239)2.2e-1335.43Show/hide
Query:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGN-NNGVDPAYENRVSYVGNSDC
        D  T NWW + + +  T IGYWPKELF  ++NGA  V  GG+ + SP+G++PP+GNG  P      +  F +++ +      G   A+   V  + +S C
Subjt:  DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGN-NNGVDPAYENRVSYVGNSDC

Query:  YGLYAAAGT-CAVDNMYFCFTFGGPGG
        YGL            + + F +GGPGG
Subjt:  YGLYAAAGT-CAVDNMYFCFTFGGPGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATCGATCAACAGGAAATTGGTGGTGGCTTGCCGTAGGCAATAGCCAAACAACAATAGGATATTGGCCAAAGGAGCTGTTTGGACATCTGAATAATGGAGCAGAGCAAGT
GGCATGGGGAGGCATTGCAAAGCCTTCACCAAATGGAATGAACCCTCCATTGGGGAATGGCCACAAGCCAAATGGTAAATACAACGAAGCTTGCTACTTCAAATCCATAA
ACTACATAGATGGTAACAACAATGGCGTAGATCCTGCTTATGAGAACAGAGTGAGTTATGTAGGTAACTCTGATTGTTATGGTTTGTATGCTGCCGCCGGTACATGTGCG
GTTGATAACATGTATTTTTGCTTCACTTTTGGAGGACCCGGTGGAAAC
mRNA sequenceShow/hide mRNA sequence
GATCGATCAACAGGAAATTGGTGGTGGCTTGCCGTAGGCAATAGCCAAACAACAATAGGATATTGGCCAAAGGAGCTGTTTGGACATCTGAATAATGGAGCAGAGCAAGT
GGCATGGGGAGGCATTGCAAAGCCTTCACCAAATGGAATGAACCCTCCATTGGGGAATGGCCACAAGCCAAATGGTAAATACAACGAAGCTTGCTACTTCAAATCCATAA
ACTACATAGATGGTAACAACAATGGCGTAGATCCTGCTTATGAGAACAGAGTGAGTTATGTAGGTAACTCTGATTGTTATGGTTTGTATGCTGCCGCCGGTACATGTGCG
GTTGATAACATGTATTTTTGCTTCACTTTTGGAGGACCCGGTGGAAAC
Protein sequenceShow/hide protein sequence
DRSTGNWWWLAVGNSQTTIGYWPKELFGHLNNGAEQVAWGGIAKPSPNGMNPPLGNGHKPNGKYNEACYFKSINYIDGNNNGVDPAYENRVSYVGNSDCYGLYAAAGTCA
VDNMYFCFTFGGPGGN