; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014671 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014671
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold1315:109128..109746
RNA-Seq ExpressionMS014671
SyntenyMS014671
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]7.5e-3955.03Show/hide
Query:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG
        G  TGCYNM C+GFVQ NPS     PL P+STY+G+QYDY FT+ Q    G+WW+ V ++   +GYWPKEL  +L DGA+Q+AWGGIA+PS +G+SP LG
Subjt:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG

Query:  NGHKPN-NGKYNEACYFKSINYIDG--NNKGVDPAYENIVSHVSNSDCY
        +GHKPN NG YNE CY ++I  I G   N  V P ++N +S+ SN+ CY
Subjt:  NGHKPN-NGKYNEACYFKSINYIDG--NNKGVDPAYENIVSHVSNSDCY

XP_022145286.1 uncharacterized protein LOC111014774 [Momordica charantia]7.5e-5572.08Show/hide
Query:  GAQTGCYNMLCRGFVQTNPSTPPNI--------PLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSP
        G  T  YN+     V  + S+  NI         L  + T  G      F  + DRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSP
Subjt:  GAQTGCYNMLCRGFVQTNPSTPPNI--------PLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSP

Query:  NGMSPPLGNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        NGMSPPLGNGHKPNNGKYNEACYFKSINYIDGNN GVDPAYENIVSHVSNSDCY
Subjt:  NGMSPPLGNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY

XP_022145287.1 uncharacterized protein LOC111014775 [Momordica charantia]8.8e-6480.71Show/hide
Query:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG
        G  TG YNM CR F+QTNPSTPPNIPLYPSSTYQGKQYDY+FTVFQDR TGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG
Subjt:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG

Query:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHV
        NGHKPN GK+++ACYF+++NYI+ NN+    A EN  S++
Subjt:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHV

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]1.2e-3961.07Show/hide
Query:  PSTPPNIPL---YPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNNGKYNEACY
        P   PN+ L     +    G      F  + DR TG+WWLAV +S  TIGYWPKELFGHLNDG EQVAWGGIAKPSPNGMSPPLGNGHKPN  KY++ACY
Subjt:  PSTPPNIPL---YPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNNGKYNEACY

Query:  FKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        F+ +NY+D NNKG  PA EN  +++SN+ CY
Subjt:  FKSINYIDGNNKGVDPAYENIVSHVSNSDCY

XP_031738648.1 uncharacterized protein LOC105435061 [Cucumis sativus]7.5e-3955.03Show/hide
Query:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG
        G  TGCYNM C+GFVQ NPS     PL P+STY+G+QYDY FT+ Q    G+WW+ V ++   +GYWPKEL  +L DGA+Q+AWGGIA+PS +G+SP LG
Subjt:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG

Query:  NGHKPN-NGKYNEACYFKSINYIDG--NNKGVDPAYENIVSHVSNSDCY
        +GHKPN NG YNE CY ++I  I G   N  V P ++N +S+ SN+ CY
Subjt:  NGHKPN-NGKYNEACYFKSINYIDG--NNKGVDPAYENIVSHVSNSDCY

TrEMBL top hitse value%identityAlignment
A0A5A7UEV4 Uncharacterized protein1.3e-3653.74Show/hide
Query:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPS-PNGMSPPL
        GA TGCYNMLC+GFV  NP  P    + P+S YQGKQYDY F++ Q  + G WW+ V D Q  +GYWP ELF +L  GAEQVAWGG A+PS  +  SPPL
Subjt:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPS-PNGMSPPL

Query:  GNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        G+GHKP NG+ +EAC+ ++I YI  N     P  +N +++VS+S CY
Subjt:  GNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY

A0A5D3CJM0 Neprosin 21.3e-3653.74Show/hide
Query:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPS-PNGMSPPL
        GA TGCYNMLC+GFV  NP  P    + P+S YQGKQYDY F++ Q  + G WW+ V D Q  +GYWP ELF +L  GAEQVAWGG A+PS  +  SPPL
Subjt:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPS-PNGMSPPL

Query:  GNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        G+GHKP NG+ +EAC+ ++I YI  N     P  +N +++VS+S CY
Subjt:  GNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY

A0A6J1CVJ6 uncharacterized protein LOC1110147775.6e-4061.07Show/hide
Query:  PSTPPNIPL---YPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNNGKYNEACY
        P   PN+ L     +    G      F  + DR TG+WWLAV +S  TIGYWPKELFGHLNDG EQVAWGGIAKPSPNGMSPPLGNGHKPN  KY++ACY
Subjt:  PSTPPNIPL---YPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNNGKYNEACY

Query:  FKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        F+ +NY+D NNKG  PA EN  +++SN+ CY
Subjt:  FKSINYIDGNNKGVDPAYENIVSHVSNSDCY

A0A6J1CVW9 uncharacterized protein LOC1110147743.6e-5572.08Show/hide
Query:  GAQTGCYNMLCRGFVQTNPSTPPNI--------PLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSP
        G  T  YN+     V  + S+  NI         L  + T  G      F  + DRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSP
Subjt:  GAQTGCYNMLCRGFVQTNPSTPPNI--------PLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSP

Query:  NGMSPPLGNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        NGMSPPLGNGHKPNNGKYNEACYFKSINYIDGNN GVDPAYENIVSHVSNSDCY
Subjt:  NGMSPPLGNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY

A0A6J1CW60 uncharacterized protein LOC1110147754.3e-6480.71Show/hide
Query:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG
        G  TG YNM CR F+QTNPSTPPNIPLYPSSTYQGKQYDY+FTVFQDR TGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG
Subjt:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG

Query:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHV
        NGHKPN GK+++ACYF+++NYI+ NN+    A EN  S++
Subjt:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)1.2e-2138.36Show/hide
Query:  TGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGG-IAKPSPNG--MSPPLG
        TGCYN+LC GFVQTN        + PSS+Y+G Q+D +  +++D   G+WWL    S   +GYWP  LF HL + A  V +GG I   SP G   S  +G
Subjt:  TGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGG-IAKPSPNG--MSPPLG

Query:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        +GH    G + ++ YF++I  +D +N  V     N+     + +CY
Subjt:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY

AT2G44220.1 Protein of Unknown Function (DUF239)4.0e-2236.73Show/hide
Query:  QTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNG---MSPPL
        +TGCYN+LC GFVQT+        +   S Y+G QYD S  +++D+ TG+WWL V++ +  IGYWP  LF  L   A +V WGG    S  G    +  +
Subjt:  QTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNG---MSPPL

Query:  GNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        G+GH  + G + +A YF+++  +DG N   +P  + +       +CY
Subjt:  GNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY

AT2G44240.1 Protein of Unknown Function (DUF239)3.1e-2237.67Show/hide
Query:  QTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGG--IAKPSPNGMSPPLG
        +TGCYN++C GFVQT            +S Y G Q   +  +++D  TG+WWL ++D+   IGYWP  LF  L DGA +V WGG   A  S    +  +G
Subjt:  QTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGG--IAKPSPNGMSPPLG

Query:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        +GH    G   +A Y K+I  +DG N   +P  + + S+  N +CY
Subjt:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY

AT4G23360.1 unknown protein1.6e-2334.46Show/hide
Query:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELF--GHLNDGAEQVAWGGIAKPSPNGMSPP
        G+ TGC +M C GFVQ + + P    + P+S Y+G QY+   T++QD   GDWW A++D    +GYWP  LF     ++ A   +WGG         SPP
Subjt:  GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELF--GHLNDGAEQVAWGGIAKPSPNGMSPP

Query:  LGNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        +G+GH P+ G + ++ Y   +  I G+ +  +P    +  + +N +CY
Subjt:  LGNGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY

AT5G18460.1 Protein of Unknown Function (DUF239)1.0e-2538.36Show/hide
Query:  TGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGG---IAKPSPNGMSPPLG
        TGCYN+LC GF+QTN        + P ST++G Q+D +  +++D   G+WW+ + DS T +GYWP ELF HL D A  V WGG     + S    +  +G
Subjt:  TGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGG---IAKPSPNGMSPPLG

Query:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY
        +GH P+ G + +A YF+++  +D +N  V P ++ +     N++CY
Subjt:  NGHKPNNGKYNEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTGCTCAAACGGGATGCTATAATATGCTTTGTCGAGGTTTTGTACAAACAAATCCAAGTACTCCTCCTAATATCCCTCTTTACCCTTCGTCTACATATCAAGGGAAACA
ATATGACTATTCATTTACGGTTTTTCAGGATCGATCAACGGGGGATTGGTGGCTTGCAGTGAGTGATAGCCAAACAACAATAGGGTATTGGCCAAAGGAGCTGTTTGGAC
ATCTGAATGATGGGGCAGAACAAGTGGCATGGGGAGGCATTGCAAAGCCTTCACCAAATGGAATGAGCCCTCCATTGGGGAATGGCCACAAGCCAAATAATGGTAAATAC
AATGAAGCTTGCTACTTCAAATCCATAAACTACATAGATGGCAATAACAAAGGCGTAGATCCTGCTTATGAGAACATAGTGAGTCATGTAAGTAATTCTGATTGTTAT
mRNA sequenceShow/hide mRNA sequence
GGTGCTCAAACGGGATGCTATAATATGCTTTGTCGAGGTTTTGTACAAACAAATCCAAGTACTCCTCCTAATATCCCTCTTTACCCTTCGTCTACATATCAAGGGAAACA
ATATGACTATTCATTTACGGTTTTTCAGGATCGATCAACGGGGGATTGGTGGCTTGCAGTGAGTGATAGCCAAACAACAATAGGGTATTGGCCAAAGGAGCTGTTTGGAC
ATCTGAATGATGGGGCAGAACAAGTGGCATGGGGAGGCATTGCAAAGCCTTCACCAAATGGAATGAGCCCTCCATTGGGGAATGGCCACAAGCCAAATAATGGTAAATAC
AATGAAGCTTGCTACTTCAAATCCATAAACTACATAGATGGCAATAACAAAGGCGTAGATCCTGCTTATGAGAACATAGTGAGTCATGTAAGTAATTCTGATTGTTAT
Protein sequenceShow/hide protein sequence
GAQTGCYNMLCRGFVQTNPSTPPNIPLYPSSTYQGKQYDYSFTVFQDRSTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNNGKY
NEACYFKSINYIDGNNKGVDPAYENIVSHVSNSDCY