; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001317 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001317
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAT-rich interactive domain-containing protein
Genome locationscaffold36:3236054..3237465
RNA-Seq ExpressionMS001317
SyntenyMS001317
Gene Ontology termsNA
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064925.1 uncharacterized protein E6C27_scaffold82G002430 [Cucumis melo var. makuwa]2.9e-6788.19Show/hide
Query:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR
        GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTVWSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+SQKQV+
Subjt:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR

Query:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

KAG6585468.1 hypothetical protein SDJN03_18201, partial [Cucurbita argyrosperma subsp. sororia]6.5e-6786.99Show/hide
Query:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQ
        C G CT PALG+AMDGPS GLRVEDQEAKKQCLPENF SSST EMDNSTVWSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SESQK+
Subjt:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQ

Query:  VREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  VREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_008445211.1 PREDICTED: uncharacterized protein LOC103488310 isoform X1 [Cucumis melo]2.9e-6788.19Show/hide
Query:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR
        GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTVWSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+SQKQV+
Subjt:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR

Query:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_022131290.1 uncharacterized protein LOC111004556 [Momordica charantia]2.4e-69100Show/hide
Query:  MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE
        MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE
Subjt:  MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE

Query:  SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
Subjt:  SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_022951409.1 uncharacterized protein LOC111454240 isoform X1 [Cucurbita moschata]6.5e-6786.99Show/hide
Query:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQ
        C G CT PALG+AMDGPS GLRVEDQEAKKQCLPENF SSST EMDNSTVWSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SESQK+
Subjt:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQ

Query:  VREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  VREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A0A0LPL3 Uncharacterized protein7.0e-6786.81Show/hide
Query:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR
        GCCTA ALGNAMDGPSKGLRV+++EAKKQCLPENFPSSST EMDNSTVWSQRS+AS ++HDSHSNIGSS DFVNSGLLLWNETRKQW GNK+S SQKQV+
Subjt:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR

Query:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        EPKISWNATY++LL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A1S3BC47 uncharacterized protein LOC103488310 isoform X11.4e-6788.19Show/hide
Query:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR
        GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTVWSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+SQKQV+
Subjt:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR

Query:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A5A7VGA9 Uncharacterized protein1.4e-6788.19Show/hide
Query:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR
        GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTVWSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+SQKQV+
Subjt:  GCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVR

Query:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  EPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A6J1BP98 uncharacterized protein LOC1110045561.2e-69100Show/hide
Query:  MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE
        MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE
Subjt:  MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE

Query:  SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
Subjt:  SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A6J1GIP5 uncharacterized protein LOC111454240 isoform X13.2e-6786.99Show/hide
Query:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQ
        C G CT PALG+AMDGPS GLRVEDQEAKKQCLPENF SSST EMDNSTVWSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SESQK+
Subjt:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQ

Query:  VREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  VREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein5.9e-2645.22Show/hide
Query:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWAG
        C GC      TA +L    D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+ ++ DS S   N  +  ++VN GLLLWN+TR++W G
Subjt:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWAG

Query:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
         +K +      +  K++WN ATY+SLLG+NK FP+ +PL EM++FLVD+WEQEGLYD
Subjt:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

AT1G15350.2 unknown protein5.9e-2645.22Show/hide
Query:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWAG
        C GC      TA +L    D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+ ++ DS S   N  +  ++VN GLLLWN+TR++W G
Subjt:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWAG

Query:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
         +K +      +  K++WN ATY+SLLG+NK FP+ +PL EM++FLVD+WEQEGLYD
Subjt:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

AT4G32342.1 unknown protein9.8e-2948.65Show/hide
Query:  CSGCCTAP-ALGNAMDGPSKGLRVEDQEAKK-QCLPENFPSSSTYEMD-NSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES
        C GCC     L   +D PSKGL+++ +  KK     ++F S+ST +MD N T+ SQ   +S    D   +  +S +FVN GL+LWN TR+QW    L+  
Subjt:  CSGCCTAP-ALGNAMDGPSKGLRVEDQEAKK-QCLPENFPSSSTYEMD-NSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLY
        Q  V EP ISWN+TY+SLL TNK FP+ +PL EM+ FLVDVWE+EGLY
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLY

AT5G25360.1 unknown protein1.5e-4056.38Show/hide
Query:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWAGNKLSES
        C GCC  P L  A+D PSKGLR++ +  KK  + E+F S+ST EMDNST+ SQRS++S    ++ S   S+    +FVN GL LWN+TR+QW  N  S+ 
Subjt:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        + +VREP ISWNATYESLLG NK F   +PL EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

AT5G25360.2 unknown protein1.5e-4056.38Show/hide
Query:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWAGNKLSES
        C GCC  P L  A+D PSKGLR++ +  KK  + E+F S+ST EMDNST+ SQRS++S    ++ S   S+    +FVN GL LWN+TR+QW  N  S+ 
Subjt:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        + +VREP ISWNATYESLLG NK F   +PL EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGTAGTGGATGCTGCACTGCACCTGCACTAGGTAATGCAATGGATGGACCATCTAAAGGTCTGAGAGTTGAAGACCAAGAAGCGAAAAAACAATGCTTACCGGAAAATTT
CCCAAGTTCTAGCACATATGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATTGCATCAGCCAAGTCTCATGATTCCCACAGCAATATTGGGAGCAGTAGAGACT
TTGTAAATTCTGGTCTGCTTCTTTGGAACGAGACCAGGAAGCAATGGGCTGGAAATAAATTGTCCGAGAGCCAAAAGCAAGTTCGAGAACCGAAAATAAGTTGGAATGCT
ACTTATGAGAGCTTGTTAGGGACGAACAAGCCGTTCCCCGAGGCCGTGCCTCTTGCTGAGATGATAGAGTTTCTTGTTGATGTCTGGGAGCAGGAGGGTCTGTATGAC
mRNA sequenceShow/hide mRNA sequence
TGTAGTGGATGCTGCACTGCACCTGCACTAGGTAATGCAATGGATGGACCATCTAAAGGTCTGAGAGTTGAAGACCAAGAAGCGAAAAAACAATGCTTACCGGAAAATTT
CCCAAGTTCTAGCACATATGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATTGCATCAGCCAAGTCTCATGATTCCCACAGCAATATTGGGAGCAGTAGAGACT
TTGTAAATTCTGGTCTGCTTCTTTGGAACGAGACCAGGAAGCAATGGGCTGGAAATAAATTGTCCGAGAGCCAAAAGCAAGTTCGAGAACCGAAAATAAGTTGGAATGCT
ACTTATGAGAGCTTGTTAGGGACGAACAAGCCGTTCCCCGAGGCCGTGCCTCTTGCTGAGATGATAGAGTTTCTTGTTGATGTCTGGGAGCAGGAGGGTCTGTATGAC
Protein sequenceShow/hide protein sequence
CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNA
TYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD