; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g04530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g04530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionAT-rich interactive domain-containing protein
Genome locationchr8:3272717..3276215
RNA-Seq ExpressionMoc08g04530
SyntenyMoc08g04530
Gene Ontology termsNA
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585468.1 hypothetical protein SDJN03_18201, partial [Cucurbita argyrosperma subsp. sororia]2.8e-6885.53Show/hide
Query:  TNGRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKL
        +N RR C G CT PALG+AMDGPS GLRVEDQEAKKQCLPENF SSST EMDNSTVWSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK 
Subjt:  TNGRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKL

Query:  SESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        SESQK+VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  SESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

KAG7020386.1 hypothetical protein SDJN02_17070 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-6786.58Show/hide
Query:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES
        RR C G CT PALG+AMDGPS GLRVEDQEAKKQCLPENF SSST EMDNSTVWSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SES
Subjt:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        QK+VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_008445211.1 PREDICTED: uncharacterized protein LOC103488310 isoform X1 [Cucumis melo]5.4e-6785.91Show/hide
Query:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES
        +R   GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTVWSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+S
Subjt:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        QKQV+EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_022131290.1 uncharacterized protein LOC111004556 [Momordica charantia]3.4e-69100Show/hide
Query:  MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE
        MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE
Subjt:  MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE

Query:  SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
Subjt:  SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_022951409.1 uncharacterized protein LOC111454240 isoform X1 [Cucurbita moschata]1.1e-6786.58Show/hide
Query:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES
        RR C G CT PALG+AMDGPS GLRVEDQEAKKQCLPENF SSST EMDNSTVWSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SES
Subjt:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        QK+VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A1S3BC47 uncharacterized protein LOC103488310 isoform X12.6e-6785.91Show/hide
Query:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES
        +R   GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTVWSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+S
Subjt:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        QKQV+EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A5A7VGA9 Uncharacterized protein2.6e-6785.91Show/hide
Query:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES
        +R   GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTVWSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+S
Subjt:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        QKQV+EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A6J1BP98 uncharacterized protein LOC1110045561.6e-69100Show/hide
Query:  MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE
        MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE
Subjt:  MDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYE

Query:  SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
Subjt:  SLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A6J1GIP5 uncharacterized protein LOC111454240 isoform X15.2e-6886.58Show/hide
Query:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES
        RR C G CT PALG+AMDGPS GLRVEDQEAKKQCLPENF SSST EMDNSTVWSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SES
Subjt:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        QK+VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A6J1KQM2 uncharacterized protein LOC111496323 isoform X13.4e-6785.23Show/hide
Query:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES
        RR C G CT PALG+AMDGPS GLRV+DQEAKKQCLP+NF SSST EMDNSTVWSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SES
Subjt:  RRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        QK+VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein8.4e-2645.22Show/hide
Query:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWAG
        C GC      TA +L    D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+ ++ DS S   N  +  ++VN GLLLWN+TR++W G
Subjt:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWAG

Query:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
         +K +      +  K++WN ATY+SLLG+NK FP+ +PL EM++FLVD+WEQEGLYD
Subjt:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

AT1G15350.2 unknown protein8.4e-2645.22Show/hide
Query:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWAG
        C GC      TA +L    D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+ ++ DS S   N  +  ++VN GLLLWN+TR++W G
Subjt:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWAG

Query:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
         +K +      +  K++WN ATY+SLLG+NK FP+ +PL EM++FLVD+WEQEGLYD
Subjt:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

AT4G32342.1 unknown protein1.4e-2848.65Show/hide
Query:  CSGCCTAP-ALGNAMDGPSKGLRVEDQEAKK-QCLPENFPSSSTYEMD-NSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES
        C GCC     L   +D PSKGL+++ +  KK     ++F S+ST +MD N T+ SQ   +S    D   +  +S +FVN GL+LWN TR+QW    L+  
Subjt:  CSGCCTAP-ALGNAMDGPSKGLRVEDQEAKK-QCLPENFPSSSTYEMD-NSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLY
        Q  V EP ISWN+TY+SLL TNK FP+ +PL EM+ FLVDVWE+EGLY
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLY

AT5G25360.1 unknown protein2.0e-4056.38Show/hide
Query:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWAGNKLSES
        C GCC  P L  A+D PSKGLR++ +  KK  + E+F S+ST EMDNST+ SQRS++S    ++ S   S+    +FVN GL LWN+TR+QW  N  S+ 
Subjt:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        + +VREP ISWNATYESLLG NK F   +PL EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

AT5G25360.2 unknown protein2.0e-4056.38Show/hide
Query:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWAGNKLSES
        C GCC  P L  A+D PSKGLR++ +  KK  + E+F S+ST EMDNST+ SQRS++S    ++ S   S+    +FVN GL LWN+TR+QW  N  S+ 
Subjt:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWAGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        + +VREP ISWNATYESLLG NK F   +PL EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATGAATGGAGTATGGACTCTGTTCTCCACTTGTACGGAGCTTCGGGCCTCTACCCACCAGGATTTAGTGCTTTTCTTCGGACAATTTTGCTTGGAAACCGCGT
AAATGCATTATCGAGGGATTATTTGGATTGCGCCCTGACGGATTATGGCACGAATGGGAGAAGATGCTGTAGTGGATGCTGCACTGCACCTGCACTAGGTAATGCAATGG
ATGGACCATCTAAAGGTCTGAGAGTTGAAGACCAAGAAGCGAAAAAACAATGCTTACCGGAAAATTTCCCAAGTTCTAGCACATATGAAATGGACAACAGTACAGTTTGG
TCCCAGAGAAGCATTGCATCAGCCAAGTCTCATGATTCCCACAGCAATATTGGGAGCAGTAGAGACTTTGTAAATTCTGGTCTGCTTCTTTGGAACGAGACCAGGAAGCA
ATGGGCTGGAAATAAATTGTCCGAGAGCCAAAAGCAAGTTCGAGAACCGAAAATAAGTTGGAATGCTACTTATGAGAGCTTGTTAGGAACGAACAAGCCGTTCCCCGAGG
CCGTGCCTCTTGCTGAGATGATAGAGTTTCTTGTTGATGTCTGGGAGCAGGAGGGTCTGTATGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATGAATGGAGTATGGACTCTGTTCTCCACTTGTACGGAGCTTCGGGCCTCTACCCACCAGGATTTAGTGCTTTTCTTCGGACAATTTTGCTTGGAAACCGCGT
AAATGCATTATCGAGGGATTATTTGGATTGCGCCCTGACGGATTATGGCACGAATGGGAGAAGATGCTGTAGTGGATGCTGCACTGCACCTGCACTAGGTAATGCAATGG
ATGGACCATCTAAAGGTCTGAGAGTTGAAGACCAAGAAGCGAAAAAACAATGCTTACCGGAAAATTTCCCAAGTTCTAGCACATATGAAATGGACAACAGTACAGTTTGG
TCCCAGAGAAGCATTGCATCAGCCAAGTCTCATGATTCCCACAGCAATATTGGGAGCAGTAGAGACTTTGTAAATTCTGGTCTGCTTCTTTGGAACGAGACCAGGAAGCA
ATGGGCTGGAAATAAATTGTCCGAGAGCCAAAAGCAAGTTCGAGAACCGAAAATAAGTTGGAATGCTACTTATGAGAGCTTGTTAGGAACGAACAAGCCGTTCCCCGAGG
CCGTGCCTCTTGCTGAGATGATAGAGTTTCTTGTTGATGTCTGGGAGCAGGAGGGTCTGTATGACTGA
Protein sequenceShow/hide protein sequence
MEDEWSMDSVLHLYGASGLYPPGFSAFLRTILLGNRVNALSRDYLDCALTDYGTNGRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVW
SQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWAGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD