; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g24810 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g24810
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:17981283..17983168
RNA-Seq ExpressionMoc04g24810
SyntenyMoc04g24810
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067953.1 RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein [Cucumis melo var. makuwa]1.7e-2240.91Show/hide
Query:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRK--NKVQAVVSKRKGKYQQGFMAY--SPSTVPMNSLTLHNDPQTLP---NNHSSRRQLKRES-
        KEYAQRW +   ++  PL +KE     + T R+ F  +   K +++  +R  KY+Q F +Y  + S +P NS  L +     P   N++S +  +KR+  
Subjt:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRK--NKVQAVVSKRKGKYQQGFMAY--SPSTVPMNSLTLHNDPQTLP---NNHSSRRQLKRES-

Query:  --------FHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM
                F  IP++Y ELL QL Q+  LA IP+ P++PPYPKW+D    CDYHAG VGHSTENC +LK KV+ L+
Subjt:  --------FHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM

TYK21788.1 uncharacterized protein E5676_scaffold1721G00440 [Cucumis melo var. makuwa]8.2e-2236.59Show/hide
Query:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRKNKVQA----------VVSKRK---------------------GKYQQGFMAY--SPSTVPMN
        KEYAQRW +   ++  PL +KE     + T R+ F  +   +A           +SK+K                      KY+Q F +Y  + S +P N
Subjt:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRKNKVQA----------VVSKRK---------------------GKYQQGFMAY--SPSTVPMN

Query:  SLTL------------HNDPQTLPNNHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHK
        S  L             N PQ       S+       F  IP++YTELL QL Q+  LAPIP+ P++PPYPKW+D    CDYHAG VGHSTENC +LK K
Subjt:  SLTL------------HNDPQTLPNNHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHK

Query:  VEVLM
        V+ L+
Subjt:  VEVLM

XP_017979843.1 PREDICTED: uncharacterized protein LOC108662779 [Theobroma cacao]2.4e-2137.14Show/hide
Query:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRKNKVQAVVSKRKG---KYQQG-----------------FMAYSP------------------S
        KEYAQRW +   Q+  PL +KE     + T R+ F  +     V S +KG   K ++G                 + +Y P                   
Subjt:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRKNKVQAVVSKRKG---KYQQG-----------------FMAYSP------------------S

Query:  TVPMNSLTLHNDPQTLP-------NN--HSSR---RQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCT
         VP  +   +  PQT P       NN  H  R     L+R  F  IP+ YT LL QL ++ LLA  P+EPL+PP+PKWYDP  +CDYH G  GHSTENCT
Subjt:  TVPMNSLTLHNDPQTLP-------NN--HSSR---RQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCT

Query:  SLKHKVEVLM
        +LKHKV+ L+
Subjt:  SLKHKVEVLM

XP_022153553.1 uncharacterized protein LOC111021029 [Momordica charantia]7.9e-5772.09Show/hide
Query:  ESFPQHNSSHEAQLSKKGQQAPTICQV--AFKDGQASKAIHSNILKGKEKENVTNGNVEETAGKEYAQRWGEAMEQLPVPLINKEAHLETNRSSFKRKNK
        E FPQH+SSH+AQLSKKGQQ PT CQV   FKD Q  K  +S+ LKG+EKENVT             QRWG+A+EQLPVPLINKE H + N+SS KRKNK
Subjt:  ESFPQHNSSHEAQLSKKGQQAPTICQV--AFKDGQASKAIHSNILKGKEKENVTNGNVEETAGKEYAQRWGEAMEQLPVPLINKEAHLETNRSSFKRKNK

Query:  VQAVVSKRKGKYQQGFMAYSPSTVPMNSLTLHNDPQTLPNNHSSRRQLKRESFHSIPISYTELLSQLFQSHL
        V AVVSKRK KYQQGF+AYSPSTVP+NS+TL +DPQTLPNNHSSRRQLKRESFHSIP SYTELLSQLFQ +L
Subjt:  VQAVVSKRKGKYQQGFMAYSPSTVPMNSLTLHNDPQTLPNNHSSRRQLKRESFHSIPISYTELLSQLFQSHL

XP_022158986.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia]1.4e-2136Show/hide
Query:  EYAQRWGEAMEQLPVPLINKEAHLETNRSSFKRKNKVQAVVSKRKGKYQQGFM---AYSPSTVP---------MNSLTLHNDPQTLPN------------
        EY  R G     +  PL  K+A       S K++ +VQ V + R    QQ +     Y+P   P         +N+ T H  P T  N            
Subjt:  EYAQRWGEAMEQLPVPLINKEAHLETNRSSFKRKNKVQAVVSKRKGKYQQGFM---AYSPSTVP---------MNSLTLHNDPQTLPN------------

Query:  --------------------NHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM
                              ++R   K+  F  IP++YTELL QLFQ++ LAP+PV+P+QPPYP+WYD    CDYHAGA+GHSTENCT+LK++V+ L+
Subjt:  --------------------NHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM

TrEMBL top hitse value%identityAlignment
A0A5A7VIM6 RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein8.0e-2340.91Show/hide
Query:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRK--NKVQAVVSKRKGKYQQGFMAY--SPSTVPMNSLTLHNDPQTLP---NNHSSRRQLKRES-
        KEYAQRW +   ++  PL +KE     + T R+ F  +   K +++  +R  KY+Q F +Y  + S +P NS  L +     P   N++S +  +KR+  
Subjt:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRK--NKVQAVVSKRKGKYQQGFMAY--SPSTVPMNSLTLHNDPQTLP---NNHSSRRQLKRES-

Query:  --------FHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM
                F  IP++Y ELL QL Q+  LA IP+ P++PPYPKW+D    CDYHAG VGHSTENC +LK KV+ L+
Subjt:  --------FHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM

A0A5D3DEB3 Retrotrans_gag domain-containing protein4.0e-2236.59Show/hide
Query:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRKNKVQA----------VVSKRK---------------------GKYQQGFMAY--SPSTVPMN
        KEYAQRW +   ++  PL +KE     + T R+ F  +   +A           +SK+K                      KY+Q F +Y  + S +P N
Subjt:  KEYAQRWGEAMEQLPVPLINKE---AHLETNRSSFKRKNKVQA----------VVSKRK---------------------GKYQQGFMAY--SPSTVPMN

Query:  SLTL------------HNDPQTLPNNHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHK
        S  L             N PQ       S+       F  IP++YTELL QL Q+  LAPIP+ P++PPYPKW+D    CDYHAG VGHSTENC +LK K
Subjt:  SLTL------------HNDPQTLPNNHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHK

Query:  VEVLM
        V+ L+
Subjt:  VEVLM

A0A6J1DHS7 uncharacterized protein LOC1110210293.8e-5772.09Show/hide
Query:  ESFPQHNSSHEAQLSKKGQQAPTICQV--AFKDGQASKAIHSNILKGKEKENVTNGNVEETAGKEYAQRWGEAMEQLPVPLINKEAHLETNRSSFKRKNK
        E FPQH+SSH+AQLSKKGQQ PT CQV   FKD Q  K  +S+ LKG+EKENVT             QRWG+A+EQLPVPLINKE H + N+SS KRKNK
Subjt:  ESFPQHNSSHEAQLSKKGQQAPTICQV--AFKDGQASKAIHSNILKGKEKENVTNGNVEETAGKEYAQRWGEAMEQLPVPLINKEAHLETNRSSFKRKNK

Query:  VQAVVSKRKGKYQQGFMAYSPSTVPMNSLTLHNDPQTLPNNHSSRRQLKRESFHSIPISYTELLSQLFQSHL
        V AVVSKRK KYQQGF+AYSPSTVP+NS+TL +DPQTLPNNHSSRRQLKRESFHSIP SYTELLSQLFQ +L
Subjt:  VQAVVSKRKGKYQQGFMAYSPSTVPMNSLTLHNDPQTLPNNHSSRRQLKRESFHSIPISYTELLSQLFQSHL

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222311.5e-2161.73Show/hide
Query:  NNHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM
        NN S+R+Q     F  IP++YTELL QLFQ++ LAP+PV+P+QPPYP WYD    CDYHAGA+GHSTENCT+LK++V+ L+
Subjt:  NNHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM

A0A6J1E2J7 Ribonuclease H6.8e-2236Show/hide
Query:  EYAQRWGEAMEQLPVPLINKEAHLETNRSSFKRKNKVQAVVSKRKGKYQQGFM---AYSPSTVP---------MNSLTLHNDPQTLPN------------
        EY  R G     +  PL  K+A       S K++ +VQ V + R    QQ +     Y+P   P         +N+ T H  P T  N            
Subjt:  EYAQRWGEAMEQLPVPLINKEAHLETNRSSFKRKNKVQAVVSKRKGKYQQGFM---AYSPSTVP---------MNSLTLHNDPQTLPN------------

Query:  --------------------NHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM
                              ++R   K+  F  IP++YTELL QLFQ++ LAP+PV+P+QPPYP+WYD    CDYHAGA+GHSTENCT+LK++V+ L+
Subjt:  --------------------NHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYCDYHAGAVGHSTENCTSLKHKVEVLM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTCTCCACTCAAGTCGACCATACTGCCAAAGCATATGGAAGATTTGAGGGTCATTATCCTCTAGGATGCATTATAGAGAGCTTCCCCCAGCATAATTCATCACA
TGAAGCTCAGCTTTCCAAGAAAGGCCAACAAGCTCCTACAATCTGTCAGGTTGCATTCAAAGATGGACAAGCATCAAAGGCCATTCACTCAAATATTTTAAAAGGTAAAG
AGAAAGAAAATGTTACCAATGGAAATGTTGAAGAGACTGCAGGGAAAGAATATGCTCAGAGATGGGGAGAAGCAATGGAACAACTGCCAGTGCCTTTAATTAATAAAGAG
GCACACCTTGAAACGAATAGAAGCTCTTTTAAAAGGAAAAATAAAGTTCAGGCAGTAGTTTCGAAGCGCAAAGGGAAGTACCAACAAGGATTCATGGCATACAGTCCTTC
TACGGTACCCATGAACTCATTGACTCTGCATAATGATCCACAAACATTACCTAACAATCATAGTAGTCGGAGACAACTAAAGCGAGAGAGCTTTCATTCGATCCCAATAT
CCTATACTGAGTTGTTATCCCAATTATTTCAAAGTCATCTACTAGCTCCGATACCTGTAGAACCATTGCAGCCACCTTATCCAAAATGGTACGACCCGCAAGTCTACTGT
GATTACCATGCAGGAGCTGTAGGTCATTCAACTGAGAACTGTACTTCACTAAAGCATAAAGTGGAAGTTCTAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAATTCTCCACTCAAGTCGACCATACTGCCAAAGCATATGGAAGATTTGAGGGTCATTATCCTCTAGGATGCATTATAGAGAGCTTCCCCCAGCATAATTCATCACA
TGAAGCTCAGCTTTCCAAGAAAGGCCAACAAGCTCCTACAATCTGTCAGGTTGCATTCAAAGATGGACAAGCATCAAAGGCCATTCACTCAAATATTTTAAAAGGTAAAG
AGAAAGAAAATGTTACCAATGGAAATGTTGAAGAGACTGCAGGGAAAGAATATGCTCAGAGATGGGGAGAAGCAATGGAACAACTGCCAGTGCCTTTAATTAATAAAGAG
GCACACCTTGAAACGAATAGAAGCTCTTTTAAAAGGAAAAATAAAGTTCAGGCAGTAGTTTCGAAGCGCAAAGGGAAGTACCAACAAGGATTCATGGCATACAGTCCTTC
TACGGTACCCATGAACTCATTGACTCTGCATAATGATCCACAAACATTACCTAACAATCATAGTAGTCGGAGACAACTAAAGCGAGAGAGCTTTCATTCGATCCCAATAT
CCTATACTGAGTTGTTATCCCAATTATTTCAAAGTCATCTACTAGCTCCGATACCTGTAGAACCATTGCAGCCACCTTATCCAAAATGGTACGACCCGCAAGTCTACTGT
GATTACCATGCAGGAGCTGTAGGTCATTCAACTGAGAACTGTACTTCACTAAAGCATAAAGTGGAAGTTCTAATGTAG
Protein sequenceShow/hide protein sequence
MKFSTQVDHTAKAYGRFEGHYPLGCIIESFPQHNSSHEAQLSKKGQQAPTICQVAFKDGQASKAIHSNILKGKEKENVTNGNVEETAGKEYAQRWGEAMEQLPVPLINKE
AHLETNRSSFKRKNKVQAVVSKRKGKYQQGFMAYSPSTVPMNSLTLHNDPQTLPNNHSSRRQLKRESFHSIPISYTELLSQLFQSHLLAPIPVEPLQPPYPKWYDPQVYC
DYHAGAVGHSTENCTSLKHKVEVLM