; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0736 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0736
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionzf-RVT domain-containing protein
Genome locationMC04:8199514..8200004
RNA-Seq ExpressionMC04g0736
SyntenyMC04g0736
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]2.27e-1129.01Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQRNSALQ--SSFCILCSSSSETQAQLFFQCPSALSCWA-LFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISSG
        P++++++IWI+    + +S+ILQ+ S +    S C LC  +S+    +F  CP +   W  +FS FN+ W        +++ LL    L  K  R+I   
Subjt:  PKKVSMVIWIIFQGNLNTSDILQRNSALQ--SSFCILCSSSSETQAQLFFQCPSALSCWA-LFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISSG

Query:  LMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNA
        ++  +L    ++ R  ++ FHDK++   +        A+ WCS+   F +YS   IC +WNA
Subjt:  LMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNA

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]4.64e-1329.27Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQRNSALQ--SSFCILCSSSSETQAQLFFQCPSALSCWA-LFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISSG
        P++++++IWI+    +N+S+ILQ+ S +    S C LC  +S+    +F  CP +   W  +FS FN+ W        +++ LL    L  K  R+I   
Subjt:  PKKVSMVIWIIFQGNLNTSDILQRNSALQ--SSFCILCSSSSETQAQLFFQCPSALSCWA-LFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISSG

Query:  LMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI
        ++  +L    ++ R  ++ FHDK++   +        A+ WCS+   F +YS   IC +WN F+
Subjt:  LMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]4.81e-2236.36Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQRNS---ALQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS
        P++V++  WI+FQG LNT+DI+Q+ S   AL  SFC LC+ S E    LFF C  A  CW  LF  FNV W       +N+  LL+ P     + R +  
Subjt:  PKKVSMVIWIIFQGNLNTSDILQRNS---ALQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS

Query:  GLMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI
         ++++ L   +       + F +K +   + F   KFKAS WCS+   F  +S   I  +W AFI
Subjt:  GLMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI

XP_024195633.1 uncharacterized protein LOC112198748 [Rosa chinensis]8.89e-0932.57Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQRN---SALQSSFCILCSSSSETQAQLFFQCPSALSCWA-LFSFFNVHWVLTGHHRNNLLHLLYDPVL---SQKAHRL
        P KV ++ W++  G  NT D+LQR    S     +CILC +  E+   +F  C  A   W  LF    V W  T   RN+LL    +P+     +KA  L
Subjt:  PKKVSMVIWIIFQGNLNTSDILQRN---SALQSSFCILCSSSSETQAQLFFQCPSALSCWA-LFSFFNVHWVLTGHHRNNLLHLLYDPVL---SQKAHRL

Query:  ISSGLM---------RSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNA
           G++         R+   + +Y   G++K      +DL   ++  KF AS W SIS  FKDY+  SI  +W A
Subjt:  ISSGLM---------RSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNA

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]3.41e-1631.52Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQRNS---ALQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS
        P++V+++IWI+  G LN +++LQ+     +L  + C  C   SE    LFF CP +  CW  L  FFN+   L    ++N+  LL  P  S K+ RL+  
Subjt:  PKKVSMVIWIIFQGNLNTSDILQRNS---ALQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS

Query:  GLMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI
          +++ L   D      ++ F++K+       +  + +AS WC +S PF+ YS      +W AFI
Subjt:  GLMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI

TrEMBL top hitse value%identityAlignment
A0A438FUT7 zf-RVT domain-containing protein3.13e-0728.95Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQRN---SALQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS
        P KV  + W++  G +NT+D LQ      AL   +CILC  + E+   LF  CP  +  W  LF+   + WVL G  R   L LL    L          
Subjt:  PKKVSMVIWIIFQGNLNTSDILQRN---SALQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS

Query:  GLMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDY
         +   +L +  +  R   + F DK +     +DL +F +S W S+    +++
Subjt:  GLMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDY

A0A5D3C9J6 LINE-1 retrotransposable element ORF2 protein1.62e-0727.44Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQR---NSALQSSFCILCSSSSETQAQLFFQCPSALSCWALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISSG
        P K+   +W + Q  LNT +++Q+   NS LQ ++C+LC   SET A LFF C      W+L    ++++         L         S   H+++  G
Subjt:  PKKVSMVIWIIFQGNLNTSDILQR---NSALQSSFCILCSSSSETQAQLFFQCPSALSCWALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISSG

Query:  LMRSSLFYHDYVMRGIKKTFHDKS--KDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNA
        L+  ++ +  +  R   + F   S  K +   ++  K     WCS    FK+YS+ +I  + NA
Subjt:  LMRSSLFYHDYVMRGIKKTFHDKS--KDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNA

A0A5D3DE60 zf-RVT domain-containing protein2.25e-1329.27Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQRNSALQ--SSFCILCSSSSETQAQLFFQCPSALSCWA-LFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISSG
        P++++++IWI+    +N+S+ILQ+ S +    S C LC  +S+    +F  CP +   W  +FS FN+ W        +++ LL    L  K  R+I   
Subjt:  PKKVSMVIWIIFQGNLNTSDILQRNSALQ--SSFCILCSSSSETQAQLFFQCPSALSCWA-LFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISSG

Query:  LMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI
        ++  +L    ++ R  ++ FHDK++   +        A+ WCS+   F +YS   IC +WN F+
Subjt:  LMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI

A0A5E4F2L4 zf-RVT domain-containing protein4.64e-0930Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQRNSA---LQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS
        P KV + +W++  G +NTSD++QR      L   +C+LC    E+   LF  CP +LS W  L+      WV+           L    +     +L S+
Subjt:  PKKVSMVIWIIFQGNLNTSDILQRNSA---LQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS

Query:  --GLMRSSLFYHDYVMRGIKKTFHD-KSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSI
          G +  S+F+  ++ R  ++ F D K   L   +D  K+ A+ W S++  FKDYS  +I
Subjt:  --GLMRSSLFYHDYVMRGIKKTFHD-KSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSI

A0A6J1DIE2 uncharacterized protein LOC1110207652.33e-2236.36Show/hide
Query:  PKKVSMVIWIIFQGNLNTSDILQRNS---ALQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS
        P++V++  WI+FQG LNT+DI+Q+ S   AL  SFC LC+ S E    LFF C  A  CW  LF  FNV W       +N+  LL+ P     + R +  
Subjt:  PKKVSMVIWIIFQGNLNTSDILQRNS---ALQSSFCILCSSSSETQAQLFFQCPSALSCW-ALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISS

Query:  GLMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI
         ++++ L   +       + F +K +   + F   KFKAS WCS+   F  +S   I  +W AFI
Subjt:  GLMRSSLFYHDYVMRGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.5e-0432.2Show/hide
Query:  KVSMVIWIIFQGNLNTSDILQRNSALQSSFCILCSSSSETQAQLFFQCPSALSCWALFS
        K + + W++    L+T D L+       + C+LC+S  E++A LFF+CP   + W  F+
Subjt:  KVSMVIWIIFQGNLNTSDILQRNSALQSSFCILCSSSSETQAQLFFQCPSALSCWALFS

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.3e-0435.59Show/hide
Query:  KVSMVIWIIFQGNLNTSDILQRNSALQSSFCILCSSSSETQAQLFFQCPSALSCWALFS
        + S++ W+ F   L T D L+       S  +LCS+  ET A LFF+C  +L+ W  F+
Subjt:  KVSMVIWIIFQGNLNTSDILQRNSALQSSFCILCSSSSETQAQLFFQCPSALSCWALFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCAAAGAAGGTCAGCATGGTTATTTGGATTATTTTTCAGGGGAACTTGAATACCTCTGATATTCTTCAAAGGAATTCAGCTCTTCAGTCCTCTTTTTGCATTTTGTGCTC
TTCTTCCAGCGAAACCCAAGCTCAATTATTTTTTCAATGTCCTTCTGCATTGAGCTGCTGGGCGTTGTTCTCCTTCTTCAATGTCCATTGGGTTTTAACAGGACACCACA
GAAACAACTTGCTGCATTTATTATATGATCCTGTTCTGTCCCAGAAGGCTCATCGGCTCATCTCATCTGGGTTAATGCGATCAAGCCTATTTTATCATGATTATGTTATG
AGAGGAATCAAAAAGACCTTCCACGATAAAAGCAAGGATTTGGTGAAATGCTTTGATCTAGAAAAATTCAAAGCTTCTCAGTGGTGTTCCATTTCATCTCCATTTAAGGA
TTATTCGTCCGGCTCTATTTGTGATTCTTGGAATGCCTTTATTTTCTCC
mRNA sequenceShow/hide mRNA sequence
CCAAAGAAGGTCAGCATGGTTATTTGGATTATTTTTCAGGGGAACTTGAATACCTCTGATATTCTTCAAAGGAATTCAGCTCTTCAGTCCTCTTTTTGCATTTTGTGCTC
TTCTTCCAGCGAAACCCAAGCTCAATTATTTTTTCAATGTCCTTCTGCATTGAGCTGCTGGGCGTTGTTCTCCTTCTTCAATGTCCATTGGGTTTTAACAGGACACCACA
GAAACAACTTGCTGCATTTATTATATGATCCTGTTCTGTCCCAGAAGGCTCATCGGCTCATCTCATCTGGGTTAATGCGATCAAGCCTATTTTATCATGATTATGTTATG
AGAGGAATCAAAAAGACCTTCCACGATAAAAGCAAGGATTTGGTGAAATGCTTTGATCTAGAAAAATTCAAAGCTTCTCAGTGGTGTTCCATTTCATCTCCATTTAAGGA
TTATTCGTCCGGCTCTATTTGTGATTCTTGGAATGCCTTTATTTTCTCC
Protein sequenceShow/hide protein sequence
PKKVSMVIWIIFQGNLNTSDILQRNSALQSSFCILCSSSSETQAQLFFQCPSALSCWALFSFFNVHWVLTGHHRNNLLHLLYDPVLSQKAHRLISSGLMRSSLFYHDYVM
RGIKKTFHDKSKDLVKCFDLEKFKASQWCSISSPFKDYSSGSICDSWNAFIFS