; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g11700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g11700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:9001157..9002095
RNA-Seq ExpressionMoc07g11700
SyntenyMoc07g11700
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042317.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]9.8e-3735.39Show/hide
Query:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW
        P  +G PI  + G+RV+YDAA    A++++F+ PD  WQWPRVS++LIDL   V +V+  +  +DR VW P   G FSI+SA   +RPR   V +  LLW
Subjt:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW

Query:  -----------------------------------------------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRL
                                                        HLFF CP+ R VW+ ++    SSH++ YW  ELSWICH  +G SVRR +WR+
Subjt:  -----------------------------------------------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRL

Query:  AWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCTTW
           +T   +W+E N R+HGG  R   ++ + I   +  R  +W
Subjt:  AWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCTTW

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]1.2e-3435.06Show/hide
Query:  GDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW------------
        G+RV+YDAA    AK++DF+ P+  W WPRVS++LIDL   V  V   +   D  VW P   G FSI+SAW  + PR   V +  LLW            
Subjt:  GDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW------------

Query:  -----------------------------------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRE
                                            HLFF CP+   VW+ +     SSH++ +W  ELSWICH  +G  VRR +WR+ W +T+  +W E
Subjt:  -----------------------------------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRE

Query:  RNLRIHGGPGRSSSVLLRVIKAEVYFRCTTW
        RN R+HGG  R   +L  +I   +  R  +W
Subjt:  RNLRIHGGPGRSSSVLLRVIKAEVYFRCTTW

XP_022158199.1 uncharacterized protein LOC111024737 [Momordica charantia]1.9e-4077.57Show/hide
Query:  FDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCTTWPGSCCRSS
        F   + +A W+ +     SSH+VSYWS+EL+WICHVS GSS RRHVWRLAWTSTVSLLWRERNLRIHGG GR SSVLLRVIKAEVYFRCTTWPGSCC+SS
Subjt:  FDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCTTWPGSCCRSS

Query:  EAMLVSA
        E MLVSA
Subjt:  EAMLVSA

XP_022158861.1 uncharacterized protein LOC111025324 [Momordica charantia]1.4e-8065.11Show/hide
Query:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW
        P   GGPIHPRCGDRVIYD A SSMAKV DFLLPD SW+WPRVS+DL++LLPEVMSV+ +VG++DR VWTPAVSGLFS+SS WG+LRPRRP V YF LLW
Subjt:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW

Query:  Y----------------------------------------------HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLA
        +                                              HLF DCPYSRAVW  MISWAGSSH+VSYWS+EL+WICHV+VGSS RRHVWRLA
Subjt:  Y----------------------------------------------HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLA

Query:  WTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEV
        WT TVSLLWRE NLRIH G GRSSSVLLRVIKAEV
Subjt:  WTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEV

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]1.3e-3333.73Show/hide
Query:  GVGRIFWVSGIHSVLWCVLFLGMVGFVLCGTIPGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLV
        GV  +  VSGI   L       M G + CG I   +GG I  + G+RVIYDA     A++ DF++ D  W+WP VS+DL+D+   +  V+     +DR V
Subjt:  GVGRIFWVSGIHSVLWCVLFLGMVGFVLCGTIPGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLV

Query:  WTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW-----------------------------------------------YHLFFDCPYSRAVWTSMISWA
        W P     FSI+SAW  +RP    V +  LLW                                                HLFF CP+   +W+ ++ + 
Subjt:  WTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW-----------------------------------------------YHLFFDCPYSRAVWTSMISWA

Query:  GSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGR
         SSH++ YW  ELSWIC+  +G  VRR +W L W +T+  +W+ERN  +HGG  R
Subjt:  GSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGR

TrEMBL top hitse value%identityAlignment
A0A5A7TKU4 Non-LTR retroelement reverse transcriptase-like protein4.7e-3735.39Show/hide
Query:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW
        P  +G PI  + G+RV+YDAA    A++++F+ PD  WQWPRVS++LIDL   V +V+  +  +DR VW P   G FSI+SA   +RPR   V +  LLW
Subjt:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW

Query:  -----------------------------------------------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRL
                                                        HLFF CP+ R VW+ ++    SSH++ YW  ELSWICH  +G SVRR +WR+
Subjt:  -----------------------------------------------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRL

Query:  AWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCTTW
           +T   +W+E N R+HGG  R   ++ + I   +  R  +W
Subjt:  AWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCTTW

A0A5A7TZS0 Reverse transcriptase domain-containing protein5.8e-3535.06Show/hide
Query:  GDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW------------
        G+RV+YDAA    AK++DF+ P+  W WPRVS++LIDL   V  V   +   D  VW P   G FSI+SAW  + PR   V +  LLW            
Subjt:  GDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW------------

Query:  -----------------------------------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRE
                                            HLFF CP+   VW+ +     SSH++ +W  ELSWICH  +G  VRR +WR+ W +T+  +W E
Subjt:  -----------------------------------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRE

Query:  RNLRIHGGPGRSSSVLLRVIKAEVYFRCTTW
        RN R+HGG  R   +L  +I   +  R  +W
Subjt:  RNLRIHGGPGRSSSVLLRVIKAEVYFRCTTW

A0A5D3BDZ6 Zf-RVT domain-containing protein8.4e-3439.11Show/hide
Query:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW
        P  +GG I  + G+RV+YDAA    A+++DF+ P+  W WPRVS++LIDL   V  V   +   D  VW P   G FSI+SAW  + PR   V +  LLW
Subjt:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW

Query:  ------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCT
               H F  C +  A+   ++    SSH++ +W  ELSWI H  +G  VRR +WR+ W +T+  +W ERN R+HGG  R   +L  +I   +  R  
Subjt:  ------YHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCT

Query:  TW
        +W
Subjt:  TW

A0A6J1DYP6 uncharacterized protein LOC1110247379.2e-4177.57Show/hide
Query:  FDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCTTWPGSCCRSS
        F   + +A W+ +     SSH+VSYWS+EL+WICHVS GSS RRHVWRLAWTSTVSLLWRERNLRIHGG GR SSVLLRVIKAEVYFRCTTWPGSCC+SS
Subjt:  FDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRCTTWPGSCCRSS

Query:  EAMLVSA
        E MLVSA
Subjt:  EAMLVSA

A0A6J1E271 uncharacterized protein LOC1110253247.0e-8165.11Show/hide
Query:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW
        P   GGPIHPRCGDRVIYD A SSMAKV DFLLPD SW+WPRVS+DL++LLPEVMSV+ +VG++DR VWTPAVSGLFS+SS WG+LRPRRP V YF LLW
Subjt:  PGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLW

Query:  Y----------------------------------------------HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLA
        +                                              HLF DCPYSRAVW  MISWAGSSH+VSYWS+EL+WICHV+VGSS RRHVWRLA
Subjt:  Y----------------------------------------------HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLA

Query:  WTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEV
        WT TVSLLWRE NLRIH G GRSSSVLLRVIKAEV
Subjt:  WTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.1e-0533.33Show/hide
Query:  HLFFDCPYSRAVWTSMISWAG--SSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFR
        HLFF+CP+  AVW      A      Q+ Y    L W+ + S   +    + RLA+ + V  +WRERNL +H G  R    +L+ I+  +  R
Subjt:  HLFFDCPYSRAVWTSMISWAG--SSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFR

AT1G45063.1 copper ion binding;electron carriers5.2e-0434.29Show/hide
Query:  HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIH
        H+FFDCP+S  VW+   S A  +    +  S   W+ H      V   + +LA+ ++V  +WRERN+R++
Subjt:  HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIH

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.2e-0529.35Show/hide
Query:  HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRC
        HLFFDC ++R VW    S       + +    + W+ +     +V   + RL+  ++V  +W+ERN R+H    R ++ L+  IK+ +  RC
Subjt:  HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIKAEVYFRC

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.9e-0723.83Show/hide
Query:  GPIHPRCG----DRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTP---AVSGLFSISSAWGLLRPRRPLVPYFSL
        GP+ PR      D V+ DA R +   +A       S     + + L +LLPE   +       D  +W     A S  FS    W  L P+   VP+   
Subjt:  GPIHPRCG----DRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAMVGQKDRLVWTP---AVSGLFSISSAWGLLRPRRPLVPYFSL

Query:  LWY-----------------------------------------------HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVW
        +W+                                               HLFF+C +S  VW    +    +         L+W+   S   ++   + 
Subjt:  LWY-----------------------------------------------HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVW

Query:  RLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIK
        RLA+ S V  +WRERN R+H G  RS+  +L+ I+
Subjt:  RLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIK

AT4G05095.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT4G04650.1)1.6e-0530.59Show/hide
Query:  HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIK
        HLFF C  S AVW + +  A  +  + +    L+W+   S   ++   + +L + +++  LW+ERN R+H    RSS+ +++ IK
Subjt:  HLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRIHGGPGRSSSVLLRVIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTGCGGGGTTCTTGTATTTGGACTGTCCGTCCTTCACCTCGGTTTTCCTGGTGTTGGCAGGATATTCTGGGTGTCCGGGATTCACTCCGTCCTCTGGTGCGTACT
GTTCTTGGGGATGGTCGGCTTTGTTTTGTGTGGCACGATTCCTGGTTGCCGGGGGGGTCCGATTCACCCGAGGTGTGGGGATAGGGTCATTTATGATGCTGCTAGATCTT
CTATGGCTAAGGTTGCTGACTTTTTGCTCCCTGACGATTCCTGGCAGTGGCCTCGTGTTTCGATGGATCTTATTGATCTTCTCCCGGAGGTGATGTCGGTGCAAGCCATG
GTTGGTCAGAAGGATCGCCTTGTCTGGACTCCTGCGGTGTCTGGCCTTTTTTCGATCTCTAGTGCTTGGGGGCTTCTTCGTCCTCGTCGTCCGCTAGTTCCTTATTTTTC
CTTGTTATGGTATCATCTGTTTTTTGATTGCCCTTATAGTAGAGCGGTTTGGACTAGTATGATTTCGTGGGCCGGTTCGTCTCATCAGGTTTCATATTGGTCTTCAGAGC
TTTCTTGGATTTGTCATGTTAGTGTTGGGTCGTCTGTGCGGCGGCATGTTTGGCGGTTAGCTTGGACTTCGACTGTCTCTCTCCTTTGGAGGGAGCGGAACCTTCGTATT
CATGGTGGTCCGGGTCGCTCGTCGTCTGTGCTCCTTAGGGTCATTAAAGCTGAGGTGTATTTTCGTTGCACTACTTGGCCAGGGAGTTGTTGTAGGAGCTCTGAGGCCAT
GCTTGTCTCTGCTTGGGACTTATTTTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTGCGGGGTTCTTGTATTTGGACTGTCCGTCCTTCACCTCGGTTTTCCTGGTGTTGGCAGGATATTCTGGGTGTCCGGGATTCACTCCGTCCTCTGGTGCGTACT
GTTCTTGGGGATGGTCGGCTTTGTTTTGTGTGGCACGATTCCTGGTTGCCGGGGGGGTCCGATTCACCCGAGGTGTGGGGATAGGGTCATTTATGATGCTGCTAGATCTT
CTATGGCTAAGGTTGCTGACTTTTTGCTCCCTGACGATTCCTGGCAGTGGCCTCGTGTTTCGATGGATCTTATTGATCTTCTCCCGGAGGTGATGTCGGTGCAAGCCATG
GTTGGTCAGAAGGATCGCCTTGTCTGGACTCCTGCGGTGTCTGGCCTTTTTTCGATCTCTAGTGCTTGGGGGCTTCTTCGTCCTCGTCGTCCGCTAGTTCCTTATTTTTC
CTTGTTATGGTATCATCTGTTTTTTGATTGCCCTTATAGTAGAGCGGTTTGGACTAGTATGATTTCGTGGGCCGGTTCGTCTCATCAGGTTTCATATTGGTCTTCAGAGC
TTTCTTGGATTTGTCATGTTAGTGTTGGGTCGTCTGTGCGGCGGCATGTTTGGCGGTTAGCTTGGACTTCGACTGTCTCTCTCCTTTGGAGGGAGCGGAACCTTCGTATT
CATGGTGGTCCGGGTCGCTCGTCGTCTGTGCTCCTTAGGGTCATTAAAGCTGAGGTGTATTTTCGTTGCACTACTTGGCCAGGGAGTTGTTGTAGGAGCTCTGAGGCCAT
GCTTGTCTCTGCTTGGGACTTATTTTCGTGA
Protein sequenceShow/hide protein sequence
MLCGVLVFGLSVLHLGFPGVGRIFWVSGIHSVLWCVLFLGMVGFVLCGTIPGCRGGPIHPRCGDRVIYDAARSSMAKVADFLLPDDSWQWPRVSMDLIDLLPEVMSVQAM
VGQKDRLVWTPAVSGLFSISSAWGLLRPRRPLVPYFSLLWYHLFFDCPYSRAVWTSMISWAGSSHQVSYWSSELSWICHVSVGSSVRRHVWRLAWTSTVSLLWRERNLRI
HGGPGRSSSVLLRVIKAEVYFRCTTWPGSCCRSSEAMLVSAWDLFS