; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:14150732..14151184
RNA-Seq ExpressionMoc02g19010
SyntenyMoc02g19010
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030942013.1 uncharacterized protein LOC115967068 [Quercus lobata]3.2e-3767.5Show/hide
Query:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS
        M + RPISLCNVIYK+++KVLANRLK+VL  IIS  QS FVPGR ITDN +  FE +H IN KR+GK G++A+KLDMSKAYDRVEW YL A+M +LGF  
Subjt:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS

Query:  RWISLIMDCVEFVSFSVILN
        RWISL+M CV  VS+SV+LN
Subjt:  RWISLIMDCVEFVSFSVILN

XP_035544594.1 uncharacterized protein LOC118347987 [Juglans regia]1.5e-3758.99Show/hide
Query:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW
        D RPISLCNV+YK++AKVLANRLK VL++IISPNQS F+PGRLI+DN +  +E +H++  ++ GK G +A+KLDMSKAYDR+EW YLRAV+ K+GFC +W
Subjt:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW

Query:  ISLIMDCVEFVSFSVILNRFYSSKRSSSRRSFIPVPFSP
        I LIM CV  VS+SV++N   S     SR      P SP
Subjt:  ISLIMDCVEFVSFSVILNRFYSSKRSSSRRSFIPVPFSP

XP_035546588.1 uncharacterized protein LOC118348634 [Juglans regia]1.5e-3758.99Show/hide
Query:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW
        D RPISLCNV+YK++AKVLANRLK VL++IISPNQS F+PGRLI+DN +  +E +H++  ++ GK G +A+KLDMSKAYDR+EW YLRAV+ K+GFC +W
Subjt:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW

Query:  ISLIMDCVEFVSFSVILNRFYSSKRSSSRRSFIPVPFSP
        I LIM CV  VS+SV++N   S     SR      P SP
Subjt:  ISLIMDCVEFVSFSVILNRFYSSKRSSSRRSFIPVPFSP

XP_040249920.1 uncharacterized protein LOC109742703 isoform X1 [Aegilops tauschii subsp. strangulata]1.1e-3765Show/hide
Query:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS
        MKD+RPISLCNV+YKL++KVLANRLKQ+L  IISPNQS FVPGRLITDN +  +EC H + NKR GKDG  A+KLDMSKAYD+VEW +L  +M +LGF  
Subjt:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS

Query:  RWISLIMDCVEFVSFSVILN
        RWI L+M C+  VS+   +N
Subjt:  RWISLIMDCVEFVSFSVILN

XP_040249922.1 uncharacterized protein LOC109742703 isoform X3 [Aegilops tauschii subsp. strangulata]1.1e-3765Show/hide
Query:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS
        MKD+RPISLCNV+YKL++KVLANRLKQ+L  IISPNQS FVPGRLITDN +  +EC H + NKR GKDG  A+KLDMSKAYD+VEW +L  +M +LGF  
Subjt:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS

Query:  RWISLIMDCVEFVSFSVILN
        RWI L+M C+  VS+   +N
Subjt:  RWISLIMDCVEFVSFSVILN

TrEMBL top hitse value%identityAlignment
A0A2N9G161 Reverse transcriptase domain-containing protein2.0e-3761.86Show/hide
Query:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW
        + RPISLCNV+YKL++KVLANRLK+VL +++S  QS FVPGR+ITDN +  FE +H ++N+R GK G +A+KLDMSKAYDRVEWG+L+ VM ++GFC  W
Subjt:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW

Query:  ISLIMDCVEFVSFSVILN
        ISLIM+C+  VS+S+++N
Subjt:  ISLIMDCVEFVSFSVILN

A0A2N9G1P5 Reverse transcriptase domain-containing protein2.0e-3761.86Show/hide
Query:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW
        + RPISLCNV+YKL++KVLANRLK+VL +++S  QS FVPGR+ITDN +  FE +H ++N+R GK G +A+KLDMSKAYDRVEWG+L+ VM ++GFC  W
Subjt:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW

Query:  ISLIMDCVEFVSFSVILN
        ISLIM+C+  VS+S+++N
Subjt:  ISLIMDCVEFVSFSVILN

A0A2N9G3M5 Reverse transcriptase domain-containing protein3.5e-3762.71Show/hide
Query:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW
        + RPISLCNVIYKL++KVLANRLK+VL  ++S  QS FVPGR+ITDN +  FE +H ++N+R GK G +A+KLDMSKAYDRVEWG+L+ VM ++GFC  W
Subjt:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW

Query:  ISLIMDCVEFVSFSVILN
        ISLIM+C+  VS+S+++N
Subjt:  ISLIMDCVEFVSFSVILN

A0A6P9EAZ3 uncharacterized protein LOC1183479877.0e-3858.99Show/hide
Query:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW
        D RPISLCNV+YK++AKVLANRLK VL++IISPNQS F+PGRLI+DN +  +E +H++  ++ GK G +A+KLDMSKAYDR+EW YLRAV+ K+GFC +W
Subjt:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW

Query:  ISLIMDCVEFVSFSVILNRFYSSKRSSSRRSFIPVPFSP
        I LIM CV  VS+SV++N   S     SR      P SP
Subjt:  ISLIMDCVEFVSFSVILNRFYSSKRSSSRRSFIPVPFSP

A0A6P9ERU8 uncharacterized protein LOC1183486347.0e-3858.99Show/hide
Query:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW
        D RPISLCNV+YK++AKVLANRLK VL++IISPNQS F+PGRLI+DN +  +E +H++  ++ GK G +A+KLDMSKAYDR+EW YLRAV+ K+GFC +W
Subjt:  DIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRW

Query:  ISLIMDCVEFVSFSVILNRFYSSKRSSSRRSFIPVPFSP
        I LIM CV  VS+SV++N   S     SR      P SP
Subjt:  ISLIMDCVEFVSFSVILNRFYSSKRSSSRRSFIPVPFSP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-0930.25Show/hide
Query:  KDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSR
        ++ RPISL N+  K++ K+LANR++Q + K+I  +Q  F+PG     N       I  IN  R      V + +D  KA+D+++  ++   + KLG    
Subjt:  KDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSR

Query:  WISLIMDCVEFVSFSVILN
        ++ +I    +  + ++ILN
Subjt:  WISLIMDCVEFVSFSVILN

P08548 LINE-1 reverse transcriptase homolog8.6e-0930.56Show/hide
Query:  KDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSR
        ++ RPISL N+  K++ K+L NR++Q + KII  +Q  F+PG     N       I  I NK   KD ++ + +D  KA+D ++  ++   ++K+G    
Subjt:  KDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSR

Query:  WISLIMDCVEFVSFSVILNRFYSSK---RSSSRRSFIPVPFSPL
        ++ LI       + ++ILN         RS +R+     P SPL
Subjt:  WISLIMDCVEFVSFSVILNRFYSSK---RSSSRRSFIPVPFSPL

P11369 LINE-1 retrotransposable element ORF2 protein1.7e-0931.13Show/hide
Query:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS
        +++ RPISL N+  K++ K+LANR+++ +  II P+Q  F+PG     N       IH IN  ++     + + LD  KA+D+++  ++  V+E+ G   
Subjt:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS

Query:  RWISLI
         ++++I
Subjt:  RWISLI

P14381 Transposon TX1 uncharacterized 149 kDa protein2.7e-1035.85Show/hide
Query:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS
        +K+ RP+SL +  YK++AK ++ RLK VL ++I P+QS  VPGR I DN     + +H    +R G   +  + LD  KA+DRV+  YL   ++   F  
Subjt:  MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCS

Query:  RWISLI
        +++  +
Subjt:  RWISLI

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.0e-1338.55Show/hide
Query:  LANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRWI
        +  RLK ++  +I P Q++F+PGR+ TDN +   E +H++  K+ G  G + +KLD+ KAYDR+ W YL   +   GF   W+
Subjt:  LANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATATTCGCCCTATAAGTTTATGTAATGTGATTTACAAGCTTATGGCAAAAGTCCTGGCTAATAGGTTGAAACAAGTTCTTGATAAGATTATTTCCCCCAACCA
ATCAACGTTTGTGCCAGGAAGATTAATTACAGATAATGCGATGAGTGGGTTTGAGTGTATTCATGCAATTAACAATAAAAGAATAGGGAAAGATGGAGTGGTCGCTATGA
AGTTGGATATGAGCAAGGCGTATGATCGAGTCGAGTGGGGGTACTTGAGGGCAGTTATGGAGAAATTGGGGTTTTGTAGTAGATGGATTTCTTTGATAATGGATTGTGTT
GAGTTTGTTAGCTTCTCAGTCATTCTTAATAGGTTTTATTCTTCAAAGAGGTCTTCGTCAAGACGATCCTTTATCCCCGTACCTTTTTCTCCTTTGTGCAGAAGGTCTTT
CCACACTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATATTCGCCCTATAAGTTTATGTAATGTGATTTACAAGCTTATGGCAAAAGTCCTGGCTAATAGGTTGAAACAAGTTCTTGATAAGATTATTTCCCCCAACCA
ATCAACGTTTGTGCCAGGAAGATTAATTACAGATAATGCGATGAGTGGGTTTGAGTGTATTCATGCAATTAACAATAAAAGAATAGGGAAAGATGGAGTGGTCGCTATGA
AGTTGGATATGAGCAAGGCGTATGATCGAGTCGAGTGGGGGTACTTGAGGGCAGTTATGGAGAAATTGGGGTTTTGTAGTAGATGGATTTCTTTGATAATGGATTGTGTT
GAGTTTGTTAGCTTCTCAGTCATTCTTAATAGGTTTTATTCTTCAAAGAGGTCTTCGTCAAGACGATCCTTTATCCCCGTACCTTTTTCTCCTTTGTGCAGAAGGTCTTT
CCACACTCCTTAA
Protein sequenceShow/hide protein sequence
MKDIRPISLCNVIYKLMAKVLANRLKQVLDKIISPNQSTFVPGRLITDNAMSGFECIHAINNKRIGKDGVVAMKLDMSKAYDRVEWGYLRAVMEKLGFCSRWISLIMDCV
EFVSFSVILNRFYSSKRSSSRRSFIPVPFSPLCRRSFHTP