; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g00720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g00720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr1:465331..466017
RNA-Seq ExpressionMoc01g00720
SyntenyMoc01g00720
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061282.1 putative mitochondrial protein [Cucumis melo var. makuwa]6.1e-4454.82Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDD-----GLCLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQLLRDLQLVGVTPPVLWCDNASAIQL
        MH P   H  A KRILRY+ G   D     G       NPISW +KKQST+SRSST+AEYR+LA+T ++L+W+RQLL DL +   T P+LWCDN SAI L
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDD-----GLCLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQLLRDLQLVGVTPPVLWCDNASAIQL

Query:  ARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRS
        A NP+FH+RTKH+EID+HF+RE+V+RKDI + ++S+  QLAD+ TKPL +   L LR KL+    S
Subjt:  ARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRS

KAG5563892.1 hypothetical protein RHGRI_000173 [Rhododendron griersonianum]5.1e-4348.98Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGL------------------------------CLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ
        MH P HSH +A KRILRYI G++  GL                              C+  GPN +SW AKKQ TV+RSST+AEYR+LA TA+++ W++Q
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGL------------------------------CLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ

Query:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRSLSSAG
        +L +L +    PPV+WCDN SAI LA NPIFH+RTKHVEID+HFIRE+V+ K + +H V T  Q+AD+FTK L+ AR  FL++KLM     +S  G
Subjt:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRSLSSAG

TQD96815.1 hypothetical protein C1H46_017547 [Malus baccata]3.3e-4249.05Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGL--------------------------------CLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWV
        MH+P  SH  A KRILRY+ G++D GL                                C+  G + ISW AKKQ TV+RSST+AEYR+LA+TASE+ W+
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGL--------------------------------CLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWV

Query:  RQLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRSLSSAGFR
         QLL D+      PP LWCDN SAI LA+NP+FH+RTKHVE+D+H+IRE+VV K I LH++ ++ Q+AD+ TK L AAR LFLR KL      L    FR
Subjt:  RQLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRSLSSAGFR

Query:  LRGMLTDGNI
        LRG +  GNI
Subjt:  LRGMLTDGNI

XP_022152156.1 uncharacterized protein LOC111019945 [Momordica charantia]1.8e-4856.08Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK---------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFW
        MH P   HL AAKRILRY+ GTVD GL  ++                                 G NPISWGAKKQ+TVSRSST+AEYRALAST +EL W
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK---------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFW

Query:  VRQLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLM
        +RQ+L+DL +     P+LWCDN S IQLA NP+FH +TKHVEIDFHF+RERV+RKDI+L ++ST+ QLADLF K +T  RL FLRSKL+
Subjt:  VRQLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLM

XP_030964983.1 uncharacterized protein LOC115986279, partial [Quercus lobata]2.1e-4451.74Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGP------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ
        M SP  +HL AAKR+LRY+ GT+  G+   +GP                              NPISW +KKQ+TVSRSST+AEYRALA+TA+EL W+RQ
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGP------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ

Query:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRSLSSAGFRLR
        L RDL L     PVLWCDN SAI LA NP+FH+RTKHVE+D+HF+RERV+RKD+ + FVS +  LAD+FTKPL     L  R+KLM       S+  RLR
Subjt:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRSLSSAGFRLR

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A2N9FIJ5 Reverse transcriptase Ty1/copia-type domain-containing protein1.0e-4454.3Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGP------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ
        MH P  +HL+AAKRILRY+ GT+  G+  + G                               NPI+W +KKQ TVSRSST+AEYRALA+ A+EL W+RQ
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGP------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ

Query:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLM
        +L DL +   T P +WCDN SAI LA NP+FHSRTKH+E+D+HF+RERVVR D+ LHF+STE QLADLFTKPLT  R   L SKLM
Subjt:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLM

A0A2N9FXA9 Uncharacterized protein2.2e-4453.23Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGP------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ
        M++P   HL AAKRILRY+ GT+D GL    GP                              NPI+W AKKQ TVSRSST+AEYRALAS ++E+ W+R 
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGP------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ

Query:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLM
        LLRDL +    PP+LWCDN SA+ +A NP+FH+RTKH+E+DFHFIRERV+RKD+++ FVST  QLAD+FTK L++ R   L+SKLM
Subjt:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLM

A0A2N9HW05 Uncharacterized protein5.9e-4551.02Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGP------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ
        MHSP   HL+AAKRILRY+ G++D G+  + GP                              NPI+W +KKQ TVSRSST+AEYR+LA+ A+EL W+RQ
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGP------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQ

Query:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRSLSSAG
        +LRD+ L   + PV+WCDN SA+ LA NP+FH RTKH+ +DFHF+RERVVR DI+LHF+ST+ Q+ADLFTK  ++ R   LRSKL+ S      AG
Subjt:  LLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRSLSSAG

A0A2N9IIJ0 Uncharacterized protein5.9e-4559.39Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDD-----GLCLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQLLRDLQLVGVTPPVLWCDNASAIQL
        M SP   HL+AAKRILRY+ G  DD     GL +  GPNPI+W AKKQ TVSRSST++EYRALA  ++EL W R LL+DL +     P+LWCDN SA+ +
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDD-----GLCLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQLLRDLQLVGVTPPVLWCDNASAIQL

Query:  ARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDR
        A NP+FH+RTKH+E+DFHF+RERV+RKD+ + FVST  QLAD+FTK L   R L LRS LM + R
Subjt:  ARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDR

A0A6J1DGS4 uncharacterized protein LOC1110199458.8e-4956.08Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK---------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFW
        MH P   HL AAKRILRY+ GTVD GL  ++                                 G NPISWGAKKQ+TVSRSST+AEYRALAST +EL W
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK---------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFW

Query:  VRQLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLM
        +RQ+L+DL +     P+LWCDN S IQLA NP+FH +TKHVEIDFHF+RERV+RKDI+L ++ST+ QLADLF K +T  RL FLRSKL+
Subjt:  VRQLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLM

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-2337.29Show/hide
Query:  KRILRYIVGTVDDGLCLKKGP----------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQLLRDLQLV
        KR+LRY+ GT+D  L  KK                                    N I W  K+Q++V+ SST+AEY AL     E  W++ LL  + + 
Subjt:  KRILRYIVGTVDDGLCLKKGP----------------------------------NPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQLLRDLQLV

Query:  GVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKL
           P  ++ DN   I +A NP  H R KH++I +HF RE+V    I L ++ TE QLAD+FTKPL AAR + LR KL
Subjt:  GVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-1833.14Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGPNP-------------------------------ISWGAKKQSTVSRSSTKAEYRALASTASELFWVR
        + +P   H  A K ILRY+ GT  D LC   G +P                               ISW +K Q  V+ S+T+AEY A   T  E+ W++
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGPNP-------------------------------ISWGAKKQSTVSRSSTKAEYRALASTASELFWVR

Query:  QLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTK
        + L++L L      V++CD+ SAI L++N ++H+RTKH+++ +H+IRE V  + +++  +ST    AD+ TK
Subjt:  QLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTK

P92519 Uncharacterized mitochondrial protein AtMg008101.1e-0839.8Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK-------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFW
        MH P  +     KR+LRY+ GT+  GL + K                               G N ISW AK+Q TVSRSST+ EYRALA TA+EL W
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK-------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-3343.01Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK-------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVR
        MH P   HL A KRILRY+ GT + G+ LKK                               G +PISW +KKQ  V RSST+AEYR++A+T+SE+ W+ 
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK-------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVR

Query:  QLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKL
         LL +L +    PPV++CDN  A  L  NP+FHSR KH+ ID+HFIR +V    +R+  VST  QLAD  TKPL+        SK+
Subjt:  QLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-3244Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK-------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVR
        MH P   H  A KR+LRY+ GT D G+ LKK                               G +PISW +KKQ  V RSST+AEYR++A+T+SEL W+ 
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK-------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVR

Query:  QLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLT
         LL +L +    PPV++CDN  A  L  NP+FHSR KH+ +D+HFIR +V    +R+  VST  QLAD  TKPL+
Subjt:  QLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVVRKDIRLHFVSTELQLADLFTKPLT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.8e-2339.6Show/hide
Query:  SPLHSHLIAAKRILRYIVGTVDDGL-------------------------------CLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQL
        +P  +H  A  +IL YI GTV  GL                               C+  G + ISW +KKQ  VS+SS +AEYRAL+    E+ W+ Q 
Subjt:  SPLHSHLIAAKRILRYIVGTVDDGL-------------------------------CLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQL

Query:  LRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVV
         R+LQL    P +L+CDN +AI +A N +FH RTKH+E D H +RER V
Subjt:  LRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEIDFHFIRERVV

ATMG00810.1 DNA/RNA polymerases superfamily protein8.0e-1039.8Show/hide
Query:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK-------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFW
        MH P  +     KR+LRY+ GT+  GL + K                               G N ISW AK+Q TVSRSST+ EYRALA TA+EL W
Subjt:  MHSPLHSHLIAAKRILRYIVGTVDDGLCLKK-------------------------------GPNPISWGAKKQSTVSRSSTKAEYRALASTASELFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTCCCCTCTTCATTCTCATTTGATTGCTGCAAAACGAATACTTCGATATATTGTTGGTACTGTTGATGATGGTTTATGTTTAAAGAAAGGTCCAAATCCGATTTC
TTGGGGCGCTAAGAAGCAATCTACCGTTTCCCGTAGCTCTACCAAGGCTGAGTATAGAGCTCTGGCTTCCACTGCCTCTGAGTTATTCTGGGTCCGACAGCTTCTTCGTG
ATCTTCAGCTTGTTGGTGTTACGCCACCTGTTTTGTGGTGTGACAATGCATCGGCTATACAACTTGCTAGGAATCCTATCTTCCATAGTCGTACGAAGCACGTGGAAATA
GACTTTCATTTTATTCGGGAAAGAGTTGTGCGGAAAGATATCAGACTGCATTTTGTTTCCACTGAACTACAACTTGCTGACTTATTTACAAAACCACTTACTGCAGCTCG
CCTTCTGTTTCTTCGGTCCAAACTCATGCCATCTGATCGTTCACTATCTTCTGCTGGCTTTCGTTTGAGGGGGATGTTAACTGATGGAAATATTTGGTTTAATATTTGTT
ATTCTGTTATGTTATGGTTATACTGCATTTGTCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGCATTCCCCTCTTCATTCTCATTTGATTGCTGCAAAACGAATACTTCGATATATTGTTGGTACTGTTGATGATGGTTTATGTTTAAAGAAAGGTCCAAATCCGATTTC
TTGGGGCGCTAAGAAGCAATCTACCGTTTCCCGTAGCTCTACCAAGGCTGAGTATAGAGCTCTGGCTTCCACTGCCTCTGAGTTATTCTGGGTCCGACAGCTTCTTCGTG
ATCTTCAGCTTGTTGGTGTTACGCCACCTGTTTTGTGGTGTGACAATGCATCGGCTATACAACTTGCTAGGAATCCTATCTTCCATAGTCGTACGAAGCACGTGGAAATA
GACTTTCATTTTATTCGGGAAAGAGTTGTGCGGAAAGATATCAGACTGCATTTTGTTTCCACTGAACTACAACTTGCTGACTTATTTACAAAACCACTTACTGCAGCTCG
CCTTCTGTTTCTTCGGTCCAAACTCATGCCATCTGATCGTTCACTATCTTCTGCTGGCTTTCGTTTGAGGGGGATGTTAACTGATGGAAATATTTGGTTTAATATTTGTT
ATTCTGTTATGTTATGGTTATACTGCATTTGTCTATAA
Protein sequenceShow/hide protein sequence
MHSPLHSHLIAAKRILRYIVGTVDDGLCLKKGPNPISWGAKKQSTVSRSSTKAEYRALASTASELFWVRQLLRDLQLVGVTPPVLWCDNASAIQLARNPIFHSRTKHVEI
DFHFIRERVVRKDIRLHFVSTELQLADLFTKPLTAARLLFLRSKLMPSDRSLSSAGFRLRGMLTDGNIWFNICYSVMLWLYCICL