; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr9:11978400..11979052
RNA-Seq ExpressionMoc09g14040
SyntenyMoc09g14040
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032769.1 uncharacterized protein E6C27_scaffold708G00140 [Cucumis melo var. makuwa]3.6e-4949.11Show/hide
Query:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC
        +FD+EQYFKAT T +EE KVTLATMHL+ DAKLW                           QFFPENVE +A+RKLR+LRHTG I++YV QF  +MLDI 
Subjt:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC

Query:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQS------
        DMSEKDKV  FVEGLKPWA+ KLYEQ+VQDL +A A AERL + +S+S  +++N  +  R N+  +P + K+ G DKRP G +  P + +   +      
Subjt:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQS------

Query:  -QNAQKPISCFLCKGPHHVAECPH
            ++P+SCF+C+GPH   ECP+
Subjt:  -QNAQKPISCFLCKGPHHVAECPH

KAA0045217.1 uncharacterized protein E6C27_scaffold30G002260 [Cucumis melo var. makuwa]3.6e-4948.88Show/hide
Query:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC
        +FD+EQYF+AT T +EE KVTLATMHL+ DAKLW                           QFFPENVE +A+RKLREL+HTG+I++YV QF  +MLDI 
Subjt:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC

Query:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGS-----YPQSQ
        DMSEKDKV  FVEGLKPWA+TKLYEQ+VQDL +A A AERL + S+ S  ++++  + + G++  +P + K+ G D+R  G        +      P +Q
Subjt:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGS-----YPQSQ

Query:  N-AQKPISCFLCKGPHHVAECPH
        N + +P+SCF+CKGPH   ECP+
Subjt:  N-AQKPISCFLCKGPHHVAECPH

KAA0060053.1 uncharacterized protein E6C27_scaffold160G00160 [Cucumis melo var. makuwa]1.6e-4949.78Show/hide
Query:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC
        +FD+EQYF+AT T +EE KVTLATMHL+ DAKLW                           QFFPENVE +A+RKLREL+HTG+I++YV QF  +MLDI 
Subjt:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC

Query:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQG---PNADPSRGSYPQSQN-
        DMSEKDKV  FVEGLKPWA+TKLYEQ+VQDL +A A AERL + S++S  ++++  + +RG++  +P + KS G DKR  G    +   +  S+  S N 
Subjt:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQG---PNADPSRGSYPQSQN-

Query:  --AQKPISCFLCKGPHHVAECPH
          + +P+SCF+CK PH   ECP+
Subjt:  --AQKPISCFLCKGPHHVAECPH

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]1.1e-7971.89Show/hide
Query:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC
        +FD+EQYFKATGTTSEEMKVTLATMHLT DAKLW                          GQFFP+NVEFMA+RKLRELRHTGTI+DYV QF  VM+DI 
Subjt:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC

Query:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQSQNAQKP
        DMSEKDKV +F++GLK WARTKLYEQ+VQDLATAMA AERLL+Y+SE S  KKN  NPT GNKTFKPFT KSGGADKRPQGPN  PSRG YPQSQNAQ+ 
Subjt:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQSQNAQKP

Query:  ISCFLCKGPHHVAECPH
        +S FLCKGPH VAECPH
Subjt:  ISCFLCKGPHHVAECPH

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]7.0e-7771.3Show/hide
Query:  FDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDICD
        FD+EQYFK TGT SE MKVTLATMHLT DAKLW                          GQFF +NVEFMA+RKLRELRHTGTI+DYV QF  VMLDI D
Subjt:  FDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDICD

Query:  MSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQSQNAQKPI
        MSEKDKV +F+EGLK WARTKLYEQ+VQDLATAMA AERLL+YSSE S  KKN  NPT GNKTFKPFT K GGADKRP GPN  PSRG YPQSQNAQ+P 
Subjt:  MSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQSQNAQKPI

Query:  SCFLCKGPHHVAECPH
        SCFLC+GPH VAECPH
Subjt:  SCFLCKGPHHVAECPH

TrEMBL top hitse value%identityAlignment
A0A5A7TTJ4 Reverse transcriptase domain-containing protein1.7e-4948.88Show/hide
Query:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC
        +FD+EQYF+AT T +EE KVTLATMHL+ DAKLW                           QFFPENVE +A+RKLREL+HTG+I++YV QF  +MLDI 
Subjt:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC

Query:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGS-----YPQSQ
        DMSEKDKV  FVEGLKPWA+TKLYEQ+VQDL +A A AERL + S+ S  ++++  + + G++  +P + K+ G D+R  G        +      P +Q
Subjt:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGS-----YPQSQ

Query:  N-AQKPISCFLCKGPHHVAECPH
        N + +P+SCF+CKGPH   ECP+
Subjt:  N-AQKPISCFLCKGPHHVAECPH

A0A5A7UVQ2 Retrotrans_gag domain-containing protein7.8e-5049.78Show/hide
Query:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC
        +FD+EQYF+AT T +EE KVTLATMHL+ DAKLW                           QFFPENVE +A+RKLREL+HTG+I++YV QF  +MLDI 
Subjt:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC

Query:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQG---PNADPSRGSYPQSQN-
        DMSEKDKV  FVEGLKPWA+TKLYEQ+VQDL +A A AERL + S++S  ++++  + +RG++  +P + KS G DKR  G    +   +  S+  S N 
Subjt:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQG---PNADPSRGSYPQSQN-

Query:  --AQKPISCFLCKGPHHVAECPH
          + +P+SCF+CK PH   ECP+
Subjt:  --AQKPISCFLCKGPHHVAECPH

A0A5A7UXR6 Reverse transcriptase1.7e-4949.11Show/hide
Query:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC
        +FD+EQYF+AT T +EE KVTLATMHL+ DAKLW                           QFFPENVE +A+RKLREL+HTG+I++YV QF  +MLDIC
Subjt:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC

Query:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKR-------PQGPNADPSRGSYPQ
        DMSEKDKV  FVEGLKPWA+TKLYEQ+VQDL +A A AERL + S++S  ++++  + + G++  +P + K+ G D+R        Q    +  RGS  Q
Subjt:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKR-------PQGPNADPSRGSYPQ

Query:  SQNAQKPISCFLCKGPHHVAECPH
        +  + +P+SCF+CKGPH   E P+
Subjt:  SQNAQKPISCFLCKGPHHVAECPH

A0A6J1D906 Reverse transcriptase4.3e-8072.35Show/hide
Query:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC
        +FD+EQYFKATGTTSEEMKVTLATMHLT DAKLW                          GQFFP+NVEFMA+RKLRELRHTGTI+DYV QF  VM+DI 
Subjt:  MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDIC

Query:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQSQNAQKP
        DMSEKDKV +F+EGLK WARTKLYEQ+VQDLATAMA AERLL+Y+SE S  KKN  NPT GNKTFKPFT KSGGADKRPQGPN  PSRG YPQSQNAQ+ 
Subjt:  DMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQSQNAQKP

Query:  ISCFLCKGPHHVAECPH
        +S FLCKGPH VAECPH
Subjt:  ISCFLCKGPHHVAECPH

A0A6J1DK29 uncharacterized protein LOC1110218293.4e-7771.3Show/hide
Query:  FDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDICD
        FD+EQYFK TGT SE MKVTLATMHLT DAKLW                          GQFF +NVEFMA+RKLRELRHTGTI+DYV QF  VMLDI D
Subjt:  FDIEQYFKATGTTSEEMKVTLATMHLTVDAKLW--------------------------GQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDICD

Query:  MSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQSQNAQKPI
        MSEKDKV +F+EGLK WARTKLYEQ+VQDLATAMA AERLL+YSSE S  KKN  NPT GNKTFKPFT K GGADKRP GPN  PSRG YPQSQNAQ+P 
Subjt:  MSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMAVAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQSQNAQKPI

Query:  SCFLCKGPHHVAECPH
        SCFLC+GPH VAECPH
Subjt:  SCFLCKGPHHVAECPH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGATATAGAACAATACTTCAAGGCTACCGGGACAACGTCAGAAGAAATGAAAGTGACTTTGGCCACCATGCACCTTACTGTTGATGCAAAGCTGTGGGGTCAGTT
CTTCCCCGAGAATGTCGAGTTCATGGCTAAAAGGAAGCTACGTGAACTCCGGCACACTGGAACAATTCAGGACTATGTGAACCAATTCCCTACCGTGATGCTGGATATTT
GCGACATGTCAGAGAAAGACAAGGTGCTCCTCTTTGTTGAAGGGTTGAAACCATGGGCCAGAACAAAGCTGTATGAACAGAAAGTGCAAGACCTTGCCACCGCCATGGCC
GTTGCAGAACGATTACTAAACTATAGCAGTGAGTCGTCCCAATCGAAAAAGAACACTCCAAACCCCACTAGGGGAAACAAGACGTTCAAACCCTTTACCCTAAAGAGTGG
GGGAGCTGACAAGAGACCCCAAGGCCCAAACGCCGATCCTTCTCGAGGATCGTATCCACAAAGTCAAAACGCTCAAAAGCCAATATCATGTTTCTTGTGCAAGGGCCCTC
ACCACGTAGCTGAATGCCCGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGATATAGAACAATACTTCAAGGCTACCGGGACAACGTCAGAAGAAATGAAAGTGACTTTGGCCACCATGCACCTTACTGTTGATGCAAAGCTGTGGGGTCAGTT
CTTCCCCGAGAATGTCGAGTTCATGGCTAAAAGGAAGCTACGTGAACTCCGGCACACTGGAACAATTCAGGACTATGTGAACCAATTCCCTACCGTGATGCTGGATATTT
GCGACATGTCAGAGAAAGACAAGGTGCTCCTCTTTGTTGAAGGGTTGAAACCATGGGCCAGAACAAAGCTGTATGAACAGAAAGTGCAAGACCTTGCCACCGCCATGGCC
GTTGCAGAACGATTACTAAACTATAGCAGTGAGTCGTCCCAATCGAAAAAGAACACTCCAAACCCCACTAGGGGAAACAAGACGTTCAAACCCTTTACCCTAAAGAGTGG
GGGAGCTGACAAGAGACCCCAAGGCCCAAACGCCGATCCTTCTCGAGGATCGTATCCACAAAGTCAAAACGCTCAAAAGCCAATATCATGTTTCTTGTGCAAGGGCCCTC
ACCACGTAGCTGAATGCCCGCACTGA
Protein sequenceShow/hide protein sequence
MFDIEQYFKATGTTSEEMKVTLATMHLTVDAKLWGQFFPENVEFMAKRKLRELRHTGTIQDYVNQFPTVMLDICDMSEKDKVLLFVEGLKPWARTKLYEQKVQDLATAMA
VAERLLNYSSESSQSKKNTPNPTRGNKTFKPFTLKSGGADKRPQGPNADPSRGSYPQSQNAQKPISCFLCKGPHHVAECPH