; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g25800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g25800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:19435477..19438745
RNA-Seq ExpressionMoc06g25800
SyntenyMoc06g25800
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]2.5e-3039.84Show/hide
Query:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRS--------SAVLLHDSWL-----SG
        L  G+E    G KVA+ +VCLPF+EG LGI      N A  +K+L    T  GSLWV W+ AYIL+G  LW V S         A+L     +       
Subjt:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRS--------SAVLLHDSWL-----SG

Query:  VRFFCGLVIESFMMLLALP-----WLRLCL--------VEAVAGRMDVS---VWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLA
        V +      E+ +     P     W R+ L        V+ V+  + VS   VWVP   G FS++SAW  +      V +  LLW  GNIPKHSF AWLA
Subjt:  VRFFCGLVIESFMMLLALP-----WLRLCL--------VEAVAGRMDVS---VWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLA

Query:  VCDQLSTRDHLRRWGAVVHSDCVFCT-GQESRDHLSFECPFSTVVW
        + D+L TRD L RW + +   C+ C  G ESRDHL F CPF   VW
Subjt:  VCDQLSTRDHLRRWGAVVHSDCVFCT-GQESRDHLSFECPFSTVVW

KAB1205646.1 hypothetical protein CJ030_MR7G017818 [Morella rubra]8.0e-2938.53Show/hide
Query:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAG-SLWVTWVTAYILRGACLWTVRSSAVLLHDSWLSGVRFFCGLVIESF
        L +G E    G KVA+ +VCLP +EG LG+C +A  N A+ +K +W  F     S+W  WV AY+LRG   W V+   +    SW             ++
Subjt:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAG-SLWVTWVTAYILRGACLWTVRSSAVLLHDSWLSGVRFFCGLVIESF

Query:  MMLLALPWLRLCLVEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-
          LL+   L+L   +    + D   W  +S G FS+ SA+  LR  RP V +  L+W  G IPK+SFI WLAV + L+T+D L   G V    CV C G 
Subjt:  MMLLALPWLRLCLVEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-

Query:  QESRDHLSFECPFSTVVW
        +ES DHL F C F++ +W
Subjt:  QESRDHLSFECPFSTVVW

XP_022158861.1 uncharacterized protein LOC111025324 [Momordica charantia]8.6e-3158.72Show/hide
Query:  VEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQESRDHLSFECPFS
        V  V G+ D  VW P   G FSVSS WG+LR  RP VSY  LLWF GNIPKHSFI+WLA+ D+L TR+ LR+W A+V + CVFC G ESRDHL  +CP+S
Subjt:  VEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQESRDHLSFECPFS

Query:  TVVWDAQVS
          VW   +S
Subjt:  TVVWDAQVS

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]9.1e-3336.36Show/hide
Query:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRG--------------ACLWTVRSSAVLLHDSWLS
        L  G+E    GAKVA+ EVCLPF EG L I   +  N A+ +K+LWL    +GSLWV WV AYIL+G              +C+  + S  ++L   W  
Subjt:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRG--------------ACLWTVRSSAVLLHDSWLS

Query:  GVRFFCGLV------------------------IESFMM--------LLALP----WLRLCLVEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVS
             CG +                        +  FM+        L++L     W  +  V   +   D  VWVP SL  FS++SAW  +R     V 
Subjt:  GVRFFCGLV------------------------IESFMM--------LLALP----WLRLCLVEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVS

Query:  YIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQ-ESRDHLSFECPFSTVVWDAQVSLFV
        +  LLW  GNIPKHSF AWLA+ D+L TRD L +W   +   C+ C G  ESRDHL F CPF   +W +++ LF+
Subjt:  YIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQ-ESRDHLSFECPFSTVVWDAQVSLFV

XP_034928674.1 uncharacterized protein LOC118059820 [Populus alba]3.0e-2829.66Show/hide
Query:  NAPIDQKLQRCEINRSNSLTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRSSAV----
        +A +   + + E   ++ L +G     SGAKVA++ +C P  EG LGI  +   N AA +K +W   +   S+WVTWV + +LRG   W ++   +    
Subjt:  NAPIDQKLQRCEINRSNSLTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRSSAV----

Query:  --------------------------LLHDSWLSGVRFFCGLVIESFMMLLALPWLRLCLVEAVAGR-------------------------MDVSVWVP
                                  L  D WL   +  C L+    +    LPW        +AGR                         +D  +W  
Subjt:  --------------------------LLHDSWLSGVRFFCGLVIESFMMLLALPWLRLCLVEAVAGR-------------------------MDVSVWVP

Query:  HSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQ-ESRDHLSFECPFSTVVW
         + G F++ SAW +LR +RP  S   L+WF G+ P+H+FI W+A  D+L T D L  +     S C+ C  Q E+ DHL F CPFS+ VW
Subjt:  HSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQ-ESRDHLSFECPFSTVVW

TrEMBL top hitse value%identityAlignment
A0A2N9G6U4 Reverse transcriptase domain-containing protein2.5e-2829.93Show/hide
Query:  LQRCEINRSNSLTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRSSA------------
        ++  E + ++ L +G+ +   G +VA+ +VCLP +EG LG+  +   N AA +K +W  FT +GSLWV W+  ++++  C WTV+ ++            
Subjt:  LQRCEINRSNSLTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRSSA------------

Query:  ------------------VLLHDSW-LSGVRFF-------------CGLVIESFMMLLALPW------------LRLCLVEAVAGRMDVSVWVPHSLGYF
                           L HD W   G+ +                  + S +      W             +LCL+    G  D +VW   S G F
Subjt:  ------------------VLLHDSW-LSGVRFF-------------CGLVIESFMMLLALPW------------LRLCLVEAVAGRMDVSVWVPHSLGYF

Query:  SVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQ-ESRDHLSFECPFSTVVW
        S ++ W  LR     VS+  LLWF  +IP+HSFI WLA+ ++L T++ + +WG +V  +CVFC    E+R+H+ FEC FS  +W
Subjt:  SVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQ-ESRDHLSFECPFSTVVW

A0A2N9HQ55 Very-long-chain 3-oxoacyl-CoA synthase2.5e-2829.93Show/hide
Query:  LQRCEINRSNSLTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRSSA------------
        ++  E + ++ L +G+ +   G +VA+ +VCLP +EG LG+  +   N AA +K +W  FT +GSLWV W+  ++++  C WTV+ ++            
Subjt:  LQRCEINRSNSLTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRSSA------------

Query:  ------------------VLLHDSW-LSGVRFF-------------CGLVIESFMMLLALPW------------LRLCLVEAVAGRMDVSVWVPHSLGYF
                           L HD W   G+ +                  + S +      W             +LCL+    G  D +VW   S G F
Subjt:  ------------------VLLHDSW-LSGVRFF-------------CGLVIESFMMLLALPW------------LRLCLVEAVAGRMDVSVWVPHSLGYF

Query:  SVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQ-ESRDHLSFECPFSTVVW
        S ++ W  LR     VS+  LLWF  +IP+HSFI WLA+ ++L T++ + +WG +V  +CVFC    E+R+H+ FEC FS  +W
Subjt:  SVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQ-ESRDHLSFECPFSTVVW

A0A5A7TZS0 Reverse transcriptase domain-containing protein1.2e-3039.84Show/hide
Query:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRS--------SAVLLHDSWL-----SG
        L  G+E    G KVA+ +VCLPF+EG LGI      N A  +K+L    T  GSLWV W+ AYIL+G  LW V S         A+L     +       
Subjt:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLWVTWVTAYILRGACLWTVRS--------SAVLLHDSWL-----SG

Query:  VRFFCGLVIESFMMLLALP-----WLRLCL--------VEAVAGRMDVS---VWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLA
        V +      E+ +     P     W R+ L        V+ V+  + VS   VWVP   G FS++SAW  +      V +  LLW  GNIPKHSF AWLA
Subjt:  VRFFCGLVIESFMMLLALP-----WLRLCL--------VEAVAGRMDVS---VWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLA

Query:  VCDQLSTRDHLRRWGAVVHSDCVFCT-GQESRDHLSFECPFSTVVW
        + D+L TRD L RW + +   C+ C  G ESRDHL F CPF   VW
Subjt:  VCDQLSTRDHLRRWGAVVHSDCVFCT-GQESRDHLSFECPFSTVVW

A0A6A1UZY0 zf-RVT domain-containing protein3.9e-2938.53Show/hide
Query:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAG-SLWVTWVTAYILRGACLWTVRSSAVLLHDSWLSGVRFFCGLVIESF
        L +G E    G KVA+ +VCLP +EG LG+C +A  N A+ +K +W  F     S+W  WV AY+LRG   W V+   +    SW             ++
Subjt:  LTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAG-SLWVTWVTAYILRGACLWTVRSSAVLLHDSWLSGVRFFCGLVIESF

Query:  MMLLALPWLRLCLVEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-
          LL+   L+L   +    + D   W  +S G FS+ SA+  LR  RP V +  L+W  G IPK+SFI WLAV + L+T+D L   G V    CV C G 
Subjt:  MMLLALPWLRLCLVEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-

Query:  QESRDHLSFECPFSTVVW
        +ES DHL F C F++ +W
Subjt:  QESRDHLSFECPFSTVVW

A0A6J1E271 uncharacterized protein LOC1110253244.1e-3158.72Show/hide
Query:  VEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQESRDHLSFECPFS
        V  V G+ D  VW P   G FSVSS WG+LR  RP VSY  LLWF GNIPKHSFI+WLA+ D+L TR+ LR+W A+V + CVFC G ESRDHL  +CP+S
Subjt:  VEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQESRDHLSFECPFS

Query:  TVVWDAQVS
          VW   +S
Subjt:  TVVWDAQVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.5e-1440Show/hide
Query:  FSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-QESRDHLSFECPFSTVVW
        FS +     L      V +   +WF  ++PKH+FI W+   ++L TRD LR WG  + + C+ C    ESR HL FECPF   VW
Subjt:  FSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-QESRDHLSFECPFSTVVW

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.8e-1339.02Show/hide
Query:  SSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFC-TGQESRDHLSFECPFSTVVW
        ++ W  L      V +   +WF G IPKH+FIAW+ +  +L T+D +  WG +    C+FC T  E+R HL F+C F+  VW
Subjt:  SSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFC-TGQESRDHLSFECPFSTVVW

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.3e-1640Show/hide
Query:  FSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-QESRDHLSFECPFSTVVW
        FS    W  L     +V +   +WF  ++PKH+FI W+   ++L TRD L+ WG  + ++C+ C    +SR HL FEC FS VVW
Subjt:  FSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-QESRDHLSFECPFSTVVW

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.3e-1240.74Show/hide
Query:  FSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-QESRDHLSFECPFS
        FS ++ W  L      V +   +WF G IPKH+FI+W+ +  +L TRD L  WG  V S C+ C    E+R HL F+C F+
Subjt:  FSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTG-QESRDHLSFECPFS

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.2e-1539.13Show/hide
Query:  FSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCT-GQESRDHLSFECPFSTVVWDAQVSLF
        FS    W  +R   P+V +  ++WF   IP+ S I W++  ++L TRD LR WG  + S  V C+ G E+  HL FEC FS  +W+   S F
Subjt:  FSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSFIAWLAVCDQLSTRDHLRRWGAVVHSDCVFCT-GQESRDHLSFECPFSTVVWDAQVSLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGAGCTCGAAATTGCTGAGCTCGGTTCCGACGTTGATCGGATGTTCAAATTCTTCCAAAAACTTGATTCTCGTTTTACCCGTTATGAGCTAGCAAGGGGACCTAA
TGCACCTATAGATCAGAAGCTCCAACGATGTGAGATTAATCGGTCAAACTCATTGACCGAGGGTCAGGAGAGTGACTCGTCGGGAGCTAAGGTGGCGTACTCTGAAGTGT
GTCTCCCTTTCCAGGAGGGTGATTTGGGGATTTGTCACTTGGCTTTGGGAAACTGTGCTGCTGCTATGAAGCTCCTTTGGCTTACTTTTACGTGTGCAGGTTCTTTGTGG
GTGACCTGGGTGACGGCTTATATCCTTCGAGGGGCGTGTCTCTGGACTGTGCGTTCGTCGGCAGTTCTCCTGCATGATTCTTGGTTGTCGGGGGTCCGATTCTTCTGCGG
TTTGGTGATCGAGTCATTTATGATGCTGCTAGCTCTACCTTGGCTAAGGTTATGTCTTGTGGAGGCGGTGGCTGGGCGTATGGATGTTTCGGTTTGGGTCCCACATTCGT
TGGGCTATTTCTCGGTGTCCAGTGCATGGGGGGTGTTGCGGCGTTCTCGCCCGTCCGTTTCTTACATTGCTCTTCTGTGGTTTTGGGGGAACATCCCTAAGCACTCTTTT
ATTGCTTGGTTGGCGGTTTGTGATCAGTTATCTACGCGTGATCATCTTCGACGTTGGGGTGCAGTTGTTCATTCTGATTGTGTGTTTTGCACTGGTCAGGAGTCTCGTGA
CCATCTTTCCTTTGAGTGTCCTTTCAGTACGGTAGTCTGGGATGCTCAAGTCAGCTTGTTCGTCGACGTTTGTGGCGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGAGCTCGAAATTGCTGAGCTCGGTTCCGACGTTGATCGGATGTTCAAATTCTTCCAAAAACTTGATTCTCGTTTTACCCGTTATGAGCTAGCAAGGGGACCTAA
TGCACCTATAGATCAGAAGCTCCAACGATGTGAGATTAATCGGTCAAACTCATTGACCGAGGGTCAGGAGAGTGACTCGTCGGGAGCTAAGGTGGCGTACTCTGAAGTGT
GTCTCCCTTTCCAGGAGGGTGATTTGGGGATTTGTCACTTGGCTTTGGGAAACTGTGCTGCTGCTATGAAGCTCCTTTGGCTTACTTTTACGTGTGCAGGTTCTTTGTGG
GTGACCTGGGTGACGGCTTATATCCTTCGAGGGGCGTGTCTCTGGACTGTGCGTTCGTCGGCAGTTCTCCTGCATGATTCTTGGTTGTCGGGGGTCCGATTCTTCTGCGG
TTTGGTGATCGAGTCATTTATGATGCTGCTAGCTCTACCTTGGCTAAGGTTATGTCTTGTGGAGGCGGTGGCTGGGCGTATGGATGTTTCGGTTTGGGTCCCACATTCGT
TGGGCTATTTCTCGGTGTCCAGTGCATGGGGGGTGTTGCGGCGTTCTCGCCCGTCCGTTTCTTACATTGCTCTTCTGTGGTTTTGGGGGAACATCCCTAAGCACTCTTTT
ATTGCTTGGTTGGCGGTTTGTGATCAGTTATCTACGCGTGATCATCTTCGACGTTGGGGTGCAGTTGTTCATTCTGATTGTGTGTTTTGCACTGGTCAGGAGTCTCGTGA
CCATCTTTCCTTTGAGTGTCCTTTCAGTACGGTAGTCTGGGATGCTCAAGTCAGCTTGTTCGTCGACGTTTGTGGCGGTTAG
Protein sequenceShow/hide protein sequence
MFELEIAELGSDVDRMFKFFQKLDSRFTRYELARGPNAPIDQKLQRCEINRSNSLTEGQESDSSGAKVAYSEVCLPFQEGDLGICHLALGNCAAAMKLLWLTFTCAGSLW
VTWVTAYILRGACLWTVRSSAVLLHDSWLSGVRFFCGLVIESFMMLLALPWLRLCLVEAVAGRMDVSVWVPHSLGYFSVSSAWGVLRRSRPSVSYIALLWFWGNIPKHSF
IAWLAVCDQLSTRDHLRRWGAVVHSDCVFCTGQESRDHLSFECPFSTVVWDAQVSLFVDVCGG