; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:19164681..19167193
RNA-Seq ExpressionMoc08g26580
SyntenyMoc08g26580
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]4.4e-1842.86Show/hide
Query:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL
        M  AQ  +  +PLS     +  LA   FW  V  E ASLRQKS+V+WL+LGDQN+AFFH+SVR ++ RN L SL ++   RV +   +AQM V ++   L
Subjt:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL

Query:  GSEPVSYHALTDQLTSILDFVLPTEC
        GS+ + Y  L+  +  I+ F    EC
Subjt:  GSEPVSYHALTDQLTSILDFVLPTEC

XP_022149381.1 uncharacterized protein LOC111017811 [Momordica charantia]6.9e-2439.6Show/hide
Query:  GLVTNTWKNSFFGASLLDAEVYRLATFLGFRWR--------------RLSRRDCKLFLEWIVSRVRSWSARMLSFAGQFYRVFRSIGPMCLSFRLELFMM
        GLV NT K+SFFG  + D EV +LA F  F                 RLS  DC+  LE IVSRV SWSARM SFA +  ++ +SI     S      + 
Subjt:  GLVTNTWKNSFFGASLLDAEVYRLATFLGFRWR--------------RLSRRDCKLFLEWIVSRVRSWSARMLSFAGQFYRVFRSIGPMCLSFRLELFMM

Query:  SSIFYVLSCGR---------WLGWLGLRWLGWKLIFFVGIVFGPFVLRLGCFGVCKIFFLCGMLFDLWFSLLLEMVFL-----------------GRVYV
         +    L  G          ++GWL + WL      +VG +               IFFL  MLFDLWF L LEMV +                  R  V
Subjt:  SSIFYVLSCGR---------WLGWLGLRWLGWKLIFFVGIVFGPFVLRLGCFGVCKIFFLCGMLFDLWFSLLLEMVFL-----------------GRVYV

Query:  SIWILASSGLFSVLSAWGVLRLARPLIHWFSLVWFDGSIPKHSFITWWCV
         +WI  SSGLFSV SAW VLR  +PL+ WFS +WF G+I KHSFI W  V
Subjt:  SIWILASSGLFSVLSAWGVLRLARPLIHWFSLVWFDGSIPKHSFITWWCV

XP_022157428.1 uncharacterized protein LOC111024128 [Momordica charantia]1.3e-3362.6Show/hide
Query:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL
        M AAQ+LLL+DP SI +QEEER+A R+FW W   E ASLRQKSRV WLSLGD NSAFFH+SVRG+I  N+L SLT+   + V +R+EIA++ V FYR LL
Subjt:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL

Query:  GSEPVSYHALTDQLTSILDFVLPTECSVDLC
        GSE V Y  LT +L +I+DFV P EC V+LC
Subjt:  GSEPVSYHALTDQLTSILDFVLPTECSVDLC

XP_022158199.1 uncharacterized protein LOC111024737 [Momordica charantia]1.9e-3458.96Show/hide
Query:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL
        M A Q+LLL+ P SI +QEEER+A R+FW W   E ASLRQKS+V+WLSLGDQNS FFH+ VR +IVRN+L SLT+   + V +R+EIA++ V FYR L+
Subjt:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL

Query:  GSEPVSYHALTDQLTSILDFVLPTECSVDLCRSI
        GSE V Y  LT +L +I+DFV P EC V+LCR +
Subjt:  GSEPVSYHALTDQLTSILDFVLPTECSVDLCRSI

XP_022159081.1 uncharacterized protein LOC111025522 [Momordica charantia]1.9e-3758.94Show/hide
Query:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL
        M AAQ+LLL+DP SI +QEEER+A R+FW W   E ASLRQKS V+WLSLGDQNSAFFH+SVRG+IVRN+L SLT+ A Q V +R+EIA++ V FYR LL
Subjt:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL

Query:  GSEPVSYHALTDQLTSILDFVLPTECSVDLCRSIWRGLVTNTWKNSFFGAS
        GS+ + Y  LT +L +I+DFV P EC V+LC   W       W+  F  AS
Subjt:  GSEPVSYHALTDQLTSILDFVLPTECSVDLCRSIWRGLVTNTWKNSFFGAS

TrEMBL top hitse value%identityAlignment
A0A5A7TZS0 Reverse transcriptase domain-containing protein2.1e-1842.86Show/hide
Query:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL
        M  AQ  +  +PLS     +  LA   FW  V  E ASLRQKS+V+WL+LGDQN+AFFH+SVR ++ RN L SL ++   RV +   +AQM V ++   L
Subjt:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL

Query:  GSEPVSYHALTDQLTSILDFVLPTEC
        GS+ + Y  L+  +  I+ F    EC
Subjt:  GSEPVSYHALTDQLTSILDFVLPTEC

A0A6J1D875 uncharacterized protein LOC1110178113.4e-2439.6Show/hide
Query:  GLVTNTWKNSFFGASLLDAEVYRLATFLGFRWR--------------RLSRRDCKLFLEWIVSRVRSWSARMLSFAGQFYRVFRSIGPMCLSFRLELFMM
        GLV NT K+SFFG  + D EV +LA F  F                 RLS  DC+  LE IVSRV SWSARM SFA +  ++ +SI     S      + 
Subjt:  GLVTNTWKNSFFGASLLDAEVYRLATFLGFRWR--------------RLSRRDCKLFLEWIVSRVRSWSARMLSFAGQFYRVFRSIGPMCLSFRLELFMM

Query:  SSIFYVLSCGR---------WLGWLGLRWLGWKLIFFVGIVFGPFVLRLGCFGVCKIFFLCGMLFDLWFSLLLEMVFL-----------------GRVYV
         +    L  G          ++GWL + WL      +VG +               IFFL  MLFDLWF L LEMV +                  R  V
Subjt:  SSIFYVLSCGR---------WLGWLGLRWLGWKLIFFVGIVFGPFVLRLGCFGVCKIFFLCGMLFDLWFSLLLEMVFL-----------------GRVYV

Query:  SIWILASSGLFSVLSAWGVLRLARPLIHWFSLVWFDGSIPKHSFITWWCV
         +WI  SSGLFSV SAW VLR  +PL+ WFS +WF G+I KHSFI W  V
Subjt:  SIWILASSGLFSVLSAWGVLRLARPLIHWFSLVWFDGSIPKHSFITWWCV

A0A6J1DTC3 uncharacterized protein LOC1110241286.1e-3462.6Show/hide
Query:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL
        M AAQ+LLL+DP SI +QEEER+A R+FW W   E ASLRQKSRV WLSLGD NSAFFH+SVRG+I  N+L SLT+   + V +R+EIA++ V FYR LL
Subjt:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL

Query:  GSEPVSYHALTDQLTSILDFVLPTECSVDLC
        GSE V Y  LT +L +I+DFV P EC V+LC
Subjt:  GSEPVSYHALTDQLTSILDFVLPTECSVDLC

A0A6J1DYP6 uncharacterized protein LOC1110247379.4e-3558.96Show/hide
Query:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL
        M A Q+LLL+ P SI +QEEER+A R+FW W   E ASLRQKS+V+WLSLGDQNS FFH+ VR +IVRN+L SLT+   + V +R+EIA++ V FYR L+
Subjt:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL

Query:  GSEPVSYHALTDQLTSILDFVLPTECSVDLCRSI
        GSE V Y  LT +L +I+DFV P EC V+LCR +
Subjt:  GSEPVSYHALTDQLTSILDFVLPTECSVDLCRSI

A0A6J1E2U5 uncharacterized protein LOC1110255229.1e-3858.94Show/hide
Query:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL
        M AAQ+LLL+DP SI +QEEER+A R+FW W   E ASLRQKS V+WLSLGDQNSAFFH+SVRG+IVRN+L SLT+ A Q V +R+EIA++ V FYR LL
Subjt:  MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELL

Query:  GSEPVSYHALTDQLTSILDFVLPTECSVDLCRSIWRGLVTNTWKNSFFGAS
        GS+ + Y  LT +L +I+DFV P EC V+LC   W       W+  F  AS
Subjt:  GSEPVSYHALTDQLTSILDFVLPTECSVDLCRSIWRGLVTNTWKNSFFGAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.7e-0834.34Show/hide
Query:  QSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELLGSE
        QS LL +P S S    E +A +++  +     +  RQKSR++WL  GD N+ FFH+ +     +N +  L      RV N +++ +M+V +Y  LLGS+
Subjt:  QSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELLGSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCTGCCCAATCGTTGTTGTTAAATGATCCTTTGTCTATTTCTTCTCAAGAGGAGGAGAGGCTGGCAGTGAGGGAGTTTTGGGTTTGGGTTGGGAAAGAG
GCGGCGTCGCTTCGTCAGAAGTCTCGGGTCCAGTGGCTTTCTCTAGGTGACCAGAATTCTGCTTTCTTCCATCAGAGTGTTCGAGGTCAGATTGTTCGTAATGAG
TTAACTTCGTTGACTAATGCTGCGAGTCAGCGGGTTATTAATCGTTCTGAGATTGCTCAGATGGTGGTTCGCTTTTATCGTGAACTGTTGGGTTCTGAACCTGTG
AGTTATCATGCCTTGACTGATCAGCTTACTAGTATCCTTGATTTTGTTTTGCCGACTGAGTGCAGTGTAGACTTATGTCGTTCGATCTGGCGGGGTCTTGTCACG
AATACTTGGAAGAACTCATTTTTCGGGGCGAGCCTTCTTGATGCAGAGGTGTATAGGTTGGCGACTTTCTTAGGTTTTCGATGGCGTCGGCTTTCTCGACGCGAT
TGTAAGCTTTTTCTTGAGTGGATTGTCTCTCGTGTTCGAAGTTGGTCGGCTCGGATGCTTTCTTTTGCTGGTCAGTTCTACAGAGTTTTTAGGTCTATTGGGCCA
ATGTGTTTATCCTTCCGGCTCGAGTTGTTCATGATGTCGAGCATATTCTACGTTCTTTCTTGTGGAAGGTGGCTTGGGTGGCTTGGTCTGAGGTGGCTTGGGTGG
AAGCTTATATTCTTCGTGGGGATTGTATTTGGACCGTTCGTGCTTCGCCTCGGTTGTTTTGGTGTTTGCAAGATATTCTTTCTATGTGGGATGCTTTTCGACCTC
TGGTTTAGTTTGCTATTAGAGATGGTTTTCTTAGGGAGGGTTTATGTGTCCATTTGGATTCTGGCATCATCAGGTCTCTTCTCGGTGTTGAGTGCGTGGGGTGTG
TTGCGGCTAGCCCGACCTCTTATTCATTGGTTTTCTTTGGTTTGGTTCGATGGGAGCATTCCTAAGCATTCTTTCATTACTTGGTGGTGCGTGATCGTCTTGTTA
CGAGGGATCACTTATGTCGTTTGGACTCTTCTGTTTCTATTTCCTGTGTGTTTTGTGCTGGCCTGGAGTCTCGGTATCAGTGCCCCTTTAGTTGAGAGTGCCAAG
AAGTCTGCTCATTGTCGTGTGTGGCGCTTGGCATGGACGTCAGTTGTTTCTTTCATTTGGAGGAAGCGTAATGCTAGAGTTAATGCATGTGGGGTGGGCAGGTCG
TCTTCTGTCCTCCTGCACGCTCTAAGAGCTGCTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCTGCCCAATCGTTGTTGTTAAATGATCCTTTGTCTATTTCTTCTCAAGAGGAGGAGAGGCTGGCAGTGAGGGAGTTTTGGGTTTGGGTTGGGAAAGAG
GCGGCGTCGCTTCGTCAGAAGTCTCGGGTCCAGTGGCTTTCTCTAGGTGACCAGAATTCTGCTTTCTTCCATCAGAGTGTTCGAGGTCAGATTGTTCGTAATGAG
TTAACTTCGTTGACTAATGCTGCGAGTCAGCGGGTTATTAATCGTTCTGAGATTGCTCAGATGGTGGTTCGCTTTTATCGTGAACTGTTGGGTTCTGAACCTGTG
AGTTATCATGCCTTGACTGATCAGCTTACTAGTATCCTTGATTTTGTTTTGCCGACTGAGTGCAGTGTAGACTTATGTCGTTCGATCTGGCGGGGTCTTGTCACG
AATACTTGGAAGAACTCATTTTTCGGGGCGAGCCTTCTTGATGCAGAGGTGTATAGGTTGGCGACTTTCTTAGGTTTTCGATGGCGTCGGCTTTCTCGACGCGAT
TGTAAGCTTTTTCTTGAGTGGATTGTCTCTCGTGTTCGAAGTTGGTCGGCTCGGATGCTTTCTTTTGCTGGTCAGTTCTACAGAGTTTTTAGGTCTATTGGGCCA
ATGTGTTTATCCTTCCGGCTCGAGTTGTTCATGATGTCGAGCATATTCTACGTTCTTTCTTGTGGAAGGTGGCTTGGGTGGCTTGGTCTGAGGTGGCTTGGGTGG
AAGCTTATATTCTTCGTGGGGATTGTATTTGGACCGTTCGTGCTTCGCCTCGGTTGTTTTGGTGTTTGCAAGATATTCTTTCTATGTGGGATGCTTTTCGACCTC
TGGTTTAGTTTGCTATTAGAGATGGTTTTCTTAGGGAGGGTTTATGTGTCCATTTGGATTCTGGCATCATCAGGTCTCTTCTCGGTGTTGAGTGCGTGGGGTGTG
TTGCGGCTAGCCCGACCTCTTATTCATTGGTTTTCTTTGGTTTGGTTCGATGGGAGCATTCCTAAGCATTCTTTCATTACTTGGTGGTGCGTGATCGTCTTGTTA
CGAGGGATCACTTATGTCGTTTGGACTCTTCTGTTTCTATTTCCTGTGTGTTTTGTGCTGGCCTGGAGTCTCGGTATCAGTGCCCCTTTAGTTGAGAGTGCCAAG
AAGTCTGCTCATTGTCGTGTGTGGCGCTTGGCATGGACGTCAGTTGTTTCTTTCATTTGGAGGAAGCGTAATGCTAGAGTTAATGCATGTGGGGTGGGCAGGTCG
TCTTCTGTCCTCCTGCACGCTCTAAGAGCTGCTATTTGA
Protein sequenceShow/hide protein sequence
MGAAQSLLLNDPLSISSQEEERLAVREFWVWVGKEAASLRQKSRVQWLSLGDQNSAFFHQSVRGQIVRNELTSLTNAASQRVINRSEIAQMVVRFYRELLGSEPV
SYHALTDQLTSILDFVLPTECSVDLCRSIWRGLVTNTWKNSFFGASLLDAEVYRLATFLGFRWRRLSRRDCKLFLEWIVSRVRSWSARMLSFAGQFYRVFRSIGP
MCLSFRLELFMMSSIFYVLSCGRWLGWLGLRWLGWKLIFFVGIVFGPFVLRLGCFGVCKIFFLCGMLFDLWFSLLLEMVFLGRVYVSIWILASSGLFSVLSAWGV
LRLARPLIHWFSLVWFDGSIPKHSFITWWCVIVLLRGITYVVWTLLFLFPVCFVLAWSLGISAPLVESAKKSAHCRVWRLAWTSVVSFIWRKRNARVNACGVGRS
SSVLLHALRAAI