; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:14169085..14170753
RNA-Seq ExpressionMoc02g19080
SyntenyMoc02g19080
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4366980.1 hypothetical protein F8388_022768 [Cannabis sativa]5.6e-2435.58Show/hide
Query:  GVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIKIFGWRLCLDSLPMG
        G WD+E+V  HF   +   I  IPI +   D L W Y   G + VKSGYRV ++  +      S++  +  W K FWK+QLP ++K+FGWR+C + LP  
Subjt:  GVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIKIFGWRLCLDSLPMG

Query:  ENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGGADFTDMFNFVR
         NL  RG+++  +C  CG   E   H  W+C K+++VW  + +     +LG     D+   +R
Subjt:  ENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGGADFTDMFNFVR

ONK66393.1 uncharacterized protein A4U43_C06F7380 [Asparagus officinalis]3.3e-2437.23Show/hide
Query:  LSVPTLPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVS-PVDKLIWDYEKRGIFSVKSGYRVLQQAL-----ISQGPSSSSLDSVLQWSKG
        +S   +P N+ V DL  PS  W+VE +R  F+  EA  ILSIP+  +  VDKL+W Y K G +SVKSGY V  QA       + G S  +     Q  K 
Subjt:  LSVPTLPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVS-PVDKLIWDYEKRGIFSVKSGYRVLQQAL-----ISQGPSSSSLDSVLQWSKG

Query:  FWKIQLPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQL-GGADFTDMFNFV
         W + LP+KIK+F WR C   +P  + L  + + V   C+ C    E  +H  W C ++++VW  + F H  C +    DF  +FN V
Subjt:  FWKIQLPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQL-GGADFTDMFNFV

XP_022143319.1 uncharacterized protein LOC111013220 [Momordica charantia]2.0e-2645.59Show/hide
Query:  FVDEEATAILSIPIGVS-PVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIKIFGWRLCLDSLPMGENLQARGLDV
        F  +E   ILSIP+G+    D+LIW++EK GI +VKS Y++          S+S  + + +W K  W++ LPSKIK+F WR CLD LP G NL  RG+DV
Subjt:  FVDEEATAILSIPIGVS-PVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIKIFGWRLCLDSLPMGENLQARGLDV

Query:  LSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHL
             FCG  GE A+H+FW C   +     SKFSHL
Subjt:  LSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHL

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]2.2e-3648.52Show/hide
Query:  LSVPTLPRNSFVCDLCT-PSGVWDVEKVRAHFVDEEATAILSIPIG-VSPVDKLIWDYEKRGIFSVKSGYRV-LQQALISQGPSSSSLDSVLQWSKGFWK
        LS P LP  S V  L     G W  + VR  F  +EA  ILSIPIG  +  D+LIW+YEK G++SV+SGY+V L      Q PSSSS + V  W  GFWK
Subjt:  LSVPTLPRNSFVCDLCT-PSGVWDVEKVRAHFVDEEATAILSIPIG-VSPVDKLIWDYEKRGIFSVKSGYRV-LQQALISQGPSSSSLDSVLQWSKGFWK

Query:  IQLPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLT
        + +P+KIK+F WRLCLD LP G NL  RG+++ + C FCG  GE ++H+FW C     +W+ SKF  L+
Subjt:  IQLPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLT

XP_030505362.1 uncharacterized protein LOC115720349 [Cannabis sativa]6.6e-2532.95Show/hide
Query:  LPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIK
        L  N+ +  L T  G W  + + A+F  ++   IL  P+ +   D L W    +G + VKSGYRV ++  +      S++D +  W K +W +QLP +IK
Subjt:  LPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIK

Query:  IFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGGADFTDMFNFV
        +FGW+LC + LP   NL  RG+ +  +C  CG   E   H  W+C K + VW    +    C+  G     MF+F+
Subjt:  IFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGGADFTDMFNFV

TrEMBL top hitse value%identityAlignment
A0A6J1CNZ5 uncharacterized protein LOC1110132209.9e-2745.59Show/hide
Query:  FVDEEATAILSIPIGVS-PVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIKIFGWRLCLDSLPMGENLQARGLDV
        F  +E   ILSIP+G+    D+LIW++EK GI +VKS Y++          S+S  + + +W K  W++ LPSKIK+F WR CLD LP G NL  RG+DV
Subjt:  FVDEEATAILSIPIGVS-PVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIKIFGWRLCLDSLPMGENLQARGLDV

Query:  LSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHL
             FCG  GE A+H+FW C   +     SKFSHL
Subjt:  LSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHL

A0A6J1DAR4 uncharacterized protein LOC1110189541.1e-3648.52Show/hide
Query:  LSVPTLPRNSFVCDLCT-PSGVWDVEKVRAHFVDEEATAILSIPIG-VSPVDKLIWDYEKRGIFSVKSGYRV-LQQALISQGPSSSSLDSVLQWSKGFWK
        LS P LP  S V  L     G W  + VR  F  +EA  ILSIPIG  +  D+LIW+YEK G++SV+SGY+V L      Q PSSSS + V  W  GFWK
Subjt:  LSVPTLPRNSFVCDLCT-PSGVWDVEKVRAHFVDEEATAILSIPIG-VSPVDKLIWDYEKRGIFSVKSGYRV-LQQALISQGPSSSSLDSVLQWSKGFWK

Query:  IQLPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLT
        + +P+KIK+F WRLCLD LP G NL  RG+++ + C FCG  GE ++H+FW C     +W+ SKF  L+
Subjt:  IQLPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLT

A0A803PPQ0 Uncharacterized protein4.2e-2535.15Show/hide
Query:  LSVPTLPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPS-SSSLDSVLQWSKGFWKIQ
        L  P L   + + DL    G W ++K++ HF +E+   +  IPI +   D L W Y   G + VKSGYR+ ++  I+  P+ SS+++ + +W K  W + 
Subjt:  LSVPTLPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPS-SSSLDSVLQWSKGFWKIQ

Query:  LPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSH
        LP ++K+FGWR+C + LP   NL  RG+DV   C  CG   E   H  W C K++ +W    + H
Subjt:  LPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSH

A0A803PR93 Uncharacterized protein1.6e-2434.55Show/hide
Query:  LSVPTLPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPS-SSSLDSVLQWSKGFWKIQ
        L  P L   + + +L    G W ++K++ HF +E+   +  IPI +   D L W Y   G + VKSGYR+ ++  I+  P+ SS+++ + +W K  W + 
Subjt:  LSVPTLPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPS-SSSLDSVLQWSKGFWKIQ

Query:  LPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSH
        LP ++K+FGWR+C + LP   NL  RG+DV   C  CG   E   H  W C K++ +W    + H
Subjt:  LPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSH

A0A803PUH4 Uncharacterized protein3.2e-2532.95Show/hide
Query:  LPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIK
        L  N+ +  L T  G W  + + A+F  ++   IL  P+ +   D L W    +G + VKSGYRV ++  +      S++D +  W K +W +QLP +IK
Subjt:  LPRNSFVCDLCTPSGVWDVEKVRAHFVDEEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIK

Query:  IFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGGADFTDMFNFV
        +FGW+LC + LP   NL  RG+ +  +C  CG   E   H  W+C K + VW    +    C+  G     MF+F+
Subjt:  IFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGGADFTDMFNFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein3.1e-1227.54Show/hide
Query:  WDVEKVRAHFVDEEATAIL-SIPIGVS-PVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIKIFGWRLCLDSLPMG
        WD  K+ + FVD+     +  I +  S   DK+IW+Y   G ++V+SGY +L     +  P+ +     +      W + +  K+K F WR    +L   
Subjt:  WDVEKVRAHFVDEEATAIL-SIPIGVS-PVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIKIFGWRLCLDSLPMG

Query:  ENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGGADF----TDMFNFVR
        E L  RG+ +   C  C    E   H  + C      W  S  S +  QL   DF    +++ NFV+
Subjt:  ENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGGADF----TDMFNFVR

AT3G25270.1 Ribonuclease H-like superfamily protein2.5e-0634.78Show/hide
Query:  WKIQLPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSH
        WK++   KIK F W+L   +L  G+NL+ R +     C  C    E + H+F+ C   ++VW AS   H
Subjt:  WKIQLPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSH

AT4G29090.1 Ribonuclease H-like superfamily protein3.6e-1332.52Show/hide
Query:  VCDLCTPSG-VWDVEKVRAHFVDEEATAILSI-PIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQ-GPSSSSLDSVLQWSKGFWKIQLPSKIKIFG
        V DL   SG  W  + +   F + E   I  + P G   +D   WDY   G ++VKSGY VL Q +  +  P   S  S+    +  WK Q   KI+ F 
Subjt:  VCDLCTPSG-VWDVEKVRAHFVDEEATAILSI-PIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQ-GPSSSSLDSVLQWSKGFWKIQLPSKIKIFG

Query:  WRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGG
        W+   +SLP+   L  R L   S C  C    E   H+ + CT  R  W     S +   LGG
Subjt:  WRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGAMHVFWACTKIRRVWVASKFSHLTCQLGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCGCTGCGACGCTACAAGAGATCATGTTTGTTTCGCAACCCCAGTCGAAAAAGGAGGAAGGCATGAGGAGTGCCACACCCGTTATTTGGGTCTTCCTTCGTTT
ATGCCTCGAAACCGGGGCGGGCATTTGCACTTTATCCAGGACCGTATCTGGTCTCATCTTCAAGGCTGGAAAGATTACCGAAGAGCTTGATCCACGATATTCAGATGATG
ATGGCCCGTTTTTGTCGGTCCCAACTTTACCACGTAATAGCTTTGTTTGTGATTTATGTACTCCATCGGGCGTGTGGGATGTGGAGAAAGTTAGGGCACATTTTGTGGAT
GAGGAAGCTACAGCGATCCTGTCAATTCCAATTGGGGTTAGCCCGGTTGATAAGCTTATTTGGGATTATGAAAAAAGAGGGATTTTTTCGGTCAAGAGCGGGTATCGGGT
TCTGCAGCAAGCTTTGATTTCCCAAGGTCCCTCATCTTCGTCTTTGGATTCAGTGTTGCAATGGTCGAAGGGCTTTTGGAAAATTCAACTCCCCAGCAAAATTAAAATCT
TTGGTTGGCGTTTATGCCTTGACAGCCTGCCGATGGGGGAAAATCTCCAAGCTCGGGGTCTGGATGTGTTGTCTATTTGCAGATTCTGTGGATGCACAGGGGAAGGTGCG
ATGCATGTTTTCTGGGCTTGTACAAAAATCCGACGAGTGTGGGTTGCCTCCAAATTCTCACATCTCACCTGTCAATTGGGGGGTGCTGACTTCACTGATATGTTTAACTT
TGTTCGTGGGTGGAAGGAGCTAGTTTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCGCTGCGACGCTACAAGAGATCATGTTTGTTTCGCAACCCCAGTCGAAAAAGGAGGAAGGCATGAGGAGTGCCACACCCGTTATTTGGGTCTTCCTTCGTTT
ATGCCTCGAAACCGGGGCGGGCATTTGCACTTTATCCAGGACCGTATCTGGTCTCATCTTCAAGGCTGGAAAGATTACCGAAGAGCTTGATCCACGATATTCAGATGATG
ATGGCCCGTTTTTGTCGGTCCCAACTTTACCACGTAATAGCTTTGTTTGTGATTTATGTACTCCATCGGGCGTGTGGGATGTGGAGAAAGTTAGGGCACATTTTGTGGAT
GAGGAAGCTACAGCGATCCTGTCAATTCCAATTGGGGTTAGCCCGGTTGATAAGCTTATTTGGGATTATGAAAAAAGAGGGATTTTTTCGGTCAAGAGCGGGTATCGGGT
TCTGCAGCAAGCTTTGATTTCCCAAGGTCCCTCATCTTCGTCTTTGGATTCAGTGTTGCAATGGTCGAAGGGCTTTTGGAAAATTCAACTCCCCAGCAAAATTAAAATCT
TTGGTTGGCGTTTATGCCTTGACAGCCTGCCGATGGGGGAAAATCTCCAAGCTCGGGGTCTGGATGTGTTGTCTATTTGCAGATTCTGTGGATGCACAGGGGAAGGTGCG
ATGCATGTTTTCTGGGCTTGTACAAAAATCCGACGAGTGTGGGTTGCCTCCAAATTCTCACATCTCACCTGTCAATTGGGGGGTGCTGACTTCACTGATATGTTTAACTT
TGTTCGTGGGTGGAAGGAGCTAGTTTCCTAG
Protein sequenceShow/hide protein sequence
MDSAATLQEIMFVSQPQSKKEEGMRSATPVIWVFLRLCLETGAGICTLSRTVSGLIFKAGKITEELDPRYSDDDGPFLSVPTLPRNSFVCDLCTPSGVWDVEKVRAHFVD
EEATAILSIPIGVSPVDKLIWDYEKRGIFSVKSGYRVLQQALISQGPSSSSLDSVLQWSKGFWKIQLPSKIKIFGWRLCLDSLPMGENLQARGLDVLSICRFCGCTGEGA
MHVFWACTKIRRVWVASKFSHLTCQLGGADFTDMFNFVRGWKELVS