; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g00970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g00970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:706952..707685
RNA-Seq ExpressionMoc03g00970
SyntenyMoc03g00970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD4180550.1 hypothetical protein E3N88_29141 [Mikania micrantha]1.1e-1736.36Show/hide
Query:  EMLACVESDTTFV-DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGY
        E   C + D   V      W+VDS A+ HV+S R +YTS      G ++  N  LSK+  + ++  + D+G ELVLH V+++P   +NLISAG LD++GY
Subjt:  EMLACVESDTTFV-DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGY

Query:  LSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL
         + F    WKL RG  +V  G R + +Y     +S+ S    +H  VN    +L
Subjt:  LSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL

KAD5318290.1 hypothetical protein E3N88_18236 [Mikania micrantha]3.6e-1835.71Show/hide
Query:  EMLACVESD-TTFVDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGY
        E   C + D          W+VDS A+ HV+S R +YTS      G+ +  N  LSK+  + ++  + D+G ELVLH V+++P   +NLISAG LD++GY
Subjt:  EMLACVESD-TTFVDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGY

Query:  LSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL
         + F    WKL  G  +V  G R + +Y+    +S+ S    +H  VN    DL
Subjt:  LSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL

KAF5753470.1 putative RNA-directed DNA polymerase [Helianthus annuus]2.3e-1738.46Show/hide
Query:  EMLACVESDTTFV---DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDE
        E   C + D   +   DPS  W+VDS A+ HV+S R +Y+S      G+++  N  LSKV  + +V  + D+G ELVLH V+++P   +NLISAG LD++
Subjt:  EMLACVESDTTFV---DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDE

Query:  GYLSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL
        GY S F    WKL RG  +V  G R + +Y     +S+ S    ++  VN    DL
Subjt:  GYLSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL

KAF5762848.1 putative RNA-directed DNA polymerase [Helianthus annuus]2.3e-1738.46Show/hide
Query:  EMLACVESDTTFV---DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDE
        E   C + D   +   DPS  W+VDS A+ HV+S R +Y+S      G+++  N  LSKV  + +V  + D+G ELVLH V+++P   +NLISAG LD++
Subjt:  EMLACVESDTTFV---DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDE

Query:  GYLSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL
        GY S F    WKL RG  +V  G R + +Y     +S+ S    ++  VN    DL
Subjt:  GYLSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL

KAF5800639.1 putative RNA-directed DNA polymerase [Helianthus annuus]2.3e-1738.46Show/hide
Query:  EMLACVESDTTFV---DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDE
        E   C + D   +   DPS  W+VDS A+ HV+S R +Y+S      G+++  N  LSKV  + +V  + D+G ELVLH V+++P   +NLISAG LD++
Subjt:  EMLACVESDTTFV---DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDE

Query:  GYLSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL
        GY S F    WKL RG  +V  G R + +Y     +S+ S    ++  VN    DL
Subjt:  GYLSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL

TrEMBL top hitse value%identityAlignment
A0A438HJE1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-1736.17Show/hide
Query:  VDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGYLSEFVKNRWKLIR
        +   ++WI+DS AS HV+S   ++TS      G +R  N  +SK+ ++ ++   +++G +L+L  VR++P   +NLISAGKLDDEGY + F   +WKL +
Subjt:  VDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGYLSEFVKNRWKLIR

Query:  GFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL
        G  VV  G +   +Y+++  + +G    VM+   N S  DL
Subjt:  GFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL

A0A4D8YHC2 CCHC-type domain-containing protein5.6e-1733.74Show/hide
Query:  ACVESDTTFVDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGYLSEF
        ACV S     D    WIVDS AS H++  R  + S    S G +R  N  +++V  I ++   +D+G +LVL  VR++P   +N+IS GKLDD+GY++ F
Subjt:  ACVESDTTFVDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGYLSEF

Query:  VKNRWKLIRGFEVVVVGHREALVYVLRFGVSRG--------SDRQVMHRTV-NISREDLKELA
         + +WKLI+G  +   G ++  +Y++   +S G        S  ++ H+ + ++S++ L+ LA
Subjt:  VKNRWKLIRGFEVVVVGHREALVYVLRFGVSRG--------SDRQVMHRTV-NISREDLKELA

A0A4D9AG59 CCHC-type domain-containing protein4.3e-1734.36Show/hide
Query:  ACVESDTTFVDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGYLSEF
        ACV S     D    WIVDS AS H+   R  + S   SS G +R  N  +++V  I  +   +D+G +LVL  VR++P   +N+IS GKLDD+GY++ F
Subjt:  ACVESDTTFVDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGYLSEF

Query:  VKNRWKLIRGFEVVVVGHREALVYVLRFGVSRG--------SDRQVMHRTV-NISREDLKELA
         + +WKLI+G  +   G ++  +Y++   +S G        S  ++ H+ + ++S++ L+ LA
Subjt:  VKNRWKLIRGFEVVVVGHREALVYVLRFGVSRG--------SDRQVMHRTV-NISREDLKELA

A0A5N6N2Q3 Integrase catalytic domain-containing protein5.1e-1836.36Show/hide
Query:  EMLACVESDTTFV-DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGY
        E   C + D   V      W+VDS A+ HV+S R +YTS      G ++  N  LSK+  + ++  + D+G ELVLH V+++P   +NLISAG LD++GY
Subjt:  EMLACVESDTTFV-DPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGY

Query:  LSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL
         + F    WKL RG  +V  G R + +Y     +S+ S    +H  VN    +L
Subjt:  LSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL

A0A5N6NVJ4 Uncharacterized protein1.7e-1835.71Show/hide
Query:  EMLACVESD-TTFVDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGY
        E   C + D          W+VDS A+ HV+S R +YTS      G+ +  N  LSK+  + ++  + D+G ELVLH V+++P   +NLISAG LD++GY
Subjt:  EMLACVESD-TTFVDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGY

Query:  LSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL
         + F    WKL  G  +V  G R + +Y+    +S+ S    +H  VN    DL
Subjt:  LSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-1231.97Show/hide
Query:  PSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGYLSEFVKNRWKLIRGF
        P  EW+VD+AAS H +  R  +   +    G ++  N   SK+  I ++  +++ G  LVL  VR++P   +NLIS   LD +GY S F   +W+L +G 
Subjt:  PSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELVLHGVRYIPSFEINLISAGKLDDEGYLSEFVKNRWKLIRGF

Query:  EVVVVGHREALVYVLRFGVSRG
         V+  G     +Y     + +G
Subjt:  EVVVVGHREALVYVLRFGVSRG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAACTCTATGGAGTCAGCATTGGTTGCTCAGAGCAAGGGCAAAGGGAAGATGAATTATGGGAAACAGTAGAGACATGGCAACAGGAGTGGGAGCGGCGAGGTGA
AATGTTAGCATGCGTTGAGAGTGACACAACATTTGTGGATCCATCATTAGAGTGGATAGTGGACAGTGCAGCATCGGTGCATGTATCTTCAGACAGGAGTTGGTACACGT
CTCCCATGACAAGCAGTCATGGCATGATGAGGACAAGGAATGGGAGACTCTCCAAGGTTAGAGAAATTATGGAGGTTCGTCGGAGGAGTGATAGTGGGACCGAGTTAGTT
TTGCATGGTGTCAGGTACATACCCAGTTTCGAGATAAATTTGATATCAGCGGGGAAGTTGGATGACGAAGGCTATTTAAGTGAGTTTGTAAAAAATAGGTGGAAACTCAT
AAGGGGATTCGAGGTAGTGGTTGTTGGCCACAGAGAAGCTTTAGTATACGTATTGAGGTTTGGCGTTTCCAGAGGATCAGATAGACAGGTTATGCATAGGACTGTAAATA
TTTCAAGAGAAGACTTGAAAGAATTAGCAGCATTGACAATCAGGACTGTTCAGGAGAATCTGCCATTAGTTCAAATACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAACTCTATGGAGTCAGCATTGGTTGCTCAGAGCAAGGGCAAAGGGAAGATGAATTATGGGAAACAGTAGAGACATGGCAACAGGAGTGGGAGCGGCGAGGTGA
AATGTTAGCATGCGTTGAGAGTGACACAACATTTGTGGATCCATCATTAGAGTGGATAGTGGACAGTGCAGCATCGGTGCATGTATCTTCAGACAGGAGTTGGTACACGT
CTCCCATGACAAGCAGTCATGGCATGATGAGGACAAGGAATGGGAGACTCTCCAAGGTTAGAGAAATTATGGAGGTTCGTCGGAGGAGTGATAGTGGGACCGAGTTAGTT
TTGCATGGTGTCAGGTACATACCCAGTTTCGAGATAAATTTGATATCAGCGGGGAAGTTGGATGACGAAGGCTATTTAAGTGAGTTTGTAAAAAATAGGTGGAAACTCAT
AAGGGGATTCGAGGTAGTGGTTGTTGGCCACAGAGAAGCTTTAGTATACGTATTGAGGTTTGGCGTTTCCAGAGGATCAGATAGACAGGTTATGCATAGGACTGTAAATA
TTTCAAGAGAAGACTTGAAAGAATTAGCAGCATTGACAATCAGGACTGTTCAGGAGAATCTGCCATTAGTTCAAATACAATAG
Protein sequenceShow/hide protein sequence
MGELYGVSIGCSEQGQREDELWETVETWQQEWERRGEMLACVESDTTFVDPSLEWIVDSAASVHVSSDRSWYTSPMTSSHGMMRTRNGRLSKVREIMEVRRRSDSGTELV
LHGVRYIPSFEINLISAGKLDDEGYLSEFVKNRWKLIRGFEVVVVGHREALVYVLRFGVSRGSDRQVMHRTVNISREDLKELAALTIRTVQENLPLVQIQ