; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10020 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10020
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:7438764..7442692
RNA-Seq ExpressionMoc04g10020
SyntenyMoc04g10020
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KZV18236.1 hypothetical protein F511_28102 [Dorcoceras hygrometricum]1.7e-3965.32Show/hide
Query:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG
        M++ARG+K GTLY+  + +D +A VD  + + LWH RL HMSEKGMK+L S GKL  LK+V+ K+CE CIFGKQ +VSFSK G  PK+ KLELVH DVWG
Subjt:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG

Query:  PSEVSSIGGSRYYVTFIDDSSRKL
        PS V S+GGSRYYVTFIDDSSRK+
Subjt:  PSEVSSIGGSRYYVTFIDDSSRKL

RVW30183.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.9e-3865.85Show/hide
Query:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP
        V+ARG+K GTLY+    +D IAV D S+ T LWH RL HMSEKGMK+L S GKL  LK++   +CE CI GKQ RVSF KTG TPK+ KLELVH D+WGP
Subjt:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP

Query:  SEVSSIGGSRYYVTFIDDSSRKL
        S V+S+GGSRYY+TFIDDSSRK+
Subjt:  SEVSSIGGSRYYVTFIDDSSRKL

RVW35576.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.9e-3865.85Show/hide
Query:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP
        V+ARG+K GTLY+    +D IAV D S+ T LWH RL HMSEKGMK+L S GKL  LK++   +CE CI GKQ +VSF KTG TPKS KLELVH D+WGP
Subjt:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP

Query:  SEVSSIGGSRYYVTFIDDSSRKL
        S V+S+GGSRYY+TFIDDSSRK+
Subjt:  SEVSSIGGSRYYVTFIDDSSRKL

RVW58633.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.4e-3866.39Show/hide
Query:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP
        V+ARG+K GTLY+    +D IAV D S+ T LWH RL HMSEKGMKIL S GKL  LK++   +CE CI GKQ +VSF KTG TPK+ KLELVH D+WGP
Subjt:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP

Query:  SEVSSIGGSRYYVTFIDDSSRK
        S V+S+GGSRYY+TFIDDSSRK
Subjt:  SEVSSIGGSRYYVTFIDDSSRK

RVW70488.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.4e-3865.85Show/hide
Query:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP
        V+ARG+K GTLY+    +D IAVVD S+ T LWH RL HMSEKGMK+L S GKL  LK++   +CE CI GKQ +VSF KTG TPK+ KLELVH D+WGP
Subjt:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP

Query:  SEVSSIGGSRYYVTFIDDSSRKL
        S V+S+GGSRYY+TFIDDSSRK+
Subjt:  SEVSSIGGSRYYVTFIDDSSRKL

TrEMBL top hitse value%identityAlignment
A0A2N9GTI4 Uncharacterized protein6.9e-3964.52Show/hide
Query:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG
        MV+ARG+K GTLY+  + +D IAV +  + T LWH RL HMSEKGMK+L S GKL  LK+V+  +CE CI GKQ +VSF K G TPKS KLELVH D+WG
Subjt:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG

Query:  PSEVSSIGGSRYYVTFIDDSSRKL
        PS ++S+GGSRYYVTFIDDSSRK+
Subjt:  PSEVSSIGGSRYYVTFIDDSSRKL

A0A2N9IKI1 Uncharacterized protein6.9e-3964.52Show/hide
Query:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG
        MV+ARG+K GTLY+  + +D IAV +  + T LWH RL HMSEKGMK+L S GKL  LK+V+  +CE CI GKQ +VSF K G TPKS KLELVH D+WG
Subjt:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG

Query:  PSEVSSIGGSRYYVTFIDDSSRKL
        PS ++S+GGSRYYVTFIDDSSRK+
Subjt:  PSEVSSIGGSRYYVTFIDDSSRKL

A0A2Z7AGM8 Integrase catalytic domain-containing protein8.2e-4065.32Show/hide
Query:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG
        M++ARG+K GTLY+  + +D +A VD  + + LWH RL HMSEKGMK+L S GKL  LK+V+ K+CE CIFGKQ +VSFSK G  PK+ KLELVH DVWG
Subjt:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG

Query:  PSEVSSIGGSRYYVTFIDDSSRKL
        PS V S+GGSRYYVTFIDDSSRK+
Subjt:  PSEVSSIGGSRYYVTFIDDSSRKL

A0A438GE83 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-3965.85Show/hide
Query:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP
        V+ARG+K GTLY+    +D IAVVD S+ T LWH RL HMSEKGMK+L S GKL  LK++   +CE CI GKQ +VSF KTG TPK+ KLELVH D+WGP
Subjt:  VIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGP

Query:  SEVSSIGGSRYYVTFIDDSSRKL
        S V+S+GGSRYY+TFIDDSSRK+
Subjt:  SEVSSIGGSRYYVTFIDDSSRKL

A0A5B7BAK4 Uncharacterized protein1.1e-4167.74Show/hide
Query:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG
        MV+ARG+K GTLYV  N +D I + +  S + LWH RL HMS+KGMK+LHS GKL+GLK+V   LCE CIFGKQ +VSFSK G TPK+ KLELVH DVWG
Subjt:  MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWG

Query:  PSEVSSIGGSRYYVTFIDDSSRKL
        PS VSS+GGS YYVTFIDDS+RK+
Subjt:  PSEVSSIGGSRYYVTFIDDSSRKL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-0530.61Show/hide
Query:  HSSQTQLWHSRLRHMSEKGM-----KILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSK-TGSTPKSTKLELVHPDVWGPSEVSSIGGSRYYVTFID
        H +  +LWH R  H+S+  +     K + S   L     +  ++CE C+ GKQ+R+ F +    T     L +VH DV GP    ++    Y+V F+D
Subjt:  HSSQTQLWHSRLRHMSEKGM-----KILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSK-TGSTPKSTKLELVHPDVWGPSEVSSIGGSRYYVTFID

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.7e-2347.24Show/hide
Query:  MVIARGRKLGTLYVNDND---KDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPD
        +VIA+G   GTLY  + +    ++ A  D  S   LWH R+ HMSEKG++IL     +   K    K C+ C+FGKQ RVSF +T S  K   L+LV+ D
Subjt:  MVIARGRKLGTLYVNDND---KDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPD

Query:  VWGPSEVSSIGGSRYYVTFIDDSSRKL
        V GP E+ S+GG++Y+VTFIDD+SRKL
Subjt:  VWGPSEVSSIGGSRYYVTFIDDSSRKL

P93293 Uncharacterized mitochondrial protein AtMg003008.2e-1348Show/hide
Query:  QTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGPSEV
        +T+LWHSRL HMS++GM++L   G L+  K    K CE CI+GK  RV+FS TG       L+ VH D+WG   V
Subjt:  QTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGPSEV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-0638Show/hide
Query:  HSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPK----LCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGPSEVSSIGGSRYYVTFIDDSSR
        HSS    WHSRL H S   + IL+S      L  + P      C  C   K  +V FS +  T  S  LE ++ DVW  S + SI   RYYV F+D  +R
Subjt:  HSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPK----LCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGPSEVSSIGGSRYYVTFIDDSSR

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein5.8e-1448Show/hide
Query:  QTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGPSEV
        +T+LWHSRL HMS++GM++L   G L+  K    K CE CI+GK  RV+FS TG       L+ VH D+WG   V
Subjt:  QTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGPSEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGATCGCTCGAGGAAGAAAGTTAGGAACTTTATATGTCAACGACAATGACAAAGATATGATAGCTGTTGTAGATCATTCGAGTCAGACCCAATTATGGCAC
AGTAGGCTGAGACATATGAGTGAAAAAGGTATGAAAATTCTTCACTCTACAGGGAAGCTAGAAGGACTTAAGGCAGTGAAACCCAAGTTATGCGAAAGATGTATA
TTTGGGAAACAGAGTCGTGTCAGTTTCTCAAAGACAGGTTCAACGCCAAAGTCTACAAAATTGGAGTTGGTTCACCCCGATGTGTGGGGGCCTTCTGAAGTTTCT
TCAATTGGAGGTTCCAGATACTATGTAACTTTCATAGACGACTCAAGTAGGAAGTTGGACTTCTCCACGACTAGAGAGGAGTCGCTATCGTTGCAGAAGAGACAT
ACTGCTTGCTTTGTCTCCAAGTTCTCATCAAACCAACCCAACCCCTTTGCAGAAAAATCAAGTGCAGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGATCGCTCGAGGAAGAAAGTTAGGAACTTTATATGTCAACGACAATGACAAAGATATGATAGCTGTTGTAGATCATTCGAGTCAGACCCAATTATGGCAC
AGTAGGCTGAGACATATGAGTGAAAAAGGTATGAAAATTCTTCACTCTACAGGGAAGCTAGAAGGACTTAAGGCAGTGAAACCCAAGTTATGCGAAAGATGTATA
TTTGGGAAACAGAGTCGTGTCAGTTTCTCAAAGACAGGTTCAACGCCAAAGTCTACAAAATTGGAGTTGGTTCACCCCGATGTGTGGGGGCCTTCTGAAGTTTCT
TCAATTGGAGGTTCCAGATACTATGTAACTTTCATAGACGACTCAAGTAGGAAGTTGGACTTCTCCACGACTAGAGAGGAGTCGCTATCGTTGCAGAAGAGACAT
ACTGCTTGCTTTGTCTCCAAGTTCTCATCAAACCAACCCAACCCCTTTGCAGAAAAATCAAGTGCAGCCTGA
Protein sequenceShow/hide protein sequence
MVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQLWHSRLRHMSEKGMKILHSTGKLEGLKAVKPKLCERCIFGKQSRVSFSKTGSTPKSTKLELVHPDVWGPSEVS
SIGGSRYYVTFIDDSSRKLDFSTTREESLSLQKRHTACFVSKFSSNQPNPFAEKSSAA