; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g29850 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g29850
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:22469166..22470248
RNA-Seq ExpressionMoc06g29850
SyntenyMoc06g29850
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD4180550.1 hypothetical protein E3N88_29141 [Mikania micrantha]1.7e-2039.44Show/hide
Query:  FLACVEKNTSVDH-SSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNG
        F+ C     +V H  S WVVD+  + +VTS R +++S+T GD+GSV+MGN  +SK++ + ++ LK   G ELVL +V+++   RLNL+SA  LD+D Y+ 
Subjt:  FLACVEKNTSVDH-SSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNG

Query:  ESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQIMH
            G WKL R S IVA G +   +Y +   ++K S+  +++
Subjt:  ESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQIMH

KAF7121453.1 hypothetical protein RHSIM_Rhsim13G0116100 [Rhododendron simsii]1.3e-2043.9Show/hide
Query:  WVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKRESRIVA
        WV+D+  S +VTS R +F+S+T GD+G VRMGN  VSK++ + +V L+   G +L+L DVR++   RLNL+SA KLDD+ Y  +   G WKL + S +VA
Subjt:  WVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKRESRIVA

Query:  TGHKRYSVYVSELGVAKGSLRQI
         G K  ++Y+ +  ++KG +  I
Subjt:  TGHKRYSVYVSELGVAKGSLRQI

KAF7129225.1 hypothetical protein RHSIM_Rhsim10G0050800 [Rhododendron simsii]1.7e-2036.54Show/hide
Query:  LQDRRDKLNVFAESSKFLACVEKNTSV-DHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFR
        +  ++D   V ++    + C +    +      WV+D+  S +VTS R +F+S+T GD+G VRMGN  VSK++ + +V L+   G +L+L DVR++ + R
Subjt:  LQDRRDKLNVFAESSKFLACVEKNTSV-DHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFR

Query:  LNLLSARKLDDDNYNGESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQI
        LNL+SA KLDD+ Y  +   G WKL + S +VA G K  ++Y+ +  ++KG +  I
Subjt:  LNLLSARKLDDDNYNGESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQI

KAF7129546.1 hypothetical protein RHSIM_Rhsim10G0154200 [Rhododendron simsii]9.9e-2137.18Show/hide
Query:  LQDRRDKLNVFAESSKFLACVEKNTSV-DHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFR
        +  ++D   V ++    + C +    +      WV+D+  S +VTS R +F+S+T GD+G VRMGN  VSK++ + +V L+   G +L+L DVR++   R
Subjt:  LQDRRDKLNVFAESSKFLACVEKNTSV-DHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFR

Query:  LNLLSARKLDDDNYNGESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQI
        LNL+SA KLDD+ Y  +   G WKL + S IVA G K  ++Y+ ++ ++KG +  I
Subjt:  LNLLSARKLDDDNYNGESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQI

RVW84557.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.9e-2139.53Show/hide
Query:  VDHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKR
        +    +W++D+  S +VTS   +F+S++ GD+G VRMGN  VSK++ + +++L+   G +L+L DVR++   RLNL+SA KLDD+ YN     G WKL +
Subjt:  VDHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKR

Query:  ESRIVATGHKRYSVYVSELGVAKGSLRQI
         S +VA G K  S+Y+ +  + KG +  +
Subjt:  ESRIVATGHKRYSVYVSELGVAKGSLRQI

TrEMBL top hitse value%identityAlignment
A0A251SV86 Putative zinc finger, CCHC-type, Ribonuclease H-like domain, GAG-pre-integrase domain protein1.1e-2040.26Show/hide
Query:  DKLNVFAESSKFLACVEK---NTSVDHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNL
        D   V   + +F  C +    N + D SS WVVD+  + +VTS R +FSS+T GD+G V+MGN  +SK+I + +V LK   G ELVL +V+++   RLNL
Subjt:  DKLNVFAESSKFLACVEK---NTSVDHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNL

Query:  LSARKLDDDNYNGESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQIM
        +SA  LDDD Y+     G WKL R S IVA G +   +Y++   ++  S+  ++
Subjt:  LSARKLDDDNYNGESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQIM

A0A438HJE1 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-2139.53Show/hide
Query:  VDHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKR
        +    +W++D+  S +VTS   +F+S++ GD+G VRMGN  VSK++ + +++L+   G +L+L DVR++   RLNL+SA KLDD+ YN     G WKL +
Subjt:  VDHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKR

Query:  ESRIVATGHKRYSVYVSELGVAKGSLRQI
         S +VA G K  S+Y+ +  + KG +  +
Subjt:  ESRIVATGHKRYSVYVSELGVAKGSLRQI

A0A438I3N7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-2041.94Show/hide
Query:  VDHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKR
        +   ++WV+D+  S +VTS   +F+S+  GD+G+VRMGN +VSK++ + ++ L+   G +L+L DVR++   RLNL+SA KLDD+ YN     G WKL +
Subjt:  VDHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKR

Query:  ESRIVATGHKRYSVYVSELGVAKG
         S +VA G+K  S+Y  +  + KG
Subjt:  ESRIVATGHKRYSVYVSELGVAKG

A0A484KC40 Uncharacterized protein1.4e-2046.61Show/hide
Query:  WVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKRESRIVA
        WV+D+  S ++TS R +FSS+T GD+G VRMGN   SK+I +++V L+   G  LVL DVR++   RL+L+SA  LDD+ Y  +   G WKL R S I+A
Subjt:  WVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKRESRIVA

Query:  TGHKRYSVYVSELGVAKG
         G K+ S+YV +  V  G
Subjt:  TGHKRYSVYVSELGVAKG

A0A5N6N2Q3 Integrase catalytic domain-containing protein8.1e-2139.44Show/hide
Query:  FLACVEKNTSVDH-SSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNG
        F+ C     +V H  S WVVD+  + +VTS R +++S+T GD+GSV+MGN  +SK++ + ++ LK   G ELVL +V+++   RLNL+SA  LD+D Y+ 
Subjt:  FLACVEKNTSVDH-SSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNG

Query:  ESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQIMH
            G WKL R S IVA G +   +Y +   ++K S+  +++
Subjt:  ESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQIMH

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-1735.61Show/hide
Query:  SEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKRESRI
        SEWVVDT  S + T  R  F  +  GD+G+V+MGN S SK+  I ++ +K   G  LVL DVR++   R+NL+S   LD D Y        W+L + S +
Subjt:  SEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVSKVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKRESRI

Query:  VATGHKRYSVYVSELGVAKGSLRQIMHRVAAN
        +A G  R ++Y +   + +G L      ++ +
Subjt:  VATGHKRYSVYVSELGVAKGSLRQIMHRVAAN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAGCAAGGATTGGAAAGAGATGGATGAGCATGCCATTGCAAACATCAGGATGTCTTTATCGATGGGTGTATCTAGCCTCGTGGCGAACGAGACGACAATGAAAGA
ACTAATGCATGCCTTACAAGACAGAAGGGACAAATTAAATGTTTTTGCTGAAAGTTCAAAGTTTCTAGCTTGCGTTGAGAAGAACACATCTGTAGACCATTCATCAGAAT
GGGTAGTGGACACTACAACATCAGCTTATGTTACTTCAGATAGACGTTGGTTCTCATCTTTTACTGGAGGTGATTATGGCTCAGTGAGGATGGGAAATGGGAGTGTCTCC
AAAGTGATAAGAATTGAGAATGTTTATTTGAAGATAGGTGACGGGGCCGAGTTGGTGTTGCTAGACGTTAGGTATATTCATAGCTTCAGATTGAATTTGTTATCCGCAAG
GAAACTAGACGATGATAACTACAATGGTGAGTCTGTTGGGGGTTTTTGGAAGCTCAAGAGGGAATCCAGGATAGTGGCGACAGGCCACAAGAGATATTCTGTTTATGTGT
CAGAGCTTGGTGTTGCCAAGGGTTCATTGAGACAAATAATGCACAGAGTAGCTGCAAATAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTAGCAAGGATTGGAAAGAGATGGATGAGCATGCCATTGCAAACATCAGGATGTCTTTATCGATGGGTGTATCTAGCCTCGTGGCGAACGAGACGACAATGAAAGA
ACTAATGCATGCCTTACAAGACAGAAGGGACAAATTAAATGTTTTTGCTGAAAGTTCAAAGTTTCTAGCTTGCGTTGAGAAGAACACATCTGTAGACCATTCATCAGAAT
GGGTAGTGGACACTACAACATCAGCTTATGTTACTTCAGATAGACGTTGGTTCTCATCTTTTACTGGAGGTGATTATGGCTCAGTGAGGATGGGAAATGGGAGTGTCTCC
AAAGTGATAAGAATTGAGAATGTTTATTTGAAGATAGGTGACGGGGCCGAGTTGGTGTTGCTAGACGTTAGGTATATTCATAGCTTCAGATTGAATTTGTTATCCGCAAG
GAAACTAGACGATGATAACTACAATGGTGAGTCTGTTGGGGGTTTTTGGAAGCTCAAGAGGGAATCCAGGATAGTGGCGACAGGCCACAAGAGATATTCTGTTTATGTGT
CAGAGCTTGGTGTTGCCAAGGGTTCATTGAGACAAATAATGCACAGAGTAGCTGCAAATAGTTAA
Protein sequenceShow/hide protein sequence
MTSKDWKEMDEHAIANIRMSLSMGVSSLVANETTMKELMHALQDRRDKLNVFAESSKFLACVEKNTSVDHSSEWVVDTTTSAYVTSDRRWFSSFTGGDYGSVRMGNGSVS
KVIRIENVYLKIGDGAELVLLDVRYIHSFRLNLLSARKLDDDNYNGESVGGFWKLKRESRIVATGHKRYSVYVSELGVAKGSLRQIMHRVAANS