; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g10360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g10360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related pol polyprotein from transposon
Genome locationchr5:8152219..8152747
RNA-Seq ExpressionMoc05g10360
SyntenyMoc05g10360
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH89318.1 hypothetical protein [Trifolium medium]1.3e-3961.02Show/hide
Query:  VKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGY
        VKYP+QNYL+YD+L P YN F+  V T Y+P F+HQAVS   WR  M  E+ A+E+N TWT  PLP G+ ++GC+WIYKVK+R+DGT++RYKA+LVA+G+
Subjt:  VKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGY

Query:  TQQEGLDYIETFSPVAKL
        TQQ G+D+++TFSPVAKL
Subjt:  TQQEGLDYIETFSPVAKL

MCI05130.1 retrovirus-related pol polyprotein from transposon [Trifolium medium]9.2e-3858.47Show/hide
Query:  VKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGY
        VKYP+QN+LSYD+L P+Y  F+  V T Y+P F+HQAVS   WR  M  EL A+E N TW+  PL  G+ ++GC+WIYKVK+R+DG+++RYKA+LVA+G+
Subjt:  VKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGY

Query:  TQQEGLDYIETFSPVAKL
        TQQ G+D+++TFSPVAKL
Subjt:  TQQEGLDYIETFSPVAKL

TYK16758.1 Copia protein [Cucumis melo var. makuwa]9.8e-4063.03Show/hide
Query:  TVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKG
        + KYP+  YLSY  LSPTY   +L V T  +  F+H+AV    WR+ M AEL+A+ETN+TW+ VPLP G++SIGC+W+YK+KH+ DG+IERYKA+LVAKG
Subjt:  TVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKG

Query:  YTQQEGLDYIETFSPVAKL
        YTQQEGLDY ETFSPVAK+
Subjt:  YTQQEGLDYIETFSPVAKL

XP_022147774.1 uncharacterized protein LOC111016631 isoform X1 [Momordica charantia]8.6e-4469.92Show/hide
Query:  MAASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKL
        +A+S +KY LQ YLSYD+LSP Y  F+LNV T ++P F+H AVS+ HWRD M+AEL A+E+N TW+ V LP G HSIGCKW+YKVKH  DG+IERYKA+L
Subjt:  MAASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKL

Query:  VAKGYTQQEGLDYIETFSPVAKL
        VAKGYTQQEGLDYIETFS VAKL
Subjt:  VAKGYTQQEGLDYIETFSPVAKL

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]7.0e-4672.95Show/hide
Query:  AASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLV
        +AS+V YPLQ YL Y+ LS +Y  FVL+V   Y+PQF+HQAV FSHWR+ M AEL A+E N TW+ VPLP   HSIGCKWIYKVKH+SDG+IERYKA+LV
Subjt:  AASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLV

Query:  AKGYTQQEGLDYIETFSPVAKL
        AKGYTQQEGLDYIETFSPVAKL
Subjt:  AKGYTQQEGLDYIETFSPVAKL

TrEMBL top hitse value%identityAlignment
A0A2N9H2Y3 Integrase catalytic domain-containing protein8.1e-4063.93Show/hide
Query:  MAASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKL
        +A+S   YPL   LSYD LSPT+ +F L+V    +P FFHQA    HW++ M AEL A+E N TWT  PLP G+H IGCKW+YKVK +SDG++ERYKA+L
Subjt:  MAASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKL

Query:  VAKGYTQQEGLDYIETFSPVAK
        VAKGYTQQEGLDY ETFSPVAK
Subjt:  VAKGYTQQEGLDYIETFSPVAK

A0A392MRN7 Reverse transcriptase Ty1/copia-type domain-containing protein (Fragment)6.2e-4061.02Show/hide
Query:  VKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGY
        VKYP+QNYL+YD+L P YN F+  V T Y+P F+HQAVS   WR  M  E+ A+E+N TWT  PLP G+ ++GC+WIYKVK+R+DGT++RYKA+LVA+G+
Subjt:  VKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGY

Query:  TQQEGLDYIETFSPVAKL
        TQQ G+D+++TFSPVAKL
Subjt:  TQQEGLDYIETFSPVAKL

A0A5D3CZP1 Copia protein4.8e-4063.03Show/hide
Query:  TVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKG
        + KYP+  YLSY  LSPTY   +L V T  +  F+H+AV    WR+ M AEL+A+ETN+TW+ VPLP G++SIGC+W+YK+KH+ DG+IERYKA+LVAKG
Subjt:  TVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKG

Query:  YTQQEGLDYIETFSPVAKL
        YTQQEGLDY ETFSPVAK+
Subjt:  YTQQEGLDYIETFSPVAKL

A0A6J1D203 uncharacterized protein LOC111016631 isoform X14.2e-4469.92Show/hide
Query:  MAASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKL
        +A+S +KY LQ YLSYD+LSP Y  F+LNV T ++P F+H AVS+ HWRD M+AEL A+E+N TW+ V LP G HSIGCKW+YKVKH  DG+IERYKA+L
Subjt:  MAASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKL

Query:  VAKGYTQQEGLDYIETFSPVAKL
        VAKGYTQQEGLDYIETFS VAKL
Subjt:  VAKGYTQQEGLDYIETFSPVAKL

A0A6J1DNP7 uncharacterized protein LOC1110220653.4e-4672.95Show/hide
Query:  AASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLV
        +AS+V YPLQ YL Y+ LS +Y  FVL+V   Y+PQF+HQAV FSHWR+ M AEL A+E N TW+ VPLP   HSIGCKWIYKVKH+SDG+IERYKA+LV
Subjt:  AASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLV

Query:  AKGYTQQEGLDYIETFSPVAKL
        AKGYTQQEGLDYIETFSPVAKL
Subjt:  AKGYTQQEGLDYIETFSPVAKL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-1535.65Show/hide
Query:  LSYDRLSPTYNSFVLNVPTTYK--PQFFHQAV---SFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGYTQQ
        +SY+    + N  VLN  T +   P  F +       S W + ++ EL A + N TWT    P  ++ +  +W++ VK+   G   RYKA+LVA+G+TQ+
Subjt:  LSYDRLSPTYNSFVLNVPTTYK--PQFFHQAV---SFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGYTQQ

Query:  EGLDYIETFSPVAKL
          +DY ETF+PVA++
Subjt:  EGLDYIETFSPVAKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-1443.06Show/hide
Query:  MDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYIETFSPVAKL
        M  E+++++ N T+  V LP G+  + CKW++K+K   D  + RYKA+LV KG+ Q++G+D+ E FSPV K+
Subjt:  MDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYIETFSPVAKL

P92520 Uncharacterized mitochondrial protein AtMg008201.8e-2043.52Show/hide
Query:  DRLSPTYNSFVLNVPTTYK--PQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYI
        ++L+P Y+   L + TT K  P+    A+    W   M  EL A+  NKTW  VP P  ++ +GCKW++K K  SDGT++R KA+LVAKG+ Q+EG+ ++
Subjt:  DRLSPTYNSFVLNVPTTYK--PQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYI

Query:  ETFSPVAK
        ET+SPV +
Subjt:  ETFSPVAK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-2046.15Show/hide
Query:  SPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSI-GCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYIETFS
        +P Y S  +++    +P+   QA+    WR+ M +E+ A   N TW  VP P    +I GC+WI+  K+ SDG++ RYKA+LVAKGY Q+ GLDY ETFS
Subjt:  SPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSI-GCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYIETFS

Query:  PVAK
        PV K
Subjt:  PVAK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.4e-1946.46Show/hide
Query:  SFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSI-GCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYIETFSPVAK
        S+  ++    +P+   QA+    WR  M +E+ A   N TW  VP P    +I GC+WI+  K  SDG++ RYKA+LVAKGY Q+ GLDY ETFSPV K
Subjt:  SFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSI-GCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYIETFSPVAK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-3452.07Show/hide
Query:  ASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVA
        AS   + +  +LSY+++SP Y+SF++ +    +P  +++A  F  W   MD E+ A+ET  TW    LP  +  IGCKW+YK+K+ SDGTIERYKA+LVA
Subjt:  ASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVA

Query:  KGYTQQEGLDYIETFSPVAKL
        KGYTQQEG+D+IETFSPV KL
Subjt:  KGYTQQEGLDYIETFSPVAKL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-2143.52Show/hide
Query:  DRLSPTYNSFVLNVPTTYK--PQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYI
        ++L+P Y+   L + TT K  P+    A+    W   M  EL A+  NKTW  VP P  ++ +GCKW++K K  SDGT++R KA+LVAKG+ Q+EG+ ++
Subjt:  DRLSPTYNSFVLNVPTTYK--PQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEGLDYI

Query:  ETFSPVAK
        ET+SPV +
Subjt:  ETFSPVAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTTCTACTGTCAAGTACCCGCTGCAGAATTATTTGTCTTATGATAGACTTTCTCCAACTTATAATTCTTTTGTTCTTAATGTTCCTACTACTTATAAGCCTCA
ATTTTTTCATCAAGCAGTCTCATTTTCTCATTGGAGGGATGTTATGGATGCAGAATTGAAAGCCATAGAAACTAACAAGACTTGGACCTTTGTACCACTACCATCGGGAC
GTCATTCTATTGGATGTAAGTGGATCTACAAAGTCAAACACCGCTCTGATGGGACCATCGAACGTTATAAAGCAAAGTTGGTTGCCAAAGGCTACACTCAGCAAGAGGGA
CTTGACTACATTGAAACATTCTCACCAGTTGCAAAATTGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTTCTACTGTCAAGTACCCGCTGCAGAATTATTTGTCTTATGATAGACTTTCTCCAACTTATAATTCTTTTGTTCTTAATGTTCCTACTACTTATAAGCCTCA
ATTTTTTCATCAAGCAGTCTCATTTTCTCATTGGAGGGATGTTATGGATGCAGAATTGAAAGCCATAGAAACTAACAAGACTTGGACCTTTGTACCACTACCATCGGGAC
GTCATTCTATTGGATGTAAGTGGATCTACAAAGTCAAACACCGCTCTGATGGGACCATCGAACGTTATAAAGCAAAGTTGGTTGCCAAAGGCTACACTCAGCAAGAGGGA
CTTGACTACATTGAAACATTCTCACCAGTTGCAAAATTGGAATAA
Protein sequenceShow/hide protein sequence
MAASTVKYPLQNYLSYDRLSPTYNSFVLNVPTTYKPQFFHQAVSFSHWRDVMDAELKAIETNKTWTFVPLPSGRHSIGCKWIYKVKHRSDGTIERYKAKLVAKGYTQQEG
LDYIETFSPVAKLE