; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g34120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g34120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:26067085..26067834
RNA-Seq ExpressionMoc09g34120
SyntenyMoc09g34120
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0046274 - lignin catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0048046 - apoplast (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005507 - copper ion binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0052716 - hydroquinone:oxygen oxidoreductase activity (molecular function)
InterPro domainsIPR007750 - Protein of unknown function DUF674
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW19921.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.2e-3774.51Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK
        GSKW+FKTKLKEDGTI+RYKARLVA+G+SQI G+D+ ETFSPVIK TTIR+I +LAVT  W +RQLDVKNAFLHGFLKEEV+M QPPG  + + P+ VCK
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK

Query:  LN
        LN
Subjt:  LN

RVW35654.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.2e-3774.51Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK
        GSKW+FKTKLKEDGTI+RYKARLVA+G+SQI G+D+ ETFSPVIK TTIR+I +LAVT  W +RQLDVKNAFLHGFLKEEV+M QPPG  + + P+ VCK
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK

Query:  LN
        LN
Subjt:  LN

RVX04530.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.2e-3774.51Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK
        GSKW+FKTKLKEDGTI+RYKARLVA+G+SQI G+D+ ETFSPVIK TTIR+I +LAVT  W +RQLDVKNAFLHGFLKEEV+M QPPG  + + P+ VCK
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK

Query:  LN
        LN
Subjt:  LN

RVX04589.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.2e-3774.51Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK
        GSKW+FKTKLKEDGTI+RYKARLVA+G+SQI G+D+ ETFSPVIK TTIR+I +LAVT  W +RQLDVKNAFLHGFLKEEV+M QPPG  + + P+ VCK
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK

Query:  LN
        LN
Subjt:  LN

XP_022139306.1 uncharacterized protein LOC111010253 isoform X2 [Momordica charantia]6.0e-13799.2Show/hide
Query:  MGETNVRLKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLKTEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPKLSFNSSTMPTLLPSYA
        MGETNVRLKLVIESHT RVVYCEAGKSF DFLFDLLSLPLGAVIRLLKTEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPKLSFNSSTMPTLLPSYA
Subjt:  MGETNVRLKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLKTEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPKLSFNSSTMPTLLPSYA

Query:  ETPQTQTQTQIQIPSRGPFYFGSDEEFTIRSPRSSRQSFTFAGNDSTGSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLIL
        ETPQTQTQTQIQIPSRGPFYFGSDEEFTIRSPRSSRQSFTFAGNDSTGSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLIL
Subjt:  ETPQTQTQTQIQIPSRGPFYFGSDEEFTIRSPRSSRQSFTFAGNDSTGSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLIL

Query:  ALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKLN
        ALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKLN
Subjt:  ALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKLN

TrEMBL top hitse value%identityAlignment
A0A438C9J9 Retrovirus-related Pol polyprotein from transposon RE14.4e-3774.51Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK
        GSKW+FKTKLKEDGTI+RYKARLVA+G+SQI G+D+ ETFSPVIK TTIR+I +LAVT  W +RQLDVKNAFLHGFLKEEV+M QPPG  + + P+ VCK
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK

Query:  LN
        LN
Subjt:  LN

A0A438DJK3 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-3774.51Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK
        GSKW+FKTKLKEDGTI+RYKARLVA+G+SQI G+D+ ETFSPVIK TTIR+I +LAVT  W +RQLDVKNAFLHGFLKEEV+M QPPG  + + P+ VCK
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK

Query:  LN
        LN
Subjt:  LN

A0A438J6E1 Retrovirus-related Pol polyprotein from transposon RE14.4e-3774.51Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK
        GSKW+FKTKLKEDGTI+RYKARLVA+G+SQI G+D+ ETFSPVIK TTIR+I +LAVT  W +RQLDVKNAFLHGFLKEEV+M QPPG  + + P+ VCK
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK

Query:  LN
        LN
Subjt:  LN

A0A438J6K3 Retrovirus-related Pol polyprotein from transposon RE14.4e-3774.51Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK
        GSKW+FKTKLKEDGTI+RYKARLVA+G+SQI G+D+ ETFSPVIK TTIR+I +LAVT  W +RQLDVKNAFLHGFLKEEV+M QPPG  + + P+ VCK
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK

Query:  LN
        LN
Subjt:  LN

A0A6J1CCA1 uncharacterized protein LOC111010253 isoform X22.9e-13799.2Show/hide
Query:  MGETNVRLKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLKTEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPKLSFNSSTMPTLLPSYA
        MGETNVRLKLVIESHT RVVYCEAGKSF DFLFDLLSLPLGAVIRLLKTEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPKLSFNSSTMPTLLPSYA
Subjt:  MGETNVRLKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLKTEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPKLSFNSSTMPTLLPSYA

Query:  ETPQTQTQTQIQIPSRGPFYFGSDEEFTIRSPRSSRQSFTFAGNDSTGSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLIL
        ETPQTQTQTQIQIPSRGPFYFGSDEEFTIRSPRSSRQSFTFAGNDSTGSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLIL
Subjt:  ETPQTQTQTQIQIPSRGPFYFGSDEEFTIRSPRSSRQSFTFAGNDSTGSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLIL

Query:  ALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKLN
        ALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKLN
Subjt:  ALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKLN

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.3e-2150.5Show/hide
Query:  SKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKL
        S+W+F  K  E G   RYKARLVA+G++Q   +DYEETF+PV + ++ R IL+L + +N  + Q+DVK AFL+G LKEE+YM  P GI        VCKL
Subjt:  SKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKL

Query:  N
        N
Subjt:  N

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-2655Show/hide
Query:  KWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKLN
        KW+FK K   D  + RYKARLV +G+ Q +G+D++E FSPV+K T+IR IL+LA + +  + QLDVK AFLHG L+EE+YM QP G +   K H VCKLN
Subjt:  KWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKLN

P92520 Uncharacterized mitochondrial protein AtMg008204.8e-1258.93Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALA
        G KW+FKTKL  DGT++R KARLVA+G+ Q EG+ + ET+SPV++  TIR IL +A
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-3261.39Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK
        G +WIF  K   DG++ RYKARLVA+GY+Q  G+DY ETFSPVIK T+IR++L +AV  +WP+RQLDV NAFL G L ++VYMSQPPG  D ++P++VCK
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCK

Query:  L
        L
Subjt:  L

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.2e-3242.22Show/hide
Query:  TYIHSNEAKNAVLKPKLSFNSSTMPTLLPSYAETPQTQTQTQIQIPSRGPFYFGSDEEFTIRSPRSSRQSFTFAGNDSTGSKWIFKTKLKEDGTIERYKA
        T+  +  AK+ + KP   ++ +T      S A   + +T  Q     R     GS+    I +                G +WIF  K   DG++ RYKA
Subjt:  TYIHSNEAKNAVLKPKLSFNSSTMPTLLPSYAETPQTQTQTQIQIPSRGPFYFGSDEEFTIRSPRSSRQSFTFAGNDSTGSKWIFKTKLKEDGTIERYKA

Query:  RLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKL
        RLVA+GY+Q  G+DY ETFSPVIK T+IR++L +AV  +WP+RQLDV NAFL G L +EVYMSQPPG  D ++P +VC+L
Subjt:  RLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKL

Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)3.5e-1039.02Show/hide
Query:  LKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLK-----TEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPK
        L+L+I+    RV+  EAGK F D L  LL+LP+G ++RLL+       ++VGCL NLY S+   +     S   K+ +L P+
Subjt:  LKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLK-----TEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPK

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.9e-2856.19Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQ----DLEKPH
        G KW++K K   DGTIERYKARLVA+GY+Q EG+D+ ETFSPV K T+++LILA++  +N+ L QLD+ NAFL+G L EE+YM  PPG      D   P+
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPLRQLDVKNAFLHGFLKEEVYMSQPPGIQ----DLEKPH

Query:  FVCKL
         VC L
Subjt:  FVCKL

AT5G01130.1 Protein of unknown function (DUF674)7.8e-1037.93Show/hide
Query:  ETNVRLKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLKTE-----AMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPK
        E  V L+L I+    +VV  EA K+F D LF LL+LP+G +IRLL+         VGC  NLY S+         ++  K+ +L P+
Subjt:  ETNVRLKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLKTE-----AMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPK

AT5G43240.1 Protein of unknown function (DUF674)1.3e-0933.73Show/hide
Query:  VRLKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLK-----TEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKP
        ++LKL+I+    +VV+ EAGK F D LF   +LP+G ++RLL+      +  +GC  N+Y S+ +    +  +   K  +L P
Subjt:  VRLKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLK-----TEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.4e-1358.93Show/hide
Query:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALA
        G KW+FKTKL  DGT++R KARLVA+G+ Q EG+ + ET+SPV++  TIR IL +A
Subjt:  GSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGAAACGAACGTGAGATTGAAGCTTGTGATAGAGTCGCACACAAAAAGAGTTGTTTATTGCGAAGCAGGGAAGAGCTTCGCCGACTTTCTTTTCGACCTA
CTTTCCCTCCCACTTGGGGCTGTAATTAGGCTGTTGAAAACGGAAGCCATGGTGGGATGCTTGGGAAATCTCTACCACAGCATAGAAACCTTTAACCAAACATAT
ATACACTCAAACGAGGCCAAAAACGCAGTCTTGAAACCAAAACTCTCGTTCAACTCCTCCACTATGCCCACGCTTTTGCCTTCATATGCCGAGACTCCGCAAACC
CAAACCCAAACCCAAATCCAAATCCCCTCACGTGGCCCATTTTATTTTGGTTCTGATGAAGAATTTACTATTCGAAGTCCAAGGTCAAGTAGACAGTCATTTACT
TTTGCAGGAAATGATTCAACTGGATCAAAGTGGATATTCAAAACTAAACTAAAGGAGGATGGAACAATTGAACGTTACAAGGCAAGATTGGTCGCTCAAGGATAC
TCTCAGATAGAAGGAGTGGACTACGAAGAAACTTTTAGTCCTGTAATCAAGCCAACAACAATTCGGCTCATCCTTGCACTGGCAGTGACCTTCAACTGGCCCTTG
CGACAACTAGATGTCAAAAATGCATTTCTCCATGGCTTTCTCAAGGAAGAAGTCTACATGTCTCAACCACCTGGAATTCAAGATCTAGAAAAACCACACTTCGTA
TGCAAGCTCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGAAACGAACGTGAGATTGAAGCTTGTGATAGAGTCGCACACAAAAAGAGTTGTTTATTGCGAAGCAGGGAAGAGCTTCGCCGACTTTCTTTTCGACCTA
CTTTCCCTCCCACTTGGGGCTGTAATTAGGCTGTTGAAAACGGAAGCCATGGTGGGATGCTTGGGAAATCTCTACCACAGCATAGAAACCTTTAACCAAACATAT
ATACACTCAAACGAGGCCAAAAACGCAGTCTTGAAACCAAAACTCTCGTTCAACTCCTCCACTATGCCCACGCTTTTGCCTTCATATGCCGAGACTCCGCAAACC
CAAACCCAAACCCAAATCCAAATCCCCTCACGTGGCCCATTTTATTTTGGTTCTGATGAAGAATTTACTATTCGAAGTCCAAGGTCAAGTAGACAGTCATTTACT
TTTGCAGGAAATGATTCAACTGGATCAAAGTGGATATTCAAAACTAAACTAAAGGAGGATGGAACAATTGAACGTTACAAGGCAAGATTGGTCGCTCAAGGATAC
TCTCAGATAGAAGGAGTGGACTACGAAGAAACTTTTAGTCCTGTAATCAAGCCAACAACAATTCGGCTCATCCTTGCACTGGCAGTGACCTTCAACTGGCCCTTG
CGACAACTAGATGTCAAAAATGCATTTCTCCATGGCTTTCTCAAGGAAGAAGTCTACATGTCTCAACCACCTGGAATTCAAGATCTAGAAAAACCACACTTCGTA
TGCAAGCTCAACTGA
Protein sequenceShow/hide protein sequence
MGETNVRLKLVIESHTKRVVYCEAGKSFADFLFDLLSLPLGAVIRLLKTEAMVGCLGNLYHSIETFNQTYIHSNEAKNAVLKPKLSFNSSTMPTLLPSYAETPQT
QTQTQIQIPSRGPFYFGSDEEFTIRSPRSSRQSFTFAGNDSTGSKWIFKTKLKEDGTIERYKARLVAQGYSQIEGVDYEETFSPVIKPTTIRLILALAVTFNWPL
RQLDVKNAFLHGFLKEEVYMSQPPGIQDLEKPHFVCKLN