; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g00710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g00710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr1:464753..465142
RNA-Seq ExpressionMoc01g00710
SyntenyMoc01g00710
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054997.1 putative mitochondrial protein [Cucumis melo var. makuwa]1.1e-3364.96Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        MQQ QGFV  + P+ VCKL KSLYGLKQAPRAWF+CFT+HLLTLGF  S+ +SSLFVR+   S TYLLLY DDII+TG++  YI+ L+++L+ +FD TDL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSG
        G  KYFLGLEI  + SG
Subjt:  GSLKYFLGLEITCSLSG

KAA0061282.1 putative mitochondrial protein [Cucumis melo var. makuwa]8.7e-3163.03Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        M Q  GF + + P+HVC L KSLYGLKQAPRAWF+ FTS+L TLGFV S AD SLF+R    S+TYLLLYVDDIIVTG   LYI  L  +L L F ++DL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSGFL
        G LKYFLGLEI  S+ G +
Subjt:  GSLKYFLGLEITCSLSGFL

XP_020415542.1 uncharacterized protein LOC109948051 [Prunus persica]7.9e-3261.54Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        MQQ  GF  S+HPHHVC+LLKSLYGLKQAPRAW + FT+HLLTLGFV+S AD+SLF+R S   +  LLLYVDDII+TG++S+ I+  +  L   FDM DL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSG
        G L YFLGL++  + +G
Subjt:  GSLKYFLGLEITCSLSG

XP_022143489.1 uncharacterized protein LOC111013365 [Momordica charantia]5.8e-3567.52Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        MQQ QGFV  ++P +VCKL KSLYG KQAPRAWF+CFT+HLL LGFV S  DSSLFVR    S TYLLLYVDDI+VTGS   YI +L+ +L+ RFDMTDL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSG
          LKYFLGLEI+ + +G
Subjt:  GSLKYFLGLEITCSLSG

XP_022151604.1 uncharacterized protein LOC111019517 [Momordica charantia]6.9e-3667.52Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        MQQ QGF+    P+ VCKL+KSLY LKQAPRAWF CF SHLLTLGF  S ADSSLFVRR++DS+TYLLLYVDDI +T + + YI+ L+++L+LRFDMTDL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSG
        G L++FLGLEI  S  G
Subjt:  GSLKYFLGLEITCSLSG

TrEMBL top hitse value%identityAlignment
A0A2N9FMC6 Integrase catalytic domain-containing protein5.0e-3261.86Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        M Q QGFV SS PHHVCKL KSLYGLKQAPRAWF+ FTS LL LGF  S AD SLF+ RS+ ++ +LL+YVDDII+TG+S   + +LV +L   F++ DL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSGF
        G L YFLGLE+  S +GF
Subjt:  GSLKYFLGLEITCSLSGF

A0A2N9GRJ0 Uncharacterized protein5.0e-3261.86Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        M Q QGFV SS PHHVCKL KSLYGLKQAPRAWF+ FTS LL LGF  S AD SLF+ RS+ ++ +LL+YVDDII+TG+S   + +LV +L   F++ DL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSGF
        G L YFLGLE+  S +GF
Subjt:  GSLKYFLGLEITCSLSGF

A0A5A7UKB0 Putative mitochondrial protein5.3e-3464.96Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        MQQ QGFV  + P+ VCKL KSLYGLKQAPRAWF+CFT+HLLTLGF  S+ +SSLFVR+   S TYLLLY DDII+TG++  YI+ L+++L+ +FD TDL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSG
        G  KYFLGLEI  + SG
Subjt:  GSLKYFLGLEITCSLSG

A0A6J1CPG5 uncharacterized protein LOC1110133652.8e-3567.52Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        MQQ QGFV  ++P +VCKL KSLYG KQAPRAWF+CFT+HLL LGFV S  DSSLFVR    S TYLLLYVDDI+VTGS   YI +L+ +L+ RFDMTDL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSG
          LKYFLGLEI+ + +G
Subjt:  GSLKYFLGLEITCSLSG

A0A6J1DDJ2 uncharacterized protein LOC1110195173.3e-3667.52Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        MQQ QGF+    P+ VCKL+KSLY LKQAPRAWF CF SHLLTLGF  S ADSSLFVRR++DS+TYLLLYVDDI +T + + YI+ L+++L+LRFDMTDL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSG
        G L++FLGLEI  S  G
Subjt:  GSLKYFLGLEITCSLSG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-1236.54Show/hide
Query:  SSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFV--RRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDLGSLKYFL
        S +  +VCKL K++YGLKQA R WF+ F   L    FV S  D  +++  + + +   Y+LLYVDD+++       +      L  +F MTDL  +K+F+
Subjt:  SSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFV--RRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDLGSLKYFL

Query:  GLEI
        G+ I
Subjt:  GLEI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-1741.07Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRR-SADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTD
        M+Q +GF  +   H VCKL KSLYGLKQAPR W+  F S + +  ++++++D  ++ +R S ++   LLLYVDD+++ G     I  L  +L   FDM D
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRR-SADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTD

Query:  LGSLKYFLGLEI
        LG  +  LG++I
Subjt:  LGSLKYFLGLEI

P25600 Putative transposon Ty5-1 protein YCL074W2.2e-1334.71Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        ++Q  GFV+  +P +V +L   +YGLKQAP  W +   + L  +GF     +  L+ R ++D   Y+ +YVDD++V   S    + +  EL   + M DL
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLEITCSLSGFLFL
        G +  FLGL I  S +G + L
Subjt:  GSLKYFLGLEITCSLSGFLFL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.8e-2449.09Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        M Q  GF+    P++VCKL K+LYGLKQAPRAW+    ++LLT+GFV S +D+SLFV +   SI Y+L+YVDDI++TG+    +   +  L  RF + D 
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLE
          L YFLG+E
Subjt:  GSLKYFLGLE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.0e-2347.27Show/hide
Query:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL
        M Q  GFV    P +VC+L K++YGLKQAPRAW+    ++LLT+GFV S +D+SLFV +   SI Y+L+YVDDI++TG+ ++ ++  +  L  RF + + 
Subjt:  MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDL

Query:  GSLKYFLGLE
          L YFLG+E
Subjt:  GSLKYFLGLE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.6e-2146.67Show/hide
Query:  PHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDLGSLKYFLGLEIT
        P+ VC L KS+YGLKQA R WF  F+  L+  GFV+SH+D + F++ +A     +L+YVDDII+  ++   ++ L ++L+  F + DLG LKYFLGLEI 
Subjt:  PHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDLGSLKYFLGLEIT

Query:  CSLSG
         S +G
Subjt:  CSLSG

ATMG00810.1 DNA/RNA polymerases superfamily protein6.5e-0853.57Show/hide
Query:  YLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDLGSLKYFLGLEITCSLSGFLFL
        YLLLYVDDI++TGSS+  +  L+ +L   F M DLG + YFLG++I    SG LFL
Subjt:  YLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDLGSLKYFLGLEITCSLSGFLFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACAACTCCAAGGTTTTGTAAGTTCTTCTCATCCTCATCATGTTTGTAAGCTGCTAAAGTCCTTATATGGTTTAAAGCAGGCTCCTCGTGCTTGGTTCAAATGTTT
TACTAGTCATTTGCTTACACTTGGGTTTGTTGAATCTCATGCTGATTCTTCATTGTTTGTTCGTCGCTCTGCTGACTCTATTACCTACCTATTACTGTACGTTGATGACA
TAATTGTCACTGGTAGTAGTTCTCTTTATATTGAGACTCTTGTTGCTGAACTTCGACTTAGATTTGATATGACTGATCTTGGCTCTCTCAAGTATTTTTTGGGGTTGGAA
ATTACTTGCAGCCTCTCTGGATTTTTGTTTCTCATTGTAAATATGCAAAATATGTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAACAACTCCAAGGTTTTGTAAGTTCTTCTCATCCTCATCATGTTTGTAAGCTGCTAAAGTCCTTATATGGTTTAAAGCAGGCTCCTCGTGCTTGGTTCAAATGTTT
TACTAGTCATTTGCTTACACTTGGGTTTGTTGAATCTCATGCTGATTCTTCATTGTTTGTTCGTCGCTCTGCTGACTCTATTACCTACCTATTACTGTACGTTGATGACA
TAATTGTCACTGGTAGTAGTTCTCTTTATATTGAGACTCTTGTTGCTGAACTTCGACTTAGATTTGATATGACTGATCTTGGCTCTCTCAAGTATTTTTTGGGGTTGGAA
ATTACTTGCAGCCTCTCTGGATTTTTGTTTCTCATTGTAAATATGCAAAATATGTGTTAG
Protein sequenceShow/hide protein sequence
MQQLQGFVSSSHPHHVCKLLKSLYGLKQAPRAWFKCFTSHLLTLGFVESHADSSLFVRRSADSITYLLLYVDDIIVTGSSSLYIETLVAELRLRFDMTDLGSLKYFLGLE
ITCSLSGFLFLIVNMQNMC