; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g00660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g00660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:397387..398290
RNA-Seq ExpressionMoc07g00660
SyntenyMoc07g00660
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141216.1 uncharacterized protein LOC111011669 [Momordica charantia]7.4e-3753.03Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQR---AIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDR
        M FLMGLN+SF+Q+RAQLLLMEP+ TINRAFSLVAQEV+QR   A AS    SS  A+ +A  + SS +T+A SSQ +R   P    CTHC L GHTVDR
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQR---AIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDR

Query:  CYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLA--KAQTSNPDSGTSHVAGRVFGGKLTQDTWR
        CYKLHGYPPG+RS+      +    +         S+  SSS  DSL+N  ADQ QGLL+ LQSHLA  K  + +  S +SHVAG+V    L +D W+
Subjt:  CYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLA--KAQTSNPDSGTSHVAGRVFGGKLTQDTWR

XP_022148562.1 uncharacterized protein LOC111017196 [Momordica charantia]5.0e-4150.98Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSG----------PNRMVCTHCGL
        M FLMGLNDSFSQIRA LLLM P PTIN AF L+AQEVQQR I+ I S ++S ASS A     + +  + +  N  TS             + +CTHC L
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSG----------PNRMVCTHCGL

Query:  IGHTVDRCYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVFGGKLTQ
        + HTVDRCYKLHGYPPGYR+S  R T   P           A++  ++++PDSL+++NADQC GL ++LQSHLAK +T S  +SGTSHVAG+V    L +
Subjt:  IGHTVDRCYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVFGGKLTQ

Query:  DTWR
        D W+
Subjt:  DTWR

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]6.5e-4156.22Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDRCYK
        M FLMGLN SFSQIRAQLLLMEP PTINRAF+LVAQE+QQR+I S+PS +S  AS+V   T +S +++  SS        ++ +CTHCG+ GHTVD+CYK
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDRCYK

Query:  LHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVFG
        LH YPPGYRSS  + T +        ++ + + +AT S + +SLA L ADQCQ LL+LLQSHL   +T S+ DSGTSHVA   FG
Subjt:  LHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVFG

XP_022158788.1 uncharacterized protein LOC111025254 [Momordica charantia]1.3e-3651.06Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASI----PSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVD
        M FLMGLNDSFSQ  AQLLLMEP+P+INR  SLVAQE QQRAI S+    P+T     +SV      +  T + S  NK+    ++ VCTHCG+IGHT D
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASI----PSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVD

Query:  RCYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVF
        +CY+LHGYPPG+R  G +++F+             ++++ SS+  DSLA+  ADQCQGLL+LL SHL+  Q  ++ DS T HVAG VF
Subjt:  RCYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVF

XP_038905564.1 uncharacterized protein LOC120091546 [Benincasa hispida]1.3e-2848.91Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPS-TSSSGASSVALLTR--SSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDR
        MTFLMGLN+SFSQI  QLLLME +P+IN+AFS V QEV+QR I S  +  S+    + ALL +  SS     QSS +      +R+  THC + GHTVD+
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPS-TSSSGASSVALLTR--SSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDR

Query:  CYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATS-SSLPDSLANLNADQCQGLLSLLQSHLAKAQTSNPDSGTSHVAG
        CYK+H YPP YRS+          K S  +SE+ +ST+TS SS   S  N +A Q QGLL + QSHLAKA+     S  +H+AG
Subjt:  CYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATS-SSLPDSLANLNADQCQGLLSLLQSHLAKAQTSNPDSGTSHVAG

TrEMBL top hitse value%identityAlignment
A0A6J1CIG1 uncharacterized protein LOC1110116693.6e-3753.03Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQR---AIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDR
        M FLMGLN+SF+Q+RAQLLLMEP+ TINRAFSLVAQEV+QR   A AS    SS  A+ +A  + SS +T+A SSQ +R   P    CTHC L GHTVDR
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQR---AIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDR

Query:  CYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLA--KAQTSNPDSGTSHVAGRVFGGKLTQDTWR
        CYKLHGYPPG+RS+      +    +         S+  SSS  DSL+N  ADQ QGLL+ LQSHLA  K  + +  S +SHVAG+V    L +D W+
Subjt:  CYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLA--KAQTSNPDSGTSHVAGRVFGGKLTQDTWR

A0A6J1CXR2 uncharacterized protein LOC1110152396.3e-2641.79Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDRCYK
        MTFLMGLN+S+++IRAQ+LLM+P P +N+ FSL+ QE +QRAI +I     S A +VA +++ +  T+ +          NR  CTHCGL GH +D+CYK
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDRCYK

Query:  LHGYPPGYRSSGTRATFAP--------------PVKVSERQSETVASTA---TSSSLPDSLANLNADQCQGLLSLLQSHL--AKAQTSNPDSGTSHVAGR
        LHGYPPGYR++   A                    +VSE+  +  +S A    S+S P    +LN+ Q   L+ +LQSHL  AK +T  P    +HVAG+
Subjt:  LHGYPPGYRSSGTRATFAP--------------PVKVSERQSETVASTA---TSSSLPDSLANLNADQCQGLLSLLQSHL--AKAQTSNPDSGTSHVAGR

Query:  V
        V
Subjt:  V

A0A6J1D5E3 uncharacterized protein LOC1110171962.4e-4150.98Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSG----------PNRMVCTHCGL
        M FLMGLNDSFSQIRA LLLM P PTIN AF L+AQEVQQR I+ I S ++S ASS A     + +  + +  N  TS             + +CTHC L
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSG----------PNRMVCTHCGL

Query:  IGHTVDRCYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVFGGKLTQ
        + HTVDRCYKLHGYPPGYR+S  R T   P           A++  ++++PDSL+++NADQC GL ++LQSHLAK +T S  +SGTSHVAG+V    L +
Subjt:  IGHTVDRCYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVFGGKLTQ

Query:  DTWR
        D W+
Subjt:  DTWR

A0A6J1DNP7 uncharacterized protein LOC1110220653.1e-4156.22Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDRCYK
        M FLMGLN SFSQIRAQLLLMEP PTINRAF+LVAQE+QQR+I S+PS +S  AS+V   T +S +++  SS        ++ +CTHCG+ GHTVD+CYK
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDRCYK

Query:  LHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVFG
        LH YPPGYRSS  + T +        ++ + + +AT S + +SLA L ADQCQ LL+LLQSHL   +T S+ DSGTSHVA   FG
Subjt:  LHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVFG

A0A6J1DX32 uncharacterized protein LOC1110252546.1e-3751.06Show/hide
Query:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASI----PSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVD
        M FLMGLNDSFSQ  AQLLLMEP+P+INR  SLVAQE QQRAI S+    P+T     +SV      +  T + S  NK+    ++ VCTHCG+IGHT D
Subjt:  MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASI----PSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVD

Query:  RCYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVF
        +CY+LHGYPPG+R  G +++F+             ++++ SS+  DSLA+  ADQCQGLL+LL SHL+  Q  ++ DS T HVAG VF
Subjt:  RCYKLHGYPPGYRSSGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQT-SNPDSGTSHVAGRVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTTCTTGATGGGATTGAATGACTCTTTCAGTCAGATTAGGGCTCAATTACTTCTTATGGAGCCTGATCCCACCATTAATCGTGCTTTTTCTTTGGTTGCACAAGA
AGTACAACAGCGCGCGATTGCATCGATTCCGTCTACCTCTTCTTCAGGTGCTTCGTCTGTTGCTTTGTTGACTCGAAGCTCTCCGCATACGAAGGCACAATCATCTCAGA
ATAAGCGAACTTCTGGTCCCAATCGGATGGTTTGTACTCATTGCGGTCTTATCGGACACACAGTGGATCGTTGTTACAAGCTACATGGGTATCCTCCAGGTTATCGCTCA
TCTGGTACTCGTGCTACTTTCGCACCACCAGTTAAGGTATCTGAGAGGCAGTCCGAAACAGTGGCTTCTACTGCTACTTCATCTAGTCTTCCAGATTCTCTGGCCAATCT
TAATGCTGATCAGTGTCAAGGACTCCTGTCTTTGCTTCAATCACATTTAGCCAAAGCTCAGACATCTAACCCCGACTCTGGCACGTCTCATGTTGCAGGACGAGTGTTTG
GGGGTAAACTTACCCAGGACACGTGGAGGACCATGATTCGTGTTGGAAGAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTTTCTTGATGGGATTGAATGACTCTTTCAGTCAGATTAGGGCTCAATTACTTCTTATGGAGCCTGATCCCACCATTAATCGTGCTTTTTCTTTGGTTGCACAAGA
AGTACAACAGCGCGCGATTGCATCGATTCCGTCTACCTCTTCTTCAGGTGCTTCGTCTGTTGCTTTGTTGACTCGAAGCTCTCCGCATACGAAGGCACAATCATCTCAGA
ATAAGCGAACTTCTGGTCCCAATCGGATGGTTTGTACTCATTGCGGTCTTATCGGACACACAGTGGATCGTTGTTACAAGCTACATGGGTATCCTCCAGGTTATCGCTCA
TCTGGTACTCGTGCTACTTTCGCACCACCAGTTAAGGTATCTGAGAGGCAGTCCGAAACAGTGGCTTCTACTGCTACTTCATCTAGTCTTCCAGATTCTCTGGCCAATCT
TAATGCTGATCAGTGTCAAGGACTCCTGTCTTTGCTTCAATCACATTTAGCCAAAGCTCAGACATCTAACCCCGACTCTGGCACGTCTCATGTTGCAGGACGAGTGTTTG
GGGGTAAACTTACCCAGGACACGTGGAGGACCATGATTCGTGTTGGAAGAAATTAG
Protein sequenceShow/hide protein sequence
MTFLMGLNDSFSQIRAQLLLMEPDPTINRAFSLVAQEVQQRAIASIPSTSSSGASSVALLTRSSPHTKAQSSQNKRTSGPNRMVCTHCGLIGHTVDRCYKLHGYPPGYRS
SGTRATFAPPVKVSERQSETVASTATSSSLPDSLANLNADQCQGLLSLLQSHLAKAQTSNPDSGTSHVAGRVFGGKLTQDTWRTMIRVGRN