; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g14580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g14580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon opus
Genome locationchr10:11120904..11126162
RNA-Seq ExpressionMoc10g14580
SyntenyMoc10g14580
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3473721.1 retroelement pol polyprotein-like [Gossypium australe]5.2e-2035.65Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALP---------------------------H
        +E+L++ YM KNDALIQSQAATL+N+E Q+GQ+A EL+NR Q  LP D ENP+  G E C+ALTL S K +                             
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALP---------------------------H

Query:  PYPSLMQEDNVIHA-----------------------ILIGKFPQKMGDQGNFTIPVSIGGKNL---ADRSITHPGGRPFLSTGRALTDVHNGELTVRVN
        P P   + D V                           +  K P K+ D G FTIP +IG        D+ +    GRPFL+TGR + DV  GELT+RV 
Subjt:  PYPSLMQEDNVIHA-----------------------ILIGKFPQKMGDQGNFTIPVSIGGKNL---ADRSITHPGGRPFLSTGRALTDVHNGELTVRVN

Query:  DQQLEGTVEGQTAIRD
          Q     + + +I +
Subjt:  DQQLEGTVEGQTAIRD

XP_022157917.1 uncharacterized protein LOC111024527 [Momordica charantia]4.4e-1937.57Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALPHPYPSLMQEDNVIHAILIGKFPQKMGDQ
        +E LM+ YM  ND ++QSQAA+LRN+E+Q+GQ+A +LK+RPQ  +   +E  ++  N       + + K     Y ++         ILI K P KM D 
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALPHPYPSLMQEDNVIHAILIGKFPQKMGDQ

Query:  GNFTIPVSIGGKNLA--------------------------DRSITHPGGRPFLSTGRALTDVHNGELTVRVNDQQLEGTV
        G+FTIPVSIGG+ +                           D+ ++   GRPFL T R L DVH GELT+RV DQ+++ +V
Subjt:  GNFTIPVSIGGKNLA--------------------------DRSITHPGGRPFLSTGRALTDVHNGELTVRVNDQQLEGTV

XP_030495102.1 uncharacterized protein LOC115710889 [Cannabis sativa]6.8e-2037.61Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGK-------ALPHPYPSLMQEDNVIH---AILI
        +E+LMRDYMTKNDA+IQSQAA+LRN+EVQLGQ+A +LKNRPQ  LP D ENP+++G E C+A+TL SGK       A     PS +Q++  +    AI I
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGK-------ALPHPYPSLMQEDNVIH---AILI

Query:  G-KFP--------------------------QKMGDQGNF------------TIP-----------------VSIGGKNLAD----------RSITHPGG
          KFP                          +K  D G F             IP                 +    + L +          R +    G
Subjt:  G-KFP--------------------------QKMGDQGNF------------TIP-----------------VSIGGKNLAD----------RSITHPGG

Query:  RPFLSTGRALTDVHNGELTVRVNDQQ
        RPFL+TGR L DV  GELT+R  D+Q
Subjt:  RPFLSTGRALTDVHNGELTVRVNDQQ

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.1e-1727.15Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKAL-----------------------------
        +E+LMRDYM KNDA+IQSQAA LRN+E+QLG +A ELK RPQ  LP D ENP+++G EQC+++ L SGK L                             
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKAL-----------------------------

Query:  -----------------------------PHPYP------------------------------SLMQEDNVI------------------------HAI
                                     P P+P                              +L Q  N +                         A+
Subjt:  -----------------------------PHPYP------------------------------SLMQEDNVI------------------------HAI

Query:  LIGKFPQKMGDQGNFTIPVSIGGKN--------------LADRSITHPGG----------------------------------RPFLSTGRALTDVHNG
        L  K P K+ D G+FTIP+SIGG++              LADRS+ HP G                                  RPFL+TGR L DV  G
Subjt:  LIGKFPQKMGDQGNFTIPVSIGGKN--------------LADRSITHPGG----------------------------------RPFLSTGRALTDVHNG

Query:  ELTVRVNDQQLEGTVEGQTAIRDAFPDEQLLVVIENKKDLEPCALSFSFISALGSLQHRQFVSIVIPICLGLKRKQGRVRKLSP-QDLTPSLSPKKSPEK
        ELT+R  D+Q    V       DA   E L +      D++P             ++  +F            +   +VRK  P +++T    P++  +K
Subjt:  ELTVRVNDQQLEGTVEGQTAIRDAFPDEQLLVVIENKKDLEPCALSFSFISALGSLQHRQFVSIVIPICLGLKRKQGRVRKLSP-QDLTPSLSPKKSPEK

Query:  ALEKSPPNLTRYPKKPTQKKKSLPRLRRNWQ
        AL          PK+P +KK+   +L R +Q
Subjt:  ALEKSPPNLTRYPKKPTQKKKSLPRLRRNWQ

XP_030505532.1 uncharacterized protein LOC115720524 [Cannabis sativa]2.4e-1729.84Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGK------------------------AL-----
        +E+LMRDYM K DA+IQSQ A+LRN+E+QLG +A ELK RPQ  LP D +NP+++G EQC+++ L SGK                        AL     
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGK------------------------AL-----

Query:  --PHPYP------------------------------SLMQEDNVI-------------------------HAILIGKFPQKMGDQGNFTIPVSIGGKN-
          P P+P                              +L Q  N +                         +A+L  K P K+ D G+FTIP SI G++ 
Subjt:  --PHPYP------------------------------SLMQEDNVI-------------------------HAILIGKFPQKMGDQGNFTIPVSIGGKN-

Query:  -----------------------------------LADRSITHPG----------------------------------GRPFLSTGRALTDVHNGELTV
                                           LADRS+ HP                                   GRPFL+TGR+L DV NGELT+
Subjt:  -----------------------------------LADRSITHPG----------------------------------GRPFLSTGRALTDVHNGELTV

Query:  RVNDQ
        RVND+
Subjt:  RVNDQ

TrEMBL top hitse value%identityAlignment
A0A5B6VKQ8 Retrovirus-related Pol polyprotein from transposon 17.67.6e-1735.11Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALPHPY-PSLMQEDNVIHAILIGKFPQ----
        MEAL+++YM KND +IQSQA +LR +E Q+GQ++  L +R Q  LP D +N + +G E C+A+T  SG  LP     ++++ED++         PQ    
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALPHPY-PSLMQEDNVIHAILIGKFPQ----

Query:  --------KMGDQGNFTIPVSIGGKNLADRSITHPGGRPFLSTGRALTDVHNGELTVRVNDQQLEGTVEGQTAIRDAFPDEQLLVVIE
                +M +   F   + +    L +  I      PFL+TGR L D+  GELT+RVNDQQ    V       D   D Q + +++
Subjt:  --------KMGDQGNFTIPVSIGGKNLADRSITHPGGRPFLSTGRALTDVHNGELTVRVNDQQLEGTVEGQTAIRDAFPDEQLLVVIE

A0A5B6VWJ0 Retroelement pol polyprotein-like2.5e-2035.65Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALP---------------------------H
        +E+L++ YM KNDALIQSQAATL+N+E Q+GQ+A EL+NR Q  LP D ENP+  G E C+ALTL S K +                             
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALP---------------------------H

Query:  PYPSLMQEDNVIHA-----------------------ILIGKFPQKMGDQGNFTIPVSIGGKNL---ADRSITHPGGRPFLSTGRALTDVHNGELTVRVN
        P P   + D V                           +  K P K+ D G FTIP +IG        D+ +    GRPFL+TGR + DV  GELT+RV 
Subjt:  PYPSLMQEDNVIHA-----------------------ILIGKFPQKMGDQGNFTIPVSIGGKNL---ADRSITHPGGRPFLSTGRALTDVHNGELTVRVN

Query:  DQQLEGTVEGQTAIRD
          Q     + + +I +
Subjt:  DQQLEGTVEGQTAIRD

A0A6J1DVS9 uncharacterized protein LOC1110245272.1e-1937.57Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALPHPYPSLMQEDNVIHAILIGKFPQKMGDQ
        +E LM+ YM  ND ++QSQAA+LRN+E+Q+GQ+A +LK+RPQ  +   +E  ++  N       + + K     Y ++         ILI K P KM D 
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQ-RLPCDIENPKKEGNEQCQALTLHSGKALPHPYPSLMQEDNVIHAILIGKFPQKMGDQ

Query:  GNFTIPVSIGGKNLA--------------------------DRSITHPGGRPFLSTGRALTDVHNGELTVRVNDQQLEGTV
        G+FTIPVSIGG+ +                           D+ ++   GRPFL T R L DVH GELT+RV DQ+++ +V
Subjt:  GNFTIPVSIGGKNLA--------------------------DRSITHPGGRPFLSTGRALTDVHNGELTVRVNDQQLEGTV

A0A6J1DWK1 uncharacterized protein LOC1110250533.2e-1545.83Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRP-QRLPCDIENPKKEGNEQCQALTLHSGKALP--HP-YPSLMQEDNVIHAILIGKFPQKM
        +E +M+ YM  NDA +QSQAA+LRN+E+Q+GQ+A++LK+RP   LP D E PK++  EQC ALTL SGKALP  HP  P+L +E     A ++   PQ  
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRP-QRLPCDIENPKKEGNEQCQALTLHSGKALP--HP-YPSLMQEDNVIHAILIGKFPQKM

Query:  GDQGNFTIPVSIGGKNLADR
         D     + V I  + +A++
Subjt:  GDQGNFTIPVSIGGKNLADR

A0A6J1GJ68 uncharacterized protein LOC1114543447.1e-1559.72Show/hide
Query:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRP-QRLPCDIENPKKEGNEQCQALTLHSGKALP
        +E+L+++YM KND +IQSQ A+L+N+EVQ+GQ+A EL+NRP  +LP D E PK+EG EQCQA+ L SGK +P
Subjt:  MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRP-QRLPCDIENPKKEGNEQCQALTLHSGKALP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTCTGATGAGAGATTATATGACCAAGAATGATGCTTTGATTCAAAGCCAAGCTGCTACTTTAAGAAATATGGAAGTTCAACTGGGACAGATGGCTATTGAATT
GAAAAACAGGCCACAACGCTTACCATGTGATATTGAGAATCCTAAGAAGGAAGGGAATGAGCAATGTCAAGCATTGACTCTTCACAGTGGAAAGGCATTACCACACCCAT
ACCCGTCTCTGATGCAAGAGGATAATGTAATTCATGCAATCCTCATAGGAAAGTTTCCTCAGAAAATGGGTGACCAAGGGAATTTCACCATTCCTGTGTCTATAGGAGGA
AAGAATTTGGCAGATAGGTCCATTACACACCCTGGAGGAAGACCGTTCTTATCCACGGGTAGAGCCTTAACAGATGTACATAATGGAGAGCTGACCGTGAGAGTTAATGA
CCAGCAGCTTGAGGGAACAGTGGAAGGGCAGACTGCTATACGAGATGCATTCCCTGATGAACAGCTGTTAGTGGTGATAGAAAATAAGAAAGACTTGGAACCATGCGCCC
TGTCTTTTTCCTTCATCTCAGCCCTAGGCTCCCTGCAGCATCGACAATTCGTTAGCATTGTCATCCCAATATGCCTCGGGTTAAAGCGAAAGCAAGGAAGGGTAAGAAAA
CTATCCCCCCAAGACCTGACACCATCCCTTTCGCCGAAAAAATCACCAGAAAAAGCTCTAGAGAAGTCTCCCCCCAACCTGACACGTTACCCGAAGAAACCGACCCAGAA
AAAGAAATCCCTTCCCCGCCTAAGAAGAAACTGGCAGCCAAAAGAGGAAGAAAACAGCGAAAAGCAAGAGGCTACCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTCTGATGAGAGATTATATGACCAAGAATGATGCTTTGATTCAAAGCCAAGCTGCTACTTTAAGAAATATGGAAGTTCAACTGGGACAGATGGCTATTGAATT
GAAAAACAGGCCACAACGCTTACCATGTGATATTGAGAATCCTAAGAAGGAAGGGAATGAGCAATGTCAAGCATTGACTCTTCACAGTGGAAAGGCATTACCACACCCAT
ACCCGTCTCTGATGCAAGAGGATAATGTAATTCATGCAATCCTCATAGGAAAGTTTCCTCAGAAAATGGGTGACCAAGGGAATTTCACCATTCCTGTGTCTATAGGAGGA
AAGAATTTGGCAGATAGGTCCATTACACACCCTGGAGGAAGACCGTTCTTATCCACGGGTAGAGCCTTAACAGATGTACATAATGGAGAGCTGACCGTGAGAGTTAATGA
CCAGCAGCTTGAGGGAACAGTGGAAGGGCAGACTGCTATACGAGATGCATTCCCTGATGAACAGCTGTTAGTGGTGATAGAAAATAAGAAAGACTTGGAACCATGCGCCC
TGTCTTTTTCCTTCATCTCAGCCCTAGGCTCCCTGCAGCATCGACAATTCGTTAGCATTGTCATCCCAATATGCCTCGGGTTAAAGCGAAAGCAAGGAAGGGTAAGAAAA
CTATCCCCCCAAGACCTGACACCATCCCTTTCGCCGAAAAAATCACCAGAAAAAGCTCTAGAGAAGTCTCCCCCCAACCTGACACGTTACCCGAAGAAACCGACCCAGAA
AAAGAAATCCCTTCCCCGCCTAAGAAGAAACTGGCAGCCAAAAGAGGAAGAAAACAGCGAAAAGCAAGAGGCTACCAGTTGA
Protein sequenceShow/hide protein sequence
MEALMRDYMTKNDALIQSQAATLRNMEVQLGQMAIELKNRPQRLPCDIENPKKEGNEQCQALTLHSGKALPHPYPSLMQEDNVIHAILIGKFPQKMGDQGNFTIPVSIGG
KNLADRSITHPGGRPFLSTGRALTDVHNGELTVRVNDQQLEGTVEGQTAIRDAFPDEQLLVVIENKKDLEPCALSFSFISALGSLQHRQFVSIVIPICLGLKRKQGRVRK
LSPQDLTPSLSPKKSPEKALEKSPPNLTRYPKKPTQKKKSLPRLRRNWQPKEEENSEKQEATS