; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g29280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g29280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon opus
Genome locationchr6:22041429..22042829
RNA-Seq ExpressionMoc06g29280
SyntenyMoc06g29280
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157836.1 uncharacterized protein LOC111024449 [Momordica charantia]1.5e-2627.58Show/hide
Query:  SPVLQISDISCVYWGDNHLYENCPANPASIFYVGQASQ-------------------------PQQYNQQRAQNNIQQGGSNASVEVM------------
        +PV  + +  C Y GD+H  ENCP+NP  + YVGQ S                          PQQ+NQQ+  +   Q   +    +M            
Subjt:  SPVLQISDISCVYWGDNHLYENCPANPASIFYVGQASQ-------------------------PQQYNQQRAQNNIQQGGSNASVEVM------------

Query:  ------MKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRSILSYKGPSLPTEETDAVTHVPVSTS
              M+EF TR++  I+++ +R DAA+RNLE Q+GQ+A++ K+RP+GTL   TE PK   EG EHCK + TRS L+Y+ P +P E +   T    + +
Subjt:  ------MKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRSILSYKGPSLPTEETDAVTHVPVSTS

Query:  NPQQE-----------------------------------------------------------------------------------------------
         P +                                                                                                
Subjt:  NPQQE-----------------------------------------------------------------------------------------------

Query:  --EKAERVSSEEKGKKAD----KDKQVVPN----TTPQVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE
          + A+R  ++ +GK  D     DK + P        + D +V II+GRPFL+TG+ + +V+KGE+TM V+D++VTFN+LDAM+ PD+
Subjt:  --EKAERVSSEEKGKKAD----KDKQVVPN----TTPQVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]1.2e-3957.14Show/hide
Query:  MPEPSPVLQISDISCVYWGDNHLYENCPANPASIFYVGQ-------------------------------------------------------ASQPQQ
        M EPS  LQISDISCVY GDN LYENCPANP S+FYVGQ                                                       ASQPQQ
Subjt:  MPEPSPVLQISDISCVYWGDNHLYENCPANPASIFYVGQ-------------------------------------------------------ASQPQQ

Query:  YNQQRAQNNIQQGGSNASVEVMMKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPK
        YNQQRAQN  QQGGSN S+E M KEFMTRSEAT KEFMTRTD  IR LEMQVGQIAND+KSRPQGTL G+TENPK
Subjt:  YNQQRAQNNIQQGGSNASVEVMMKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPK

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]9.9e-2325.85Show/hide
Query:  PSPVLQISDISCVYWGDNHLYENCPANPASIFYVGQASQ-------------------------------------------------------PQQYNQ
        PSPV QI++ +C Y GD H  ENCP+NP+S++YVGQ +Q                                                       P QYNQ
Subjt:  PSPVLQISDISCVYWGDNHLYENCPANPASIFYVGQASQ-------------------------------------------------------PQQYNQ

Query:  QRAQNNIQQGGSNAS-VEVMMKEFMTRSEATIKEFMTRT-----------------DAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCK
        Q+  N +Q    N S +E++MKE +T+++AT+KE MTRT                 D  +R LEMQ+GQ+ N+ ++RPQG+L   TE P+  R G EHC 
Subjt:  QRAQNNIQQGGSNAS-VEVMMKEFMTRSEATIKEFMTRT-----------------DAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCK

Query:  AVITRSILSYKGPSLPTE---------ETDAV--------THVP----VSTSNPQQE-------------------------------------------
        ++ TRS L Y+GP +P E         +T AV          VP    VS S P                                              
Subjt:  AVITRSILSYKGPSLPTE---------ETDAV--------THVP----VSTSNPQQE-------------------------------------------

Query:  --------------------------------------------------------------------------------------EKAERVSSEEKGKK
                                                                                              + A+R  ++ +GK 
Subjt:  --------------------------------------------------------------------------------------EKAERVSSEEKGKK

Query:  AD----KDKQVVPN----TTPQVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE
         D     DK + P        + D +V II+GRPFL TG+ + +V+KGE+TM+V+D++VTFN+LDAM+  D+
Subjt:  AD----KDKQVVPN----TTPQVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE

XP_024038239.1 uncharacterized protein LOC112097286 [Citrus clementina]3.3e-2630.75Show/hide
Query:  PSPVLQISDISCVYWGDNHLYENCPANPASIFYVG--------------------------------------------QASQPQQYNQQRAQNNIQQGG
        P+ V Q++++SCVY G++H ++NCP NPASI YVG                                            Q  QP  + Q + Q ++ Q  
Subjt:  PSPVLQISDISCVYWGDNHLYENCPANPASIFYVG--------------------------------------------QASQPQQYNQQRAQNNIQQGG

Query:  SNASVEVMMKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAV-------------ITRSILSYKGPSLP
           S+EV++KE++ ++E+ ++        ++RNLE Q+GQ+A    SR QG+L  +TE P+  RE  EHCK +             IT++ +       P
Subjt:  SNASVEVMMKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAV-------------ITRSILSYKGPSLP

Query:  TEETDAVTHVP-------------VSTSNPQQEEK----------------AERVSSEEKGKKAD----KDKQVVP----NTTPQVDLEVTIIIGRPFLT
        T++   +   P                  P+  EK                A+R  +  +G   D     DK ++P        + D EV II+GRPFL 
Subjt:  TEETDAVTHVP-------------VSTSNPQQEEK----------------AERVSSEEKGKKAD----KDKQVVP----NTTPQVDLEVTIIIGRPFLT

Query:  TGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE
        TG  + +V+KGE+TM+VND+QVTFNVL+AMR PDE
Subjt:  TGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE

XP_030497888.1 uncharacterized protein LOC115713544 [Cannabis sativa]8.1e-2529.74Show/hide
Query:  EPSPVLQISDISCVYWGDNHLYENCPANPAS----------------IFYVGQASQPQQYNQQ-RAQNNIQ-QGGSNASVEVMMKEFMTRSEATIKEFMT
        +P+  +Q ++ISCVY+GD H +ENCP+NP S                    G+ S P +++QQ R Q   Q QG   +S+E +M+++ T+++A I+  + 
Subjt:  EPSPVLQISDISCVYWGDNHLYENCPANPAS----------------IFYVGQASQPQQYNQQ-RAQNNIQ-QGGSNASVEVMMKEFMTRSEATIKEFMT

Query:  RTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRS---------ILSYKGPSLPTEETD--------AVTHVPVSTSNPQQE-
           A+++NLE+Q+GQ+AND KSRPQGTL   T+NP+RD  G EHCKAV  RS             K PSL  +E +        A    PV T++ Q   
Subjt:  RTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRS---------ILSYKGPSLPTEETD--------AVTHVPVSTSNPQQE-

Query:  ----------------------------------------------------------------------------------------------EKAERV
                                                                                                      +  +R 
Subjt:  ----------------------------------------------------------------------------------------------EKAERV

Query:  SSEEKGKKADKDKQVVPNTTP--------QVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE
         +  +GK  D   QV     P        + D EV II+GRPFL TG  + +V+ GE+TM+VND++VTFNV +AMR PDE
Subjt:  SSEEKGKKADKDKQVVPNTTP--------QVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE

TrEMBL top hitse value%identityAlignment
A0A6J1DAE9 uncharacterized protein LOC1110185144.8e-2339.22Show/hide
Query:  SPVLQISDISCVYWGDNHLYENCPANPASIFYVGQ---------------------------------ASQ------------------------PQQYN
        SPV QI++I C Y  DNH+YENCP NPAS +YVG                                  A+Q                        PQQYN
Subjt:  SPVLQISDISCVYWGDNHLYENCPANPASIFYVGQ---------------------------------ASQ------------------------PQQYN

Query:  QQRAQNNIQQGGSNASVEVMMKEFMTRSEATI--KEFMTRTDAA-IRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRSILSYKGPS
        Q +   +     +N S+E M KE+M R++A +  +    +T AA +RNLE+Q+GQ AND K+RPQG+  GHTE  KRD  G E CKAV  RS LSY+GP 
Subjt:  QQRAQNNIQQGGSNASVEVMMKEFMTRSEATI--KEFMTRTDAA-IRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRSILSYKGPS

Query:  LPTE
        +P E
Subjt:  LPTE

A0A6J1DY39 uncharacterized protein LOC1110256534.8e-2325.85Show/hide
Query:  PSPVLQISDISCVYWGDNHLYENCPANPASIFYVGQASQ-------------------------------------------------------PQQYNQ
        PSPV QI++ +C Y GD H  ENCP+NP+S++YVGQ +Q                                                       P QYNQ
Subjt:  PSPVLQISDISCVYWGDNHLYENCPANPASIFYVGQASQ-------------------------------------------------------PQQYNQ

Query:  QRAQNNIQQGGSNAS-VEVMMKEFMTRSEATIKEFMTRT-----------------DAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCK
        Q+  N +Q    N S +E++MKE +T+++AT+KE MTRT                 D  +R LEMQ+GQ+ N+ ++RPQG+L   TE P+  R G EHC 
Subjt:  QRAQNNIQQGGSNAS-VEVMMKEFMTRSEATIKEFMTRT-----------------DAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCK

Query:  AVITRSILSYKGPSLPTE---------ETDAV--------THVP----VSTSNPQQE-------------------------------------------
        ++ TRS L Y+GP +P E         +T AV          VP    VS S P                                              
Subjt:  AVITRSILSYKGPSLPTE---------ETDAV--------THVP----VSTSNPQQE-------------------------------------------

Query:  --------------------------------------------------------------------------------------EKAERVSSEEKGKK
                                                                                              + A+R  ++ +GK 
Subjt:  --------------------------------------------------------------------------------------EKAERVSSEEKGKK

Query:  AD----KDKQVVPN----TTPQVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE
         D     DK + P        + D +V II+GRPFL TG+ + +V+KGE+TM+V+D++VTFN+LDAM+  D+
Subjt:  AD----KDKQVVPN----TTPQVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE

A0A6J1DYG0 uncharacterized protein LOC1110257643.4e-2133.94Show/hide
Query:  PSPVLQISDISCVYWGDNHLYENCPANPASIFYVG-----------------------------------------QASQP-------------QQYNQQ
        P+PV Q++D+ C +  +NH+Y+ CP NPAS+FYVG                                         Q  QP             Q+YNQ+
Subjt:  PSPVLQISDISCVYWGDNHLYENCPANPASIFYVG-----------------------------------------QASQP-------------QQYNQQ

Query:  RAQNNIQQGGSNASVEVMMKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRSILSYKGPSLPTEE
             +Q   SN  +E MMKE+M R++A I+       A++RN E Q+GQ+AN+ K+RPQG+   HTE PK  REG E CKAV  RS L+Y  P++PT +
Subjt:  RAQNNIQQGGSNASVEVMMKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRSILSYKGPSLPTEE

Query:  -----TDAVTHVPVSTSNPQQ
             T     +P + + P++
Subjt:  -----TDAVTHVPVSTSNPQQ

A0A6J1DZ19 uncharacterized protein LOC1110248245.6e-4057.14Show/hide
Query:  MPEPSPVLQISDISCVYWGDNHLYENCPANPASIFYVGQ-------------------------------------------------------ASQPQQ
        M EPS  LQISDISCVY GDN LYENCPANP S+FYVGQ                                                       ASQPQQ
Subjt:  MPEPSPVLQISDISCVYWGDNHLYENCPANPASIFYVGQ-------------------------------------------------------ASQPQQ

Query:  YNQQRAQNNIQQGGSNASVEVMMKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPK
        YNQQRAQN  QQGGSN S+E M KEFMTRSEAT KEFMTRTD  IR LEMQVGQIAND+KSRPQGTL G+TENPK
Subjt:  YNQQRAQNNIQQGGSNASVEVMMKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPK

A0A6J1DZC3 uncharacterized protein LOC1110244497.1e-2727.58Show/hide
Query:  SPVLQISDISCVYWGDNHLYENCPANPASIFYVGQASQ-------------------------PQQYNQQRAQNNIQQGGSNASVEVM------------
        +PV  + +  C Y GD+H  ENCP+NP  + YVGQ S                          PQQ+NQQ+  +   Q   +    +M            
Subjt:  SPVLQISDISCVYWGDNHLYENCPANPASIFYVGQASQ-------------------------PQQYNQQRAQNNIQQGGSNASVEVM------------

Query:  ------MKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRSILSYKGPSLPTEETDAVTHVPVSTS
              M+EF TR++  I+++ +R DAA+RNLE Q+GQ+A++ K+RP+GTL   TE PK   EG EHCK + TRS L+Y+ P +P E +   T    + +
Subjt:  ------MKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQGTLSGHTENPKRDREGNEHCKAVITRSILSYKGPSLPTEETDAVTHVPVSTS

Query:  NPQQE-----------------------------------------------------------------------------------------------
         P +                                                                                                
Subjt:  NPQQE-----------------------------------------------------------------------------------------------

Query:  --EKAERVSSEEKGKKAD----KDKQVVPN----TTPQVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE
          + A+R  ++ +GK  D     DK + P        + D +V II+GRPFL+TG+ + +V+KGE+TM V+D++VTFN+LDAM+ PD+
Subjt:  --EKAERVSSEEKGKKAD----KDKQVVPN----TTPQVDLEVTIIIGRPFLTTGDMVFNVRKGEITMKVNDEQVTFNVLDAMRLPDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGAACCGTCTCCTGTTTTACAAATTTCAGATATATCTTGTGTCTATTGGGGTGATAACCACTTGTATGAGAACTGTCCAGCTAATCCAGCTTCTATTTTCTATGT
AGGTCAAGCATCGCAGCCTCAACAATACAATCAGCAAAGAGCTCAAAATAACATTCAGCAAGGTGGTAGCAACGCAAGTGTGGAGGTCATGATGAAAGAATTCATGACAA
GAAGTGAAGCTACAATAAAAGAGTTCATGACAAGAACTGATGCTGCGATAAGGAACTTGGAGATGCAAGTGGGGCAGATAGCAAATGACCAAAAATCTAGACCCCAAGGT
ACATTGTCTGGACACACAGAGAACCCGAAGCGAGATCGTGAGGGAAATGAGCATTGTAAGGCGGTTATCACAAGAAGCATACTAAGTTATAAAGGACCCTCACTTCCAAC
TGAAGAAACTGATGCAGTTACACATGTTCCTGTATCCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACGCGTAAGTTCAGAAGAAAAAGGTAAAAAGGCAGATAAAG
ATAAGCAAGTAGTGCCCAACACTACCCCACAGGTAGATCTTGAGGTGACGATCATTATAGGAAGGCCATTTTTAACAACTGGAGATATGGTATTCAATGTTAGGAAAGGA
GAGATCACTATGAAGGTTAATGATGAGCAAGTAACCTTCAATGTCCTTGATGCGATGCGTCTCCCGGATGAGTCGAGGACTGCTCTACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTGAACCGTCTCCTGTTTTACAAATTTCAGATATATCTTGTGTCTATTGGGGTGATAACCACTTGTATGAGAACTGTCCAGCTAATCCAGCTTCTATTTTCTATGT
AGGTCAAGCATCGCAGCCTCAACAATACAATCAGCAAAGAGCTCAAAATAACATTCAGCAAGGTGGTAGCAACGCAAGTGTGGAGGTCATGATGAAAGAATTCATGACAA
GAAGTGAAGCTACAATAAAAGAGTTCATGACAAGAACTGATGCTGCGATAAGGAACTTGGAGATGCAAGTGGGGCAGATAGCAAATGACCAAAAATCTAGACCCCAAGGT
ACATTGTCTGGACACACAGAGAACCCGAAGCGAGATCGTGAGGGAAATGAGCATTGTAAGGCGGTTATCACAAGAAGCATACTAAGTTATAAAGGACCCTCACTTCCAAC
TGAAGAAACTGATGCAGTTACACATGTTCCTGTATCCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACGCGTAAGTTCAGAAGAAAAAGGTAAAAAGGCAGATAAAG
ATAAGCAAGTAGTGCCCAACACTACCCCACAGGTAGATCTTGAGGTGACGATCATTATAGGAAGGCCATTTTTAACAACTGGAGATATGGTATTCAATGTTAGGAAAGGA
GAGATCACTATGAAGGTTAATGATGAGCAAGTAACCTTCAATGTCCTTGATGCGATGCGTCTCCCGGATGAGTCGAGGACTGCTCTACAATAG
Protein sequenceShow/hide protein sequence
MPEPSPVLQISDISCVYWGDNHLYENCPANPASIFYVGQASQPQQYNQQRAQNNIQQGGSNASVEVMMKEFMTRSEATIKEFMTRTDAAIRNLEMQVGQIANDQKSRPQG
TLSGHTENPKRDREGNEHCKAVITRSILSYKGPSLPTEETDAVTHVPVSTSNPQQEEKAERVSSEEKGKKADKDKQVVPNTTPQVDLEVTIIIGRPFLTTGDMVFNVRKG
EITMKVNDEQVTFNVLDAMRLPDESRTALQ