; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g25110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g25110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetroelement pol polyprotein-like
Genome locationchr4:18237632..18241133
RNA-Seq ExpressionMoc04g25110
SyntenyMoc04g25110
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031937.1 retroelement pol polyprotein-like [Cucumis melo var. makuwa]7.7e-1136.18Show/hide
Query:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFN---ILSISMPLKDMP---HLLVEFSNTSYILEDKSTS
        +++FLMGLN+S+A T  Q+LLM+P  S+++AFSL+ QEEQQR+ISS S A++  + A+    N     S     KD P   H  +         +DK TS
Subjt:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFN---ILSISMPLKDMP---HLLVEFSNTSYILEDKSTS

Query:  RMIGRGSSLHGLYPLDNMHVPVAVSNLSLSRAS---ERVELIVQFSESAYKG
        + IG      GLY LD      ++  ++ ++AS   +R+++   FS   Y G
Subjt:  RMIGRGSSLHGLYPLDNMHVPVAVSNLSLSRAS---ERVELIVQFSESAYKG

TYK16758.1 Copia protein [Cucumis melo var. makuwa]7.7e-1136.18Show/hide
Query:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFN---ILSISMPLKDMP---HLLVEFSNTSYILEDKSTS
        +++FLMGLN+S+A T  Q+LLM+P  S+++AFSL+ QEEQQR+ISS S A++  + A+    N     S     KD P   H  +         +DK TS
Subjt:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFN---ILSISMPLKDMP---HLLVEFSNTSYILEDKSTS

Query:  RMIGRGSSLHGLYPLDNMHVPVAVSNLSLSRAS---ERVELIVQFSESAYKG
        + IG      GLY LD      ++  ++ ++AS   +R+++   FS   Y G
Subjt:  RMIGRGSSLHGLYPLDNMHVPVAVSNLSLSRAS---ERVELIVQFSESAYKG

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]3.6e-0863.79Show/hide
Query:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAM
        +IN LMGL+E + ST A++LLMDPP SVNKA SLVRQ+EQQRSI +S++  +A SFA+
Subjt:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAM

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]1.1e-0435Show/hide
Query:  HASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFNILSISMPLKDMPHLLVEFSNTSYILEDKSTSRMIGRGSSLHGLYPLDN
        H  +P  + L + P  V +    VR         S   ++    +   F+FN++S+++ L+DMP L VEF+N + I++DKS S+ I +G   HGLY LDN
Subjt:  HASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFNILSISMPLKDMPHLLVEFSNTSYILEDKSTSRMIGRGSSLHGLYPLDN

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]5.2e-0730.43Show/hide
Query:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFNILSISMPLKDMPHLLVEFSNTSYILEDKSTSRMIGRG
        +++FLMGLN+S+A T  Q+LLM+P  S+++AFSL+ QEEQQR+ISS S A++  +F                               L DK TS+ IG  
Subjt:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFNILSISMPLKDMPHLLVEFSNTSYILEDKSTSRMIGRG

Query:  SSLHGLYPLDNMHVPVAVSNLSLSRASERVELIVQFSESAYKGTLIYFPKVGKKRIPSSNS---PLGLVP----KMQPSTLDPNSSTDPLS-TTLGPTVN
            GLY LD      ++  ++ ++AS                  ++  ++G     + NS   PL L P    KM   T+ P +    LS T+     +
Subjt:  SSLHGLYPLDNMHVPVAVSNLSLSRASERVELIVQFSESAYKGTLIYFPKVGKKRIPSSNS---PLGLVP----KMQPSTLDPNSSTDPLS-TTLGPTVN

Query:  NLFDALH
        N FD +H
Subjt:  NLFDALH

XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]8.6e-1040.91Show/hide
Query:  FLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSI----------------SSSSSAVSAS----------SFAMTFHFNILSISMPLKDMPHL
        FLMGLNES +    Q+LLM+P  ++N+ FSLV QE QQR+I                SSSSS  S S          +F  T  +N+L +S    D   +
Subjt:  FLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSI----------------SSSSSAVSAS----------SFAMTFHFNILSISMPLKDMPHL

Query:  LVEFSNTSYILEDKSTSRMIGRGSSLHGLYPL
         V F++   IL+DKS+S+MIG+  S HGLY L
Subjt:  LVEFSNTSYILEDKSTSRMIGRGSSLHGLYPL

TrEMBL top hitse value%identityAlignment
A0A5A7SRC2 Retroelement pol polyprotein-like3.8e-1136.18Show/hide
Query:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFN---ILSISMPLKDMP---HLLVEFSNTSYILEDKSTS
        +++FLMGLN+S+A T  Q+LLM+P  S+++AFSL+ QEEQQR+ISS S A++  + A+    N     S     KD P   H  +         +DK TS
Subjt:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFN---ILSISMPLKDMP---HLLVEFSNTSYILEDKSTS

Query:  RMIGRGSSLHGLYPLDNMHVPVAVSNLSLSRAS---ERVELIVQFSESAYKG
        + IG      GLY LD      ++  ++ ++AS   +R+++   FS   Y G
Subjt:  RMIGRGSSLHGLYPLDNMHVPVAVSNLSLSRAS---ERVELIVQFSESAYKG

A0A5D3CZP1 Copia protein3.8e-1136.18Show/hide
Query:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFN---ILSISMPLKDMP---HLLVEFSNTSYILEDKSTS
        +++FLMGLN+S+A T  Q+LLM+P  S+++AFSL+ QEEQQR+ISS S A++  + A+    N     S     KD P   H  +         +DK TS
Subjt:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFN---ILSISMPLKDMP---HLLVEFSNTSYILEDKSTS

Query:  RMIGRGSSLHGLYPLDNMHVPVAVSNLSLSRAS---ERVELIVQFSESAYKG
        + IG      GLY LD      ++  ++ ++AS   +R+++   FS   Y G
Subjt:  RMIGRGSSLHGLYPLDNMHVPVAVSNLSLSRAS---ERVELIVQFSESAYKG

A0A6J1CR17 uncharacterized protein LOC1110134411.7e-0863.79Show/hide
Query:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAM
        +IN LMGL+E + ST A++LLMDPP SVNKA SLVRQ+EQQRSI +S++  +A SFA+
Subjt:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAM

A0A6J1CR17 uncharacterized protein LOC1110134415.2e-0535Show/hide
Query:  HASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFNILSISMPLKDMPHLLVEFSNTSYILEDKSTSRMIGRGSSLHGLYPLDN
        H  +P  + L + P  V +    VR         S   ++    +   F+FN++S+++ L+DMP L VEF+N + I++DKS S+ I +G   HGLY LDN
Subjt:  HASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFNILSISMPLKDMPHLLVEFSNTSYILEDKSTSRMIGRGSSLHGLYPLDN

A0A6J1CR17 uncharacterized protein LOC1110134412.5e-0730.43Show/hide
Query:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFNILSISMPLKDMPHLLVEFSNTSYILEDKSTSRMIGRG
        +++FLMGLN+S+A T  Q+LLM+P  S+++AFSL+ QEEQQR+ISS S A++  +F                               L DK TS+ IG  
Subjt:  MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFNILSISMPLKDMPHLLVEFSNTSYILEDKSTSRMIGRG

Query:  SSLHGLYPLDNMHVPVAVSNLSLSRASERVELIVQFSESAYKGTLIYFPKVGKKRIPSSNS---PLGLVP----KMQPSTLDPNSSTDPLS-TTLGPTVN
            GLY LD      ++  ++ ++AS                  ++  ++G     + NS   PL L P    KM   T+ P +    LS T+     +
Subjt:  SSLHGLYPLDNMHVPVAVSNLSLSRASERVELIVQFSESAYKGTLIYFPKVGKKRIPSSNS---PLGLVP----KMQPSTLDPNSSTDPLS-TTLGPTVN

Query:  NLFDALH
        N FD +H
Subjt:  NLFDALH

A0A6J1DIP8 uncharacterized protein LOC1110203994.1e-1040.91Show/hide
Query:  FLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSI----------------SSSSSAVSAS----------SFAMTFHFNILSISMPLKDMPHL
        FLMGLNES +    Q+LLM+P  ++N+ FSLV QE QQR+I                SSSSS  S S          +F  T  +N+L +S    D   +
Subjt:  FLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSI----------------SSSSSAVSAS----------SFAMTFHFNILSISMPLKDMPHL

Query:  LVEFSNTSYILEDKSTSRMIGRGSSLHGLYPL
         V F++   IL+DKS+S+MIG+  S HGLY L
Subjt:  LVEFSNTSYILEDKSTSRMIGRGSSLHGLYPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAATTTTTTGATGGGCCTCAATGAGTCTCATGCCTCTACTCCGGCTCAAATTTTGCTGATGGATCCTCCTCTGTCTGTCAACAAGGCGTTCTCGTTGGTGCGACA
AGAAGAGCAACAACGTTCGATTAGTTCTAGTTCCAGTGCTGTCTCTGCTTCTTCTTTTGCTATGACATTTCACTTTAACATACTTTCCATTAGTATGCCGTTAAAGGATA
TGCCTCATCTATTGGTGGAATTCTCTAATACCTCTTATATTCTTGAGGACAAGTCCACTTCGAGGATGATTGGCAGGGGTAGCTCGCTCCATGGACTTTATCCGCTTGAT
AATATGCATGTTCCTGTTGCAGTCTCCAACTTAAGTCTCTCTCGGGCCAGTGAGAGGGTGGAGCTCATTGTTCAATTCTCGGAGTCAGCATATAAGGGAACACTCATCTA
TTTCCCTAAAGTCGGGAAGAAGCGAATTCCATCTTCCAACTCCCCACTCGGTCTCGTCCCCAAAATGCAGCCTAGCACACTTGATCCTAATAGTAGCACTGATCCACTGT
CTACCACACTTGGTCCTACTGTTAACAACTTGTTCGACGCTCTCCACAGGGGCGCCTTCTTATATGCAAGATTATCACTGCAGTTCATTGGCTCATTCTTTGCCATCTCC
AGAATTTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTAATTTTTTGATGGGCCTCAATGAGTCTCATGCCTCTACTCCGGCTCAAATTTTGCTGATGGATCCTCCTCTGTCTGTCAACAAGGCGTTCTCGTTGGTGCGACA
AGAAGAGCAACAACGTTCGATTAGTTCTAGTTCCAGTGCTGTCTCTGCTTCTTCTTTTGCTATGACATTTCACTTTAACATACTTTCCATTAGTATGCCGTTAAAGGATA
TGCCTCATCTATTGGTGGAATTCTCTAATACCTCTTATATTCTTGAGGACAAGTCCACTTCGAGGATGATTGGCAGGGGTAGCTCGCTCCATGGACTTTATCCGCTTGAT
AATATGCATGTTCCTGTTGCAGTCTCCAACTTAAGTCTCTCTCGGGCCAGTGAGAGGGTGGAGCTCATTGTTCAATTCTCGGAGTCAGCATATAAGGGAACACTCATCTA
TTTCCCTAAAGTCGGGAAGAAGCGAATTCCATCTTCCAACTCCCCACTCGGTCTCGTCCCCAAAATGCAGCCTAGCACACTTGATCCTAATAGTAGCACTGATCCACTGT
CTACCACACTTGGTCCTACTGTTAACAACTTGTTCGACGCTCTCCACAGGGGCGCCTTCTTATATGCAAGATTATCACTGCAGTTCATTGGCTCATTCTTTGCCATCTCC
AGAATTTACTAA
Protein sequenceShow/hide protein sequence
MINFLMGLNESHASTPAQILLMDPPLSVNKAFSLVRQEEQQRSISSSSSAVSASSFAMTFHFNILSISMPLKDMPHLLVEFSNTSYILEDKSTSRMIGRGSSLHGLYPLD
NMHVPVAVSNLSLSRASERVELIVQFSESAYKGTLIYFPKVGKKRIPSSNSPLGLVPKMQPSTLDPNSSTDPLSTTLGPTVNNLFDALHRGAFLYARLSLQFIGSFFAIS
RIY