; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g10610 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g10610
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr7:8119084..8122124
RNA-Seq ExpressionMoc07g10610
SyntenyMoc07g10610
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045330.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]6.9e-0637.82Show/hide
Query:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATA-TSKKGKAKE----------------------------------
        ++ KGQKGEAN ATS ++F+RGS+SGTKS PSSSG+K +KK K  G+E K + A A TSKK KA +                                  
Subjt:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATA-TSKKGKAKE----------------------------------

Query:  ------------------------------ISSWRQLDTGEMILKVGTGEVVSVVA
                                      ISSWRQL+TGEM ++VGTG VVS +A
Subjt:  ------------------------------ISSWRQLDTGEMILKVGTGEVVSVVA

KAA0054637.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-0656.18Show/hide
Query:  KGQK-GEANFATSKRFNRGSSSGTKSAPSSSGSKTFKKKKAA-GKEFKPDSATATSKKGKAKEISSWRQLDTGEMILKVGTGEVVSVVA
        KGQK GEAN A SKRF +G S  TK  P SSG K  +KKK   GK     SAT    + KAKE SS +QL+ GEM LKVGTG+V+S  A
Subjt:  KGQK-GEANFATSKRFNRGSSSGTKSAPSSSGSKTFKKKKAA-GKEFKPDSATATSKKGKAKEISSWRQLDTGEMILKVGTGEVVSVVA

KAA0063049.1 gag/pol protein [Cucumis melo var. makuwa]8.8e-0954.95Show/hide
Query:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATATSKKGKAKEISSWRQLDTGEMILKVGTGEVVSVVA
        ++ KGQKGE N ATS ++F RGS+SGTKS PSSS +K +KKKK         +A  TSKKG    IS WRQL+T EM +KVG   VVS +A
Subjt:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATATSKKGKAKEISSWRQLDTGEMILKVGTGEVVSVVA

TYJ96755.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-0849.52Show/hide
Query:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATATSKKGKAKE--------------ISSWRQLDTGEMILKVGTGEV
        ++ KGQKGEAN ATS ++F RGS SGTKS P SS +K  KKKK         +A   SKK KA +              ISSW+QL+TGEM ++VGTG V
Subjt:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATATSKKGKAKE--------------ISSWRQLDTGEMILKVGTGEV

Query:  VSVVA
        VS +A
Subjt:  VSVVA

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.1e-1031.13Show/hide
Query:  MANSSSSSSTGRLGRTVST-----SSPRKAGYGEDLVNSDASTSGQGLKFPSSMRENFLGLLCRHYQIPNTISLCLPKAGERADDP--------------
        M++S SS+    L R + +      + R +  GED   SDASTSGQGL++PS + E++LG L R + IP  I L LP+ GERAD+P              
Subjt:  MANSSSSSSTGRLGRTVST-----SSPRKAGYGEDLVNSDASTSGQGLKFPSSMRENFLGLLCRHYQIPNTISLCLPKAGERADDP--------------

Query:  -----------HMDGRTGFRPSQVAPMDEETLALLLVRKKGCMRDQQRSNL---------------------------------------YKGMGKDLVL
                       RTG  P+QVAP     +  L +      RD + + L                                        KG  +    
Subjt:  -----------HMDGRTGFRPSQVAPMDEETLALLLVRKKGCMRDQQRSNL---------------------------------------YKGMGKDLVL

Query:  HLEDWLAKDKSDQSFFNVPLRFGNLES---KGQKGEANFATSK----RFNRGSSSGT
           +WLAKD+S +SFF+VP RFGNL S     +  +A+F T K    RF RG   GT
Subjt:  HLEDWLAKDKSDQSFFNVPLRFGNLES---KGQKGEANFATSK----RFNRGSSSGT

TrEMBL top hitse value%identityAlignment
A0A5A7TQ86 Retrotransposon protein, putative, Ty1-copia subclass3.4e-0637.82Show/hide
Query:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATA-TSKKGKAKE----------------------------------
        ++ KGQKGEAN ATS ++F+RGS+SGTKS PSSSG+K +KK K  G+E K + A A TSKK KA +                                  
Subjt:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATA-TSKKGKAKE----------------------------------

Query:  ------------------------------ISSWRQLDTGEMILKVGTGEVVSVVA
                                      ISSWRQL+TGEM ++VGTG VVS +A
Subjt:  ------------------------------ISSWRQLDTGEMILKVGTGEVVSVVA

A0A5A7UJ71 Gag/pol protein1.2e-0656.18Show/hide
Query:  KGQK-GEANFATSKRFNRGSSSGTKSAPSSSGSKTFKKKKAA-GKEFKPDSATATSKKGKAKEISSWRQLDTGEMILKVGTGEVVSVVA
        KGQK GEAN A SKRF +G S  TK  P SSG K  +KKK   GK     SAT    + KAKE SS +QL+ GEM LKVGTG+V+S  A
Subjt:  KGQK-GEANFATSKRFNRGSSSGTKSAPSSSGSKTFKKKKAA-GKEFKPDSATATSKKGKAKEISSWRQLDTGEMILKVGTGEVVSVVA

A0A5A7VBR5 Gag/pol protein4.2e-0954.95Show/hide
Query:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATATSKKGKAKEISSWRQLDTGEMILKVGTGEVVSVVA
        ++ KGQKGE N ATS ++F RGS+SGTKS PSSS +K +KKKK         +A  TSKKG    IS WRQL+T EM +KVG   VVS +A
Subjt:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATATSKKGKAKEISSWRQLDTGEMILKVGTGEVVSVVA

A0A5D3BE74 Gag/pol protein2.1e-0849.52Show/hide
Query:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATATSKKGKAKE--------------ISSWRQLDTGEMILKVGTGEV
        ++ KGQKGEAN ATS ++F RGS SGTKS P SS +K  KKKK         +A   SKK KA +              ISSW+QL+TGEM ++VGTG V
Subjt:  LESKGQKGEANFATS-KRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATATSKKGKAKE--------------ISSWRQLDTGEMILKVGTGEV

Query:  VSVVA
        VS +A
Subjt:  VSVVA

A0A6J1DXS5 uncharacterized protein LOC1110255022.9e-1031.13Show/hide
Query:  MANSSSSSSTGRLGRTVST-----SSPRKAGYGEDLVNSDASTSGQGLKFPSSMRENFLGLLCRHYQIPNTISLCLPKAGERADDP--------------
        M++S SS+    L R + +      + R +  GED   SDASTSGQGL++PS + E++LG L R + IP  I L LP+ GERAD+P              
Subjt:  MANSSSSSSTGRLGRTVST-----SSPRKAGYGEDLVNSDASTSGQGLKFPSSMRENFLGLLCRHYQIPNTISLCLPKAGERADDP--------------

Query:  -----------HMDGRTGFRPSQVAPMDEETLALLLVRKKGCMRDQQRSNL---------------------------------------YKGMGKDLVL
                       RTG  P+QVAP     +  L +      RD + + L                                        KG  +    
Subjt:  -----------HMDGRTGFRPSQVAPMDEETLALLLVRKKGCMRDQQRSNL---------------------------------------YKGMGKDLVL

Query:  HLEDWLAKDKSDQSFFNVPLRFGNLES---KGQKGEANFATSK----RFNRGSSSGT
           +WLAKD+S +SFF+VP RFGNL S     +  +A+F T K    RF RG   GT
Subjt:  HLEDWLAKDKSDQSFFNVPLRFGNLES---KGQKGEANFATSK----RFNRGSSSGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTATACTCCTTCCAAGCGTGTCAGCGTCCTCCACGTGGTACGACAATTTCATTCCCCAAACATTGGCTTCCTCCCTGTCAGGCTCGAGGTCGACAAAGCA
GAGAAGATCTATAAAGGGGAAATTTGTAGACGTGGCGAACTCCAAATTCGAGTATCTTCTTCTGCACTTCATTTTGACTTTATTTTCGTCATGGCAAACTCTTCT
AGTAGTAGTTCTACCGGTCGTTTAGGTCGAACCGTTAGTACCTCGTCCCCTAGGAAAGCTGGTTATGGTGAAGACTTAGTGAATAGTGACGCCTCGACCTCGGGC
CAGGGTTTGAAGTTCCCTTCATCAATGCGTGAGAACTTCCTTGGGTTGCTCTGTAGGCACTACCAAATCCCTAATACTATAAGCCTGTGCTTACCTAAAGCTGGG
GAAAGAGCTGACGATCCCCATATGGATGGCCGAACTGGCTTTAGACCCTCTCAAGTGGCCCCAATGGATGAGGAAACCTTGGCACTACTACTTGTGCGCAAGAAA
GGGTGCATGAGGGATCAGCAAAGGTCTAACCTCTATAAAGGGATGGGTAAAGATTTGGTTTTACACCTCGAAGATTGGTTGGCGAAGGACAAGTCCGACCAGTCA
TTCTTCAACGTTCCCCTTAGATTTGGGAATTTAGAAAGTAAAGGACAAAAAGGGGAGGCAAATTTTGCCACCTCAAAGAGGTTCAACCGAGGTTCGTCCTCTGGA
ACAAAGTCTGCACCCTCTTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAAGGCAGCTGGTAAGGAGTTTAAACCTGACTCTGCTACTGCCACTTCCAAGAAAGGC
AAGGCCAAGGAAATTAGTTCCTGGAGGCAGCTTGACACCGGAGAGATGATTCTCAAGGTTGGAACGGGAGAGGTCGTTTCAGTTGTGGCAAGTAAGGTAGTGATT
AACGAGATTTTCGAAGAGGCTACAAACACGTCAACAAGAGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGTACATCACGTCAATCACAT
CCACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTATACTCCTTCCAAGCGTGTCAGCGTCCTCCACGTGGTACGACAATTTCATTCCCCAAACATTGGCTTCCTCCCTGTCAGGCTCGAGGTCGACAAAGCA
GAGAAGATCTATAAAGGGGAAATTTGTAGACGTGGCGAACTCCAAATTCGAGTATCTTCTTCTGCACTTCATTTTGACTTTATTTTCGTCATGGCAAACTCTTCT
AGTAGTAGTTCTACCGGTCGTTTAGGTCGAACCGTTAGTACCTCGTCCCCTAGGAAAGCTGGTTATGGTGAAGACTTAGTGAATAGTGACGCCTCGACCTCGGGC
CAGGGTTTGAAGTTCCCTTCATCAATGCGTGAGAACTTCCTTGGGTTGCTCTGTAGGCACTACCAAATCCCTAATACTATAAGCCTGTGCTTACCTAAAGCTGGG
GAAAGAGCTGACGATCCCCATATGGATGGCCGAACTGGCTTTAGACCCTCTCAAGTGGCCCCAATGGATGAGGAAACCTTGGCACTACTACTTGTGCGCAAGAAA
GGGTGCATGAGGGATCAGCAAAGGTCTAACCTCTATAAAGGGATGGGTAAAGATTTGGTTTTACACCTCGAAGATTGGTTGGCGAAGGACAAGTCCGACCAGTCA
TTCTTCAACGTTCCCCTTAGATTTGGGAATTTAGAAAGTAAAGGACAAAAAGGGGAGGCAAATTTTGCCACCTCAAAGAGGTTCAACCGAGGTTCGTCCTCTGGA
ACAAAGTCTGCACCCTCTTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAAGGCAGCTGGTAAGGAGTTTAAACCTGACTCTGCTACTGCCACTTCCAAGAAAGGC
AAGGCCAAGGAAATTAGTTCCTGGAGGCAGCTTGACACCGGAGAGATGATTCTCAAGGTTGGAACGGGAGAGGTCGTTTCAGTTGTGGCAAGTAAGGTAGTGATT
AACGAGATTTTCGAAGAGGCTACAAACACGTCAACAAGAGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGTACATCACGTCAATCACAT
CCACCTTAA
Protein sequenceShow/hide protein sequence
MNYTPSKRVSVLHVVRQFHSPNIGFLPVRLEVDKAEKIYKGEICRRGELQIRVSSSALHFDFIFVMANSSSSSSTGRLGRTVSTSSPRKAGYGEDLVNSDASTSG
QGLKFPSSMRENFLGLLCRHYQIPNTISLCLPKAGERADDPHMDGRTGFRPSQVAPMDEETLALLLVRKKGCMRDQQRSNLYKGMGKDLVLHLEDWLAKDKSDQS
FFNVPLRFGNLESKGQKGEANFATSKRFNRGSSSGTKSAPSSSGSKTFKKKKAAGKEFKPDSATATSKKGKAKEISSWRQLDTGEMILKVGTGEVVSVVASKVVI
NEIFEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPP