; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:9738476..9746608
RNA-Seq ExpressionMoc03g14440
SyntenyMoc03g14440
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031937.1 retroelement pol polyprotein-like [Cucumis melo var. makuwa]8.8e-0435.43Show/hide
Query:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAV-QQSPSSGKAHICCSASNFHSLAKSSPVV
        E+  FL  E     +   +D   + R QLLL EP P+I++AFSL+ QEEQQR+I ++ SP   +PT ALA    +P +  AH         +        
Subjt:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAV-QQSPSSGKAHICCSASNFHSLAKSSPVV

Query:  VGLPNKDVSTTRVIGRGNLRNDLYFLE
        V    KD  T++ IG   L   LY L+
Subjt:  VGLPNKDVSTTRVIGRGNLRNDLYFLE

RVW92730.1 hypothetical protein CK203_042571 [Vitis vinifera]5.1e-0426.47Show/hide
Query:  DDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINT---ASSPNSASPTFALAVQQSPSS--------------------------------------
        +D   ++R QLLL +P P+INK FSL++QEE Q+ I +   A S ++ +  FA+    S +S                                      
Subjt:  DDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINT---ASSPNSASPTFALAVQQSPSS--------------------------------------

Query:  --------GKAHICCSASNFHSLAKSSPVVVGLPNK---------DVSTTRVIGRGNLRNDLYFLENSASSGYVTPLHSTLHTSMASSAFASLHTDVSCP
                   HIC SA+ F SL  +    V LPN          D++ + ++   ++     F  N  S   +T + S+L  S  S +  S  + ++ P
Subjt:  --------GKAHICCSASNFHSLAKSSPVVVGLPNK---------DVSTTRVIGRGNLRNDLYFLENSASSGYVTPLHSTLHTSMASSAFASLHTDVSCP

Query:  NSVPPNPTGVVQPDDVDLTRDASTTILSNGNNGHVALADLNPHTTDVS-SNGQLRQSSRPIKPPSYLQDYHC
           PP  +G+    D            S  NN    L+DL+ H    S +   LR+S+R  KPPS L+D+HC
Subjt:  NSVPPNPTGVVQPDDVDLTRDASTTILSNGNNGHVALADLNPHTTDVS-SNGQLRQSSRPIKPPSYLQDYHC

XP_019071858.1 PREDICTED: uncharacterized protein LOC100853407 [Vitis vinifera]7.9e-0526.77Show/hide
Query:  ETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHI--------CCSASNF--------HSL
        E+  + +   ++    IRAQ+LL EP+P +NK FSLV QEE+QRS+ T++SP   +P  +     S +S   +          C+  N         + +
Subjt:  ETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHI--------CCSASNF--------HSL

Query:  AKSSPVVVGLPNKDVSTTR---VIGRGNLRNDLYFLENSASSGYVTPLHSTLHT------SMASSAFASLHTDVSCP------NSVPPNPTGVVQPDDVD
            P     PN   + +R   ++      N L   + S +S    PL    H       S+ +S+ +S   D S P      N        V+    V 
Subjt:  AKSSPVVVGLPNKDVSTTR---VIGRGNLRNDLYFLENSASSGYVTPLHSTLHT------SMASSAFASLHTDVSCP------NSVPPNPTGVVQPDDVD

Query:  LTRDASTTILSNGNNGHVALADLNPHTTDVSSNGQLRQSSRPIKPPSYLQDYHC
            +   + S+ N+   +  D +PHT    S+    +SSR  +PP YL DYHC
Subjt:  LTRDASTTILSNGNNGHVALADLNPHTTDVSSNGQLRQSSRPIKPPSYLQDYHC

XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]2.3e-0430.77Show/hide
Query:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHICCSASNF------HSL--
        E+  F  +E     +   ++   ++R QLLL EP PTIN+ FSLV+QE QQR+I T++SP +  PT AL  + S SSG +    ++S++      H+L  
Subjt:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHICCSASNF------HSL--

Query:  ---------AKSSPVVVGLPN----KDVSTTRVIGRGNLRNDLYFLENSASSGYVTPLHSTLHTSMASSAFASLHTDVSCPN
                 A SS  V+   N    +D S++++IG+    + LY L           L++ L   + +SA+ + H  +  P+
Subjt:  ---------AKSSPVVVGLPN----KDVSTTRVIGRGNLRNDLYFLENSASSGYVTPLHSTLHTSMASSAFASLHTDVSCPN

XP_022154973.1 uncharacterized protein LOC111022117 [Momordica charantia]3.9e-0444.57Show/hide
Query:  FLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHICCSASNFHSLAKSS
        F+  E   K +   ++    IRAQ+LL +P P+I KAFSL++QEEQQR I   S+P   SP   LAV QS SS       SASN  S  ++S
Subjt:  FLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHICCSASNFHSLAKSS

TrEMBL top hitse value%identityAlignment
A0A438I7P6 Retrotran_gag_3 domain-containing protein2.5e-0426.47Show/hide
Query:  DDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINT---ASSPNSASPTFALAVQQSPSS--------------------------------------
        +D   ++R QLLL +P P+INK FSL++QEE Q+ I +   A S ++ +  FA+    S +S                                      
Subjt:  DDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINT---ASSPNSASPTFALAVQQSPSS--------------------------------------

Query:  --------GKAHICCSASNFHSLAKSSPVVVGLPNK---------DVSTTRVIGRGNLRNDLYFLENSASSGYVTPLHSTLHTSMASSAFASLHTDVSCP
                   HIC SA+ F SL  +    V LPN          D++ + ++   ++     F  N  S   +T + S+L  S  S +  S  + ++ P
Subjt:  --------GKAHICCSASNFHSLAKSSPVVVGLPNK---------DVSTTRVIGRGNLRNDLYFLENSASSGYVTPLHSTLHTSMASSAFASLHTDVSCP

Query:  NSVPPNPTGVVQPDDVDLTRDASTTILSNGNNGHVALADLNPHTTDVS-SNGQLRQSSRPIKPPSYLQDYHC
           PP  +G+    D            S  NN    L+DL+ H    S +   LR+S+R  KPPS L+D+HC
Subjt:  NSVPPNPTGVVQPDDVDLTRDASTTILSNGNNGHVALADLNPHTTDVS-SNGQLRQSSRPIKPPSYLQDYHC

A0A5A7SRC2 Retroelement pol polyprotein-like4.2e-0435.43Show/hide
Query:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAV-QQSPSSGKAHICCSASNFHSLAKSSPVV
        E+  FL  E     +   +D   + R QLLL EP P+I++AFSL+ QEEQQR+I ++ SP   +PT ALA    +P +  AH         +        
Subjt:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAV-QQSPSSGKAHICCSASNFHSLAKSSPVV

Query:  VGLPNKDVSTTRVIGRGNLRNDLYFLE
        V    KD  T++ IG   L   LY L+
Subjt:  VGLPNKDVSTTRVIGRGNLRNDLYFLE

A0A5D3CZP1 Copia protein4.2e-0435.43Show/hide
Query:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAV-QQSPSSGKAHICCSASNFHSLAKSSPVV
        E+  FL  E     +   +D   + R QLLL EP P+I++AFSL+ QEEQQR+I ++ SP   +PT ALA    +P +  AH         +        
Subjt:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAV-QQSPSSGKAHICCSASNFHSLAKSSPVV

Query:  VGLPNKDVSTTRVIGRGNLRNDLYFLE
        V    KD  T++ IG   L   LY L+
Subjt:  VGLPNKDVSTTRVIGRGNLRNDLYFLE

A0A6J1DIP8 uncharacterized protein LOC1110203991.1e-0430.77Show/hide
Query:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHICCSASNF------HSL--
        E+  F  +E     +   ++   ++R QLLL EP PTIN+ FSLV+QE QQR+I T++SP +  PT AL  + S SSG +    ++S++      H+L  
Subjt:  EMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHICCSASNF------HSL--

Query:  ---------AKSSPVVVGLPN----KDVSTTRVIGRGNLRNDLYFLENSASSGYVTPLHSTLHTSMASSAFASLHTDVSCPN
                 A SS  V+   N    +D S++++IG+    + LY L           L++ L   + +SA+ + H  +  P+
Subjt:  ---------AKSSPVVVGLPN----KDVSTTRVIGRGNLRNDLYFLENSASSGYVTPLHSTLHTSMASSAFASLHTDVSCPN

A0A6J1DLQ9 uncharacterized protein LOC1110221171.9e-0444.57Show/hide
Query:  FLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHICCSASNFHSLAKSS
        F+  E   K +   ++    IRAQ+LL +P P+I KAFSL++QEEQQR I   S+P   SP   LAV QS SS       SASN  S  ++S
Subjt:  FLLEETYKKLVFATDDDAVKIRAQLLLTEPSPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHICCSASNFHSLAKSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAACCCGGTTGTGGTCCAACCAATAACCGGAAGTCCCTCTCGGTTATGTTCCCAGCTCCCCACTCAGTCTTATCCCAAAAATGGTAGGCATATTGAGTCG
GCGACTCGAGCCACTCTCACCTATACAAATCAAAGGACGAGTCCTAACAGGGCGTTGCGGAAGGAGTGGACGAACCATCTAGGAAAGAAGTACTTCAACAATATG
TTGGAGATGAAGATTTTTTTGTTAGAAGAAACGTATAAGAAGCTTGTGTTTGCGACTGATGATGATGCAGTTAAGATTCGTGCTCAATTATTACTCACGGAACCT
TCGCCTACCATCAATAAAGCCTTTTCCCTTGTTAATCAAGAGGAACAACAACGATCGATCAACACAGCTTCTTCACCTAATTCAGCTTCCCCTACTTTTGCGCTT
GCTGTGCAACAGTCGCCATCTTCTGGAAAAGCACACATTTGCTGTTCAGCATCTAATTTTCACTCTTTAGCGAAGAGTTCTCCGGTTGTTGTGGGTTTACCTAAC
AAAGATGTGTCCACTACGAGGGTGATTGGCAGGGGTAACTTGAGAAATGACCTCTATTTTCTTGAGAATTCTGCTTCTTCTGGTTATGTTACTCCTTTACATTCG
ACACTTCATACTTCTATGGCTTCCTCTGCTTTTGCTAGTCTTCATACTGATGTGTCTTGTCCCAATAGTGTACCACCTAATCCTACTGGTGTTGTACAGCCTGAT
GATGTTGACTTGACAAGGGATGCTTCGACTACTATACTTTCTAATGGTAATAATGGTCATGTTGCTTTAGCTGATCTTAATCCCCATACTACTGATGTTTCATCC
AATGGTCAATTGAGGCAGTCTTCTAGGCCAATTAAGCCGCCTTCTTACCTCCAAGATTATCACTGTCTCTCTTACATGGGATTAGGTGCTGACCAATCCATGATC
GCACTTGGCCCTTCAATAATGTGTCATGGCCCTTCTATTGCTACTACGATCGTCCATAGTAGCACACATGGGGCAAGTATTCCCACCCCCTCAATTAGACCTCAG
CCCCTTGGAGCAGCTGTGGAGTTGATGAAGCTTATCCGCGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAACCCGGTTGTGGTCCAACCAATAACCGGAAGTCCCTCTCGGTTATGTTCCCAGCTCCCCACTCAGTCTTATCCCAAAAATGGTAGGCATATTGAGTCG
GCGACTCGAGCCACTCTCACCTATACAAATCAAAGGACGAGTCCTAACAGGGCGTTGCGGAAGGAGTGGACGAACCATCTAGGAAAGAAGTACTTCAACAATATG
TTGGAGATGAAGATTTTTTTGTTAGAAGAAACGTATAAGAAGCTTGTGTTTGCGACTGATGATGATGCAGTTAAGATTCGTGCTCAATTATTACTCACGGAACCT
TCGCCTACCATCAATAAAGCCTTTTCCCTTGTTAATCAAGAGGAACAACAACGATCGATCAACACAGCTTCTTCACCTAATTCAGCTTCCCCTACTTTTGCGCTT
GCTGTGCAACAGTCGCCATCTTCTGGAAAAGCACACATTTGCTGTTCAGCATCTAATTTTCACTCTTTAGCGAAGAGTTCTCCGGTTGTTGTGGGTTTACCTAAC
AAAGATGTGTCCACTACGAGGGTGATTGGCAGGGGTAACTTGAGAAATGACCTCTATTTTCTTGAGAATTCTGCTTCTTCTGGTTATGTTACTCCTTTACATTCG
ACACTTCATACTTCTATGGCTTCCTCTGCTTTTGCTAGTCTTCATACTGATGTGTCTTGTCCCAATAGTGTACCACCTAATCCTACTGGTGTTGTACAGCCTGAT
GATGTTGACTTGACAAGGGATGCTTCGACTACTATACTTTCTAATGGTAATAATGGTCATGTTGCTTTAGCTGATCTTAATCCCCATACTACTGATGTTTCATCC
AATGGTCAATTGAGGCAGTCTTCTAGGCCAATTAAGCCGCCTTCTTACCTCCAAGATTATCACTGTCTCTCTTACATGGGATTAGGTGCTGACCAATCCATGATC
GCACTTGGCCCTTCAATAATGTGTCATGGCCCTTCTATTGCTACTACGATCGTCCATAGTAGCACACATGGGGCAAGTATTCCCACCCCCTCAATTAGACCTCAG
CCCCTTGGAGCAGCTGTGGAGTTGATGAAGCTTATCCGCGTATAG
Protein sequenceShow/hide protein sequence
MNNPVVVQPITGSPSRLCSQLPTQSYPKNGRHIESATRATLTYTNQRTSPNRALRKEWTNHLGKKYFNNMLEMKIFLLEETYKKLVFATDDDAVKIRAQLLLTEP
SPTINKAFSLVNQEEQQRSINTASSPNSASPTFALAVQQSPSSGKAHICCSASNFHSLAKSSPVVVGLPNKDVSTTRVIGRGNLRNDLYFLENSASSGYVTPLHS
TLHTSMASSAFASLHTDVSCPNSVPPNPTGVVQPDDVDLTRDASTTILSNGNNGHVALADLNPHTTDVSSNGQLRQSSRPIKPPSYLQDYHCLSYMGLGADQSMI
ALGPSIMCHGPSIATTIVHSSTHGASIPTPSIRPQPLGAAVELMKLIRV