; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g1223 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g1223
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationMC02:11094860..11095694
RNA-Seq ExpressionMC02g1223
SyntenyMC02g1223
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.63e-1529Show/hide
Query:  SVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCIL--RKMDSTSHCQSFSIVLLLLLVGLSFSMSLGI
        SV  A        +  +P + K +WK + PKK K F+  +  G  NT+D+ Q+R+PN  LSP+WC +  +  +  +H          L +   +S  L  
Subjt:  SVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCIL--RKMDSTSHCQSFSIVLLLLLVGLSFSMSLGI

Query:  G----VSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL
             ++W    +   S+ Q +   N++    ++  NT NAT    IW ERN RI K +E +    WE     +  W    KLF NY+  S+  N   F+
Subjt:  G----VSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]2.45e-1832.37Show/hide
Query:  IWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSF-SIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQGP-NL
        +WK+K+P++V  F  I+FQGK NT+D  Q++ P+  L PS+C L       H   F           L F       V WCF L    +V+QLL GP +L
Subjt:  IWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSF-SIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQGP-NL

Query:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL
              LW N + A     +W+ERN R+ + K      S+   +   S WC +   F +++   + ANW  F+
Subjt:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL

XP_030479135.1 uncharacterized protein LOC115696374 [Cannabis sativa]9.79e-1929.56Show/hide
Query:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS
        G FS  SA    ++  +   P   K +WKS    +VK F+ ++  GK N  D  QRR P   +SP WC+  K +         HC+  S +  +LL    
Subjt:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS

Query:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWK
               G+ W  P S    +   ++G   +    ILW   + AT W +IW ERN RI +  ENS +  W+ ++ + ++W    K F + +F+ L+  W+
Subjt:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWK

Query:  PFL
          L
Subjt:  PFL

XP_030483308.1 uncharacterized protein LOC115699905 [Cannabis sativa]5.36e-1631.84Show/hide
Query:  GCFSVNSATLNLSTKHSRINPAI-IKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSL
        G FS  S T N  T           K +WKS  P KVK F  ++  GK N     Q+R P   LSP WC+  K  S      F        +G   +  L
Subjt:  GCFSVNSATLNLSTKHSRINPAI-IKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSL

Query:  GIG------VSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANW
         I       + W  P S    +   + G   + G S LW   + A  W  IW ERN RI +G E S+   WE VR + + W    K F  ++F+ LN +W
Subjt:  GIG------VSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANW

Query:  K
        +
Subjt:  K

XP_030505044.1 uncharacterized protein LOC115720016 [Cannabis sativa]1.78e-1529.06Show/hide
Query:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS
        G FS  SA    ++  +   P   K +WKS    +VK F+ ++  GK N  D  QRR P   +SP WC+  K           HC+  S +  +LL    
Subjt:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS

Query:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWK
               G+ W  P S    +   ++G   +    ILW   + AT W +IW ERN RI +  ENS+       +    +W    K F + +F+ L+  W+
Subjt:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWK

Query:  PFL
          L
Subjt:  PFL

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein7.91e-1629Show/hide
Query:  SVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCIL--RKMDSTSHCQSFSIVLLLLLVGLSFSMSLGI
        SV  A        +  +P + K +WK + PKK K F+  +  G  NT+D+ Q+R+PN  LSP+WC +  +  +  +H          L +   +S  L  
Subjt:  SVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCIL--RKMDSTSHCQSFSIVLLLLLVGLSFSMSLGI

Query:  G----VSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL
             ++W    +   S+ Q +   N++    ++  NT NAT    IW ERN RI K +E +    WE     +  W    KLF NY+  S+  N   F+
Subjt:  G----VSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL

A0A6J1DIE2 uncharacterized protein LOC1110207651.19e-1832.37Show/hide
Query:  IWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSF-SIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQGP-NL
        +WK+K+P++V  F  I+FQGK NT+D  Q++ P+  L PS+C L       H   F           L F       V WCF L    +V+QLL GP +L
Subjt:  IWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSF-SIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQGP-NL

Query:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL
              LW N + A     +W+ERN R+ + K      S+   +   S WC +   F +++   + ANW  F+
Subjt:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL

A0A803PZR8 Uncharacterized protein8.62e-1629.06Show/hide
Query:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS
        G FS  SA    ++  +   P   K +WKS    +VK F+ ++  GK N  D  QRR P   +SP WC+  K           HC+  S +  +LL    
Subjt:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS

Query:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWK
               G+ W  P S    +   ++G   +    ILW   + AT W +IW ERN RI +  ENS+       +    +W    K F + +F+ L+  W+
Subjt:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWK

Query:  PFL
          L
Subjt:  PFL

A0A803QEA6 Uncharacterized protein4.43e-1629.48Show/hide
Query:  KCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQGPNL
        K +WKS+ P KVK F  ++   K N   + Q++ P   +SP WC+  K+    +   F  +   L   L   +     + W  P S D  +   + G   
Subjt:  KCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQGPNL

Query:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL
           S+ LW  T+ +  W  +W ERN RI +G E+SI   W+ ++ + +SW    K F N +F+ L  +W+  L
Subjt:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL

A0A803QGT5 Uncharacterized protein1.76e-1829.56Show/hide
Query:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS
        G FS  SA    ++  +   P   K +WKS    +VK F+ ++  GK N  D  QRR P   +SP WC+  K +         HC+  S +  +LL    
Subjt:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS

Query:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWK
               G+ W  P S    +   ++G   +    ILW   + AT W +IW ERN RI +  ENS +  W+ ++ + ++W    K F + +F+ L+  W+
Subjt:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWK

Query:  PFL
          L
Subjt:  PFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AATGGCTGCTTTTCAGTTAACTCGGCCACCCTTAATTTGTCCACCAAGCACAGTAGAATTAACCCAGCTATTATCAAGTGCATTTGGAAGTCTAAAAATCCCAAGAAAGT
AAAAGATTTCCTCAGGATTATGTTTCAAGGCAAGTCGAATACTAGTGACAAGTTCCAAAGAAGAATGCCGAATTGTCATCTCTCCCCTAGCTGGTGCATCTTACGCAAGA
TGGACTCCACAAGCCACTGCCAATCCTTTTCCATTGTCCTGTTGTTACTGCTTGTTGGGCTAAGCTTTTCAATGTCTTTGGGAATTGGGGTAAGCTGGTGTTTTCCTCTT
TCATACGATACCTCAGTTTTTCAACTACTCCAAGGCCCTAATCTCAAAGGTGGTTCTAGTATTTTGTGGGACAACACTATGAATGCTACTTTTTGGTGCTCGATTTGGTA
TGAGAGGAATCAAAGAATCGTCAAAGGGAAAGAAAACTCCATAGTCCATTCCTGGGAAGTTGTTCGTACTTTTGTCTCTTCTTGGTGCCTCATTGATAAATTGTTTTGTA
ATTATAATTTCATGAGTCTAAACGCCAACTGGAAACCTTTTTTG
mRNA sequenceShow/hide mRNA sequence
AATGGCTGCTTTTCAGTTAACTCGGCCACCCTTAATTTGTCCACCAAGCACAGTAGAATTAACCCAGCTATTATCAAGTGCATTTGGAAGTCTAAAAATCCCAAGAAAGT
AAAAGATTTCCTCAGGATTATGTTTCAAGGCAAGTCGAATACTAGTGACAAGTTCCAAAGAAGAATGCCGAATTGTCATCTCTCCCCTAGCTGGTGCATCTTACGCAAGA
TGGACTCCACAAGCCACTGCCAATCCTTTTCCATTGTCCTGTTGTTACTGCTTGTTGGGCTAAGCTTTTCAATGTCTTTGGGAATTGGGGTAAGCTGGTGTTTTCCTCTT
TCATACGATACCTCAGTTTTTCAACTACTCCAAGGCCCTAATCTCAAAGGTGGTTCTAGTATTTTGTGGGACAACACTATGAATGCTACTTTTTGGTGCTCGATTTGGTA
TGAGAGGAATCAAAGAATCGTCAAAGGGAAAGAAAACTCCATAGTCCATTCCTGGGAAGTTGTTCGTACTTTTGTCTCTTCTTGGTGCCTCATTGATAAATTGTTTTGTA
ATTATAATTTCATGAGTCTAAACGCCAACTGGAAACCTTTTTTG
Protein sequenceShow/hide protein sequence
NGCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSLGIGVSWCFPL
SYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVRTFVSSWCLIDKLFCNYNFMSLNANWKPFL