; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023350 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023350
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold378:283032..283863
RNA-Seq ExpressionMS023350
SyntenyMS023350
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.6e-1329.29Show/hide
Query:  SVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSLGIG-
        SV  A        +  +P + K +WK + PKK K F+  +  G  NT+D+ Q+R+PN  LSP+WC +        C      +  L +   +S  L    
Subjt:  SVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSLGIG-

Query:  ---VSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWKPFL
           ++W    +   S+ Q +   N++    ++  NT NAT    IW ERN RI K +E +    WE     +  W    KLF NY+  S+  N   F+
Subjt:  ---VSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWKPFL

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]7.9e-1532.37Show/hide
Query:  IWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQ-SFSIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQG-PNL
        +WK+K+P++V  F  I+FQGK NT+D  Q++ P+  L PS+C L       H    F          L F       V WCF L    +V+QLL G P+L
Subjt:  IWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQ-SFSIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQG-PNL

Query:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWKPFL
              LW N + A     +W+ERN R+ + K      S+       S WC +   F +++   + ANW  F+
Subjt:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWKPFL

XP_030479135.1 uncharacterized protein LOC115696374 [Cannabis sativa]1.2e-1529.56Show/hide
Query:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS
        G FS  SA    ++  +   P   K +WKS    +VK F+ ++  GK N  D  QRR P   +SP WC+  K +         HC+  S +  +LL    
Subjt:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS

Query:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWK
               G+ W  P S    +   ++G   +    ILW   + AT W +IW ERN RI +  ENS +  W+ +  + ++W    K F + +F+ L+  W+
Subjt:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWK

Query:  PFL
          L
Subjt:  PFL

XP_030483308.1 uncharacterized protein LOC115699905 [Cannabis sativa]9.6e-1331.63Show/hide
Query:  NGCFSVNSATLNLSTKHSRINPAI-IKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMS
        +G FS  S T N  T           K +WKS  P KVK F  ++  GK N     Q+R P   LSP WC+  K  S      F  +   L   L   + 
Subjt:  NGCFSVNSATLNLSTKHSRINPAI-IKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMS

Query:  LGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWK
            + W  P S    +   + G   + G S LW   + A  W  IW ERN RI +G E S+   WE V  + + W    K F  ++F+ LN +W+
Subjt:  LGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWK

XP_030505044.1 uncharacterized protein LOC115720016 [Cannabis sativa]9.6e-1329.06Show/hide
Query:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS
        G FS  SA    ++  +   P   K +WKS    +VK F+ ++  GK N  D  QRR P   +SP WC+  K           HC+  S +  +LL    
Subjt:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS

Query:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWK
               G+ W  P S    +   ++G   +    ILW   + AT W +IW ERN RI +  ENS+            +W    K F + +F+ L+  W+
Subjt:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWK

Query:  PFL
          L
Subjt:  PFL

TrEMBL top hitse value%identityAlignment
A0A6J1DIE2 uncharacterized protein LOC1110207653.8e-1532.37Show/hide
Query:  IWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQ-SFSIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQG-PNL
        +WK+K+P++V  F  I+FQGK NT+D  Q++ P+  L PS+C L       H    F          L F       V WCF L    +V+QLL G P+L
Subjt:  IWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQ-SFSIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQG-PNL

Query:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWKPFL
              LW N + A     +W+ERN R+ + K      S+       S WC +   F +++   + ANW  F+
Subjt:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWKPFL

A0A803QEA6 Uncharacterized protein5.5e-1429.48Show/hide
Query:  KCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQGPNL
        K +WKS+ P KVK F  ++   K N   + Q++ P   +SP WC+  K+    +   F  +   L   L   +     + W  P S D  +   + G   
Subjt:  KCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSLGIGVSWCFPLSYDTSVFQLLQGPNL

Query:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWKPFL
           S+ LW  T+ +  W  +W ERN RI +G E+SI   W+ +  + +SW    K F N +F+ L  +W+  L
Subjt:  KGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWKPFL

A0A803QGT5 Uncharacterized protein5.9e-1629.56Show/hide
Query:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS
        G FS  SA    ++  +   P   K +WKS    +VK F+ ++  GK N  D  QRR P   +SP WC+  K +         HC+  S +  +LL    
Subjt:  GCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGLS

Query:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWK
               G+ W  P S    +   ++G   +    ILW   + AT W +IW ERN RI +  ENS +  W+ +  + ++W    K F + +F+ L+  W+
Subjt:  FSMSLGIGVSWCFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWK

Query:  PFL
          L
Subjt:  PFL

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)1.6e-1333.17Show/hide
Query:  GCFSVNS-ATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGL
        G FS  S  +  LST      P     IWK+K P K++ F+ +   G+ NT D  QRR P   LSPSWC+L K ++ +      HC S+S+ L   ++G 
Subjt:  GCFSVNS-ATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGL

Query:  SFSMSLGIGVSW-----CFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHS-WEVVHTFVSSWCLIDKLFCNYNFM
               +GV W     CF L    S+   + G   + G  IL D  ++A FW +IW ERNQRI +G     V   W+ +  + S W  +   F +Y++ 
Subjt:  SFSMSLGIGVSW-----CFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHS-WEVVHTFVSSWCLIDKLFCNYNFM

Query:  SL
        ++
Subjt:  SL

M5XV38 zf-RVT domain-containing protein1.6e-1333.17Show/hide
Query:  GCFSVNS-ATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGL
        G FS  S  +  LST      P     IWK+K P K++ F+ +   G+ NT D  QRR P   LSPSWC+L K ++ +      HC S+S+ L   ++G 
Subjt:  GCFSVNS-ATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTS------HCQSFSIVLLLLLVGL

Query:  SFSMSLGIGVSW-----CFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHS-WEVVHTFVSSWCLIDKLFCNYNFM
               +GV W     CF L    S+   + G   + G  IL D  ++A FW +IW ERNQRI +G     V   W+ +  + S W  +   F +Y++ 
Subjt:  SFSMSLGIGVSW-----CFPLSYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHS-WEVVHTFVSSWCLIDKLFCNYNFM

Query:  SL
        ++
Subjt:  SL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AATGGCTGCTTTTCAGTTAACTCGGCCACCCTTAATTTGTCCACCAAGCACAGTAGAATTAACCCAGCTATTATCAAGTGCATTTGGAAGTCTAAAAATCCCAAGAAAGT
AAAAGATTTCCTCAGGATTATGTTTCAAGGCAAGTCGAATACTAGTGACAAGTTCCAAAGAAGAATGCCGAATTGTCATCTCTCCCCTAGCTGGTGCATCTTACGCAAGA
TGGACTCCACAAGCCACTGCCAATCCTTTTCCATTGTCCTGTTGTTACTGCTTGTTGGGCTAAGCTTTTCAATGTCTTTGGGAATTGGGGTAAGCTGGTGTTTTCCTCTT
TCATACGATACCTCAGTTTTTCAACTACTCCAAGGCCCTAATCTCAAAGGTGGTTCTAGTATTTTGTGGGACAACACTATGAATGCTACTTTTTGGTGCTCGATTTGGTA
TGAAAGGAATCAAAGAATCGTCAAAGGGAAAGAAAACTCCATAGTCCATTCCTGGGAAGTTGTTCATACTTTTGTCTCTTCTTGGTGCCTCATTGATAAATTGTTTTGTA
ATTATAATTTCATGAGTCTAAACGCCAACTGGAAACCTTTTTTG
mRNA sequenceShow/hide mRNA sequence
AATGGCTGCTTTTCAGTTAACTCGGCCACCCTTAATTTGTCCACCAAGCACAGTAGAATTAACCCAGCTATTATCAAGTGCATTTGGAAGTCTAAAAATCCCAAGAAAGT
AAAAGATTTCCTCAGGATTATGTTTCAAGGCAAGTCGAATACTAGTGACAAGTTCCAAAGAAGAATGCCGAATTGTCATCTCTCCCCTAGCTGGTGCATCTTACGCAAGA
TGGACTCCACAAGCCACTGCCAATCCTTTTCCATTGTCCTGTTGTTACTGCTTGTTGGGCTAAGCTTTTCAATGTCTTTGGGAATTGGGGTAAGCTGGTGTTTTCCTCTT
TCATACGATACCTCAGTTTTTCAACTACTCCAAGGCCCTAATCTCAAAGGTGGTTCTAGTATTTTGTGGGACAACACTATGAATGCTACTTTTTGGTGCTCGATTTGGTA
TGAAAGGAATCAAAGAATCGTCAAAGGGAAAGAAAACTCCATAGTCCATTCCTGGGAAGTTGTTCATACTTTTGTCTCTTCTTGGTGCCTCATTGATAAATTGTTTTGTA
ATTATAATTTCATGAGTCTAAACGCCAACTGGAAACCTTTTTTG
Protein sequenceShow/hide protein sequence
NGCFSVNSATLNLSTKHSRINPAIIKCIWKSKNPKKVKDFLRIMFQGKSNTSDKFQRRMPNCHLSPSWCILRKMDSTSHCQSFSIVLLLLLVGLSFSMSLGIGVSWCFPL
SYDTSVFQLLQGPNLKGGSSILWDNTMNATFWCSIWYERNQRIVKGKENSIVHSWEVVHTFVSSWCLIDKLFCNYNFMSLNANWKPFL