; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023824 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023824
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold207:990997..992283
RNA-Seq ExpressionMS023824
SyntenyMS023824
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041367.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.4e-3439.9Show/hide
Query:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVT
        ++W +     PI+YLG+ LGG   +K FW  + E+++K   SWKY+ + KGG++TL++S L +LPTY LS+F+AP S  K++E+   NFLW+       T
Subjt:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVT

Query:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
          L   +      PK+  GL I  ++ +N  LL+KWLWR++ E + LW  +I+AKY S     +P  C   SSRS WF I K    F +   WK+ NG
Subjt:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

KAA0050814.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-3440.4Show/hide
Query:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVT
        + W +   + P++YLG+ LGGNPKSK FW  I +R+ K   +WKY +I KGGRLTL++S L++LP Y LSVFQAP S +K++E+L  NFLW+G      +
Subjt:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVT

Query:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
          ++++ +     PK+   L I  ++ +N  LLSKWLWR+  E NSLW  LI  KY  +    +P +    S ++ W  I+ +   F +   W + NG
Subjt:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

KAA0065894.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.8e-3541.33Show/hide
Query:  WVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAPS-CHKSVEQLLINFLWEGVYAKNVTGH
        W  P   FPI YLG+ LGG P SK FW  I++++ K    WKY+ + KGG+LTL+Q+ L++LPTY LSVF+AP+   KS+E+   +FLW+       T  
Subjt:  WVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAPS-CHKSVEQLLINFLWEGVYAKNVTGH

Query:  LSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
        + +S + +   PK   GL I N++ +N  LL KWLWRF  EK+SLW  LIS KY       +P   K+ + ++ W  I+K    F   ++WK+  G
Subjt:  LSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.6e-3443.16Show/hide
Query:  SFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVTGHLSFSSM
        + P++YLG+ LGGNPKS  FW  I +R+ K   +WKY +I KGGRLTL++S L++LP Y LSVFQAP S +K++E+L  NFLW+G      +  +++S +
Subjt:  SFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVTGHLSFSSM

Query:  GSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
             PK+  GL I  ++ +N  LLSKWLWR+  E NSLW  LI  KY  +    LP +    SS++ W  I+ +   F     W + NG
Subjt:  GSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

TYK05690.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-3541.92Show/hide
Query:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQA-PSCHKSVEQLLINFLWEGVYAKNVT
        + W +   + P++YLG+ LGGNPKSK FW  I +R+ K   +WKY +I KGGRLTL++S L++LP Y LSVFQA  S +K++E+L  NFLW+G      +
Subjt:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQA-PSCHKSVEQLLINFLWEGVYAKNVT

Query:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
          +++S +     PK+  GL I  ++ +N  LLSKWLWR+  E NSLW  LI  KY  +    LP +    SS++ W  I+ +   F +   W + NG
Subjt:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

TrEMBL top hitse value%identityAlignment
A0A5A7TI93 LINE-1 retrotransposable element ORF2 protein1.6e-3439.9Show/hide
Query:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVT
        ++W +     PI+YLG+ LGG   +K FW  + E+++K   SWKY+ + KGG++TL++S L +LPTY LS+F+AP S  K++E+   NFLW+       T
Subjt:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVT

Query:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
          L   +      PK+  GL I  ++ +N  LL+KWLWR++ E + LW  +I+AKY S     +P  C   SSRS WF I K    F +   WK+ NG
Subjt:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

A0A5A7VF18 LINE-1 retrotransposable element ORF2 protein8.6e-3641.33Show/hide
Query:  WVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAPS-CHKSVEQLLINFLWEGVYAKNVTGH
        W  P   FPI YLG+ LGG P SK FW  I++++ K    WKY+ + KGG+LTL+Q+ L++LPTY LSVF+AP+   KS+E+   +FLW+       T  
Subjt:  WVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAPS-CHKSVEQLLINFLWEGVYAKNVTGH

Query:  LSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
        + +S + +   PK   GL I N++ +N  LL KWLWRF  EK+SLW  LIS KY       +P   K+ + ++ W  I+K    F   ++WK+  G
Subjt:  LSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein1.2e-3443.16Show/hide
Query:  SFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVTGHLSFSSM
        + P++YLG+ LGGNPKS  FW  I +R+ K   +WKY +I KGGRLTL++S L++LP Y LSVFQAP S +K++E+L  NFLW+G      +  +++S +
Subjt:  SFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVTGHLSFSSM

Query:  GSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
             PK+  GL I  ++ +N  LLSKWLWR+  E NSLW  LI  KY  +    LP +    SS++ W  I+ +   F     W + NG
Subjt:  GSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

A0A5D3C2W8 LINE-1 retrotransposable element ORF2 protein6.6e-3641.92Show/hide
Query:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQA-PSCHKSVEQLLINFLWEGVYAKNVT
        + W +   + P++YLG+ LGGNPKSK FW  I +R+ K   +WKY +I KGGRLTL++S L++LP Y LSVFQA  S +K++E+L  NFLW+G      +
Subjt:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQA-PSCHKSVEQLLINFLWEGVYAKNVT

Query:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
          +++S +     PK+  GL I  ++ +N  LLSKWLWR+  E NSLW  LI  KY  +    LP +    SS++ W  I+ +   F +   W + NG
Subjt:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

A0A5D3C9J6 LINE-1 retrotransposable element ORF2 protein9.5e-3540.4Show/hide
Query:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVT
        + W +   + P++YLG+ LGGNPKSK FW  I +R+ K   +WKY +I KGGRLTL++S L++LP Y LSVFQAP S +K++E+L  NFLW+G      +
Subjt:  TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVT

Query:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG
          ++++ +     PK+   L I  ++ +N  LLSKWLWR+  E NSLW  LI  KY  +    +P +    S ++ W  I+ +   F +   W + NG
Subjt:  GHLSFSSMGSFFLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRS-WFHILKHKYLFLQLFQWKMGNG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.4e-1333.33Show/hide
Query:  SKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVTGHLSFSSMGSFFLPKDAWGLDIEN
        +K  +  I+ERV      W+   +   GRLTL ++VL+++P + +S    P S    ++QL   FLW     K     + +S + S   PK   GL +  
Subjt:  SKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAP-SCHKSVEQLLINFLWEGVYAKNVTGHLSFSSMGSFFLPKDAWGLDIEN

Query:  IRRSNTTLLSKWLWRFLVEKNSLWASLISAKY
         +  N  L+SK  WR L EKNSLW  ++  KY
Subjt:  IRRSNTTLLSKWLWRFLVEKNSLWASLISAKY

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.9e-1524.84Show/hide
Query:  SFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAPS-CHKSVEQLLINFLWEGVYAKNVTGHLSFSSM
        + P+ YLG+ L     +   + P++E++      W   ++   GRL L+ SV+++L  +++S F+ PS C K ++ +  +FLW G         +++S +
Subjt:  SFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAPS-CHKSVEQLLINFLWEGVYAKNVTGHLSFSSM

Query:  GSFFLPKDAWGLDIENIRRSN---------TTLLSKWLWRFLVEKNSLWASLI
         +   PKD  GL I +++ +N          T L  W+W+ +++  +L +  +
Subjt:  GSFFLPKDAWGLDIENIRRSN---------TTLLSKWLWRFLVEKNSLWASLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACAAAGTGGGTTGTCCCAATCACCTCTTTCCCCATCTCTTACCTTGGCATGTGCCTTGGTGGTAACCCAAAATCAAAACCTTTTTGGGACCCAATTATTGAAAGAGTACA
TAAAAATTTTGAAAGTTGGAAGTACACATATATTTACAAAGGAGGGAGGCTTACTCTTGTTCAATCAGTCCTCAATAATTTGCCCACTTACTATCTTTCGGTCTTTCAAG
CCCCTAGTTGTCACAAAAGTGTTGAGCAATTGTTGATAAATTTTCTCTGGGAAGGTGTTTATGCAAAAAATGTGACGGGACATCTCTCATTTAGTTCGATGGGATCATTC
TTTCTTCCTAAAGATGCTTGGGGATTAGACATCGAAAATATCAGAAGGTCAAATACGACTCTCCTCTCAAAATGGCTATGGCGTTTTCTGGTGGAAAAAAACAGTCTCTG
GGCCTCACTGATTTCAGCAAAATACACATCAAGGCCGTTTGATCTTCTCCCTTATGATTGTAAATTTGGGAGTTCTAGATCATGGTTTCACATCCTAAAGCACAAATACC
TTTTCCTCCAATTATTTCAATGGAAGATGGGAAATGGT
mRNA sequenceShow/hide mRNA sequence
ACAAAGTGGGTTGTCCCAATCACCTCTTTCCCCATCTCTTACCTTGGCATGTGCCTTGGTGGTAACCCAAAATCAAAACCTTTTTGGGACCCAATTATTGAAAGAGTACA
TAAAAATTTTGAAAGTTGGAAGTACACATATATTTACAAAGGAGGGAGGCTTACTCTTGTTCAATCAGTCCTCAATAATTTGCCCACTTACTATCTTTCGGTCTTTCAAG
CCCCTAGTTGTCACAAAAGTGTTGAGCAATTGTTGATAAATTTTCTCTGGGAAGGTGTTTATGCAAAAAATGTGACGGGACATCTCTCATTTAGTTCGATGGGATCATTC
TTTCTTCCTAAAGATGCTTGGGGATTAGACATCGAAAATATCAGAAGGTCAAATACGACTCTCCTCTCAAAATGGCTATGGCGTTTTCTGGTGGAAAAAAACAGTCTCTG
GGCCTCACTGATTTCAGCAAAATACACATCAAGGCCGTTTGATCTTCTCCCTTATGATTGTAAATTTGGGAGTTCTAGATCATGGTTTCACATCCTAAAGCACAAATACC
TTTTCCTCCAATTATTTCAATGGAAGATGGGAAATGGT
Protein sequenceShow/hide protein sequence
TKWVVPITSFPISYLGMCLGGNPKSKPFWDPIIERVHKNFESWKYTYIYKGGRLTLVQSVLNNLPTYYLSVFQAPSCHKSVEQLLINFLWEGVYAKNVTGHLSFSSMGSF
FLPKDAWGLDIENIRRSNTTLLSKWLWRFLVEKNSLWASLISAKYTSRPFDLLPYDCKFGSSRSWFHILKHKYLFLQLFQWKMGNG