; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008093 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008093
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold703:194010..194655
RNA-Seq ExpressionMS008093
SyntenyMS008093
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]6.5e-1531.98Show/hide
Query:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKKGN----------VNTEEILRKKLSSMVIHPS
        L+D E  EF +L+  L    + + +D + W ++S    S K L     ++S + K   +A+ +S SP++ N          V + EIL+KK S + + PS
Subjt:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKKGN----------VNTEEILRKKLSSMVIHPS

Query:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVNSLLFGPVL--TPANLLWCNGVKA-LFNLWAERNQRIFKGKCISLGESFTLAK
        I      + +   HIF +CP ++  W+ +F +F ++W   + +S +V  LL G  L  TP  ++W    KA L  +W ERNQRIF  K     E    A 
Subjt:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVNSLLFGPVL--TPANLLWCNGVKA-LFNLWAERNQRIFKGKCISLGESFTLAK

Query:  FKVSQWCALSNLFTNYSPNLIC
           + WC+L   F NYS   IC
Subjt:  FKVSQWCALSNLFTNYSPNLIC

RVW92839.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.6e-1129.46Show/hide
Query:  LKDNESEEFMALISSLKLFSIKDE-EDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAAL-WKSKSPKK----------GNVNTEEILRKKLSSMVIH
        L D+E E+  +L+ SL    +     D++ W L SS+  +VK         S LP  +   L W S+ P K            VNT ++L+ +     I 
Subjt:  LKDNESEEFMALISSLKLFSIKDE-EDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAAL-WKSKSPKK----------GNVNTEEILRKKLSSMVIH

Query:  PSISIFYANSCETQQHIFFHCPYAAACWKLLFEIFVL-WTRAR-----VSTNVNSLLFGPVLTPANLLWCNGVKALFNLWAERNQRIFKGKCISLGESFT
        P I +      ET  H+F HCP     W  LF++  + W   R     +STN N   FG       L     +  L+ +W ERN RIF+GK  +L   + 
Subjt:  PSISIFYANSCETQQHIFFHCPYAAACWKLLFEIFVL-WTRAR-----VSTNVNSLLFGPVLTPANLLWCNGVKALFNLWAERNQRIFKGKCISLGESFT

Query:  LAKFKVSQWCALSNLFTNYSPNLI
        +  F  S W + S +F     N+I
Subjt:  LAKFKVSQWCALSNLFTNYSPNLI

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]5.9e-1632.43Show/hide
Query:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKKGN----------VNTEEILRKKLSSMVIHPS
        L+D E  EF +L+  L    + + +D + W ++S    S K L     ++S + K   +A+ +S SP++ N          VN+ EIL+KK S + + PS
Subjt:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKKGN----------VNTEEILRKKLSSMVIHPS

Query:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVNSLLFGPVL--TPANLLWCNGVKA-LFNLWAERNQRIFKGKCISLGESFTLAK
        I      + +   HIF +CP ++  W+ +F +F ++W   + +S +V  LL G  L  TP  ++W    KA L  +W ERNQRIF  K     E    A 
Subjt:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVNSLLFGPVL--TPANLLWCNGVKA-LFNLWAERNQRIFKGKCISLGESFTLAK

Query:  FKVSQWCALSNLFTNYSPNLIC
           + WC+L   F NYS   IC
Subjt:  FKVSQWCALSNLFTNYSPNLIC

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]1.2e-2442.2Show/hide
Query:  SSSLLPKEYVAALWKSKSPKK----------GNVNTEEILRKKLSSMVIHPSISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVN
        S++ +PKE+ AALWK+KSP++          G +NT +I++KK  S  + PS       S E   H+FFHC +A+ CW LLF  F V W    +   NV 
Subjt:  SSSLLPKEYVAALWKSKSPKK----------GNVNTEEILRKKLSSMVIHPSISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVN

Query:  SLLFGP--VLTPANLLWCNGVKALFN-LWAERNQRIFKGKCISLGESFTLAKFKVSQWCALSNLFTNYSPNLI
         LL GP  + +    LW N VKAL + LW ERN R+F+ K     ESF  AKFK S WC+L + F ++SP++I
Subjt:  SLLFGP--VLTPANLLWCNGVKALFN-LWAERNQRIFKGKCISLGESFTLAKFKVSQWCALSNLFTNYSPNLI

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]1.8e-2034.26Show/hide
Query:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKK----------GNVNTEEILRKKLSSMVIHPS
        LKD E   F  L+  +   S     D++ W + ++   +VK L       S L K   + +WK+KSP++          G +N  E+L+KK  +  + P+
Subjt:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKK----------GNVNTEEILRKKLSSMVIHPS

Query:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF--VLWTRARVSTNVNSLLFGPVL-TPANLLWCNGVKALF-NLWAERNQRIFKGKCISLGESFTLAKF
        +  F  +  E   H+FF CPY++ CW  L   F   L       +NV  LL  P       LLWCN VKAL  +LW ERNQRIF  K  S  +    A+ 
Subjt:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF--VLWTRARVSTNVNSLLFGPVL-TPANLLWCNGVKALF-NLWAERNQRIFKGKCISLGESFTLAKF

Query:  KVSQWCALSNLFTNYS
        + S WC LS+ F  YS
Subjt:  KVSQWCALSNLFTNYS

TrEMBL top hitse value%identityAlignment
A0A438I862 LINE-1 retrotransposable element ORF2 protein1.2e-1129.46Show/hide
Query:  LKDNESEEFMALISSLKLFSIKDE-EDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAAL-WKSKSPKK----------GNVNTEEILRKKLSSMVIH
        L D+E E+  +L+ SL    +     D++ W L SS+  +VK         S LP  +   L W S+ P K            VNT ++L+ +     I 
Subjt:  LKDNESEEFMALISSLKLFSIKDE-EDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAAL-WKSKSPKK----------GNVNTEEILRKKLSSMVIH

Query:  PSISIFYANSCETQQHIFFHCPYAAACWKLLFEIFVL-WTRAR-----VSTNVNSLLFGPVLTPANLLWCNGVKALFNLWAERNQRIFKGKCISLGESFT
        P I +      ET  H+F HCP     W  LF++  + W   R     +STN N   FG       L     +  L+ +W ERN RIF+GK  +L   + 
Subjt:  PSISIFYANSCETQQHIFFHCPYAAACWKLLFEIFVL-WTRAR-----VSTNVNSLLFGPVLTPANLLWCNGVKALFNLWAERNQRIFKGKCISLGESFT

Query:  LAKFKVSQWCALSNLFTNYSPNLI
        +  F  S W + S +F     N+I
Subjt:  LAKFKVSQWCALSNLFTNYSPNLI

A0A5A7T2Y0 zf-RVT domain-containing protein3.1e-1531.98Show/hide
Query:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKKGN----------VNTEEILRKKLSSMVIHPS
        L+D E  EF +L+  L    + + +D + W ++S    S K L     ++S + K   +A+ +S SP++ N          V + EIL+KK S + + PS
Subjt:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKKGN----------VNTEEILRKKLSSMVIHPS

Query:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVNSLLFGPVL--TPANLLWCNGVKA-LFNLWAERNQRIFKGKCISLGESFTLAK
        I      + +   HIF +CP ++  W+ +F +F ++W   + +S +V  LL G  L  TP  ++W    KA L  +W ERNQRIF  K     E    A 
Subjt:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVNSLLFGPVL--TPANLLWCNGVKA-LFNLWAERNQRIFKGKCISLGESFTLAK

Query:  FKVSQWCALSNLFTNYSPNLIC
           + WC+L   F NYS   IC
Subjt:  FKVSQWCALSNLFTNYSPNLIC

A0A5A7TES3 LINE-1 retrotransposable element ORF2 protein1.2e-0927.85Show/hide
Query:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSS---KNISVKLLCKPADSS--SLLPKEYVAALWKSKSPKK----------GNVNTEEILRKKLSSM
        L++ E   +  L +SL     ++ +D   W L+S+      SVK   +  D S   L  +     LWK+  PKK           +VNT E L K+L ++
Subjt:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSS---KNISVKLLCKPADSS--SLLPKEYVAALWKSKSPKK----------GNVNTEEILRKKLSSM

Query:  VIHPSISIFYANSCETQQHIFFHCPYAAACWKLLFEIFVLWTRARVSTNVNSLLFGPVL---------TPANLLWCNG-VKALFNLWAERNQRIFKGKCI
           PS  +    + E + H+F  CP A + W+L+         + +++NVN L    +          T  N++  N    AL+N+W ERN RIF GK  
Subjt:  VIHPSISIFYANSCETQQHIFFHCPYAAACWKLLFEIFVLWTRARVSTNVNSLLFGPVL---------TPANLLWCNG-VKALFNLWAERNQRIFKGKCI

Query:  SLGESFTLAKFKVSQWCALSNLFTNYSPNLICSKIRS
        ++ E +   K     W + S+LF+NY  + I   + +
Subjt:  SLGESFTLAKFKVSQWCALSNLFTNYSPNLICSKIRS

A0A5D3DE60 zf-RVT domain-containing protein2.8e-1632.43Show/hide
Query:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKKGN----------VNTEEILRKKLSSMVIHPS
        L+D E  EF +L+  L    + + +D + W ++S    S K L     ++S + K   +A+ +S SP++ N          VN+ EIL+KK S + + PS
Subjt:  LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKKGN----------VNTEEILRKKLSSMVIHPS

Query:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVNSLLFGPVL--TPANLLWCNGVKA-LFNLWAERNQRIFKGKCISLGESFTLAK
        I      + +   HIF +CP ++  W+ +F +F ++W   + +S +V  LL G  L  TP  ++W    KA L  +W ERNQRIF  K     E    A 
Subjt:  ISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVNSLLFGPVL--TPANLLWCNGVKA-LFNLWAERNQRIFKGKCISLGESFTLAK

Query:  FKVSQWCALSNLFTNYSPNLIC
           + WC+L   F NYS   IC
Subjt:  FKVSQWCALSNLFTNYSPNLIC

A0A6J1DIE2 uncharacterized protein LOC1110207655.7e-2542.2Show/hide
Query:  SSSLLPKEYVAALWKSKSPKK----------GNVNTEEILRKKLSSMVIHPSISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVN
        S++ +PKE+ AALWK+KSP++          G +NT +I++KK  S  + PS       S E   H+FFHC +A+ CW LLF  F V W    +   NV 
Subjt:  SSSLLPKEYVAALWKSKSPKK----------GNVNTEEILRKKLSSMVIHPSISIFYANSCETQQHIFFHCPYAAACWKLLFEIF-VLWT-RARVSTNVN

Query:  SLLFGP--VLTPANLLWCNGVKALFN-LWAERNQRIFKGKCISLGESFTLAKFKVSQWCALSNLFTNYSPNLI
         LL GP  + +    LW N VKAL + LW ERN R+F+ K     ESF  AKFK S WC+L + F ++SP++I
Subjt:  SLLFGP--VLTPANLLWCNGVKALFN-LWAERNQRIFKGKCISLGESFTLAKFKVSQWCALSNLFTNYSPNLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTGAAGGATAATGAATCTGAGGAGTTTATGGCCTTGATTTCAAGTCTGAAATTGTTTTCTATCAAGGACGAAGAAGATCAGAAAGGCTGGTTGTTGGATTCTTCTAAAAA
CATTTCGGTAAAATTGTTGTGCAAGCCAGCTGATTCCTCTTCTCTTTTGCCAAAAGAGTATGTCGCTGCACTTTGGAAATCCAAAAGTCCAAAGAAAGGTAATGTTAATA
CGGAGGAGATTCTTCGGAAAAAGCTTTCTTCAATGGTCATCCATCCATCTATTTCCATCTTTTATGCTAATTCTTGTGAAACTCAGCAGCATATCTTCTTTCATTGCCCT
TATGCGGCGGCTTGTTGGAAGCTCTTATTCGAGATTTTTGTTTTGTGGACTCGTGCTAGAGTCTCAACAAACGTAAATTCTTTACTGTTTGGTCCAGTTTTAACCCCTGC
AAATCTTCTTTGGTGCAATGGAGTTAAGGCCTTGTTCAATTTATGGGCTGAAAGAAATCAAAGGATCTTCAAAGGGAAATGTATTTCTTTGGGAGAAAGCTTCACCCTTG
CCAAATTCAAGGTCTCACAATGGTGTGCTCTTTCTAATTTGTTTACAAATTATTCTCCTAATCTGATTTGCTCAAAAATTAGGAGCCTTTTA
mRNA sequenceShow/hide mRNA sequence
TTGAAGGATAATGAATCTGAGGAGTTTATGGCCTTGATTTCAAGTCTGAAATTGTTTTCTATCAAGGACGAAGAAGATCAGAAAGGCTGGTTGTTGGATTCTTCTAAAAA
CATTTCGGTAAAATTGTTGTGCAAGCCAGCTGATTCCTCTTCTCTTTTGCCAAAAGAGTATGTCGCTGCACTTTGGAAATCCAAAAGTCCAAAGAAAGGTAATGTTAATA
CGGAGGAGATTCTTCGGAAAAAGCTTTCTTCAATGGTCATCCATCCATCTATTTCCATCTTTTATGCTAATTCTTGTGAAACTCAGCAGCATATCTTCTTTCATTGCCCT
TATGCGGCGGCTTGTTGGAAGCTCTTATTCGAGATTTTTGTTTTGTGGACTCGTGCTAGAGTCTCAACAAACGTAAATTCTTTACTGTTTGGTCCAGTTTTAACCCCTGC
AAATCTTCTTTGGTGCAATGGAGTTAAGGCCTTGTTCAATTTATGGGCTGAAAGAAATCAAAGGATCTTCAAAGGGAAATGTATTTCTTTGGGAGAAAGCTTCACCCTTG
CCAAATTCAAGGTCTCACAATGGTGTGCTCTTTCTAATTTGTTTACAAATTATTCTCCTAATCTGATTTGCTCAAAAATTAGGAGCCTTTTA
Protein sequenceShow/hide protein sequence
LKDNESEEFMALISSLKLFSIKDEEDQKGWLLDSSKNISVKLLCKPADSSSLLPKEYVAALWKSKSPKKGNVNTEEILRKKLSSMVIHPSISIFYANSCETQQHIFFHCP
YAAACWKLLFEIFVLWTRARVSTNVNSLLFGPVLTPANLLWCNGVKALFNLWAERNQRIFKGKCISLGESFTLAKFKVSQWCALSNLFTNYSPNLICSKIRSLL