; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G09865 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G09865
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr11:12753052..12760167
RNA-Seq ExpressionClc11G09865
SyntenyClc11G09865
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD1817157.1 unnamed protein product [Ananas comosus var. bracteatus]9.5e-2054.74Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENFYI
        MLGP  SGFGWN   KCI CE  +FDAWVKSHP+A  L  KSFPY + L++VFGKDRATG+ A + A+     V EEE     + Q PD E F++
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENFYI

KAA0043158.1 retrotransposon protein [Cucumis melo var. makuwa]1.6e-1966.67Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV
        M GP CSGFGWN E KCI  E E+FD WV+SHP+AK L +KSFPYYD+L  VFG+DRATG  A T A+V
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV

KAA0062747.1 retrotransposon protein [Cucumis melo var. makuwa]5.6e-2065.22Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV
        M GP CSGFGWN E +CI  E ++FD+WVKSHP+ K L HKSFPYYDDL+ VFGKDRATG+ + T  +V
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV

XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]5.6e-2065.22Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV
        M GP CSGFGWN E +CI  E ++FD+W+KSHP+AK L HKSFPYYDDL+ VFGKDRATG+ + T   V
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV

XP_030483301.1 uncharacterized protein LOC115699898 [Cannabis sativa]7.3e-2053.76Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENF
        MLGP  SGFGWN + KC+  +  +FD WVKSHP+AK L HK FPYYD+LAIV+GKDRATG  A     + F   ++E  E+I N  + DF+ F
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENF

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859532.7e-2065.22Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV
        M GP CSGFGWN E +CI  E ++FD+W+KSHP+AK L HKSFPYYDDL+ VFGKDRATG+ + T   V
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV

A0A5A7U0H7 Retrotransposon protein2.7e-2065.22Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV
        M GP CSGFGWN E +CI  E ++FD+W+KSHP+AK L HKSFPYYDDL+ VFGKDRATG+ + T   V
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV

A0A5D3DG22 Retrotransposon protein2.7e-2065.22Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV
        M GP CSGFGWN E +CI  E ++FD+WVKSHP+ K L HKSFPYYDDL+ VFGKDRATG+ + T  +V
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEV

A0A6V7NF77 Myb_DNA-bind_3 domain-containing protein4.6e-2054.74Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENFYI
        MLGP  SGFGWN   KCI CE  +FDAWVKSHP+A  L  KSFPY + L++VFGKDRATG+ A + A+     V EEE     + Q PD E F++
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENFYI

A0A803QNC5 Uncharacterized protein3.5e-2053.76Show/hide
Query:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENF
        MLGP  SGFGWN + KC+  +  +FD WVKSHP+AK L HK FPYYD+LAIV+GKDRATG  A     + F   ++E  E+I N  + DF+ F
Subjt:  MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein1.2e-0740.35Show/hide
Query:  SGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHA
        SGFGW+ E K      E++  ++K+HP+ K +  +S  +++DL I+FG   ATGS A
Subjt:  SGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHA

AT2G24960.1 unknown protein6.4e-0631.25Show/hide
Query:  GFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVF------GKDRATGSHATTTAEVEFEPVMEEENED
        GF W+  R  I  +  ++D+++K HP A+    KS P Y+DL  +F      G D      A  T+E +     +E+N D
Subjt:  GFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVF------GKDRATGSHATTTAEVEFEPVMEEENED

AT2G24960.2 unknown protein2.9e-0627.59Show/hide
Query:  SGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENF
        +GF W+A R  +  + +I++ ++++HP A+    K+ P Y +L  +FGK+ + G +  T     F+P      E +  N+S   + F
Subjt:  SGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENF

AT4G02210.1 unknown protein2.0e-0421.88Show/hide
Query:  GFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTT-----AEVEFEPVMEEENEDILNNQSPDFENFYIPDP
        GF W+ ER+ +  +  ++  ++K+H  A++   +  PYY DL ++ G      +           E EF+        D+  +   +  N  + DP
Subjt:  GFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTT-----AEVEFEPVMEEENEDILNNQSPDFENFYIPDP

AT5G27260.1 unknown protein4.0e-0830.68Show/hide
Query:  SGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENFY
        SGFGW+   K      E++  ++K+HP+ K+L + +F ++D+L I+FG+  ATG +A    +   + +     E+       DF+N Y
Subjt:  SGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGCCCAGGCTGTAGTGGGTTTGGGTGGAATGCGGAGCGCAAATGTATTGATTGTGAGGCGGAGATATTTGACGCATGGGTCAAGAGTCATCCGAGTGCAAAAAG
ACTGTGCCATAAGTCATTTCCATACTATGATGACTTGGCCATCGTATTCGGAAAAGATAGAGCCACAGGGAGTCATGCAACCACCACTGCAGAGGTCGAATTTGAACCTG
TTATGGAAGAGGAGAACGAGGACATCCTAAACAACCAGTCCCCAGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGCTAGTTATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGCCCAGGCTGTAGTGGGTTTGGGTGGAATGCGGAGCGCAAATGTATTGATTGTGAGGCGGAGATATTTGACGCATGGGTCAAGAGTCATCCGAGTGCAAAAAG
ACTGTGCCATAAGTCATTTCCATACTATGATGACTTGGCCATCGTATTCGGAAAAGATAGAGCCACAGGGAGTCATGCAACCACCACTGCAGAGGTCGAATTTGAACCTG
TTATGGAAGAGGAGAACGAGGACATCCTAAACAACCAGTCCCCAGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGCTAGTTATCTTTAA
Protein sequenceShow/hide protein sequence
MLGPGCSGFGWNAERKCIDCEAEIFDAWVKSHPSAKRLCHKSFPYYDDLAIVFGKDRATGSHATTTAEVEFEPVMEEENEDILNNQSPDFENFYIPDPPFASYL