; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G023760 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G023760
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationCG_Chr01:37014962..37022803
RNA-Seq ExpressionClCG01G023760
SyntenyClCG01G023760
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]8.5e-2132.08Show/hide
Query:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI
        M  + ++   PL  A S+L++I K+  ++F+  M ++              S +  +F A L +    F NYSVD  +SSKVSL SF +A+L G  FSS+
Subjt:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI

Query:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQ----KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAE
        +IH+LE   +++LR+    + S +  PLH  EL L P      P    L Q    K F+++S+  ++II+   L+ +    + + +  T V FSI    E
Subjt:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQ----KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAE

Query:  ITLSEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPFGGE
        IT+++E G C   G E EDE ++ I   P  FF++   +  RV F+   N  S I++  F   G+
Subjt:  ITLSEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPFGGE

XP_008458682.1 PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo]5.0e-2135.37Show/hide
Query:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI
        M  + +E   PL  A S+L+++ K+  +KF+P M  I  + + SP           +F A L L    F N+SVD N SSKVSL+ F +A+L G  FSS+
Subjt:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI

Query:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQ----KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAE
        TIHLL+  N+++LRF    + S+   PLH HEL L P      P    L Q      F++ S+  +RII+   L+   T S+++   G+ V FSI    E
Subjt:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQ----KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAE

Query:  ITLSEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFF
        I L++E G C   G E E E  +++ + P  FF++   +  + S F
Subjt:  ITLSEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFF

XP_022156149.1 uncharacterized protein LOC111023105 [Momordica charantia]3.8e-2133.71Show/hide
Query:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI
        M  I V NI  L  A S L+EI  EV +K +P +F++              S    +F  +L LP + F  YSV   H+S++S+ +F  ALL      S+
Subjt:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI

Query:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHEL--ELFPLEDWKDPARIRLNQ-KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEI
        TIH+ + +NR +LRF     E+ S Q    HEL   L P +D  +    +L+  + F I  Q F +II  FS++D++T  + + S  + VTFS  P   +
Subjt:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHEL--ELFPLEDWKDPARIRLNQ-KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEI

Query:  TLSEELGQCSFRGEEEDEV-DIKININPASFFISLANQVQRVSFFHKNND--------DSHIAIMYF
        TL +E G+C+  G  EDE+    I I P  FF+ +A Q Q+V FF   +         D+++ ++YF
Subjt:  TLSEELGQCSFRGEEEDEV-DIKININPASFFISLANQVQRVSFFHKNND--------DSHIAIMYF

XP_031736681.1 uncharacterized protein LOC116402062 [Cucumis sativus]8.2e-3239.7Show/hide
Query:  MICINVENIVPLTKAASMLSEI-GKEVCLKFSP-TMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFS
        M  I +++I P   A  + +E+   E+CLKF P TM  I     P              F   + LP   F  YSVDRNH S++SL SF +ALL+ + F+
Subjt:  MICINVENIVPLTKAASMLSEI-GKEVCLKFSP-TMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFS

Query:  SITIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEIT
        ++ IHL E  N++LL F  +PS S S  PL+     + P+EDW   A++  + K FSI+SQ F  IIQ FS +  + ++I I S G+ V+FSILPFAE  
Subjt:  SITIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEIT

Query:  LSEELGQCSFRGEEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPFGGESHGMV
        L+EE G C   G+ E E  I++ + P+SFF +   Q QRV FF   + DS    ++FPF    H +V
Subjt:  LSEELGQCSFRGEEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPFGGESHGMV

XP_031744160.1 uncharacterized protein LOC116404808 [Cucumis sativus]1.2e-2234.11Show/hide
Query:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI
        M  + ++N  P   A S L+ I +E  +KF+P  F+IT             S    +F A L++  + F NY VD +H+S++SL SF +ALL G    S+
Subjt:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI

Query:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEITLS
        TIHLL  +N+++LRF     ES S  P   HEL L P ++ +D   I    K FSIDS+  +R+I+   ++     SI + +  + V FSI    EI L+
Subjt:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEITLS

Query:  EELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPF
        +E  +C   G E E+E  + IN+NP  FF++  +   RV F+        + +  F F
Subjt:  EELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPF

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980102.4e-2135.37Show/hide
Query:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI
        M  + +E   PL  A S+L+++ K+  +KF+P M  I  + + SP           +F A L L    F N+SVD N SSKVSL+ F +A+L G  FSS+
Subjt:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI

Query:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQ----KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAE
        TIHLL+  N+++LRF    + S+   PLH HEL L P      P    L Q      F++ S+  +RII+   L+   T S+++   G+ V FSI    E
Subjt:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQ----KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAE

Query:  ITLSEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFF
        I L++E G C   G E E E  +++ + P  FF++   +  + S F
Subjt:  ITLSEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFF

A0A1S3CL88 uncharacterized protein LOC1035022509.2e-2134.93Show/hide
Query:  MICINVENIVPLTKAASMLSEIG-KEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSS
        M  + ++N  PL  A S L++I      LKF+P+ F I              S    +F A L L    F  +SVD +HSSKVSL SF +A+L G  F+S
Subjt:  MICINVENIVPLTKAASMLSEIG-KEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSS

Query:  ITIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFP--LEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEI
        +TIHLL+  N+++LRF    SE    QPLH HEL L P   ED +        +K+F + S+  +RII+   ++ N  S IS+    + V FSI    EI
Subjt:  ITIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFP--LEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEI

Query:  TLSEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIA-----------IMYFPF
         L+E    C   G EEE E   +I + P  FF++   +  RV F+   N+   +            ++YFPF
Subjt:  TLSEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIA-----------IMYFPF

A0A6J1DSH6 uncharacterized protein LOC1110231051.8e-2133.71Show/hide
Query:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI
        M  I V NI  L  A S L+EI  EV +K +P +F++              S    +F  +L LP + F  YSV   H+S++S+ +F  ALL      S+
Subjt:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSI

Query:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHEL--ELFPLEDWKDPARIRLNQ-KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEI
        TIH+ + +NR +LRF     E+ S Q    HEL   L P +D  +    +L+  + F I  Q F +II  FS++D++T  + + S  + VTFS  P   +
Subjt:  TIHLLEPLNRLLLRFHLQPSESYSAQPLHYHEL--ELFPLEDWKDPARIRLNQ-KHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEI

Query:  TLSEELGQCSFRGEEEDEV-DIKININPASFFISLANQVQRVSFFHKNND--------DSHIAIMYF
        TL +E G+C+  G  EDE+    I I P  FF+ +A Q Q+V FF   +         D+++ ++YF
Subjt:  TLSEELGQCSFRGEEEDEV-DIKININPASFFISLANQVQRVSFFHKNND--------DSHIAIMYF

A0A6J1H2Z8 uncharacterized protein LOC1114600113.5e-2033.85Show/hide
Query:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFT-ITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSS
        M  + + +  PL +A S+L++I  E  LKFS + F+ IT+ PS             H+F A   +    FANY VDRNHSS+VSL+SF NA+  G  FSS
Subjt:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFT-ITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSS

Query:  ITIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEITL
        +TIH  E  +R++L+F    S +++   +H   L+L P ++ ++  +I+ + + FSI SQ F+ II     + N +  +S+ S      ++     E  L
Subjt:  ITIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEITL

Query:  SEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPFG
        ++E G+C   G E + E+  +IN+NP  FF +L+    R+ F+     DS   I    FG
Subjt:  SEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPFG

A0A6J1KZ05 uncharacterized protein LOC1114988877.8e-2034.62Show/hide
Query:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFT-ITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSS
        M  + + +  PL +A S+L++I  E  LKFS + F+ IT+ PS              +F A   +    FANYSVDRNHSS+VSL+SF +A+  G  FSS
Subjt:  MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFT-ITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSS

Query:  ITIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEITL
        +TIH  E  +R++L+F    S +++   +H   L+L P ++ ++  +I+ + + FSI SQ F+ II     + N +  +S+ S  + V F      E  L
Subjt:  ITIHLLEPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEITL

Query:  SEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPFG
        ++E G+C+  G E E E+  +IN+NP  FF +L+    R+ F+     DS   I    FG
Subjt:  SEELGQCSFRG-EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTGTATCAATGTTGAGAACATTGTTCCTTTAACGAAGGCAGCCTCCATGTTGAGTGAGATTGGTAAGGAAGTATGCCTCAAATTTTCACCCACCATGTTC
ACCATAACTACTCTTCCATCTCCATCTCCATCTCCATCTCCATCTCCATCTCCATCTCCTCACAAATTCTCAGCAATCCTAATTTTGCCAAGCTCCTCATTCGCC
AACTATTCCGTCGATCGCAATCACAGTTCAAAGGTTTCCCTCCGATCCTTCCAGAATGCTTTATTGCAAGGCCGGAGGTTTTCTTCAATCACCATCCATCTTCTC
GAACCCCTAAATCGCTTACTCCTTAGATTTCACCTTCAGCCTTCAGAGTCTTATTCGGCACAACCATTGCATTATCATGAATTGGAATTATTTCCTTTGGAAGAT
TGGAAAGATCCTGCCCGAATTAGGTTAAATCAAAAGCATTTTTCAATTGATTCACAATGCTTCAAACGTATTATTCAAACATTCTCTCTCTACGATAATCAAACT
TCTTCAATTTCTATTATTTCAAAGGGTACTTCTGTCACCTTCTCCATTTTGCCATTTGCGGAGATTACTCTTAGTGAAGAGTTGGGACAATGCTCCTTCCGAGGC
GAGGAAGAAGATGAAGTTGATATAAAAATCAACATCAATCCCGCTTCGTTTTTTATTAGTTTGGCAAATCAAGTACAAAGGGTATCATTCTTCCACAAAAATAAT
GATGATTCTCATATTGCAATAATGTATTTCCCTTTTGGAGGGGAATCTCATGGAATGGTTCATGGGATGCAAGACAGTGGGAAGAGTTTTGGAGAAGAACCTCAA
TTTTGTTACACTGCAGCCAGCGGACGAGAAAGAGATTGTCAAATAAGAGATTGTGTAAGGTAA
mRNA sequenceShow/hide mRNA sequence
TTTGGATTGGGAAGCAGAGAAATTAAAGCACCATGATTTGTATCAATGTTGAGAACATTGTTCCTTTAACGAAGGCAGCCTCCATGTTGAGTGAGATTGGTAAGG
AAGTATGCCTCAAATTTTCACCCACCATGTTCACCATAACTACTCTTCCATCTCCATCTCCATCTCCATCTCCATCTCCATCTCCATCTCCTCACAAATTCTCAG
CAATCCTAATTTTGCCAAGCTCCTCATTCGCCAACTATTCCGTCGATCGCAATCACAGTTCAAAGGTTTCCCTCCGATCCTTCCAGAATGCTTTATTGCAAGGCC
GGAGGTTTTCTTCAATCACCATCCATCTTCTCGAACCCCTAAATCGCTTACTCCTTAGATTTCACCTTCAGCCTTCAGAGTCTTATTCGGCACAACCATTGCATT
ATCATGAATTGGAATTATTTCCTTTGGAAGATTGGAAAGATCCTGCCCGAATTAGGTTAAATCAAAAGCATTTTTCAATTGATTCACAATGCTTCAAACGTATTA
TTCAAACATTCTCTCTCTACGATAATCAAACTTCTTCAATTTCTATTATTTCAAAGGGTACTTCTGTCACCTTCTCCATTTTGCCATTTGCGGAGATTACTCTTA
GTGAAGAGTTGGGACAATGCTCCTTCCGAGGCGAGGAAGAAGATGAAGTTGATATAAAAATCAACATCAATCCCGCTTCGTTTTTTATTAGTTTGGCAAATCAAG
TACAAAGGGTATCATTCTTCCACAAAAATAATGATGATTCTCATATTGCAATAATGTATTTCCCTTTTGGAGGGGAATCTCATGGAATGGTTCATGGGATGCAAG
ACAGTGGGAAGAGTTTTGGAGAAGAACCTCAATTTTGTTACACTGCAGCCAGCGGACGAGAAAGAGATTGTCAAATAAGAGATTGTGTAAGGTAAAGTAAAATTG
TTG
Protein sequenceShow/hide protein sequence
MICINVENIVPLTKAASMLSEIGKEVCLKFSPTMFTITTLPSPSPSPSPSPSPSPHKFSAILILPSSSFANYSVDRNHSSKVSLRSFQNALLQGRRFSSITIHLL
EPLNRLLLRFHLQPSESYSAQPLHYHELELFPLEDWKDPARIRLNQKHFSIDSQCFKRIIQTFSLYDNQTSSISIISKGTSVTFSILPFAEITLSEELGQCSFRG
EEEDEVDIKININPASFFISLANQVQRVSFFHKNNDDSHIAIMYFPFGGESHGMVHGMQDSGKSFGEEPQFCYTAASGRERDCQIRDCVR