; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G014500 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G014500
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationCG_Chr05:24529419..24536463
RNA-Seq ExpressionClCG05G014500
SyntenyClCG05G014500
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575164.1 hypothetical protein SDJN03_25803, partial [Cucurbita argyrosperma subsp. sororia]1.9e-2438.81Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQ--
        M  V LE+  PL+DA FL + +S EADVK + +  S+  S+    ++  L IW+  F +Y V  +   RISL TF+++L   G    SS+T     T+  
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQ--

Query:  ------NTQTP-----------LPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR
              N+ +P           LPS                    F+HIIREV  FPN  + +T+T+ QVKFS ASKEIILTKQ    QI GYE     +
Subjt:  ------NTQTP-----------LPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR

Query:  FRIFLHPTSFFLTLSIQAQ
        FRI LHP  FFL LS Q++
Subjt:  FRIFLHPTSFFLTLSIQAQ

XP_008458264.1 PREDICTED: uncharacterized protein LOC103497732 [Cucumis melo]1.3e-2844.27Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQN-
        + FVSL NF PL DAA L S+I +EADVK S ++LS+IT+D  R +VA LHIWEPFF DYY+   I+SRISL TF   + +A ++HCSS+ I  ++  N 
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQN-

Query:  --------------------------TQTPLPS-------ISKFKHIIREVVHFPN-YSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGY
                                  TQT + +        S F+ II EV H PN   + VT+TNSQVKFS +SKEIILT++  G ++  Y
Subjt:  --------------------------TQTPLPS-------ISKFKHIIREVVHFPN-YSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGY

XP_022958858.1 uncharacterized protein LOC111460013 [Cucurbita moschata]1.4e-2439.27Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQ--
        M  V LE+  PL+DA FL + +S EADVK + +  S+  S+    ++  L IW+  F +Y V  +   RISL TF+++L   G    SS+T     T+  
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQ--

Query:  ------NTQTP-----------LPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR
              N+ +P           LPS                    F+HIIREV  FPN  + VT+T+ QVKFS ASKEIILTKQ    QI GYE     +
Subjt:  ------NTQTP-----------LPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR

Query:  FRIFLHPTSFFLTLSIQAQ
        FRI LHP  FFL LS Q++
Subjt:  FRIFLHPTSFFLTLSIQAQ

XP_023006008.1 uncharacterized protein LOC111498886 [Cucurbita maxima]2.7e-2338.36Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQNT
        M  V LE+  PL+DA FL + +S E DVK + +  S+  S+    ++  L IW+  F +Y V  +   RISL TF++ L   GE++ SS+T     T++ 
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQNT

Query:  Q-------------------TPLPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR
                            T LPS                    F+HIIREV  FPN ++ VT+T+ QVKFS A+KEIILTKQ    QI GYE     +
Subjt:  Q-------------------TPLPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR

Query:  FRIFLHPTSFFLTLSIQAQ
        FRI LHP +FFL LS Q++
Subjt:  FRIFLHPTSFFLTLSIQAQ

XP_023548334.1 uncharacterized protein LOC111807002 [Cucurbita pepo subsp. pepo]1.6e-2336.57Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAG----------ENHCSSI
        M  V L +F PL +A  +L+ ISHEAD+K S S+ S+ITS     +VA   I   FF +Y+V  + SSR+SLQ+F   +  AG              S +
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAG----------ENHCSSI

Query:  TIPYERTQNTQTPLPSISK------------------------FKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTRFR
         + +E + +T+  +  + K                        F+ I+  +  FPN S++V++T+SQVKF  AS+E ILTK+GG C IVGYEG  +  F+
Subjt:  TIPYERTQNTQTPLPSISK------------------------FKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTRFR

Query:  IFLHPTSFFLTLSIQA
        I L+P  FF  LS  A
Subjt:  IFLHPTSFFLTLSIQA

TrEMBL top hitse value%identityAlignment
A0A1S3C7K5 uncharacterized protein LOC1034977326.1e-2944.27Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQN-
        + FVSL NF PL DAA L S+I +EADVK S ++LS+IT+D  R +VA LHIWEPFF DYY+   I+SRISL TF   + +A ++HCSS+ I  ++  N 
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQN-

Query:  --------------------------TQTPLPS-------ISKFKHIIREVVHFPN-YSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGY
                                  TQT + +        S F+ II EV H PN   + VT+TNSQVKFS +SKEIILT++  G ++  Y
Subjt:  --------------------------TQTPLPS-------ISKFKHIIREVVHFPN-YSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGY

A0A1S3C8J1 uncharacterized protein LOC1034980103.8e-2336.87Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQNT
        M  V LE F PL DA  LL+ ++ +ADVK +   L II S+    +VA L +    F ++ V H+ SS++SLQ F   +   G    SS+TI    T N 
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQNT

Query:  -----QTPLPSISKFKH------------------------------IIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTRF
             +TP   +    H                              II+E+  F   +V VT+T SQVKFS  SKEIILTK+GG C+IVGYEG+V+T+ 
Subjt:  -----QTPLPSISKFKH------------------------------IIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTRF

Query:  RIFLHPTSFFLTLSIQA
        ++ L P  FFL  + +A
Subjt:  RIFLHPTSFFLTLSIQA

A0A6J1H2Z8 uncharacterized protein LOC1114600112.3e-2336.11Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAG----------ENHCSSI
        M  V L +F PL +A  +L+ IS+EAD+K S S+ S+ITS     +VA   I   FF +Y+V  + SSR+SLQ+F   +  AG              S +
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAG----------ENHCSSI

Query:  TIPYERTQNTQTPLPSISK------------------------FKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTRFR
         + +E + +T+  +  + K                        F+ II  +  FPN S++V++T+S+VKF +AS+E ILTK+GG C IVGYEG  +  F+
Subjt:  TIPYERTQNTQTPLPSISK------------------------FKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTRFR

Query:  IFLHPTSFFLTLSIQA
        I L+P  FF  LS  A
Subjt:  IFLHPTSFFLTLSIQA

A0A6J1H490 uncharacterized protein LOC1114600137.0e-2539.27Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQ--
        M  V LE+  PL+DA FL + +S EADVK + +  S+  S+    ++  L IW+  F +Y V  +   RISL TF+++L   G    SS+T     T+  
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQ--

Query:  ------NTQTP-----------LPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR
              N+ +P           LPS                    F+HIIREV  FPN  + VT+T+ QVKFS ASKEIILTKQ    QI GYE     +
Subjt:  ------NTQTP-----------LPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR

Query:  FRIFLHPTSFFLTLSIQAQ
        FRI LHP  FFL LS Q++
Subjt:  FRIFLHPTSFFLTLSIQAQ

A0A6J1L3R4 uncharacterized protein LOC1114988861.3e-2338.36Show/hide
Query:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQNT
        M  V LE+  PL+DA FL + +S E DVK + +  S+  S+    ++  L IW+  F +Y V  +   RISL TF++ L   GE++ SS+T     T++ 
Subjt:  MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQNT

Query:  Q-------------------TPLPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR
                            T LPS                    F+HIIREV  FPN ++ VT+T+ QVKFS A+KEIILTKQ    QI GYE     +
Subjt:  Q-------------------TPLPS-----------------ISKFKHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTR

Query:  FRIFLHPTSFFLTLSIQAQ
        FRI LHP +FFL LS Q++
Subjt:  FRIFLHPTSFFLTLSIQAQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTTGTTAGCCTTGAGAACTTTGGTCCTCTTACAGACGCAGCCTTTCTACTTTCTCATATTTCTCATGAAGCTGACGTGAAAGCCTCAGGGTCGCGGTTGTCGAT
AATAACCTCCGATGGTTACCGTCACTACGTGGCTGCACTGCATATTTGGGAACCCTTTTTCGTCGACTATTATGTCCGTCATCATATCAGTTCAAGGATTTCTCTCCAAA
CTTTCCGTCGTACTTTGCGGCAAGCTGGAGAAAATCATTGTTCTTCAATCACCATTCCATATGAAAGAACTCAGAACACACAAACGCCTCTTCCTTCAATTTCAAAATTT
AAACACATAATAAGAGAAGTAGTTCACTTCCCAAATTATTCAGTTTGGGTTACTATAACCAATTCACAAGTCAAGTTCTCTTTTGCATCTAAGGAGATTATTCTTACCAA
ACAGGGTGGAGGATGCCAAATTGTAGGCTATGAAGGAGACGTCCAAACTCGATTTCGAATCTTTCTCCATCCGACGTCGTTTTTCCTTACATTGTCAATTCAAGCACAGG
GCAGTGCTCTACGCTTCTCAACGTCGAATTTCCTTAATCACATATTCAAATGCCTTCACACTTCGTCCCTCGGCATTTTCAATCCCTCAAGTGCCTTCACATATAGTCCC
TTAGCACTGAGTTTTACTATCAACCACTGCTCAAGTGCCTTCACTCAGGGTCCCTCAGCACTGGAATCCGTTTCATCCAACTACTCAAGTGCCTTCACACGTGGTCCCTC
AGCACTGAGTATCATATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTTTGTTAGCCTTGAGAACTTTGGTCCTCTTACAGACGCAGCCTTTCTACTTTCTCATATTTCTCATGAAGCTGACGTGAAAGCCTCAGGGTCGCGGTTGTCGAT
AATAACCTCCGATGGTTACCGTCACTACGTGGCTGCACTGCATATTTGGGAACCCTTTTTCGTCGACTATTATGTCCGTCATCATATCAGTTCAAGGATTTCTCTCCAAA
CTTTCCGTCGTACTTTGCGGCAAGCTGGAGAAAATCATTGTTCTTCAATCACCATTCCATATGAAAGAACTCAGAACACACAAACGCCTCTTCCTTCAATTTCAAAATTT
AAACACATAATAAGAGAAGTAGTTCACTTCCCAAATTATTCAGTTTGGGTTACTATAACCAATTCACAAGTCAAGTTCTCTTTTGCATCTAAGGAGATTATTCTTACCAA
ACAGGGTGGAGGATGCCAAATTGTAGGCTATGAAGGAGACGTCCAAACTCGATTTCGAATCTTTCTCCATCCGACGTCGTTTTTCCTTACATTGTCAATTCAAGCACAGG
GCAGTGCTCTACGCTTCTCAACGTCGAATTTCCTTAATCACATATTCAAATGCCTTCACACTTCGTCCCTCGGCATTTTCAATCCCTCAAGTGCCTTCACATATAGTCCC
TTAGCACTGAGTTTTACTATCAACCACTGCTCAAGTGCCTTCACTCAGGGTCCCTCAGCACTGGAATCCGTTTCATCCAACTACTCAAGTGCCTTCACACGTGGTCCCTC
AGCACTGAGTATCATATAA
Protein sequenceShow/hide protein sequence
MLFVSLENFGPLTDAAFLLSHISHEADVKASGSRLSIITSDGYRHYVAALHIWEPFFVDYYVRHHISSRISLQTFRRTLRQAGENHCSSITIPYERTQNTQTPLPSISKF
KHIIREVVHFPNYSVWVTITNSQVKFSFASKEIILTKQGGGCQIVGYEGDVQTRFRIFLHPTSFFLTLSIQAQGSALRFSTSNFLNHIFKCLHTSSLGIFNPSSAFTYSP
LALSFTINHCSSAFTQGPSALESVSSNYSSAFTRGPSALSII