; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G04270 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G04270
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationClcChr06:4439102..4441270
RNA-Seq ExpressionClc06G04270
SyntenyClc06G04270
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]5.9e-4451.17Show/hide
Query:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVVELENKSL---------
        +I S+ SP FVATLQ+S RLFTN+SVDH  SSKVSLE FHDAMLDGGSFSSM+IH+L+   QMILR+E P      +++ L +   + +SL         
Subjt:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVVELENKSL---------

Query:  --SGYSLKLAIEGMYKFQNNF-------------LFFFYYIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTPAF
          +   L+  I+ +  F ++              +      +    G CRIVGYEGE ET+L I  RPMMFFLNFTYKAN++WFYKT TNS SV+S PAF
Subjt:  --SGYSLKLAIEGMYKFQNNF-------------LFFFYYIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTPAF

Query:  GTYGQYVIYFPQL
        G YGQYV+YFP+L
Subjt:  GTYGQYVIYFPQL

XP_008458682.1 PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo]2.1e-4157.14Show/hide
Query:  MIIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIY--MYVYIFLFVVELENK--------
        MIIVSNRSP FVATLQLSRRLFTNFSVDHNKSSKVSL+PFHDAMLDGGSFSSMTIHLLD TNQM+LRFETP      ++  + L   + EN         
Subjt:  MIIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIY--MYVYIFLFVVELENK--------

Query:  -SLSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANK
         +++   L+  I+ +  F  + +                  I+   GG C+IVGYEGEVET+LQ++LRPMMFFLNFTY+ANK
Subjt:  -SLSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANK

XP_008464344.1 PREDICTED: uncharacterized protein LOC103502250 [Cucumis melo]9.1e-4551.63Show/hide
Query:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVV------------ELEN
        II S+RSP F+ATLQLS + FT FSVD++ SSKVSLE FHDA+LDGGSF+SMTIHLLDKTNQMILRF+TP      ++  L +             EL+ 
Subjt:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVV------------ELEN

Query:  KS---LSGYSLKLAIEGMYKFQNN-------------FLFFFYYIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMS
        +    +   +L+  I+ +  FQN+             F      II   G  C+I G+E EVETQ QIIL PMMFFLNFTYKAN++WFYKT  N++++M 
Subjt:  KS---LSGYSLKLAIEGMYKFQNN-------------FLFFFYYIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMS

Query:  TPAFGTYGQYVIYFP
         PA+G +GQYVIYFP
Subjt:  TPAFGTYGQYVIYFP

XP_031743957.1 uncharacterized protein LOC116404737 [Cucumis sativus]8.3e-4651.16Show/hide
Query:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVVELENKS----------
        +IVSNR P+F+ATLQLS R FTNFSVDH  SSKVSLE FHDAMLDGG FS+M+IHLLDKTNQMILRF+TP      ++  L +   +++           
Subjt:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVVELENKS----------

Query:  -----LSGYSLKLAIEGMYKFQNN-------------FLFFFYYIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMS
             ++  SL+  I+ +  FQN+             F    + I+ + G  CRI G+E EVETQ QI L PM+FFLNFTY+A+++WFYKT  N+++VM 
Subjt:  -----LSGYSLKLAIEGMYKFQNN-------------FLFFFYYIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMS

Query:  TPAFGTYGQYVIYFP
         PAFG YGQY IYFP
Subjt:  TPAFGTYGQYVIYFP

XP_038875055.1 uncharacterized protein LOC120067580 [Benincasa hispida]2.9e-4350.7Show/hide
Query:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIY--MYVYIFLFVVELENKSLSGY----
        +I    SP FVATLQLS++ FTN+SVDH  +SKV LE FHDA+LDGGSF+SMTIHLL+K NQMILRF+TP      ++  +     +L + ++ G     
Subjt:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIY--MYVYIFLFVVELENKSLSGY----

Query:  --------SLKLAIEGMYKFQNNFLF--------FFYYIINN----NGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTP
                +L+  I+ +  FQ++ +           + I +         CRIVG+E EVETQ QIILRPM+FFLNFTYKANK+WFYKT  NS+SVM+ P
Subjt:  --------SLKLAIEGMYKFQNNFLF--------FFYYIINN----NGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTP

Query:  AFGTYGQYVIYFP
        AFG  GQYVIYFP
Subjt:  AFGTYGQYVIYFP

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980101.0e-4157.14Show/hide
Query:  MIIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIY--MYVYIFLFVVELENK--------
        MIIVSNRSP FVATLQLSRRLFTNFSVDHNKSSKVSL+PFHDAMLDGGSFSSMTIHLLD TNQM+LRFETP      ++  + L   + EN         
Subjt:  MIIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIY--MYVYIFLFVVELENK--------

Query:  -SLSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANK
         +++   L+  I+ +  F  + +                  I+   GG C+IVGYEGEVET+LQ++LRPMMFFLNFTY+ANK
Subjt:  -SLSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANK

A0A1S3CL88 uncharacterized protein LOC1035022504.4e-4551.63Show/hide
Query:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVV------------ELEN
        II S+RSP F+ATLQLS + FT FSVD++ SSKVSLE FHDA+LDGGSF+SMTIHLLDKTNQMILRF+TP      ++  L +             EL+ 
Subjt:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVV------------ELEN

Query:  KS---LSGYSLKLAIEGMYKFQNN-------------FLFFFYYIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMS
        +    +   +L+  I+ +  FQN+             F      II   G  C+I G+E EVETQ QIIL PMMFFLNFTYKAN++WFYKT  N++++M 
Subjt:  KS---LSGYSLKLAIEGMYKFQNN-------------FLFFFYYIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMS

Query:  TPAFGTYGQYVIYFP
         PA+G +GQYVIYFP
Subjt:  TPAFGTYGQYVIYFP

A0A6J1H2Z8 uncharacterized protein LOC1114600119.3e-2740Show/hide
Query:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVVELENK-----------
        +I S  S  FVAT Q+S R F N+ VD N SS+VSL+ F++AM  G  FSSMTIH  + T++M+L+FE+     M ++  L +   + +           
Subjt:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVVELENK-----------

Query:  SLSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTPAF
        S+     +  I G+  F NN +F                 I+   GG+C IVGYEG+ E   QI L P  FF N +Y A +IWFYKTI +S  V+  PAF
Subjt:  SLSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTPAF

Query:  GTYGQYVIYF
        G   QYVIYF
Subjt:  GTYGQYVIYF

A0A6J1KIW5 uncharacterized protein LOC1114948742.4e-3038.68Show/hide
Query:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLF-------VVELE---NKS
        +  S  SP F+ATLQ+  + FT +SV+ +  S++SLE  HDA+LD GS S+MTIHLL+ TN M+LRFETP       + F+        + E+E   + +
Subjt:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLF-------VVELE---NKS

Query:  LSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTPAFG
        L    L+  I+ +  F  + +                  I     G+C I+G++G+  T+ +I+L PM+FFLN TY    +WF+KTITN+H VM  P F 
Subjt:  LSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTPAFG

Query:  TYGQYVIYFPQL
         + QYVIYFP++
Subjt:  TYGQYVIYFPQL

A0A6J1KZ05 uncharacterized protein LOC1114988872.0e-2942.38Show/hide
Query:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETP--IKIYMYVYIFLFVVELENK---------
        +I S  S  FVAT Q+S R F N+SVD N SS+VSL+ F+DAM DG  FSSMTIH  + T++M+L+FE+    K+ M+  + L   + E           
Subjt:  IIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETP--IKIYMYVYIFLFVVELENK---------

Query:  SLSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTPAF
        S+     +  I G+  F NN +F                 I+   GG+C I+GYEGE E   QI L P  FF N +Y A +IWFYKTI +S  V+  PAF
Subjt:  SLSGYSLKLAIEGMYKFQNNFLFFFY-------------YIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTPAF

Query:  GTYGQYVIYF
        G   QYVIYF
Subjt:  GTYGQYVIYF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAATTGTCTCGAACCGTTCCCCTCACTTCGTCGCAACGCTGCAATTATCGCGACGATTGTTCACCAATTTTTCCGTCGATCATAATAAAAGTTCCAAGGTTTCCCT
TGAACCCTTCCACGATGCTATGTTGGATGGCGGAAGTTTTTCTTCAATGACAATCCATCTTTTGGATAAAACAAACCAAATGATCCTTAGATTTGAGACTCCAATCAAAA
TATATATGTATGTATACATATTTCTTTTCGTTGTTGAATTGGAGAACAAAAGTCTGAGTGGATATTCTTTGAAGTTGGCTATAGAAGGGATGTACAAGTTTCAAAATAAT
TTTTTGTTTTTTTTTTATTACATAATAAACAACAATGGTGGACAATGTAGAATCGTAGGTTATGAAGGAGAAGTTGAAACACAACTCCAAATCATTCTCCGTCCCATGAT
GTTTTTCCTAAATTTTACATATAAAGCCAATAAGATATGGTTTTATAAGACAATAACTAATTCTCATAGTGTAATGAGTACCCCAGCTTTTGGAACATATGGTCAATATG
TGATCTATTTTCCCCAATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGATAATTGTCTCGAACCGTTCCCCTCACTTCGTCGCAACGCTGCAATTATCGCGACGATTGTTCACCAATTTTTCCGTCGATCATAATAAAAGTTCCAAGGTTTCCCT
TGAACCCTTCCACGATGCTATGTTGGATGGCGGAAGTTTTTCTTCAATGACAATCCATCTTTTGGATAAAACAAACCAAATGATCCTTAGATTTGAGACTCCAATCAAAA
TATATATGTATGTATACATATTTCTTTTCGTTGTTGAATTGGAGAACAAAAGTCTGAGTGGATATTCTTTGAAGTTGGCTATAGAAGGGATGTACAAGTTTCAAAATAAT
TTTTTGTTTTTTTTTTATTACATAATAAACAACAATGGTGGACAATGTAGAATCGTAGGTTATGAAGGAGAAGTTGAAACACAACTCCAAATCATTCTCCGTCCCATGAT
GTTTTTCCTAAATTTTACATATAAAGCCAATAAGATATGGTTTTATAAGACAATAACTAATTCTCATAGTGTAATGAGTACCCCAGCTTTTGGAACATATGGTCAATATG
TGATCTATTTTCCCCAATTATGA
Protein sequenceShow/hide protein sequence
MIIVSNRSPHFVATLQLSRRLFTNFSVDHNKSSKVSLEPFHDAMLDGGSFSSMTIHLLDKTNQMILRFETPIKIYMYVYIFLFVVELENKSLSGYSLKLAIEGMYKFQNN
FLFFFYYIINNNGGQCRIVGYEGEVETQLQIILRPMMFFLNFTYKANKIWFYKTITNSHSVMSTPAFGTYGQYVIYFPQL