; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G04150 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G04150
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationClcChr06:4340233..4345306
RNA-Seq ExpressionClc06G04150
SyntenyClc06G04150
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016903187.1 PREDICTED: uncharacterized protein LOC103502263 [Cucumis melo]5.0e-5271.34Show/hide
Query:  PSFFNLISWPKAKRVWLFKSNDNFHAIQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQE
        P FF++I+  ++ R   +        +QM+HHCFI+Y VD+DHTSRISLESFHDALLDGGAS SMTIHLL NI Q++LRFESSSH P V HEL+LTPSQE
Subjt:  PSFFNLISWPKAKRVWLFKSNDNFHAIQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQE

Query:  EDLGEVDYAKCFSIDSRELRRVITELPIFHEDSICVTATSSQVKFSIASEAIILTKE
        EDLGEVDYAK FSIDS++LRRVI  LPIFH DSICVTAT SQVKFSIAS+ I+LTKE
Subjt:  EDLGEVDYAKCFSIDSRELRRVITELPIFHEDSICVTATSSQVKFSIASEAIILTKE

XP_022959354.1 uncharacterized protein LOC111460352 isoform X1 [Cucurbita moschata]3.0e-4149.07Show/hide
Query:  MNFKYSWKSFSIIVSSSSPT-CIIELQIMPQFFPFFFCNNHIFKYSTISLTQFYPTLFLMKHNGFSVMIFSLCESLDHLLDLIFLTSSGDNQWQTYLPLL
        M+FK S    ++IVS  SPT  IIELQ MPQFF  F CN     +S++S+  +Y  LF MK   F ++  S  E  +  L  IF + S  +  + +LPLL
Subjt:  MNFKYSWKSFSIIVSSSSPT-CIIELQIMPQFFPFFFCNNHIFKYSTISLTQFYPTLFLMKHNGFSVMIFSLCESLDHLLDLIFLTSSGDNQWQTYLPLL

Query:  LSYQEFDVGVINYGSFVSIDSVQFLAFLIMLSDTTYVTVTVCNSHVKLTGGGEDPEYFILSQQRGECIIGGVAAGDVTQFVLIFNPIPSFFNLISWPKAK
        L+++E DVGVINYG FVSID  +F  F   L+D T+V VT+ NS  K +GGGE+   F L Q++ ECIIGGV  GD TQF +I N    F NL  W KAK
Subjt:  LSYQEFDVGVINYGSFVSIDSVQFLAFLIMLSDTTYVTVTVCNSHVKLTGGGEDPEYFILSQQRGECIIGGVAAGDVTQFVLIFNPIPSFFNLISWPKAK

Query:  RVWLFKSNDNFHAI
        RVW FKSNDN H +
Subjt:  RVWLFKSNDNFHAI

XP_023000630.1 uncharacterized protein LOC111494874 [Cucurbita maxima]1.9e-3552.6Show/hide
Query:  NLISWPKAKRVWLFKSNDNFHA-IQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQEEDL
        ++I  P + ++   + +  F A +Q+ H CF  Y+V+ DH SRISLES HDALLD G+SS+MTIHLL+N N M+LRFE+ +H P +RH+  L P QE+ +
Subjt:  NLISWPKAKRVWLFKSNDNFHA-IQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQEEDL

Query:  GEVDYAKCFSIDSRELRRVITELPIFHEDSICVTATSSQVKFSIASEAIILTKE
         E++Y+K  ++DSR+LR+VI ELP+FH DS+CVT TSS+V+FSIAS  +I  KE
Subjt:  GEVDYAKCFSIDSRELRRVITELPIFHEDSICVTATSSQVKFSIASEAIILTKE

XP_023549339.1 uncharacterized protein LOC111807722 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-4350.93Show/hide
Query:  MNFKYSWKSFSIIVSSSSPT-CIIELQIMPQFFPFFFCNNHIFKYSTISLTQFYPTLFLMKHNGFSVMIFSLCESLDHLLDLIFLTSSGDNQWQTYLPLL
        M+FK S    ++IVS  SPT  IIELQ MPQFF  FFCN     +S++S+  +Y  LF MK   F ++  S  E  +  L  IF + S  +  +  LPLL
Subjt:  MNFKYSWKSFSIIVSSSSPT-CIIELQIMPQFFPFFFCNNHIFKYSTISLTQFYPTLFLMKHNGFSVMIFSLCESLDHLLDLIFLTSSGDNQWQTYLPLL

Query:  LSYQEFDVGVINYGSFVSIDSVQFLAFLIMLSDTTYVTVTVCNSHVKLTGGGEDPEYFILSQQRGECIIGGVAAGDVTQFVLIFNPIPSFFNLISWPKAK
        L+++EFDVGVINYG FVSID  +F  F + L+D T+VTVT+ NS VK + GGE+   F L Q+R ECIIGGV  GD TQF +I N    F NL  W KAK
Subjt:  LSYQEFDVGVINYGSFVSIDSVQFLAFLIMLSDTTYVTVTVCNSHVKLTGGGEDPEYFILSQQRGECIIGGVAAGDVTQFVLIFNPIPSFFNLISWPKAK

Query:  RVWLFKSNDNFHAI
        RVW FKSNDN H +
Subjt:  RVWLFKSNDNFHAI

XP_031744160.1 uncharacterized protein LOC116404808 [Cucumis sativus]8.8e-4980.62Show/hide
Query:  MSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQEEDLGEVDYAKCFSIDSRELRRVITELPI
        M+++CFI+Y VD+DHTSRISLESFHDALLDGG S SMTIHLL NINQM+LRFESSSH P VRHEL+L PSQEEDLGE+DYAK FSIDS+ LRRVI  LPI
Subjt:  MSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQEEDLGEVDYAKCFSIDSRELRRVITELPI

Query:  FHEDSICVTATSSQVKFSIASEAIILTKE
        FH DSICVTAT SQVKFSIAS+ I+LTKE
Subjt:  FHEDSICVTATSSQVKFSIASEAIILTKE

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980106.6e-3458.33Show/hide
Query:  IQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSH-VPHVRHELTLTPSQEEDLGEVDYAKCFSIDSRELRRVITE
        +Q+S   F +++VD + +S++SL+ FHDA+LDGG+ SSMTIHLL   NQM+LRFE+ SH VP + HEL L+P Q E+LG+V+Y   F++ SRELRR+I E
Subjt:  IQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSH-VPHVRHELTLTPSQEEDLGEVDYAKCFSIDSRELRRVITE

Query:  LPIFHEDSICVTATSSQVKFSIASEAIILTKE
        LP+FH+D++ VT T SQVKFSI S+ IILTKE
Subjt:  LPIFHEDSICVTATSSQVKFSIASEAIILTKE

A0A1S4E4N8 uncharacterized protein LOC1035022632.4e-5271.34Show/hide
Query:  PSFFNLISWPKAKRVWLFKSNDNFHAIQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQE
        P FF++I+  ++ R   +        +QM+HHCFI+Y VD+DHTSRISLESFHDALLDGGAS SMTIHLL NI Q++LRFESSSH P V HEL+LTPSQE
Subjt:  PSFFNLISWPKAKRVWLFKSNDNFHAIQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQE

Query:  EDLGEVDYAKCFSIDSRELRRVITELPIFHEDSICVTATSSQVKFSIASEAIILTKE
        EDLGEVDYAK FSIDS++LRRVI  LPIFH DSICVTAT SQVKFSIAS+ I+LTKE
Subjt:  EDLGEVDYAKCFSIDSRELRRVITELPIFHEDSICVTATSSQVKFSIASEAIILTKE

A0A6J1H7T7 uncharacterized protein LOC111460352 isoform X11.5e-4149.07Show/hide
Query:  MNFKYSWKSFSIIVSSSSPT-CIIELQIMPQFFPFFFCNNHIFKYSTISLTQFYPTLFLMKHNGFSVMIFSLCESLDHLLDLIFLTSSGDNQWQTYLPLL
        M+FK S    ++IVS  SPT  IIELQ MPQFF  F CN     +S++S+  +Y  LF MK   F ++  S  E  +  L  IF + S  +  + +LPLL
Subjt:  MNFKYSWKSFSIIVSSSSPT-CIIELQIMPQFFPFFFCNNHIFKYSTISLTQFYPTLFLMKHNGFSVMIFSLCESLDHLLDLIFLTSSGDNQWQTYLPLL

Query:  LSYQEFDVGVINYGSFVSIDSVQFLAFLIMLSDTTYVTVTVCNSHVKLTGGGEDPEYFILSQQRGECIIGGVAAGDVTQFVLIFNPIPSFFNLISWPKAK
        L+++E DVGVINYG FVSID  +F  F   L+D T+V VT+ NS  K +GGGE+   F L Q++ ECIIGGV  GD TQF +I N    F NL  W KAK
Subjt:  LSYQEFDVGVINYGSFVSIDSVQFLAFLIMLSDTTYVTVTVCNSHVKLTGGGEDPEYFILSQQRGECIIGGVAAGDVTQFVLIFNPIPSFFNLISWPKAK

Query:  RVWLFKSNDNFHAI
        RVW FKSNDN H +
Subjt:  RVWLFKSNDNFHAI

A0A6J1HGU2 uncharacterized protein LOC1114642095.0e-3458.02Show/hide
Query:  IQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQEEDLGEVDYAKCFSIDSRELRRVITEL
        +Q+ H CF  Y+V+ DH SRISLES HDALLD G+SSSMTIHLL+N N M LRFE+ +H P +RH++ L P QE+ + E++Y+K  ++D R+LR+VI EL
Subjt:  IQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQEEDLGEVDYAKCFSIDSRELRRVITEL

Query:  PIFHEDSICVTATSSQVKFSIASEAIILTKE
        P+F  DS+CVT TSS+V+FSIAS  +I  KE
Subjt:  PIFHEDSICVTATSSQVKFSIASEAIILTKE

A0A6J1KIW5 uncharacterized protein LOC1114948749.2e-3652.6Show/hide
Query:  NLISWPKAKRVWLFKSNDNFHA-IQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQEEDL
        ++I  P + ++   + +  F A +Q+ H CF  Y+V+ DH SRISLES HDALLD G+SS+MTIHLL+N N M+LRFE+ +H P +RH+  L P QE+ +
Subjt:  NLISWPKAKRVWLFKSNDNFHA-IQMSHHCFIHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQEEDL

Query:  GEVDYAKCFSIDSRELRRVITELPIFHEDSICVTATSSQVKFSIASEAIILTKE
         E++Y+K  ++DSR+LR+VI ELP+FH DS+CVT TSS+V+FSIAS  +I  KE
Subjt:  GEVDYAKCFSIDSRELRRVITELPIFHEDSICVTATSSQVKFSIASEAIILTKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTCAAATACTCATGGAAGAGTTTCTCCATAATAGTTTCATCTTCTTCCCCAACTTGCATTATAGAGCTTCAAATCATGCCTCAATTCTTCCCTTTTTTCTTTTG
CAATAATCACATCTTTAAATATTCCACCATTTCCCTTACCCAATTTTACCCTACCTTGTTTCTTATGAAACACAATGGTTTTTCTGTCATGATTTTCAGTCTTTGTGAAA
GCCTTGATCATCTCCTTGACCTTATATTTCTCACTTCTTCTGGTGATAATCAATGGCAAACTTACTTGCCTTTATTGCTGTCTTATCAAGAATTTGATGTAGGAGTAATT
AACTATGGAAGCTTTGTCTCAATTGATTCAGTACAATTCTTAGCATTTTTAATCATGTTATCGGATACTACATACGTTACGGTCACTGTATGCAATTCACACGTCAAGTT
GACTGGAGGGGGAGAGGATCCGGAGTACTTTATTCTTAGCCAACAGAGAGGAGAATGCATAATTGGAGGTGTTGCAGCAGGAGATGTAACTCAATTTGTCCTCATTTTCA
ATCCAATTCCATCTTTCTTCAATTTGATAAGTTGGCCAAAAGCAAAAAGAGTTTGGTTATTCAAATCAAATGATAATTTTCACGCCATTCAAATGTCCCACCATTGCTTC
ATCCACTACAATGTCGATGATGATCACACTTCAAGAATTTCCCTTGAATCCTTCCATGACGCTCTCTTGGATGGTGGAGCTTCTTCTTCAATGACCATTCATCTTCTCCA
AAACATAAACCAAATGATGCTTAGATTCGAATCTTCAAGTCATGTGCCACACGTGCGTCATGAATTGACATTGACACCGTCACAAGAAGAGGATCTTGGAGAAGTTGATT
ATGCAAAATGTTTCTCAATTGATTCAAGGGAATTAAGACGTGTTATAACAGAATTACCTATCTTCCATGAGGACTCAATATGTGTTACTGCAACCAGTTCACAAGTCAAA
TTCTCTATTGCTTCTGAAGCGATTATTCTTACCAAAGAGGTTATGAGGAAGAAGATGAAATTGAAATCCGAATCAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTCAAATACTCATGGAAGAGTTTCTCCATAATAGTTTCATCTTCTTCCCCAACTTGCATTATAGAGCTTCAAATCATGCCTCAATTCTTCCCTTTTTTCTTTTG
CAATAATCACATCTTTAAATATTCCACCATTTCCCTTACCCAATTTTACCCTACCTTGTTTCTTATGAAACACAATGGTTTTTCTGTCATGATTTTCAGTCTTTGTGAAA
GCCTTGATCATCTCCTTGACCTTATATTTCTCACTTCTTCTGGTGATAATCAATGGCAAACTTACTTGCCTTTATTGCTGTCTTATCAAGAATTTGATGTAGGAGTAATT
AACTATGGAAGCTTTGTCTCAATTGATTCAGTACAATTCTTAGCATTTTTAATCATGTTATCGGATACTACATACGTTACGGTCACTGTATGCAATTCACACGTCAAGTT
GACTGGAGGGGGAGAGGATCCGGAGTACTTTATTCTTAGCCAACAGAGAGGAGAATGCATAATTGGAGGTGTTGCAGCAGGAGATGTAACTCAATTTGTCCTCATTTTCA
ATCCAATTCCATCTTTCTTCAATTTGATAAGTTGGCCAAAAGCAAAAAGAGTTTGGTTATTCAAATCAAATGATAATTTTCACGCCATTCAAATGTCCCACCATTGCTTC
ATCCACTACAATGTCGATGATGATCACACTTCAAGAATTTCCCTTGAATCCTTCCATGACGCTCTCTTGGATGGTGGAGCTTCTTCTTCAATGACCATTCATCTTCTCCA
AAACATAAACCAAATGATGCTTAGATTCGAATCTTCAAGTCATGTGCCACACGTGCGTCATGAATTGACATTGACACCGTCACAAGAAGAGGATCTTGGAGAAGTTGATT
ATGCAAAATGTTTCTCAATTGATTCAAGGGAATTAAGACGTGTTATAACAGAATTACCTATCTTCCATGAGGACTCAATATGTGTTACTGCAACCAGTTCACAAGTCAAA
TTCTCTATTGCTTCTGAAGCGATTATTCTTACCAAAGAGGTTATGAGGAAGAAGATGAAATTGAAATCCGAATCAATCTGA
Protein sequenceShow/hide protein sequence
MNFKYSWKSFSIIVSSSSPTCIIELQIMPQFFPFFFCNNHIFKYSTISLTQFYPTLFLMKHNGFSVMIFSLCESLDHLLDLIFLTSSGDNQWQTYLPLLLSYQEFDVGVI
NYGSFVSIDSVQFLAFLIMLSDTTYVTVTVCNSHVKLTGGGEDPEYFILSQQRGECIIGGVAAGDVTQFVLIFNPIPSFFNLISWPKAKRVWLFKSNDNFHAIQMSHHCF
IHYNVDDDHTSRISLESFHDALLDGGASSSMTIHLLQNINQMMLRFESSSHVPHVRHELTLTPSQEEDLGEVDYAKCFSIDSRELRRVITELPIFHEDSICVTATSSQVK
FSIASEAIILTKEVMRKKMKLKSESI