; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G11640 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G11640
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr06:21894935..21895692
RNA-Seq ExpressionClc06G11640
SyntenyClc06G11640
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038877407.1 uncharacterized protein LOC120069696 [Benincasa hispida]8.6e-4753.88Show/hide
Query:  MRSLKKQYDALLEMLSQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDL-------------SYLGKDRAIGQLSEDPY----------
        +RSLKKQY+A+ EMLSQSGF WNEEFKCV            SHP+ KG+WNK FPHYDDL             S L +D    ++ E+P           
Subjt:  MRSLKKQYDALLEMLSQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDL-------------SYLGKDRAIGQLSEDPY----------

Query:  VMATNAFKEFEDEIRLESQDCHTP----ENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRYYLR
        V ++   K      ++E  D         +THMGRLA+ QKKKY+LEFG +K+VVNAIYNIDGL++D QVTLIDL++TDIQKTDCFLAVPEHA KRY LR
Subjt:  VMATNAFKEFEDEIRLESQDCHTP----ENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRYYLR

Query:  LLGRNM
        LLGRNM
Subjt:  LLGRNM

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]2.0e-4351.67Show/hide
Query:  MRSLKKQYDALLEMLSQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLS-YLGKDRA---------------IGQLSEDPYVMAT---
        +RSLKKQY+A+ EMLSQSGF WNEEFKCV            SHP+ KG+W K FPHYDDLS   GKDRA                 ++ E+P   +T   
Subjt:  MRSLKKQYDALLEMLSQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLS-YLGKDRA---------------IGQLSEDPYVMAT---

Query:  -------NAFKEFEDEIRLESQDCHTP----ENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRY
                  K      ++E  D        ++THMGRLA+ Q +KY+LE    K+VVNAIYNID L ++DQVTLIDL++TDIQKTDCFLAVPEHA+KRY
Subjt:  -------NAFKEFEDEIRLESQDCHTP----ENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRY

Query:  YLRLLGRNM
         LRLLGRNM
Subjt:  YLRLLGRNM

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]3.2e-7075Show/hide
Query:  MRSLKKQYDALLEMLSQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLS-YLGKDRAIGQLSEDPYVMATNAFKEFEDEIRLESQDCH
        +RSLKKQ +A+ EMLSQSGF WNEEFKCV            SHP+ KG+WNK FPHYDDLS   GK +A+GQ SEDPYVM TNAF+EFEDEIRL SQDCH
Subjt:  MRSLKKQYDALLEMLSQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLS-YLGKDRAIGQLSEDPYVMATNAFKEFEDEIRLESQDCH

Query:  TPENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRYYLRLLGRNM
        TPE+THMGRLA+ QK+KY+LEFG RK+VVNAIYNIDGL++DDQVTLIDLL+TDIQKT+CFLAVPEHA+KRY LRLLGRNM
Subjt:  TPENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRYYLRLLGRNM

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]5.6e-4652.63Show/hide
Query:  MRSLKKQYDALLEMLSQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLS-YLGKDRA---------------IGQLSEDPYVMATNAF
        +RSLKKQY+A+ EMLSQSGFGWNEEFKCV            SH + KG+WNK F HYDDLS   GKDRA                 ++ E+P   +T   
Subjt:  MRSLKKQYDALLEMLSQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLS-YLGKDRA---------------IGQLSEDPYVMATNAF

Query:  K--------------EFEDEIRLESQDCHTPENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRY
                        F+ E+    +     ++THMGRLA+ QK+KY+LEFG RK+VVNAIY+IDGL++DDQVT IDLL+TDIQKTDCFLAVPEHA+KRY
Subjt:  K--------------EFEDEIRLESQDCHTPENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRY

Query:  YLRLLGRNM
         L LL RNM
Subjt:  YLRLLGRNM

XP_038899910.1 uncharacterized protein LOC120087100 [Benincasa hispida]8.0e-3749.73Show/hide
Query:  MLSQSGFGWNEEFKCV---------SHPSVKGIWNKLFPHYDDLS-YLGKDRAIGQLSEDPYVMATNAFKEFEDEIRLESQDCHTPE-------------
        MLSQSGFGWNEEFKCV         SHP+ KG+WNK FPHYDDLS   GKDRA+GQ SEDPYVMA NAF+EFEDEIRL SQDC T E             
Subjt:  MLSQSGFGWNEEFKCV---------SHPSVKGIWNKLFPHYDDLS-YLGKDRAIGQLSEDPYVMATNAFKEFEDEIRLESQDCHTPE-------------

Query:  --------------------------------------------NTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQV
                                                    +THMGRLA+ QK+KY+LEF H K+VVNAIY+IDGL++DD++
Subjt:  --------------------------------------------NTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQV

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859531.0e-1328.63Show/hide
Query:  MRSLKKQYDALLEML--SQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSYL-GKDRAIGQLSEDPYVMATNAFKEFEDEIRL----
        ++SLKK Y A+ EM   S SGFGWNEEF+C+            SHP+ KG+ +K FP+YDDLSY+ GKDRA G  SE    + +N    F D I L    
Subjt:  MRSLKKQYDALLEML--SQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSYL-GKDRAIGQLSEDPYVMATNAFKEFEDEIRL----

Query:  ----------------------------ESQDCHTPE----------------------NTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQV
                                    E ++C +                        N  +  +A   K+K  +E   R +VV  + +I  L   D+ 
Subjt:  ----------------------------ESQDCHTPE----------------------NTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQV

Query:  TLIDLLITDIQKTDCFLAVPEHAQKRY
         L+ +L   ++  + FL++P   +  Y
Subjt:  TLIDLLITDIQKTDCFLAVPEHAQKRY

A0A2N9G0Q3 Myb_DNA-bind_3 domain-containing protein1.4e-1045.98Show/hide
Query:  RSLKKQYDALLEMLSQ-SGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSYL-GKDRAIGQLSEDPYVMATNAFKE
        R LKKQY A+ +ML + SGFGW++ +KC+             HP   G+ NK FPHYDDL+++ GKDRAIG  +E P  MA    +E
Subjt:  RSLKKQYDALLEMLSQ-SGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSYL-GKDRAIGQLSEDPYVMATNAFKE

A0A5A7U0H7 Retrotransposon protein1.0e-1328.63Show/hide
Query:  MRSLKKQYDALLEML--SQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSYL-GKDRAIGQLSEDPYVMATNAFKEFEDEIRL----
        ++SLKK Y A+ EM   S SGFGWNEEF+C+            SHP+ KG+ +K FP+YDDLSY+ GKDRA G  SE    + +N    F D I L    
Subjt:  MRSLKKQYDALLEML--SQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSYL-GKDRAIGQLSEDPYVMATNAFKEFEDEIRL----

Query:  ----------------------------ESQDCHTPE----------------------NTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQV
                                    E ++C +                        N  +  +A   K+K  +E   R +VV  + +I  L   D+ 
Subjt:  ----------------------------ESQDCHTPE----------------------NTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQV

Query:  TLIDLLITDIQKTDCFLAVPEHAQKRY
         L+ +L   ++  + FL++P   +  Y
Subjt:  TLIDLLITDIQKTDCFLAVPEHAQKRY

A0A5D3DG22 Retrotransposon protein8.7e-1331.44Show/hide
Query:  SQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSYL-GKDRAIGQLSEDPYVMATNAFKEFEDEIRL-----------ESQDCH-TPE
        S SGFGWNEEF+C+            SHP+ KG+ +K FP+YDDLSY+ GKDRA G  SE    + +N    F D I L            SQ  H +P+
Subjt:  SQSGFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSYL-GKDRAIGQLSEDPYVMATNAFKEFEDEIRL-----------ESQDCH-TPE

Query:  -----------------NTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRYYLRLLGRNM
                         N  +  +A   K+K  +E   R +VV  + +I  L    +  L+ +L   ++    FL++P   +  Y   LL  N+
Subjt:  -----------------NTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRYYLRLLGRNM

A0A6J1DW73 uncharacterized protein LOC1110250186.7e-1326.98Show/hide
Query:  GFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSY-LGKDRAIGQLSEDPYVMATNAFKEFEDEIRLESQDCHTP----------------
        GFGWN++ KC+            SHP+ KG+ NK  PHYDDL+   GKDRA G   + P  MA++A     ++   E+QD + P                
Subjt:  GFGWNEEFKCV------------SHPSVKGIWNKLFPHYDDLSY-LGKDRAIGQLSEDPYVMATNAFKEFEDEIRLESQDCHTP----------------

Query:  --------------------------------------ENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVP
                                              +  H+ ++AT   KK + +   RK V + +  I  L  +D V L+ +L+T+++K+  FL VP
Subjt:  --------------------------------------ENTHMGRLATCQKKKYKLEFGHRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVP

Query:  EHAQKRYYLRLLGRN
           +K + ++LLG++
Subjt:  EHAQKRYYLRLLGRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGTTTAAAGAAACAGTACGACGCATTATTAGAGATGTTAAGTCAGTCGGGGTTTGGGTGGAACGAGGAGTTCAAATGTGTCAGTCATCCCAGTGTGAAAGGGAT
ATGGAACAAGCTATTCCCCCATTATGATGACCTCTCGTATCTGGGAAAAGACAGAGCAATAGGACAATTAAGTGAGGACCCATACGTGATGGCAACGAATGCATTCAAAG
AGTTTGAAGATGAGATTCGACTTGAATCACAGGATTGTCACACACCTGAGAACACACACATGGGTAGACTTGCAACATGCCAGAAGAAAAAGTATAAGTTGGAGTTTGGG
CATCGGAAGAAAGTAGTGAACGCCATATACAACATTGATGGCTTGAATAAGGATGATCAGGTCACCCTTATTGACCTCCTTATCACAGACATTCAGAAGACAGATTGCTT
CCTTGCAGTACCAGAACACGCACAGAAGAGGTACTATCTTCGTCTACTAGGACGAAACATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGTTTAAAGAAACAGTACGACGCATTATTAGAGATGTTAAGTCAGTCGGGGTTTGGGTGGAACGAGGAGTTCAAATGTGTCAGTCATCCCAGTGTGAAAGGGAT
ATGGAACAAGCTATTCCCCCATTATGATGACCTCTCGTATCTGGGAAAAGACAGAGCAATAGGACAATTAAGTGAGGACCCATACGTGATGGCAACGAATGCATTCAAAG
AGTTTGAAGATGAGATTCGACTTGAATCACAGGATTGTCACACACCTGAGAACACACACATGGGTAGACTTGCAACATGCCAGAAGAAAAAGTATAAGTTGGAGTTTGGG
CATCGGAAGAAAGTAGTGAACGCCATATACAACATTGATGGCTTGAATAAGGATGATCAGGTCACCCTTATTGACCTCCTTATCACAGACATTCAGAAGACAGATTGCTT
CCTTGCAGTACCAGAACACGCACAGAAGAGGTACTATCTTCGTCTACTAGGACGAAACATGTAG
Protein sequenceShow/hide protein sequence
MRSLKKQYDALLEMLSQSGFGWNEEFKCVSHPSVKGIWNKLFPHYDDLSYLGKDRAIGQLSEDPYVMATNAFKEFEDEIRLESQDCHTPENTHMGRLATCQKKKYKLEFG
HRKKVVNAIYNIDGLNKDDQVTLIDLLITDIQKTDCFLAVPEHAQKRYYLRLLGRNM