; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G00930 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G00930
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationClcChr04:2695984..2697813
RNA-Seq ExpressionClc04G00930
SyntenyClc04G00930
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051398.1 Integrase, catalytic core [Cucumis melo var. makuwa]3.5e-3553.29Show/hide
Query:  VSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFEENVRTIDLLEWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCK
        V++  +S+FIN RPRG IQ ++G+RQG+PLS FLFLLVS+VLS L+  LH K  +EE        +W S +K+NWEKSA+C VN++  +  S  S LNCK
Subjt:  VSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFEENVRTIDLLEWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCK

Query:  ATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATLFS
           L  +Y GL LGGYPK   F Q V++K Q KL  WKR N SRGGR TL S
Subjt:  ATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATLFS

TYK01565.1 hypothetical protein E5676_scaffold451G001420 [Cucumis melo var. makuwa]1.2e-3547.22Show/hide
Query:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE-------------------------------ENVR-TIDLL
        MGCV + ++S+FIN  PRG +  +RGI+QG PLS FLFLL+SKVL  L+  LH  G +E                               +N+R T+D  
Subjt:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE-------------------------------ENVR-TIDLL

Query:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRG
        EWCSGQKVNW+KSALCG+NV   EL S+ + L  K  HL  +YLGL LGGY K+ +  Q V +K+  KLDKWKR+N SRG
Subjt:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRG

TYK31266.1 hypothetical protein E5676_scaffold455G005560 [Cucumis melo var. makuwa]3.2e-2845.22Show/hide
Query:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE--------------------------------ENVRTIDLL
        MGC+ +PKFSIF+N +PRG I  +RGIRQG+P S FLFLLVS+VL  ++ +LH  G+FE                                +    I L 
Subjt:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE--------------------------------ENVRTIDLL

Query:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETF
        EWCSGQKVNWEKSA+ GVNV+  +L    + L CK   L  +YLGL LGGYP ++ F
Subjt:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETF

XP_038880332.1 uncharacterized protein LOC120071973 [Benincasa hispida]5.4e-5257.38Show/hide
Query:  GCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE-------------------------------ENVRTIDLLEW
        GCVS+PKFSIFI+ RPRG I+  RGIRQG+P S FLFLLVS+VLS L+ARLHEKGK+E                               EN+RTI++ EW
Subjt:  GCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE-------------------------------ENVRTIDLLEW

Query:  CSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATL
        CS QKVNWEKSA+CG+N++  ++ SV ++LNCK  HL  MYLGL LGGYPK  +F Q VI+K+QGKLDKW+R+N SRGG+ATL
Subjt:  CSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATL

XP_038889274.1 uncharacterized protein LOC120079176 [Benincasa hispida]8.2e-4049.73Show/hide
Query:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHE----------KGKFEENV----------------------RTIDLL
        MGCV +P FS+FIN RPRG +   RGIRQG+PLS  LFL+VS+VLS L+ +L +          K K + ++                      +T++  
Subjt:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHE----------KGKFEENV----------------------RTIDLL

Query:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATL
        +WCSGQKVNWEKSAL G+NVD  EL S  + LNCKA HL F YLGL LGGY KK +F Q V+   Q KLDKWK +N SRG R TL
Subjt:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATL

TrEMBL top hitse value%identityAlignment
A0A438ESM4 Putative ribonuclease H protein4.7e-2542.95Show/hide
Query:  GCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFEENVRTIDLL----EWCSGQKVNWEKSALCGVNVDGIELYSVV
        GC+SS  F+I +N   +G ++ +RG+RQG+PLS FLF LV+ VLS ++ R  E+   EE ++T+  L       SG KVN  KS++ G+N+D   L  + 
Subjt:  GCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFEENVRTIDLL----EWCSGQKVNWEKSALCGVNVDGIELYSVV

Query:  SKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATL
          L+CKA+    +YLGL LGG PK   F   V+E+I  +LD W++   S GGR TL
Subjt:  SKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATL

A0A438FW30 Transposon TX1 uncharacterized 149 kDa protein1.3e-2729.41Show/hide
Query:  GCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFEENVRTIDLL----EWCSGQKVNWEKSALCGVNVDGIELYSVV
        GC+SS  F+I +N   +G ++ +RG+RQG+PLS FLF LV+ VLS ++ R  E+    E  RT+  L       SG KVN  KS++ G+N+D   L  + 
Subjt:  GCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFEENVRTIDLL----EWCSGQKVNWEKSALCGVNVDGIELYSVV

Query:  SKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATLFSGF----------------------------------------
          L+CKA+    +YLGL LGG PK   F   V+E+I  +LD W++   S GGR TL  G+                                        
Subjt:  SKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATLFSGF----------------------------------------

Query:  -------WDS---------------SLSS--WSINFRRLLKEEEITDFQILLGLVTAKAISVN-PDKRVWDLEANGNFSDNILQLVI----GPKFKSKPR
               W +               S+S+  W++NFRR L + EI D + L+  +    +S + PD R+W L ++  FS     L +    G       +
Subjt:  -------WDS---------------SLSS--WSINFRRLLKEEEITDFQILLGLVTAKAISVN-PDKRVWDLEANGNFSDNILQLVI----GPKFKSKPR

Query:  LIWSNEVKAMLVE-IWFKRNQRV
         +W+++V   +   +W   +++V
Subjt:  LIWSNEVKAMLVE-IWFKRNQRV

A0A5A7U808 Integrase, catalytic core1.7e-3553.29Show/hide
Query:  VSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFEENVRTIDLLEWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCK
        V++  +S+FIN RPRG IQ ++G+RQG+PLS FLFLLVS+VLS L+  LH K  +EE        +W S +K+NWEKSA+C VN++  +  S  S LNCK
Subjt:  VSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFEENVRTIDLLEWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCK

Query:  ATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATLFS
           L  +Y GL LGGYPK   F Q V++K Q KL  WKR N SRGGR TL S
Subjt:  ATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATLFS

A0A5D3BQV1 Reverse transcriptase domain-containing protein5.9e-3647.22Show/hide
Query:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE-------------------------------ENVR-TIDLL
        MGCV + ++S+FIN  PRG +  +RGI+QG PLS FLFLL+SKVL  L+  LH  G +E                               +N+R T+D  
Subjt:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE-------------------------------ENVR-TIDLL

Query:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRG
        EWCSGQKVNW+KSALCG+NV   EL S+ + L  K  HL  +YLGL LGGY K+ +  Q V +K+  KLDKWKR+N SRG
Subjt:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRG

A0A5D3E6J9 Reverse transcriptase domain-containing protein1.6e-2845.22Show/hide
Query:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE--------------------------------ENVRTIDLL
        MGC+ +PKFSIF+N +PRG I  +RGIRQG+P S FLFLLVS+VL  ++ +LH  G+FE                                +    I L 
Subjt:  MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFE--------------------------------ENVRTIDLL

Query:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETF
        EWCSGQKVNWEKSA+ GVNV+  +L    + L CK   L  +YLGL LGGYP ++ F
Subjt:  EWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFMYLGLSLGGYPKKETF

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012503.0e-0548.89Show/hide
Query:  INVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGK
        IN  P+G++  +RG+RQG+PLS +LF+L ++VLS L  R  E+G+
Subjt:  INVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGK

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.2e-0648.89Show/hide
Query:  INVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGK
        IN  P+G++  +RG+RQG+PLS +LF+L ++VLS L  R  E+G+
Subjt:  INVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTGTGTTTCAAGTCCAAAATTCTCAATTTTCATTAATGTAAGACCAAGGGGAATAATTCAAGTTGCAAGAGGCATTAGACAAGGCGAGCCTCTTTCGGCTTTTCT
ATTTCTATTAGTTAGCAAGGTGCTAAGCCCTCTAATGGCTAGGCTCCATGAAAAGGGCAAATTTGAAGAAAATGTAAGGACCATTGACCTACTTGAGTGGTGCTCGGGGC
AAAAAGTCAATTGGGAAAAATCTGCTTTATGTGGTGTTAATGTTGATGGCATTGAGCTGTATTCGGTGGTGTCCAAGTTGAATTGCAAGGCGACCCACCTTTCTTTTATG
TACCTTGGACTGTCTTTAGGAGGCTATCCCAAAAAAGAAACATTTCGGCAATCGGTAATTGAAAAAATTCAAGGGAAATTGGATAAATGGAAGAGATATAACTTCTCTAG
AGGTGGCCGAGCCACACTTTTTTCAGGTTTTTGGGACTCCTCACTCTCCTCCTGGTCCATCAACTTTCGAAGGTTGCTAAAAGAGGAAGAGATTACAGATTTTCAAATCC
TCCTTGGTCTTGTTACAGCCAAAGCAATCTCGGTAAATCCTGATAAACGAGTTTGGGACCTAGAAGCTAATGGAAATTTCTCGGACAATATTTTGCAGCTAGTGATTGGT
CCTAAATTTAAATCAAAACCAAGGCTGATTTGGTCCAATGAAGTTAAAGCCATGCTAGTAGAAATTTGGTTTAAAAGAAATCAACGAGTATTTCATGACAAAGCTTCCCC
TTGGACGATACGATTTGAGATAGCTCGACTTAATGCATCCTTGCGGTGTTCTCTATCCAAGCTGTTTGAAGATTTTTCCATCCAAGATATCAGCCTTAACTGGCAATCCT
TCATTTTCCCTCCCTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCTGTGTTTCAAGTCCAAAATTCTCAATTTTCATTAATGTAAGACCAAGGGGAATAATTCAAGTTGCAAGAGGCATTAGACAAGGCGAGCCTCTTTCGGCTTTTCT
ATTTCTATTAGTTAGCAAGGTGCTAAGCCCTCTAATGGCTAGGCTCCATGAAAAGGGCAAATTTGAAGAAAATGTAAGGACCATTGACCTACTTGAGTGGTGCTCGGGGC
AAAAAGTCAATTGGGAAAAATCTGCTTTATGTGGTGTTAATGTTGATGGCATTGAGCTGTATTCGGTGGTGTCCAAGTTGAATTGCAAGGCGACCCACCTTTCTTTTATG
TACCTTGGACTGTCTTTAGGAGGCTATCCCAAAAAAGAAACATTTCGGCAATCGGTAATTGAAAAAATTCAAGGGAAATTGGATAAATGGAAGAGATATAACTTCTCTAG
AGGTGGCCGAGCCACACTTTTTTCAGGTTTTTGGGACTCCTCACTCTCCTCCTGGTCCATCAACTTTCGAAGGTTGCTAAAAGAGGAAGAGATTACAGATTTTCAAATCC
TCCTTGGTCTTGTTACAGCCAAAGCAATCTCGGTAAATCCTGATAAACGAGTTTGGGACCTAGAAGCTAATGGAAATTTCTCGGACAATATTTTGCAGCTAGTGATTGGT
CCTAAATTTAAATCAAAACCAAGGCTGATTTGGTCCAATGAAGTTAAAGCCATGCTAGTAGAAATTTGGTTTAAAAGAAATCAACGAGTATTTCATGACAAAGCTTCCCC
TTGGACGATACGATTTGAGATAGCTCGACTTAATGCATCCTTGCGGTGTTCTCTATCCAAGCTGTTTGAAGATTTTTCCATCCAAGATATCAGCCTTAACTGGCAATCCT
TCATTTTCCCTCCCTTGTAG
Protein sequenceShow/hide protein sequence
MGCVSSPKFSIFINVRPRGIIQVARGIRQGEPLSAFLFLLVSKVLSPLMARLHEKGKFEENVRTIDLLEWCSGQKVNWEKSALCGVNVDGIELYSVVSKLNCKATHLSFM
YLGLSLGGYPKKETFRQSVIEKIQGKLDKWKRYNFSRGGRATLFSGFWDSSLSSWSINFRRLLKEEEITDFQILLGLVTAKAISVNPDKRVWDLEANGNFSDNILQLVIG
PKFKSKPRLIWSNEVKAMLVEIWFKRNQRVFHDKASPWTIRFEIARLNASLRCSLSKLFEDFSIQDISLNWQSFIFPPL