; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G14200 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G14200
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationClcChr02:26548442..26549435
RNA-Seq ExpressionClc02G14200
SyntenyClc02G14200
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056799.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-2432.8Show/hide
Query:  VGLKSKMCMAQLIEDMEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSP----SPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFRRLTD
        VGLK  M  AQL E+     +++W  G PN          ++    TT+ SSP    SPT           N  S SNT  F+   G      ++RR T+
Subjt:  VGLKSKMCMAQLIEDMEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSP----SPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFRRLTD

Query:  REMQSRREKGLCYRCDEKYAVGHRCK-KELNLFI-----------QQETDGEDIE----EELGTDAAV--------------------------------
         E+Q+R+EKGLCYRCDE ++ G RCK +EL L +           +++T G  +E     EL  ++ V                                
Subjt:  REMQSRREKGLCYRCDEKYAVGHRCK-KELNLFI-----------QQETDGEDIE----EELGTDAAV--------------------------------

Query:  ------EGEAGFTENDE--VTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGEWDVKLEGDRSVM
              E +   TE  +  V     ++V+  G+C GVV+ LPGLT+V DF PL LG+ D++LG+QWL   G +  DW    M F VG+  V L+GD S+ 
Subjt:  ------EGEAGFTENDE--VTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGEWDVKLEGDRSVM

Query:  KSQISLKSMMK
        + +ISLK ++K
Subjt:  KSQISLKSMMK

TXG69438.1 hypothetical protein EZV62_004373 [Acer yangbiense]7.2e-2431.9Show/hide
Query:  EMRMFSLVGLKSKMCMAQLIEDMEEAR-QLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFR
        E+R+   + L   M +AQ IE    A   LK  GG     +KGSG      +P ++   +P PT   T+   P+ N                  P G  R
Subjt:  EMRMFSLVGLKSKMCMAQLIEDMEEAR-QLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFR

Query:  RLTDREMQSRREKGLCYRCDEKYAVGHRC-KKELNLFIQQETDGEDIEEEL-------------------GTDAAVEGEAGF------------------
        RLTD E+Q +R  GLCYRCDEK++ GH+C KKELN+ I  + + E+  EE                      + ++    G                   
Subjt:  RLTDREMQSRREKGLCYRCDEKYAVGHRC-KKELNLFIQQETDGEDIEEEL-------------------GTDAAVEGEAGF------------------

Query:  --------------------------TENDEVTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGE
                                  TE   VT     SVR  GIC+GV L+L G+ IV +F PL LGS+D+ILG+QWL  LG    +W    M+F +G 
Subjt:  --------------------------TENDEVTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGE

Query:  WDVKLEGDRSVMKSQISLKSMMKQVK
          V L GD S+ K+ +SLK+MM+  K
Subjt:  WDVKLEGDRSVMKSQISLKSMMKQVK

XP_031737572.1 uncharacterized protein LOC116402461 [Cucumis sativus]1.7e-2537.61Show/hide
Query:  RDTNGRNFPTGTFRRLTDREMQSRREKGLCYRCDEKYAVGHRCK-KELNLFIQQ-------ETD--GEDIE----EELGTDAA-----------------
        +++  ++  +  +R++TD EM+ ++EKG C+RCD+K++  HRCK +ELN+ + Q       ETD  GE+IE    +E+ T+ A                 
Subjt:  RDTNGRNFPTGTFRRLTDREMQSRREKGLCYRCDEKYAVGHRCK-KELNLFIQQ-------ETD--GEDIE----EELGTDAA-----------------

Query:  VEGE-----------AGFTEN---DEVTRDCSQS----------------VRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDW
        ++GE            G T N   +EV ++   S                VR TG+C+ V L +  L+I ++F PL LGS D+ILGV WL  LGKV  D+
Subjt:  VEGE-----------AGFTEN---DEVTRDCSQS----------------VRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDW

Query:  STSEMQF-VGEWDVKLEGDRSVMKSQISLKSMMK
          SEM+F  GEW V L+GDRS+++SQ+SLKSMMK
Subjt:  STSEMQF-VGEWDVKLEGDRSVMKSQISLKSMMK

XP_038904464.1 uncharacterized protein LOC120090832 [Benincasa hispida]8.1e-6044.18Show/hide
Query:  MEMRMFSLVGLKSKMCMAQLIEDMEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSP-----TAARTITLNPTCNSVSPSNTVSFRDTNGRNFP
        +EMRMF L+GLK KM MAQ+IEDMEEAR+ KWGGG+PN  SK +G++       TT    P+P     + ARTI+LNPT    +PSN+V+ +D NGR+F 
Subjt:  MEMRMFSLVGLKSKMCMAQLIEDMEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSP-----TAARTITLNPTCNSVSPSNTVSFRDTNGRNFP

Query:  TGTFRRLTDREMQSRREKGLCYRCDEKYAVGHRCK-KELNL-----------------------------FIQQETDGEDIEEELGTDAAVEG-------
         G F+RL+D +MQ+RR+KGLCY+C+EKY  GHRCK KEL++                              + + TD E  E  L + A ++        
Subjt:  TGTFRRLTDREMQSRREKGLCYRCDEKYAVGHRCK-KELNL-----------------------------FIQQETDGEDIEEELGTDAAVEG-------

Query:  ------------EAGFTEN--DE-----------------VTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWST
                    ++G + N  D+                 +      SVRTTG+C+GV+LNL  LTI+ND FPL LG+ D++LGVQWLM LG+VECDW T
Subjt:  ------------EAGFTEN--DE-----------------VTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWST

Query:  SEMQF-VGEWDVKLEGDRSVMKSQISLKSMMKQVK
        SEM+F +G+W V L+G+R++MK+QISLKSMMK V+
Subjt:  SEMQF-VGEWDVKLEGDRSVMKSQISLKSMMKQVK

XP_038907170.1 uncharacterized protein LOC120092972 [Benincasa hispida]3.0e-5442.13Show/hide
Query:  MRMFSLVGLKSKMCMAQLIEDMEEARQLKWGGGSPNLGS--KGSGSNLTQTKPATTIASSPS--PTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGT
        MRMF LVG+K K+  AQLIED EEA + KWG  SPN     + +GS +T    +T+   + +  PT ARTI+LNP+ +S++ SN+V+ RD NG+    G 
Subjt:  MRMFSLVGLKSKMCMAQLIEDMEEARQLKWGGGSPNLGS--KGSGSNLTQTKPATTIASSPS--PTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGT

Query:  FRRLTDREMQSRREKGLCYRCDEKYAVGHRCKKELNLFIQQETDGEDIEEELGTDAA-VEG---------------------------------------
        ++RL+D +MQSR +KGLCYRCDE+Y+ GHRCKKELNL I  + + E+   E G +   VEG                                       
Subjt:  FRRLTDREMQSRREKGLCYRCDEKYAVGHRCKKELNLFIQQETDGEDIEEELGTDAA-VEG---------------------------------------

Query:  ----------------------------------EAGFTEN---DEV----------TRDC------SQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGS
                                          ++G T N   D++          T  C       +SVRT GIC+GVVLNLP LTI+NDFFP+ LGS
Subjt:  ----------------------------------EAGFTEN---DEV----------TRDC------SQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGS

Query:  ADIILGVQWLMMLGKVECDWSTSEMQF-VGEWDVKLEGDRSVMKSQISLKSMMKQV
        AD+++GVQWLM LG+VECDWSTS+M F VGE  V L+ DRS++KSQISLKSMMK +
Subjt:  ADIILGVQWLMMLGKVECDWSTSEMQF-VGEWDVKLEGDRSVMKSQISLKSMMKQV

TrEMBL top hitse value%identityAlignment
A0A5A7UDR7 Ty3/gypsy retrotransposon protein3.8e-2332.85Show/hide
Query:  LKSKMCMAQLIEDMEEAR-QLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPTAARTITLNPTCNSVSPSNTVSFRDT-NGRNFPTGTFRRLTDREMQ
        L   M +AQ++E+ E AR + K  G S   G K +G N    K +T   +  +             N+V P  T++ R +    N   G+++RL D E Q
Subjt:  LKSKMCMAQLIEDMEEAR-QLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPTAARTITLNPTCNSVSPSNTVSFRDT-NGRNFPTGTFRRLTDREMQ

Query:  SRREKGLCYRCDEKYAVGHRCK----KELNLFIQQE-------TDGEDIEEELG-----------TDAAVEGEAGFTENDEV-TRDCSQSVRTTGICRGV
        +R+EKGLC+RC+EKY+  H+C+    +EL +F+  E        + E  E+ELG            + ++    G  +   +  R    +V+  G+C  +
Subjt:  SRREKGLCYRCDEKYAVGHRCK----KELNLFIQQE-------TDGEDIEEELG-----------TDAAVEGEAGFTENDEV-TRDCSQSVRTTGICRGV

Query:  VLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQFVGEW-DVKLEGDRSVMKSQISLKSMMK
         + L G  IV DF PL LG  D+ILG++WL  LG    DW    + FV E  +V ++GD S+ K++ISLK+M+K
Subjt:  VLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQFVGEW-DVKLEGDRSVMKSQISLKSMMK

A0A5C7IJS7 Uncharacterized protein3.5e-2431.9Show/hide
Query:  EMRMFSLVGLKSKMCMAQLIEDMEEAR-QLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFR
        E+R+   + L   M +AQ IE    A   LK  GG     +KGSG      +P ++   +P PT   T+   P+ N                  P G  R
Subjt:  EMRMFSLVGLKSKMCMAQLIEDMEEAR-QLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFR

Query:  RLTDREMQSRREKGLCYRCDEKYAVGHRC-KKELNLFIQQETDGEDIEEEL-------------------GTDAAVEGEAGF------------------
        RLTD E+Q +R  GLCYRCDEK++ GH+C KKELN+ I  + + E+  EE                      + ++    G                   
Subjt:  RLTDREMQSRREKGLCYRCDEKYAVGHRC-KKELNLFIQQETDGEDIEEEL-------------------GTDAAVEGEAGF------------------

Query:  --------------------------TENDEVTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGE
                                  TE   VT     SVR  GIC+GV L+L G+ IV +F PL LGS+D+ILG+QWL  LG    +W    M+F +G 
Subjt:  --------------------------TENDEVTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGE

Query:  WDVKLEGDRSVMKSQISLKSMMKQVK
          V L GD S+ K+ +SLK+MM+  K
Subjt:  WDVKLEGDRSVMKSQISLKSMMKQVK

A0A5D3BKW3 Ty3-gypsy retrotransposon protein7.0e-2532.8Show/hide
Query:  VGLKSKMCMAQLIEDMEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSP----SPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFRRLTD
        VGLK  M  AQL E+     +++W  G PN          ++    TT+ SSP    SPT           N  S SNT  F+   G      ++RR T+
Subjt:  VGLKSKMCMAQLIEDMEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSP----SPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFRRLTD

Query:  REMQSRREKGLCYRCDEKYAVGHRCK-KELNLFI-----------QQETDGEDIE----EELGTDAAV--------------------------------
         E+Q+R+EKGLCYRCDE ++ G RCK +EL L +           +++T G  +E     EL  ++ V                                
Subjt:  REMQSRREKGLCYRCDEKYAVGHRCK-KELNLFI-----------QQETDGEDIE----EELGTDAAV--------------------------------

Query:  ------EGEAGFTENDE--VTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGEWDVKLEGDRSVM
              E +   TE  +  V     ++V+  G+C GVV+ LPGLT+V DF PL LG+ D++LG+QWL   G +  DW    M F VG+  V L+GD S+ 
Subjt:  ------EGEAGFTENDE--VTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGEWDVKLEGDRSVM

Query:  KSQISLKSMMK
        + +ISLK ++K
Subjt:  KSQISLKSMMK

A0A5D3CRH2 Ty3/gypsy retrotransposon protein2.1e-2132.34Show/hide
Query:  GLKSKMCMAQLIEDMEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPT-AARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFRRLTDREMQ
        GL   M +AQL E+ E+ R      G  + G K      + +KP  +++   + T   RTITL         SN +      G     GT +RL++ E Q
Subjt:  GLKSKMCMAQLIEDMEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPT-AARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFRRLTDREMQ

Query:  SRREKGLCYRCDEKYAVGHRCK----KELNLFIQQETDGE-DIEEELGTDAAVEGEAGFTENDEVTRDCS-------------QSVRTTGICRGVVLNLP
        +R+EKGLC+RC+EKY+  H+CK    +EL +++ +  D + +I EE   +          E D+V  + S              +++  GIC  V + L 
Subjt:  SRREKGLCYRCDEKYAVGHRCK----KELNLFIQQETDGE-DIEEELGTDAAVEGEAGFTENDEVTRDCS-------------QSVRTTGICRGVVLNLP

Query:  GLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQFVGEW-DVKLEGDRSVMKSQISLKSMMK
        G  ++ D  PL LG  D++LG+QWL  LG  E DW    M F+ +   + ++GD S+ K+++SLK+MMK
Subjt:  GLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQFVGEW-DVKLEGDRSVMKSQISLKSMMK

J3SDF5 Ty3/gypsy retrotransposon protein1.2e-2130.03Show/hide
Query:  EMRMFSLVGLKSKMCMAQLIED---MEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGT
        E+R+ +   L   M +A  +E+   +  AR+     GS ++ ++G  SN +      +   S + T +  I  N +  SV   N       + R F  G 
Subjt:  EMRMFSLVGLKSKMCMAQLIED---MEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGT

Query:  FRRLTDREMQSRREKGLCYRCDEKYAVGHRC-KKELNLFIQQETDGEDIE-------------EELGTDAAVEGEAGF----------------------
         RRLT++E+Q +R KGLC++CDEK+ VGH+C +KEL++   ++ + +++E             EE+  + ++    G                       
Subjt:  FRRLTDREMQSRREKGLCYRCDEKYAVGHRC-KKELNLFIQQETDGEDIE-------------EELGTDAAVEGEAGF----------------------

Query:  --------------------TENDE--VTRDCSQSVRTTGICRGVVLNLP-GLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGEWDV
                            TE++E  V+    Q+VR TGICR V L L  GL +V DF PL LG++D+ILGVQWL  LG V  +W T +M F +G    
Subjt:  --------------------TENDE--VTRDCSQSVRTTGICRGVVLNLP-GLTIVNDFFPLSLGSADIILGVQWLMMLGKVECDWSTSEMQF-VGEWDV

Query:  KLEGDRSVMKSQISLKSMMKQVK
         L GD ++ +S++SLK+M++ ++
Subjt:  KLEGDRSVMKSQISLKSMMKQVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGCGGATGTTCAGTTTGGTGGGCCTCAAGTCCAAAATGTGTATGGCCCAGCTGATCGAAGATATGGAGGAAGCCCGACAACTTAAGTGGGGCGGTGGGAGTCC
AAATCTGGGAAGCAAAGGAAGCGGGTCAAACCTGACTCAAACGAAACCGGCAACCACCATCGCCTCCAGCCCGAGTCCAACGGCAGCCCGAACAATAACCCTCAACCCTA
CCTGTAATTCAGTTTCTCCTTCGAATACCGTGAGTTTCCGTGACACTAACGGTAGGAATTTTCCGACGGGGACTTTTCGCAGGTTGACGGATAGAGAGATGCAAAGCCGG
AGGGAGAAAGGACTGTGCTACCGCTGTGACGAAAAATATGCCGTCGGCCATCGCTGTAAGAAGGAGTTAAACTTGTTCATTCAACAGGAAACCGACGGTGAGGACATTGA
AGAAGAACTCGGGACCGACGCCGCTGTTGAGGGCGAGGCTGGATTCACCGAAAACGATGAAGTGACAAGGGACTGTAGTCAATCCGTTCGAACAACCGGCATTTGCAGAG
GTGTAGTCCTCAACCTCCCAGGCTTAACAATTGTTAATGATTTTTTTCCATTATCATTGGGTAGTGCTGATATAATTTTGGGAGTACAGTGGCTGATGATGTTGGGAAAG
GTTGAATGTGATTGGAGTACATCGGAGATGCAATTTGTAGGTGAATGGGATGTCAAACTTGAGGGGGATAGGAGTGTAATGAAGTCACAAATTTCTCTGAAATCTATGAT
GAAACAGGTGAAAGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATGCGGATGTTCAGTTTGGTGGGCCTCAAGTCCAAAATGTGTATGGCCCAGCTGATCGAAGATATGGAGGAAGCCCGACAACTTAAGTGGGGCGGTGGGAGTCC
AAATCTGGGAAGCAAAGGAAGCGGGTCAAACCTGACTCAAACGAAACCGGCAACCACCATCGCCTCCAGCCCGAGTCCAACGGCAGCCCGAACAATAACCCTCAACCCTA
CCTGTAATTCAGTTTCTCCTTCGAATACCGTGAGTTTCCGTGACACTAACGGTAGGAATTTTCCGACGGGGACTTTTCGCAGGTTGACGGATAGAGAGATGCAAAGCCGG
AGGGAGAAAGGACTGTGCTACCGCTGTGACGAAAAATATGCCGTCGGCCATCGCTGTAAGAAGGAGTTAAACTTGTTCATTCAACAGGAAACCGACGGTGAGGACATTGA
AGAAGAACTCGGGACCGACGCCGCTGTTGAGGGCGAGGCTGGATTCACCGAAAACGATGAAGTGACAAGGGACTGTAGTCAATCCGTTCGAACAACCGGCATTTGCAGAG
GTGTAGTCCTCAACCTCCCAGGCTTAACAATTGTTAATGATTTTTTTCCATTATCATTGGGTAGTGCTGATATAATTTTGGGAGTACAGTGGCTGATGATGTTGGGAAAG
GTTGAATGTGATTGGAGTACATCGGAGATGCAATTTGTAGGTGAATGGGATGTCAAACTTGAGGGGGATAGGAGTGTAATGAAGTCACAAATTTCTCTGAAATCTATGAT
GAAACAGGTGAAAGGATAA
Protein sequenceShow/hide protein sequence
MEMRMFSLVGLKSKMCMAQLIEDMEEARQLKWGGGSPNLGSKGSGSNLTQTKPATTIASSPSPTAARTITLNPTCNSVSPSNTVSFRDTNGRNFPTGTFRRLTDREMQSR
REKGLCYRCDEKYAVGHRCKKELNLFIQQETDGEDIEEELGTDAAVEGEAGFTENDEVTRDCSQSVRTTGICRGVVLNLPGLTIVNDFFPLSLGSADIILGVQWLMMLGK
VECDWSTSEMQFVGEWDVKLEGDRSVMKSQISLKSMMKQVKG