; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G11340 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G11340
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr06:21206275..21208213
RNA-Seq ExpressionClc06G11340
SyntenyClc06G11340
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038877407.1 uncharacterized protein LOC120069696 [Benincasa hispida]2.0e-6267.55Show/hide
Query:  VSPSGFEWNEEFKCGQAEREIFDLWVR---------------------DCHTPDVCQTKSPLDQNGIDEEPTKQSTGRTSVPTESSRGSKRKRPSFQAEM
        +S SGF+WNEEFKC Q EREIF+LWVR                     DCHTP+VCQ +S L+Q+ IDEEPT+QSTGRTS+P ESSRGSKRKR SFQ EM
Subjt:  VSPSGFEWNEEFKCGQAEREIFDLWVR---------------------DCHTPDVCQTKSPLDQNGIDEEPTKQSTGRTSVPTESSRGSKRKRPSFQAEM

Query:  MDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRNM
        +D MR  VE+ +TH+ RLASWQK+KYELEFG +KEVVN IYNIDGL+ED QVTLIDL+V +IQ TDCFLAVP+HA KRY LRLLGRNM
Subjt:  MDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRNM

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]7.8e-5964.62Show/hide
Query:  VSPSGFEWNEEFKCGQAEREIFDLWVR----------------------------DCHTPDVCQTKSPLDQNGIDEEPTKQSTGRTSVPTESSRGSKRKR
        +S SGF WNEEFKC Q EREIFDLWVR                            DCHTP+V QT+SPL+Q+ IDEEP +QSTGR SVPTESSRGSKRKR
Subjt:  VSPSGFEWNEEFKCGQAEREIFDLWVR----------------------------DCHTPDVCQTKSPLDQNGIDEEPTKQSTGRTSVPTESSRGSKRKR

Query:  PSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRNM
         SFQ EM+D ++  VE+Q+TH+ RLASWQ EKYELE    KEVVN IYNID L E+DQVTLIDL+V +IQ TDCFLAVP+HARKRY LRLLGRNM
Subjt:  PSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRNM

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]1.4e-3655.56Show/hide
Query:  VSPSGFEWNEEFKCGQAEREIFDLWVRDCHTPDVCQTKSPLDQNGIDEEPTKQSTGRTSVPTESSRGSKRKRPSFQAEMMDTMRLIVE----IQNTHLSR
        +S SGF+WNEEFKC Q EREIFD WVR  H         P        +      G+     +SS        +   E  D +RL  +     ++TH+ R
Subjt:  VSPSGFEWNEEFKCGQAEREIFDLWVRDCHTPDVCQTKSPLDQNGIDEEPTKQSTGRTSVPTESSRGSKRKRPSFQAEMMDTMRLIVE----IQNTHLSR

Query:  LASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRNM
        LASWQKEKYELEFG RKEVVN IYNIDGL+EDDQVTLIDLLV +IQ T+CFLAVP+HARKRY LRLLGRNM
Subjt:  LASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRNM

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]2.6e-6266.15Show/hide
Query:  VSPSGFEWNEEFKCGQAEREIFDLWVR----------------------------DCHTPDVCQTKSPLDQNGIDEEPTKQSTGRTSVPTESSRGSKRKR
        +S SGF WNEEFKC Q E+EIFDLWVR                            +CHTP+VCQ +SPL+Q+ IDEEP +QSTGR SV  ESSRGSKRKR
Subjt:  VSPSGFEWNEEFKCGQAEREIFDLWVR----------------------------DCHTPDVCQTKSPLDQNGIDEEPTKQSTGRTSVPTESSRGSKRKR

Query:  PSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRNM
        PSFQAEM+D MR  VE+Q+TH+ RLASWQKEKYELEFG RKEVVN IY+IDGL+EDDQVT IDLLV +IQ TDCFLAVP+HARKRY L LL RNM
Subjt:  PSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRNM

XP_038899910.1 uncharacterized protein LOC120087100 [Benincasa hispida]2.5e-3650Show/hide
Query:  VSPSGFEWNEEFKCGQAEREIFD-----------------------------------------------------LWVRDCHTPDVCQTKSPLDQNGID
        +S SGF WNEEFKC Q E+EIF+                                                     L  +DC T +V QT+SPL+Q+ ID
Subjt:  VSPSGFEWNEEFKCGQAEREIFD-----------------------------------------------------LWVRDCHTPDVCQTKSPLDQNGID

Query:  EEPTKQSTGRTSVPTESSRGSKRKRPSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQV
        EEP +QSTGR SVP E+S+GSKRKRPSFQAEM+D MR  VE+Q+TH+ RLASWQKEKYELEF H KEVVN IY+IDGL+EDD++
Subjt:  EEPTKQSTGRTSVPTESSRGSKRKRPSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQV

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859531.6e-0925.7Show/hide
Query:  IRDVSPSGFEWNEEFKCGQAEREIFDLWVRD---------------------------------------CHTPDVCQTKSPLDQNGIDEEPTKQSTGRT
        +R  S SGF WNEEF+C  AER++FD W++                                         +  ++     PL  +  ++ PT  S G  
Subjt:  IRDVSPSGFEWNEEFKCGQAEREIFDLWVRD---------------------------------------CHTPDVCQTKSPLDQNGIDEEPTKQSTGRT

Query:  SVPTE--------------SSRGSKRKRPSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNT
          P E               S  SKRKR S + E ++ +R ++E  N  L  +A W KEK  +E   R +VV  + +I  L   D+  L+ +L ++++  
Subjt:  SVPTE--------------SSRGSKRKRPSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNT

Query:  DCFLAVPKHARKRY
        + FL++P   +  Y
Subjt:  DCFLAVPKHARKRY

A0A5A7U0H7 Retrotransposon protein1.6e-0925.7Show/hide
Query:  IRDVSPSGFEWNEEFKCGQAEREIFDLWVRD---------------------------------------CHTPDVCQTKSPLDQNGIDEEPTKQSTGRT
        +R  S SGF WNEEF+C  AER++FD W++                                         +  ++     PL  +  ++ PT  S G  
Subjt:  IRDVSPSGFEWNEEFKCGQAEREIFDLWVRD---------------------------------------CHTPDVCQTKSPLDQNGIDEEPTKQSTGRT

Query:  SVPTE--------------SSRGSKRKRPSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNT
          P E               S  SKRKR S + E ++ +R ++E  N  L  +A W KEK  +E   R +VV  + +I  L   D+  L+ +L ++++  
Subjt:  SVPTE--------------SSRGSKRKRPSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNT

Query:  DCFLAVPKHARKRY
        + FL++P   +  Y
Subjt:  DCFLAVPKHARKRY

A0A5A7U7F7 Retrotransposon protein2.3e-1131.58Show/hide
Query:  SGFEWNEEFKCGQAEREIFDLWV---RDCHTPDVCQTKSPLD---QNGIDEEPTKQSTGRTSVPTESSRGSKR---KRPSFQAEMMDTMRLIVEIQNTHL
        SGF WNEE KC +AE+ +FD WV   RD    D+       D    +G+ E P+ +    T        GS R   KR S+  ++MDT R  +   +  +
Subjt:  SGFEWNEEFKCGQAEREIFDLWV---RDCHTPDVCQTKSPLD---QNGIDEEPTKQSTGRTSVPTESSRGSKR---KRPSFQAEMMDTMRLIVEIQNTHL

Query:  SRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGR
         ++A+WQ+EK E+E    K +   +  I G++ DD + + + L+ +      FL  P   + R  +R+LGR
Subjt:  SRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGR

A0A5A7UXH8 Retrotransposon protein1.5e-1029.63Show/hide
Query:  IRDVSPSGFEWNEEFKCGQAEREIFDLWVRDC-----------------HTPDVCQTKSPLDQNGIDEEP---TKQSTGRTSVPTESSRGSKRKRPSFQA
        +R  + SGF WN+E KC  AE+E+FD W                      TPD      P+   G++  P    +  T R S     S GSKRKRP    
Subjt:  IRDVSPSGFEWNEEFKCGQAEREIFDLWVRDC-----------------HTPDVCQTKSPLDQNGIDEEP---TKQSTGRTSVPTESSRGSKRKRPSFQA

Query:  EMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRN
        +  D +R  +E +N  L R+A W   +++     R+E+V  +  I  L   D+  L+ +L++N+ +   FL V  + +  YY  +L  N
Subjt:  EMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRYYLRLLGRN

A0A6J1DW73 uncharacterized protein LOC1110250185.4e-1326.98Show/hide
Query:  GFEWNEEFKCGQAEREIFDLWVRDCHTPDVCQTKSP------------------LD-----------------------------------QNGIDEEPT
        GF WN++ KC +AE+E+FD WV+        + K P                  LD                                   ++ I+E+  
Subjt:  GFEWNEEFKCGQAEREIFDLWVRDCHTPDVCQTKSP------------------LD-----------------------------------QNGIDEEPT

Query:  KQSTGRTSVPTESSRGSKRKRPSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVP
           T + ++ T SS GSKRKR  + +EM+D +R  + +Q  HL ++A+W  +K E +   RK V + +  I  L  +D V L+ +L+ N++ +  FL VP
Subjt:  KQSTGRTSVPTESSRGSKRKRPSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVP

Query:  KHARKRYYLRLLGRN
           +K + ++LLG++
Subjt:  KHARKRYYLRLLGRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCTGAAAATTCTCAGAATCCTTCTCTCAGAAGAGCTCGAAATGGAGGCTGGTTATGCAGTTTCGTCTCTTAGTACTTGGGTTATTTGTGCTTCTTTTCATCTTAA
CTCTGCGTTAAGAACAGAAAAGCTCATAATTTCAGGTGAGGATTCTGAAGAAATAGTACAACGCAGTATCAGAGATGTAAGTCCATCAGGGTTCGAGTGGAACGAGGAGT
TTAAATGTGGCCAGGCCGAGAGGGAGATCTTTGATCTTTGGGTTCGGGACTGTCACACACCTGACGTTTGCCAGACAAAATCACCATTAGATCAAAATGGAATAGATGAA
GAGCCAACAAAGCAATCTACTGGTAGAACGAGTGTACCTACCGAGTCATCTCGAGGCAGCAAAAGGAAGAGGCCATCATTTCAAGCTGAAATGATGGACACCATGAGATT
GATTGTTGAAATACAGAACACACACTTGAGCAGACTTGCATCGTGGCAAAAAGAAAAGTATGAGTTGGAGTTTGGGCATCGGAAGGAAGTAGTGAATGTCATATACAACA
TTGATGGCTTGAATGAGGATGATCAGGTCACCCTTATTGACCTCCTTGTAAAAAATATTCAGAATACAGATTGCTTCCTTGCAGTACCAAAACACGCACGGAAGAGGTAC
TACCTCCGTCTACTAGGACGAAACATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTCTGAAAATTCTCAGAATCCTTCTCTCAGAAGAGCTCGAAATGGAGGCTGGTTATGCAGTTTCGTCTCTTAGTACTTGGGTTATTTGTGCTTCTTTTCATCTTAA
CTCTGCGTTAAGAACAGAAAAGCTCATAATTTCAGGTGAGGATTCTGAAGAAATAGTACAACGCAGTATCAGAGATGTAAGTCCATCAGGGTTCGAGTGGAACGAGGAGT
TTAAATGTGGCCAGGCCGAGAGGGAGATCTTTGATCTTTGGGTTCGGGACTGTCACACACCTGACGTTTGCCAGACAAAATCACCATTAGATCAAAATGGAATAGATGAA
GAGCCAACAAAGCAATCTACTGGTAGAACGAGTGTACCTACCGAGTCATCTCGAGGCAGCAAAAGGAAGAGGCCATCATTTCAAGCTGAAATGATGGACACCATGAGATT
GATTGTTGAAATACAGAACACACACTTGAGCAGACTTGCATCGTGGCAAAAAGAAAAGTATGAGTTGGAGTTTGGGCATCGGAAGGAAGTAGTGAATGTCATATACAACA
TTGATGGCTTGAATGAGGATGATCAGGTCACCCTTATTGACCTCCTTGTAAAAAATATTCAGAATACAGATTGCTTCCTTGCAGTACCAAAACACGCACGGAAGAGGTAC
TACCTCCGTCTACTAGGACGAAACATGTAG
Protein sequenceShow/hide protein sequence
MTLKILRILLSEELEMEAGYAVSSLSTWVICASFHLNSALRTEKLIISGEDSEEIVQRSIRDVSPSGFEWNEEFKCGQAEREIFDLWVRDCHTPDVCQTKSPLDQNGIDE
EPTKQSTGRTSVPTESSRGSKRKRPSFQAEMMDTMRLIVEIQNTHLSRLASWQKEKYELEFGHRKEVVNVIYNIDGLNEDDQVTLIDLLVKNIQNTDCFLAVPKHARKRY
YLRLLGRNM