; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G012797 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G012797
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationCG_Chr04:27625234..27625977
RNA-Seq ExpressionClCG04G012797
SyntenyClCG04G012797
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057564.1 Transposon TX1 uncharacterized [Cucumis melo var. makuwa]7.4e-3957.55Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        MRKF  F   ++L+EIPLSNG+FTWSREG   +RSLL RF ++++WD+SFA++RA+R+ R  S H PILLEAG+FEWGPSPFRFCNSWL  K C  ++  
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKR
        SL    HQ   GFVI+SKLR+LK  LK W +N ++ +K+
Subjt:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKR

XP_038884535.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X1 [Benincasa hispida]9.4e-5857.29Show/hide
Query:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE
        +S NWD+ F +SR +RQ RTIS HFP+L EAG+FEWGPSPFRFCNSWL  KECC ++E S  ++  Q  AGF ++S+LR +K+++KRW    +K++K  E
Subjt:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE

Query:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELIL
        E LL EI+ +DL+ +  E      ++ +SLKA LLSLY+ EER+L+QKSKLN L LGDEN+SFFHRFL+AK+R+NLI+EL NE G+ T SF+EIE +IL
Subjt:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELIL

XP_038884536.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X2 [Benincasa hispida]9.4e-5857.29Show/hide
Query:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE
        +S NWD+ F +SR +RQ RTIS HFP+L EAG+FEWGPSPFRFCNSWL  KECC ++E S  ++  Q  AGF ++S+LR +K+++KRW    +K++K  E
Subjt:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE

Query:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELIL
        E LL EI+ +DL+ +  E      ++ +SLKA LLSLY+ EER+L+QKSKLN L LGDEN+SFFHRFL+AK+R+NLI+EL NE G+ T SF+EIE +IL
Subjt:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELIL

XP_038884537.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X3 [Benincasa hispida]9.4e-5857.29Show/hide
Query:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE
        +S NWD+ F +SR +RQ RTIS HFP+L EAG+FEWGPSPFRFCNSWL  KECC ++E S  ++  Q  AGF ++S+LR +K+++KRW    +K++K  E
Subjt:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE

Query:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELIL
        E LL EI+ +DL+ +  E      ++ +SLKA LLSLY+ EER+L+QKSKLN L LGDEN+SFFHRFL+AK+R+NLI+EL NE G+ T SF+EIE +IL
Subjt:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELIL

XP_038904301.1 uncharacterized protein LOC120090656 [Benincasa hispida]4.6e-5751.24Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        M  FN F  +  L+E PLSNG+FTWSREG V ++SLL  F +SS W+D F NSR  RQ RT+S HFP+ LEAG+FEWGPS FRFCNSWL+ KE C ++E 
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEE-HQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRD-LEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLG
        SL  +E HQ  A   + + LR  K ALK+W     KE K  EE LL E++ +D L  + S ++    +   SLKA LL+LY++EE++L+QK KL  L  G
Subjt:  SLDMEE-HQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRD-LEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLG

Query:  DENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELIL
        DEN+SFFHRFLS +KR+NL ++L N+  + T   ++IE++IL
Subjt:  DENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELIL

TrEMBL top hitse value%identityAlignment
A0A2N9HFZ0 Uncharacterized protein1.6e-2634.68Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        M  F++F     L+++PL  GRFTWS     PA S L RF +S +WD  F+ +  +   RT+S H PILL+ G    G  PFRF N WL  +     ++ 
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDEN---SEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSL
          +    +G  G ++  KL+ LK  L++W  ++  +    + EL+ EI+  D  +E+   S E  T RE     + +L  +  ++E +  QKS++  L  
Subjt:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDEN---SEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSL

Query:  GDENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELILKCTTH
        GD N+ FFHR  ++ +R N I  L N +G VTS  KE+EE I++   H
Subjt:  GDENSSFFHRFLSAKKRRNLISELTNEDGVVTSSFKEIEELILKCTTH

A0A5A7UR38 Transposon TX1 uncharacterized3.6e-3957.55Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        MRKF  F   ++L+EIPLSNG+FTWSREG   +RSLL RF ++++WD+SFA++RA+R+ R  S H PILLEAG+FEWGPSPFRFCNSWL  K C  ++  
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKR
        SL    HQ   GFVI+SKLR+LK  LK W +N ++ +K+
Subjt:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKR

A0A5D3BHE3 Uncharacterized protein3.2e-2753.91Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        MR+FNN  D   + E+PL NGR TWSREGS  +RSLL  FFI   WD+   NSR  R+  TIS HFP+LLEAGS +WGPSPFRF NSWL   EC  +++ 
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVI
          ++      AGFV+
Subjt:  SLDMEEHQGRAGFVI

A0A5D3BXH7 Exodeoxyribonuclease-like5.8e-2938.16Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        M  FN   +   LMEIP+ NGR+TW REG   ++SL  RFFI+  WDD F NSR + + R  S H P+ L+AG+  WGPSPFRFCNSWL   +C  V+  
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDE
        +L    +Q   G                  LN+        EEL+                        +L+A++LS+YRIE  N +QKSKLN L++ DE
Subjt:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDE

Query:  NSSFFHR
        N++ + R
Subjt:  NSSFFHR

A0A5D3CIL8 Uncharacterized protein2.6e-2959.09Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        MRKFNNF D   L+E+PLSNG FTWS  GS+ ARSL+ RFFIS +WDD+F NSR  RQ    S HFP++LEAGS  WGPSPFRF NSW   K C  V++ 
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGR
        S       GR
Subjt:  SLDMEEHQGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAATTCAACAATTTCAGTGACATGACACAACTAATGGAGATCCCACTTAGTAACGGAAGATTCACTTGGTCTAGAGAAGGAAGCGTTCCAGCTCGGTCCCTCTT
GGGTAGATTCTTTATTTCTAGCAACTGGGATGATTCATTCGCTAATTCAAGAGCTACAAGACAGATGCGCACTATTTCTGGTCACTTTCCTATCTTGCTTGAAGCCGGAT
CTTTTGAATGGGGTCCTTCCCCATTCAGATTTTGCAACAGCTGGTTAGATTTGAAGGAGTGTTGCCATGTCATGGAAATTTCACTCGATATGGAAGAACATCAAGGGAGG
GCTGGTTTTGTAATTTTTTCCAAACTCAGAAACTTAAAGGAGGCTTTAAAGAGATGGAAGCTGAACCTTGACAAAGAAAAGAAGAGGTTCGAAGAAGAGCTCCTAATGGA
AATTGAGAACCGAGACCTGGAGGACGAAAATTCAGAAGAAGTTGGCACGGGAAGGGAGATCTGTCTATCACTTAAAGCAAAATTATTATCATTATACCGTATTGAAGAAA
GAAACCTAATGCAGAAAAGTAAGTTGAACTGTTTGAGTTTGGGTGATGAAAATTCTAGTTTTTTCCATCGGTTTTTGTCAGCTAAGAAGAGGAGAAATTTGATTTCAGAA
TTGACCAATGAGGATGGAGTAGTCACATCATCCTTCAAGGAGATAGAAGAGCTAATCTTGAAATGTACGACACATTATACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAATTCAACAATTTCAGTGACATGACACAACTAATGGAGATCCCACTTAGTAACGGAAGATTCACTTGGTCTAGAGAAGGAAGCGTTCCAGCTCGGTCCCTCTT
GGGTAGATTCTTTATTTCTAGCAACTGGGATGATTCATTCGCTAATTCAAGAGCTACAAGACAGATGCGCACTATTTCTGGTCACTTTCCTATCTTGCTTGAAGCCGGAT
CTTTTGAATGGGGTCCTTCCCCATTCAGATTTTGCAACAGCTGGTTAGATTTGAAGGAGTGTTGCCATGTCATGGAAATTTCACTCGATATGGAAGAACATCAAGGGAGG
GCTGGTTTTGTAATTTTTTCCAAACTCAGAAACTTAAAGGAGGCTTTAAAGAGATGGAAGCTGAACCTTGACAAAGAAAAGAAGAGGTTCGAAGAAGAGCTCCTAATGGA
AATTGAGAACCGAGACCTGGAGGACGAAAATTCAGAAGAAGTTGGCACGGGAAGGGAGATCTGTCTATCACTTAAAGCAAAATTATTATCATTATACCGTATTGAAGAAA
GAAACCTAATGCAGAAAAGTAAGTTGAACTGTTTGAGTTTGGGTGATGAAAATTCTAGTTTTTTCCATCGGTTTTTGTCAGCTAAGAAGAGGAGAAATTTGATTTCAGAA
TTGACCAATGAGGATGGAGTAGTCACATCATCCTTCAAGGAGATAGAAGAGCTAATCTTGAAATGTACGACACATTATACTTAA
Protein sequenceShow/hide protein sequence
MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGR
AGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLSAKKRRNLISE
LTNEDGVVTSSFKEIEELILKCTTHYT