; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC04G080100 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC04G080100
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationCmU531Chr04:26290188..26290931
RNA-Seq ExpressionCmUC04G080100
SyntenyCmUC04G080100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057564.1 Transposon TX1 uncharacterized [Cucumis melo var. makuwa]7.4e-3957.55Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        MRKF  F   ++L+EIPLSNG+FTWSREG   +RSLL RF ++++WD+SFA++RA+R+ R  S H PILLEAG+FEWGPSPFRFCNSWL  K C  ++  
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKR
        SL    HQ   GFVI+SKLR+LK  LK W +N ++ +K+
Subjt:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKR

XP_038884535.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X1 [Benincasa hispida]1.2e-5757.29Show/hide
Query:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE
        +S NWD+ F +SR +RQ RTIS HFP+L EAG+FEWGPSPFRFCNSWL  KECC ++E S  ++  Q  AGF ++S+LR +K+++KRW    +K++K  E
Subjt:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE

Query:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELIL
        E LL EI+ +DL+ +  E      ++ +SLKA LLSLY+ EER+L+QKSKLN L LGDEN+SFFHRFL AK+R+NLI+EL NE G+ T SF+EIE +IL
Subjt:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELIL

XP_038884536.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X2 [Benincasa hispida]1.2e-5757.29Show/hide
Query:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE
        +S NWD+ F +SR +RQ RTIS HFP+L EAG+FEWGPSPFRFCNSWL  KECC ++E S  ++  Q  AGF ++S+LR +K+++KRW    +K++K  E
Subjt:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE

Query:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELIL
        E LL EI+ +DL+ +  E      ++ +SLKA LLSLY+ EER+L+QKSKLN L LGDEN+SFFHRFL AK+R+NLI+EL NE G+ T SF+EIE +IL
Subjt:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELIL

XP_038884537.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X3 [Benincasa hispida]1.2e-5757.29Show/hide
Query:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE
        +S NWD+ F +SR +RQ RTIS HFP+L EAG+FEWGPSPFRFCNSWL  KECC ++E S  ++  Q  AGF ++S+LR +K+++KRW    +K++K  E
Subjt:  ISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFE

Query:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELIL
        E LL EI+ +DL+ +  E      ++ +SLKA LLSLY+ EER+L+QKSKLN L LGDEN+SFFHRFL AK+R+NLI+EL NE G+ T SF+EIE +IL
Subjt:  EELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELIL

XP_038904301.1 uncharacterized protein LOC120090656 [Benincasa hispida]1.4e-5650.83Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        M  FN F  +  L+E PLSNG+FTWSREG V ++SLL  F +SS W+D F NSR  RQ RT+S HFP+ LEAG+FEWGPS FRFCNSWL+ KE C ++E 
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEE-HQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRD-LEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLG
        SL  +E HQ  A   + + LR  K ALK+W     KE K  EE LL E++ +D L  + S ++    +   SLKA LL+LY++EE++L+QK KL  L  G
Subjt:  SLDMEE-HQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRD-LEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLG

Query:  DENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELIL
        DEN+SFFHRFL  +KR+NL ++L N+  + T   ++IE++IL
Subjt:  DENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELIL

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein3.5e-2633.19Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        MR+FN+F     L++ PLSN ++TWS   +    S L RF  +S W++ F    +    RT S HFPI+LE+ +  WGPSPFRF N++L   +    +E 
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDE
                G AG+    +L+ L   +K W  +   + +  ++  + EI+  D  +         RE   +LKA L  +   E +   QK K   +  GDE
Subjt:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDE

Query:  NSSFFHRFLLAKKRRNLISELTNEDG
        NSSFFH+   A++++ LIS++ N  G
Subjt:  NSSFFHRFLLAKKRRNLISELTNEDG

A0A5A7UR38 Transposon TX1 uncharacterized3.6e-3957.55Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        MRKF  F   ++L+EIPLSNG+FTWSREG   +RSLL RF ++++WD+SFA++RA+R+ R  S H PILLEAG+FEWGPSPFRFCNSWL  K C  ++  
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKR
        SL    HQ   GFVI+SKLR+LK  LK W +N ++ +K+
Subjt:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKR

A0A5D3BHE3 Uncharacterized protein3.2e-2753.91Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        MR+FNN  D   + E+PL NGR TWSREGS  +RSLL  FFI   WD+   NSR  R+  TIS HFP+LLEAGS +WGPSPFRF NSWL   EC  +++ 
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVI
          ++      AGFV+
Subjt:  SLDMEEHQGRAGFVI

A0A5D3BXH7 Exodeoxyribonuclease-like5.8e-2938.16Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        M  FN   +   LMEIP+ NGR+TW REG   ++SL  RFFI+  WDD F NSR + + R  S H P+ L+AG+  WGPSPFRFCNSWL   +C  V+  
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDE
        +L    +Q   G                  LN+        EEL+                        +L+A++LS+YRIE  N +QKSKLN L++ DE
Subjt:  SLDMEEHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDE

Query:  NSSFFHR
        N++ + R
Subjt:  NSSFFHR

A0A5D3CIL8 Uncharacterized protein2.6e-2959.09Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI
        MRKFNNF D   L+E+PLSNG FTWS  GS+ ARSL+ RFFIS +WDD+F NSR  RQ    S HFP++LEAGS  WGPSPFRF NSW   K C  V++ 
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEI

Query:  SLDMEEHQGR
        S       GR
Subjt:  SLDMEEHQGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.5e-1026.46Show/hide
Query:  MRKFNNFSDMTQLMEIPLSNGRFTWS-REGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVME
        + +F N    + L++IP     +TWS  +   P    L R   + +W  SF ++ A  ++  +S H P ++     E  P   + C  +           
Subjt:  MRKFNNFSDMTQLMEIPLSNGRFTWS-REGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVME

Query:  ISLDME-EHQGRAGFVIFSKLRNLKEALKRWKL-------NLDKEKKRFEEELLMEIENRDL---EDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQ
        +SL +  E Q   G  +FS   +LK A K  KL       N+  + K   +  L  I+++ L    D         R+      A L S YR       Q
Subjt:  ISLDME-EHQGRAGFVIFSKLRNLKEALKRWKL-------NLDKEKKRFEEELLMEIENRDL---EDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQ

Query:  KSKLNCLSLGDENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELILKCTTH
        KS++  L  GD N+ FFH+ +LA + +NLI  L  +D V   +  +++E+I+   TH
Subjt:  KSKLNCLSLGDENSSFFHRFLLAKKRRNLISELTNEDGVVTSSFKEIEELILKCTTH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAATTCAACAATTTCAGTGACATGACACAACTAATGGAGATCCCACTTAGTAACGGAAGATTCACTTGGTCTAGAGAAGGAAGCGTTCCAGCTCGGTCC
CTCTTGGGTAGATTCTTTATTTCTAGCAACTGGGATGATTCATTCGCTAATTCAAGAGCTACAAGACAGATGCGCACTATTTCTGGTCACTTTCCTATCTTGCTT
GAAGCCGGATCTTTTGAATGGGGTCCTTCCCCATTCAGATTTTGCAACAGCTGGTTAGATTTGAAGGAGTGTTGCCATGTCATGGAAATTTCACTCGATATGGAA
GAACATCAAGGGAGGGCTGGTTTTGTAATTTTTTCCAAACTCAGAAACTTAAAGGAGGCTTTAAAGAGATGGAAGCTGAACCTTGACAAAGAAAAGAAGAGGTTC
GAAGAAGAGCTCCTAATGGAAATTGAGAACCGAGACCTGGAGGACGAAAATTCAGAAGAAGTTGGCACGGGAAGGGAGATCTGTCTATCACTTAAAGCAAAATTA
TTATCATTATACCGTATTGAAGAAAGAAACCTAATGCAGAAAAGTAAGTTGAACTGTTTGAGTTTGGGTGATGAAAATTCTAGTTTTTTCCATCGGTTTTTGTTA
GCTAAGAAGAGGAGAAATTTGATTTCAGAATTGACCAATGAGGATGGAGTAGTCACATCATCCTTCAAGGAGATAGAAGAGCTAATCTTGAAATGTACGACACAT
TATACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAATTCAACAATTTCAGTGACATGACACAACTAATGGAGATCCCACTTAGTAACGGAAGATTCACTTGGTCTAGAGAAGGAAGCGTTCCAGCTCGGTCC
CTCTTGGGTAGATTCTTTATTTCTAGCAACTGGGATGATTCATTCGCTAATTCAAGAGCTACAAGACAGATGCGCACTATTTCTGGTCACTTTCCTATCTTGCTT
GAAGCCGGATCTTTTGAATGGGGTCCTTCCCCATTCAGATTTTGCAACAGCTGGTTAGATTTGAAGGAGTGTTGCCATGTCATGGAAATTTCACTCGATATGGAA
GAACATCAAGGGAGGGCTGGTTTTGTAATTTTTTCCAAACTCAGAAACTTAAAGGAGGCTTTAAAGAGATGGAAGCTGAACCTTGACAAAGAAAAGAAGAGGTTC
GAAGAAGAGCTCCTAATGGAAATTGAGAACCGAGACCTGGAGGACGAAAATTCAGAAGAAGTTGGCACGGGAAGGGAGATCTGTCTATCACTTAAAGCAAAATTA
TTATCATTATACCGTATTGAAGAAAGAAACCTAATGCAGAAAAGTAAGTTGAACTGTTTGAGTTTGGGTGATGAAAATTCTAGTTTTTTCCATCGGTTTTTGTTA
GCTAAGAAGAGGAGAAATTTGATTTCAGAATTGACCAATGAGGATGGAGTAGTCACATCATCCTTCAAGGAGATAGAAGAGCTAATCTTGAAATGTACGACACAT
TATACTTAA
Protein sequenceShow/hide protein sequence
MRKFNNFSDMTQLMEIPLSNGRFTWSREGSVPARSLLGRFFISSNWDDSFANSRATRQMRTISGHFPILLEAGSFEWGPSPFRFCNSWLDLKECCHVMEISLDME
EHQGRAGFVIFSKLRNLKEALKRWKLNLDKEKKRFEEELLMEIENRDLEDENSEEVGTGREICLSLKAKLLSLYRIEERNLMQKSKLNCLSLGDENSSFFHRFLL
AKKRRNLISELTNEDGVVTSSFKEIEELILKCTTHYT