; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G193700 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G193700
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrotransposon protein
Genome locationCla97Chr10:22445856..22446681
RNA-Seq ExpressionCla97C10G193700
SyntenyCla97C10G193700
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]2.6e-7059.76Show/hide
Query:  MVGNGRRSKHVWSKVEDAKLVEALLYLVETGWR------------------------CALNQNTIECKVKGLSKW-----------GFGWNKEFKCVQVK
        M GN +RSKHVWSKVEDA+LVEALLYLVETGWR                        CALN+NTIECKV+ L K            GF WN+EFKCVQV+
Subjt:  MVGNGRRSKHVWSKVEDAKLVEALLYLVETGWR------------------------CALNQNTIECKVKGLSKW-----------GFGWNKEFKCVQVK

Query:  SKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPI
         +IFD+WVRSHP+AKGMW KPFPHYDDLS +FGKDRA                            DCHT EV QTE PLNQD IDEE  +QS GRASVP 
Subjt:  SKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPI

Query:  ESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELE
        ESSRGSKR R SFQ EMIDI+ STVEMQ+THM RLASWQ EKYELE
Subjt:  ESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELE

XP_038892629.1 uncharacterized protein At2g29880-like [Benincasa hispida]1.1e-6560.89Show/hide
Query:  MVGNGRRSKHVWSKVEDAKLVEALLYLVETGWR------------------------CALNQNTIECKVKGLSKW-----------GFGWNKEFKCVQVK
        M GNG++SKHVWSKVEDAKLVEALLYLVETGWR                        CALN NTIECKV+ L K            G GWN+EFKCV V+
Subjt:  MVGNGRRSKHVWSKVEDAKLVEALLYLVETGWR------------------------CALNQNTIECKVKGLSKW-----------GFGWNKEFKCVQVK

Query:  SKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPI
         +IFD+WV SHP+AK MWNKPFPHYDDLST+FGKDRAVGQSS++P+VM                 DCHT EV QTE PLNQD IDEE  +QS GRASVP 
Subjt:  SKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPI

Query:  ESSRGSKRNRPSFQAEMIDIMGSTV
        ESSR +KRNR SFQ EMIDIM STV
Subjt:  ESSRGSKRNRPSFQAEMIDIMGSTV

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]1.9e-7360.47Show/hide
Query:  MVGNGRRSKHVWSKVEDAKLVEALLYLVETGWR------------------------CALNQNTIECKVKGLSKW-----------GFGWNKEFKCVQVK
        M G+G+RSKHVWSKVED KLVEALLYLVETGWR                        CALNQNTIECKV+ L K            GFGWN+EFKCVQV+
Subjt:  MVGNGRRSKHVWSKVEDAKLVEALLYLVETGWR------------------------CALNQNTIECKVKGLSKW-----------GFGWNKEFKCVQVK

Query:  SKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPI
         +IFD+WVRSH +AKGMWNK F HYDDLST+FGKDRA                            +CHT EVCQ E PLNQD IDEE  +QS GRASV  
Subjt:  SKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPI

Query:  ESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEI
        ESSRGSKR RPSFQAEMIDIM STVEMQ+THM RLASWQKEKYELEFG + E+
Subjt:  ESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEI

XP_038899910.1 uncharacterized protein LOC120087100 [Benincasa hispida]2.2e-6977.78Show/hide
Query:  LSKWGFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQD
        LS+ GFGWN+EFKCVQV+ +IF+   RSHP+AKGMWNK FPHYDDLST+FGKDRAVGQSS+DP+VM  NAFREFEDEI+LGSQDC T EV QTE PLNQD
Subjt:  LSKWGFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQD

Query:  GIDEELVDQSIGRASVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEI
         IDEE  +QS GRASVP+E+S+GSKR RPSFQAEMIDIM STVEMQ+THM RLASWQKEKYELEF H  E+
Subjt:  GIDEELVDQSIGRASVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEI

XP_038902479.1 uncharacterized protein At2g29880-like [Benincasa hispida]1.0e-7165.73Show/hide
Query:  MVGNGRRSKHVWSKVEDAKLVEALLYLVETGWR------------------------CALNQNTIECKVKGLSKW-----------GFGWNKEFKCVQVK
        M  NG+RSKH+WSKVEDAKLVEALLYLVETGWR                        C LNQNTIECKV+ L K            GF WN+EFKCVQV+
Subjt:  MVGNGRRSKHVWSKVEDAKLVEALLYLVETGWR------------------------CALNQNTIECKVKGLSKW-----------GFGWNKEFKCVQVK

Query:  SKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPI
         +IFD+WV SHP+AK MWNKPFPHYDD ST+FGKDR VG+SS+DP+VM TNAFREFEDEI+LGSQDC T EV QTE PLNQD IDEE  +QS GRASVP 
Subjt:  SKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPI

Query:  ESSRGSKRNRPSF
        +SSRGSKR RPSF
Subjt:  ESSRGSKRNRPSF

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859532.6e-2833.08Show/hide
Query:  MVGNGRRSKHVWSKVEDAKLVEALLYLVET-GWRC-------------------------ALNQNTIECKVKGLSKW-------------GFGWNKEFKC
        M    R  KH W+K E+ K VE L+ LV + GWR                              +TI+C VK L K              GFGWN+EF+C
Subjt:  MVGNGRRSKHVWSKVEDAKLVEALLYLVET-GWRC-------------------------ALNQNTIECKVKGLSKW-------------GFGWNKEFKC

Query:  VQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRA
        +  +  +FD W++SHP+AKG+ +K FP+YDDLS +FGKDRA G  S+    + +N    F D I LG  D H  ++  T          +E+     G+A
Subjt:  VQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRA

Query:  SVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEISKR
        S     S  SKR R S + E ++++ S +E  N  +  +A W KEK  +E   + ++ K+
Subjt:  SVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEISKR

A0A5A7U098 Retrotransposon protein2.7e-2531.54Show/hide
Query:  MVGNGRRSKHVWSKVEDAKLVEALLYLVET-GWR------------------------CALNQNTIECKVKGLSKW-------------GFGWNKEFKCV
        M  + R  KH W+K E+A LVE L+ LV   GWR                        C ++ +TI+ ++K + +              GFGWN E KC+
Subjt:  MVGNGRRSKHVWSKVEDAKLVEALLYLVET-GWR------------------------CALNQNTIECKVKGLSKW-------------GFGWNKEFKCV

Query:  QVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRAS
         V+ ++FD WV+SHP AKG+ NK F HYD+LS +FGKDRA G  ++    + +N    ++      + D +   +  + L ++ D    +L++    R S
Subjt:  QVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRAS

Query:  VPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASW
             S GSKR RP    +  DI+ + +E +N  + R+A W
Subjt:  VPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASW

A0A5A7U0H7 Retrotransposon protein2.6e-2833.08Show/hide
Query:  MVGNGRRSKHVWSKVEDAKLVEALLYLVET-GWRC-------------------------ALNQNTIECKVKGLSKW-------------GFGWNKEFKC
        M    R  KH W+K E+ K VE L+ LV + GWR                              +TI+C VK L K              GFGWN+EF+C
Subjt:  MVGNGRRSKHVWSKVEDAKLVEALLYLVET-GWRC-------------------------ALNQNTIECKVKGLSKW-------------GFGWNKEFKC

Query:  VQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRA
        +  +  +FD W++SHP+AKG+ +K FP+YDDLS +FGKDRA G  S+    + +N    F D I LG  D H  ++  T          +E+     G+A
Subjt:  VQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRA

Query:  SVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEISKR
        S     S  SKR R S + E ++++ S +E  N  +  +A W KEK  +E   + ++ K+
Subjt:  SVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEISKR

A0A5D3C7T4 Uncharacterized protein1.3e-2734.4Show/hide
Query:  NGRRSKHVWSKVEDAKLVEALLYLVET-GWRC--------ALNQNTIECKVKGLSKWGFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLS
        N + +KH W+ +ED  LVE LL LVE  GWR          L Q T   ++ G +  GFGWN+  KC++V+  +FD WV+ HP+A+G+ NKPFP++ DL 
Subjt:  NGRRSKHVWSKVEDAKLVEALLYLVET-GWRC--------ALNQNTIECKVKGLSKWGFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLS

Query:  TLFGKDRAVGQSSKDPHVMLTNAFREF-EDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQ
         +FG+DRA G   K P  M +   R+  ED++ +  +D         E P  +D      +  +    +    SSR SK+ R S+  +++D   +++   
Subjt:  TLFGKDRAVGQSSKDPHVMLTNAFREF-EDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQ

Query:  NTHMDRLASWQKEKYELE
        +  + ++A+WQ+EK E+E
Subjt:  NTHMDRLASWQKEKYELE

A0A6J1DW73 uncharacterized protein LOC1110250183.5e-2536.21Show/hide
Query:  GFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDE
        GFGWN + KC++ + ++FD WV+SHP+AKG+ NKP PHYDDL+  FGKDRA G +   P  M ++A     ++    +QD +  +         +D I+E
Subjt:  GFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDPHVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDE

Query:  ELVDQSIGRASVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEISKRHIQH
        +L +    + ++   SS GSKR R  + +EM+D++ + + MQ  H++++A+W  +K E       +I++R I H
Subjt:  ELVDQSIGRASVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEISKRHIQH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24960.1 unknown protein4.4e-0427.42Show/hide
Query:  LSKWGFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKD
        L + GF W++    +     ++D +++ HP A+    K  P Y+DL T+F      G   +D
Subjt:  LSKWGFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKD

AT2G24960.2 unknown protein4.4e-0427.42Show/hide
Query:  LSKWGFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKD
        L + GF W++    +     ++D +++ HP A+    K  P Y+DL T+F      G   +D
Subjt:  LSKWGFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGGTAATGGTAGGAGGTCTAAACACGTATGGTCGAAAGTGGAGGATGCTAAGTTGGTGGAAGCCCTATTGTATTTGGTGGAGACTGGTTGGAGGTGCGCACTGAA
TCAGAACACTATTGAGTGTAAGGTGAAGGGTCTAAGTAAGTGGGGGTTCGGCTGGAACAAAGAGTTCAAATGTGTCCAGGTCAAGAGTAAGATTTTCGATGTTTGGGTTC
GGAGTCATCCTAGTGCTAAAGGGATGTGGAACAAGCCATTCCCCCATTATGATGACCTCTCCACCTTATTTGGGAAAGACAGAGCAGTAGGACAATCAAGTAAGGACCCA
CACGTGATGTTGACGAATGCATTCAGAGAGTTTGAAGATGAGATTCAACTTGGATCACAAGACTGTCACACACTTGAGGTTTGCCAGACAGAGTTACCATTAAATCAAGA
TGGAATAGATGAAGAGCTAGTAGATCAATCTATAGGTAGAGCGAGTGTACCTATCGAGTCATCTCGAGGCAGCAAGAGGAATAGGCCATCATTCCAAGCTGAAATGATCG
ACATCATGGGATCGACTGTTGAAATGCAAAACACACACATGGATAGACTTGCATCGTGGCAGAAGGAAAAGTATGAGCTAGAGTTCGGGCATCAGAACGAAATTAGTAAA
CGCCATATACAACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGGTAATGGTAGGAGGTCTAAACACGTATGGTCGAAAGTGGAGGATGCTAAGTTGGTGGAAGCCCTATTGTATTTGGTGGAGACTGGTTGGAGGTGCGCACTGAA
TCAGAACACTATTGAGTGTAAGGTGAAGGGTCTAAGTAAGTGGGGGTTCGGCTGGAACAAAGAGTTCAAATGTGTCCAGGTCAAGAGTAAGATTTTCGATGTTTGGGTTC
GGAGTCATCCTAGTGCTAAAGGGATGTGGAACAAGCCATTCCCCCATTATGATGACCTCTCCACCTTATTTGGGAAAGACAGAGCAGTAGGACAATCAAGTAAGGACCCA
CACGTGATGTTGACGAATGCATTCAGAGAGTTTGAAGATGAGATTCAACTTGGATCACAAGACTGTCACACACTTGAGGTTTGCCAGACAGAGTTACCATTAAATCAAGA
TGGAATAGATGAAGAGCTAGTAGATCAATCTATAGGTAGAGCGAGTGTACCTATCGAGTCATCTCGAGGCAGCAAGAGGAATAGGCCATCATTCCAAGCTGAAATGATCG
ACATCATGGGATCGACTGTTGAAATGCAAAACACACACATGGATAGACTTGCATCGTGGCAGAAGGAAAAGTATGAGCTAGAGTTCGGGCATCAGAACGAAATTAGTAAA
CGCCATATACAACATTGA
Protein sequenceShow/hide protein sequence
MVGNGRRSKHVWSKVEDAKLVEALLYLVETGWRCALNQNTIECKVKGLSKWGFGWNKEFKCVQVKSKIFDVWVRSHPSAKGMWNKPFPHYDDLSTLFGKDRAVGQSSKDP
HVMLTNAFREFEDEIQLGSQDCHTLEVCQTELPLNQDGIDEELVDQSIGRASVPIESSRGSKRNRPSFQAEMIDIMGSTVEMQNTHMDRLASWQKEKYELEFGHQNEISK
RHIQH