; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G018217 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G018217
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionRetrotransposon protein
Genome locationGy14Chr5:24357619..24358400
RNA-Seq ExpressionCsGy5G018217
SyntenyCsGy5G018217
Gene Ontology termsNA
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]7.89e-7761.17Show/hide
Query:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATT-GGDHINYIEASNEWTQWRDDLAQSMF
        MKHSS RNVIER  G+LKGRW ILR KSYYP+QVQCRTI+AC LLHNLIN EMT  + + E+EDEGDS YATT   + I YIE +NEW+QWRDDLA SMF
Subjt:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATT-GGDHINYIEASNEWTQWRDDLAQSMF

Query:  NEWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSG
         +WQ R                        SC ++LVS+ GW+SDN TFRP YLAQL+RM+AE++ GC++ +TTV++ RIK LKR+FQAIAEM GP CSG
Subjt:  NEWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSG

Query:  FGWNDD
        FGWND+
Subjt:  FGWNDD

ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]5.90e-8162.93Show/hide
Query:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN
        MKH S RNVIER  G+LKGRW ILR KSYYP++VQCRTI+ACCLLHNLIN EMTN D +++N DE DS +ATT  D I+YIE SNEW+QWRD+LA+ +  
Subjt:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN

Query:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF
                M SSSR PKH WT+ EE+ LV CLV+LV+  GWRSDN TFRP YL QL RM+A ++PG  + ++T+ +SRIKL+KR F A+AEM GP CSGF
Subjt:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF

Query:  GWNDD
        GWND+
Subjt:  GWNDD

KAA0035413.1 retrotransposon protein [Cucumis melo var. makuwa]4.10e-7660Show/hide
Query:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN
        MKHSS RNVIE+  G+LKGRW IL+ KSYYPI+VQCRTI+ACCLLHNLIN EM   D +D+  DE DS +AT+  D I+YIE SNEWT+WRDDL + MF+
Subjt:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN

Query:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF
        EW+ RND MASS R  KH WT+ EE+ LV CLV+LV+  GWRSDN TFRP                           RIKLLKR F AIAEMCGP CS F
Subjt:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF

Query:  GWNDD
        GWND+
Subjt:  GWNDD

KAA0043564.1 retrotransposon protein [Cucumis melo var. makuwa]6.65e-7159.68Show/hide
Query:  GRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFNEWQSRNDYMASSSRNPKH
        GRW ILR KSYYP+ VQCRTIMACCLLHNLIN EMTN + +D+  DEGDS YATTGGD INYIE SNEW++ RD LA +MF++W+ R D M SSSR  KH
Subjt:  GRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFNEWQSRNDYMASSSRNPKH

Query:  AWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGFGWND
         WT+ EE+ LV CLV+LVS  GWRS+N TFR  YLAQL RM+ +++    +  +  ++ R+K LK+   +   M GP CSGFGWN+
Subjt:  AWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGFGWND

XP_031742024.1 uncharacterized protein LOC105435527 [Cucumis sativus]9.48e-153100Show/hide
Query:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN
        MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN
Subjt:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN

Query:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF
        EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF
Subjt:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF

Query:  GWNDDL
        GWNDDL
Subjt:  GWNDDL

TrEMBL top hitse value%identityAlignment
A0A5A7SXX8 Retrotransposon protein1.98e-7660Show/hide
Query:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN
        MKHSS RNVIE+  G+LKGRW IL+ KSYYPI+VQCRTI+ACCLLHNLIN EM   D +D+  DE DS +AT+  D I+YIE SNEWT+WRDDL + MF+
Subjt:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN

Query:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF
        EW+ RND MASS R  KH WT+ EE+ LV CLV+LV+  GWRSDN TFRP                           RIKLLKR F AIAEMCGP CS F
Subjt:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF

Query:  GWNDD
        GWND+
Subjt:  GWNDD

A0A5A7TJS2 Retrotransposon protein3.22e-7159.68Show/hide
Query:  GRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFNEWQSRNDYMASSSRNPKH
        GRW ILR KSYYP+ VQCRTIMACCLLHNLIN EMTN + +D+  DEGDS YATTGGD INYIE SNEW++ RD LA +MF++W+ R D M SSSR  KH
Subjt:  GRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFNEWQSRNDYMASSSRNPKH

Query:  AWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGFGWND
         WT+ EE+ LV CLV+LVS  GWRS+N TFR  YLAQL RM+ +++    +  +  ++ R+K LK+   +   M GP CSGFGWN+
Subjt:  AWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGFGWND

A0A5D3D8J6 Retrotransposon protein2.33e-6859.14Show/hide
Query:  GRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFNEWQSRNDYMASSSRNPKH
        GRW ILR KSYYP+ VQCRTIMACCLLHNLIN EMTN + +D+  DEGDS YATTGGD INYIE SNEW++ RD LA +MF++W+ R D M SSSR  KH
Subjt:  GRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFNEWQSRNDYMASSSRNPKH

Query:  AWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGFGWND
         WT+ EE+ LV CLV+LVS  GWRS+N TFR  YLAQL RM+ +++    +  +  ++ R+K LK+   +   + GP CSGFGWN+
Subjt:  AWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGFGWND

E5GBB2 Retrotransposon protein3.82e-7761.17Show/hide
Query:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATT-GGDHINYIEASNEWTQWRDDLAQSMF
        MKHSS RNVIER  G+LKGRW ILR KSYYP+QVQCRTI+AC LLHNLIN EMT  + + E+EDEGDS YATT   + I YIE +NEW+QWRDDLA SMF
Subjt:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATT-GGDHINYIEASNEWTQWRDDLAQSMF

Query:  NEWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSG
         +WQ R                        SC ++LVS+ GW+SDN TFRP YLAQL+RM+AE++ GC++ +TTV++ RIK LKR+FQAIAEM GP CSG
Subjt:  NEWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSG

Query:  FGWNDD
        FGWND+
Subjt:  FGWNDD

E5GCB5 Retrotransposon protein2.86e-8162.93Show/hide
Query:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN
        MKH S RNVIER  G+LKGRW ILR KSYYP++VQCRTI+ACCLLHNLIN EMTN D +++N DE DS +ATT  D I+YIE SNEW+QWRD+LA+ +  
Subjt:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFN

Query:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF
                M SSSR PKH WT+ EE+ LV CLV+LV+  GWRSDN TFRP YL QL RM+A ++PG  + ++T+ +SRIKL+KR F A+AEM GP CSGF
Subjt:  EWQSRNDYMASSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGF

Query:  GWNDD
        GWND+
Subjt:  GWNDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.1e-0728.7Show/hide
Query:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLI--NCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNE------------
        ++H S RNVIER+ G+ K R+ I +    +  + Q   ++ C  LHN +   C     D  DE  +EGD       G+ +N  E  NE            
Subjt:  MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLI--NCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNE------------

Query:  WTQWRDDLAQSMFNE
           WR  +A+ M+ +
Subjt:  WTQWRDDLAQSMFNE

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.3e-0428.07Show/hide
Query:  KHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCE--------MTNVDTLDE-NEDE----GDSNYATTGGDHINYIEASNEWT
        +H      I R  G LK R+ IL     YP+Q Q + ++A C LHN +  E        M   +TL E  ED      +      G +H    E   +  
Subjt:  KHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCE--------MTNVDTLDE-NEDE----GDSNYATTGGDHINYIEASNEWT

Query:  QWRDDLAQSMFNEW
        + RD++A  ++N +
Subjt:  QWRDDLAQSMFNEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACACTCGTCTACACGTAACGTTATCGAACGAGTGATTGGTCTTCTGAAGGGTCGTTGGGTAATACTTCGCGAAAAGTCGTACTACCCCATTCAGGTTCAGTGTCG
CACTATTATGGCTTGTTGTCTACTCCACAATCTAATAAATTGTGAGATGACAAATGTCGACACGCTAGACGAGAACGAGGACGAGGGTGACTCGAACTATGCAACGACTG
GAGGTGACCACATCAACTACATTGAGGCGTCAAACGAATGGACTCAATGGAGGGATGACCTCGCACAGTCGATGTTCAACGAATGGCAGTCGCGAAACGACTACATGGCC
AGTTCATCGAGAAACCCAAAGCACGCATGGACGAGAGCAGAGGAGTCATGCCTCGTCAGTTGTCTTGTCGATCTTGTCTCCGTAGAAGGGTGGAGATCAGACAACGACAC
CTTTCGACCTAGCTACCTCGCGCAGTTGCTGAGAATGCTAGCTGAGAGGATGCCAGGGTGCAAATTGACATCCACTACCGTTGTAGAAAGCAGAATAAAGTTATTGAAAC
GATCATTCCAGGCAATTGCAGAGATGTGTGGTCCTGTATGCAGTGGGTTCGGGTGGAATGACGACCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACACTCGTCTACACGTAACGTTATCGAACGAGTGATTGGTCTTCTGAAGGGTCGTTGGGTAATACTTCGCGAAAAGTCGTACTACCCCATTCAGGTTCAGTGTCG
CACTATTATGGCTTGTTGTCTACTCCACAATCTAATAAATTGTGAGATGACAAATGTCGACACGCTAGACGAGAACGAGGACGAGGGTGACTCGAACTATGCAACGACTG
GAGGTGACCACATCAACTACATTGAGGCGTCAAACGAATGGACTCAATGGAGGGATGACCTCGCACAGTCGATGTTCAACGAATGGCAGTCGCGAAACGACTACATGGCC
AGTTCATCGAGAAACCCAAAGCACGCATGGACGAGAGCAGAGGAGTCATGCCTCGTCAGTTGTCTTGTCGATCTTGTCTCCGTAGAAGGGTGGAGATCAGACAACGACAC
CTTTCGACCTAGCTACCTCGCGCAGTTGCTGAGAATGCTAGCTGAGAGGATGCCAGGGTGCAAATTGACATCCACTACCGTTGTAGAAAGCAGAATAAAGTTATTGAAAC
GATCATTCCAGGCAATTGCAGAGATGTGTGGTCCTGTATGCAGTGGGTTCGGGTGGAATGACGACCTTTAG
Protein sequenceShow/hide protein sequence
MKHSSTRNVIERVIGLLKGRWVILREKSYYPIQVQCRTIMACCLLHNLINCEMTNVDTLDENEDEGDSNYATTGGDHINYIEASNEWTQWRDDLAQSMFNEWQSRNDYMA
SSSRNPKHAWTRAEESCLVSCLVDLVSVEGWRSDNDTFRPSYLAQLLRMLAERMPGCKLTSTTVVESRIKLLKRSFQAIAEMCGPVCSGFGWNDDL