; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G078540 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G078540
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionRetrotransposon protein
Genome locationchrH04:17521185..17522421
RNA-Seq ExpressionChy4G078540
SyntenyChy4G078540
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038929.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]6.05e-2840.3Show/hide
Query:  EEISSMYLLVGKDQLRTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGF---SGTI*RPKLPHAGMA-----WRG--------KCTFNVQRVLQYEAFV
        +EI   YL VGKDQLRTH+      +D ++ R  RL +P     L   G+    G +     P+ G       WRG        K  FN++         
Subjt:  EEISSMYLLVGKDQLRTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGF---SGTI*RPKLPHAGMA-----WRG--------KCTFNVQRVLQYEAFV

Query:  CV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHN
             +R+   L+        K   P            LLHNLINREMTNFDI  +IDE +STH TT  DDIHYI+TSNEWSQWRD LA EMF+ WEL N
Subjt:  CV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHN

Query:  K
        +
Subjt:  K

KAA0048364.1 retrotransposon protein [Cucumis melo var. makuwa]2.71e-3031.61Show/hide
Query:  MLELLHNNIKRILHILHDTRHRIRNLPIFE*IMRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWD-N
        MLELL N+ KRI HI ++TRHRIR L  F  I       D V   +  +   C   L  LL         R +   T T+    +   +  +H++  D  
Subjt:  MLELLHNNIKRILHILHDTRHRIRNLPIFE*IMRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWD-N

Query:  FLSFQHGVVGNYSTACRALEKTTTSD*QLYRSKIE----VV*GVAKLSW-CIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVGKDQL
            Q   + +  T  R       +  +L+   ++    V        W CIR HV++S R +         + EVATNVLG+ +     +Y+L G +  
Subjt:  FLSFQHGVVGNYSTACRALEKTTTSD*QLYRSKIE----VV*GVAKLSW-CIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVGKDQL

Query:  RTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGF---SGTI*RPKLPHAGMA-----WRG--------KCTFNVQRVLQYEAFVCV*CNQRSIWCLEES
                  +D ++ R  RL +P     L   G+    G +     P+ G       WRG        K  FN++              +R+   L+  
Subjt:  RTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGF---SGTI*RPKLPHAGMA-----WRG--------KCTFNVQRVLQYEAFVCV*CNQRSIWCLEES

Query:  VGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK
              K   P            LLHNLINR+MTNFDI  +IDE +STH TT  DDIHYI+TSNEWSQWRD LA EMF+ WEL N+
Subjt:  VGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK

KAA0067470.1 retrotransposon protein [Cucumis melo var. makuwa]8.61e-3232Show/hide
Query:  MLELLHNNIKRILHILHDTRHRIRNLPIFE*IMRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWD-N
        MLELL NN KRI HILH+TRHRIR L  F    R    +D V   +  +   C T L  LL         R +   T T+    +   +  +H++  D  
Subjt:  MLELLHNNIKRILHILHDTRHRIRNLPIFE*IMRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWD-N

Query:  FLSFQHGVVGNYSTACRALEKT------TTSD*QLYRSKIEVV*GVAKLSWCIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVGKD-
            Q   + +  T  RAL+ T        SD   YR++                                  + EV TNVLG+ +     +Y+L G + 
Subjt:  FLSFQHGVVGNYSTACRALEKT------TTSD*QLYRSKIEVV*GVAKLSWCIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVGKD-

Query:  -QLRTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGFSGTI*RPKLPHAGMAWRGKCTFNVQRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*
          + +HI      +D ++        P+  +V K   +   +  P        +RG+C +++Q   ++          +  + ++ S    A+  +    
Subjt:  -QLRTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGFSGTI*RPKLPHAGMAWRGKCTFNVQRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*

Query:  SPMSHN--ACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK
            H   AC LL HNLINREMTNFDI  +IDE +STH TT  DDIHYI+TSNEWSQWRD+LA EMFS WEL N+
Subjt:  SPMSHN--ACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK

TYK04902.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.59e-2660Show/hide
Query:  RSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK
        RSI  +E S+G+ + KV LP  SPM H+  +LLLHNLINREMTN D+  +  EG+ST+  T+GDDIHYI+TSNE SQW D+LA EM S WEL N+
Subjt:  RSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK

XP_008442271.1 PREDICTED: uncharacterized protein LOC103486178 [Cucumis melo]2.47e-2745.89Show/hide
Query:  RGFSGTI*RPKLPHAGMAWRG--------KCTFNVQRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPG
        RGFS T+ RP LP A   WRG        K  FN++               +S W + +       KV               LLHNLI+REMTNF+I  
Subjt:  RGFSGTI*RPKLPHAGMAWRG--------KCTFNVQRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPG

Query:  DIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK
        DIDE +STH TT+GDDIHYI+TSNEW+ WRDEL  EMFS WEL N+
Subjt:  DIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK

TrEMBL top hitse value%identityAlignment
A0A1S3B5B3 uncharacterized protein LOC1034861782.0e-2245.89Show/hide
Query:  RGFSGTI*RPKLPHAGMAWRG--------KCTFNVQRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPG
        RGFS T+ RP LP A   WRG        K  FN++               +S W + +       KV               LLHNLI+REMTNF+I  
Subjt:  RGFSGTI*RPKLPHAGMAWRG--------KCTFNVQRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPG

Query:  DIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK
        DIDE +STH TT+GDDIHYI+TSNEW+ WRDEL  EMFS WEL N+
Subjt:  DIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK

A0A5A7TCB9 Putative nuclease HARBI16.9e-2340.3Show/hide
Query:  EEISSMYLLVGKDQLRTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGF---SGTI*RPKLPHAG-----MAWRG--------KCTFNVQRVLQYEAFV
        +EI   YL VGKDQLRTH+      +D ++ R  RL +P     L   G+    G +     P+ G       WRG        K  FN++         
Subjt:  EEISSMYLLVGKDQLRTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGF---SGTI*RPKLPHAG-----MAWRG--------KCTFNVQRVLQYEAFV

Query:  CV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHN
             +R+   L+        K   P            LLHNLINREMTNFDI  +IDE +STH TT  DDIHYI+TSNEWSQWRD LA EMF+ WEL N
Subjt:  CV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHN

Query:  K
        +
Subjt:  K

A0A5A7TZ93 Retrotransposon protein7.2e-2830.85Show/hide
Query:  MLELLHNNIKRILHILHDTRHRIRNLPIFE*IMRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWD--
        MLELL N+ KRI HI ++TRHRIR L  F  I       D V   +  +   C   L  LL         R +   T T+    +   +  +H++  D  
Subjt:  MLELLHNNIKRILHILHDTRHRIRNLPIFE*IMRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWD--

Query:  ------NFLSFQHGVVGNYSTACRALEKTTTSD*QLYRSKIEVV*GVAKLSW-CIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVG-
               F+     +  +++    A+ +      +L +    V        W CIR HV++S R +         + EVATNVLG+ +     +Y+L G 
Subjt:  ------NFLSFQHGVVGNYSTACRALEKTTTSD*QLYRSKIEVV*GVAKLSW-CIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVG-

Query:  ----------KDQLRTHISFEMP--FQDLMA*RYPR---LLLPSRCQVLKCRGFSGTI*RPKLPHAGMAWRGKCTFNVQRVLQYEAFVCV*CNQRSIWCL
                  +D L      ++P  +  L+   YP     L P R Q    + + G    P           K  FN++              +R+   L
Subjt:  ----------KDQLRTHISFEMP--FQDLMA*RYPR---LLLPSRCQVLKCRGFSGTI*RPKLPHAGMAWRGKCTFNVQRVLQYEAFVCV*CNQRSIWCL

Query:  EESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK
        +        K   P            LLHNLINR+MTNFDI  +IDE +STH TT  DDIHYI+TSNEWSQWRD LA EMF+ WEL N+
Subjt:  EESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK

A0A5A7VJX3 Retrotransposon protein1.9e-2832Show/hide
Query:  MLELLHNNIKRILHILHDTRHRIRNLPIFE*IMRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWD-N
        MLELL NN KRI HILH+TRHRIR L  F    R    +D V   +  +   C T L  LL         R +   T T+    +   +  +H++  D  
Subjt:  MLELLHNNIKRILHILHDTRHRIRNLPIFE*IMRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWD-N

Query:  FLSFQHGVVGNYSTACRALEKT------TTSD*QLYRSKIEVV*GVAKLSWCIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVGKD-
            Q   + +  T  RAL+ T        SD   YR++                                  + EV TNVLG+ +     +Y+L G + 
Subjt:  FLSFQHGVVGNYSTACRALEKT------TTSD*QLYRSKIEVV*GVAKLSWCIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVGKD-

Query:  -QLRTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGFSGTI*RPKLPHAGMAWRGKCTFNVQRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*
          + +HI  +             L  P+  +V K   +   +  P        +RG+C +++Q   ++          +  + ++ S    A+  +    
Subjt:  -QLRTHISFEMPFQDLMA*RYPRLLLPSRCQVLKCRGFSGTI*RPKLPHAGMAWRGKCTFNVQRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*

Query:  SPMSHN--ACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK
            H   AC  LLHNLINREMTNFDI  +IDE +STH TT  DDIHYI+TSNEWSQWRD+LA EMFS WEL N+
Subjt:  SPMSHN--ACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK

A0A5D3DT98 Putative nuclease HARBI13.4e-2230.09Show/hide
Query:  MRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWDNFLSFQHGVVGNYSTACRALEKTTTSD*QLYRSK
        M  T  V KVREWTEDV PFC TYL               +   T T+    +   +  +H++  DN L   HG     +           SD   YR++
Subjt:  MRQT*CVDKVREWTEDVLPFCATYLGPLLD*RRC*GDGRNVPLYTCTQ--REKSCHSTRVHVVGWDNFLSFQHGVVGNYSTACRALEKTTTSD*QLYRSK

Query:  IEVV*GVAKLSWCIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVG-----------KDQLRTHISFEMP--FQDLMA*RYPR---LL
        +                                   +VATNVLG+ +     +Y+L              D L      ++P  +  L+   YP     L
Subjt:  IEVV*GVAKLSWCIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISS-MYLLVG-----------KDQLRTHISFEMP--FQDLMA*RYPR---LL

Query:  LPSRCQVLKCRGFSGTI*RPKLPHAGMAWRGKCTFNV-QRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFD
         P R Q    + +      P         +   T NV +R              +S + +E        + IL         AC   LHNLINR+MTNFD
Subjt:  LPSRCQVLKCRGFSGTI*RPKLPHAGMAWRGKCTFNV-QRVLQYEAFVCV*CNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFD

Query:  IPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK
        I  +IDE +STH TT+ DDIHYI+TSNEWSQWRD+LA EMF+ W L N+
Subjt:  IPGDIDEGNSTHVTTIGDDIHYIKTSNEWSQWRDELANEMFSVWELHNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGAGTTGTTACATAACAACATAAAGAGGATATTGCACATCCTCCATGACACTAGGCATAGGATTAGAAATTTGCCTATTTTCGAATGAATCATGCGTCAGACCTA
GTGTGTCGACAAAGTACGAGAATGGACCGAAGATGTTTTGCCATTTTGTGCCACCTACTTAGGACCATTGCTTGACTAACGTCGATGTTGAGGAGATGGCCGTAATGTTC
CTCTATATACTTGCACACAACGTGAAAAATCGTGTCATTCAACGAGAGTTCATGTGGTCGGATGGGACAATTTCCTATCATTTCAACATGGTGTTGTTGGCAATTATTCG
ACTGCATGTCGAGCTCTTGAAAAAACCACAACTAGTGACTAACAGTTGTACAGATCAAAGATAGAGGTGGTTTGAGGCGTTGCAAAATTGTCTTGGTGCATTAGATGACA
TGTATATCAAAGTTAACGTTCTAGCAAGTGTAGTCTAAGTATAGAACACGCAAGGAGAGAGGTGGCCACAAATGTCCTCGGCATGTACGAGGAAATTTCGTCTATGTACT
TGTTGGTTGGGAAGGATCAGCTGCGGACTCACATATCCTTCGAGATGCCATTTCAAGACCTAATGGCCTGAAGGTACCCAAGGCTATTACTACCTAGTCGATGTCAAGTA
CTCAAATGTAGAGGGTTTTCTGGCACCATATAGAGGCCAAAACTACCACATGCAGGAATGGCGTGGCGCGGAAAATGCACATTCAACGTCCAAAGAGTTCTTCAATATGA
AGCATTCGTCTGCGTGTAATGTAATCAAAGGAGCATTTGGTGTCTTGAAGAGTCGGTGGGTGATACTGCGTGGAAAGTCATACTACCCTATTGAAGTCCAATGTCGCACA
ATGCTTGCCTACTGCTACTACACAACCTCATCAATAGGGAGATGACGAACTTTGATATACCAGGCGACATAGATGAGGGCAATTCGACCCACGTGACTACTATCGGGGAT
GACATACATTATATAAAGACCTCCAACGAGTGGAGTCAGTGGAGGGACGAGCTTGCAAATGAAATGTTTAGTGTCTGGGAGTTGCATAACAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGAGTTGTTACATAACAACATAAAGAGGATATTGCACATCCTCCATGACACTAGGCATAGGATTAGAAATTTGCCTATTTTCGAATGAATCATGCGTCAGACCTA
GTGTGTCGACAAAGTACGAGAATGGACCGAAGATGTTTTGCCATTTTGTGCCACCTACTTAGGACCATTGCTTGACTAACGTCGATGTTGAGGAGATGGCCGTAATGTTC
CTCTATATACTTGCACACAACGTGAAAAATCGTGTCATTCAACGAGAGTTCATGTGGTCGGATGGGACAATTTCCTATCATTTCAACATGGTGTTGTTGGCAATTATTCG
ACTGCATGTCGAGCTCTTGAAAAAACCACAACTAGTGACTAACAGTTGTACAGATCAAAGATAGAGGTGGTTTGAGGCGTTGCAAAATTGTCTTGGTGCATTAGATGACA
TGTATATCAAAGTTAACGTTCTAGCAAGTGTAGTCTAAGTATAGAACACGCAAGGAGAGAGGTGGCCACAAATGTCCTCGGCATGTACGAGGAAATTTCGTCTATGTACT
TGTTGGTTGGGAAGGATCAGCTGCGGACTCACATATCCTTCGAGATGCCATTTCAAGACCTAATGGCCTGAAGGTACCCAAGGCTATTACTACCTAGTCGATGTCAAGTA
CTCAAATGTAGAGGGTTTTCTGGCACCATATAGAGGCCAAAACTACCACATGCAGGAATGGCGTGGCGCGGAAAATGCACATTCAACGTCCAAAGAGTTCTTCAATATGA
AGCATTCGTCTGCGTGTAATGTAATCAAAGGAGCATTTGGTGTCTTGAAGAGTCGGTGGGTGATACTGCGTGGAAAGTCATACTACCCTATTGAAGTCCAATGTCGCACA
ATGCTTGCCTACTGCTACTACACAACCTCATCAATAGGGAGATGACGAACTTTGATATACCAGGCGACATAGATGAGGGCAATTCGACCCACGTGACTACTATCGGGGAT
GACATACATTATATAAAGACCTCCAACGAGTGGAGTCAGTGGAGGGACGAGCTTGCAAATGAAATGTTTAGTGTCTGGGAGTTGCATAACAAGTAG
Protein sequenceShow/hide protein sequence
MLELLHNNIKRILHILHDTRHRIRNLPIFEIMRQT*CVDKVREWTEDVLPFCATYLGPLLDRRC*GDGRNVPLYTCTQREKSCHSTRVHVVGWDNFLSFQHGVVGNYSTA
CRALEKTTTSDQLYRSKIEVV*GVAKLSWCIR*HVYQS*RSSKCSLSIEHARREVATNVLGMYEEISSMYLLVGKDQLRTHISFEMPFQDLMARYPRLLLPSRCQVLKCR
GFSGTI*RPKLPHAGMAWRGKCTFNVQRVLQYEAFVCVCNQRSIWCLEESVGDTAWKVILPY*SPMSHNACLLLLHNLINREMTNFDIPGDIDEGNSTHVTTIGDDIHYI
KTSNEWSQWRDELANEMFSVWELHNK