; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G10130 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G10130
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDUF659 domain-containing protein
Genome locationChr7:8203751..8205047
RNA-Seq ExpressionCSPI07G10130
SyntenyCSPI07G10130
Gene Ontology termsGO:0016853 - isomerase activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RWR74797.1 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein [Cinnamomum micranthum f. kanehirae]1.0e-13370.17Show/hide
Query:  DEAKTQMERKAPKNVPLPMTSSSISIGGVDVTNSNRFEIENKKRK----GNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIGAFTYAANN
        +E K +M+  APK VPLP+ S ++S   + + +      ++KKRK    GN++ +EK+FN  + DQL A IA+MFYSAGLPFHLARNPHF+ AFT+AAN+
Subjt:  DEAKTQMERKAPKNVPLPMTSSSISIGGVDVTNSNRFEIENKKRK----GNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIGAFTYAANN

Query:  PLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHEN
        PL+ Y PPGYNMLRTSLLQREK NIE LL PIKG WR+KGV+IVSDGWSDSQRR LI+FM ++ G  MFLK+V+CSGE KDKYFIANLMKEVIN+VGHEN
Subjt:  PLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHEN

Query:  VVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPLKLLSIAET
        VVQVITDN PNCKG GQ+IE+QFP IIWT CVVHTLNLAL NICA+KNVENNQ+ YGECSWI DIVGDV+ +K FIMNHSMRL MFNEFV LKLLS+A+T
Subjt:  VVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPLKLLSIAET

Query:  CFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW
         F S I+MLKRFKLIK GLQAMVISDKW  YRE DV  AR VKE + L  IW
Subjt:  CFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW

XP_022156304.1 uncharacterized protein LOC111023231 isoform X1 [Momordica charantia]1.2e-12665.83Show/hide
Query:  KDIADMQRLEDEAKTQMERKAPKNV--PLPMTSSSISIGGVDVTNSNRFEIENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIG
        KD+A+MQRLEDEAK + E+ APK V  P P  + + S G +   + +    + KKRK ++S LEKSFN T+ DQL + IA+MFYS+GLPF LARNPHF+ 
Subjt:  KDIADMQRLEDEAKTQMERKAPKNV--PLPMTSSSISIGGVDVTNSNRFEIENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIG

Query:  AFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEV
        AFT+AANN LS Y PPGYNMLRT+LLQREKTNIE LL PIK  W  KGV+IVSDGWSDSQRR  INFM I++G  +FLK V+CSGE+KDKYFI NL+KEV
Subjt:  AFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEV

Query:  INEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPL
        INEVGH+N++Q+ITDN PNC+  GQ+IE+QF  I+WT CVV TLNLALKNIC+SKN+E N+ V+ EC WI    GDV++VK FIMNH MRL MF EFV L
Subjt:  INEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPL

Query:  KLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW
        KLLSIAET F   I MLKRFKLIK GLQAM ISDKW  YREDDV KA+H+K+L+ L  IW
Subjt:  KLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW

XP_022156306.1 uncharacterized protein LOC111023231 isoform X2 [Momordica charantia]1.2e-12665.83Show/hide
Query:  KDIADMQRLEDEAKTQMERKAPKNV--PLPMTSSSISIGGVDVTNSNRFEIENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIG
        KD+A+MQRLEDEAK + E+ APK V  P P  + + S G +   + +    + KKRK ++S LEKSFN T+ DQL + IA+MFYS+GLPF LARNPHF+ 
Subjt:  KDIADMQRLEDEAKTQMERKAPKNV--PLPMTSSSISIGGVDVTNSNRFEIENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIG

Query:  AFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEV
        AFT+AANN LS Y PPGYNMLRT+LLQREKTNIE LL PIK  W  KGV+IVSDGWSDSQRR  INFM I++G  +FLK V+CSGE+KDKYFI NL+KEV
Subjt:  AFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEV

Query:  INEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPL
        INEVGH+N++Q+ITDN PNC+  GQ+IE+QF  I+WT CVV TLNLALKNIC+SKN+E N+ V+ EC WI    GDV++VK FIMNH MRL MF EFV L
Subjt:  INEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPL

Query:  KLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW
        KLLSIAET F   I MLKRFKLIK GLQAM ISDKW  YREDDV KA+H+K+L+ L  IW
Subjt:  KLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW

XP_031743157.1 uncharacterized protein LOC116404561 [Cucumis sativus]1.4e-13588.57Show/hide
Query:  MFYSAGLPFHLARNPHFIGAFTYAANNPLSIYRPPGYNMLRTS-LLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKS
        MFYSAGLPFHLARNPHFI AFTYAANNPLS Y+PPGYNMLRTS LLQREK NIE LLMPIKGEWRKKGV+I+SDGWSDSQRR LINFM ISNGK MFLKS
Subjt:  MFYSAGLPFHLARNPHFIGAFTYAANNPLSIYRPPGYNMLRTS-LLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKS

Query:  VNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVV
        V+CSGEIKDKYFIAN MKEVINEVGHENVVQVITDN PNCKG GQLIEAQFP IIWT CVVHTLNLALKNICA KNVENNQIVYGECSWIF I GD+VVV
Subjt:  VNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVV

Query:  KFFIMNHSMRLPMFNEFVPLKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW
        KFFIMNHSM L MFNEFVPLKLLSIAET F SVIIMLKRFKLIK GLQAMVISDKWESYREDDVVKA HVKEL+  V  W
Subjt:  KFFIMNHSMRLPMFNEFVPLKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW

XP_038891577.1 uncharacterized protein LOC120080967 [Benincasa hispida]3.8e-12565.12Show/hide
Query:  KDIADMQRLEDEAKTQMERKAPKNVPLPMT------SSSISIGGVDVTN--SNRFE-IENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLA
        KD+A+MQ+LEDEAKT++ R APK VPLP +      S S   G  ++    SN +  +E KKRKGN SALEKSFN   RDQ+D+ IA+MF S+GL FHLA
Subjt:  KDIADMQRLEDEAKTQMERKAPKNVPLPMT------SSSISIGGVDVTN--SNRFE-IENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLA

Query:  RNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFI
        RNPH++ AF+   NN LS Y  P YN L T LLQ+EKTNIE LL  +K  W +KGV+I SDGWSDSQRR LINFM I+ G  MFLK+V+CSGE KDKYFI
Subjt:  RNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFI

Query:  ANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPM
        ANLMKEVINEVGHENV+Q+ITDN  NCKG GQ+IE+QFP+I+WT CVVHTLNLALKNICA++N+ +NQ V+ E SWI +I  DV+ VK FIMNHSMRL M
Subjt:  ANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPM

Query:  FNEFVPLKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW
        FNEFV LKLL++AET F S II+L+RFKLIK GLQ +VIS+KW  YREDD+VKAR VK+L+ L  IW
Subjt:  FNEFVPLKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW

TrEMBL top hitse value%identityAlignment
A0A443N8D6 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein4.8e-13470.17Show/hide
Query:  DEAKTQMERKAPKNVPLPMTSSSISIGGVDVTNSNRFEIENKKRK----GNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIGAFTYAANN
        +E K +M+  APK VPLP+ S ++S   + + +      ++KKRK    GN++ +EK+FN  + DQL A IA+MFYSAGLPFHLARNPHF+ AFT+AAN+
Subjt:  DEAKTQMERKAPKNVPLPMTSSSISIGGVDVTNSNRFEIENKKRK----GNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIGAFTYAANN

Query:  PLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHEN
        PL+ Y PPGYNMLRTSLLQREK NIE LL PIKG WR+KGV+IVSDGWSDSQRR LI+FM ++ G  MFLK+V+CSGE KDKYFIANLMKEVIN+VGHEN
Subjt:  PLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHEN

Query:  VVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPLKLLSIAET
        VVQVITDN PNCKG GQ+IE+QFP IIWT CVVHTLNLAL NICA+KNVENNQ+ YGECSWI DIVGDV+ +K FIMNHSMRL MFNEFV LKLLS+A+T
Subjt:  VVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPLKLLSIAET

Query:  CFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW
         F S I+MLKRFKLIK GLQAMVISDKW  YRE DV  AR VKE + L  IW
Subjt:  CFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW

A0A5B7AFB0 Uncharacterized protein6.8e-12865.4Show/hide
Query:  SKGYSKDIADMQRLEDEAKTQMERKAPKNVPLPMTSSSISIGGVDVTNSNRFEIENKKRK----GNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLA
        SK  +KDI +MQ+LEDE K +++  A K VPLP   S IS+ G   T+ ++   ++KKRK    G+ + LEK+FN  + +QL A IA+MFYS+GLPFHLA
Subjt:  SKGYSKDIADMQRLEDEAKTQMERKAPKNVPLPMTSSSISIGGVDVTNSNRFEIENKKRK----GNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLA

Query:  RNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFI
        RNP+++ +FT+AANNP+  Y PPGYN+LRT+LLQ EK NIE LL PIKG W++KGV+IVSDGWS+SQRR LINFM ++    MFLK V+CSGE KDKYFI
Subjt:  RNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFI

Query:  ANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPM
        ANLM+EVINEVGHENV+Q+ITDN PNCKG GQ+IE+QF  I WT CVVHTLNLALKNICA+KNVENNQ+ Y ECSWI DI GDV+ +K FIMNHS+RL M
Subjt:  ANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPM

Query:  FNEFVPLKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW
        FNEFV LKLLS+A+T F SVI+M +RFKLIK GLQAMVISDKW  Y+EDDV + R VKE + L  IW
Subjt:  FNEFVPLKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW

A0A6J1DT13 uncharacterized protein LOC111023231 isoform X15.7e-12765.83Show/hide
Query:  KDIADMQRLEDEAKTQMERKAPKNV--PLPMTSSSISIGGVDVTNSNRFEIENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIG
        KD+A+MQRLEDEAK + E+ APK V  P P  + + S G +   + +    + KKRK ++S LEKSFN T+ DQL + IA+MFYS+GLPF LARNPHF+ 
Subjt:  KDIADMQRLEDEAKTQMERKAPKNV--PLPMTSSSISIGGVDVTNSNRFEIENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIG

Query:  AFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEV
        AFT+AANN LS Y PPGYNMLRT+LLQREKTNIE LL PIK  W  KGV+IVSDGWSDSQRR  INFM I++G  +FLK V+CSGE+KDKYFI NL+KEV
Subjt:  AFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEV

Query:  INEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPL
        INEVGH+N++Q+ITDN PNC+  GQ+IE+QF  I+WT CVV TLNLALKNIC+SKN+E N+ V+ EC WI    GDV++VK FIMNH MRL MF EFV L
Subjt:  INEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPL

Query:  KLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW
        KLLSIAET F   I MLKRFKLIK GLQAM ISDKW  YREDDV KA+H+K+L+ L  IW
Subjt:  KLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW

A0A6J1DUJ6 uncharacterized protein LOC111023231 isoform X25.7e-12765.83Show/hide
Query:  KDIADMQRLEDEAKTQMERKAPKNV--PLPMTSSSISIGGVDVTNSNRFEIENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIG
        KD+A+MQRLEDEAK + E+ APK V  P P  + + S G +   + +    + KKRK ++S LEKSFN T+ DQL + IA+MFYS+GLPF LARNPHF+ 
Subjt:  KDIADMQRLEDEAKTQMERKAPKNV--PLPMTSSSISIGGVDVTNSNRFEIENKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIG

Query:  AFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEV
        AFT+AANN LS Y PPGYNMLRT+LLQREKTNIE LL PIK  W  KGV+IVSDGWSDSQRR  INFM I++G  +FLK V+CSGE+KDKYFI NL+KEV
Subjt:  AFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEV

Query:  INEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPL
        INEVGH+N++Q+ITDN PNC+  GQ+IE+QF  I+WT CVV TLNLALKNIC+SKN+E N+ V+ EC WI    GDV++VK FIMNH MRL MF EFV L
Subjt:  INEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPL

Query:  KLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW
        KLLSIAET F   I MLKRFKLIK GLQAM ISDKW  YREDDV KA+H+K+L+ L  IW
Subjt:  KLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIW

A0A7J0FQA7 DUF659 domain-containing protein4.4e-11969.03Show/hide
Query:  IENKKRK------GNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEW
        +  KKR+      G+ +AL K+FN  +R+QL + IA+MFYSAGLPFHLARNP++  ++T+AAN+ +S Y PPGYN+LRT+LL+RE++NI+ LL PI+G W
Subjt:  IENKKRK------GNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEW

Query:  RKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTL
         +KGV+IVSDGWSDSQRR LINFM ++ G  MFLK+V+CSGE KDKYFIANLMKEVI EVG +NVVQVITDN PNCKG GQLIEAQFP I+WT CVVHTL
Subjt:  RKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTL

Query:  NLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPLKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDV
        NLALKNICA+KNVENN + YGECSWI D+VGDV++VK FI NHSMRL M+NEFVPLKLLS+A+T F S ++MLKRFKLIK GLQ MVISDKW SY+E+DV
Subjt:  NLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPLKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDV

Query:  VKARHVKELI
         KA+ VK+ +
Subjt:  VKARHVKELI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43260.1 hAT transposon superfamily protein2.9e-2229.55Show/hide
Query:  IAQMFYSAGLPFHLARNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFL
        +A+  YS G+PF+   N         A      +  PP    LR  LL+ E   ++GL+   + EWR  G ++ +D WSD +RRS++N  +     +MFL
Subjt:  IAQMFYSAGLPFHLARNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFL

Query:  KSVNCSGEI-KDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDV
         S +C  +    +Y  A + +  I  +G ++VVQV+T+N  N     +L++   PTI WT C  HT+NL ++ I  SK   +++IV  + +  F I    
Subjt:  KSVNCSGEI-KDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDV

Query:  VVVKFFIMNHSMRLPMFNEFVPLKLLSIAETCFPSVIIMLKRFKLIK
             FI  H   L M   F   +     +   P +I     F+ +K
Subjt:  VVVKFFIMNHSMRLPMFNEFVPLKLLSIAETCFPSVIIMLKRFKLIK

AT1G79740.1 hAT transposon superfamily5.1e-1926.11Show/hide
Query:  SRDQLDALIAQMFYSAGLPFHLARNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVI
        ++D  +  I+  F+   + F +AR+P +       A        P      +T  L R K++I   L   + EW   G TI+++ W+D++ R+LINF V 
Subjt:  SRDQLDALIAQMFYSAGLPFHLARNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVI

Query:  SNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWI
        S  +  F KSV+ S   K+   +A+L   VI ++G E++VQ+I DN+    G+   +   + TI  + C    LN+ L+              + +  W+
Subjt:  SNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWI

Query:  FDIVGDVVVVKFFIMNHSMRLPMFNE
           +    V+  F+ N+S  L +  +
Subjt:  FDIVGDVVVVKFFIMNHSMRLPMFNE

AT3G17450.1 hAT dimerisation domain-containing protein1.4e-1623.66Show/hide
Query:  NKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHF------IGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRK
        +K+RK  +S    S    SR  + + I++  +  G+P   A + +F      IG +          +  P   +    LLQ E + I+  L   +  W  
Subjt:  NKKRKGNTSALEKSFNKTSRDQLDALIAQMFYSAGLPFHLARNPHF------IGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRK

Query:  KGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNL
         G +I++D W++++ + +I+F+V       F  S++ +  ++D   +   + ++++++G ENVVQVIT NT   +  G+L+E +   + WT C +H   L
Subjt:  KGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNL

Query:  ALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLP-MFNEFVP-LKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKW-ESYREDD
         L++             + +  ++ + +     +  FI N +  L  M NEF   L LL  A     S    L+     K  L+ +  SD W  S     
Subjt:  ALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLP-MFNEFVP-LKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKW-ESYREDD

Query:  VVKARHVKELIHLVYIW
          + R V++++     W
Subjt:  VVKARHVKELIHLVYIW

AT4G08267.1 hAT transposon superfamily protein1.8e-1953.12Show/hide
Query:  VITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICA-SKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPLKLLSIA
        V+T+N  N    G LI A+F TI WT CVVHTLNLALKN CA S +  NN++VY  C WI  I  +V  +K  IMN+ +RL MF E   LKLL+I+
Subjt:  VITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICA-SKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFNEFVPLKLLSIA

AT5G31412.1 hAT transposon superfamily protein2.4e-1636.8Show/hide
Query:  QREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQL
        + E+   + LL   K  W++ GV  ++D WSD +RRS++N  V S G   FL S + S       +I   +   I +VG +NVVQV+TDN  N     ++
Subjt:  QREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSGEIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQL

Query:  IEAQFPTIIWTSCVVHTLNLALKNI
        ++ + P I WT CV HT++L L+ I
Subjt:  IEAQFPTIIWTSCVVHTLNLALKNI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGCAATTTTTGTCACATTATAAAGAAGAGCTCATATACAAGAGTGAGAGCTCACTTGCTGAAGATAGTTGGTCAAGGAATTGGGTCATGTCTAAAGGTTACTC
TAAAGACATAGCAGATATGCAAAGGTTAGAGGATGAGGCAAAAACTCAAATGGAAAGAAAAGCTCCAAAAAATGTTCCGTTGCCAATGACATCTTCATCCATATCAATTG
GGGGAGTGGATGTGACCAATTCAAATAGATTTGAGATTGAGAATAAGAAAAGGAAGGGCAATACAAGTGCACTAGAAAAATCATTCAACAAGACATCACGGGACCAATTG
GATGCACTTATTGCTCAAATGTTTTATTCTGCGGGCTTGCCATTTCATTTGGCAAGGAACCCCCATTTTATCGGTGCCTTTACTTATGCTGCAAACAATCCGTTGTCCAT
ATACAGGCCTCCAGGATATAATATGTTGAGGACTAGTCTACTGCAAAGGGAGAAAACTAATATTGAAGGACTACTTATGCCTATTAAAGGTGAATGGCGAAAAAAGGGAG
TGACCATTGTAAGTGATGGTTGGAGTGACTCACAAAGAAGGTCATTGATAAACTTCATGGTCATTTCTAATGGTAAGTCAATGTTTCTAAAATCGGTCAATTGCTCAGGT
GAGATTAAGGATAAGTACTTCATTGCAAATTTGATGAAGGAAGTGATTAATGAAGTCGGTCATGAGAATGTTGTTCAAGTGATAACTGATAATACTCCCAATTGCAAAGG
GCTAGGACAACTTATTGAAGCACAATTTCCGACAATTATATGGACATCATGCGTAGTTCATACTCTAAATCTTGCCTTGAAAAACATTTGTGCTTCCAAGAATGTTGAAA
ATAATCAAATTGTTTATGGAGAGTGCAGCTGGATTTTTGATATTGTTGGGGACGTCGTGGTAGTGAAATTTTTTATAATGAACCATTCCATGAGACTTCCCATGTTTAAT
GAATTTGTGCCTCTTAAATTACTTTCAATAGCTGAAACTTGTTTTCCATCGGTTATTATTATGCTCAAAAGGTTTAAGCTTATTAAAGACGGCTTGCAAGCTATGGTTAT
CAGTGATAAATGGGAAAGTTACAGAGAAGATGATGTGGTCAAGGCAAGACATGTAAAGGAGTTGATACACCTTGTTTACATTTGGTGTATGACATGTGGGATACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAATGCAATTTTTGTCACATTATAAAGAAGAGCTCATATACAAGAGTGAGAGCTCACTTGCTGAAGATAGTTGGTCAAGGAATTGGGTCATGTCTAAAGGTTACTC
TAAAGACATAGCAGATATGCAAAGGTTAGAGGATGAGGCAAAAACTCAAATGGAAAGAAAAGCTCCAAAAAATGTTCCGTTGCCAATGACATCTTCATCCATATCAATTG
GGGGAGTGGATGTGACCAATTCAAATAGATTTGAGATTGAGAATAAGAAAAGGAAGGGCAATACAAGTGCACTAGAAAAATCATTCAACAAGACATCACGGGACCAATTG
GATGCACTTATTGCTCAAATGTTTTATTCTGCGGGCTTGCCATTTCATTTGGCAAGGAACCCCCATTTTATCGGTGCCTTTACTTATGCTGCAAACAATCCGTTGTCCAT
ATACAGGCCTCCAGGATATAATATGTTGAGGACTAGTCTACTGCAAAGGGAGAAAACTAATATTGAAGGACTACTTATGCCTATTAAAGGTGAATGGCGAAAAAAGGGAG
TGACCATTGTAAGTGATGGTTGGAGTGACTCACAAAGAAGGTCATTGATAAACTTCATGGTCATTTCTAATGGTAAGTCAATGTTTCTAAAATCGGTCAATTGCTCAGGT
GAGATTAAGGATAAGTACTTCATTGCAAATTTGATGAAGGAAGTGATTAATGAAGTCGGTCATGAGAATGTTGTTCAAGTGATAACTGATAATACTCCCAATTGCAAAGG
GCTAGGACAACTTATTGAAGCACAATTTCCGACAATTATATGGACATCATGCGTAGTTCATACTCTAAATCTTGCCTTGAAAAACATTTGTGCTTCCAAGAATGTTGAAA
ATAATCAAATTGTTTATGGAGAGTGCAGCTGGATTTTTGATATTGTTGGGGACGTCGTGGTAGTGAAATTTTTTATAATGAACCATTCCATGAGACTTCCCATGTTTAAT
GAATTTGTGCCTCTTAAATTACTTTCAATAGCTGAAACTTGTTTTCCATCGGTTATTATTATGCTCAAAAGGTTTAAGCTTATTAAAGACGGCTTGCAAGCTATGGTTAT
CAGTGATAAATGGGAAAGTTACAGAGAAGATGATGTGGTCAAGGCAAGACATGTAAAGGAGTTGATACACCTTGTTTACATTTGGTGTATGACATGTGGGATACAATGA
Protein sequenceShow/hide protein sequence
MAMQFLSHYKEELIYKSESSLAEDSWSRNWVMSKGYSKDIADMQRLEDEAKTQMERKAPKNVPLPMTSSSISIGGVDVTNSNRFEIENKKRKGNTSALEKSFNKTSRDQL
DALIAQMFYSAGLPFHLARNPHFIGAFTYAANNPLSIYRPPGYNMLRTSLLQREKTNIEGLLMPIKGEWRKKGVTIVSDGWSDSQRRSLINFMVISNGKSMFLKSVNCSG
EIKDKYFIANLMKEVINEVGHENVVQVITDNTPNCKGLGQLIEAQFPTIIWTSCVVHTLNLALKNICASKNVENNQIVYGECSWIFDIVGDVVVVKFFIMNHSMRLPMFN
EFVPLKLLSIAETCFPSVIIMLKRFKLIKDGLQAMVISDKWESYREDDVVKARHVKELIHLVYIWCMTCGIQ