; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004098 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004098
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionBED-type domain-containing protein
Genome locationChr08:13675122..13677406
RNA-Seq ExpressionHG10004098
SyntenyHG10004098
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031743736.1 uncharacterized protein LOC101215032 [Cucumis sativus]1.4e-15276.03Show/hide
Query:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH
        M E KEVVGKAKR+VQFIYNNVWVLNQIKKR+GGREII LASTRYFSIFLTLQNILSLKDHLHQTFTS AWMQS+LS+ GAGLEVTKITADP FWSKCDH
Subjt:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH

Query:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA
        ITMGTKPLLSVLQFLESEEKPS GFIYDAFEK K+SVMLAFNQK SVYLPYLKAIDHVL KEFQS LHVAAYYLNPSIFY PTFLSSKVIQKGLLDCIEA
Subjt:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA

Query:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------
        LEPDITSQVMITNNINFYEEA+GDFGRPVALHGRDSLAP    SLY               S TC+ +                                
Subjt:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------

Query:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTD
              RRLETCKARCSIDAVDPVF EAID NMEDWV      +DEHKRWVDVKVTNQETLVEHKLSN DSCI STDER TE+TR TD
Subjt:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTD

XP_038884678.1 uncharacterized protein LOC120075395 isoform X1 [Benincasa hispida]1.4e-15277.72Show/hide
Query:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH
        MEE KEVVGKAKRIVQFIYNN WVLNQIKKRSGGREIIQLASTRYFS FLTL+NILSLK+HLHQTFTSGAWMQSNLSK GAGLEVTKI ADPLFWSKCDH
Subjt:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH

Query:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA
        ITMGTKPLLSVLQFLESEEKP+ GFIYDAFEKAKNSVMLAFNQKES+YLPYLKAIDHVL KEFQS LHVAAYYLNPSIFY PTFLSSKVIQKGLLDCIEA
Subjt:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA

Query:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------
        LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAP    SLY               S TC+ +                                
Subjt:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------

Query:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTD
              RRLETCKARCSIDAVDPVFLEAIDVNM+DWV      EDEHK WVDVKVTNQET VEHKLSNMDSCID TD
Subjt:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTD

XP_038884679.1 uncharacterized protein LOC120075395 isoform X2 [Benincasa hispida]1.4e-15277.72Show/hide
Query:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH
        MEE KEVVGKAKRIVQFIYNN WVLNQIKKRSGGREIIQLASTRYFS FLTL+NILSLK+HLHQTFTSGAWMQSNLSK GAGLEVTKI ADPLFWSKCDH
Subjt:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH

Query:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA
        ITMGTKPLLSVLQFLESEEKP+ GFIYDAFEKAKNSVMLAFNQKES+YLPYLKAIDHVL KEFQS LHVAAYYLNPSIFY PTFLSSKVIQKGLLDCIEA
Subjt:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA

Query:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------
        LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAP    SLY               S TC+ +                                
Subjt:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------

Query:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTD
              RRLETCKARCSIDAVDPVFLEAIDVNM+DWV      EDEHK WVDVKVTNQET VEHKLSNMDSCID TD
Subjt:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTD

XP_038884682.1 uncharacterized protein LOC120075395 isoform X3 [Benincasa hispida]1.4e-15277.72Show/hide
Query:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH
        MEE KEVVGKAKRIVQFIYNN WVLNQIKKRSGGREIIQLASTRYFS FLTL+NILSLK+HLHQTFTSGAWMQSNLSK GAGLEVTKI ADPLFWSKCDH
Subjt:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH

Query:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA
        ITMGTKPLLSVLQFLESEEKP+ GFIYDAFEKAKNSVMLAFNQKES+YLPYLKAIDHVL KEFQS LHVAAYYLNPSIFY PTFLSSKVIQKGLLDCIEA
Subjt:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA

Query:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------
        LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAP    SLY               S TC+ +                                
Subjt:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------

Query:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTD
              RRLETCKARCSIDAVDPVFLEAIDVNM+DWV      EDEHK WVDVKVTNQET VEHKLSNMDSCID TD
Subjt:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTD

XP_038884685.1 uncharacterized protein LOC120075395 isoform X4 [Benincasa hispida]1.4e-15277.72Show/hide
Query:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH
        MEE KEVVGKAKRIVQFIYNN WVLNQIKKRSGGREIIQLASTRYFS FLTL+NILSLK+HLHQTFTSGAWMQSNLSK GAGLEVTKI ADPLFWSKCDH
Subjt:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH

Query:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA
        ITMGTKPLLSVLQFLESEEKP+ GFIYDAFEKAKNSVMLAFNQKES+YLPYLKAIDHVL KEFQS LHVAAYYLNPSIFY PTFLSSKVIQKGLLDCIEA
Subjt:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA

Query:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------
        LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAP    SLY               S TC+ +                                
Subjt:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCATS--------------------------------

Query:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTD
              RRLETCKARCSIDAVDPVFLEAIDVNM+DWV      EDEHK WVDVKVTNQET VEHKLSNMDSCID TD
Subjt:  ------RRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTD

TrEMBL top hitse value%identityAlignment
A0A1S3C3D6 uncharacterized protein LOC103496546 isoform X15.1e-14574.61Show/hide
Query:  EAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHIT
        E KE+VGKAKRIVQFIYNNVWVLNQIKKRSGGREII LASTRYFSIFLTLQNILSLKDHLHQTFTS AWMQS+LS+ GAGLEVTKITADP FWSKCDHIT
Subjt:  EAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHIT

Query:  MGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALE
        MGTKPLLSVLQFLESEEKPS GFI+DAFEK K+SVMLAFNQKESVYLPYLKAIDHVLLKEFQS LHVAA YLNPSIFY PTFLSSKVIQKGLLDCIEALE
Subjt:  MGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALE

Query:  PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCAT-----------------------------------
        PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAP    SLY               S TC+                                    
Subjt:  PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCAT-----------------------------------

Query:  ---SRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTD
            RR ETCKARCSIDAVDPV LEAID NMEDWV DV+V   E        VT QETLVEHKLSN DSCI STDER TE+TR TD
Subjt:  ---SRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTD

A0A1S3C3S8 uncharacterized protein LOC103496546 isoform X25.1e-14574.61Show/hide
Query:  EAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHIT
        E KE+VGKAKRIVQFIYNNVWVLNQIKKRSGGREII LASTRYFSIFLTLQNILSLKDHLHQTFTS AWMQS+LS+ GAGLEVTKITADP FWSKCDHIT
Subjt:  EAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHIT

Query:  MGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALE
        MGTKPLLSVLQFLESEEKPS GFI+DAFEK K+SVMLAFNQKESVYLPYLKAIDHVLLKEFQS LHVAA YLNPSIFY PTFLSSKVIQKGLLDCIEALE
Subjt:  MGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALE

Query:  PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCAT-----------------------------------
        PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAP    SLY               S TC+                                    
Subjt:  PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCAT-----------------------------------

Query:  ---SRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTD
            RR ETCKARCSIDAVDPV LEAID NMEDWV DV+V   E        VT QETLVEHKLSN DSCI STDER TE+TR TD
Subjt:  ---SRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTD

A0A1S3C533 uncharacterized protein LOC103496546 isoform X31.3e-14574.29Show/hide
Query:  EAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHIT
        E KE+VGKAKRIVQFIYNNVWVLNQIKKRSGGREII LASTRYFSIFLTLQNILSLKDHLHQTFTS AWMQS+LS+ GAGLEVTKITADP FWSKCDHIT
Subjt:  EAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHIT

Query:  MGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALE
        MGTKPLLSVLQFLESEEKPS GFI+DAFEK K+SVMLAFNQKESVYLPYLKAIDHVLLKEFQS LHVAA YLNPSIFY PTFLSSKVIQKGLLDCIEALE
Subjt:  MGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALE

Query:  PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCAT-----------------------------------
        PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAP    SLY               S TC+                                    
Subjt:  PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCAT-----------------------------------

Query:  ---SRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTDGND
            RR ETCKARCSIDAVDPV LEAID NMEDWV DV+V   E        VT QETLVEHKLSN DSCI STDER TE+TR TDG +
Subjt:  ---SRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTDGND

A0A5D3BZ70 BED-type domain-containing protein1.9e-14774.87Show/hide
Query:  EAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHIT
        E KE+VGKAKRIVQFIYNNVWVLNQIKKRSGGREII LASTRYFSIFLTLQNILSLKDHLHQTFTS AWMQS+LS+ GAGLEVTKITADP FWSKCDHIT
Subjt:  EAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHIT

Query:  MGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALE
        MGTKPLLSVLQFLESEEKPS GFI+DAFEK K+SVMLAFNQKESVYLPYLKAIDHVLLKEFQS LHVAA YLNPSIFY PTFLSSKVIQKGLLDCIEALE
Subjt:  MGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALE

Query:  PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCAT-----------------------------------
        PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAP    SLY               S TC+                                    
Subjt:  PDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCAT-----------------------------------

Query:  ---SRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTDGNDL
            RR ETCKARCSIDAVDPV LEAID NMEDWV DV+V   E        VT QETLVEHKLSN DSCI STDER TE+TR TDGNDL
Subjt:  ---SRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTDGNDL

A0A6J1BVZ0 uncharacterized protein LOC111006240 isoform X27.9e-14672.7Show/hide
Query:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH
        MEE +EVVGKAKRIVQFIYNNVWVLN IKKR GGREIIQLASTR FSIFLTL NILSLKDHLHQTFTSG WMQSN SK GAGLEV KITADPLFWSKCDH
Subjt:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH

Query:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA
        +T GTKPLLSVLQFLESEEKPS GFIYDAFEKAKNSVMLAFN+KES Y P+LKAIDHVL KEFQSPLHVAAYYLNPSIFY PTFLSSKVIQKGLLDCIEA
Subjt:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA

Query:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCA----------------------------------
        LEPDITSQVM  +NINFYEEAVGDFGR VALHGR+SLAP    SLY               S TC+                                  
Subjt:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPEL--SLY---------------SSTCA----------------------------------

Query:  ----TSRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTDGNDL
              RRLE  K RCSI A+DPV LEAIDV MEDW+ DVEV+EDEHKRW++VKVT+QET VEHK SN++SCID+TDER +EDT  TDGNDL
Subjt:  ----TSRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTRGTDGNDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22220.1 hAT transposon superfamily4.5e-3734.51Show/hide
Query:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH
        M+  +E++ +A+ + + IYN+  VLN ++K + G +I+Q   T   + F T+  I  LK +L    TS  W   + SK   GL +T+   D  FW     
Subjt:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH

Query:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA
            T P+L VL+ + SE KP+ G++Y A  +AK ++      +E  Y+ Y K ID   L   Q PL+ A +YLNP  FY         I   ++DCIE 
Subjt:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA

Query:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPE--LSLYSSTCATSRR
        L PD+  Q ++  +IN Y+ AVG FGR +A+  RD++ P    S Y  +C    R
Subjt:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPE--LSLYSSTCATSRR

AT3G22220.2 hAT transposon superfamily4.5e-3734.51Show/hide
Query:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH
        M+  +E++ +A+ + + IYN+  VLN ++K + G +I+Q   T   + F T+  I  LK +L    TS  W   + SK   GL +T+   D  FW     
Subjt:  MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDH

Query:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA
            T P+L VL+ + SE KP+ G++Y A  +AK ++      +E  Y+ Y K ID   L   Q PL+ A +YLNP  FY         I   ++DCIE 
Subjt:  ITMGTKPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEA

Query:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPE--LSLYSSTCATSRR
        L PD+  Q ++  +IN Y+ AVG FGR +A+  RD++ P    S Y  +C    R
Subjt:  LEPDITSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPE--LSLYSSTCATSRR

AT4G15020.1 hAT transposon superfamily1.1e-3031.7Show/hide
Query:  EVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHITMGT
        E + +A+ I +F+YN+  VLN + K + G +I+  A +   + F TL  I  LK +L    TS  W + + S+  +GL +  +T D  FW     +   T
Subjt:  EVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHITMGT

Query:  KPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALEPDI
         PLL  L+ + SE++P+ G++Y A  +AK+++      +E  Y+ Y K ID    ++   PL  A ++LNP +FY         +   +LDCIE L PD 
Subjt:  KPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALEPDI

Query:  TSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPE--LSLYSSTCATSRRL------ETCKARCS
          Q  I   +  Y+ A G FGR +A+  RD++ P    S Y  +C    R       +TC +  S
Subjt:  TSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPE--LSLYSSTCATSRRL------ETCKARCS

AT4G15020.2 hAT transposon superfamily1.1e-3031.7Show/hide
Query:  EVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHITMGT
        E + +A+ I +F+YN+  VLN + K + G +I+  A +   + F TL  I  LK +L    TS  W + + S+  +GL +  +T D  FW     +   T
Subjt:  EVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHITMGT

Query:  KPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALEPDI
         PLL  L+ + SE++P+ G++Y A  +AK+++      +E  Y+ Y K ID    ++   PL  A ++LNP +FY         +   +LDCIE L PD 
Subjt:  KPLLSVLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALEPDI

Query:  TSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPE--LSLYSSTCATSRRL------ETCKARCS
          Q  I   +  Y+ A G FGR +A+  RD++ P    S Y  +C    R       +TC +  S
Subjt:  TSQVMITNNINFYEEAVGDFGRPVALHGRDSLAPE--LSLYSSTCATSRRL------ETCKARCS

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related1.0e-2832.39Show/hide
Query:  IKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSTGFIY
        ++K +GGR + + A TR  + F+TL     LKD+L +   S  W  S  +K   G+++        FW    H      PL+ VL+ ++ E KP  G+IY
Subjt:  IKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSTGFIY

Query:  DAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFY-CPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFG
         A ++AK ++M +F  KE  Y    + ID     +   PLH A YYLNP   Y  P  +  + +  G L C+  L P I +Q  I   ++ +++A G FG
Subjt:  DAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFY-CPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEEAVGDFG

Query:  RPVALHGRDSLAP
         P+A+  R  ++P
Subjt:  RPVALHGRDSLAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGCAAAAGAGGTGGTTGGGAAAGCAAAAAGAATAGTTCAATTCATATACAACAATGTCTGGGTCCTAAACCAAATAAAGAAGAGAAGTGGTGGAAGAGAAAT
TATTCAGCTTGCATCTACAAGATATTTCTCCATCTTCTTGACTCTGCAAAACATTTTGTCTTTGAAAGACCATCTTCATCAGACATTCACCAGTGGTGCTTGGATGCAGT
CAAATTTGTCCAAGTCCGGGGCTGGACTTGAGGTGACAAAGATCACTGCTGATCCACTATTCTGGTCAAAGTGTGATCATATTACAATGGGAACAAAACCATTACTTTCT
GTGTTGCAATTTCTTGAATCAGAGGAGAAGCCATCTACTGGATTTATATATGATGCATTTGAAAAAGCAAAGAATAGCGTCATGCTTGCTTTCAACCAGAAGGAATCTGT
GTACTTGCCATATTTAAAAGCAATTGACCATGTTTTACTGAAAGAATTTCAGAGTCCTCTTCATGTGGCTGCATACTACCTAAATCCGTCGATATTCTATTGTCCTACAT
TCTTATCCAGCAAAGTTATTCAAAAGGGTTTACTTGATTGCATTGAAGCCTTAGAGCCAGATATAACATCCCAGGTCATGATTACAAACAATATAAATTTCTATGAGGAA
GCTGTTGGAGATTTTGGGCGGCCAGTGGCATTACATGGTCGAGATTCATTGGCCCCAGAATTATCTTTATATTCTTCCACCTGTGCCACAAGCAGGAGACTGGAGACTTG
TAAAGCAAGGTGCTCAATAGATGCAGTAGATCCTGTTTTTTTGGAAGCCATTGATGTGAACATGGAAGATTGGGTGGAGGATGTTGAGGTATTGGAGGATGAGCACAAGA
GGTGGGTGGATGTGAAGGTCACTAATCAGGAGACCTTGGTGGAACATAAATTGTCCAACATGGATAGTTGTATTGACAGCACAGATGAGAGAGGCACTGAGGATACTAGA
GGTACAGATGGTAATGATTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGCAAAAGAGGTGGTTGGGAAAGCAAAAAGAATAGTTCAATTCATATACAACAATGTCTGGGTCCTAAACCAAATAAAGAAGAGAAGTGGTGGAAGAGAAAT
TATTCAGCTTGCATCTACAAGATATTTCTCCATCTTCTTGACTCTGCAAAACATTTTGTCTTTGAAAGACCATCTTCATCAGACATTCACCAGTGGTGCTTGGATGCAGT
CAAATTTGTCCAAGTCCGGGGCTGGACTTGAGGTGACAAAGATCACTGCTGATCCACTATTCTGGTCAAAGTGTGATCATATTACAATGGGAACAAAACCATTACTTTCT
GTGTTGCAATTTCTTGAATCAGAGGAGAAGCCATCTACTGGATTTATATATGATGCATTTGAAAAAGCAAAGAATAGCGTCATGCTTGCTTTCAACCAGAAGGAATCTGT
GTACTTGCCATATTTAAAAGCAATTGACCATGTTTTACTGAAAGAATTTCAGAGTCCTCTTCATGTGGCTGCATACTACCTAAATCCGTCGATATTCTATTGTCCTACAT
TCTTATCCAGCAAAGTTATTCAAAAGGGTTTACTTGATTGCATTGAAGCCTTAGAGCCAGATATAACATCCCAGGTCATGATTACAAACAATATAAATTTCTATGAGGAA
GCTGTTGGAGATTTTGGGCGGCCAGTGGCATTACATGGTCGAGATTCATTGGCCCCAGAATTATCTTTATATTCTTCCACCTGTGCCACAAGCAGGAGACTGGAGACTTG
TAAAGCAAGGTGCTCAATAGATGCAGTAGATCCTGTTTTTTTGGAAGCCATTGATGTGAACATGGAAGATTGGGTGGAGGATGTTGAGGTATTGGAGGATGAGCACAAGA
GGTGGGTGGATGTGAAGGTCACTAATCAGGAGACCTTGGTGGAACATAAATTGTCCAACATGGATAGTTGTATTGACAGCACAGATGAGAGAGGCACTGAGGATACTAGA
GGTACAGATGGTAATGATTTGTAG
Protein sequenceShow/hide protein sequence
MEEAKEVVGKAKRIVQFIYNNVWVLNQIKKRSGGREIIQLASTRYFSIFLTLQNILSLKDHLHQTFTSGAWMQSNLSKSGAGLEVTKITADPLFWSKCDHITMGTKPLLS
VLQFLESEEKPSTGFIYDAFEKAKNSVMLAFNQKESVYLPYLKAIDHVLLKEFQSPLHVAAYYLNPSIFYCPTFLSSKVIQKGLLDCIEALEPDITSQVMITNNINFYEE
AVGDFGRPVALHGRDSLAPELSLYSSTCATSRRLETCKARCSIDAVDPVFLEAIDVNMEDWVEDVEVLEDEHKRWVDVKVTNQETLVEHKLSNMDSCIDSTDERGTEDTR
GTDGNDL