; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C11G213975 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C11G213975
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionDUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein
Genome locationCla97Chr11:7572685..7578751
RNA-Seq ExpressionCla97C11G213975
SyntenyCla97C11G213975
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5516348.1 hypothetical protein RHGRI_037159 [Rhododendron griersonianum]2.5e-14844.89Show/hide
Query:  SIGATISSSSVEDESKQLWQYVTK-NQILNEGG--------------------------LGGYGIGVCSKVTSQDKANMQRLEDEMQDRMAKKTPRNIPL
        S GA+ +  +VE+  K LWQYVTK ++   +GG                          + G G+ VC  VTS D A  +RL +E + +  +  P+ +PL
Subjt:  SIGATISSSSVEDESKQLWQYVTK-NQILNEGG--------------------------LGGYGIGVCSKVTSQDKANMQRLEDEMQDRMAKKTPRNIPL

Query:  PPS------------------------------------------------------------------FVPS---------------------------
        PPS                                                                  +V S                           
Subjt:  PPS------------------------------------------------------------------FVPS---------------------------

Query:  ----EIIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVE
             +I F A+T G PMFLKA+ C GE+KDK+FIA+L KEV+IEVGP+NV+Q++TDNA NCK A QIIE+Q+P I WTPCVVHTLNLALKNICA KN+E
Subjt:  ----EIIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVE

Query:  NNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM-----------------VKGSILDDVWW
        NN LVY++CSWI+ +  D   +KNFIMNHSMR+AIFNEFV L+LLSVAETRFAS I+MLKRFKLIK  LQ+M                 V+ ++L++ WW
Subjt:  NNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM-----------------VKGSILDDVWW

Query:  DKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPN
        DK+DYILS T+P Y+++  CDT+  T HL+YDMWDTMIEKVK AIY+HE K  +++S FYDVVH IL+DRWNKNNTPLHCLAH+LNP   S+ WL++DP 
Subjt:  DKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPN

Query:  RVTPHKDIEVTRER---------------------MKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV-------------YVSLPHL
        RV PHKD EVT ER                      KFST    F+D+ SI  RYN D+  WW +HG  A +LQ++A K+             + +   +
Subjt:  RVTPHKDIEVTRER---------------------MKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV-------------YVSLPHL

Query:  HAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAGDFF---EEAGMLQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDVME
        H+++        A DLVYIHSNL LLSR+SP+Y  G TK+WDIAGD F   ++ GML+V NLSLDEP++E VVFTDDG G++AN+    DE+VME
Subjt:  HAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAGDFF---EEAGMLQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDVME

KAG5532188.1 hypothetical protein RHGRI_026721 [Rhododendron griersonianum]7.4e-14552.14Show/hide
Query:  YVTKNQILNEGGLGGY---GIGVC-SKVTSQDKANMQRLEDEMQDRMAKKTPRNIPLPPSFVPSE---IIIFKAITDGDPMFLKAMECSGEVKDKYFIAN
        YVT  Q      L GY   G  +  + +  Q+K N++RL   ++    +K    +     +  S+   +I F A+T+G PMFLKA++CSGE KDKYFI  
Subjt:  YVTKNQILNEGGLGGY---GIGVC-SKVTSQDKANMQRLEDEMQDRMAKKTPRNIPLPPSFVPSE---IIIFKAITDGDPMFLKAMECSGEVKDKYFIAN

Query:  LPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFN
        L +EVI EVGP NV+Q+ITDNA NC GA  +IE  +P I WTPCVVHTLNLAL+NICA KNVENN + Y++CSWI+ IA DV  +KNFIMNHSMR+AIFN
Subjt:  LPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFN

Query:  EFVPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWD
        +FVPL+LLSVA TRFAS+++MLKRFKL+K  LQ M                   VK  +LDD+WWD IDYILS TSP Y+M+ ICDT+    HLVYDMWD
Subjt:  EFVPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWD

Query:  TMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK-------------------
        TMIEKVK AIYRHEGKR ++ S+FYDVVH IL+DRWNKN+TPLHCLAHSLNP   S++WL + P+RV PHKD+EV RERMK                   
Subjt:  TMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK-------------------

Query:  --FSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKVYV------------SLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHG
          FS+   +F D  SI  RY  D K WW  +G  A +LQSIA K+ V            +   +H+ K        A DLV+IHSNL LLSR++ +Y  G
Subjt:  --FSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKVYV------------SLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHG

Query:  ETKLWDIAGDFFE---EAGMLQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDVMED
        +TK+WDIAGD F+   + G L++  LSLDEP+LE VVF DD  G+E+++ E  DE  MED
Subjt:  ETKLWDIAGDFFE---EAGMLQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDVMED

RWR74797.1 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein [Cinnamomum micranthum f. kanehirae]2.4e-15156.29Show/hide
Query:  SKVTSQDKANMQRLEDEMQDRMAKKTPRNIPLPPSFVPSE---IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNC
        + +  ++KAN++RL   ++    +K    +     +  S+   +I F A+T+G PMFLKA++CSGE KDKYFIANL KEVI +VG +NV+Q+ITDNAPNC
Subjt:  SKVTSQDKANMQRLEDEMQDRMAKKTPRNIPLPPSFVPSE---IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNC

Query:  KGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRF
        KGA QIIESQFP I+WTPCVVHTLNLAL NICA KNVENN L Y +CSWI +I  DVM +K+FIMNHSMR+A+FNEFV L+LLSVA+TRFAS I+MLKRF
Subjt:  KGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRF

Query:  KLIKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFY
        KLIK GLQAM                   VK  +LDD+WWD IDYILS TSP Y+M+ +CDT+    HLVYDMWDTMIEKVKT I+RHEGKR DE S FY
Subjt:  KLIKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFY

Query:  DVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAK
        DVVH IL+D WNKNNTPLHCLAHSLNP   S++WL++DP+RV P+KD+EV+RER K                     FS    +F +  S+  RY+ D  
Subjt:  DVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAK

Query:  DWWAMHGVYAVMLQSIAFKV-------------YVSLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEEAGMLQVT
         WWA+HG  A  LQS+AFK+             + +   +H+V+        A DLV+IHSNL LLSRK+P+Y  GETK+WDIAG   D FE+ G+L+V 
Subjt:  DWWAMHGVYAVMLQSIAFKV-------------YVSLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEEAGMLQVT

Query:  NLSLDEPDLEEVVFTDD
        NLSLDEP+LE VVFTDD
Subjt:  NLSLDEPDLEEVVFTDD

XP_031743157.1 uncharacterized protein LOC116404561 [Cucumis sativus]1.2e-15055.35Show/hide
Query:  GYGIGVCSKVTSQDKANMQRLEDEMQDRMAKKTPRNIPLPPSFVPSE---IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLI
        GY +   S +  ++KAN++RL   ++    KK    I     +  S+   +I F AI++G PMFLK+++CSGE+KDKYFIAN  KEVI EVG +NV+Q+I
Subjt:  GYGIGVCSKVTSQDKANMQRLEDEMQDRMAKKTPRNIPLPPSFVPSE---IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLI

Query:  TDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASI
        TDNAPNCKGA Q+IE+QFP I+WTPCVVHTLNLALKNICA KNVENN +VY +CSWI  IA D++ +K FIMNHSM +A+FNEFVPL+LLS+AETRFAS+
Subjt:  TDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASI

Query:  IIMLKRFKLIKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRG
        IIMLKRFKLIKGGLQAM                   VK  +LD +WWDKI+YILS TSP Y+M+  CD +T   HLVYDMWDTMIEKVK +IY+HEG R 
Subjt:  IIMLKRFKLIKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRG

Query:  DEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIA
         E SSFYDV++NILID WNKN+T LHCLAHSLNP   SE+WL +DPNRV PH+D+E+TRERMK                     FST   DF+D  SI  
Subjt:  DEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIA

Query:  RYNEDAKDWWAMHGVYAVMLQSIAFKV------------------YVSLPHLHAVKGVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEE
        RY  D+  WWA HG YA MLQ IAFKV                  +++    + +    A DLV+IHSNLHLLSRK+PEYS GETK WDIAG   D  E+
Subjt:  RYNEDAKDWWAMHGVYAVMLQSIAFKV------------------YVSLPHLHAVKGVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEE

Query:  AGMLQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDVME
          ML+V NLSLD+P+LE  +  +DG          +DEDV+E
Subjt:  AGMLQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDVME

XP_038721054.1 uncharacterized protein LOC120013346 isoform X2 [Tripterygium wilfordii]7.2e-14849.68Show/hide
Query:  SSSTSSNSGQSSTIPSIGATISSSSVEDESKQLWQYVTKNQILNEGGLGG-----------------------------YGIGVCSKVTSQDKANMQRLE
        S +TS ++  S++ PS     SS  VED  K LW+YV + +   +GG GG                              GI  C  VTS+D A MQRLE
Subjt:  SSSTSSNSGQSSTIPSIGATISSSSVEDESKQLWQYVTKNQILNEGGLGG-----------------------------YGIGVCSKVTSQDKANMQRLE

Query:  DEMQDRMAKKTPRNI-PLPPSF------VPSE---------IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKG
        +E ++R      R + P+  ++      + S+         +I F A+++  PMFLKA++CSGE KDK+FI NL KEVI EVGPQNV+Q+ITDNA NC G
Subjt:  DEMQDRMAKKTPRNI-PLPPSF------VPSE---------IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKG

Query:  AEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKL
        A  ++E  +P IVWTPCVVHTLNLAL+NICA KN+ENN +VY++CSWI+ ++ DV  +KNFIMNHSMR+AIFNEFVPL+LLS+A TRFAS+++MLKRF L
Subjt:  AEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKL

Query:  IKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDV
        IK  L +M                   VK  +LDDVWWD IDYIL  T+P Y+M+  CDT+    HLVYDMWD+MIEKV+ AIYR EGKR +E S FYDV
Subjt:  IKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDV

Query:  VHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAKDW
        VH IL+ RWNKNNTPLHCLAHSLNP   SE+WL +DP RV PHKD+EV RERMK                     FS+M  DF D  SI  R + D K W
Subjt:  VHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAKDW

Query:  WAMHGVYAVMLQSIAFKVYV-------------SLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAGD---FFEEAGMLQVTNL
        W   G  A +LQ++A K+ V             +   +H+V+        A DLV+IHSNL LLSR+ P+YS GE+K+WDIAGD    FE+ G L+V  L
Subjt:  WAMHGVYAVMLQSIAFKVYV-------------SLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAGD---FFEEAGMLQVTNL

Query:  SLDEPDLEEVVFTDDG
        SLDEPD E VVF DDG
Subjt:  SLDEPDLEEVVFTDDG

TrEMBL top hitse value%identityAlignment
A0A1Q3C897 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein2.1e-13755.49Show/hide
Query:  IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLV
        +I F A++DGD MF+KA++CSG+ KD +FI NL KEVI EVG   V+Q+ITDNA NCKG  Q+IE +FP+IVWTPCVVHTLNLALKNI A KNVE+N + 
Subjt:  IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLV

Query:  YEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM-------------------VKGSILDDVWWDKI
        YE CSWIS+I  DVM ++NFIMNHSMR+A+F+EFV L+LL VAETRFAS++IMLKRFKLIK GLQ M                   VK  +LDD+WWD I
Subjt:  YEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM-------------------VKGSILDDVWWDKI

Query:  DYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVT
        DYILS T+P Y+MI  CDT+  + HLVYDMWD MIEKVK AIY HE K   E+S F+ VVH ILIDRWNKNNTPLHCLAH+LNP   SEQWL +DP RV 
Subjt:  DYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVT

Query:  PHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV-------------YVSLPHLHAV
        PHKD+E+ +ER K                     F     DF D  SI  RY  D K WW  HG  A MLQ+IA ++             + +   +H++
Subjt:  PHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV-------------YVSLPHLHAV

Query:  K-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAGDFF---EEAGMLQVTNLSLDEPDLEEVVFTDD
        K        A DLVYIHSNL LLSRKSP+Y+ G+TK+WDI GD F   ++ G+ +V +LSLDEP++E +VF D+
Subjt:  K-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAGDFF---EEAGMLQVTNLSLDEPDLEEVVFTDD

A0A3S3NPR5 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein2.3e-14456.37Show/hide
Query:  IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLV
        +I F A+T+  PMFLKA++C+GE+KDKYFI  L KEVI EVGPQNVIQ+ITDNA NC GA  +IE+QF  IVWTP VVHTLNLAL+NIC  KN+ENN L 
Subjt:  IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLV

Query:  YEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM-------------------VKGSILDDVWWDKI
        YE+C WI+++A DV+ +KNFIMNHSMR+AIFNEFVPL+LLSVA TRFAS+++MLKRFKLIKG LQAM                   VK  +LDD+WWDK+
Subjt:  YEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM-------------------VKGSILDDVWWDKI

Query:  DYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVT
         YILS T+P Y+M+ +CDT+    +LVYDMWDTMIEKVK AIYRHEGK  DE S+FY+ VH IL+DRWNKNNTPLHCLAHSLNP   S  WL +DP+RV 
Subjt:  DYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVT

Query:  PHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKVYV-------------SLPHLHAV
        PH+D+EV RERMK                     FS+   +F D+ SI  RY  D K WW  +G  A +LQ++AF + V             +   +H+V
Subjt:  PHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKVYV-------------SLPHLHAV

Query:  K-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEEAGMLQVTNLSLDEPDLEEVVFTDDGSGNE
        +        A DL++IHSNL LLSR++P+Y  GETK+WDIAG   D FE+AG L++  LSLDEP LE V+F++DG G +
Subjt:  K-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEEAGMLQVTNLSLDEPDLEEVVFTDDGSGNE

A0A443N8D6 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein1.2e-15156.29Show/hide
Query:  SKVTSQDKANMQRLEDEMQDRMAKKTPRNIPLPPSFVPSE---IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNC
        + +  ++KAN++RL   ++    +K    +     +  S+   +I F A+T+G PMFLKA++CSGE KDKYFIANL KEVI +VG +NV+Q+ITDNAPNC
Subjt:  SKVTSQDKANMQRLEDEMQDRMAKKTPRNIPLPPSFVPSE---IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNC

Query:  KGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRF
        KGA QIIESQFP I+WTPCVVHTLNLAL NICA KNVENN L Y +CSWI +I  DVM +K+FIMNHSMR+A+FNEFV L+LLSVA+TRFAS I+MLKRF
Subjt:  KGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRF

Query:  KLIKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFY
        KLIK GLQAM                   VK  +LDD+WWD IDYILS TSP Y+M+ +CDT+    HLVYDMWDTMIEKVKT I+RHEGKR DE S FY
Subjt:  KLIKGGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFY

Query:  DVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAK
        DVVH IL+D WNKNNTPLHCLAHSLNP   S++WL++DP+RV P+KD+EV+RER K                     FS    +F +  S+  RY+ D  
Subjt:  DVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERMK---------------------FSTMRKDFNDVYSIIARYNEDAK

Query:  DWWAMHGVYAVMLQSIAFKV-------------YVSLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEEAGMLQVT
         WWA+HG  A  LQS+AFK+             + +   +H+V+        A DLV+IHSNL LLSRK+P+Y  GETK+WDIAG   D FE+ G+L+V 
Subjt:  DWWAMHGVYAVMLQSIAFKV-------------YVSLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEEAGMLQVT

Query:  NLSLDEPDLEEVVFTDD
        NLSLDEP+LE VVFTDD
Subjt:  NLSLDEPDLEEVVFTDD

A0A5B7AFB0 Uncharacterized protein1.3e-14744.51Show/hide
Query:  SSTIPSIGATISSSSVEDESKQLWQYVTKNQILNEGG---------------------------LGGYGIGVCSKVTSQDKANMQRLEDEMQDRMAKKTP
        +ST PS     SSS  ED +K LW+YV K   L++GG                           L G GI  CSKVT++D   MQ+LEDE++ R+     
Subjt:  SSTIPSIGATISSSSVEDESKQLWQYVTKNQILNEGG---------------------------LGGYGIGVCSKVTSQDKANMQRLEDEMQDRMAKKTP

Query:  RNIPLPPSFV---------------------------PSE------------------------------------------------------------
        + +PLP S +                           P E                                                            
Subjt:  RNIPLPPSFV---------------------------PSE------------------------------------------------------------

Query:  ---------------------------------------IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAE
                                               +I F A+T+  PMFLK ++CSGE KDKYFIANL +EVI EVG +NVIQ+ITDNAPNCKGA 
Subjt:  ---------------------------------------IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAE

Query:  QIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKLIK
        Q+IESQF  I WTPCVVHTLNLALKNICA KNVENN L Y +CSWIS+IA DVM +K+FIMNHS+R+ +FNEFV L+LLSVA+TRFAS+I+M +RFKLIK
Subjt:  QIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKLIK

Query:  GGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVH
         GLQAM                   VK  +L+D+WWD IDYILS T+P YEM+  CDT+    HLVYDMWD+M+EKVK AIYRHE KR +E S+FYDVVH
Subjt:  GGLQAM-------------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVH

Query:  NILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERM---------------------KFSTMRKDFNDVYSIIARYNEDAKDWWA
        NIL+DRWNKNNTPLHCLAHSLNP   S +WL ++PNRV P+K+ E+++ER+                     KFST   DF  V SI  RY  D K WW 
Subjt:  NILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRERM---------------------KFSTMRKDFNDVYSIIARYNEDAKDWWA

Query:  MHGVYAVMLQSIAFKVYV-------------SLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEEAGMLQVTNLSL
        +HG  A MLQS+A K+ V             +   +H+V+        A DLV++HSNL LLSR++P+Y  GETK+WDIAG   D FE+ G+L+V NLSL
Subjt:  MHGVYAVMLQSIAFKVYV-------------SLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAG---DFFEEAGMLQVTNLSL

Query:  DEPDLEEVVFTDDGSGNEANNCEAEDED
        DEP+LE VVF DDG     N  E E+ED
Subjt:  DEPDLEEVVFTDDGSGNEANNCEAEDED

A0A7J0H150 Uncharacterized protein1.8e-14452.7Show/hide
Query:  GYGIGVCSKVTSQDKANMQRLEDEMQDRMAKK--TPRNIPLPPSFVPSEIIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLIT
        GY + + +K+  +++AN++RL + ++    +K  T  +     S     +I F A++   PMFLKA++CSGE KDK+FIA+L KEV+IEVGP+NV+Q+IT
Subjt:  GYGIGVCSKVTSQDKANMQRLEDEMQDRMAKK--TPRNIPLPPSFVPSEIIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLIT

Query:  DNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASII
        DNAPNCK A QIIE+Q+P I WTPCVVHTLNLALKNICA KN ENN LVY++CSWI+ +  D   +KN IMNHSMR+AIFNEFVPL+LLSVAETRF S I
Subjt:  DNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVAETRFASII

Query:  IMLKRFKLIKGGLQAM-----------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDED
        ++L+RF LIK  LQ+M                 V+ ++LD+ WWDK+DYILS T+P Y+M+  CD++  T HL+YDMWD+MI KVK AIY+HEGK  D++
Subjt:  IMLKRFKLIKGGLQAM-----------------VKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDED

Query:  SSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRER---------------------MKFSTMRKDFNDVYSIIARYN
          FY+VVH IL+DRWNKNNTPLHCLAH+LNP   S++WL++DPNRV PH+D EVT ER                      KFST    F+DV SI  R+N
Subjt:  SSFYDVVHNILIDRWNKNNTPLHCLAHSLNP---SEQWLKQDPNRVTPHKDIEVTRER---------------------MKFSTMRKDFNDVYSIIARYN

Query:  EDAKDWWAMHGVYAVMLQSIAFKV-------------YVSLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAGDFF---EEAGM
         D   WW +HG  A  LQ+IA K+             + +   +H++K        A DLV++HSNL LLSR+SP+Y  GETK+WDIAGD F   E+ GM
Subjt:  EDAKDWWAMHGVYAVMLQSIAFKV-------------YVSLPHLHAVK-----GVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAGDFF---EEAGM

Query:  LQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDV
        L+V NLSLDEP+LE VVFTDDG   E  N E E  +V
Subjt:  LQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDV

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-1125.11Show/hide
Query:  PGLDVMTVYQYRPISLVTSLYKIIPKVLTKRLKEVLPSIIHDSQMAFGEGRQILDAILTASGSLEEW-KVAGKKGALLKLDLEKAYGKME----------
        PG D      +RPISL+    KI+ K+L  R+++ +  +IH  Q+ F  G Q    I  +   ++   +   K   ++ +D EKA+ K++          
Subjt:  PGLDVMTVYQYRPISLVTSLYKIIPKVLTKRLKEVLPSIIHDSQMAFGEGRQILDAILTASGSLEEW-KVAGKKGALLKLDLEKAYGKME----------

Query:  --------KVDWRCLSSTNFSIIINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCNERRVIEGFSFGSLSTKVTHLQYVDDTLIFCSWSDGN
                K+          +II+NG+       K G +QG PLSP LF IV +  +  +R   + + I+G   G    K++   + DD +++      +
Subjt:  --------KVDWRCLSSTNFSIIINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCNERRVIEGFSFGSLSTKVTHLQYVDDTLIFCSWSDGN

Query:  LMAWWSIVNLFLLGSGLSLNVSK
              +++ F   SG  +NV K
Subjt:  LMAWWSIVNLFLLGSGLSLNVSK

P08548 LINE-1 reverse transcriptase homolog1.8e-1325.65Show/hide
Query:  NLGNLKLPGLDVMTVYQYRPISLVTSLYKIIPKVLTKRLKEVLPSIIHDSQMAFGEGRQILDAILTASGSLEEW-KVAGKKGALLKLDLEKAY-------
        N+  +  PG D      YRPISL+    KI+ K+LT R+++ +  IIH  Q+ F  G Q    I  +   ++   K+  K   +L +D EKA+       
Subjt:  NLGNLKLPGLDVMTVYQYRPISLVTSLYKIIPKVLTKRLKEVLPSIIHDSQMAFGEGRQILDAILTASGSLEEW-KVAGKKGALLKLDLEKAY-------

Query:  -----------GKMEKVDWRCLSSTNFSIIINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCNERRVIEGFSFGSLSTKVTHLQYVDDTLIF
                   G   K+     S    +II+NG        + G +QG PLSP LF IV +  +  +R   E + I+G   GS   K++   + DD +++
Subjt:  -----------GKMEKVDWRCLSSTNFSIIINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCNERRVIEGFSFGSLSTKVTHLQYVDDTLIF

Query:  CSWSDGNLMAWWSIVNLFLLGSGLSLNVSK-MSFIGVNLSFEETDFNASILGCKVNNFPFNYLGFPI--GGKHATKVACEALEEK---------------
           +  +      ++  +   SG  +N  K ++FI  N +  E     SI    V      YLG  +    K   K   E L ++               
Subjt:  CSWSDGNLMAWWSIVNLFLLGSGLSLNVSK-MSFIGVNLSFEETDFNASILGCKVNNFPFNYLGFPI--GGKHATKVACEALEEK---------------

Query:  -GRLTLAQSVLNSFSCYFFSL--MQALKSYINHLER-LLETMWGMKK
         GR+ + +  +   + Y F+   ++A  SY   LE+ +L  +W  KK
Subjt:  -GRLTLAQSVLNSFSCYFFSL--MQALKSYINHLER-LLETMWGMKK

P11369 LINE-1 retrotransposable element ORF2 protein5.4e-1323.41Show/hide
Query:  QSKIVISMLENSEGRILLRDDEIVDEIVGFY-----TSLHKKDDSTHFTFDGLDWRPLVIQASLALEAPFEEAEIWAVVQNLGNLKLPGLDVMT------
        + KI+I+ + N +G I    +EI + I  FY     T L   D+   F  D      L       L +P    EI AV+ +L   K PG D  +      
Subjt:  QSKIVISMLENSEGRILLRDDEIVDEIVGFY-----TSLHKKDDSTHFTFDGLDWRPLVIQASLALEAPFEEAEIWAVVQNLGNLKLPGLDVMT------

Query:  -------------------------------------------VYQYRPISLVTSLYKIIPKVLTKRLKEVLPSIIHDSQMAFGEGRQILDAILTASGSL
                                                   +  +RPISL+    KI+ K+L  R++E + +IIH  Q+ F  G Q    I  +   +
Subjt:  -------------------------------------------VYQYRPISLVTSLYKIIPKVLTKRLKEVLPSIIHDSQMAFGEGRQILDAILTASGSL

Query:  EEW-KVAGKKGALLKLDLEKAYGKME-----KVDWR-------------CLSSTNFSIIINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCN
            K+  K   ++ LD EKA+ K++     KV  R               S    +I +NG     I  K G +QG PLSP+LF IV +  +  +R   
Subjt:  EEW-KVAGKKGALLKLDLEKAYGKME-----KVDWR-------------CLSSTNFSIIINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCN

Query:  ERRVIEGFSFGSLSTKVTHLQYVDDTLIFCSWSDGNLMAWWSIVNLFLLGSGLSLNVSK-MSFIGVNLSFEETDFNASILGCKVNNFPFNYLGFPIGGKH
        +++ I+G   G    K++ L   DD +++ S    +     +++N F    G  +N +K M+F+       E +   +     V N    YLG       
Subjt:  ERRVIEGFSFGSLSTKVTHLQYVDDTLIFCSWSDGNLMAWWSIVNLFLLGSGLSLNVSK-MSFIGVNLSFEETDFNASILGCKVNNFPFNYLGFPIGGKH

Query:  ATKVACEALEEKGRLTLAQSVLNSFSCYFFSLMQALKSYINHLERLLETMWGMKKN---SALLMKWLWRFFHGENALWRRVTSAIYGEDDHG-----WSS
                      +TL + V + +   F SL + +K  +   +  L   W  + N    A+L K ++RF    NA+  ++ +  + E +       W++
Subjt:  ATKVACEALEEKGRLTLAQSVLNSFSCYFFSLMQALKSYINHLERLLETMWGMKKN---SALLMKWLWRFFHGENALWRRVTSAIYGEDDHG-----WSS

Query:  LMPR
          PR
Subjt:  LMPR

P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-0927.23Show/hide
Query:  DVMTVYQYRPISLVTSLYKIIPKVLTKRLKEVLPSIIHDSQMAFGEGRQILDAILTASGSLEEWKVAGKKGALLKLDLEKAYGKMEKVDWRCL-------
        D+  +  +RP+SL+++ YKI+ K ++ RLK VL  +IH  Q     GR I D +      L   +  G   A L LD EKA+   ++VD + L       
Subjt:  DVMTVYQYRPISLVTSLYKIIPKVLTKRLKEVLPSIIHDSQMAFGEGRQILDAILTASGSLEEWKVAGKKGALLKLDLEKAYGKMEKVDWRCL-------

Query:  --------------SSTNFSIIINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCNERRVIEGFSFGSLSTKVTHLQYVDDTLI
                      +S    + IN      +   RG++QG PLS  L+++  + F      C  R+ + G        +V    Y DD ++
Subjt:  --------------SSTNFSIIINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCNERRVIEGFSFGSLSTKVTHLQYVDDTLI

P92555 Uncharacterized mitochondrial protein AtMg012502.3e-0841.18Show/hide
Query:  IINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCNERRVIEGFSFGSLSTKVTHLQYVDDT
        IING   G +   RGL+QGDPLSP+LF +  +  S L R   E+  + G    + S ++ HL + DDT
Subjt:  IINGRSGGKIMAKRGLQQGDPLSPFLFTIVGDAFSNLVRYCNERRVIEGFSFGSLSTKVTHLQYVDDT

Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily1.9e-1320.38Show/hide
Query:  IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLV
        +I F   +     F K+++ S   K+   +A+L   VI ++G ++++Q+I DN+    G    +   + TI  +PC    LN+ L+              
Subjt:  IIIFKAITDGDPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLV

Query:  YEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLR-LLSVAETRFASIIIMLKRFKLIKGGLQAMVK-----------------GSILDDVWWDKID
        + K  W++   S    +  F+ N+S  + +  +    + ++    TR  S  + L+     K  L+ M                     + D+ +W  ++
Subjt:  YEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLR-LLSVAETRFASIIIMLKRFKLIKGGLQAMVK-----------------GSILDDVWWDKID

Query:  YILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKN-NTPLHCLAHSLNPSEQW---------LKQ
          ++I+ P  +++    T       +Y++     E ++T     E K        + V  +I+   W ++ ++PLH  A  LNPS Q+         LK+
Subjt:  YILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNKN-NTPLHCLAHSLNPSEQW---------LKQ

Query:  D----PNRVTPHKDI--EVTRERMKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV
        D      ++ P  D+  ++T +   F+  +  F    ++ AR +     WW   G  A +LQ +A ++
Subjt:  D----PNRVTPHKDI--EVTRERMKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV

AT3G17450.1 hAT dimerisation domain-containing protein1.5e-1320.04Show/hide
Query:  FLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASD
        F  +++ +  V+D   +     +++ ++G +NV+Q+IT N    + A +++E +   + WTPC +H   L L++             + K  ++S     
Subjt:  FLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASD

Query:  VMTLKNFIMNHSMRVAIF-NEFVP-LRLLSVAETRFASIIIMLKRFKLIKGGLQAM--------------------VKGSILDDVWWDKIDYILSITSPT
           +  FI N +  + +  NEF   L LL  A  R AS    L+     K  L+ +                    V+  +L  V+W K+ Y+L    P 
Subjt:  VMTLKNFIMNHSMRVAIF-NEFVP-LRLLSVAETRFASIIIMLKRFKLIKGGLQAM--------------------VKGSILDDVWWDKIDYILSITSPT

Query:  YEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNK-NNTPLHCLAHSLNPSEQW-------------LKQDPNRV
         ++I + +       + Y        K+             +D+  Y     ++  RWN   + PL+  A+  NP+ ++             + +   R+
Subjt:  YEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRWNK-NNTPLHCLAHSLNPSEQW-------------LKQDPNRV

Query:  TPHKDIEVT--RERMKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKVYVSLPHLHAVKGVGAH-----------------------DL
         P     +T   +   ++  + DF    +I  R   D   WW  HG+  + LQ +A ++       H    VG                         DL
Subjt:  TPHKDIEVT--RERMKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKVYVSLPHLHAVKGVGAH-----------------------DL

Query:  VYIHSNLHLLSRKSPEYSHGETKLWDIAGDFFEEAGMLQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDVMEDFLEEGGAFSEEE
         Y+H NL L  ++  +  H E +          +   L    L   E + EE +  +D +  E +  + E+E+  + ++E G    E E
Subjt:  VYIHSNLHLLSRKSPEYSHGETKLWDIAGDFFEEAGMLQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDVMEDFLEEGGAFSEEE

AT4G08267.1 hAT transposon superfamily protein2.0e-1846.88Show/hide
Query:  LITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVK-NVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVA
        ++T+NA N   +  +I ++F TI WTPCVVHTLNLALKN CA   +  NN +VY+ C WI  I+ +V  +KN IMN+ +R+ +F E   L+LL+++
Subjt:  LITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVK-NVENNLLVYEKCSWISNIASDVMTLKNFIMNHSMRVAIFNEFVPLRLLSVA

AT4G15020.1 hAT transposon superfamily3.2e-1322.31Show/hide
Query:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIAS
        +FLK+++ S  +     +  L  E++ EVG  NV+Q+IT        A + +   +P++ W PC  H ++  L+              + K  WIS    
Subjt:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIAS

Query:  DVMTLKNFIMNHSMRVAIFNEF-----VPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM------------------VKGSILDDVWWDKIDYILSITS
            +  F+ NHS  + +  +F     + L   S + T FA+    L R   +K  LQAM                  V  ++ D+ +W  +  +  +TS
Subjt:  DVMTLKNFIMNHSMRVAIFNEF-----VPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM------------------VKGSILDDVWWDKIDYILSITS

Query:  PTYEMIWI-CDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRW--NKNNTPLHCLAHSLNPSEQWLKQDP------------
        P    + I C         VY      + + K AI  H   R D       +++  +IDRW   + + PL      LNP   +   +             
Subjt:  PTYEMIWI-CDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRW--NKNNTPLHCLAHSLNPSEQWLKQDP------------

Query:  -NRVTPHKDIE--VTRERMKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV
          R+ P   I+  + +E   + T    F    +I AR      +WW+ +G   + L   A ++
Subjt:  -NRVTPHKDIE--VTRERMKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV

AT4G15020.2 hAT transposon superfamily3.2e-1322.31Show/hide
Query:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIAS
        +FLK+++ S  +     +  L  E++ EVG  NV+Q+IT        A + +   +P++ W PC  H ++  L+              + K  WIS    
Subjt:  MFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIAS

Query:  DVMTLKNFIMNHSMRVAIFNEF-----VPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM------------------VKGSILDDVWWDKIDYILSITS
            +  F+ NHS  + +  +F     + L   S + T FA+    L R   +K  LQAM                  V  ++ D+ +W  +  +  +TS
Subjt:  DVMTLKNFIMNHSMRVAIFNEF-----VPLRLLSVAETRFASIIIMLKRFKLIKGGLQAM------------------VKGSILDDVWWDKIDYILSITS

Query:  PTYEMIWI-CDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRW--NKNNTPLHCLAHSLNPSEQWLKQDP------------
        P    + I C         VY      + + K AI  H   R D       +++  +IDRW   + + PL      LNP   +   +             
Subjt:  PTYEMIWI-CDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKRGDEDSSFYDVVHNILIDRW--NKNNTPLHCLAHSLNPSEQWLKQDP------------

Query:  -NRVTPHKDIE--VTRERMKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV
          R+ P   I+  + +E   + T    F    +I AR      +WW+ +G   + L   A ++
Subjt:  -NRVTPHKDIE--VTRERMKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCCTCTTCATCTACATCTAGTAATAGTGGTCAATCATCAACAATTCCTTCAATAGGAGCTACAATCTCATCCTCTAGTGTTGAAGATGAATCAAAACAACT
TTGGCAATATGTGACCAAGAATCAAATATTAAACGAAGGAGGCTTAGGTGGTTATGGAATTGGAGTGTGTAGTAAAGTTACCTCTCAAGATAAAGCCAACATGCAAAGAT
TAGAAGATGAGATGCAAGATCGTATGGCTAAAAAGACCCCTAGAAACATTCCTTTACCACCTTCGTTTGTACCTTCTGAGATCATAATTTTTAAGGCAATCACAGATGGT
GATCCAATGTTTCTAAAAGCGATGGAATGCTCAGGTGAAGTCAAAGATAAATATTTTATTGCAAATCTGCCGAAGGAAGTGATTATTGAAGTTGGCCCTCAAAATGTAAT
TCAATTGATTACTGATAATGCTCCTAATTGCAAGGGTGCAGAGCAAATTATTGAATCACAATTTCCGACAATTGTATGGACACCATGTGTAGTACATACTCTTAATCTTG
CCTTGAAGAATATATGTGCAGTGAAAAATGTTGAAAACAATTTGCTTGTCTATGAGAAATGTAGTTGGATTTCTAATATTGCTAGTGATGTGATGACGCTGAAGAATTTT
ATTATGAATCATTCAATGAGGGTTGCTATTTTTAATGAGTTTGTACCTTTGAGATTACTTTCTGTGGCAGAAACACGTTTTGCATCAATCATTATCATGCTTAAAAGGTT
CAAGCTTATTAAAGGTGGGTTGCAAGCTATGGTGAAAGGATCGATCTTGGATGATGTTTGGTGGGATAAAATTGATTATATCCTTTCCATCACTTCACCTACATATGAAA
TGATTTGGATTTGTGATACGAATACAATTACCTTTCATTTGGTTTATGACATGTGGGATACTATGATTGAAAAGGTGAAGACTGCAATATATAGGCATGAAGGAAAACGT
GGAGATGAAGACTCTTCTTTTTATGATGTTGTCCATAATATTCTCATTGATCGTTGGAACAAGAATAATACTCCACTTCATTGTTTGGCCCATTCTTTGAACCCAAGTGA
GCAATGGCTTAAGCAAGATCCTAATCGAGTAACTCCACATAAAGACATTGAAGTAACTCGTGAGAGAATGAAATTTTCAACGATGAGAAAAGATTTTAATGATGTTTATT
CTATAATAGCTAGGTACAATGAGGATGCAAAGGATTGGTGGGCTATGCATGGTGTCTATGCAGTAATGCTCCAATCAATTGCTTTTAAGGTATATGTCAGTCTTCCTCAT
CTTCATGCTGTGAAAGGAGTTGGCGCACATGATCTAGTATATATCCATAGTAATTTACATCTTTTGTCAAGAAAAAGTCCAGAATATTCACATGGAGAGACAAAATTATG
GGATATTGCTGGAGATTTTTTTGAAGAAGCTGGGATGCTTCAAGTGACTAATTTATCATTAGATGAACCAGATTTAGAGGAAGTAGTTTTCACTGATGATGGGAGTGGCA
ATGAGGCTAACAATTGTGAAGCTGAAGATGAAGATGTTATGGAAGATTTTCTAGAAGAGGGGGGTGCTTTCTCTGAAGAAGAAAGAAAGGAAAGGGAAGCTTGTAAGGCT
GCTTTAGCTGAGACTGTTTTCAAAGAGCAGCGTGCTTGGGTTCAAAAAAGTAAAATTCAGAGCAAAATTGTTATTTCTATGTTGGAAAACAGTGAGGGGCGTATTCTCCT
TAGAGATGATGAAATTGTTGATGAAATTGTTGGCTTTTACACCAGCTTACACAAGAAAGATGATAGTACGCATTTTACCTTTGATGGTCTGGATTGGAGACCTCTGGTCA
TTCAAGCTAGCTTAGCTTTGGAAGCCCCTTTTGAAGAGGCTGAAATTTGGGCTGTAGTCCAAAACTTGGGGAATCTAAAGTTGCCTGGGCTGGATGTCATGACTGTCTAT
CAGTATAGACCTATTAGCTTGGTGACTTCTCTTTACAAAATCATTCCCAAGGTGCTAACCAAGAGATTGAAAGAAGTCCTTCCTTCAATTATTCATGACTCACAAATGGC
ATTTGGGGAAGGGAGGCAGATTCTTGATGCTATTTTGACAGCTTCAGGATCTTTAGAAGAATGGAAGGTGGCTGGCAAGAAGGGTGCTCTCCTTAAGCTTGACCTTGAAA
AGGCATATGGTAAGATGGAGAAAGTGGATTGGCGATGCCTATCTTCTACCAACTTCTCAATTATTATAAATGGTAGATCAGGAGGGAAAATTATGGCTAAAAGAGGCCTT
CAACAAGGTGACCCCCTCTCACCCTTTCTTTTCACAATCGTTGGTGATGCGTTCAGCAATCTTGTTCGGTATTGTAATGAAAGAAGAGTGATTGAAGGCTTCTCATTTGG
CAGCCTGTCGACTAAGGTTACACATCTTCAATATGTGGATGATACTTTAATCTTCTGCTCTTGGAGTGATGGGAATTTGATGGCGTGGTGGTCTATTGTCAATTTATTCT
TGTTGGGATCGGGCCTTTCCCTAAATGTCTCCAAAATGTCTTTTATTGGGGTCAATCTTAGCTTTGAAGAGACTGATTTTAATGCGTCCATTTTGGGTTGCAAAGTTAAT
AACTTTCCCTTTAATTACTTGGGGTTTCCCATTGGTGGAAAGCATGCTACTAAGGTGGCATGTGAAGCTCTTGAGGAGAAGGGGAGATTAACTCTTGCTCAGTCAGTTCT
CAATAGCTTCTCATGTTATTTTTTCTCCTTGATGCAAGCCCTGAAGAGTTATATTAATCATTTGGAGAGATTGTTAGAGACTATGTGGGGAATGAAGAAAAATAGCGCTC
TTTTAATGAAATGGTTGTGGAGATTTTTCCATGGAGAGAACGCCCTTTGGAGGCGTGTCACTAGTGCTATTTATGGGGAGGATGATCACGGTTGGAGTTCCCTCATGCCT
AGAGACAAAAGGAAGTTCAGGTTATGCAGTGGAGATGGTAGAAACATAAGATTCTGGGAAGATCGTTGGTGTGCTACTCAACCTCTTGCGACTTTGTACCATGATATTTA
CGTTATTTCCACTAAAAGGGAAGTTGTTGTCGCGGATTGTTGGAATGATCTGACCAAGCTTGGGATCTGGGTTGCTAAATTCATGGGTGGTAAACAGAAACAGCGACTGT
CCCTTGCTTATAGAAGTCTAAATACTCATGAGAAGCTTCAAAGGAAGTTCCTAATTTGGTCTCTCTCACCCTCCATTTGTTGTCTTCATCTTAGAGAGATGGAAACTGTG
GATCACTTGTTTTTACCTGAGTTCGCTTCTAAAGGGTGGAGCATTCTTCTTAGTACTTTTGGGTTGGCTAGTTGTCTCCCTAAGAAGATTGACAATTGGATGATGGATGG
TCTTGATGGTGGGAGCTTTTGTGGAAGAGGAAAAATTATTTGGAAATGTGCTATTCGTGCGCTTTTGGAGTGTCTGTGGAAAGAAAAGAACTGCAGGATGTTTGAAGATA
AGCTTGCTTCTTTTGATTATTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTTCCTCTTCATCTACATCTAGTAATAGTGGTCAATCATCAACAATTCCTTCAATAGGAGCTACAATCTCATCCTCTAGTGTTGAAGATGAATCAAAACAACT
TTGGCAATATGTGACCAAGAATCAAATATTAAACGAAGGAGGCTTAGGTGGTTATGGAATTGGAGTGTGTAGTAAAGTTACCTCTCAAGATAAAGCCAACATGCAAAGAT
TAGAAGATGAGATGCAAGATCGTATGGCTAAAAAGACCCCTAGAAACATTCCTTTACCACCTTCGTTTGTACCTTCTGAGATCATAATTTTTAAGGCAATCACAGATGGT
GATCCAATGTTTCTAAAAGCGATGGAATGCTCAGGTGAAGTCAAAGATAAATATTTTATTGCAAATCTGCCGAAGGAAGTGATTATTGAAGTTGGCCCTCAAAATGTAAT
TCAATTGATTACTGATAATGCTCCTAATTGCAAGGGTGCAGAGCAAATTATTGAATCACAATTTCCGACAATTGTATGGACACCATGTGTAGTACATACTCTTAATCTTG
CCTTGAAGAATATATGTGCAGTGAAAAATGTTGAAAACAATTTGCTTGTCTATGAGAAATGTAGTTGGATTTCTAATATTGCTAGTGATGTGATGACGCTGAAGAATTTT
ATTATGAATCATTCAATGAGGGTTGCTATTTTTAATGAGTTTGTACCTTTGAGATTACTTTCTGTGGCAGAAACACGTTTTGCATCAATCATTATCATGCTTAAAAGGTT
CAAGCTTATTAAAGGTGGGTTGCAAGCTATGGTGAAAGGATCGATCTTGGATGATGTTTGGTGGGATAAAATTGATTATATCCTTTCCATCACTTCACCTACATATGAAA
TGATTTGGATTTGTGATACGAATACAATTACCTTTCATTTGGTTTATGACATGTGGGATACTATGATTGAAAAGGTGAAGACTGCAATATATAGGCATGAAGGAAAACGT
GGAGATGAAGACTCTTCTTTTTATGATGTTGTCCATAATATTCTCATTGATCGTTGGAACAAGAATAATACTCCACTTCATTGTTTGGCCCATTCTTTGAACCCAAGTGA
GCAATGGCTTAAGCAAGATCCTAATCGAGTAACTCCACATAAAGACATTGAAGTAACTCGTGAGAGAATGAAATTTTCAACGATGAGAAAAGATTTTAATGATGTTTATT
CTATAATAGCTAGGTACAATGAGGATGCAAAGGATTGGTGGGCTATGCATGGTGTCTATGCAGTAATGCTCCAATCAATTGCTTTTAAGGTATATGTCAGTCTTCCTCAT
CTTCATGCTGTGAAAGGAGTTGGCGCACATGATCTAGTATATATCCATAGTAATTTACATCTTTTGTCAAGAAAAAGTCCAGAATATTCACATGGAGAGACAAAATTATG
GGATATTGCTGGAGATTTTTTTGAAGAAGCTGGGATGCTTCAAGTGACTAATTTATCATTAGATGAACCAGATTTAGAGGAAGTAGTTTTCACTGATGATGGGAGTGGCA
ATGAGGCTAACAATTGTGAAGCTGAAGATGAAGATGTTATGGAAGATTTTCTAGAAGAGGGGGGTGCTTTCTCTGAAGAAGAAAGAAAGGAAAGGGAAGCTTGTAAGGCT
GCTTTAGCTGAGACTGTTTTCAAAGAGCAGCGTGCTTGGGTTCAAAAAAGTAAAATTCAGAGCAAAATTGTTATTTCTATGTTGGAAAACAGTGAGGGGCGTATTCTCCT
TAGAGATGATGAAATTGTTGATGAAATTGTTGGCTTTTACACCAGCTTACACAAGAAAGATGATAGTACGCATTTTACCTTTGATGGTCTGGATTGGAGACCTCTGGTCA
TTCAAGCTAGCTTAGCTTTGGAAGCCCCTTTTGAAGAGGCTGAAATTTGGGCTGTAGTCCAAAACTTGGGGAATCTAAAGTTGCCTGGGCTGGATGTCATGACTGTCTAT
CAGTATAGACCTATTAGCTTGGTGACTTCTCTTTACAAAATCATTCCCAAGGTGCTAACCAAGAGATTGAAAGAAGTCCTTCCTTCAATTATTCATGACTCACAAATGGC
ATTTGGGGAAGGGAGGCAGATTCTTGATGCTATTTTGACAGCTTCAGGATCTTTAGAAGAATGGAAGGTGGCTGGCAAGAAGGGTGCTCTCCTTAAGCTTGACCTTGAAA
AGGCATATGGTAAGATGGAGAAAGTGGATTGGCGATGCCTATCTTCTACCAACTTCTCAATTATTATAAATGGTAGATCAGGAGGGAAAATTATGGCTAAAAGAGGCCTT
CAACAAGGTGACCCCCTCTCACCCTTTCTTTTCACAATCGTTGGTGATGCGTTCAGCAATCTTGTTCGGTATTGTAATGAAAGAAGAGTGATTGAAGGCTTCTCATTTGG
CAGCCTGTCGACTAAGGTTACACATCTTCAATATGTGGATGATACTTTAATCTTCTGCTCTTGGAGTGATGGGAATTTGATGGCGTGGTGGTCTATTGTCAATTTATTCT
TGTTGGGATCGGGCCTTTCCCTAAATGTCTCCAAAATGTCTTTTATTGGGGTCAATCTTAGCTTTGAAGAGACTGATTTTAATGCGTCCATTTTGGGTTGCAAAGTTAAT
AACTTTCCCTTTAATTACTTGGGGTTTCCCATTGGTGGAAAGCATGCTACTAAGGTGGCATGTGAAGCTCTTGAGGAGAAGGGGAGATTAACTCTTGCTCAGTCAGTTCT
CAATAGCTTCTCATGTTATTTTTTCTCCTTGATGCAAGCCCTGAAGAGTTATATTAATCATTTGGAGAGATTGTTAGAGACTATGTGGGGAATGAAGAAAAATAGCGCTC
TTTTAATGAAATGGTTGTGGAGATTTTTCCATGGAGAGAACGCCCTTTGGAGGCGTGTCACTAGTGCTATTTATGGGGAGGATGATCACGGTTGGAGTTCCCTCATGCCT
AGAGACAAAAGGAAGTTCAGGTTATGCAGTGGAGATGGTAGAAACATAAGATTCTGGGAAGATCGTTGGTGTGCTACTCAACCTCTTGCGACTTTGTACCATGATATTTA
CGTTATTTCCACTAAAAGGGAAGTTGTTGTCGCGGATTGTTGGAATGATCTGACCAAGCTTGGGATCTGGGTTGCTAAATTCATGGGTGGTAAACAGAAACAGCGACTGT
CCCTTGCTTATAGAAGTCTAAATACTCATGAGAAGCTTCAAAGGAAGTTCCTAATTTGGTCTCTCTCACCCTCCATTTGTTGTCTTCATCTTAGAGAGATGGAAACTGTG
GATCACTTGTTTTTACCTGAGTTCGCTTCTAAAGGGTGGAGCATTCTTCTTAGTACTTTTGGGTTGGCTAGTTGTCTCCCTAAGAAGATTGACAATTGGATGATGGATGG
TCTTGATGGTGGGAGCTTTTGTGGAAGAGGAAAAATTATTTGGAAATGTGCTATTCGTGCGCTTTTGGAGTGTCTGTGGAAAGAAAAGAACTGCAGGATGTTTGAAGATA
AGCTTGCTTCTTTTGATTATTTTTGA
Protein sequenceShow/hide protein sequence
MASSSSSTSSNSGQSSTIPSIGATISSSSVEDESKQLWQYVTKNQILNEGGLGGYGIGVCSKVTSQDKANMQRLEDEMQDRMAKKTPRNIPLPPSFVPSEIIIFKAITDG
DPMFLKAMECSGEVKDKYFIANLPKEVIIEVGPQNVIQLITDNAPNCKGAEQIIESQFPTIVWTPCVVHTLNLALKNICAVKNVENNLLVYEKCSWISNIASDVMTLKNF
IMNHSMRVAIFNEFVPLRLLSVAETRFASIIIMLKRFKLIKGGLQAMVKGSILDDVWWDKIDYILSITSPTYEMIWICDTNTITFHLVYDMWDTMIEKVKTAIYRHEGKR
GDEDSSFYDVVHNILIDRWNKNNTPLHCLAHSLNPSEQWLKQDPNRVTPHKDIEVTRERMKFSTMRKDFNDVYSIIARYNEDAKDWWAMHGVYAVMLQSIAFKVYVSLPH
LHAVKGVGAHDLVYIHSNLHLLSRKSPEYSHGETKLWDIAGDFFEEAGMLQVTNLSLDEPDLEEVVFTDDGSGNEANNCEAEDEDVMEDFLEEGGAFSEEERKEREACKA
ALAETVFKEQRAWVQKSKIQSKIVISMLENSEGRILLRDDEIVDEIVGFYTSLHKKDDSTHFTFDGLDWRPLVIQASLALEAPFEEAEIWAVVQNLGNLKLPGLDVMTVY
QYRPISLVTSLYKIIPKVLTKRLKEVLPSIIHDSQMAFGEGRQILDAILTASGSLEEWKVAGKKGALLKLDLEKAYGKMEKVDWRCLSSTNFSIIINGRSGGKIMAKRGL
QQGDPLSPFLFTIVGDAFSNLVRYCNERRVIEGFSFGSLSTKVTHLQYVDDTLIFCSWSDGNLMAWWSIVNLFLLGSGLSLNVSKMSFIGVNLSFEETDFNASILGCKVN
NFPFNYLGFPIGGKHATKVACEALEEKGRLTLAQSVLNSFSCYFFSLMQALKSYINHLERLLETMWGMKKNSALLMKWLWRFFHGENALWRRVTSAIYGEDDHGWSSLMP
RDKRKFRLCSGDGRNIRFWEDRWCATQPLATLYHDIYVISTKREVVVADCWNDLTKLGIWVAKFMGGKQKQRLSLAYRSLNTHEKLQRKFLIWSLSPSICCLHLREMETV
DHLFLPEFASKGWSILLSTFGLASCLPKKIDNWMMDGLDGGSFCGRGKIIWKCAIRALLECLWKEKNCRMFEDKLASFDYF