; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G10230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G10230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr7:8321345..8331293
RNA-Seq ExpressionCSPI07G10230
SyntenyCSPI07G10230
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]0.0e+0062.42Show/hide
Query:  DALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEK
        D ++E +R ++ +    + E + ++KMK+DLPSY GKR+IE+FLDWLK+TENFF+YM T + KKV LVALKLKGGASAWW+Q+  NRQ+  K P+RSWEK
Subjt:  DALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEK

Query:  MKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTM
        MKKL+K+RF+PPNYEQTLY QYQNCRQG R   EYIEEFHRL  RTNL E E+H I+ FVGGLRFD+KEKV+LQP + LSEAI+ AETVEEMI  +  + 
Subjt:  MKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTM

Query:  NRRTWEPTSNKKTNYANK-----SNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG
         +R WEP+++KKT   N      ++++P+      GK+   +              K +N Y RP  G C+RCGQ GH SN CPQRKTIA+A + D+   
Subjt:  NRRTWEPTSNKKTNYANK-----SNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG

Query:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV
            E +EETE IEADEGD +SC++ RVLI+PKEE   QRHSLFKTRCTI GKVC+VIID GSSENF++KK+V+ LNLK QPH  PYKIGW+KKG E+ +
Subjt:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV

Query:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN---SKTRGQLFTTCSGKTLLKERKQDIL
        +E+C VPLSIG+SYKDQ+VCDVI+MDVCH+LLGRPWQ D Q++H+GRENTYEF WM K++ LLPL K+ ++N   ++ +G LF T SGK  L+ER+ +IL
Subjt:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN---SKTRGQLFTTCSGKTLLKERKQDIL

Query:  AL-----------------------------KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPK
         +                             KEP  LPPLRDI H+I+L+ GAS P+L HY M+P EY  LH+ IE+LLKKGHIKPS S C VPALLTPK
Subjt:  AL-----------------------------KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPK

Query:  KDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFL
        KDG+WRMCVDSRAIN+ITVKYRFPIPRV DLLDQLG A IFSKIDLRS YHQIRIRPGDEWKTAFKTNEGLFEW+                   VLHPFL
Subjt:  KDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFL

Query:  NKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIK
        NKFI+VYFDDILV+S   D+HL H+ +LFQVL   ELY+N KKC F  +EI FLGFII+K  V M+ +KVEAI+ W  PTTV +VQAFLGLASFYRKFI+
Subjt:  NKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIK

Query:  NFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRF----------FSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRAL
        N SSI AP+TDCLKKG F+W   QQ+SF  +K+            F   FEVAVD C TGIGAVLSQ+ HPIEY SE+LS +RQ+WSTYEQELYALVRAL
Subjt:  NFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRF----------FSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRAL

Query:  KQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHK
        KQWEHYLLS+EF+LLTDHFSLKYL SQK+IS MHARWISY QRFDFVIKHQAGKENKVADALSRK +LL +LSSEI  F H+ ELY++D DF  IW K
Subjt:  KQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHK

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]1.8e-28646.71Show/hide
Query:  EGLISTHHHNSTPTSIMSKKTNLRAVTTRTTPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKVDLPSYSGKRDIESFLDWLKSTENFFSY
        +G +  H  N +P  +      +   +      +A DE     + G   RR   R   +H        +++K ++P++ G  DIE+ LDWL   + FF  
Subjt:  EGLISTHHHNSTPTSIMSKKTNLRAVTTRTTPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKVDLPSYSGKRDIESFLDWLKSTENFFSY

Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        MD P+ ++V++VA KL+GGA AWW++ + NR+ + +RPV +W  MK+++K RFLPP+ EQ LY QY NC QG RTV EY  EF RL AR NL E ++   
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPMA-PAYGKGKETDSQTATYEKKAEMINRSKNQN
        AR+V GL   I+EK+ L  +  + +A +LA   E M         R T E TS    +Y  K N    A P+      T+S+ +          +    N
Subjt:  ARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPMA-PAYGKGKETDSQTATYEKKAEMINRSKNQN

Query:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLPGEGESEPEEETEEIEADEG--DRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDV
         Y +P   KCFRCG+PGH SN CP+R T+  ++   D + G+   + E++ E  E  +G  ++I+CVI R L +PK   S QR+ +F+T+C +  K+C +
Subjt:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLPGEGESEPEEETEEIEADEG--DRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDV

Query:  IIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG
        IIDGGS EN ++K +V    L  +PHPNPY+IGW+KKG    V E+C VPL+IG  Y + + CDV+DM+ CHVLLGRPWQHD    H+G+ N Y F W G
Subjt:  IIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG

Query:  KRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALKEPDGLPPLRDIQHHIDLIPGASLP
        K I +LPL   +         L T  S  K    ERK+                                 ++A   PD LPPLR+IQH IDL+PGASLP
Subjt:  KRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALKEPDGLPPLRDIQHHIDLIPGASLP

Query:  NLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIR
        NL HYRM+PKE   L E +E+LL+KGHI+ S+SPCAVPALLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLLDQL  A +FSKIDLRSGYHQIRI+
Subjt:  NLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIR

Query:  PGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGF
        PGDEWKTAFKT +GL+EW+                   VL PF+ KF+VVYFDDILVYS    EHL HL+K+ + LTE EL++N KKC FL +++ FLG+
Subjt:  PGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGF

Query:  IIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDA
        I+    + ++  KV+A+R+W +P T+ EV++F GLA+FYR+F++NFSSI AP+T+C+KKG FKWT   +ESF+ IK+R  ++P          FE+  DA
Subjt:  IIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDA

Query:  CCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKEN
        C TGIGAVLSQ G P+ + SEKL+ ARQ WSTYEQELYA+V+A+K+WEHYL+ +EFV+ +DH +LKY Q+Q++++++HARW S+L++F++VIKH++G  N
Subjt:  CCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKEN

Query:  KVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHMLLISSHLGTIKT
        KVADALSRK +LL  +S++++ FE +  LYE D DF   W +         + +L+G+LFKG++LCIP TSLR  LIKE H   +S+HLG  KT
Subjt:  KVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHMLLISSHLGTIKT

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.7e-30956.24Show/hide
Query:  WDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWE
        W+   E +  Q  R    R ET+HD+KMK+DLP+Y+GKRDIESFLDW+K+TENFF YM  P++KKV LVALKLKGGASAW                    
Subjt:  WDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWE

Query:  KMKKLLKERFLPPNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALK
                   P +Y Q +   Y+QYQNCRQG++ V EYIEEFHRL AR NLSENEQHQIARF+GGLRFDIKEKV+L   R LSEAISLAETVEEM+ ++
Subjt:  KMKKLLKERFLPPNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALK

Query:  TNTMNRRT-WEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG
            NRRT WE   +KK +Y  K+++QP      KGK  D Q     KK E + R K QNNYTRPSLGKCFRCG+PGHLSN+C QRKTIALA++ED    
Subjt:  TNTMNRRT-WEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG

Query:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV
          + E EEETE IEAD+GDRISC++ RVLI PKEE +PQ HSLFKTRCTING                          KV PHP+PYKIGWVKKG E+ +
Subjt:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV

Query:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN--SKTRGQLFTTCSGKTLLKERKQDILA
        NE+CT+PLSIG+SYKDQIVCDVI+MDVCH+LLGRPWQHDTQTLH+GRENTYEF WMGK++ LLPL KKN E+   K + QLF T SGK LLKER+QD+L 
Subjt:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN--SKTRGQLFTTCSGKTLLKERKQDILA

Query:  L-----------------------------KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKK
        L                             KEP GLPPLRDIQH IDL+P ASLPNL HYRM+P+EY  LH+HIEDLLKKGHIKPSLSPCAVPALLTP K
Subjt:  L-----------------------------KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKK

Query:  DGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVLHPFLNKFIVVYFDDILVYSSGNDE
        DGSWRMCVDSRAINR+T KYRFPIPR+GDLLDQLGKA IFSKIDLR+GYHQI+IRPGDEWKTAFKTNEGLFE                       S  ++
Subjt:  DGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVLHPFLNKFIVVYFDDILVYSSGNDE

Query:  HLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKW
        HL +L+KLF+VLTE ELYIN KKC +L  EI FLGF+IK+G++ MEP+K+EAI+    PT++KEVQAFLGLASFYR+FI+NFS I APLTD         
Subjt:  HLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKW

Query:  TSFQQESFEEIKKRFFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI
                      +F+SPFEVAV+AC TGIGAVLSQ+GHPIEY SEKLST+RQ+WSTYEQELYALVRALKQWEHYLLS +F                  
Subjt:  TSFQQESFEEIKKRFFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI

Query:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE
                                                                                      HI+EG LFKG+QLCIPHTSLRE
Subjt:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE

Query:  ALIKEAHMLLISSHLGTIKTRDPINK
        AL+KEAH   ++ H    KT + I+K
Subjt:  ALIKEAHMLLISSHLGTIKTRDPINK

XP_024932991.1 uncharacterized protein LOC112492819 [Ziziphus jujuba]2.6e-28549.03Show/hide
Query:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ
        D+++KVD+P++ G  +IE FLDW+++ E+FF YM  PE K+VRLVA K +GGASAWWEQ+ +NR++  K P++SW +++++L+ RFLP ++EQ LY QY 
Subjt:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ

Query:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQP
        +C QG R+++E  EEF+RLSAR NL+ENE   +AR+V GL   I+E++ L P+  LSEA++LA  +E+ I  +  T     W+P S     Y  K     
Subjt:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQP

Query:  MAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRISCVIHRV
         A  Y K    D+      K     NR  N   Y R    KCF+CGQ GH SN CP RK + + + +D+  GE  +   +ETE ++ D+G+ + C+I ++
Subjt:  MAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRISCVIHRV

Query:  LIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVC
        L +PK     QRHS+FKT+CTIN KVC+VIID GSSEN ++K +V  L L    HPNPYK+GW+KKG E+ V E+C V  SIG  Y D++VCDV++MD C
Subjt:  LIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVC

Query:  HVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE-----------ENSKTRGQLFTTCSGKTLLKERK---------------------QDI
        H+LLGRPWQ D      GR+NT+ F W GK+I LLP   +N+           E    +G +  T SGK    + K                     Q I
Subjt:  HVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE-----------ENSKTRGQLFTTCSGKTLLKERK---------------------QDI

Query:  LALKE----------PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRIT
        L L +          PD LPP+RDIQH IDL+PGA LPNL HYRM PKE   L + +EDLLKK  I+ SLSPCAVPALL PKK+G WRMC+DSRAIN+IT
Subjt:  LALKE----------PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRIT

Query:  VKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEW-------------------MVLHPFLNKFIVVYFDDILVYSSGN
         KYRFPIPR+ D+LD+L  A +FSK+DLRSGYHQIRIRP DEWKTAFKT  GL+EW                    VL P +   +VVYFDDILVYS   
Subjt:  VKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEW-------------------MVLHPFLNKFIVVYFDDILVYSSGN

Query:  DEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNF
        + H+ H++++F +L E  LY+N KKC F+  E+ FLGFI+ K  +  +P KV AIR+W  P+TV EV++F GLA+FYR+F++NFSSI  PLTDCLKKG F
Subjt:  DEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNF

Query:  KWTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDH
        +W   QQ+SFE +KK    +P          FEV  DA  TGIGAVL Q    IE+ SEKL  A Q WS YEQELYA+VRA K WEHYL+ +EFVL  DH
Subjt:  KWTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDH

Query:  FSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKG
         +L++  SQK+I++MH +W  +LQ+F +   H+AGK N+VADALSRK  LL  L  E++ F+ L + Y  D D   IW KC  H  A  +HI +G+LFKG
Subjt:  FSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKG

Query:  DQLCIPHTSLREALIKEAHMLLISSHLGTIKT
         QLCIP TSLR  LI++ H   +++H G  KT
Subjt:  DQLCIPHTSLREALIKEAHMLLISSHLGTIKT

XP_040994264.1 uncharacterized protein LOC121240799 [Juglans microcarpa x Juglans regia]2.0e-29049.9Show/hide
Query:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ
        +FK+K+DLP ++G   +ESFLDWL   ENFF YM  PE ++V+LVA KL+GGASAWWEQ + NR+R  K+PVR W KMK+L++ RFLPP+YEQ LY QYQ
Subjt:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ

Query:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMI------ALKTNTMNRRTWEPTSNKKTNYAN
        NCRQG RT+ EY EEF+RL++R NL+E E  Q+AR++GGLR  I++KV L  +  LSEA++LA  +E  +      A       +    PT +   ++ +
Subjt:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMI------ALKTNTMNRRTWEPTSNKKTNYAN

Query:  KSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRIS
         S DQ     Y   K     T T            +QN Y +P  GKCFRC QPGH SN CP RK++ L D +D    E + E EE+ E +E DEGD ++
Subjt:  KSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRIS

Query:  CVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDV
        CVI R+L+APK+E   QRH +FKTRCT+N KVC++IID GS EN +++ +VS L L  + HP PYKI W+KKG E+ V   C +P SIG  Y D + CDV
Subjt:  CVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDV

Query:  IDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLP-----------------LTKKNEE---NSKTRGQLFT-TCSGKT------------LL
        ++MD CHVLLGRPWQ+D    +KGR+NTY F W  +++ LLP                 LT   ++   ++K  G++ T    G+              L
Subjt:  IDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLP-----------------LTKKNEE---NSKTRGQLFT-TCSGKT------------LL

Query:  KERKQDILALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKY
         E   DI   + P GLPPLRDIQH IDL+PG SLPNL HYRM+P E+  L + +EDL++KG I+ S+SPCAVPALL PKKDGSWRMCVDSRAIN+ITVKY
Subjt:  KERKQDILALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKY

Query:  RFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMV-------------------LHPFLNKFIVVYFDDILVYSSGNDEH
        RFPIPR+ D+LD L  + +FSK+DLRSGYHQIR+RPGDEWKTAFKT EGL+EW+V                   L PF+ KF+VVYFDDIL+YS    +H
Subjt:  RFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMV-------------------LHPFLNKFIVVYFDDILVYSSGNDEH

Query:  LLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT
        + HL+++  VL E +LYIN KKC FL S + FLGF++ +  V ++  KV  IREW  P+T+ +V++F GLA+FYR+FI+NFS++ AP+T+C+KKG   W 
Subjt:  LLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT

Query:  SFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSL
          Q  SF  IK++  ++P          FEV  DA   GIGAVLSQ G P+E+ SEKL+ AR+ W+ YE E YA++RALK WEHYL+ +EFVL +DH +L
Subjt:  SFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSL

Query:  KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFK
        K++ +Q +++RMHARW++++Q+F  V+K+++G+ NKVADALSR+G LL  L +EI  FE L +LY  D DFA+ W +C     A  +HI + +LF+
Subjt:  KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFK

TrEMBL top hitse value%identityAlignment
A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein8.6e-28746.71Show/hide
Query:  EGLISTHHHNSTPTSIMSKKTNLRAVTTRTTPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKVDLPSYSGKRDIESFLDWLKSTENFFSY
        +G +  H  N +P  +      +   +      +A DE     + G   RR   R   +H        +++K ++P++ G  DIE+ LDWL   + FF  
Subjt:  EGLISTHHHNSTPTSIMSKKTNLRAVTTRTTPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKVDLPSYSGKRDIESFLDWLKSTENFFSY

Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        MD P+ ++V++VA KL+GGA AWW++ + NR+ + +RPV +W  MK+++K RFLPP+ EQ LY QY NC QG RTV EY  EF RL AR NL E ++   
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPMA-PAYGKGKETDSQTATYEKKAEMINRSKNQN
        AR+V GL   I+EK+ L  +  + +A +LA   E M         R T E TS    +Y  K N    A P+      T+S+ +          +    N
Subjt:  ARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPMA-PAYGKGKETDSQTATYEKKAEMINRSKNQN

Query:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLPGEGESEPEEETEEIEADEG--DRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDV
         Y +P   KCFRCG+PGH SN CP+R T+  ++   D + G+   + E++ E  E  +G  ++I+CVI R L +PK   S QR+ +F+T+C +  K+C +
Subjt:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLPGEGESEPEEETEEIEADEG--DRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDV

Query:  IIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG
        IIDGGS EN ++K +V    L  +PHPNPY+IGW+KKG    V E+C VPL+IG  Y + + CDV+DM+ CHVLLGRPWQHD    H+G+ N Y F W G
Subjt:  IIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG

Query:  KRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALKEPDGLPPLRDIQHHIDLIPGASLP
        K I +LPL   +         L T  S  K    ERK+                                 ++A   PD LPPLR+IQH IDL+PGASLP
Subjt:  KRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALKEPDGLPPLRDIQHHIDLIPGASLP

Query:  NLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIR
        NL HYRM+PKE   L E +E+LL+KGHI+ S+SPCAVPALLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLLDQL  A +FSKIDLRSGYHQIRI+
Subjt:  NLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIR

Query:  PGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGF
        PGDEWKTAFKT +GL+EW+                   VL PF+ KF+VVYFDDILVYS    EHL HL+K+ + LTE EL++N KKC FL +++ FLG+
Subjt:  PGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGF

Query:  IIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDA
        I+    + ++  KV+A+R+W +P T+ EV++F GLA+FYR+F++NFSSI AP+T+C+KKG FKWT   +ESF+ IK+R  ++P          FE+  DA
Subjt:  IIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDA

Query:  CCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKEN
        C TGIGAVLSQ G P+ + SEKL+ ARQ WSTYEQELYA+V+A+K+WEHYL+ +EFV+ +DH +LKY Q+Q++++++HARW S+L++F++VIKH++G  N
Subjt:  CCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKEN

Query:  KVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHMLLISSHLGTIKT
        KVADALSRK +LL  +S++++ FE +  LYE D DF   W +         + +L+G+LFKG++LCIP TSLR  LIKE H   +S+HLG  KT
Subjt:  KVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHMLLISSHLGTIKT

A0A5B7BER3 Uncharacterized protein0.0e+0051.73Show/hide
Query:  RAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQT
        R +   +++MK+DLPS++G   IESFLDW+   E FF  M+  + K+V+LVA KLKGGASAWW+Q++ NR+R  K+PVR+W+KM++LL+ERFLP +YEQ 
Subjt:  RAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQT

Query:  LYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVE-EMIALKTNTMNRRTWEPTSNKKTNYA
        LY QYQNCRQG R+V+EY +EF+ LS+R NL+E E  Q+AR+VGGLR  I++++ L+ +  L+EA SLA  VE +       + N     P S++     
Subjt:  LYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVE-EMIALKTNTMNRRTWEPTSNKKTNYA

Query:  NKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLP---GEGESEPEEE---TEEIE
        +K   + + P   K    D  +++  +   +    K+ N Y RP  GKCFRC QPGH SN CP R+ + +    EDN P    E E+E ++E    E  E
Subjt:  NKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLP---GEGESEPEEE---TEEIE

Query:  ADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSY
         DEG+ +SCV+ R+L+ PK+E  PQRH++F+TRCTIN KVCDVIID GSSEN ++K +V  L LK + HPNPYKIGW+KKG E+ V E+C VP SIG  Y
Subjt:  ADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSY

Query:  KDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE--ENSKTRGQLFTTCSGKTLLKERK-------------------
        KD++ CD++DMD CHVLLGRPWQ D    HKG++NTY F W  K++ L+P  K +   + SK  G+   T +G   +++ K                   
Subjt:  KDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE--ENSKTRGQLFTTCSGKTLLKERK-------------------

Query:  -------------QDILALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSR
                     QDI   + PD LPP+RDIQHHIDL+PGASLPNL HYRM+PKE   L + +EDL+ KG I+ S+SPCAVPALLTPKKDGSWRMCVDSR
Subjt:  -------------QDILALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSR

Query:  AINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDIL
        AIN+ITVKYRFPIPR+ D+LD L  + IFSKIDLRSGYHQIRIRPGDEWKTAFKT EGL+EW+                   VL PF+ KF+VVYFDDIL
Subjt:  AINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDIL

Query:  VYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDC
        +YS    EHL H++++   L E +LYIN KKC FL + + FLGFII    + ++  KV AIR+W  P TV ++++F GLA+FYR+FI+NFSSI AP+TDC
Subjt:  VYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDC

Query:  LKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEF
        +KKG F+W   Q+ SF  IK++  ++P          F+V  DA  TGIGAVLSQ G P+E+ SEKL+ ARQ W+TYE EL+A+VRALK WEHYL+ +EF
Subjt:  LKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEF

Query:  VLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILE
        V+ +DH +LK++ +Q ++SRMH RWI++LQRF FV+KH+AG++NKVADALSR+ +LL ++SSEI +FE L ELY+ D DF   W KC     +  +HI +
Subjt:  VLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILE

Query:  GFLFKGDQLCIPHTSLREALIKEAHMLLISSHLGTIKT
        G+LFKG+QLCIP TSLRE ++++ H   +  HLG  KT
Subjt:  GFLFKGDQLCIPHTSLREALIKEAHMLLISSHLGTIKT

A0A5D3DGR0 Reverse transcriptase0.0e+0062.42Show/hide
Query:  DALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEK
        D ++E +R ++ +    + E + ++KMK+DLPSY GKR+IE+FLDWLK+TENFF+YM T + KKV LVALKLKGGASAWW+Q+  NRQ+  K P+RSWEK
Subjt:  DALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEK

Query:  MKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTM
        MKKL+K+RF+PPNYEQTLY QYQNCRQG R   EYIEEFHRL  RTNL E E+H I+ FVGGLRFD+KEKV+LQP + LSEAI+ AETVEEMI  +  + 
Subjt:  MKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTM

Query:  NRRTWEPTSNKKTNYANK-----SNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG
         +R WEP+++KKT   N      ++++P+      GK+   +              K +N Y RP  G C+RCGQ GH SN CPQRKTIA+A + D+   
Subjt:  NRRTWEPTSNKKTNYANK-----SNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG

Query:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV
            E +EETE IEADEGD +SC++ RVLI+PKEE   QRHSLFKTRCTI GKVC+VIID GSSENF++KK+V+ LNLK QPH  PYKIGW+KKG E+ +
Subjt:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV

Query:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN---SKTRGQLFTTCSGKTLLKERKQDIL
        +E+C VPLSIG+SYKDQ+VCDVI+MDVCH+LLGRPWQ D Q++H+GRENTYEF WM K++ LLPL K+ ++N   ++ +G LF T SGK  L+ER+ +IL
Subjt:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN---SKTRGQLFTTCSGKTLLKERKQDIL

Query:  AL-----------------------------KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPK
         +                             KEP  LPPLRDI H+I+L+ GAS P+L HY M+P EY  LH+ IE+LLKKGHIKPS S C VPALLTPK
Subjt:  AL-----------------------------KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPK

Query:  KDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFL
        KDG+WRMCVDSRAIN+ITVKYRFPIPRV DLLDQLG A IFSKIDLRS YHQIRIRPGDEWKTAFKTNEGLFEW+                   VLHPFL
Subjt:  KDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFL

Query:  NKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIK
        NKFI+VYFDDILV+S   D+HL H+ +LFQVL   ELY+N KKC F  +EI FLGFII+K  V M+ +KVEAI+ W  PTTV +VQAFLGLASFYRKFI+
Subjt:  NKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIK

Query:  NFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRF----------FSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRAL
        N SSI AP+TDCLKKG F+W   QQ+SF  +K+            F   FEVAVD C TGIGAVLSQ+ HPIEY SE+LS +RQ+WSTYEQELYALVRAL
Subjt:  NFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRF----------FSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRAL

Query:  KQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHK
        KQWEHYLLS+EF+LLTDHFSLKYL SQK+IS MHARWISY QRFDFVIKHQAGKENKVADALSRK +LL +LSSEI  F H+ ELY++D DF  IW K
Subjt:  KQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHK

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X18.0e-31056.24Show/hide
Query:  WDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWE
        W+   E +  Q  R    R ET+HD+KMK+DLP+Y+GKRDIESFLDW+K+TENFF YM  P++KKV LVALKLKGGASAW                    
Subjt:  WDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWE

Query:  KMKKLLKERFLPPNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALK
                   P +Y Q +   Y+QYQNCRQG++ V EYIEEFHRL AR NLSENEQHQIARF+GGLRFDIKEKV+L   R LSEAISLAETVEEM+ ++
Subjt:  KMKKLLKERFLPPNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALK

Query:  TNTMNRRT-WEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG
            NRRT WE   +KK +Y  K+++QP      KGK  D Q     KK E + R K QNNYTRPSLGKCFRCG+PGHLSN+C QRKTIALA++ED    
Subjt:  TNTMNRRT-WEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG

Query:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV
          + E EEETE IEAD+GDRISC++ RVLI PKEE +PQ HSLFKTRCTING                          KV PHP+PYKIGWVKKG E+ +
Subjt:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV

Query:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN--SKTRGQLFTTCSGKTLLKERKQDILA
        NE+CT+PLSIG+SYKDQIVCDVI+MDVCH+LLGRPWQHDTQTLH+GRENTYEF WMGK++ LLPL KKN E+   K + QLF T SGK LLKER+QD+L 
Subjt:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN--SKTRGQLFTTCSGKTLLKERKQDILA

Query:  L-----------------------------KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKK
        L                             KEP GLPPLRDIQH IDL+P ASLPNL HYRM+P+EY  LH+HIEDLLKKGHIKPSLSPCAVPALLTP K
Subjt:  L-----------------------------KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKK

Query:  DGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVLHPFLNKFIVVYFDDILVYSSGNDE
        DGSWRMCVDSRAINR+T KYRFPIPR+GDLLDQLGKA IFSKIDLR+GYHQI+IRPGDEWKTAFKTNEGLFE                       S  ++
Subjt:  DGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVLHPFLNKFIVVYFDDILVYSSGNDE

Query:  HLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKW
        HL +L+KLF+VLTE ELYIN KKC +L  EI FLGF+IK+G++ MEP+K+EAI+    PT++KEVQAFLGLASFYR+FI+NFS I APLTD         
Subjt:  HLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKW

Query:  TSFQQESFEEIKKRFFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI
                      +F+SPFEVAV+AC TGIGAVLSQ+GHPIEY SEKLST+RQ+WSTYEQELYALVRALKQWEHYLLS +F                  
Subjt:  TSFQQESFEEIKKRFFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI

Query:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE
                                                                                      HI+EG LFKG+QLCIPHTSLRE
Subjt:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE

Query:  ALIKEAHMLLISSHLGTIKTRDPINK
        AL+KEAH   ++ H    KT + I+K
Subjt:  ALIKEAHMLLISSHLGTIKTRDPINK

A0A6N2LVR1 Uncharacterized protein3.1e-28948.98Show/hide
Query:  FKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQN
        F+MK+DLPS++G+  IE FLDWL   E FF YM+ PE KKV+LVA +L GGASAWWEQL+  R R  K  V+SW KM++LL+ R+LPP+YEQ L+ QYQN
Subjt:  FKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQN

Query:  CRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPM
        C+QG R V  Y+EEFHRLS+R NL E +  Q+ARFVGGLR++I+++V +  +  L+EAI+LA   E  +   TNT  R  ++PT N         N    
Subjt:  CRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPM

Query:  APAYGKGKETDSQTATYEKKAEM-INRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD--EEDNLPGEGESEPEE----ETEEIEADEGDRIS
        + +Y     + S+T   +    + +     +N Y+RP+  KC+RCGQ GH SN+CP+R  + L +  EE ++ GEG    ++    E E    DEG+ +S
Subjt:  APAYGKGKETDSQTATYEKKAEM-INRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD--EEDNLPGEGESEPEE----ETEEIEADEGDRIS

Query:  --CVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVC
           V+ R+++APK E   QR+++F+TRCT+N KVCDVIID GSSEN I+K +V+ L LK + H  PYKIGW+KKG E+ V E C    SIG +Y D+IVC
Subjt:  --CVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVC

Query:  DVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKK-NEENSKTRGQLFTTCSGKTLLKERKQD------------------------
        DV++MD CHV+LGRPWQ+D    +KG++N Y F   G+++ L PL +       + + +      G+T L +  +D                        
Subjt:  DVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKK-NEENSKTRGQLFTTCSGKTLLKERKQD------------------------

Query:  --------ILALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITV
                I+  + P+GLPP+RDIQHHIDLIPGASLPN  HYRM+P+E A L   +E+L+KKG ++ S+SPCAVPALL PKKDGSWRMC+DSRAIN+IT+
Subjt:  --------ILALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITV

Query:  KYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGND
        KYRFPIPR+ D+LD L  + IFSKIDLRSGYHQIRIRPGDEWKTAFKT EGL+EW+                   VL PF   F+VVYFDDIL+YS    
Subjt:  KYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGND

Query:  EHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFK
        +H+ HL+++F VL   +L++N  KC F+ S + FLGF++    + ++  KV AIR+W  P  + EV++F GLA+FYR+F+++FS I AP+T+C+KKG F 
Subjt:  EHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFK

Query:  WTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHF
        W    + SF  IK++  S+P          FEV  DA   GIGAVLSQ   P+ + SEKLS AR+ WSTYE ELYA+ RA+K WEHYL+ +EF+L +DH 
Subjt:  WTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHF

Query:  SLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGD
        +LK++ +Q N++RMHARW++++QRF+F +KH++G+ NKVADALSRK SLLT L +E+I FE + +LY  D DF + W KC   L  EG H  +G+LF+G+
Subjt:  SLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGD

Query:  QLCIPHTSLREALIKEAHMLLISSHLGTIKT
        QLCIP +SLRE +I E H   +  HLG  KT
Subjt:  QLCIPHTSLREALIKEAHMLLISSHLGTIKT

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.0e-7438.6Show/hide
Query:  SLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGS-----WRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRS
        +LP  + Y         +   I+D+L +G I+ S SP   P  + PKK  +     +R+ +D R +N ITV  R PIP + ++L +LG+   F+ IDL  
Subjt:  SLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGS-----WRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRS

Query:  GYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLK
        G+HQI + P    KTAF T  G +E++                   +L P LNK  +VY DDI+V+S+  DEHL  L  +F+ L +  L +   KCEFLK
Subjt:  GYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLK

Query:  SEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQE---SFEEIKKRF----------
         E TFLG ++    +   P K+EAI+++  PT  KE++AFLGL  +YRKFI NF+ I  P+T CLKK N K  +   E   +F+++K             
Subjt:  SEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQE---SFEEIKKRF----------

Query:  FSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFD
        F+  F +  DA    +GAVLSQ GHP+ Y+S  L+     +ST E+EL A+V A K + HYLL + F + +DH  L +L   K+ +    RW   L  FD
Subjt:  FSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFD

Query:  FVIKHQAGKENKVADALSRKGSLLTLLSSE
        F IK+  GKEN VADALSR     T LS +
Subjt:  FVIKHQAGKENKVADALSRKGSLLTLLSSE

P0CT41 Transposon Tf2-12 polyprotein3.5e-6733.78Show/hide
Query:  QDILALKEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP
        +DI A    + LP P++ ++  ++L        + +Y + P +  A+++ I   LK G I+ S +  A P +  PKK+G+ RM VD + +N+      +P
Subjt:  QDILALKEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP

Query:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVL-----------HPFLN--------KFIVVYFDDILVYSSGNDEHLLH
        +P +  LL ++  +TIF+K+DL+S YH IR+R GDE K AF+   G+FE++V+             F+N          +V Y DDIL++S    EH+ H
Subjt:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVL-----------HPFLN--------KFIVVYFDDILVYSSGNDEHLLH

Query:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF
        +K + Q L    L INQ KCEF +S++ F+G+ I +   +     ++ + +W  P   KE++ FLG  ++ RKFI   S +  PL + LKK   +KWT  
Subjt:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF

Query:  QQESFEEIKKRF----------FSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLS--KEFVLLT
        Q ++ E IK+            FS    +  DA    +GAVLSQ+      +P+ Y S K+S A+  +S  ++E+ A++++LK W HYL S  + F +LT
Subjt:  QQESFEEIKKRF----------FSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLS--KEFVLLT

Query:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR
        DH +L  +     +  ++  ARW  +LQ F+F I ++ G  N +ADALSR
Subjt:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR

P20825 Retrovirus-related Pol polyprotein from transposon 2973.7e-6935.21Show/hide
Query:  LALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKD-----GSWRMCVDSRAINRITVKYRF
        L  KE + L     I+H ++    + + +   Y +       +   ++++L +G I+ S SP   P  + PKK        +R+ +D R +N IT+  R+
Subjt:  LALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKD-----GSWRMCVDSRAINRITVKYRF

Query:  PIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGNDEHLL
        PIP + ++L +LGK   F+ IDL  G+HQI +      KTAF T  G +E++                   +L P LNK  +VY DDI+++S+   EHL 
Subjt:  PIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------VLHPFLNKFIVVYFDDILVYSSGNDEHLL

Query:  HLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSF
         ++ +F  L +  L +   KCEFLK E  FLG I+    +   P KV+AI  +  PT  KE++AFLGL  +YRKFI N++ I  P+T CLKK     T  
Subjt:  HLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSF

Query:  QQ--ESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSL
         +  E+FE++K      P          F +  DA    +GAVLSQ GHPI ++S  L+     +S  E+EL A+V A K + HYLL ++F++ +DH  L
Subjt:  QQ--ESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSL

Query:  KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR
        ++L + K       RW   L  + F I +  GKEN VADALSR
Subjt:  KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.7e-7834.51Show/hide
Query:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALKEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP
        GK   ++   +  E N+       T C+    L+++ ++I+    P     + +I  +H I++ PGA LP L  Y +T K    +++ ++ LL    I P
Subjt:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALKEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP

Query:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVLHPF--LN--
        S SPC+ P +L PKKDG++R+CVD R +N+ T+   FP+PR+ +LL ++G A IF+ +DL SGYHQI + P D +KTAF T  G +E+ V+ PF  +N  
Subjt:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVLHPF--LN--

Query:  --------------KFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQA
                      +F+ VY DDIL++S   +EH  HL  + + L  + L + +KKC+F   E  FLG+ I   +++    K  AIR++  P TVK+ Q 
Subjt:  --------------KFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQA

Query:  FLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHP------IEYLSEKL
        FLG+ ++YR+FI N S I  P  L  C K    +WT  Q ++ E++K    +SP          + +  DA   GIGAVL +  +       + Y S+ L
Subjt:  FLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHP------IEYLSEKL

Query:  STARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAF
         +A++ +   E EL  +++AL  + + L  K F L TDH SL  LQ++   +R   RW+  L  +DF +++ AG +N VADA+SR    +T  +S  I  
Subjt:  STARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAF

Query:  EHLPELYERD
        E     Y+ D
Subjt:  EHLPELYERD

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.7e-7734.31Show/hide
Query:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALKEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP
        GK   ++   +  E N+       T C+    L+++ ++I+    P     + +I  +H I++ PGA LP L  Y +T K    +++ ++ LL    I P
Subjt:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALKEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP

Query:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVLHPF--LN--
        S SPC+ P +L PKKDG++R+CVD R +N+ T+   FP+PR+ +LL ++G A IF+ +DL SGYHQI + P D +KTAF T  G +E+ V+ PF  +N  
Subjt:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVLHPF--LN--

Query:  --------------KFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQA
                      +F+ VY DDIL++S   +EH  HL  + + L  + L + +KKC+F   E  FLG+ I   +++    K  AIR++  P TVK+ Q 
Subjt:  --------------KFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQA

Query:  FLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHP------IEYLSEKL
        FLG+ ++YR+FI N S I  P  L  C K    +WT  Q ++ +++K    +SP          + +  DA   GIGAVL +  +       + Y S+ L
Subjt:  FLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRFFSSP----------FEVAVDACCTGIGAVLSQRGHP------IEYLSEKL

Query:  STARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAF
         +A++ +   E EL  +++AL  + + L  K F L TDH SL  LQ++   +R   RW+  L  +DF +++ AG +N VADA+SR    +T  +S  I  
Subjt:  STARQAWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAF

Query:  EHLPELYERD
        E     Y+ D
Subjt:  EHLPELYERD

Arabidopsis top hitse value%identityAlignment
AT1G40129.1 unknown protein4.3e-0425.17Show/hide
Query:  STHHHNSTPTSIMSKKTNLRAVTTRTTPW--DALDETQRGQHGRRFGTRAETHHDFKMKVDLPS---YSGKRDIESFLDWLKSTENFFSYMDTPEQKKVR
        S+H+HN         K  + A+TT       + LD+ ++G H      RA    + ++K   P     S +   E +L+W K+ + +FSY +   + +  
Subjt:  STHHHNSTPTSIMSKKTNLRAVTTRTTPW--DALDETQRGQHGRRFGTRAETHHDFKMKVDLPS---YSGKRDIESFLDWLKSTENFFSYMDTPEQKKVR

Query:  LVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERF
             L G A  WW Q   +R  Y + P+  W  +K+ L+ ++
Subjt:  LVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERF

AT2G15180.1 Zinc knuckle (CCHC-type) family protein1.0e-0530.43Show/hide
Query:  FLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLP
        +L W  +   +F +  T ++ K+ +   +LKG A  WW+Q E NR    + P+R+WE++K  +  ++ P
Subjt:  FLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLP

AT4G13320.1 unknown protein2.3e-1331.97Show/hide
Query:  LFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDM--DVCHVLLGRPWQHDT
        +F+T+C IN + C +++ GG+  N I+K +V  L LK        ++    +  +    E C VP+SIG  YKD++ C V++M  +   +L G PW +  
Subjt:  LFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDM--DVCHVLLGRPWQHDT

Query:  QTLHKGRENTYEFHWMGKRITL
        Q  H GR+++    W    I L
Subjt:  QTLHKGRENTYEFHWMGKRITL

ATMG00860.1 DNA/RNA polymerases superfamily protein4.0e-1839.64Show/hide
Query:  HLKKLFQVLTEKELYINQKKCEFLKSEITFLG--FIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT
        HL  + Q+  + + Y N+KKC F + +I +LG   II    VS +P K+EA+  W  P    E++ FLGL  +YR+F+KN+  I  PLT+ LKK + KWT
Subjt:  HLKKLFQVLTEKELYINQKKCEFLKSEITFLG--FIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT

Query:  SFQQESFEEIK
             +F+ +K
Subjt:  SFQQESFEEIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGATCAACAACAGACGAAGGCAGCGTGGACGGGGAAATTTCAGAAACCTCCACAATCCACGATACGCTAATCGAGGAAGGGCTGATCTCAACCCACCACCACAA
TTCTACGCCGACGTCCATAATGAGCAAGAAGACGAATCTTCGAGCAGTGACGACCAGAACAACCCCCTGGGACGCCTTAGACGAAACACAGAGAGGACAACACGGCAGGA
GATTCGGCACGCGGGCAGAAACTCATCATGACTTCAAGATGAAGGTTGACTTACCTTCATATAGTGGCAAGAGAGATATCGAATCCTTTTTGGATTGGCTAAAAAGTACT
GAGAACTTTTTCAGTTACATGGACACCCCTGAACAGAAAAAGGTACGTCTCGTGGCCTTGAAACTCAAAGGGGGCGCATCAGCATGGTGGGAGCAACTGGAAGCCAACAG
GCAAAGATACAACAAACGACCCGTACGCTCTTGGGAAAAAATGAAGAAGCTATTAAAAGAACGATTCCTGCCTCCGAATTATGAGCAGACCTTATACAACCAATATCAGA
ATTGTCGTCAAGGTACCAGGACAGTTACCGAATACATAGAAGAGTTCCACCGATTAAGTGCCAGGACAAATCTAAGTGAGAACGAGCAACATCAAATAGCCAGATTTGTG
GGGGGTCTGCGTTTTGATATTAAAGAAAAAGTAAGATTACAGCCTCTCCGTTTCCTATCTGAAGCAATTTCATTGGCCGAAACAGTAGAAGAGATGATAGCTCTAAAGAC
CAACACCATGAATCGAAGAACATGGGAACCAACATCAAACAAGAAGACAAACTATGCGAACAAGTCGAATGATCAGCCGATGGCACCAGCTTATGGAAAAGGAAAAGAAA
CCGACTCTCAGACTGCAACATATGAAAAGAAGGCAGAGATGATCAACAGAAGCAAGAACCAAAATAATTACACTCGCCCATCATTGGGCAAGTGCTTCAGATGCGGACAA
CCAGGTCACCTATCAAATTCTTGCCCTCAAAGAAAAACAATTGCCTTAGCCGATGAAGAAGATAACTTACCTGGCGAGGGTGAGTCAGAACCAGAAGAGGAAACGGAAGA
AATTGAAGCCGATGAAGGAGATAGGATCTCTTGTGTTATCCATAGAGTACTCATTGCTCCCAAGGAAGAAAAGAGCCCACAACGCCACAGTCTTTTCAAGACAAGGTGCA
CCATTAATGGGAAGGTATGCGATGTTATTATCGATGGAGGTAGCAGTGAAAACTTTATAACCAAGAAGATAGTCTCCAACCTGAACTTGAAGGTTCAGCCACATCCAAAC
CCCTACAAGATAGGATGGGTCAAGAAAGGGAATGAATCCACAGTTAACGAGGTATGTACGGTTCCCCTTTCAATCGGCAGCAGCTACAAGGATCAGATCGTATGCGATGT
AATCGACATGGATGTATGCCATGTCCTCCTAGGCAGACCGTGGCAACACGACACTCAAACCTTGCATAAGGGGAGAGAAAACACATACGAATTCCACTGGATGGGTAAAC
GGATAACTCTACTGCCCTTGACCAAAAAGAACGAGGAGAACAGTAAGACAAGGGGCCAGCTATTCACAACGTGCAGTGGCAAAACTCTTTTAAAAGAAAGAAAGCAGGAT
ATTTTAGCCCTTAAAGAACCTGACGGACTGCCACCTCTTCGAGACATTCAGCACCATATAGATCTGATCCCCGGAGCATCACTGCCAAACCTGGCTCATTACAGAATGAC
CCCTAAAGAGTATGCAGCGCTCCATGAACATATCGAAGACCTACTCAAGAAAGGGCATATTAAGCCAAGCCTCAGTCCTTGTGCTGTCCCAGCTCTCCTCACCCCCAAGA
AGGATGGAAGTTGGAGAATGTGTGTAGATAGCCGAGCCATTAACCGAATCACAGTAAAATACAGATTCCCTATCCCAAGAGTTGGAGACCTCTTGGATCAACTCGGCAAG
GCCACCATCTTCTCGAAAATTGACCTAAGGAGTGGATACCATCAAATACGTATCCGACCAGGTGATGAATGGAAGACTGCCTTCAAAACGAATGAAGGGCTGTTTGAATG
GATGGTACTGCACCCCTTCCTCAACAAATTCATTGTGGTTTATTTCGACGATATATTGGTGTACAGCAGTGGGAACGACGAACACTTGCTCCACCTTAAAAAGTTGTTTC
AAGTATTGACAGAAAAGGAACTATACATCAATCAAAAGAAGTGTGAATTCTTGAAGTCTGAAATTACATTTCTTGGTTTTATAATCAAGAAAGGAGAAGTAAGCATGGAA
CCAAGAAAGGTTGAAGCAATACGAGAGTGGTTGGCTCCAACCACTGTCAAAGAAGTTCAAGCCTTCCTAGGACTGGCTTCTTTTTACAGAAAGTTTATAAAGAACTTCAG
CTCCATCTGCGCACCACTAACCGACTGCTTAAAGAAGGGAAACTTTAAGTGGACTTCATTCCAACAAGAGAGCTTCGAGGAAATAAAGAAAAGATTTTTCTCTTCTCCTT
TCGAAGTAGCAGTAGACGCCTGTTGCACAGGAATTGGGGCAGTCTTATCCCAACGAGGTCACCCGATTGAATACCTCAGTGAGAAATTGAGCACCGCACGACAAGCATGG
AGCACATACGAGCAAGAATTATATGCCTTAGTTCGAGCTCTCAAGCAGTGGGAACATTACTTGCTCTCCAAAGAGTTTGTTCTCTTAACAGATCATTTCTCCCTGAAATA
TCTTCAATCTCAAAAGAATATCAGCCGGATGCATGCCCGTTGGATATCTTATTTACAGAGATTCGATTTTGTTATCAAACACCAGGCTGGAAAAGAGAACAAGGTGGCTG
ACGCACTGAGTAGAAAAGGCTCCCTGCTCACACTTCTCTCCTCAGAAATAATTGCTTTCGAGCACCTGCCAGAACTATACGAAAGAGATGCTGACTTCGCAGACATCTGG
CATAAATGCTCCAATCACCTAAGGGCTGAAGGATACCACATCCTAGAAGGATTCCTCTTCAAGGGAGACCAATTATGCATACCCCACACTTCTCTGCGGGAAGCCTTAAT
AAAGGAAGCTCATATGTTATTAATCTCAAGTCATCTTGGTACAATTAAGACCCGAGATCCAATAAATAAAGGCAGAAGTCTTTCATCCCGAAAAAATCTCAGGGCAATGA
TGTGCCTCATCTGTCACCAACATAAGGTCGAGAAAGAAAAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGATCAACAACAGACGAAGGCAGCGTGGACGGGGAAATTTCAGAAACCTCCACAATCCACGATACGCTAATCGAGGAAGGGCTGATCTCAACCCACCACCACAA
TTCTACGCCGACGTCCATAATGAGCAAGAAGACGAATCTTCGAGCAGTGACGACCAGAACAACCCCCTGGGACGCCTTAGACGAAACACAGAGAGGACAACACGGCAGGA
GATTCGGCACGCGGGCAGAAACTCATCATGACTTCAAGATGAAGGTTGACTTACCTTCATATAGTGGCAAGAGAGATATCGAATCCTTTTTGGATTGGCTAAAAAGTACT
GAGAACTTTTTCAGTTACATGGACACCCCTGAACAGAAAAAGGTACGTCTCGTGGCCTTGAAACTCAAAGGGGGCGCATCAGCATGGTGGGAGCAACTGGAAGCCAACAG
GCAAAGATACAACAAACGACCCGTACGCTCTTGGGAAAAAATGAAGAAGCTATTAAAAGAACGATTCCTGCCTCCGAATTATGAGCAGACCTTATACAACCAATATCAGA
ATTGTCGTCAAGGTACCAGGACAGTTACCGAATACATAGAAGAGTTCCACCGATTAAGTGCCAGGACAAATCTAAGTGAGAACGAGCAACATCAAATAGCCAGATTTGTG
GGGGGTCTGCGTTTTGATATTAAAGAAAAAGTAAGATTACAGCCTCTCCGTTTCCTATCTGAAGCAATTTCATTGGCCGAAACAGTAGAAGAGATGATAGCTCTAAAGAC
CAACACCATGAATCGAAGAACATGGGAACCAACATCAAACAAGAAGACAAACTATGCGAACAAGTCGAATGATCAGCCGATGGCACCAGCTTATGGAAAAGGAAAAGAAA
CCGACTCTCAGACTGCAACATATGAAAAGAAGGCAGAGATGATCAACAGAAGCAAGAACCAAAATAATTACACTCGCCCATCATTGGGCAAGTGCTTCAGATGCGGACAA
CCAGGTCACCTATCAAATTCTTGCCCTCAAAGAAAAACAATTGCCTTAGCCGATGAAGAAGATAACTTACCTGGCGAGGGTGAGTCAGAACCAGAAGAGGAAACGGAAGA
AATTGAAGCCGATGAAGGAGATAGGATCTCTTGTGTTATCCATAGAGTACTCATTGCTCCCAAGGAAGAAAAGAGCCCACAACGCCACAGTCTTTTCAAGACAAGGTGCA
CCATTAATGGGAAGGTATGCGATGTTATTATCGATGGAGGTAGCAGTGAAAACTTTATAACCAAGAAGATAGTCTCCAACCTGAACTTGAAGGTTCAGCCACATCCAAAC
CCCTACAAGATAGGATGGGTCAAGAAAGGGAATGAATCCACAGTTAACGAGGTATGTACGGTTCCCCTTTCAATCGGCAGCAGCTACAAGGATCAGATCGTATGCGATGT
AATCGACATGGATGTATGCCATGTCCTCCTAGGCAGACCGTGGCAACACGACACTCAAACCTTGCATAAGGGGAGAGAAAACACATACGAATTCCACTGGATGGGTAAAC
GGATAACTCTACTGCCCTTGACCAAAAAGAACGAGGAGAACAGTAAGACAAGGGGCCAGCTATTCACAACGTGCAGTGGCAAAACTCTTTTAAAAGAAAGAAAGCAGGAT
ATTTTAGCCCTTAAAGAACCTGACGGACTGCCACCTCTTCGAGACATTCAGCACCATATAGATCTGATCCCCGGAGCATCACTGCCAAACCTGGCTCATTACAGAATGAC
CCCTAAAGAGTATGCAGCGCTCCATGAACATATCGAAGACCTACTCAAGAAAGGGCATATTAAGCCAAGCCTCAGTCCTTGTGCTGTCCCAGCTCTCCTCACCCCCAAGA
AGGATGGAAGTTGGAGAATGTGTGTAGATAGCCGAGCCATTAACCGAATCACAGTAAAATACAGATTCCCTATCCCAAGAGTTGGAGACCTCTTGGATCAACTCGGCAAG
GCCACCATCTTCTCGAAAATTGACCTAAGGAGTGGATACCATCAAATACGTATCCGACCAGGTGATGAATGGAAGACTGCCTTCAAAACGAATGAAGGGCTGTTTGAATG
GATGGTACTGCACCCCTTCCTCAACAAATTCATTGTGGTTTATTTCGACGATATATTGGTGTACAGCAGTGGGAACGACGAACACTTGCTCCACCTTAAAAAGTTGTTTC
AAGTATTGACAGAAAAGGAACTATACATCAATCAAAAGAAGTGTGAATTCTTGAAGTCTGAAATTACATTTCTTGGTTTTATAATCAAGAAAGGAGAAGTAAGCATGGAA
CCAAGAAAGGTTGAAGCAATACGAGAGTGGTTGGCTCCAACCACTGTCAAAGAAGTTCAAGCCTTCCTAGGACTGGCTTCTTTTTACAGAAAGTTTATAAAGAACTTCAG
CTCCATCTGCGCACCACTAACCGACTGCTTAAAGAAGGGAAACTTTAAGTGGACTTCATTCCAACAAGAGAGCTTCGAGGAAATAAAGAAAAGATTTTTCTCTTCTCCTT
TCGAAGTAGCAGTAGACGCCTGTTGCACAGGAATTGGGGCAGTCTTATCCCAACGAGGTCACCCGATTGAATACCTCAGTGAGAAATTGAGCACCGCACGACAAGCATGG
AGCACATACGAGCAAGAATTATATGCCTTAGTTCGAGCTCTCAAGCAGTGGGAACATTACTTGCTCTCCAAAGAGTTTGTTCTCTTAACAGATCATTTCTCCCTGAAATA
TCTTCAATCTCAAAAGAATATCAGCCGGATGCATGCCCGTTGGATATCTTATTTACAGAGATTCGATTTTGTTATCAAACACCAGGCTGGAAAAGAGAACAAGGTGGCTG
ACGCACTGAGTAGAAAAGGCTCCCTGCTCACACTTCTCTCCTCAGAAATAATTGCTTTCGAGCACCTGCCAGAACTATACGAAAGAGATGCTGACTTCGCAGACATCTGG
CATAAATGCTCCAATCACCTAAGGGCTGAAGGATACCACATCCTAGAAGGATTCCTCTTCAAGGGAGACCAATTATGCATACCCCACACTTCTCTGCGGGAAGCCTTAAT
AAAGGAAGCTCATATGTTATTAATCTCAAGTCATCTTGGTACAATTAAGACCCGAGATCCAATAAATAAAGGCAGAAGTCTTTCATCCCGAAAAAATCTCAGGGCAATGA
TGTGCCTCATCTGTCACCAACATAAGGTCGAGAAAGAAAAA
Protein sequenceShow/hide protein sequence
MGRSTTDEGSVDGEISETSTIHDTLIEEGLISTHHHNSTPTSIMSKKTNLRAVTTRTTPWDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKST
ENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFV
GGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQ
PGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPN
PYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQD
ILALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGK
ATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSME
PRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRFFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQAW
STYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIW
HKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHMLLISSHLGTIKTRDPINKGRSLSSRKNLRAMMCLICHQHKVEKEK