; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G17630 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G17630
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr3:13212788..13216090
RNA-Seq ExpressionCSPI03G17630
SyntenyCSPI03G17630
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]0.0e+0064.53Show/hide
Query:  DALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEK
        D ++E +R ++ +    + E + ++KMK+DLPSY GKR+IE+FLDWLK+TENFF+YM T + KKV LVALKLKGGASAWW+Q+  NRQ+  K P+RSWEK
Subjt:  DALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEK

Query:  MKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTM
        MKKL+K+RF+PPNYEQTLY QYQNCRQG R   EYIEEFHRL  RTNL E E+H I+ FVGGLRFD+KEKV+LQP + LSEAI+ AETVEEMI  +  + 
Subjt:  MKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTM

Query:  NRRTWEPTSNKKTNYANK-----SNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG
         +R WEP+++KKT   N      ++++P+      GK+   +              K +N Y RP  G C+RCGQ GH SN CPQRKTIA+A + D+   
Subjt:  NRRTWEPTSNKKTNYANK-----SNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG

Query:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV
            E +EETE IEADEGD +SC++ RVLI+PKEE   QRHSLFKTRCTI GKVC+VIID GSSENF++KK+V+ LNLK QPH  PYKIGW+KKG E+ +
Subjt:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV

Query:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN---SKTRGQLFTTCSGKTLLKERKQDIL
        +E+C VPLSIG+SYKDQ+VCDVI+MDVCH+LLGRPWQ D Q++H+GRENTYEF WM K++ LLPL K+ ++N   ++ +G LF T SGK  L+ER+ +IL
Subjt:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN---SKTRGQLFTTCSGKTLLKERKQDIL

Query:  AL-----------------------------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPK
         +                             +EP  LPPLRDI H+I+L+ GAS P+L HY M+P EY  LH+ IE+LLKKGHIKPS S C VPALLTPK
Subjt:  AL-----------------------------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPK

Query:  KDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFL
        KDG+WRMCVDSRAIN+ITVKYRFPIPRV DLLDQLG A IFSKIDLRS YHQIRIRPGDEWKTAFKTNEGLFEW+VMPF LSNAPSTFMRLMN+VLHPFL
Subjt:  KDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFL

Query:  NKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIK
        NKFI+VYFDDILV+S   D+HL H+ +LFQVL   ELY+N KKC F  +EI FLGFII+K  V M+ +KVEAI+ W  PTTV +VQAFLGLASFYRKFI+
Subjt:  NKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIK

Query:  NFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRAL
        N SSI AP+TDCLKKG F+W   QQ+SF  +K+ L +  V +LPDF   FEVAVD C TGIGAVLSQ+ HPIEY SE+LS +RQ+WSTYEQELYALVRAL
Subjt:  NFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRAL

Query:  KQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHK
        KQWEHYLLS+EF+LLTDHFSLKYL SQK+IS MHARWISY QRFDFVIKHQAGKENKVADALSRK +LL +LSSEI  F H+ ELY++D DF  IW K
Subjt:  KQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHK

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]4.5e-30648.63Show/hide
Query:  EGLISTHHHNSTPTSIMSKKTNLRAVTTRTTPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKVDLPSYSGKRDIESFLDWLKSTENFFSY
        +G +  H  N +P  +      +   +      +A DE     + G   RR   R   +H        +++K ++P++ G  DIE+ LDWL   + FF  
Subjt:  EGLISTHHHNSTPTSIMSKKTNLRAVTTRTTPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKVDLPSYSGKRDIESFLDWLKSTENFFSY

Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        MD P+ ++V++VA KL+GGA AWW++ + NR+ + +RPV +W  MK+++K RFLPP+ EQ LY QY NC QG RTV EY  EF RL AR NL E ++   
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPMA-PAYGKGKETDSQTATYEKKAEMINRSKNQN
        AR+V GL   I+EK+ L  +  + +A +LA   E M         R T E TS    +Y  K N    A P+      T+S+ +          +    N
Subjt:  ARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPMA-PAYGKGKETDSQTATYEKKAEMINRSKNQN

Query:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLPGEGESEPEEETEEIEADEG--DRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDV
         Y +P   KCFRCG+PGH SN CP+R T+  ++   D + G+   + E++ E  E  +G  ++I+CVI R L +PK   S QR+ +F+T+C +  K+C +
Subjt:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLPGEGESEPEEETEEIEADEG--DRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDV

Query:  IIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG
        IIDGGS EN ++K +V    L  +PHPNPY+IGW+KKG    V E+C VPL+IG  Y + + CDV+DM+ CHVLLGRPWQHD    H+G+ N Y F W G
Subjt:  IIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG

Query:  KRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALQEPDGLPPLRDIQHHIDLIPGASLP
        K I +LPL   +         L T  S  K    ERK+                                 ++A   PD LPPLR+IQH IDL+PGASLP
Subjt:  KRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALQEPDGLPPLRDIQHHIDLIPGASLP

Query:  NLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIR
        NL HYRM+PKE   L E +E+LL+KGHI+ S+SPCAVPALLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLLDQL  A +FSKIDLRSGYHQIRI+
Subjt:  NLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIR

Query:  PGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGF
        PGDEWKTAFKT +GL+EW+VMPFGLSNAPSTFMRLM QVL PF+ KF+VVYFDDILVYS    EHL HL+K+ + LTE EL++N KKC FL +++ FLG+
Subjt:  PGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGF

Query:  IIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDA
        I+    + ++  KV+A+R+W +P T+ EV++F GLA+FYR+F++NFSSI AP+T+C+KKG FKWT   +ESF+ IK+RL + PV  LP+F + FE+  DA
Subjt:  IIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDA

Query:  CCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKEN
        C TGIGAVLSQ G P+ + SEKL+ ARQ WSTYEQELYA+V+A+K+WEHYL+ +EFV+ +DH +LKY Q+Q++++++HARW S+L++F++VIKH++G  N
Subjt:  CCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKEN

Query:  KVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFRQE
        KVADALSRK +LL  +S++++ FE +  LYE D DF   W +         + +L+G+LFKG++LCIP TSLR  LIKE H+ GL+ H  ++
Subjt:  KVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFRQE

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]8.2e-31055.4Show/hide
Query:  WDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWE
        W+   E +  Q  R    R ET+HD+KMK+DLP+Y+GKRDIESFLDW+K+TENFF YM  P++KKV LVALKLKGGASAW                    
Subjt:  WDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWE

Query:  KMKKLLKERFLPPNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALK
                   P +Y Q +   Y+QYQNCRQG++ V EYIEEFHRL AR NLSENEQHQIARF+GGLRFDIKEKV+L   R LSEAISLAETVEEM+ ++
Subjt:  KMKKLLKERFLPPNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALK

Query:  TNTMNRRT-WEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG
            NRRT WE   +KK +Y  K+++QP      KGK  D Q     KK E + R K QNNYTRPSLGKCFRCG+PGHLSN+C QRKTIALA++ED    
Subjt:  TNTMNRRT-WEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG

Query:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV
          + E EEETE IEAD+GDRISC++ RVLI PKEE +PQ HSLFKTRCTING                          KV PHP+PYKIGWVKKG E+ +
Subjt:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV

Query:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN--SKTRGQLFTTCSGKTLLKERKQDILA
        NE+CT+PLSIG+SYKDQIVCDVI+MDVCH+LLGRPWQHDTQTLH+GRENTYEF WMGK++ LLPL KKN E+   K + QLF T SGK LLKER+QD+L 
Subjt:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN--SKTRGQLFTTCSGKTLLKERKQDILA

Query:  L-----------------------------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKK
        L                             +EP GLPPLRDIQH IDL+P ASLPNL HYRM+P+EY  LH+HIEDLLKKGHIKPSLSPCAVPALLTP K
Subjt:  L-----------------------------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKK

Query:  DGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLN
        DGSWRMCVDSRAINR+T KYRFPIPR+GDLLDQLGKA IFSKIDLR+GYHQI+IRPGDEWKTAFKTNEGLFE                            
Subjt:  DGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLN

Query:  KFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKN
                      S  ++HL +L+KLF+VLTE ELYIN KKC +L  EI FLGF+IK+G++ MEP+K+EAI+    PT++KEVQAFLGLASFYR+FI+N
Subjt:  KFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKN

Query:  FSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALK
        FS I APLTD                                  F+SPFEVAV+AC TGIGAVLSQ+GHPIEY SEKLST+RQ+WSTYEQELYALVRALK
Subjt:  FSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALK

Query:  QWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSN
        QWEHYLLS +F                                                                                         
Subjt:  QWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSN

Query:  HLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFRQE
               HI+EG LFKG+QLCIPHTSLREAL+KEAHS GLAGHFRQ+
Subjt:  HLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFRQE

XP_024932991.1 uncharacterized protein LOC112492819 [Ziziphus jujuba]2.0e-30651.66Show/hide
Query:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ
        D+++KVD+P++ G  +IE FLDW+++ E+FF YM  PE K+VRLVA K +GGASAWWEQ+ +NR++  K P++SW +++++L+ RFLP ++EQ LY QY 
Subjt:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ

Query:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQP
        +C QG R+++E  EEF+RLSAR NL+ENE   +AR+V GL   I+E++ L P+  LSEA++LA  +E+ I  +  T     W+P S     Y  K     
Subjt:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQP

Query:  MAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRISCVIHRV
         A  Y K    D+      K     NR  N   Y R    KCF+CGQ GH SN CP RK + + + +D+  GE  +   +ETE ++ D+G+ + C+I ++
Subjt:  MAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRISCVIHRV

Query:  LIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVC
        L +PK     QRHS+FKT+CTIN KVC+VIID GSSEN ++K +V  L L    HPNPYK+GW+KKG E+ V E+C V  SIG  Y D++VCDV++MD C
Subjt:  LIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVC

Query:  HVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE-----------ENSKTRGQLFTTCSGKTLLKERK---------------------QDI
        H+LLGRPWQ D      GR+NT+ F W GK+I LLP   +N+           E    +G +  T SGK    + K                     Q I
Subjt:  HVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE-----------ENSKTRGQLFTTCSGKTLLKERK---------------------QDI

Query:  LALQE----------PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRIT
        L L +          PD LPP+RDIQH IDL+PGA LPNL HYRM PKE   L + +EDLLKK  I+ SLSPCAVPALL PKK+G WRMC+DSRAIN+IT
Subjt:  LALQE----------PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRIT

Query:  VKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGN
         KYRFPIPR+ D+LD+L  A +FSK+DLRSGYHQIRIRP DEWKTAFKT  GL+EW VMPFGL NAPSTFMRLMNQVL P +   +VVYFDDILVYS   
Subjt:  VKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGN

Query:  DEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNF
        + H+ H++++F +L E  LY+N KKC F+  E+ FLGFI+ K  +  +P KV AIR+W  P+TV EV++F GLA+FYR+F++NFSSI  PLTDCLKKG F
Subjt:  DEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNF

Query:  KWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDH
        +W   QQ+SFE +KK L   PV  LPDF   FEV  DA  TGIGAVL Q    IE+ SEKL  A Q WS YEQELYA+VRA K WEHYL+ +EFVL  DH
Subjt:  KWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDH

Query:  FSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKG
         +L++  SQK+I++MH +W  +LQ+F +   H+AGK N+VADALSRK  LL  L  E++ F+ L + Y  D D   IW KC  H  A  +HI +G+LFKG
Subjt:  FSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKG

Query:  DQLCIPHTSLREALIKEAHSNGLAGH
         QLCIP TSLR  LI++ H  GLA H
Subjt:  DQLCIPHTSLREALIKEAHSNGLAGH

XP_040994264.1 uncharacterized protein LOC121240799 [Juglans microcarpa x Juglans regia]1.2e-30852.01Show/hide
Query:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ
        +FK+K+DLP ++G   +ESFLDWL   ENFF YM  PE ++V+LVA KL+GGASAWWEQ + NR+R  K+PVR W KMK+L++ RFLPP+YEQ LY QYQ
Subjt:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ

Query:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMI------ALKTNTMNRRTWEPTSNKKTNYAN
        NCRQG RT+ EY EEF+RL++R NL+E E  Q+AR++GGLR  I++KV L  +  LSEA++LA  +E  +      A       +    PT +   ++ +
Subjt:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMI------ALKTNTMNRRTWEPTSNKKTNYAN

Query:  KSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRIS
         S DQ     Y   K     T T            +QN Y +P  GKCFRC QPGH SN CP RK++ L D +D    E + E EE+ E +E DEGD ++
Subjt:  KSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRIS

Query:  CVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDV
        CVI R+L+APK+E   QRH +FKTRCT+N KVC++IID GS EN +++ +VS L L  + HP PYKI W+KKG E+ V   C +P SIG  Y D + CDV
Subjt:  CVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDV

Query:  IDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLP-----------------LTKKNEE---NSKTRGQLFT-TCSGKT------------LL
        ++MD CHVLLGRPWQ+D    +KGR+NTY F W  +++ LLP                 LT   ++   ++K  G++ T    G+              L
Subjt:  IDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLP-----------------LTKKNEE---NSKTRGQLFT-TCSGKT------------LL

Query:  KERKQDILALQEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKY
         E   DI   + P GLPPLRDIQH IDL+PG SLPNL HYRM+P E+  L + +EDL++KG I+ S+SPCAVPALL PKKDGSWRMCVDSRAIN+ITVKY
Subjt:  KERKQDILALQEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKY

Query:  RFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEH
        RFPIPR+ D+LD L  + +FSK+DLRSGYHQIR+RPGDEWKTAFKT EGL+EW+VMPFGLSNAPSTFMR+M+Q L PF+ KF+VVYFDDIL+YS    +H
Subjt:  RFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEH

Query:  LLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT
        + HL+++  VL E +LYIN KKC FL S + FLGF++ +  V ++  KV  IREW  P+T+ +V++F GLA+FYR+FI+NFS++ AP+T+C+KKG   W 
Subjt:  LLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT

Query:  SFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSL
          Q  SF  IK++L++ PV  LP F   FEV  DA   GIGAVLSQ G P+E+ SEKL+ AR+ W+ YE E YA++RALK WEHYL+ +EFVL +DH +L
Subjt:  SFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSL

Query:  KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFK
        K++ +Q +++RMHARW++++Q+F  V+K+++G+ NKVADALSR+G LL  L +EI  FE L +LY  D DFA+ W +C     A  +HI + +LF+
Subjt:  KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFK

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein0.0e+0054.05Show/hide
Query:  RAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQT
        R +   +++MK+DLPS++G   IESFLDW+   E FF  M+  + K+V+LVA KLKGGASAWW+Q++ NR+R  K+PVR+W+KM++LL+ERFLP +YEQ 
Subjt:  RAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQT

Query:  LYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVE-EMIALKTNTMNRRTWEPTSNKKTNYA
        LY QYQNCRQG R+V+EY +EF+ LS+R NL+E E  Q+AR+VGGLR  I++++ L+ +  L+EA SLA  VE +       + N     P S++     
Subjt:  LYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVE-EMIALKTNTMNRRTWEPTSNKKTNYA

Query:  NKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLP---GEGESEPEEE---TEEIE
        +K   + + P   K    D  +++  +   +    K+ N Y RP  GKCFRC QPGH SN CP R+ + +    EDN P    E E+E ++E    E  E
Subjt:  NKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD-EEDNLP---GEGESEPEEE---TEEIE

Query:  ADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSY
         DEG+ +SCV+ R+L+ PK+E  PQRH++F+TRCTIN KVCDVIID GSSEN ++K +V  L LK + HPNPYKIGW+KKG E+ V E+C VP SIG  Y
Subjt:  ADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSY

Query:  KDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE--ENSKTRGQLFTTCSGKTLLKERK-------------------
        KD++ CD++DMD CHVLLGRPWQ D    HKG++NTY F W  K++ L+P  K +   + SK  G+   T +G   +++ K                   
Subjt:  KDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE--ENSKTRGQLFTTCSGKTLLKERK-------------------

Query:  -------------QDILALQEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSR
                     QDI   + PD LPP+RDIQHHIDL+PGASLPNL HYRM+PKE   L + +EDL+ KG I+ S+SPCAVPALLTPKKDGSWRMCVDSR
Subjt:  -------------QDILALQEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSR

Query:  AINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDIL
        AIN+ITVKYRFPIPR+ D+LD L  + IFSKIDLRSGYHQIRIRPGDEWKTAFKT EGL+EW+VMPFGLSNAPSTFMR+MNQVL PF+ KF+VVYFDDIL
Subjt:  AINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDIL

Query:  VYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDC
        +YS    EHL H++++   L E +LYIN KKC FL + + FLGFII    + ++  KV AIR+W  P TV ++++F GLA+FYR+FI+NFSSI AP+TDC
Subjt:  VYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDC

Query:  LKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEF
        +KKG F+W   Q+ SF  IK++L++ PV  LP F   F+V  DA  TGIGAVLSQ G P+E+ SEKL+ ARQ W+TYE EL+A+VRALK WEHYL+ +EF
Subjt:  LKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEF

Query:  VLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILE
        V+ +DH +LK++ +Q ++SRMH RWI++LQRF FV+KH+AG++NKVADALSR+ +LL ++SSEI +FE L ELY+ D DF   W KC     +  +HI +
Subjt:  VLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILE

Query:  GFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFRQE
        G+LFKG+QLCIP TSLRE ++++ HS GL GH  ++
Subjt:  GFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFRQE

A0A5D3DGR0 Reverse transcriptase0.0e+0064.53Show/hide
Query:  DALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEK
        D ++E +R ++ +    + E + ++KMK+DLPSY GKR+IE+FLDWLK+TENFF+YM T + KKV LVALKLKGGASAWW+Q+  NRQ+  K P+RSWEK
Subjt:  DALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEK

Query:  MKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTM
        MKKL+K+RF+PPNYEQTLY QYQNCRQG R   EYIEEFHRL  RTNL E E+H I+ FVGGLRFD+KEKV+LQP + LSEAI+ AETVEEMI  +  + 
Subjt:  MKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTM

Query:  NRRTWEPTSNKKTNYANK-----SNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG
         +R WEP+++KKT   N      ++++P+      GK+   +              K +N Y RP  G C+RCGQ GH SN CPQRKTIA+A + D+   
Subjt:  NRRTWEPTSNKKTNYANK-----SNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG

Query:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV
            E +EETE IEADEGD +SC++ RVLI+PKEE   QRHSLFKTRCTI GKVC+VIID GSSENF++KK+V+ LNLK QPH  PYKIGW+KKG E+ +
Subjt:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV

Query:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN---SKTRGQLFTTCSGKTLLKERKQDIL
        +E+C VPLSIG+SYKDQ+VCDVI+MDVCH+LLGRPWQ D Q++H+GRENTYEF WM K++ LLPL K+ ++N   ++ +G LF T SGK  L+ER+ +IL
Subjt:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN---SKTRGQLFTTCSGKTLLKERKQDIL

Query:  AL-----------------------------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPK
         +                             +EP  LPPLRDI H+I+L+ GAS P+L HY M+P EY  LH+ IE+LLKKGHIKPS S C VPALLTPK
Subjt:  AL-----------------------------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPK

Query:  KDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFL
        KDG+WRMCVDSRAIN+ITVKYRFPIPRV DLLDQLG A IFSKIDLRS YHQIRIRPGDEWKTAFKTNEGLFEW+VMPF LSNAPSTFMRLMN+VLHPFL
Subjt:  KDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFL

Query:  NKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIK
        NKFI+VYFDDILV+S   D+HL H+ +LFQVL   ELY+N KKC F  +EI FLGFII+K  V M+ +KVEAI+ W  PTTV +VQAFLGLASFYRKFI+
Subjt:  NKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIK

Query:  NFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRAL
        N SSI AP+TDCLKKG F+W   QQ+SF  +K+ L +  V +LPDF   FEVAVD C TGIGAVLSQ+ HPIEY SE+LS +RQ+WSTYEQELYALVRAL
Subjt:  NFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRAL

Query:  KQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHK
        KQWEHYLLS+EF+LLTDHFSLKYL SQK+IS MHARWISY QRFDFVIKHQAGKENKVADALSRK +LL +LSSEI  F H+ ELY++D DF  IW K
Subjt:  KQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHK

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X14.0e-31055.4Show/hide
Query:  WDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWE
        W+   E +  Q  R    R ET+HD+KMK+DLP+Y+GKRDIESFLDW+K+TENFF YM  P++KKV LVALKLKGGASAW                    
Subjt:  WDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWE

Query:  KMKKLLKERFLPPNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALK
                   P +Y Q +   Y+QYQNCRQG++ V EYIEEFHRL AR NLSENEQHQIARF+GGLRFDIKEKV+L   R LSEAISLAETVEEM+ ++
Subjt:  KMKKLLKERFLPPNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALK

Query:  TNTMNRRT-WEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG
            NRRT WE   +KK +Y  K+++QP      KGK  D Q     KK E + R K QNNYTRPSLGKCFRCG+PGHLSN+C QRKTIALA++ED    
Subjt:  TNTMNRRT-WEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPG

Query:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV
          + E EEETE IEAD+GDRISC++ RVLI PKEE +PQ HSLFKTRCTING                          KV PHP+PYKIGWVKKG E+ +
Subjt:  EGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTV

Query:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN--SKTRGQLFTTCSGKTLLKERKQDILA
        NE+CT+PLSIG+SYKDQIVCDVI+MDVCH+LLGRPWQHDTQTLH+GRENTYEF WMGK++ LLPL KKN E+   K + QLF T SGK LLKER+QD+L 
Subjt:  NEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEEN--SKTRGQLFTTCSGKTLLKERKQDILA

Query:  L-----------------------------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKK
        L                             +EP GLPPLRDIQH IDL+P ASLPNL HYRM+P+EY  LH+HIEDLLKKGHIKPSLSPCAVPALLTP K
Subjt:  L-----------------------------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKK

Query:  DGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLN
        DGSWRMCVDSRAINR+T KYRFPIPR+GDLLDQLGKA IFSKIDLR+GYHQI+IRPGDEWKTAFKTNEGLFE                            
Subjt:  DGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLN

Query:  KFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKN
                      S  ++HL +L+KLF+VLTE ELYIN KKC +L  EI FLGF+IK+G++ MEP+K+EAI+    PT++KEVQAFLGLASFYR+FI+N
Subjt:  KFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKN

Query:  FSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALK
        FS I APLTD                                  F+SPFEVAV+AC TGIGAVLSQ+GHPIEY SEKLST+RQ+WSTYEQELYALVRALK
Subjt:  FSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALK

Query:  QWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSN
        QWEHYLLS +F                                                                                         
Subjt:  QWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSN

Query:  HLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFRQE
               HI+EG LFKG+QLCIPHTSLREAL+KEAHS GLAGHFRQ+
Subjt:  HLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFRQE

A0A6N2LVR1 Uncharacterized protein0.0e+0051.51Show/hide
Query:  FKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQN
        F+MK+DLPS++G+  IE FLDWL   E FF YM+ PE KKV+LVA +L GGASAWWEQL+  R R  K  V+SW KM++LL+ R+LPP+YEQ L+ QYQN
Subjt:  FKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQN

Query:  CRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPM
        C+QG R V  Y+EEFHRLS+R NL E +  Q+ARFVGGLR++I+++V +  +  L+EAI+LA   E  +   TNT  R  ++PT N         N    
Subjt:  CRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPM

Query:  APAYGKGKETDSQTATYEKKAEM-INRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD--EEDNLPGEGESEPEE----ETEEIEADEGDRIS
        + +Y     + S+T   +    + +     +N Y+RP+  KC+RCGQ GH SN+CP+R  + L +  EE ++ GEG    ++    E E    DEG+ +S
Subjt:  APAYGKGKETDSQTATYEKKAEM-INRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAD--EEDNLPGEGESEPEE----ETEEIEADEGDRIS

Query:  --CVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVC
           V+ R+++APK E   QR+++F+TRCT+N KVCDVIID GSSEN I+K +V+ L LK + H  PYKIGW+KKG E+ V E C    SIG +Y D+IVC
Subjt:  --CVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVC

Query:  DVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKK-NEENSKTRGQLFTTCSGKTLLKERKQD------------------------
        DV++MD CHV+LGRPWQ+D    +KG++N Y F   G+++ L PL +       + + +      G+T L +  +D                        
Subjt:  DVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKK-NEENSKTRGQLFTTCSGKTLLKERKQD------------------------

Query:  --------ILALQEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITV
                I+  + P+GLPP+RDIQHHIDLIPGASLPN  HYRM+P+E A L   +E+L+KKG ++ S+SPCAVPALL PKKDGSWRMC+DSRAIN+IT+
Subjt:  --------ILALQEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITV

Query:  KYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGND
        KYRFPIPR+ D+LD L  + IFSKIDLRSGYHQIRIRPGDEWKTAFKT EGL+EW+VMPFGLSNAPSTFMRLMNQVL PF   F+VVYFDDIL+YS    
Subjt:  KYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGND

Query:  EHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFK
        +H+ HL+++F VL   +L++N  KC F+ S + FLGF++    + ++  KV AIR+W  P  + EV++F GLA+FYR+F+++FS I AP+T+C+KKG F 
Subjt:  EHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFK

Query:  WTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHF
        W    + SF  IK++LAS PV  LPDF   FEV  DA   GIGAVLSQ   P+ + SEKLS AR+ WSTYE ELYA+ RA+K WEHYL+ +EF+L +DH 
Subjt:  WTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHF

Query:  SLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGD
        +LK++ +Q N++RMHARW++++QRF+F +KH++G+ NKVADALSRK SLLT L +E+I FE + +LY  D DF + W KC   L  EG H  +G+LF+G+
Subjt:  SLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGD

Query:  QLCIPHTSLREALIKEAHSNGLAGHFRQE
        QLCIP +SLRE +I E H  GL GH  ++
Subjt:  QLCIPHTSLREALIKEAHSNGLAGHFRQE

A0A6P6GFU0 uncharacterized protein LOC1124928199.7e-30751.66Show/hide
Query:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ
        D+++KVD+P++ G  +IE FLDW+++ E+FF YM  PE K+VRLVA K +GGASAWWEQ+ +NR++  K P++SW +++++L+ RFLP ++EQ LY QY 
Subjt:  DFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQ

Query:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQP
        +C QG R+++E  EEF+RLSAR NL+ENE   +AR+V GL   I+E++ L P+  LSEA++LA  +E+ I  +  T     W+P S     Y  K     
Subjt:  NCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQP

Query:  MAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRISCVIHRV
         A  Y K    D+      K     NR  N   Y R    KCF+CGQ GH SN CP RK + + + +D+  GE  +   +ETE ++ D+G+ + C+I ++
Subjt:  MAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRISCVIHRV

Query:  LIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVC
        L +PK     QRHS+FKT+CTIN KVC+VIID GSSEN ++K +V  L L    HPNPYK+GW+KKG E+ V E+C V  SIG  Y D++VCDV++MD C
Subjt:  LIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVC

Query:  HVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE-----------ENSKTRGQLFTTCSGKTLLKERK---------------------QDI
        H+LLGRPWQ D      GR+NT+ F W GK+I LLP   +N+           E    +G +  T SGK    + K                     Q I
Subjt:  HVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE-----------ENSKTRGQLFTTCSGKTLLKERK---------------------QDI

Query:  LALQE----------PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRIT
        L L +          PD LPP+RDIQH IDL+PGA LPNL HYRM PKE   L + +EDLLKK  I+ SLSPCAVPALL PKK+G WRMC+DSRAIN+IT
Subjt:  LALQE----------PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRIT

Query:  VKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGN
         KYRFPIPR+ D+LD+L  A +FSK+DLRSGYHQIRIRP DEWKTAFKT  GL+EW VMPFGL NAPSTFMRLMNQVL P +   +VVYFDDILVYS   
Subjt:  VKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGN

Query:  DEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNF
        + H+ H++++F +L E  LY+N KKC F+  E+ FLGFI+ K  +  +P KV AIR+W  P+TV EV++F GLA+FYR+F++NFSSI  PLTDCLKKG F
Subjt:  DEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNF

Query:  KWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDH
        +W   QQ+SFE +KK L   PV  LPDF   FEV  DA  TGIGAVL Q    IE+ SEKL  A Q WS YEQELYA+VRA K WEHYL+ +EFVL  DH
Subjt:  KWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDH

Query:  FSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKG
         +L++  SQK+I++MH +W  +LQ+F +   H+AGK N+VADALSRK  LL  L  E++ F+ L + Y  D D   IW KC  H  A  +HI +G+LFKG
Subjt:  FSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKG

Query:  DQLCIPHTSLREALIKEAHSNGLAGH
         QLCIP TSLR  LI++ H  GLA H
Subjt:  DQLCIPHTSLREALIKEAHSNGLAGH

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.0e-9140.8Show/hide
Query:  LLKERKQDILAL---------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGS-----W
        L  E KQ + AL          E D L      +H I+     +LP  + Y         +   I+D+L +G I+ S SP   P  + PKK  +     +
Subjt:  LLKERKQDILAL---------QEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGS-----W

Query:  RMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIV
        R+ +D R +N ITV  R PIP + ++L +LG+   F+ IDL  G+HQI + P    KTAF T  G +E++ MPFGL NAP+TF R MN +L P LNK  +
Subjt:  RMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIV

Query:  VYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSI
        VY DDI+V+S+  DEHL  L  +F+ L +  L +   KCEFLK E TFLG ++    +   P K+EAI+++  PT  KE++AFLGL  +YRKFI NF+ I
Subjt:  VYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSI

Query:  CAPLTDCLKKGNFKWTSFQQE---SFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQ
          P+T CLKK N K  +   E   +F+++K  ++ +P+ ++PDF+  F +  DA    +GAVLSQ GHP+ Y+S  L+     +ST E+EL A+V A K 
Subjt:  CAPLTDCLKKGNFKWTSFQQE---SFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQ

Query:  WEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSE
        + HYLL + F + +DH  L +L   K+ +    RW   L  FDF IK+  GKEN VADALSR     T LS +
Subjt:  WEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSE

P0CT41 Transposon Tf2-12 polyprotein4.9e-8232.91Show/hide
Query:  QDILALQEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP
        +DI A    + LP P++ ++  ++L        + +Y + P +  A+++ I   LK G I+ S +  A P +  PKK+G+ RM VD + +N+      +P
Subjt:  QDILALQEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP

Query:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLH
        +P +  LL ++  +TIF+K+DL+S YH IR+R GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL++S    EH+ H
Subjt:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLH

Query:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF
        +K + Q L    L INQ KCEF +S++ F+G+ I +   +     ++ + +W  P   KE++ FLG  ++ RKFI   S +  PL + LKK   +KWT  
Subjt:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF

Query:  QQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLT
        Q ++ E IK+ L S PV +  DFS    +  DA    +GAVLSQ+      +P+ Y S K+S A+  +S  ++E+ A++++LK W HYL S  + F +LT
Subjt:  QQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLT

Query:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIAFEHLPEL---YERDADFADIWHKC
        DH +L  +     +  ++  ARW  +LQ F+F I ++ G  N +ADALSR            + + +  ++   I  +   ++   Y  D    ++ +  
Subjt:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIAFEHLPEL---YERDADFADIWHKC

Query:  SNHLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGH
           +  E   + +G L    DQ+ +P+ T L   +IK+ H  G   H
Subjt:  SNHLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGH

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-8640.88Show/hide
Query:  YRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKD-----GSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRI
        Y +       +   ++++L +G I+ S SP   P  + PKK        +R+ +D R +N IT+  R+PIP + ++L +LGK   F+ IDL  G+HQI +
Subjt:  YRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKD-----GSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRI

Query:  RPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLG
              KTAF T  G +E++ MPFGL NAP+TF R MN +L P LNK  +VY DDI+++S+   EHL  ++ +F  L +  L +   KCEFLK E  FLG
Subjt:  RPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLG

Query:  FIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQ--ESFEEIKKRLASNPVQQLPDFSSPFEVA
         I+    +   P KV+AI  +  PT  KE++AFLGL  +YRKFI N++ I  P+T CLKK     T   +  E+FE++K  +  +P+ QLPDF   F + 
Subjt:  FIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQ--ESFEEIKKRLASNPVQQLPDFSSPFEVA

Query:  VDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAG
         DA    +GAVLSQ GHPI ++S  L+     +S  E+EL A+V A K + HYLL ++F++ +DH  L++L + K       RW   L  + F I +  G
Subjt:  VDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAG

Query:  KENKVADALSR
        KEN VADALSR
Subjt:  KENKVADALSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.2e-8936.59Show/hide
Query:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALQEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP
        GK   ++   +  E N+       T C+    L+++ ++I+    P     + +I  +H I++ PGA LP L  Y +T K    +++ ++ LL    I P
Subjt:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALQEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP

Query:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPS
        S SPC+ P +L PKKDG++R+CVD R +N+ T+   FP+PR+ +LL ++G A IF+ +DL SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPS
Subjt:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPS

Query:  TFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQ
        TF R M         +F+ VY DDIL++S   +EH  HL  + + L  + L + +KKC+F   E  FLG+ I   +++    K  AIR++  P TVK+ Q
Subjt:  TFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQ

Query:  AFLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHP------IEYLSEK
         FLG+ ++YR+FI N S I  P  L  C K    +WT  Q ++ E++K  L ++PV    +  + + +  DA   GIGAVL +  +       + Y S+ 
Subjt:  AFLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHP------IEYLSEK

Query:  LSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIA
        L +A++ +   E EL  +++AL  + + L  K F L TDH SL  LQ++   +R   RW+  L  +DF +++ AG +N VADA+SR    +T  +S  I 
Subjt:  LSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIA

Query:  FEHLPELYERD
         E     Y+ D
Subjt:  FEHLPELYERD

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.1e-8936.4Show/hide
Query:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALQEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP
        GK   ++   +  E N+       T C+    L+++ ++I+    P     + +I  +H I++ PGA LP L  Y +T K    +++ ++ LL    I P
Subjt:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALQEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP

Query:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPS
        S SPC+ P +L PKKDG++R+CVD R +N+ T+   FP+PR+ +LL ++G A IF+ +DL SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPS
Subjt:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPS

Query:  TFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQ
        TF R M         +F+ VY DDIL++S   +EH  HL  + + L  + L + +KKC+F   E  FLG+ I   +++    K  AIR++  P TVK+ Q
Subjt:  TFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQ

Query:  AFLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHP------IEYLSEK
         FLG+ ++YR+FI N S I  P  L  C K    +WT  Q ++ +++K  L ++PV    +  + + +  DA   GIGAVL +  +       + Y S+ 
Subjt:  AFLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDACCTGIGAVLSQRGHP------IEYLSEK

Query:  LSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIA
        L +A++ +   E EL  +++AL  + + L  K F L TDH SL  LQ++   +R   RW+  L  +DF +++ AG +N VADA+SR    +T  +S  I 
Subjt:  LSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIA

Query:  FEHLPELYERD
         E     Y+ D
Subjt:  FEHLPELYERD

Arabidopsis top hitse value%identityAlignment
AT1G40129.1 unknown protein4.2e-0425.17Show/hide
Query:  STHHHNSTPTSIMSKKTNLRAVTTRTTPW--DALDETQRGQHGRRFGTRAETHHDFKMKVDLPS---YSGKRDIESFLDWLKSTENFFSYMDTPEQKKVR
        S+H+HN         K  + A+TT       + LD+ ++G H      RA    + ++K   P     S +   E +L+W K+ + +FSY +   + +  
Subjt:  STHHHNSTPTSIMSKKTNLRAVTTRTTPW--DALDETQRGQHGRRFGTRAETHHDFKMKVDLPS---YSGKRDIESFLDWLKSTENFFSYMDTPEQKKVR

Query:  LVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERF
             L G A  WW Q   +R  Y + P+  W  +K+ L+ ++
Subjt:  LVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERF

AT2G15180.1 Zinc knuckle (CCHC-type) family protein1.0e-0530.43Show/hide
Query:  FLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLP
        +L W  +   +F +  T ++ K+ +   +LKG A  WW+Q E NR    + P+R+WE++K  +  ++ P
Subjt:  FLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLP

AT4G13320.1 unknown protein2.2e-1331.97Show/hide
Query:  LFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDM--DVCHVLLGRPWQHDT
        +F+T+C IN + C +++ GG+  N I+K +V  L LK        ++    +  +    E C VP+SIG  YKD++ C V++M  +   +L G PW +  
Subjt:  LFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPNPYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDM--DVCHVLLGRPWQHDT

Query:  QTLHKGRENTYEFHWMGKRITL
        Q  H GR+++    W    I L
Subjt:  QTLHKGRENTYEFHWMGKRITL

ATMG00860.1 DNA/RNA polymerases superfamily protein4.5e-2239.1Show/hide
Query:  HLKKLFQVLTEKELYINQKKCEFLKSEITFLG--FIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT
        HL  + Q+  + + Y N+KKC F + +I +LG   II    VS +P K+EA+  W  P    E++ FLGL  +YR+F+KN+  I  PLT+ LKK + KWT
Subjt:  HLKKLFQVLTEKELYINQKKCEFLKSEITFLG--FIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT

Query:  SFQQESFEEIKKRLASNPVQQLPDFSSPFEVAV
             +F+ +K  + + PV  LPD   PF   V
Subjt:  SFQQESFEEIKKRLASNPVQQLPDFSSPFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTTCAACAACAGACGTAGGCAGCGTGGACGGGGAAATTTCAGAAACCTCCACAATCCACGATACGCTAATCGAGGAAGGGCTGATCTCAACCCACCACCACAA
TTCTACGCCGACGTCCATAATGAGCAAGAAGACGAATCTTCGAGCAGTGACGACCAGAACAACCCCCTGGGACGCCTTAGACGAAACACAGAGAGGACAACACGGCAGGA
GATTCGGCACGCGGGCAGAAACTCATCATGACTTCAAGATGAAGGTTGACTTACCTTCATATAGTGGCAAGAGAGATATCGAATCCTTTTTGGATTGGCTAAAAAGTACT
GAGAACTTTTTCAGTTACATGGACACCCCTGAACAGAAAAAGGTACGTCTCGTGGCCTTGAAACTCAAAGGGGGCGCATCAGCATGGTGGGAGCAACTGGAAGCCAACAG
GCAAAGATACAACAAACGACCCGTACGCTCTTGGGAAAAAATGAAGAAGCTATTAAAAGAACGATTCCTGCCTCCGAATTATGAGCAGACCTTATACAACCAATATCAGA
ATTGTCGTCAAGGTACCAGGACAGTTACCGAATACATAGAAGAGTTCCACCGATTAAGTGCCAGGACAAATCTAAGTGAGAACGAGCAACATCAAATAGCCAGATTTGTG
GGGGGTCTGCGTTTTGATATTAAAGAAAAAGTAAGATTACAGCCTCTCCGTTTCCTATCTGAAGCAATTTCATTGGCCGAAACAGTAGAAGAGATGATAGCTCTAAAGAC
CAACACCATGAATCGAAGAACATGGGAACCAACATCAAACAAGAAGACAAACTATGCGAACAAGTCGAATGATCAGCCGATGGCACCAGCTTATGGAAAAGGAAAAGAAA
CCGACTCTCAGACTGCAACATATGAAAAGAAGGCAGAGATGATCAACAGAAGCAAGAACCAAAATAATTACACTCGCCCATCATTGGGCAAGTGCTTCAGATGCGGACAA
CCAGGTCACCTATCAAATTCTTGCCCTCAAAGAAAAACAATTGCCTTAGCCGATGAAGAAGATAACTTACCTGGCGAGGGTGAGTCAGAACCAGAAGAGGAAACGGAAGA
AATTGAAGCCGATGAAGGAGATAGGATCTCTTGTGTTATCCATAGAGTACTCATTGCTCCCAAGGAAGAAAAGAGCCCACAACGCCACAGTCTTTTCAAGACAAGGTGCA
CCATTAATGGGAAGGTATGCGATGTTATTATCGATGGAGGTAGCAGTGAAAACTTTATAACCAAGAAGATAGTCTCCAACCTGAACTTGAAGGTTCAGCCACATCCAAAC
CCCTACAAGATAGGATGGGTCAAGAAAGGGAATGAATCCACAGTTAACGAGGTATGTACGGTTCCCCTTTCTATCGGCAGCAGCTACAAGGATCAGATCGTATGCGATGT
AATCGACATGGATGTATGCCATGTCCTCCTAGGCAGACCGTGGCAACACGACACTCAAACCTTGCATAAGGGGAGAGAAAACACATACGAATTCCACTGGATGGGTAAAC
GGATAACTCTACTGCCCTTGACCAAAAAGAACGAGGAGAACAGTAAGACAAGGGGCCAGCTATTCACAACGTGCAGTGGCAAAACTCTTTTAAAAGAAAGAAAGCAGGAT
ATTTTAGCCCTTCAAGAACCTGACGGACTGCCACCTCTTCGAGACATTCAGCACCATATAGATCTGATCCCCGGAGCATCACTGCCAAACTTGGCTCATTACAGAATGAC
CCCTAAAGAGTATGCAGCGCTCCATGAACATATCGAAGACCTACTCAAGAAAGGGCATATTAAGCCAAGCCTCAGTCCTTGTGCTGTCCCAGCTCTCCTCACCCCCAAGA
AGGATGGAAGTTGGAGAATGTGTGTAGATAGCCGAGCCATTAACCGAATCACAGTAAAATACAGATTCCCTATCCCAAGAGTTGGAGACCTCTTGGATCAACTCGGCAAG
GCCACCATCTTCTCGAAAATTGACCTAAGGAGTGGATACCATCAAATACGTATCCGACCAGGTGATGAATGGAAGACCGCCTTCAAGACGAATGAAGGACTGTTTGAATG
GATGGTAATGCCCTTCGGTCTATCCAATGCTCCCAGTACCTTCATGAGGCTAATGAATCAGGTACTGCACCCCTTCCTCAACAAATTCATTGTGGTCTATTTCGATGATA
TATTGGTATACAGCAGTGGAAACGACGAACACCTGCTCCACCTTAAGAAGTTGTTTCAAGTTTTGACAGAAAAGGAACTATACATCAATCAAAAGAAGTGTGAATTCTTG
AAGTCTGAAATTACATTTCTTGGTTTTATAATTAAGAAAGGAGAAGTAAGCATGGAACCAAGAAAGGTTGAAGCAATACGAGAGTGGTTGGCTCCAACCACTGTCAAAGA
AGTTCAAGCCTTCCTAGGACTGGCTTCTTTTTACAGAAAGTTTATAAAGAACTTCAGCTCCATCTGCGCACCACTAACCGACTGCTTAAAGAAGGGAAACTTTAAGTGGA
CTTCATTCCAACAAGAGAGCTTCGAGGAAATAAAGAAAAGGTTAGCTTCTAACCCTGTTCAGCAACTGCCAGATTTCTCTTCTCCTTTCGAAGTAGCAGTAGACGCCTGT
TGCACAGGAATTGGGGCAGTCTTATCCCAACGAGGTCACCCGATTGAATACCTCAGTGAGAAATTGAGCACCGCACGACAAACATGGAGCACATACGAGCAAGAATTATA
TGCCTTAGTTCGAGCTCTCAAGCAGTGGGAACATTACTTGCTCTCCAAAGAGTTTGTTCTCTTAACAGATCATTTCTCCCTGAAATATCTTCAATCTCAAAAGAATATCA
GCCGGATGCATGCCCGTTGGATATCTTATTTACAGAGATTCGATTTTGTTATCAAACACCAGGCTGGAAAAGAGAACAAGGTGGCTGACGCACTGAGTAGAAAAGGCTCC
CTGCTCACACTTCTCTCCTCAGAAATAATTGCTTTCGAGCACCTGCCAGAACTATACGAAAGAGATGCTGACTTCGCAGACATCTGGCATAAATGCTCCAATCACCTAAG
GGCTGAAGGATACCACATCCTAGAAGGATTCCTCTTCAAGGGAGACCAATTATGCATACCACACACTTCCCTGCGGGAAGCCTTAATAAAGGAAGCTCATTCCAACGGAT
TAGCGGGCCATTTCCGGCAAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGTTCAACAACAGACGTAGGCAGCGTGGACGGGGAAATTTCAGAAACCTCCACAATCCACGATACGCTAATCGAGGAAGGGCTGATCTCAACCCACCACCACAA
TTCTACGCCGACGTCCATAATGAGCAAGAAGACGAATCTTCGAGCAGTGACGACCAGAACAACCCCCTGGGACGCCTTAGACGAAACACAGAGAGGACAACACGGCAGGA
GATTCGGCACGCGGGCAGAAACTCATCATGACTTCAAGATGAAGGTTGACTTACCTTCATATAGTGGCAAGAGAGATATCGAATCCTTTTTGGATTGGCTAAAAAGTACT
GAGAACTTTTTCAGTTACATGGACACCCCTGAACAGAAAAAGGTACGTCTCGTGGCCTTGAAACTCAAAGGGGGCGCATCAGCATGGTGGGAGCAACTGGAAGCCAACAG
GCAAAGATACAACAAACGACCCGTACGCTCTTGGGAAAAAATGAAGAAGCTATTAAAAGAACGATTCCTGCCTCCGAATTATGAGCAGACCTTATACAACCAATATCAGA
ATTGTCGTCAAGGTACCAGGACAGTTACCGAATACATAGAAGAGTTCCACCGATTAAGTGCCAGGACAAATCTAAGTGAGAACGAGCAACATCAAATAGCCAGATTTGTG
GGGGGTCTGCGTTTTGATATTAAAGAAAAAGTAAGATTACAGCCTCTCCGTTTCCTATCTGAAGCAATTTCATTGGCCGAAACAGTAGAAGAGATGATAGCTCTAAAGAC
CAACACCATGAATCGAAGAACATGGGAACCAACATCAAACAAGAAGACAAACTATGCGAACAAGTCGAATGATCAGCCGATGGCACCAGCTTATGGAAAAGGAAAAGAAA
CCGACTCTCAGACTGCAACATATGAAAAGAAGGCAGAGATGATCAACAGAAGCAAGAACCAAAATAATTACACTCGCCCATCATTGGGCAAGTGCTTCAGATGCGGACAA
CCAGGTCACCTATCAAATTCTTGCCCTCAAAGAAAAACAATTGCCTTAGCCGATGAAGAAGATAACTTACCTGGCGAGGGTGAGTCAGAACCAGAAGAGGAAACGGAAGA
AATTGAAGCCGATGAAGGAGATAGGATCTCTTGTGTTATCCATAGAGTACTCATTGCTCCCAAGGAAGAAAAGAGCCCACAACGCCACAGTCTTTTCAAGACAAGGTGCA
CCATTAATGGGAAGGTATGCGATGTTATTATCGATGGAGGTAGCAGTGAAAACTTTATAACCAAGAAGATAGTCTCCAACCTGAACTTGAAGGTTCAGCCACATCCAAAC
CCCTACAAGATAGGATGGGTCAAGAAAGGGAATGAATCCACAGTTAACGAGGTATGTACGGTTCCCCTTTCTATCGGCAGCAGCTACAAGGATCAGATCGTATGCGATGT
AATCGACATGGATGTATGCCATGTCCTCCTAGGCAGACCGTGGCAACACGACACTCAAACCTTGCATAAGGGGAGAGAAAACACATACGAATTCCACTGGATGGGTAAAC
GGATAACTCTACTGCCCTTGACCAAAAAGAACGAGGAGAACAGTAAGACAAGGGGCCAGCTATTCACAACGTGCAGTGGCAAAACTCTTTTAAAAGAAAGAAAGCAGGAT
ATTTTAGCCCTTCAAGAACCTGACGGACTGCCACCTCTTCGAGACATTCAGCACCATATAGATCTGATCCCCGGAGCATCACTGCCAAACTTGGCTCATTACAGAATGAC
CCCTAAAGAGTATGCAGCGCTCCATGAACATATCGAAGACCTACTCAAGAAAGGGCATATTAAGCCAAGCCTCAGTCCTTGTGCTGTCCCAGCTCTCCTCACCCCCAAGA
AGGATGGAAGTTGGAGAATGTGTGTAGATAGCCGAGCCATTAACCGAATCACAGTAAAATACAGATTCCCTATCCCAAGAGTTGGAGACCTCTTGGATCAACTCGGCAAG
GCCACCATCTTCTCGAAAATTGACCTAAGGAGTGGATACCATCAAATACGTATCCGACCAGGTGATGAATGGAAGACCGCCTTCAAGACGAATGAAGGACTGTTTGAATG
GATGGTAATGCCCTTCGGTCTATCCAATGCTCCCAGTACCTTCATGAGGCTAATGAATCAGGTACTGCACCCCTTCCTCAACAAATTCATTGTGGTCTATTTCGATGATA
TATTGGTATACAGCAGTGGAAACGACGAACACCTGCTCCACCTTAAGAAGTTGTTTCAAGTTTTGACAGAAAAGGAACTATACATCAATCAAAAGAAGTGTGAATTCTTG
AAGTCTGAAATTACATTTCTTGGTTTTATAATTAAGAAAGGAGAAGTAAGCATGGAACCAAGAAAGGTTGAAGCAATACGAGAGTGGTTGGCTCCAACCACTGTCAAAGA
AGTTCAAGCCTTCCTAGGACTGGCTTCTTTTTACAGAAAGTTTATAAAGAACTTCAGCTCCATCTGCGCACCACTAACCGACTGCTTAAAGAAGGGAAACTTTAAGTGGA
CTTCATTCCAACAAGAGAGCTTCGAGGAAATAAAGAAAAGGTTAGCTTCTAACCCTGTTCAGCAACTGCCAGATTTCTCTTCTCCTTTCGAAGTAGCAGTAGACGCCTGT
TGCACAGGAATTGGGGCAGTCTTATCCCAACGAGGTCACCCGATTGAATACCTCAGTGAGAAATTGAGCACCGCACGACAAACATGGAGCACATACGAGCAAGAATTATA
TGCCTTAGTTCGAGCTCTCAAGCAGTGGGAACATTACTTGCTCTCCAAAGAGTTTGTTCTCTTAACAGATCATTTCTCCCTGAAATATCTTCAATCTCAAAAGAATATCA
GCCGGATGCATGCCCGTTGGATATCTTATTTACAGAGATTCGATTTTGTTATCAAACACCAGGCTGGAAAAGAGAACAAGGTGGCTGACGCACTGAGTAGAAAAGGCTCC
CTGCTCACACTTCTCTCCTCAGAAATAATTGCTTTCGAGCACCTGCCAGAACTATACGAAAGAGATGCTGACTTCGCAGACATCTGGCATAAATGCTCCAATCACCTAAG
GGCTGAAGGATACCACATCCTAGAAGGATTCCTCTTCAAGGGAGACCAATTATGCATACCACACACTTCCCTGCGGGAAGCCTTAATAAAGGAAGCTCATTCCAACGGAT
TAGCGGGCCATTTCCGGCAAGAATAG
Protein sequenceShow/hide protein sequence
MESSTTDVGSVDGEISETSTIHDTLIEEGLISTHHHNSTPTSIMSKKTNLRAVTTRTTPWDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKST
ENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKERFLPPNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFV
GGLRFDIKEKVRLQPLRFLSEAISLAETVEEMIALKTNTMNRRTWEPTSNKKTNYANKSNDQPMAPAYGKGKETDSQTATYEKKAEMINRSKNQNNYTRPSLGKCFRCGQ
PGHLSNSCPQRKTIALADEEDNLPGEGESEPEEETEEIEADEGDRISCVIHRVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSSENFITKKIVSNLNLKVQPHPN
PYKIGWVKKGNESTVNEVCTVPLSIGSSYKDQIVCDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQD
ILALQEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGK
ATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFL
KSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIKKRLASNPVQQLPDFSSPFEVAVDAC
CTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGS
LLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFRQE