; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G013100 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G013100
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase
Genome locationCG_Chr05:18295510..18304359
RNA-Seq ExpressionClCG05G013100
SyntenyClCG05G013100
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]0.0e+0053.12Show/hide
Query:  RPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGE
        R ++DY  PI+ +   GI R     + FE+KP ++ M+Q A QF GSP +DP+ HL  F+EI +T  +  ++ D IRL LFPFSL D+AR W  SL+PG 
Subjt:  RPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGE

Query:  ITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILD
        IT+W  M EKF+ KFFPP + A+ R +I  F+Q D E+L +AW ++K L+R CP +G PD +Q+++FY+GL   ++T  +AA+ G L+ KT   A  +L+
Subjt:  ITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILD

Query:  RISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQ---VNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFIRNNPYSNTYN
         ++ N+  W      R+  K+   +G  E    AAL AQ  +L+   +  + Q     A     ++  V   E    +V   N ++  + R NP  N Y+
Subjt:  RISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQ---VNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFIRNNPYSNTYN

Query:  PGWRNHPNFSWGGNR-----------QPEQQGAPIH-------ERGGSSGFSHGHQRQNSYQSAPGPSSSMEALLKEYMQKNDAFNTEA----PRGTGSS
        PG RNH NFS+G  +           QP ++   +        E   ++      Q  N         ++M+ L  +  Q     N +     P  T  +
Subjt:  PGWRNHPNFSWGGNR-----------QPEQQGAPIH-------ERGGSSGFSHGHQRQNSYQSAPGPSSSMEALLKEYMQKNDAFNTEA----PRGTGSS

Query:  GKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAE-----LQTP-PYPQRLRRKKNNERQFKRFLDVLKQ
         KEQC+A+TLRSGR +   P           +N  +K     ++   D LR +   D   S++       L TP PYPQR +++K  ++QF +FLD+ K+
Subjt:  GKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAE-----LQTP-PYPQRLRRKKNNERQFKRFLDVLKQ

Query:  LHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGIA-TQ
        +HINIP  +ALEQMP YAKFLKDI+SKKR ++E ET+ L++E S ++ +++P K+KDP SFT+PC+IG     K LCDLGASINLMPLSV+ KLG+   +
Subjt:  LHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGIA-TQ

Query:  PTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDCQNLD
         TT++LQLADRS+ YP G IEDVLVKVDKFI PADF++LD E D++VP+ILGRPFL+TGR L+DV KGE+ +RVN +EV FN+++A+++P +   C  +D
Subjt:  PTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDCQNLD

Query:  TEDESHERWLEENE------------------ETIANLDAQQPEQCCALVHSAFETLTPNRIN------QQIKPSLEEAPEI------------ELKVLP
          ++      +E+                      A  D+  P      +H  F      ++       ++++P + +   +            ELK LP
Subjt:  TEDESHERWLEENE------------------ETIANLDAQQPEQCCALVHSAFETLTPNRIN------QQIKPSLEEAPEI------------ELKVLP

Query:  VHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPI
         HL+YA+LG+  + PV ++A+L+P +E  LL +L++H  A+GWT++DIKGISP+ CMHKI +EE    SIE QRRLNPAMKEVV+ E+LK L+AG+I+ I
Subjt:  VHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPI

Query:  SDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCP
        SDS WVSPVQ VPKKGGMTVV N NNE I TRTVTGWR+CMDYRKLN AT+KDHFPLPFIDQMLDRLAG  ++CFLDGYSGYNQI IAPEDQEKTTFTCP
Subjt:  SDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCP

Query:  YGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQA
        YGTFAFRRMPFGLCNAP TFQRCMMAIFS+ +E+ +EIFMDDFSVFG SF+ CL NL  VL+RC+D NLVLNWEKCHFMV EGIVLGH++S KG+EVD+A
Subjt:  YGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQA

Query:  KIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQ
        KI  +EKLPPP N+K +RSFLGHAGFYRRF+KDFSK+++PL +LLE+N  + F++ CL+AF  +K  L+SAP++  PDWSQPFE+MCDASD+A+GA L Q
Subjt:  KIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQ

Query:  RRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLS
        RRD++   I YAS+T N AQ NY+TT+KE+LAVVFA +KFRSYL+ +KVI++TDH+A+RYL +KKDAKPRLIRW+LLLQEFD E+ D+KG EN+VADHLS
Subjt:  RRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLS

Query:  RLENMEHDRKQPDVNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSP
        RLE  E  R    +  +FPDE +       PWYADIVNFL CK  P D    Q+KK +HD K+Y WDEP L++R PD I R CVPE   Q IL  CH S 
Subjt:  RLENMEHDRKQPDVNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSP

Query:  YGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARN
        YGGHFG  RTAAKVLQSG+FWP++FRD+      CDRCQR+GNIS + ++PL +ILEVELFDVWGIDFMGPFPPS G  YIL+AVDYVSKWVEAI+   N
Subjt:  YGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARN

Query:  DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKT
        DA  V  FL KNIFTRFGTPRA+ISDEGTHF NK+  NLL KY V+H++A AYHPQ NGQAEISNRE+K ILEK VN++RKDWA KL++ALWAYRTAFKT
Subjt:  DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKT

Query:  PIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------ELFP--
        PIGMSPY LVFGKACHLP+ELEHKA WA K+ N+DLKAAGE R LQLNE++E+R  AYENAKIYKERTK+W D++I ++    GQ+        +LFP  
Subjt:  PIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------ELFP--

Query:  ----------------HGAVELMNEDGSNAFKVNGQRVKPYYGVGLERD
                         GA++L ++ G + F+VNGQR+K YYG  +ER+
Subjt:  ----------------HGAVELMNEDGSNAFKVNGQRVKPYYGVGLERD

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]0.0e+0053.38Show/hide
Query:  HDRNRPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSL
        +   R + DYA P +    P I+RP    + FE+KP ++QM+Q   QF G P EDPHAHL +F+EI +T  +  +S D IRL LFPFSL D+A+ W  S 
Subjt:  HDRNRPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSL

Query:  EPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAK
         P   TTWN + + F+ K+FPP + A+ R DI +F Q D E+L +AW +FK L R CPH+G PD + ++ FY+GLT + +   +AAA G L+ K+  EA 
Subjt:  EPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAK

Query:  EILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI--------R
        E+L+ ++ N+  W     S          G  + + +  L A+  +L +   G  G VN+V+      C  CG  H    C Q    V F+        +
Subjt:  EILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI--------R

Query:  NNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGSSGFSHGHQRQN---SYQSAPGPSSS----MEALLKEYMQKN--------DAFNTEAPRGTG
        NNPYSNTYNPGWRNHPNFSW           P+H  G     S    +Q+   + +     SS     +EA + +    N           N+   RG G
Subjt:  NNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGSSGFSHGHQRQN---SYQSAPGPSSS----MEALLKEYMQKN--------DAFNTEAPRGTG

Query:  S-------SGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAELQTP--PYPQRLRRKKNNERQFKRFL
        +       + KE C+AVTLRSG+ + ++     S  TI       K+D     K+       L K  S     E   P  P+PQRL++ K  ++QF++FL
Subjt:  S-------SGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAELQTP--PYPQRLRRKKNNERQFKRFL

Query:  DVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLG
         V +QLHINIP  +AL Q+P Y KFLK+I+SKKR +++ ETIALT+E S ++  ++P K++DP SF+IPC+IG +   +ALCDLGAS++LMPLSV  KLG
Subjt:  DVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLG

Query:  I-ATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQED
        +   +PTT++LQLADRSV YP G +E+VL+KV KFI+P DFI+L+ E D ++PIILGRPFL+T   +IDV  G + ++V ++EV FN+F+A +YP   + 
Subjt:  I-ATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQED

Query:  CQNLDTEDES------HERWLEENEETIANLDAQQPEQC-CALVHSAFETLTPNRINQQI------------KPSLEEAPEIELKVLPVHLKYAYLGEGN
           +D  DES       E   E  E  + +    + +    A V  A E   P    + I             PS  +AP +ELK LP HL YA+LGE N
Subjt:  CQNLDTEDES------HERWLEENEETIANLDAQQPEQC-CALVHSAFETLTPNRINQQI------------KPSLEEAPEIELKVLPVHLKYAYLGEGN

Query:  SLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCV
        +LPV +S +LS  Q   L+ IL+   +A+GWT++D++GISP+ CMH+I +E+     +E QRRLNP MKEVV+ EVLKWLDAG+I+PISDS W+SPVQ V
Subjt:  SLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCV

Query:  PKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFG
        PKKGGMTVV N+NNELI TRTVTGWR+C+DYRKLN+ T+KDHFPLPF+DQ+L+RLAG  ++CFLDGYSGYNQI I+PEDQEKTTFTCPYGTFAFRRMPFG
Subjt:  PKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFG

Query:  LCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPT
        LCNAP TFQRCMMAIFS+F+E+ +E+FMDDFSVFG+SF++CL NL +VL+RC++TNLVLNWEKCHFMV EGIVLGHKIS +GLEVD+AKIE +EKLPPPT
Subjt:  LCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPT

Query:  NIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYA
        N+K +RSFLGH GFYRRF+KDFSKI++PL +LL ++  + F+++CL AF  LK  LVSAPI+ APDWS PFELMCDASD+A+GA L QR+DR LH I YA
Subjt:  NIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYA

Query:  SKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQP
        S+  N AQ NY+TT+KELLAVVFA +KFRSYL+GSKVI+YTDHSAI+YL+ KKDAKPRLIRWVLLLQEFD EI D++GMEN VADHLSRLE      + P
Subjt:  SKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQP

Query:  DVNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAA
         +N SFPDE +L V+   PWYAD+VN+LV    P D +  QKKK + D K Y+W+EP LY+   D + R CVP+   + IL  CH    GGHF   +T A
Subjt:  DVNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAA

Query:  KVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKN
        KV QSG++WPT+++D R Y   CDRCQR+GNIS +N+MPLT+ILEVELFD+WGIDFMGPFP S  + YILVAVDYVSKWVEA +   ND+  V  F++KN
Subjt:  KVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKN

Query:  IFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFG
        IF+RFG PRA+ISDEG+HF N+    LL KY V H+VA AYHPQ NGQ E++NRELK ILEK V+SSRKDWA+KL++ALWAYRTAFKTP+GMSPY LVFG
Subjt:  IFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFG

Query:  KACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------------------------
        K+CHLP+ELEH+A WA K LNMDLKAAGE R LQL+ELEE+RL AYEN +IYKE+TK W D+ +  ++  IGQ+                          
Subjt:  KACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------------------------

Query:  ELFPHGAVELMNEDGSNAFKVNGQRVKPY
        +++P+GAVE+ +E  + AFKVNGQR+KPY
Subjt:  ELFPHGAVELMNEDGSNAFKVNGQRVKPY

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]0.0e+0053.38Show/hide
Query:  HDRNRPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSL
        +   R + DYA P +    P I+RP    + FE+KP ++QM+Q   QF G P EDPHAHL +F+EI +T  +  +S D IRL LFPFSL D+A+ W  S 
Subjt:  HDRNRPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSL

Query:  EPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAK
         P   TTWN + + F+ K+FPP + A+ R DI +F Q D E+L +AW +FK L R CPH+G PD + ++ FY+GLT + +   +AAA G L+ K+  EA 
Subjt:  EPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAK

Query:  EILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI--------R
        E+L+ ++ N+  W     S          G  + + +  L A+  +L +   G  G VN+V+      C  CG  H    C Q    V F+        +
Subjt:  EILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI--------R

Query:  NNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGSSGFSHGHQRQN---SYQSAPGPSSS----MEALLKEYMQKN--------DAFNTEAPRGTG
        NNPYSNTYNPGWRNHPNFSW           P+H  G     S    +Q+   + +     SS     +EA + +    N           N+   RG G
Subjt:  NNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGSSGFSHGHQRQN---SYQSAPGPSSS----MEALLKEYMQKN--------DAFNTEAPRGTG

Query:  S-------SGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAELQTP--PYPQRLRRKKNNERQFKRFL
        +       + KE C+AVTLRSG+ + ++     S  TI       K+D     K+       L K  S     E   P  P+PQRL++ K  ++QF++FL
Subjt:  S-------SGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAELQTP--PYPQRLRRKKNNERQFKRFL

Query:  DVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLG
         V +QLHINIP  +AL Q+P Y KFLK+I+SKKR +++ ETIALT+E S ++  ++P K++DP SF+IPC+IG +   +ALCDLGAS++LMPLSV  KLG
Subjt:  DVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLG

Query:  I-ATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQED
        +   +PTT++LQLADRSV YP G +E+VL+KV KFI+P DFI+L+ E D ++PIILGRPFL+T   +IDV  G + ++V ++EV FN+F+A +YP   + 
Subjt:  I-ATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQED

Query:  CQNLDTEDES------HERWLEENEETIANLDAQQPEQC-CALVHSAFETLTPNRINQQI------------KPSLEEAPEIELKVLPVHLKYAYLGEGN
           +D  DES       E   E  E  + +    + +    A V  A E   P    + I             PS  +AP +ELK LP HL YA+LGE N
Subjt:  CQNLDTEDES------HERWLEENEETIANLDAQQPEQC-CALVHSAFETLTPNRINQQI------------KPSLEEAPEIELKVLPVHLKYAYLGEGN

Query:  SLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCV
        +LPV +S +LS  Q   L+ IL+   +A+GWT++D++GISP+ CMH+I +E+     +E QRRLNP MKEVV+ EVLKWLDAG+I+PISDS W+SPVQ V
Subjt:  SLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCV

Query:  PKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFG
        PKKGGMTVV N+NNELI TRTVTGWR+C+DYRKLN+ T+KDHFPLPF+DQ+L+RLAG  ++CFLDGYSGYNQI I+PEDQEKTTFTCPYGTFAFRRMPFG
Subjt:  PKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFG

Query:  LCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPT
        LCNAP TFQRCMMAIFS+F+E+ +E+FMDDFSVFG+SF++CL NL +VL+RC++TNLVLNWEKCHFMV EGIVLGHKIS +GLEVD+AKIE +EKLPPPT
Subjt:  LCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPT

Query:  NIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYA
        N+K +RSFLGH GFYRRF+KDFSKI++PL +LL ++  + F+++CL AF  LK  LVSAPI+ APDWS PFELMCDASD+A+GA L QR+DR LH I YA
Subjt:  NIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYA

Query:  SKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQP
        S+  N AQ NY+TT+KELLAVVFA +KFRSYL+GSKVI+YTDHSAI+YL+ KKDAKPRLIRWVLLLQEFD EI D++GMEN VADHLSRLE      + P
Subjt:  SKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQP

Query:  DVNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAA
         +N SFPDE +L V+   PWYAD+VN+LV    P D +  QKKK + D K Y+W+EP LY+   D + R CVP+   + IL  CH    GGHF   +T A
Subjt:  DVNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAA

Query:  KVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKN
        KV QSG++WPT+++D R Y   CDRCQR+GNIS +N+MPLT+ILEVELFD+WGIDFMGPFP S  + YILVAVDYVSKWVEA +   ND+  V  F++KN
Subjt:  KVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKN

Query:  IFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFG
        IF+RFG PRA+ISDEG+HF N+    LL KY V H+VA AYHPQ NGQ E++NRELK ILEK V+SSRKDWA+KL++ALWAYRTAFKTP+GMSPY LVFG
Subjt:  IFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFG

Query:  KACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------------------------
        K+CHLP+ELEH+A WA K LNMDLKAAGE R LQL+ELEE+RL AYEN +IYKE+TK W D+ +  ++  IGQ+                          
Subjt:  KACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------------------------

Query:  ELFPHGAVELMNEDGSNAFKVNGQRVKPY
        +++P+GAVE+ +E  + AFKVNGQR+KPY
Subjt:  ELFPHGAVELMNEDGSNAFKVNGQRVKPY

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]0.0e+0053.27Show/hide
Query:  HDRNRPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSL
        +   R + DYA P +    P I+RP    + FE+KP ++QM+Q   QF G P EDPHAHL +F+EI +T  +  +S D IRL LFPFSL D+A+ W  S 
Subjt:  HDRNRPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSL

Query:  EPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAK
         P   TTWN + + F+ K+FPP + A+ R DI +F Q D E+L +AW +FK L R CPH+G PD + ++ FY+GLT + +   +AAA G L+ K+  EA 
Subjt:  EPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAK

Query:  EILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI--------R
        E+L+ ++ N+  W     S          G  + + +  L A+  +L +      G VN+V+      C  CG  H    C Q    V F+        +
Subjt:  EILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI--------R

Query:  NNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGSSGFSHGHQRQN---SYQSAPGPSSS----MEALLKEYMQKN--------DAFNTEAPRGTG
        NNPYSNTYNPGWRNHPNFSW           P+H  G     S    +Q+   + +     SS     +EA + +    N           N+   RG G
Subjt:  NNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGSSGFSHGHQRQN---SYQSAPGPSSS----MEALLKEYMQKN--------DAFNTEAPRGTG

Query:  S-------SGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAELQTP--PYPQRLRRKKNNERQFKRFL
        +       + KE C+AVTLRSG+ + ++     S  TI       K+D     K+       L K  S     E   P  P+PQRL++ K  ++QF++FL
Subjt:  S-------SGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAELQTP--PYPQRLRRKKNNERQFKRFL

Query:  DVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLG
         V +QLHINIP  +AL Q+P Y KFLK+I+SKKR +++ ETIALT+E S ++  ++P K++DP SF+IPC+IG +   +ALCDLGAS++LMPLSV  KLG
Subjt:  DVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLG

Query:  I-ATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQED
        +   +PTT++LQLADRSV YP G +E+VL+KV KFI+P DFI+L+ E D ++PIILGRPFL+T   +IDV  G + ++V ++EV FN+F+A +YP   + 
Subjt:  I-ATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQED

Query:  CQNLDTEDES------HERWLEENEETIANLDAQQPEQC-CALVHSAFETLTPNRINQQI------------KPSLEEAPEIELKVLPVHLKYAYLGEGN
           +D  DES       E   E  E  + +    + +    A V  A E   P    + I             PS  +AP +ELK LP HL YA+LGE N
Subjt:  CQNLDTEDES------HERWLEENEETIANLDAQQPEQC-CALVHSAFETLTPNRINQQI------------KPSLEEAPEIELKVLPVHLKYAYLGEGN

Query:  SLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCV
        +LPV +S +LS  Q   L+ IL+   +A+GWT++D++GISP+ CMH+I +E+     +E QRRLNP MKEVV+ EVLKWLDAG+I+PISDS W+SPVQ V
Subjt:  SLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCV

Query:  PKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFG
        PKKGGMTVV N+NNELI TRTVTGWR+C+DYRKLN+ T+KDHFPLPF+DQ+L+RLAG  ++CFLDGYSGYNQI I+PEDQEKTTFTCPYGTFAFRRMPFG
Subjt:  PKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFG

Query:  LCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPT
        LCNAP TFQRCMMAIFS+F+E+ +E+FMDDFSVFG+SF++CL NL +VL+RC++TNLVLNWEKCHFMV EGI+LGHKIS +GLEVD+AKIE +EKLPPPT
Subjt:  LCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPT

Query:  NIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYA
        N+K +RSFLGH GFYRRF+KDFSKI++PL +LL ++  + F+++CL AF  LK  LVSAPI+ APDWS PFELMCDASD+A+GA L QR+DR LH I YA
Subjt:  NIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYA

Query:  SKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQP
        S+  N AQ NY+TT+KELLAVVFA +KFRSYL+GSKVI+YTDHSAI+YL+ KKDAKPRLIRWVLLLQEFD EI D++GMEN VADHLSRLE      + P
Subjt:  SKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQP

Query:  DVNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAA
         +N SFPDE +L V+   PWYAD+VN+LV    P D +  QKKK + D K Y+W+EP LY+   D + R CVP+   + IL  CH    GGHF   +T A
Subjt:  DVNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAA

Query:  KVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKN
        KV QSG++WPT+++D R Y   CDRCQR+GNIS +N+MPLT+ILEVELFD+WGIDFMGPFP S  + YILVAVDYVSKWVEA +   ND+  V  F++KN
Subjt:  KVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKN

Query:  IFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFG
        IF+RFG PRA+ISDEG+HF N+    LL KY V H+VA AYHPQ NGQ E++NRELK ILEK V+SSRKDWA+KL++ALWAYRTAFKTP+GMSPY LVFG
Subjt:  IFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFG

Query:  KACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------------------------
        K+CHLP+ELEH+A WA K LNMDLKAAGE R LQL+ELEE+RL AYEN +IYKE+TK W D+ +  ++  IGQ+                          
Subjt:  KACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------------------------

Query:  ELFPHGAVELMNEDGSNAFKVNGQRVKPY
        +++P+GAVE+ +E  + AFKVNGQR+KPY
Subjt:  ELFPHGAVELMNEDGSNAFKVNGQRVKPY

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]0.0e+0053.47Show/hide
Query:  HDRNRPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSL
        +   R + DYA P +    P I+RP    + FE+KP ++QM+Q   QF G P EDPHAHL +F+EI +T  +  +S D IRL LFPFSL D+A+ W  S 
Subjt:  HDRNRPIRDYASPILYNFSPGIMRPESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSL

Query:  EPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAK
         P   TTWN + + F+ K+FPP + A+ R DI +F Q D E+L +AW +FK L R CPH+G PD + ++ FY+GLT + +   +AAA G L+ K+  EA 
Subjt:  EPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAK

Query:  EILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYE-------VCPQNPQSVCFIRN
        E+L+ ++ N+  W     S          G  + + +  L A+  +L +   G  G VN+V+      C  CG  H          V   N Q     +N
Subjt:  EILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYE-------VCPQNPQSVCFIRN

Query:  NPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGSSGFSHGHQRQN---SYQSAPGPSSS----MEALLKEY--------MQKNDAFNTEAPRGTGS
        NPYSNTYNPGWRNHPNFSW           P+H  G     S    +Q+   + +     SS     +EA + +         MQ     N+   RG G+
Subjt:  NPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGSSGFSHGHQRQN---SYQSAPGPSSS----MEALLKEY--------MQKNDAFNTEAPRGTGS

Query:  -------SGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAELQTP--PYPQRLRRKKNNERQFKRFLD
               + KE C+AVTLRSG+ + ++     S  TI       K+D     K+       L K  S     E   P  P+PQRL++ K  ++QF++FL 
Subjt:  -------SGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAELQTP--PYPQRLRRKKNNERQFKRFLD

Query:  VLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGI
        V +QLHINIP  +AL Q+P Y KFLK+I+SKKR +++ ETIALT+E S ++  ++P K++DP SF+IPC+IG +   +ALCDLGAS++LMPLSV  KLG+
Subjt:  VLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGI

Query:  -ATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDC
           +PTT++LQLADRSV YP G +E+VL+KV KFI+P DFI+L+ E D ++PIILGRPFL+T   +IDV  G + ++V ++EV FN+F+A +YP   +  
Subjt:  -ATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDC

Query:  QNLDTEDES------HERWLEENEETIANLDAQQPEQC-CALVHSAFETLTPNRINQQI------------KPSLEEAPEIELKVLPVHLKYAYLGEGNS
          +D  DES       E   E  E  + +    + +    A V  A E   P    + I             PS  +AP +ELK LP HL YA+LGE N+
Subjt:  QNLDTEDES------HERWLEENEETIANLDAQQPEQC-CALVHSAFETLTPNRINQQI------------KPSLEEAPEIELKVLPVHLKYAYLGEGNS

Query:  LPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVP
        LPV +S +LS  Q   L+ IL+   +A+GWT++D++GISP+ CMH+I +E+     +E QRRLNP MKEVV+ EVLKWLDAG+I+PISDS W+SPVQ VP
Subjt:  LPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVP

Query:  KKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGL
        KKGGMTVV N+NNELI TRTVTGWR+C+DYRKLN+ T+KDHFPLPF+DQ+L+RLAG  ++CFLDGYSGYNQI I+PEDQEKTTFTCPYGTFAFRRMPFGL
Subjt:  KKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGL

Query:  CNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTN
        CNAP TFQRCMMAIFS+F+E+ +EIFMDDFSVFG+SF++CL NL +VL+RC++TNLVLNWEKCHFMV EGIVLGHKIS +GLEVD+AKIE +EKLPPPTN
Subjt:  CNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTN

Query:  IKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYAS
        +K +RSFLGH GFYRRF+KDFSKI++PL +LL ++  + F+++CL AF  LK  LVSAPI+ APDWS PFELMCDASD+A+GA L QR+DR LH I YAS
Subjt:  IKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYAS

Query:  KTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPD
        +  N AQ NY+TT+KELLAVVFA +KFRSYL+GSKVI+YTDHSAI+YL+ KKDAKPRLIRWVLLLQEFD EI D++GMEN VADHLSRLE      + P 
Subjt:  KTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPD

Query:  VNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAK
        +N SFPDE +L V+   PWYAD+VN+LV    P D +  QKKK + D K Y+W+EP LY+   D + R CVP+   + IL  CH    GGHF   +T AK
Subjt:  VNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAK

Query:  VLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNI
        V QSG++WPT+++D R Y   CDRCQR+GNIS +N+MPLT+ILEVELFD+WGIDFMGPFP S  + YILVAVDYVSKWVEA +   ND+  V  F++KNI
Subjt:  VLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNI

Query:  FTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGK
        F+RFG PRA+ISDEG+HF N+    LL KY V H+VA AYHPQ NGQ E++NRELK ILEK V+SSRKDWA+KL++ALWAYRTAFKTP+GMSPY LVFGK
Subjt:  FTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGK

Query:  ACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------------------------E
        +CHLP+ELEH+A WA K LNMDLKAAGE R LQL+ELEE+RL AYEN +IYKE+TK W D+ +  ++  IGQ+                          +
Subjt:  ACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQK--------------------------E

Query:  LFPHGAVELMNEDGSNAFKVNGQRVKPY
        ++P+GAVE+ +E  + AFKVNGQR+KPY
Subjt:  LFPHGAVELMNEDGSNAFKVNGQRVKPY

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase0.0e+0053.27Show/hide
Query:  MLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDR
        M+Q   QF G   E+P+ H+ +F++I +T     +S D +RL LF FSL+ +A  W +SL    ITTW                                
Subjt:  MLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDR

Query:  ETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGRKQN--HASGAPEHNSVA
        ET+ +AW++F++++RNCP++  P  +Q+  FY GLTE  +   +       L  T  E   +L+ +  NH       Y +   +     A+G  E + V 
Subjt:  ETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGRKQN--HASGAPEHNSVA

Query:  ALQAQTMALNQS--NTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI------RNNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGG
        AL A+   L QS  N G    VN V  I    C  CGE H  + CP + +S+ F+      +NNPYSNTYNPGWR HPNFSW  N+   Q  AP  ++GG
Subjt:  ALQAQTMALNQS--NTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI------RNNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGG

Query:  SSGFSHGHQ------RQNSYQSAPGPSSSMEALLKEYMQKNDAFNTEAPRGTGSS---------GKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTT
                Q       +   Q     +++ + +  +  Q  +A N+  P+G+  S         GK QCQAVTLR+GR + E                  
Subjt:  SSGFSHGHQ------RQNSYQSAPGPSSSMEALLKEYMQKNDAFNTEAPRGTGSS---------GKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTT

Query:  KLDQSTQQKQADMLRSSLGKDASTSMTAELQTPPYPQRLRRKKNNERQFKRFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQE
         + + T+ K+ +++    GK        E++ P                      L++LHINIP  EALEQMP+Y KF+KDILSKKR + ++ET+ALT+E
Subjt:  KLDQSTQQKQADMLRSSLGKDASTSMTAELQTPPYPQRLRRKKNNERQFKRFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQE

Query:  SSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGIATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEA
         S ++  ++P K+KDP SFTIPC+IG    G+ALCDLG             LG A +PT++TLQLADRS+ YP+G IED+LVKVDKFI PADF++LD E 
Subjt:  SSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGIATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEA

Query:  DKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKAL-------EYPGEQEDCQNLDTEDESHERWLEENEETIANLDAQQPEQCCALVHSAF
        D +VPIILGRPFL+TGRTLIDV K       +D+    ++F  L       E P +  +   LD  DE +    EE+ E +  LDA +  +   +   + 
Subjt:  DKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKAL-------EYPGEQEDCQNLDTEDESHERWLEENEETIANLDAQQPEQCCALVHSAF

Query:  ETLTPNRINQQIKPSLEEAPEIELKVLPVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEA
        E   P+++   +KPS+EE P +ELK LP HL YAYLGE ++LPV IS++LS  Q   LL +L+ H  A+GWT+ADIKGISP++CMHKI LE+ +  S+E+
Subjt:  ETLTPNRINQQIKPSLEEAPEIELKVLPVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEA

Query:  QRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEF
        QRRLNP MKEVVKKE++KWLDAG+I+PISDS WVSPVQCVPKKGG+TVVPN +NELI TRTVTGWR+CMDYRKLN AT+KDHFPL FIDQMLDRLAG EF
Subjt:  QRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEF

Query:  FCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLN
        +CFLDGYSGYNQI IAPEDQEK TFTCPYGTFAFRRMPFGLCNAP TFQRCMMAIF++ +E  +E+FMDDFSV+GNSF+ CL NL  VLKRC+DTNL+LN
Subjt:  FCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLN

Query:  WEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAP
        WEKCHFMV EGIVLGHK+S +G+EVD+AK+E +EKLPPPT++K +RSFLGHAGFYRRF+KDFSKI++PL +LLE++ P+ F++ C  AF  LK  L+SAP
Subjt:  WEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAP

Query:  ILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLI
        I+  PDWS PFELMCDASD+A+GA L QR+D++   I YASKT N AQ NY+TT+KELLAVVFA +KFRSYL+G+KVI+YTDH+AIRYL+ KKDAKPRLI
Subjt:  ILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLI

Query:  RWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVLKVTES-APWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQL
        RWVLLLQEFD EI DRKG EN +ADHLSRLE+     +   +N +FPDE +L +  S  PWYADIVN+L C   P D +AQQKKK + D + Y+WD+P L
Subjt:  RWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVLKVTES-APWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQL

Query:  YRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGP
        +++GPD+I R CVPEI    IL QCH SPYGGHF G RTAAK+LQSG+FWP LF+DA  +   CDRCQR GNIS +++MPL +ILEVELFDVWGIDFMGP
Subjt:  YRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGP

Query:  FPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTI
        F PS G+ YILVAVDYVSKWVEA +   ND+  V +F++KNIFTRFGTPRA+ISD GTHF N+    LL KY V+H+++T YHPQ +GQ E+SNRE+K I
Subjt:  FPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTI

Query:  LEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRW
        LEK V+S+RKDW+ +L+EALWAYRTA+KTPIGMSPY LVFGKACHLP+ELEH A WA ++LN D++AAGE R LQLNEL+E+RL AYENAKIYKE+ KRW
Subjt:  LEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRW

Query:  QDQRISKKSLHIGQ--------------------------KELFPHGAVELMNEDGSNAFKVNGQRVKPYYGVGLERDKGNL
         +++I ++    GQ                           E+FPHGAVEL N++  N FKVN QR+K Y+G  ++R   ++
Subjt:  QDQRISKKSLHIGQ--------------------------KELFPHGAVELMNEDGSNAFKVNGQRVKPYYGVGLERDKGNL

A0A2G9G6G2 Reverse transcriptase0.0e+0052.02Show/hide
Query:  MLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDR
        M+Q   QF G   E+P+ H+ +F++I +T     +S D +RL LF FSL+ +A  W  SL    ITTW                                
Subjt:  MLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDR

Query:  ETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGRK--QNHASGAPEHNSVA
        ET+ +AW++F++++RNCP++  P  +Q+  FY GLTE  +   +       L  T  E   +L+ +  NH       Y +   +   + A G  E + V 
Subjt:  ETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGRK--QNHASGAPEHNSVA

Query:  ALQAQTMALNQS--NTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI------RNNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGG
        AL A+   L QS  N G    VN V Q     C  CGE H  + CP + +S+ F+      +NNPYSNTYNPGWR HPNFSW  N+   Q  AP  ++ G
Subjt:  ALQAQTMALNQS--NTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI------RNNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGG

Query:  SSGFSHGHQ------RQNSYQSAPGPSSSMEALLKEYMQKNDAFNTEAPRGTGSSGKE--QCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQ
                Q       +   Q     +++ + +  +  Q  +A N+  P+G+  S  E    Q VTLR+GR + E+                  + + T+
Subjt:  SSGFSHGHQ------RQNSYQSAPGPSSSMEALLKEYMQKNDAFNTEAPRGTGSSGKE--QCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQ

Query:  QKQADMLRSSLGKDASTSMTAELQTPPYPQRLRRKKNNERQFKRFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCE
         K+ +++        S     E++TP                      L++LHINIP  EALEQMP+Y KF+KDILSKKR + ++E + LT+E S ++  
Subjt:  QKQADMLRSSLGKDASTSMTAELQTPPYPQRLRRKKNNERQFKRFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCE

Query:  RIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGIA-TQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPI
        ++P K+K+P SFTIPC+IG    G+ALCDLGASINLMP S++  LG+   +PT++TLQLADRS+ YP+G I+D+LVKVDKFI PADF++LD E D +VPI
Subjt:  RIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGIA-TQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPI

Query:  ILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDC------QNLDTEDESHERWLEENEETIANLDAQQPEQCCALVHS---------
        ILGRPFL+TGRTLIDV                   KA+++P E ++C       NL   +   E+ L+  E  + +L  ++ E+ C +V +         
Subjt:  ILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDC------QNLDTEDESHERWLEENEETIANLDAQQPEQCCALVHS---------

Query:  -AFETLTPNRINQQIKPSLEEAPEIELKVLPVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGS
           E+L     ++ +KPS+EE P +ELK LP HL YAYLGE ++LPV IS++LS  Q   LL +LK H   +GWT+ADIKGISP++CMHKI LE+ +  S
Subjt:  -AFETLTPNRINQQIKPSLEEAPEIELKVLPVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGS

Query:  IEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAG
        IE+QRRLNP MKEVVKKE++KWLDAG+I+PISDS WVSPVQCVPKKGG+TVVPN +NELI TRTVTGWR+CMDYRKLN AT+KDHFPLPFIDQMLDRLAG
Subjt:  IEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAG

Query:  NEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNL
         EF+CFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCMMAIF++ +E  +E+FMD+FSV+G+SF+ CL NL  VLKRC+DTNL
Subjt:  NEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNL

Query:  VLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALV
        VLNWEKCHFMV EGIVLGHK+S +G+EVD+AK+E +EKLPPPT++K +RSFLGHAGFYRRF+KDFSKI++PL +LLE++ P+ FN+ C  AF  LK  L+
Subjt:  VLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALV

Query:  SAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKP
        SAPI+  PD            D+A+GA L QR+D++   I YASKT N AQ NY+TT+KELLAVVFA +KFRSYL+ +KVI+YTDH+AIRYL+ KKDA P
Subjt:  SAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKP

Query:  RLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVL-KVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDE
         LI WVLLLQEFD EI DRKG EN +ADHLSRLE+     +   +N +FPDE +L  V  + PWYADIVN+L C   P D + QQKKK++ D + Y+W++
Subjt:  RLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVL-KVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDE

Query:  PQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDF
        P L ++GPD+I R CVPEI    IL QCH SPYGGHF G RTAAK+LQSG+FWP LF+DA  +   CDRCQR  NIS +++MPL +ILEVELFDVWGIDF
Subjt:  PQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDF

Query:  MGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNREL
        MGPF PS G+ YILVAVDYVSKWVEA +   ND+  V +F++KNIFTRFGTPRA+ISD  T+F N+    LL KY V+H++ T YHPQ +G  E+SNRE+
Subjt:  MGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNREL

Query:  KTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERT
        K ILEK V+S+RKDW+ +L+EALWAYRTA+KTPIGMSPY L+FGKACHLP+ELEH A WA  +LN D++AAGE R LQLNEL+E+RL AYENAKIYKE+T
Subjt:  KTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERT

Query:  KRWQDQRISKKSLHIGQ--------------------------KELFPHGAVELMNEDGSNAFKVNGQRVKPYYGVGLER
        KRW D++I ++    GQ                           E+FPHGAVEL NE+  N FK+N +R+K Y+G  ++R
Subjt:  KRWQDQRISKKSLHIGQ--------------------------KELFPHGAVELMNEDGSNAFKVNGQRVKPYYGVGLER

A0A2G9HWF8 Reverse transcriptase0.0e+0050.97Show/hide
Query:  DPEIERTFRKRNKTFQRWRQLYREKKMQNQANTANQPLRANNEANDRREAVNYALVQAQLQAQQIDQQTHPILLAHD-RNRPIRDYASPILYNFSPGIMR
        DPEIERTFR R                                   RR+   +   + +++  QI      I++A +  N P+R+ A P        ++ 
Subjt:  DPEIERTFRKRNKTFQRWRQLYREKKMQNQANTANQPLRANNEANDRREAVNYALVQAQLQAQQIDQQTHPILLAHD-RNRPIRDYASPILYNFSPGIMR

Query:  PE-SQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPT
        PE   G + E+   M+QM+Q   QF G   E+P+ H+ +F++I +T     +S D +RL LF FSL+ +A  W  SL    ITTW Q+ E+F+ KFF P 
Subjt:  PE-SQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPT

Query:  ENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGR
        + A  R +I  F+Q   ET+ +AW++F++++RNCP++  P  +Q+  FY GLTE  +   +       L  T  E   +L+ +  NH       Y +   
Subjt:  ENARRRRDIANFQQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGR

Query:  KQN--HASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI------RNNPYSNTYNPGWRNHPNFSWGGN
        +     A+G  E + V AL A+   L QS                     CGE H  + CP + +S+ F+      +NNPYSNTYNPGWR HPNFSW  N
Subjt:  KQN--HASGAPEHNSVAALQAQTMALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFI------RNNPYSNTYNPGWRNHPNFSWGGN

Query:  RQPEQQGAPIHERGGSSGFSHGHQRQNSYQSAPGPSSSMEALLKEYMQKNDAFNTEAPRGTGSSGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTT
        +   Q  AP  ++GG             + + P P                             GK QCQAVTLR+GR + E+      +PT       +
Subjt:  RQPEQQGAPIHERGGSSGFSHGHQRQNSYQSAPGPSSSMEALLKEYMQKNDAFNTEAPRGTGSSGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTT

Query:  KLDQSTQQKQADMLRSSLGKDASTSMTAELQTPPYPQRLRRKKNNERQFKRFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQE
        K  + T +++   + + L     T++      PP+PQRL+++K  ++QF +FL+V K+LHINIP  EALEQMP+Y KF+KDILSKKR + ++ET+ALT+E
Subjt:  KLDQSTQQKQADMLRSSLGKDASTSMTAELQTPPYPQRLRRKKNNERQFKRFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQE

Query:  SSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLG-IATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYE
         S ++  ++P K+KDP               +ALCDLGASINLMP S++  LG +  +PT++TLQLADRS+ YP+G IED+LVKVDKFI PADF++LD E
Subjt:  SSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLG-IATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYE

Query:  ADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDCQNLDTED-------------ESHERWL--------EENEETIANLD
         D +VPIILGRPFL+TGRTLIDV KGE+ MRV DQ++ FNVFKA+++P E ++C ++   D             +  ER L        EE+ E +  LD
Subjt:  ADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDCQNLDTED-------------ESHERWL--------EENEETIANLD

Query:  AQQPEQCCALVHSAFETLTPNRINQQIKPSLEEAPEIELKVLPVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCM
        A +  +   +   + E   P+++   +KPS+EE P +ELK LP HL YAYLGE ++LPV IS++LS  Q   LL +L+ H  A+GWT+ADIKGISP++CM
Subjt:  AQQPEQCCALVHSAFETLTPNRINQQIKPSLEEAPEIELKVLPVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCM

Query:  HKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPL
        HKI LE+ +  S+E+QRRLNP MKEVVKKE++KWLDAG+I+PISD  W+SPVQCVPKKGG+TVVPN +NE I T+TVTGWR+CMDYRKLN AT+KDHFPL
Subjt:  HKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPL

Query:  PFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANL
        PFIDQMLDRLAG EF+CFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRR+PF LCNAP TFQRCMMAIF++ +E  +E+FMDDFSV+G+SF+ CL NL
Subjt:  PFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANL

Query:  EKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEEC
          VLKRC+DTNLVLNWEKCHFMV EGIVLGHK+S +G+EVD+AK+E +EKLPP T++K +RSFLGHAGFYRRF+KDF KI++PL  LLE++ P+ F++ C
Subjt:  EKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEEC

Query:  LKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSA
        L AF+ LK  L+SAPI+  PDWS PFELMCDASD+A+GA L QR+D++   I YASKT N AQ NY+TT+KELLAVVFA +KFRSYL+G+KVI+YTDH+A
Subjt:  LKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSA

Query:  IRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVLKVTES-APWYADIVNFLVCKQFPEDFNAQQKKK
        IRYL+ KKDAKPRLIRWVLLLQEFD EI DRKG+EN +ADHLSRLE+     +   +N +FPDE +L +  S  PWYADIVN+L C   P D +AQQKKK
Subjt:  IRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVLKVTES-APWYADIVNFLVCKQFPEDFNAQQKKK

Query:  LMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSIL
         + D + Y+WD+P L+++GPD+I R CVPEI    I  QCH SPYGGHF   RTAAK+LQSG+FWP LF+D   +   CDRCQR GNIS +++MPL +IL
Subjt:  LMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSIL

Query:  EVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQ
        EVELFDVWGIDFMGPF PS G+ YILVAVDY+SKWVEA++   ND+  V +F++KNIFTRFGTPRA+ISD GTHF N+    LL KY V+H+++T YHPQ
Subjt:  EVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQ

Query:  MNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQ
         +GQ E+SNRE+K  LEK V+S+RKDW+ +L+EALWAYRTAFKTPIGMSPY LVFGKACHLP                              ++ E R +
Subjt:  MNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQ

Query:  AYENAKIYKERTKRWQDQRISKKSLHIGQKELFPHGAVELMNEDGSNAFKVNGQRVKPYY
          +   ++  R K + ++  S+ S      E+ PHGAVEL N++  N FKVN QR+K Y+
Subjt:  AYENAKIYKERTKRWQDQRISKKSLHIGQKELFPHGAVELMNEDGSNAFKVNGQRVKPYY

A0A5N6MBJ1 Reverse transcriptase0.0e+0048.97Show/hide
Query:  GQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSD
        G F G   EDP AH+ SFIEI +TF    +S D I+L +FPFSL D A+ W  SL PG +TTW  + +KF+ K+FPP++ AR R +I +F Q D E+L D
Subjt:  GQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLSD

Query:  AWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNS-VAALQAQT
        AW ++K L+R CPH+G    +Q+  FY+GL    +   +A A G   DKT  E   +L++++ N+  W+    +R    +       ++ S VA ++A T
Subjt:  AWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNS-VAALQAQT

Query:  MALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVC-----PQNPQSVCFI------RNNPYSNTYNPGWRNHPNFSW---GGN-------RQPEQQGA
          +NQ           +NQ+    C  CG PH    C       + + V F+      +NNPYSNTYNPGW+NHPNFSW   G N       R P QQ  
Subjt:  MALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVC-----PQNPQSVCFI------RNNPYSNTYNPGWRNHPNFSW---GGN-------RQPEQQGA

Query:  PIHERGGSSGFSHGHQRQNSYQ----------SAPGPSSSMEALLKEYM-------QKNDAFNTEA----------------------------------
           E    +     +Q QN YQ          S P   S++E ++ +++       QK++A + +A                                  
Subjt:  PIHERGGSSGFSHGHQRQNSYQ----------SAPGPSSSMEALLKEYM-------QKNDAFNTEA----------------------------------

Query:  ------PRGTGSSGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGK-DASTSMTAELQTPPYPQRLRRKKNNERQFK
              P  T ++ KE C+AVTLRSG+T         S      S P  + +   Q +  +  + S GK      +     T PYP RL + +N E+ + 
Subjt:  ------PRGTGSSGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGK-DASTSMTAELQTPPYPQRLRRKKNNERQFK

Query:  RFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFN
        +FLD+ KQLHIN+P VEAL QMP YAKFLKD+L+ K+ ++E   + L +E S ++  ++P KMKDP SFTIPC IGG+ +  AL DLGASINLMP S+F+
Subjt:  RFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFN

Query:  KLGIA-TQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGE
        KL +   +PT +++QLADRSV YP G +E++LVK+ KF+ P DF+ILD + D++VP+ILGRPFL+T R L+DV +G++ +RV+++EV F +  ++++   
Subjt:  KLGIA-TQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGE

Query:  QEDC---------------------QNLDT--------EDESHERWLEENEETIANLDAQQPEQCCALVHSAFETLTPNRINQQIKPSLEEAPEIELKVL
         +D                        LDT        +     R  +E  + +AN  +  PE C       FE +  +    + KPS+EE P +ELK L
Subjt:  QEDC---------------------QNLDT--------EDESHERWLEENEETIANLDAQQPEQCCALVHSAFETLTPNRINQQIKPSLEEAPEIELKVL

Query:  PVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFP
        P HL+YAYL E + LPV I++ L+  ++  LL +LK H +A+ W + DIKGI+P++C HKI +E+     ++ QRRLNP M+EVVKKEV+K LDAG+I+P
Subjt:  PVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFP

Query:  ISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTC
        ISDS WVSPVQ VPKKGGMTVV N+ NELI TRT+TGWR+C+DYRKLN AT+KDHFPLPFIDQML+RL+GN F+CFLDG+SGY QI IAPEDQEKTTFTC
Subjt:  ISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTC

Query:  PYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQ
        PYGTFA+RRMPFGLCNAP TFQRCM+AIF + +EES+E+FMDDFSVFG+SF+ CL+NL+K+L RC+++NLVLNWEKCHFMV EGIVLGHKIS  GLEVD+
Subjt:  PYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQ

Query:  AKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALV
        AK++ + KLPPPT+++ +RSFLGHAGFYRRF+KDFSKIARP++ LLE++  ++F++ECLKAF  LK  LV+API++APDW+ PFELMCDASDYA+G  L 
Subjt:  AKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASDYAMGAALV

Query:  QRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHL
        QR+D+  HPI YASKT N AQ NY+TT+KELLAVVFA +KFRSYL+ SK ++YTDH+A+RYL  K+DAKPRLIRW+LLLQEFD EI D+KG EN  ADHL
Subjt:  QRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHL

Query:  SRLENME-HDRKQPDVNASFPDEAVLKVTE--SAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQC
        SRLEN    + ++ ++N +FP E +L+V      PW+AD  N+L      +    QQ++K   D K+Y+W++P L+R   D + R CV     + IL  C
Subjt:  SRLENME-HDRKQPDVNASFPDEAVLKVTE--SAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQC

Query:  HDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAIS
        H+ P GGH     TA KV  SG++WPT+F+DA      CD CQR GNIS +++MP  SI   E+FDVWGIDFMGPFP S GH YILVAVDYVSKWVEA +
Subjt:  HDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAIS

Query:  CARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRT
           NDA  V  FL+K +F RFG P+ LISD GTHF N  +   L +Y V HR +T YHPQ +GQ E++NRELK ILE+ V  +RK+WA KL++ALWA+RT
Subjt:  CARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRT

Query:  AFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRI-SKKSLHIGQ----------
        A+KTPIG +PY LV+GKACHLP+ELEHKA WA K +N+DL +AGE R +Q++ELE+ R QAYEN++IYKERTK+  D  +   K   +G           
Subjt:  AFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRI-SKKSLHIGQ----------

Query:  ----------------KELFPHGAVELMNEDGSNAFKVNGQRVKPY
                        KE+F +G VE+ + DG   FKVNG R+K Y
Subjt:  ----------------KELFPHGAVELMNEDGSNAFKVNGQRVKPY

A0A6P8CBX2 Reverse transcriptase0.0e+0051.23Show/hide
Query:  AGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLS
        + QF G P E P  H+  F++  NT  + N++ D IRL LFPFSL D+AR W  SL    ITTW  +  KF+++FFPP   AR R +I NF + + E+L 
Subjt:  AGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPTENARRRRDIANFQQKDRETLS

Query:  DAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQT
        +AW +FK  +R CPH+G PD + +E+FY  L +  ++  +AAA G L+ K Y EA  +++ ++ +  +W+      + R ++  +   + +++A L  Q 
Subjt:  DAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQT

Query:  MALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVC-------PQNPQSVCFIRN------NPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGS
         AL  +        ++ N    A C  C  PH+   C         N + V F+ N       PYSNTYNPGWRNHPNFSW  N     +  P       
Subjt:  MALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVC-------PQNPQSVCFIRN------NPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGS

Query:  SGFSHGHQRQNSYQSAPGPSSS--MEALLKEYMQKNDAFNTEAPRGTGSSGKEQCQAVTLRSGRTMNELPHTGNSQP------------TIHHSNPTTKL
             G Q+Q   Q+AP   S   ME L+  YMQK D           +   +  Q     S R    LP      P             +   N   + 
Subjt:  SGFSHGHQRQNSYQSAPGPSSS--MEALLKEYMQKNDAFNTEAPRGTGSSGKEQCQAVTLRSGRTMNELPHTGNSQP------------TIHHSNPTTKL

Query:  DQSTQQKQADMLRSSLGKDASTSMTAELQTPPYPQRLRRKKNNERQFKRFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESS
         + + +K     +    +  S  +   +   P+P RL++++  + QF +FLDV K+L INIP  EAL+QMP+YA+F+KD+L+KKR  D  E + LT E S
Subjt:  DQSTQQKQADMLRSSLGKDASTSMTAELQTPPYPQRLRRKKNNERQFKRFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESS

Query:  DMVCER----IPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGIA-TQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILD
         M+ ++    +P K +D  SFT+PC+IG  H    L D GASINLMPLS+F KLG+   + T +TLQLADRS+ YP+G +E+VLVKVDKFI P DFI+L+
Subjt:  DMVCER----IPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGIA-TQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILD

Query:  YEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDCQNLDTEDESHERWLEEN------EETIANL-----DAQQPEQCC
         E D++VP+ILGRPFL+TG+ LIDV +G++ +RV ++++ FNV+ A++   + + C  +D  DE     +EE       E  + +L     D +  E+  
Subjt:  YEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDCQNLDTEDESHERWLEEN------EETIANL-----DAQQPEQCC

Query:  ALVHSAFETLTPNRINQQIKP--SLEEAPEIELKVLPVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLE
          V                KP  SL ++P +ELK LP HLKYAYLG  ++LP+ IS++L+  QE  LL +L++H  A+GWT+ADIKGISP  C H+I LE
Subjt:  ALVHSAFETLTPNRINQQIKP--SLEEAPEIELKVLPVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHKIRLE

Query:  EGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQM
              ++ QRRLNP +KEVVKKEVLK LDAG+I+PISDSKWVSPVQ VPKKGGMTVV N+ N+LI TRTVTGWR+C+DYRKLN AT+KDHFPLPFIDQM
Subjt:  EGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQM

Query:  LDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKR
        L++LAG++++CFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCMM+IFS+ LE  +EIFMDDFSVFG SFE+CL NL  VLKR
Subjt:  LDRLAGNEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKR

Query:  CKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFET
        CK+TNL+LNWEKCHFMV EGIVLGHK+SKKG+EVD+AK+E +EKLPPPT+ K +RSFLGHAGFYRRF+KDFSKI+RPL +LLE++  ++FN+ CL+AF  
Subjt:  CKDTNLVLNWEKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFET

Query:  LKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMT
        LK  L SAP+++AP+W  PFELMCDASDYA+GA L QRR ++ H I YAS+T N AQ NY+TT+KELLAV+FA +KFR YL+GSK+I+YTDH+A++YL  
Subjt:  LKAALVSAPILIAPDWSQPFELMCDASDYAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMT

Query:  KKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAV-LKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAK
        K DAKPRLIRW+LLLQEFD EI D KG EN VADHLSRLE+   D     +N  FPDE + +   +  PWYADIVN++V    P   ++QQKKK +HD K
Subjt:  KKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAV-LKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAK

Query:  FYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFD
        +Y+WDEP L++   D + R CVPE     I+  CH    GGHFG +RTA K+L  G++WP +F D R+Y + C  CQR GNIS ++++P  SIL +ELFD
Subjt:  FYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFD

Query:  VWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAE
        VWGIDFMGPFP S  + YILVAVDYVSKWVEA++   NDA  V  FL+KNIF+RFG PRA+ISD G+HF N+    LL KY V H++AT YHPQ  GQ E
Subjt:  VWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAE

Query:  ISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAK
        +SNRE+K ILEK VN+SRKDW+ KL++ALWAYRTAFKTPIGMSPY +V+GK+CHLP+ELEHKA WA K LN DL+AAGE R LQLN++ E R +AYENA+
Subjt:  ISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAK

Query:  IYKERTKRWQDQRISKKSLHIGQKEL--------------------------FPHGAVELMNEDGSNAFKVNGQRVKPYY-GVGLERDKGNLKRFYQITQ
        IYKER KRW D+ I K+    GQK L                          FP+GAVEL +ED    FKVNG  +K Y+ G  ++ D   +     + +
Subjt:  IYKERTKRWQDQRISKKSLHIGQKEL--------------------------FPHGAVELMNEDGSNAFKVNGQRVKPYY-GVGLERDKGNLKRFYQITQ

Query:  IKNS
        ++ S
Subjt:  IKNS

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.5e-8328.4Show/hide
Query:  LNPAMKEVVKKEVLKWLDAGVIFPISDSKWVS--PVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFF
        L P   + +  E+ + L +G+   I +SK ++  PV  VPKK G                    R+ +DY+ LN   K + +PLP I+Q+L ++ G+  F
Subjt:  LNPAMKEVVKKEVLKWLDAGVIFPISDSKWVS--PVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFF

Query:  CFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNW
          LD  S Y+ I +   D+ K  F CP G F +  MP+G+  AP  FQ  +  I  E  E  V  +MDD  +   S    + +++ VL++ K+ NL++N 
Subjt:  CFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNW

Query:  EKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPI
         KC F  ++   +G+ IS+KG    Q  I+ V +   P N K LR FLG   + R+F+   S++  PL++LL+++  + +     +A E +K  LVS P+
Subjt:  EKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPI

Query:  LIAPDWSQPFELMCDASDYAMGAALVQRR-DRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGS--KVIIYTDHSAI--RYLMTKKDAK
        L   D+S+   L  DASD A+GA L Q+  D   +P+ Y S   + AQ NYS + KE+LA++ +++ +R YL  +     I TDH  +  R     +   
Subjt:  LIAPDWSQPFELMCDASDYAMGAALVQRR-DRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGS--KVIIYTDHSAI--RYLMTKKDAK

Query:  PRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRL----ENMEHDRKQPDVN----ASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHD
         RL RW L LQ+F+ EI  R G  N++AD LSR+    E +  D +   +N     S  D+   +V         ++N L  +    + N Q K  L+ +
Subjt:  PRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRL----ENMEHDRKQPDVN----ASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHD

Query:  AKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVEL
        +K              D I      +++ + I+ + H+     H  G      ++   + W  + +  ++Y   C  CQ   + +++   PL  I   E 
Subjt:  AKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVEL

Query:  -FDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARN-DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMN
         ++   +DF+   P S+G+N + V VD  SK    + C ++  A   +    + +   FG P+ +I+D    F ++   +   KYN   + +  Y PQ +
Subjt:  -FDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARN-DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMN

Query:  GQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHL-PLEL
        GQ E +N+ ++ +L  V ++    W   ++    +Y  A  +   M+P+ +V   +  L PLEL
Subjt:  GQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHL-PLEL

P0CT41 Transposon Tf2-12 polyprotein8.5e-8328.4Show/hide
Query:  LNPAMKEVVKKEVLKWLDAGVIFPISDSKWVS--PVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFF
        L P   + +  E+ + L +G+   I +SK ++  PV  VPKK G                    R+ +DY+ LN   K + +PLP I+Q+L ++ G+  F
Subjt:  LNPAMKEVVKKEVLKWLDAGVIFPISDSKWVS--PVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFF

Query:  CFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNW
          LD  S Y+ I +   D+ K  F CP G F +  MP+G+  AP  FQ  +  I  E  E  V  +MDD  +   S    + +++ VL++ K+ NL++N 
Subjt:  CFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNW

Query:  EKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPI
         KC F  ++   +G+ IS+KG    Q  I+ V +   P N K LR FLG   + R+F+   S++  PL++LL+++  + +     +A E +K  LVS P+
Subjt:  EKCHFMVTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPI

Query:  LIAPDWSQPFELMCDASDYAMGAALVQRR-DRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGS--KVIIYTDHSAI--RYLMTKKDAK
        L   D+S+   L  DASD A+GA L Q+  D   +P+ Y S   + AQ NYS + KE+LA++ +++ +R YL  +     I TDH  +  R     +   
Subjt:  LIAPDWSQPFELMCDASDYAMGAALVQRR-DRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGS--KVIIYTDHSAI--RYLMTKKDAK

Query:  PRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRL----ENMEHDRKQPDVN----ASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHD
         RL RW L LQ+F+ EI  R G  N++AD LSR+    E +  D +   +N     S  D+   +V         ++N L  +    + N Q K  L+ +
Subjt:  PRLIRWVLLLQEFDAEIIDRKGMENNVADHLSRL----ENMEHDRKQPDVN----ASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHD

Query:  AKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVEL
        +K              D I      +++ + I+ + H+     H  G      ++   + W  + +  ++Y   C  CQ   + +++   PL  I   E 
Subjt:  AKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVEL

Query:  -FDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARN-DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMN
         ++   +DF+   P S+G+N + V VD  SK    + C ++  A   +    + +   FG P+ +I+D    F ++   +   KYN   + +  Y PQ +
Subjt:  -FDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARN-DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMN

Query:  GQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHL-PLEL
        GQ E +N+ ++ +L  V ++    W   ++    +Y  A  +   M+P+ +V   +  L PLEL
Subjt:  GQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHL-PLEL

P10394 Retrovirus-related Pol polyprotein from transposon 4128.5e-9926.07Show/hide
Query:  LCDLGASINLMP--LSVFNKLGIATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPI----ILGRPFLSTGRTLIDVHKGEI
        L D GA I+++      F+ + I  +   + +Q   +  +   G+   + ++  K+++P DF ++    DK+ PI    I+G  F+      ID+++ E 
Subjt:  LCDLGASINLMP--LSVFNKLGIATQPTTVTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPI----ILGRPFLSTGRTLIDVHKGEI

Query:  IMRVNDQEVRFNVFKALEYPGEQEDCQNLDTEDESHERWLEENEETIANLDAQQPEQCCALVHSAFETLTPNRINQQIKPSLEEAPEIELKVLPVHLKYA
           +    ++F ++  + Y     +   L    +   R +  +++    +  Q+ +     ++ A    T +    +I  + +    + +      LKY 
Subjt:  IMRVNDQEVRFNVFKALEYPGEQEDCQNLDTEDESHERWLEENEETIANLDAQQPEQCCALVHSAFETLTPNRINQQIKPSLEEAPEIELKVLPVHLKYA

Query:  YLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYC--------------MHKIRLEEGKDGSIEAQRRLNP-AMKEVVKKEVLKW
         L   N     +  A S  +   +L  LKK+   +    + ++ I   Y               ++K +L    D  +  +   +P +  E ++ +V K 
Subjt:  YLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYC--------------MHKIRLEEGKDGSIEAQRRLNP-AMKEVVKKEVLKW

Query:  LDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPED
        +   ++ P S S++ SP+  VPKK      PN + +         WR+ +DYR++N     D FPLP ID +LD+L   ++F  LD  SG++QI +    
Subjt:  LDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYSGYNQIMIAPED

Query:  QEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKIS
        ++ T+F+   G++ F R+PFGL  AP +FQR M   FS        ++MDD  V G S +  L NL +V  +C++ NL L+ EKC F + E   LGHK +
Subjt:  QEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHKIS

Query:  KKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASD
         KG+  D  K + ++  P P +  + R F+    +YRRF+K+F+  +R ++ L ++N P+ + +EC KAF  LK+ L++  +L  PD+S+ F +  DAS 
Subjt:  KKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDASD

Query:  YAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGM
         A GA L Q  +    P+AYAS+ F   ++N STT++EL A+ +A+  FR Y+ G    + TDH  + YL +  +   +L R  L L+E++  +   KG 
Subjt:  YAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGM

Query:  ENNVADHLSRL-----------------------------ENMEHDRK------QPDVNASFPDEAVLKVT---------------------ESAPWYA-
        +N+VAD LSR+                             E ++  ++      +P+V     ++ V KV                      +    Y  
Subjt:  ENNVADHLSRL-----------------------------ENMEHDRK------QPDVNASFPDEAVLKVT---------------------ESAPWYA-

Query:  ---DIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFR-----LCVP------EISYQCILSQCHDSP-YGGHFGGQRTAAKVLQSGY
           D+  FL   +         + K+    K +       ++   + I +     L  P      E   + ILS  HD P  GGH G  +T AKV +  Y
Subjt:  ---DIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFR-----LCVP------EISYQCILSQCHDSP-YGGHFGGQRTAAKVLQSGY

Query:  FWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVE-LFDVWGIDFMGPFPPS-NGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRF
        +W  + +  ++Y  +C +CQ+    +   K P+T     E  FD   +D +GP P S NG+ Y +  +  ++K++ AI  A   A TV+  + ++   ++
Subjt:  FWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVE-LFDVWGIDFMGPFPPS-NGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRF

Query:  GTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHL
        G  +  I+D GT + N II +L     +++  +TA+H Q  G  E S+R L   +   +++ + DW   L   ++ + T         PY LVFG+  +L
Subjt:  GTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHL

Query:  PLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQKELFPHGAVELMNEDGSNA-FKVNG
        P    +K        N+D   A E++      LE    +A +  + +KE+ K   D ++    L +G K L       L NE G    FK  G
Subjt:  PLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQKELFPHGAVELMNEDGSNA-FKVNG

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.4e-8829.86Show/hide
Query:  KEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYS
        ++ + K V K LD   I P S S   SPV  VPKK G                   +R+C+DYR LN AT  D FPLP ID +L R+   + F  LD +S
Subjt:  KEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYS

Query:  GYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMV
        GY+QI + P+D+ KT F  P G + +  MPFGL NAP TF R M   F +     V +++DD  +F  S E    +L+ VL+R K+ NL++  +KC F  
Subjt:  GYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMV

Query:  TEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWS
         E   LG+ I  + +   Q K  A+   P P  +K  + FLG   +YRRF+ + SKIA+P+   +     +   ++  KA E LKAAL ++P+L+  +  
Subjt:  TEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWS

Query:  QPFELMCDASDYAMGAAL--VQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLL
          + L  DAS   +GA L  V  +++++  + Y SK+  +AQ NY   + ELL ++ A+  FR  L G    + TDH ++  L  K +   R+ RW+  L
Subjt:  QPFELMCDASDYAMGAAL--VQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLL

Query:  QEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVLKVTESAP-WYADIVNFLVCKQF---PED---FNAQQKKKLMHDA--KFYYWDEP
          +D  +    G +N VAD +SR          P+ +     E+     +S P   A +++     Q    PED   F + QKK  + +   K Y  ++ 
Subjt:  QEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVLKVTESAP-WYADIVNFLVCKQF---PED---FNAQQKKKLMHDA--KFYYWDEP

Query:  QLYRRGPDHIFRLCVPEISYQCILSQCHD-SPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQ--NKMPLTSILEVELFDVWGI
         +Y +      RL VP      ++   HD + +GGHFG   T AK+    Y+WP L      Y   C +CQ I +   +    +    I E    D+  +
Subjt:  QLYRRGPDHIFRLCVPEISYQCILSQCHD-SPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQ--NKMPLTSILEVELFDVWGI

Query:  DFMGPFPP-SNGHNYILVAVDYVSKWVEAISCARN-DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEIS
        DF+   PP SN  N ILV VD  SK    I+  +  DA  +   L + IF+  G PR + SD            L  +  ++  +++A HPQ +GQ+E +
Subjt:  DFMGPFPP-SNGHNYILVAVDYVSKWVEAISCARN-DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEIS

Query:  NRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIY
         + L  +L   V+++ ++W   L +  + Y +     +G SP+ +  G   + P       + A     ++L    +A  +Q  E         E+A+I 
Subjt:  NRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIY

Query:  KERTKRWQDQRISKKSLHIGQKEL------FPHGAVELMNEDGSNAFKVNGQRVKPYYGVGLERDKGNLKRFYQITQIKNSKRL
         E      +QR     L+IG   L      F  GA   + +     F+V    VK       E D  + K+ +++  ++  K L
Subjt:  KERTKRWQDQRISKKSLHIGQKEL------FPHGAVELMNEDGSNAFKVNGQRVKPYYGVGLERDKGNLKRFYQITQIKNSKRL

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.4e-8729.45Show/hide
Query:  KEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYS
        ++ + K V K LD   I P S S   SPV  VPKK G                   +R+C+DYR LN AT  D FPLP ID +L R+   + F  LD +S
Subjt:  KEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAGNEFFCFLDGYS

Query:  GYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMV
        GY+QI + P+D+ KT F  P G + +  MPFGL NAP TF R M   F +     V +++DD  +F  S E    +L+ VL+R K+ NL++  +KC F  
Subjt:  GYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFMV

Query:  TEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWS
         E   LG+ I  + +   Q K  A+   P P  +K  + FLG   +YRRF+ + SKIA+P+   +     +   ++  KA + LK AL ++P+L+  +  
Subjt:  TEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWS

Query:  QPFELMCDASDYAMGAAL--VQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLL
          + L  DAS   +GA L  V  +++++  + Y SK+  +AQ NY   + ELL ++ A+  FR  L G    + TDH ++  L  K +   R+ RW+  L
Subjt:  QPFELMCDASDYAMGAAL--VQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLL

Query:  QEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVLKVTESAP-WYADIVNFLVCKQF---PED---FNAQQKKKLMHDA--KFYYWDEP
          +D  +    G +N VAD +SR          P+ +     E+     +S P   A +++     Q    PED   F + QKK  + +   K Y  ++ 
Subjt:  QEFDAEIIDRKGMENNVADHLSRLENMEHDRKQPDVNASFPDEAVLKVTESAP-WYADIVNFLVCKQF---PED---FNAQQKKKLMHDA--KFYYWDEP

Query:  QLYRRGPDHIFRLCVPEISYQCILSQCHD-SPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQ--NKMPLTSILEVELFDVWGI
         +Y +      RL VP      ++   HD + +GGHFG   T AK+    Y+WP L      Y   C +CQ I +   +    +    I E    D+  +
Subjt:  QLYRRGPDHIFRLCVPEISYQCILSQCHD-SPYGGHFGGQRTAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQ--NKMPLTSILEVELFDVWGI

Query:  DFMGPFPP-SNGHNYILVAVDYVSKWVEAISCARN-DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEIS
        DF+   PP SN  N ILV VD  SK    I+  +  DA  +   L + IF+  G PR + SD            L  +  ++  +++A HPQ +GQ+E +
Subjt:  DFMGPFPP-SNGHNYILVAVDYVSKWVEAISCARN-DAATVSSFLQKNIFTRFGTPRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEIS

Query:  NRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIY
         + L  +L    +++ ++W   L +  + Y +     +G SP+ +  G   + P       + A     ++L    +A  +Q  E         E+A+I 
Subjt:  NRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIY

Query:  KERTKRWQDQRISKKSLHIGQKEL------FPHGAVELMNEDGSNAFKVNGQRVKPYYGVGLERDKGNLKRFYQITQIKNSKR
         E      +QR     L+IG   L      F  GA   + +     F+V    VK       E D  + K+ +++  ++  K+
Subjt:  KERTKRWQDQRISKKSLHIGQKEL------FPHGAVELMNEDGSNAFKVNGQRVKPYYGVGLERDKGNLKRFYQITQIKNSKR

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein1.7e-1458.93Show/hide
Query:  VLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFM
        VLQ+G++WPT F+DA  +   CD CQR GN + +N+MP   ILEVE+FDVWGI FM
Subjt:  VLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFM

ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-1840.77Show/hide
Query:  NLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHK--ISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIF
        +L  VL+  +      N +KC F   +   LGH+  IS +G+  D AK+EA+   P P N   LR FLG  G+YRRFVK++ KI RPL+ LL++N    +
Subjt:  NLEKVLKRCKDTNLVLNWEKCHFMVTEGIVLGHK--ISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIF

Query:  NEECLKAFETLKAALVSAPILIAPDWSQPF
         E    AF+ LK A+ + P+L  PD   PF
Subjt:  NEECLKAFETLKAALVSAPILIAPDWSQPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAACGAACAGCCTGTGTTCATTTCTGACCCTGAGATTGAGAGAACCTTTCGGAAAAGAAATAAGACATTTCAAAGGTGGCGACAACTCTACAGAGAGAAAAAGAT
GCAAAATCAAGCAAACACTGCCAACCAGCCTTTGAGAGCTAACAATGAGGCTAACGACAGACGAGAAGCAGTAAACTATGCGTTAGTCCAAGCACAACTTCAAGCTCAGC
AAATTGATCAACAAACTCATCCCATTTTATTAGCACATGATCGGAATCGTCCAATCAGGGATTATGCGTCGCCCATCTTATATAATTTTTCACCTGGAATTATGCGACCC
GAGTCCCAAGGATCAAGATTTGAAATGAAGCCAGTCATGCTTCAAATGCTGCAAACAGCAGGTCAGTTTGAGGGATCACCTGGTGAGGACCCTCATGCTCACCTGAAGAG
CTTCATTGAAATCTACAACACTTTTGTCATCCCAAATATAAGTGCAGATGACATCCGATTAACGCTGTTCCCATTCTCCCTTATAGATGAGGCAAGACAGTGGGCATATT
CTCTAGAACCAGGGGAGATCACCACCTGGAACCAAATGATAGAAAAATTCATGAAAAAGTTCTTCCCACCAACGGAGAATGCTCGAAGAAGAAGAGACATCGCCAATTTC
CAACAAAAAGACAGAGAAACCCTGAGCGATGCTTGGGCAAAATTCAAGAGATTGGTGAGAAATTGCCCGCATAATGGTTTTCCAGACTGTGTTCAAATGGAGATCTTCTA
TGATGGATTGACCGAGGCCTCTCAGACGGCTACAAATGCTGCCGCAGCGGGAGGACTGTTGGATAAAACTTATACTGAGGCCAAGGAAATTCTCGATAGAATATCAAGAA
ATCACGAGGATTGGGAGTACCATGGATATAGCCGATCTGGCCGCAAGCAAAACCATGCGTCAGGAGCACCTGAGCATAATAGTGTTGCTGCATTGCAAGCTCAAACTATG
GCCCTGAATCAGTCGAATACAGGAGGCAGCGGCCAAGTGAATGCAGTAAATCAAATAAATGCTGCGGGGTGCGTGGGATGTGGAGAGCCACATGCGTATGAAGTATGCCC
ACAGAACCCTCAGTCTGTGTGTTTTATACGTAACAATCCGTATTCCAATACCTACAACCCTGGCTGGAGGAATCATCCGAATTTCTCATGGGGTGGTAATCGCCAGCCTG
AGCAGCAAGGTGCGCCTATACACGAAAGAGGAGGGTCATCTGGATTCTCCCATGGACATCAAAGGCAAAATTCATATCAGTCAGCGCCTGGTCCATCATCATCCATGGAA
GCTCTTCTTAAGGAATACATGCAGAAGAATGACGCCTTCAATACCGAAGCCCCACGCGGTACGGGCAGTTCAGGGAAAGAACAGTGTCAAGCTGTGACACTGCGAAGTGG
AAGAACAATGAATGAACTTCCCCACACAGGGAACTCTCAACCAACAATTCATCATTCCAACCCCACAACTAAGTTGGACCAATCAACACAACAAAAACAAGCTGACATGC
TAAGAAGTTCATTAGGAAAAGACGCATCAACAAGCATGACAGCCGAGCTGCAAACGCCTCCTTATCCTCAGCGGTTAAGAAGGAAGAAAAACAATGAGAGGCAGTTCAAA
CGCTTCCTTGATGTACTCAAACAATTGCACATCAACATCCCGCTTGTGGAAGCATTGGAACAAATGCCTACCTATGCTAAATTCTTAAAGGACATACTGTCTAAAAAGAG
AGGAATGGATGAACATGAGACCATAGCTCTAACGCAGGAGAGCAGTGACATGGTCTGTGAACGCATCCCCACAAAGATGAAAGACCCAAGGAGCTTCACCATTCCTTGTT
CTATTGGAGGAATTCACATAGGAAAAGCGTTGTGTGATCTAGGGGCTAGCATTAACCTTATGCCCCTGTCAGTATTCAACAAGTTGGGGATCGCAACGCAGCCAACAACA
GTTACTCTACAATTGGCGGATCGATCCGTGGTATATCCTGAAGGAAGGATTGAGGATGTGTTGGTAAAGGTCGACAAGTTTATTCTCCCTGCAGACTTCATTATCCTGGA
TTATGAAGCAGACAAGGACGTCCCAATCATACTTGGACGCCCGTTTCTGTCAACGGGTCGTACGCTCATTGACGTCCACAAAGGAGAAATCATCATGAGAGTTAATGATC
AGGAAGTGAGATTCAACGTCTTTAAGGCATTGGAATATCCAGGGGAACAGGAAGATTGCCAGAACCTCGACACAGAGGATGAGTCCCATGAAAGATGGCTTGAAGAAAAT
GAGGAAACTATCGCAAATCTGGACGCGCAGCAGCCTGAACAATGCTGTGCTTTGGTCCATTCGGCTTTTGAAACCTTGACTCCGAATCGAATCAACCAACAAATAAAACC
ATCTCTGGAAGAAGCACCAGAAATCGAGTTAAAAGTATTGCCAGTTCACCTTAAGTATGCCTACTTAGGAGAAGGTAACTCTCTTCCAGTTTTTATTTCAGCTGCTTTAT
CCCCTAGTCAAGAATCAGCATTGTTATGCATTCTGAAAAAGCACATCCGAGCTGTCGGATGGACGTTAGCTGATATCAAGGGGATCAGCCCCACATACTGTATGCATAAA
ATTCGTCTAGAAGAAGGAAAAGACGGATCTATCGAGGCCCAAAGAAGGCTCAATCCAGCAATGAAAGAAGTTGTCAAGAAAGAAGTGCTTAAATGGTTGGACGCAGGCGT
CATCTTCCCCATCTCTGATAGTAAGTGGGTAAGTCCTGTCCAATGCGTTCCCAAAAAGGGAGGTATGACTGTAGTCCCAAATAAGAATAATGAATTGATCTCCACCCGCA
CTGTCACCGGGTGGAGAATTTGCATGGACTACAGAAAGCTCAACGCGGCAACGAAGAAGGATCATTTCCCGTTACCCTTCATTGACCAAATGTTAGATAGACTTGCGGGA
AATGAATTTTTCTGCTTTTTGGATGGCTATTCAGGCTACAACCAGATCATGATTGCTCCAGAAGACCAGGAGAAGACCACATTTACATGTCCTTATGGTACGTTTGCATT
TCGACGCATGCCCTTTGGATTATGTAACGCCCCAGGCACCTTTCAGAGGTGTATGATGGCAATCTTCTCTGAATTTTTAGAAGAATCTGTGGAGATATTCATGGATGATT
TTTCGGTCTTCGGAAATTCGTTTGAAGCATGTTTAGCCAATTTGGAGAAAGTACTGAAAAGATGCAAGGACACAAACCTAGTGCTTAACTGGGAAAAATGCCACTTCATG
GTAACTGAAGGAATTGTCCTCGGTCACAAGATATCCAAGAAGGGCCTGGAAGTTGATCAGGCCAAAATTGAAGCCGTAGAAAAGCTTCCACCGCCTACAAACATAAAGAC
GCTTCGTAGCTTCTTGGGGCATGCAGGCTTTTATAGGAGGTTCGTCAAGGACTTTTCAAAAATTGCACGCCCTTTAAGCTCATTGCTGGAGCAGAATCGACCCTACATTT
TTAATGAAGAATGTCTCAAGGCATTTGAGACGCTGAAAGCAGCACTGGTGTCAGCACCAATTTTAATTGCCCCAGATTGGTCGCAGCCATTTGAACTAATGTGCGATGCC
AGCGACTATGCGATGGGCGCCGCGTTAGTTCAACGCAGAGACAGAATGTTGCACCCCATAGCATATGCCAGCAAAACTTTCAACGCAGCCCAGACGAATTATAGCACTAC
AAAGAAAGAGCTGTTAGCAGTAGTGTTCGCTGTTGAGAAATTCAGGTCATATCTTCTAGGGTCGAAAGTCATCATCTATACTGACCACTCAGCTATTCGATACTTGATGA
CTAAAAAAGATGCCAAGCCGAGGCTCATCCGATGGGTTCTGCTTCTTCAAGAGTTTGACGCTGAAATAATTGATCGCAAAGGTATGGAGAACAACGTAGCAGATCATCTC
TCCAGACTCGAGAATATGGAACACGATCGTAAACAGCCGGATGTCAACGCAAGCTTTCCAGATGAAGCTGTATTGAAAGTCACCGAGTCCGCGCCATGGTATGCAGACAT
TGTCAACTTCTTGGTGTGCAAGCAATTTCCCGAAGACTTCAACGCACAGCAAAAGAAGAAGTTGATGCACGATGCGAAGTTCTATTATTGGGACGAACCCCAACTTTACA
GAAGGGGGCCTGATCACATCTTCAGACTCTGCGTCCCAGAGATTTCATATCAATGCATCCTATCTCAATGTCATGACTCCCCCTATGGAGGACATTTTGGAGGACAGCGA
ACTGCAGCCAAGGTCCTACAAAGTGGATACTTCTGGCCAACCCTTTTCAGAGATGCCAGGGACTATGCGATCAGATGCGACAGATGTCAACGCATCGGAAATATCTCGAA
TCAGAACAAAATGCCACTTACCTCTATCTTAGAAGTTGAACTCTTTGACGTTTGGGGTATTGATTTTATGGGCCCATTTCCACCATCAAATGGCCACAACTATATATTGG
TAGCTGTGGATTATGTATCAAAATGGGTCGAAGCAATCTCATGCGCTAGGAATGATGCGGCGACAGTCTCAAGTTTTCTTCAGAAGAATATATTCACGAGATTTGGGACG
CCGAGAGCCCTTATTAGCGATGAAGGTACGCATTTCATTAATAAAATCATTCCCAACTTGCTAATAAAATACAATGTGCAGCACAGGGTGGCAACTGCGTACCACCCACA
GATGAATGGACAAGCAGAAATTTCCAATAGAGAACTGAAAACTATTTTAGAAAAGGTAGTTAACTCTTCTCGAAAAGACTGGGCATCCAAATTAAACGAGGCGTTATGGG
CATATCGTACAGCATTCAAAACGCCTATCGGAATGTCACCGTACACGCTAGTATTTGGAAAGGCTTGCCATCTGCCGTTGGAGCTGGAACATAAAGCATTGTGGGCTGCC
AAGAGATTAAACATGGACCTAAAAGCAGCTGGAGAAGCGCGTCAGCTTCAACTGAATGAACTGGAGGAATGGAGACTACAGGCCTATGAGAACGCAAAAATATACAAGGA
ACGAACAAAGCGTTGGCAAGATCAACGCATCAGTAAGAAATCTTTGCATATAGGTCAAAAGGAACTCTTTCCTCATGGTGCCGTAGAGCTGATGAATGAAGACGGCAGCA
ACGCATTCAAAGTTAATGGTCAACGCGTGAAACCATATTATGGAGTTGGCTTGGAACGCGACAAAGGGAATCTGAAGCGATTCTATCAAATCACTCAGATCAAGAATTCG
AAAAGGCTGGCGGCGTTTAAGTCGGCTGATATCGACGGATCCGATAACTCACGCGCTCCGCCTTCTAAACCCTATATAAACCCCTCTTCACCCTTTTTTTTAGGTTACGC
TTCCATCTCTCAGTTTTCGAATTACCGGCGCGAATTCCGGCGATTTCCGTTTCAGATTCACCCTTTGTTTTTTCTTCTTTTCACCGATACCATCCCCATGGCATCCCAAC
AATCTGCTTCATCCAAAAACCCTAAACCCACGACCTCATCCTCGAGGCAGGGTAGTCGAAATCCGTCCACTACCTCCACCACACCCGAACCCCTCACCATTGTTCGCCCC
AACAGTCCTGAGTTGGAGGCGACATCATCCCCATCTCCACCACGCTCCAGGGACCTGCCTGAAGCCCAAATGGTGGGAAGGAACGACGAATCCACCGTCTTCGGAGACAT
TCTCGACTCCATCGATGAAGAAGACGATGCCTGGTTGGACGGCGGTCTTGAGGAATCAGACTCCGATGATAGACCCATCAGAAAAGTAATAGCGGCGGTAGAAAAGAAGA
AATCCGCCAAAGAGAAGCAAATCAGGCCAATTAGGGCCAGGGAAGAAGCCGAGGAAGATGTCCCGCCACTGAGAAGAAAAAGCAAGAAAGTAAAGGATGCGGGGACATCC
GAGGCTGTCGCATCTTCAGTCGCCGCATCATCTGTCGAGCGGCTAGCAGCAAAAGCGGCAGAGAAGTCCAAAAAGCTGAAAGATGACATCCAAAAGATGAAAGAGAGGAC
AGAGTCGTTTCTCGCCGCAAAGAAGAAAAGAAAAGAGGAAATTGCGGCTAGTGAGCAAGCTGTTGCGAAGGCCGCTAGAAGAGTCGAGTTTCTCCGTCGCATTGCAGAGA
TTGCGACGGAACTAGAAGTAGAAGTGGAAGTGAGTGATGCGGAGAGGCCACCCAAAGCTGAACGCATCAAGAAAAACATAAAGAAAATAAAGGAGGAGAAGAAGAAAGAA
AAAGAGGTCACGATAACCAAAGAGAAGCGACCGGAGAAAAAGAAGACTGAGGTGCCTAGGGCACTTGGAGTTGTCATACGGGATTCGGATGCAGGCAGAGCAAGGAGAAC
GCCTGCTAGTCCCGTCAGAAGAAATGAGAAAGGAAAAGAGAAAATGGTTGAGGAACCAACGAAGGAGGCGGCGAAGAGCCGAAAGAAGCAGTTCAGTGGGCTCTACACCG
AAGTGGGATTTTTTCCAGAACCGATAGAGCTGCCTGCCTTCATCATACAAGGAGTCGACGCATTGGGTTGGAGGCAATTGTGTGAAAGCGGCCAAGTCATCCAACCCACC
GCCGTGGAGGCATTTTATGAAGGAACAATTCACCGAAGAGCACACTTGGTCAAAATAGAAAATGAGGTGATTTTCTTCGAGCCTCAAGAGATTAACGCGTTGTTTGATCT
GCCCAACATCGCGGCTGCGGAAGGAAACAGAATAATGTCGACGCCTACAGACGCCGAAATGAATGACGCCCTCTCAATCATCGCCAAACCGGGGTCAGAGTGGAACACTT
CCCCAAAGGGTACAGCCAACCTGTGGCTCTATTTCATCAAGCGGTCGCTGCTCCCCACAACGCATGACGCCTCAATTTCAAGGGACCGTGCTATGGTCATTTACTGCATC
ATGCGGGGAATTCAGCTGGACGTGGGACACATCATTGCCCCACAGATTCGGGGGCTGTTCTTCAAGCCAAGGGGTCAGCTATTCTTCCCCTTTCTTGTGACAAGACTGTG
CGCCAACGCACAATTCATGGAGGATGCGCCAGTCAGAGCAGTTAACGAAGTCCTTTCAGCGCAGAGTTTGAGACGGATATTAAAGGACTCACCGCATCTGCTTGACGCAG
CCAACCCAAAGAAGAGGCCGCCAATGTCACAAGAACCATCATCCCCCCAACCTCAACCCAAGAAGAGGAAGACGGTCAAAAAGAATTCTGAGATTCAGGCGTCGAGCTCA
CAACCTAACTTAAACGAAGAAACGAAGGAAGTGTCGCCGCCTTCTTCCCCCCATCTGGAGTTAACCCTTTCCCCGCCTCAATCATCCACCTTTCGAGAGGCAACTCCACC
ACCACCACCACATTTTGAGCCGACCCTGACTCTGGATCCTGAGGAGTCAGTGCCCTTTCAATTCTCACAGCCTCGCCATGAACCTCAAGCTTCCAACCTTCCAACCCCAA
CTACCGAACCTTCTGCCCGCCCAACATCTCAACCACAGGCAGAAGCAACTACCAGCACGCATCATCAAGAAGAACCATACCACTTGCGCCACGCCTCTGCTCCGCTGCCG
TTTGTTGATTTAAACTTGGATGATCTGCTGAGGTACTTGGATGATGGGATCCTTCACCCCATCATGGGAGACCTGGATGAACTTCGGCGCAAGGAGATGGAAAGTCTCCA
GCGGCAAGAAGAGTTGGCTCAACAAGTTTCACAATTGGGTCAACAAGTGACTCAGATGGCGCAGCAACAATCGGAGCTTCGGAGTTTTGTTCAACGCCAAGCTCGGCGTC
AGGATGAACAATTCAGAACGTTGATGAATTATATCTATGAAGTGTTCGTCCAACGCATCCCAGCGCCAGTTATTCCTCCATCTCTTCAGCAACCCCTCCCGTTTGACGAT
CCAAATCCTCCCCCTGCGGACAACAATGACAATGCCCCGGGGAGAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAACGAACAGCCTGTGTTCATTTCTGACCCTGAGATTGAGAGAACCTTTCGGAAAAGAAATAAGACATTTCAAAGGTGGCGACAACTCTACAGAGAGAAAAAGAT
GCAAAATCAAGCAAACACTGCCAACCAGCCTTTGAGAGCTAACAATGAGGCTAACGACAGACGAGAAGCAGTAAACTATGCGTTAGTCCAAGCACAACTTCAAGCTCAGC
AAATTGATCAACAAACTCATCCCATTTTATTAGCACATGATCGGAATCGTCCAATCAGGGATTATGCGTCGCCCATCTTATATAATTTTTCACCTGGAATTATGCGACCC
GAGTCCCAAGGATCAAGATTTGAAATGAAGCCAGTCATGCTTCAAATGCTGCAAACAGCAGGTCAGTTTGAGGGATCACCTGGTGAGGACCCTCATGCTCACCTGAAGAG
CTTCATTGAAATCTACAACACTTTTGTCATCCCAAATATAAGTGCAGATGACATCCGATTAACGCTGTTCCCATTCTCCCTTATAGATGAGGCAAGACAGTGGGCATATT
CTCTAGAACCAGGGGAGATCACCACCTGGAACCAAATGATAGAAAAATTCATGAAAAAGTTCTTCCCACCAACGGAGAATGCTCGAAGAAGAAGAGACATCGCCAATTTC
CAACAAAAAGACAGAGAAACCCTGAGCGATGCTTGGGCAAAATTCAAGAGATTGGTGAGAAATTGCCCGCATAATGGTTTTCCAGACTGTGTTCAAATGGAGATCTTCTA
TGATGGATTGACCGAGGCCTCTCAGACGGCTACAAATGCTGCCGCAGCGGGAGGACTGTTGGATAAAACTTATACTGAGGCCAAGGAAATTCTCGATAGAATATCAAGAA
ATCACGAGGATTGGGAGTACCATGGATATAGCCGATCTGGCCGCAAGCAAAACCATGCGTCAGGAGCACCTGAGCATAATAGTGTTGCTGCATTGCAAGCTCAAACTATG
GCCCTGAATCAGTCGAATACAGGAGGCAGCGGCCAAGTGAATGCAGTAAATCAAATAAATGCTGCGGGGTGCGTGGGATGTGGAGAGCCACATGCGTATGAAGTATGCCC
ACAGAACCCTCAGTCTGTGTGTTTTATACGTAACAATCCGTATTCCAATACCTACAACCCTGGCTGGAGGAATCATCCGAATTTCTCATGGGGTGGTAATCGCCAGCCTG
AGCAGCAAGGTGCGCCTATACACGAAAGAGGAGGGTCATCTGGATTCTCCCATGGACATCAAAGGCAAAATTCATATCAGTCAGCGCCTGGTCCATCATCATCCATGGAA
GCTCTTCTTAAGGAATACATGCAGAAGAATGACGCCTTCAATACCGAAGCCCCACGCGGTACGGGCAGTTCAGGGAAAGAACAGTGTCAAGCTGTGACACTGCGAAGTGG
AAGAACAATGAATGAACTTCCCCACACAGGGAACTCTCAACCAACAATTCATCATTCCAACCCCACAACTAAGTTGGACCAATCAACACAACAAAAACAAGCTGACATGC
TAAGAAGTTCATTAGGAAAAGACGCATCAACAAGCATGACAGCCGAGCTGCAAACGCCTCCTTATCCTCAGCGGTTAAGAAGGAAGAAAAACAATGAGAGGCAGTTCAAA
CGCTTCCTTGATGTACTCAAACAATTGCACATCAACATCCCGCTTGTGGAAGCATTGGAACAAATGCCTACCTATGCTAAATTCTTAAAGGACATACTGTCTAAAAAGAG
AGGAATGGATGAACATGAGACCATAGCTCTAACGCAGGAGAGCAGTGACATGGTCTGTGAACGCATCCCCACAAAGATGAAAGACCCAAGGAGCTTCACCATTCCTTGTT
CTATTGGAGGAATTCACATAGGAAAAGCGTTGTGTGATCTAGGGGCTAGCATTAACCTTATGCCCCTGTCAGTATTCAACAAGTTGGGGATCGCAACGCAGCCAACAACA
GTTACTCTACAATTGGCGGATCGATCCGTGGTATATCCTGAAGGAAGGATTGAGGATGTGTTGGTAAAGGTCGACAAGTTTATTCTCCCTGCAGACTTCATTATCCTGGA
TTATGAAGCAGACAAGGACGTCCCAATCATACTTGGACGCCCGTTTCTGTCAACGGGTCGTACGCTCATTGACGTCCACAAAGGAGAAATCATCATGAGAGTTAATGATC
AGGAAGTGAGATTCAACGTCTTTAAGGCATTGGAATATCCAGGGGAACAGGAAGATTGCCAGAACCTCGACACAGAGGATGAGTCCCATGAAAGATGGCTTGAAGAAAAT
GAGGAAACTATCGCAAATCTGGACGCGCAGCAGCCTGAACAATGCTGTGCTTTGGTCCATTCGGCTTTTGAAACCTTGACTCCGAATCGAATCAACCAACAAATAAAACC
ATCTCTGGAAGAAGCACCAGAAATCGAGTTAAAAGTATTGCCAGTTCACCTTAAGTATGCCTACTTAGGAGAAGGTAACTCTCTTCCAGTTTTTATTTCAGCTGCTTTAT
CCCCTAGTCAAGAATCAGCATTGTTATGCATTCTGAAAAAGCACATCCGAGCTGTCGGATGGACGTTAGCTGATATCAAGGGGATCAGCCCCACATACTGTATGCATAAA
ATTCGTCTAGAAGAAGGAAAAGACGGATCTATCGAGGCCCAAAGAAGGCTCAATCCAGCAATGAAAGAAGTTGTCAAGAAAGAAGTGCTTAAATGGTTGGACGCAGGCGT
CATCTTCCCCATCTCTGATAGTAAGTGGGTAAGTCCTGTCCAATGCGTTCCCAAAAAGGGAGGTATGACTGTAGTCCCAAATAAGAATAATGAATTGATCTCCACCCGCA
CTGTCACCGGGTGGAGAATTTGCATGGACTACAGAAAGCTCAACGCGGCAACGAAGAAGGATCATTTCCCGTTACCCTTCATTGACCAAATGTTAGATAGACTTGCGGGA
AATGAATTTTTCTGCTTTTTGGATGGCTATTCAGGCTACAACCAGATCATGATTGCTCCAGAAGACCAGGAGAAGACCACATTTACATGTCCTTATGGTACGTTTGCATT
TCGACGCATGCCCTTTGGATTATGTAACGCCCCAGGCACCTTTCAGAGGTGTATGATGGCAATCTTCTCTGAATTTTTAGAAGAATCTGTGGAGATATTCATGGATGATT
TTTCGGTCTTCGGAAATTCGTTTGAAGCATGTTTAGCCAATTTGGAGAAAGTACTGAAAAGATGCAAGGACACAAACCTAGTGCTTAACTGGGAAAAATGCCACTTCATG
GTAACTGAAGGAATTGTCCTCGGTCACAAGATATCCAAGAAGGGCCTGGAAGTTGATCAGGCCAAAATTGAAGCCGTAGAAAAGCTTCCACCGCCTACAAACATAAAGAC
GCTTCGTAGCTTCTTGGGGCATGCAGGCTTTTATAGGAGGTTCGTCAAGGACTTTTCAAAAATTGCACGCCCTTTAAGCTCATTGCTGGAGCAGAATCGACCCTACATTT
TTAATGAAGAATGTCTCAAGGCATTTGAGACGCTGAAAGCAGCACTGGTGTCAGCACCAATTTTAATTGCCCCAGATTGGTCGCAGCCATTTGAACTAATGTGCGATGCC
AGCGACTATGCGATGGGCGCCGCGTTAGTTCAACGCAGAGACAGAATGTTGCACCCCATAGCATATGCCAGCAAAACTTTCAACGCAGCCCAGACGAATTATAGCACTAC
AAAGAAAGAGCTGTTAGCAGTAGTGTTCGCTGTTGAGAAATTCAGGTCATATCTTCTAGGGTCGAAAGTCATCATCTATACTGACCACTCAGCTATTCGATACTTGATGA
CTAAAAAAGATGCCAAGCCGAGGCTCATCCGATGGGTTCTGCTTCTTCAAGAGTTTGACGCTGAAATAATTGATCGCAAAGGTATGGAGAACAACGTAGCAGATCATCTC
TCCAGACTCGAGAATATGGAACACGATCGTAAACAGCCGGATGTCAACGCAAGCTTTCCAGATGAAGCTGTATTGAAAGTCACCGAGTCCGCGCCATGGTATGCAGACAT
TGTCAACTTCTTGGTGTGCAAGCAATTTCCCGAAGACTTCAACGCACAGCAAAAGAAGAAGTTGATGCACGATGCGAAGTTCTATTATTGGGACGAACCCCAACTTTACA
GAAGGGGGCCTGATCACATCTTCAGACTCTGCGTCCCAGAGATTTCATATCAATGCATCCTATCTCAATGTCATGACTCCCCCTATGGAGGACATTTTGGAGGACAGCGA
ACTGCAGCCAAGGTCCTACAAAGTGGATACTTCTGGCCAACCCTTTTCAGAGATGCCAGGGACTATGCGATCAGATGCGACAGATGTCAACGCATCGGAAATATCTCGAA
TCAGAACAAAATGCCACTTACCTCTATCTTAGAAGTTGAACTCTTTGACGTTTGGGGTATTGATTTTATGGGCCCATTTCCACCATCAAATGGCCACAACTATATATTGG
TAGCTGTGGATTATGTATCAAAATGGGTCGAAGCAATCTCATGCGCTAGGAATGATGCGGCGACAGTCTCAAGTTTTCTTCAGAAGAATATATTCACGAGATTTGGGACG
CCGAGAGCCCTTATTAGCGATGAAGGTACGCATTTCATTAATAAAATCATTCCCAACTTGCTAATAAAATACAATGTGCAGCACAGGGTGGCAACTGCGTACCACCCACA
GATGAATGGACAAGCAGAAATTTCCAATAGAGAACTGAAAACTATTTTAGAAAAGGTAGTTAACTCTTCTCGAAAAGACTGGGCATCCAAATTAAACGAGGCGTTATGGG
CATATCGTACAGCATTCAAAACGCCTATCGGAATGTCACCGTACACGCTAGTATTTGGAAAGGCTTGCCATCTGCCGTTGGAGCTGGAACATAAAGCATTGTGGGCTGCC
AAGAGATTAAACATGGACCTAAAAGCAGCTGGAGAAGCGCGTCAGCTTCAACTGAATGAACTGGAGGAATGGAGACTACAGGCCTATGAGAACGCAAAAATATACAAGGA
ACGAACAAAGCGTTGGCAAGATCAACGCATCAGTAAGAAATCTTTGCATATAGGTCAAAAGGAACTCTTTCCTCATGGTGCCGTAGAGCTGATGAATGAAGACGGCAGCA
ACGCATTCAAAGTTAATGGTCAACGCGTGAAACCATATTATGGAGTTGGCTTGGAACGCGACAAAGGGAATCTGAAGCGATTCTATCAAATCACTCAGATCAAGAATTCG
AAAAGGCTGGCGGCGTTTAAGTCGGCTGATATCGACGGATCCGATAACTCACGCGCTCCGCCTTCTAAACCCTATATAAACCCCTCTTCACCCTTTTTTTTAGGTTACGC
TTCCATCTCTCAGTTTTCGAATTACCGGCGCGAATTCCGGCGATTTCCGTTTCAGATTCACCCTTTGTTTTTTCTTCTTTTCACCGATACCATCCCCATGGCATCCCAAC
AATCTGCTTCATCCAAAAACCCTAAACCCACGACCTCATCCTCGAGGCAGGGTAGTCGAAATCCGTCCACTACCTCCACCACACCCGAACCCCTCACCATTGTTCGCCCC
AACAGTCCTGAGTTGGAGGCGACATCATCCCCATCTCCACCACGCTCCAGGGACCTGCCTGAAGCCCAAATGGTGGGAAGGAACGACGAATCCACCGTCTTCGGAGACAT
TCTCGACTCCATCGATGAAGAAGACGATGCCTGGTTGGACGGCGGTCTTGAGGAATCAGACTCCGATGATAGACCCATCAGAAAAGTAATAGCGGCGGTAGAAAAGAAGA
AATCCGCCAAAGAGAAGCAAATCAGGCCAATTAGGGCCAGGGAAGAAGCCGAGGAAGATGTCCCGCCACTGAGAAGAAAAAGCAAGAAAGTAAAGGATGCGGGGACATCC
GAGGCTGTCGCATCTTCAGTCGCCGCATCATCTGTCGAGCGGCTAGCAGCAAAAGCGGCAGAGAAGTCCAAAAAGCTGAAAGATGACATCCAAAAGATGAAAGAGAGGAC
AGAGTCGTTTCTCGCCGCAAAGAAGAAAAGAAAAGAGGAAATTGCGGCTAGTGAGCAAGCTGTTGCGAAGGCCGCTAGAAGAGTCGAGTTTCTCCGTCGCATTGCAGAGA
TTGCGACGGAACTAGAAGTAGAAGTGGAAGTGAGTGATGCGGAGAGGCCACCCAAAGCTGAACGCATCAAGAAAAACATAAAGAAAATAAAGGAGGAGAAGAAGAAAGAA
AAAGAGGTCACGATAACCAAAGAGAAGCGACCGGAGAAAAAGAAGACTGAGGTGCCTAGGGCACTTGGAGTTGTCATACGGGATTCGGATGCAGGCAGAGCAAGGAGAAC
GCCTGCTAGTCCCGTCAGAAGAAATGAGAAAGGAAAAGAGAAAATGGTTGAGGAACCAACGAAGGAGGCGGCGAAGAGCCGAAAGAAGCAGTTCAGTGGGCTCTACACCG
AAGTGGGATTTTTTCCAGAACCGATAGAGCTGCCTGCCTTCATCATACAAGGAGTCGACGCATTGGGTTGGAGGCAATTGTGTGAAAGCGGCCAAGTCATCCAACCCACC
GCCGTGGAGGCATTTTATGAAGGAACAATTCACCGAAGAGCACACTTGGTCAAAATAGAAAATGAGGTGATTTTCTTCGAGCCTCAAGAGATTAACGCGTTGTTTGATCT
GCCCAACATCGCGGCTGCGGAAGGAAACAGAATAATGTCGACGCCTACAGACGCCGAAATGAATGACGCCCTCTCAATCATCGCCAAACCGGGGTCAGAGTGGAACACTT
CCCCAAAGGGTACAGCCAACCTGTGGCTCTATTTCATCAAGCGGTCGCTGCTCCCCACAACGCATGACGCCTCAATTTCAAGGGACCGTGCTATGGTCATTTACTGCATC
ATGCGGGGAATTCAGCTGGACGTGGGACACATCATTGCCCCACAGATTCGGGGGCTGTTCTTCAAGCCAAGGGGTCAGCTATTCTTCCCCTTTCTTGTGACAAGACTGTG
CGCCAACGCACAATTCATGGAGGATGCGCCAGTCAGAGCAGTTAACGAAGTCCTTTCAGCGCAGAGTTTGAGACGGATATTAAAGGACTCACCGCATCTGCTTGACGCAG
CCAACCCAAAGAAGAGGCCGCCAATGTCACAAGAACCATCATCCCCCCAACCTCAACCCAAGAAGAGGAAGACGGTCAAAAAGAATTCTGAGATTCAGGCGTCGAGCTCA
CAACCTAACTTAAACGAAGAAACGAAGGAAGTGTCGCCGCCTTCTTCCCCCCATCTGGAGTTAACCCTTTCCCCGCCTCAATCATCCACCTTTCGAGAGGCAACTCCACC
ACCACCACCACATTTTGAGCCGACCCTGACTCTGGATCCTGAGGAGTCAGTGCCCTTTCAATTCTCACAGCCTCGCCATGAACCTCAAGCTTCCAACCTTCCAACCCCAA
CTACCGAACCTTCTGCCCGCCCAACATCTCAACCACAGGCAGAAGCAACTACCAGCACGCATCATCAAGAAGAACCATACCACTTGCGCCACGCCTCTGCTCCGCTGCCG
TTTGTTGATTTAAACTTGGATGATCTGCTGAGGTACTTGGATGATGGGATCCTTCACCCCATCATGGGAGACCTGGATGAACTTCGGCGCAAGGAGATGGAAAGTCTCCA
GCGGCAAGAAGAGTTGGCTCAACAAGTTTCACAATTGGGTCAACAAGTGACTCAGATGGCGCAGCAACAATCGGAGCTTCGGAGTTTTGTTCAACGCCAAGCTCGGCGTC
AGGATGAACAATTCAGAACGTTGATGAATTATATCTATGAAGTGTTCGTCCAACGCATCCCAGCGCCAGTTATTCCTCCATCTCTTCAGCAACCCCTCCCGTTTGACGAT
CCAAATCCTCCCCCTGCGGACAACAATGACAATGCCCCGGGGAGAACTTAA
Protein sequenceShow/hide protein sequence
MNNEQPVFISDPEIERTFRKRNKTFQRWRQLYREKKMQNQANTANQPLRANNEANDRREAVNYALVQAQLQAQQIDQQTHPILLAHDRNRPIRDYASPILYNFSPGIMRP
ESQGSRFEMKPVMLQMLQTAGQFEGSPGEDPHAHLKSFIEIYNTFVIPNISADDIRLTLFPFSLIDEARQWAYSLEPGEITTWNQMIEKFMKKFFPPTENARRRRDIANF
QQKDRETLSDAWAKFKRLVRNCPHNGFPDCVQMEIFYDGLTEASQTATNAAAAGGLLDKTYTEAKEILDRISRNHEDWEYHGYSRSGRKQNHASGAPEHNSVAALQAQTM
ALNQSNTGGSGQVNAVNQINAAGCVGCGEPHAYEVCPQNPQSVCFIRNNPYSNTYNPGWRNHPNFSWGGNRQPEQQGAPIHERGGSSGFSHGHQRQNSYQSAPGPSSSME
ALLKEYMQKNDAFNTEAPRGTGSSGKEQCQAVTLRSGRTMNELPHTGNSQPTIHHSNPTTKLDQSTQQKQADMLRSSLGKDASTSMTAELQTPPYPQRLRRKKNNERQFK
RFLDVLKQLHINIPLVEALEQMPTYAKFLKDILSKKRGMDEHETIALTQESSDMVCERIPTKMKDPRSFTIPCSIGGIHIGKALCDLGASINLMPLSVFNKLGIATQPTT
VTLQLADRSVVYPEGRIEDVLVKVDKFILPADFIILDYEADKDVPIILGRPFLSTGRTLIDVHKGEIIMRVNDQEVRFNVFKALEYPGEQEDCQNLDTEDESHERWLEEN
EETIANLDAQQPEQCCALVHSAFETLTPNRINQQIKPSLEEAPEIELKVLPVHLKYAYLGEGNSLPVFISAALSPSQESALLCILKKHIRAVGWTLADIKGISPTYCMHK
IRLEEGKDGSIEAQRRLNPAMKEVVKKEVLKWLDAGVIFPISDSKWVSPVQCVPKKGGMTVVPNKNNELISTRTVTGWRICMDYRKLNAATKKDHFPLPFIDQMLDRLAG
NEFFCFLDGYSGYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSEFLEESVEIFMDDFSVFGNSFEACLANLEKVLKRCKDTNLVLNWEKCHFM
VTEGIVLGHKISKKGLEVDQAKIEAVEKLPPPTNIKTLRSFLGHAGFYRRFVKDFSKIARPLSSLLEQNRPYIFNEECLKAFETLKAALVSAPILIAPDWSQPFELMCDA
SDYAMGAALVQRRDRMLHPIAYASKTFNAAQTNYSTTKKELLAVVFAVEKFRSYLLGSKVIIYTDHSAIRYLMTKKDAKPRLIRWVLLLQEFDAEIIDRKGMENNVADHL
SRLENMEHDRKQPDVNASFPDEAVLKVTESAPWYADIVNFLVCKQFPEDFNAQQKKKLMHDAKFYYWDEPQLYRRGPDHIFRLCVPEISYQCILSQCHDSPYGGHFGGQR
TAAKVLQSGYFWPTLFRDARDYAIRCDRCQRIGNISNQNKMPLTSILEVELFDVWGIDFMGPFPPSNGHNYILVAVDYVSKWVEAISCARNDAATVSSFLQKNIFTRFGT
PRALISDEGTHFINKIIPNLLIKYNVQHRVATAYHPQMNGQAEISNRELKTILEKVVNSSRKDWASKLNEALWAYRTAFKTPIGMSPYTLVFGKACHLPLELEHKALWAA
KRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWQDQRISKKSLHIGQKELFPHGAVELMNEDGSNAFKVNGQRVKPYYGVGLERDKGNLKRFYQITQIKNS
KRLAAFKSADIDGSDNSRAPPSKPYINPSSPFFLGYASISQFSNYRREFRRFPFQIHPLFFLLFTDTIPMASQQSASSKNPKPTTSSSRQGSRNPSTTSTTPEPLTIVRP
NSPELEATSSPSPPRSRDLPEAQMVGRNDESTVFGDILDSIDEEDDAWLDGGLEESDSDDRPIRKVIAAVEKKKSAKEKQIRPIRAREEAEEDVPPLRRKSKKVKDAGTS
EAVASSVAASSVERLAAKAAEKSKKLKDDIQKMKERTESFLAAKKKRKEEIAASEQAVAKAARRVEFLRRIAEIATELEVEVEVSDAERPPKAERIKKNIKKIKEEKKKE
KEVTITKEKRPEKKKTEVPRALGVVIRDSDAGRARRTPASPVRRNEKGKEKMVEEPTKEAAKSRKKQFSGLYTEVGFFPEPIELPAFIIQGVDALGWRQLCESGQVIQPT
AVEAFYEGTIHRRAHLVKIENEVIFFEPQEINALFDLPNIAAAEGNRIMSTPTDAEMNDALSIIAKPGSEWNTSPKGTANLWLYFIKRSLLPTTHDASISRDRAMVIYCI
MRGIQLDVGHIIAPQIRGLFFKPRGQLFFPFLVTRLCANAQFMEDAPVRAVNEVLSAQSLRRILKDSPHLLDAANPKKRPPMSQEPSSPQPQPKKRKTVKKNSEIQASSS
QPNLNEETKEVSPPSSPHLELTLSPPQSSTFREATPPPPPHFEPTLTLDPEESVPFQFSQPRHEPQASNLPTPTTEPSARPTSQPQAEATTSTHHQEEPYHLRHASAPLP
FVDLNLDDLLRYLDDGILHPIMGDLDELRRKEMESLQRQEELAQQVSQLGQQVTQMAQQQSELRSFVQRQARRQDEQFRTLMNYIYEVFVQRIPAPVIPPSLQQPLPFDD
PNPPPADNNDNAPGRT