; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010616 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010616
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:2260266..2270253
RNA-Seq ExpressionLag0010616
SyntenyLag0010616
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74381.1 hypothetical protein VITISV_007944 [Vitis vinifera]7.3e-25338.8Show/hide
Query:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQHGKF-IKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE
        MA  E  L+IQ+FHQCSSL+SIKL  SN LLW+SQ+LPLVRSLG+  HL SEN+H        +  E  +     W +NDGLLTSWLLG ++E+++ +++
Subjt:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQHGKF-IKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE

Query:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL
         T+TA  +W SL E+LL M+KE E+ LT  L  +KKG  S+D+YL++ KGICD LAA++KPV D+ K F +A+GLGT Y  F+ AMLSK PYP++N+FVL
Subjt:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL

Query:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTV
        AL+ HE  I T+++  K ++ N EQA++TQ+GR R RG  F SRGRGF P GR  +++TS+Q  N + + T    N  N    P   + +N   S  +  
Subjt:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTV

Query:  SKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLE
                +      +ICQIC K NH+AL+CWNRFD+ YQSEEIP+ALAAM L+ E+ DP  Y DSGAT+H+ NDP   S+  + K          DQL+
Subjt:  SKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLE

Query:  ISHIGQGKGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL------------------------------------
                 ++LA+G++K GLYALEE  ++   V      ++KAS  +WH+RM H   KS++ L                                    
Subjt:  ISHIGQGKGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL------------------------------------

Query:  ----------------------------------------------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHR
                                                            +IK+FQSDGGGEF S++ +  L + GI+ Q+SCP TP+QNGV ERKHR
Subjt:  ----------------------------------------------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHR

Query:  HLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEI---------------QC----------------------
        H++E GL    N  KL  S              +    T+ Y ++  P+T   ++ P   +               QC                      
Subjt:  HLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEI---------------QC----------------------

Query:  -----QNGVQNDSPTNKAAQM-------------DEQSREQEENSLPIE-----------DSTNIAEQT---KENLVQPLLEER---HNDTTTMMEDENL
               G +   P+ K   +             D    ++    +PIE           +++ I  QT   KEN        +   H   TT   ++  
Subjt:  -----QNGVQNDSPTNKAAQM-------------DEQSREQEENSLPIE-----------DSTNIAEQT---KENLVQPLLEER---HNDTTTMMEDENL

Query:  MQPQIEERCTTKVTHTGSIVHN------------------NLQNSTPQQENSKSHSLSQFDH----------LPDISS---------NLYIDLTLPSAGN
        ++ +I  RC T      S V++                  N  N+    E     +L   DH          L  I S         N ++D   PS  N
Subjt:  MQPQIEERCTTKVTHTGSIVHN------------------NLQNSTPQQENSKSHSLSQFDH----------LPDISS---------NLYIDLTLPSAGN

Query:  QSNGSQGSNNTSTQSK------------HLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMN
        +S      +     SK            H  ++ TH M+TR KL  DP++   +             +         EPK Y++AL+IPHW  AM+EE+ 
Subjt:  QSNGSQGSNNTSTQSK------------HLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMN

Query:  ALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKET
        AL QN TW LVP+P   N++GSKW+FKTKLKEDGT++RYKARLVA+GF+Q+ GLD+ ETFSPV+K TTIR+I ++A+   W ++QLDVKNAFLHG LKE 
Subjt:  ALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKET

Query:  IYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIK
        ++M QPPGF +  LPNHVCKLN+SLYGLKQAPRAWF+RLSQ LLH+GF C ++D SLFI +                                       
Subjt:  IYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIK

Query:  DLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQ
                                                                                              VNK CQH Q PT  
Subjt:  DLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQ

Query:  HMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLG
         +  VKRILRY+KGT E+G+ F K SSL L GFCDADWAGC  TRRST+G+CIFLG NCISW+SK+QPTV+RSS+EAEYR++AS+ AEITW++FLLRD+G
Subjt:  HMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLG

Query:  IPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH
        I L++PPQL CDNLSAL+M+VNPVFHAR+KHIE+DYHFVREKVA G LITR++PS LQVAD+ TK+L KTSF+  R KLGVH
Subjt:  IPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH

GAU44375.1 hypothetical protein TSUD_243070 [Trifolium subterraneum]5.9e-28744.92Show/hide
Query:  EPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSE-NQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIESTDT
        EP LTIQSFHQCSSLISIKL++SN+LLWKSQILPL+RSLG+E H+ ++ ++    I +    ++ NP   QWI NDGLLTSWLLG + E+ ++MI   DT
Subjt:  EPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSE-NQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIESTDT

Query:  AKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKA
        A  IW+SL EQLL  +++ E  L  +L  L KG LS+D+Y++K K +CD+L A+ KPV D+ KVF +++GLG  Y+ F+ A+LSK PYP+FN+F+++L+ 
Subjt:  AKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKA

Query:  HEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGFPQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTVSK-SG
         E    T+  ++     +  QAF+ Q    RGRGR+ T  GRG  +GR ++SS  N+ +N                         N  AS+ +++ K + 
Subjt:  HEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGFPQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTVSK-SG

Query:  NFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLNEDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLEISHIG
        N+  +P +     CQICG+ NH A  C+ R++++ + E   +ALAA+ +NE+ DP  YADSGAT+HM N  G + SL+ Y G D +FVGNG  L I+H G
Subjt:  NFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLNEDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLEISHIG

Query:  QG------KGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL-----------KIKVFQSDGGGEFTSLELKELLEQ
        +         +ILA+G +K  LYALE  K+E      A     +A   +WH R+GH N K L+ L           KIKVFQSDGGGEFTS E       
Subjt:  QG------KGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL-----------KIKVFQSDGGGEFTSLELKELLEQ

Query:  SGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRS------QTSSYDLDENPTTEHNVDPPNQEIQCQNGVQND
            H   C +      ++ R    ++ TG E  T + KL        D   I  + L +        T    +  +     N  P + +        +D
Subjt:  SGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRS------QTSSYDLDENPTTEHNVDPPNQEIQCQNGVQND

Query:  SPTNKAAQMDEQSREQEENSLPIEDSTNIAEQTKENLVQPLLEERHNDTTTMMEDENLM---QPQIEERCTTKVTHTGSIVHNN---LQNSTPQQENSKS
        S      + DE   E+      ++D T             + ++  +DT     D + +    P ++    T       I+ NN   +QN      +   
Subjt:  SPTNKAAQMDEQSREQEENSLPIEDSTNIAEQTKENLVQPLLEERHNDTTTMMEDENLM---QPQIEERCTTKVTHTGSIVHNN---LQNSTPQQENSKS

Query:  HSLSQFDHLPDISSNLYIDLTLP-SAGNQSNGSQGSNNTSTQSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKS
         S  Q  ++ + SS       LP    N  +    + N        +S+   P     K +  PTI          T     H+  ++  +  EPK YK+
Subjt:  HSLSQFDHLPDISSNLYIDLTLP-SAGNQSNGSQGSNNTSTQSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKS

Query:  ALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLK
        AL+  +W+ AM++E++ALH NNTW+LV +P + NVIGSKW+F+TKL EDG+++R+KARLVA+G+TQ+ GLD+ ETFSPV+K  TIR+I+++A+HF W LK
Subjt:  ALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLK

Query:  QLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTN
        QLDVKNAFLHG L E +YM QPPGF+H  LPNHVC+L+KSLYGLKQAPRAWFE+LS  L+ +GF CS++DPSLFI+++ +   ++LVYVDDIILTGN  +
Subjt:  QLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTN

Query:  ALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDIT
         ++ L+  L  +FA+KDLG+LHYFLG+E+ H   G+ +SQTKYA DLL++ +M  AS  NTP+A   ++ P+D    DAT YRR+ GSLQYLT TRPD+T
Subjt:  ALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDIT

Query:  HAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMAS
        HAVN VCQH QNPT + ++ VKRILRYIKGT  +G+ +   SSLNL  FCDADWAGCP TRRSTTGFCI+LG +CISW SKKQPTV+RSS+EAEY+A+A+
Subjt:  HAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMAS

Query:  ATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH
          AE+TW+ +LL DLGI L++ P +FCDN SA++MS NPVFHARTKHI +DYHF+REKV  G L  RY+ +  Q+ADV TKSL K SF T R KLGVH
Subjt:  ATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH

RVW19921.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.7e-27941.24Show/hide
Query:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQH-GKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE
        MA  E  L+IQ+FHQCSSL+SIKL  SN LLW+SQ+LPLVRSLG+  HL SEN+H  +     +  E  +     W +NDGLLTSWLLG ++E+++ +++
Subjt:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQH-GKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE

Query:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL
         T+TA  +W SL E+LL M+KE E+ LT  L  +KKG  S+D+YL++ KGICD LAA++KPV D+ KVF +A+GLGT Y  F+ AMLSK PYP++N+FVL
Subjt:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL

Query:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYL----TAPTDQNQKNITASH
        AL+ HE  I T+++  K ++ N EQA++TQ+GR R +G  F SRGRGF P GR  +++TS+Q  N H +     ++  N+     +AP  Q   + T+ H
Subjt:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYL----TAPTDQNQKNITASH

Query:  GSTVSKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNG
                                               + ++++ +IP+ALAAM L+ E+ DP  Y DSGAT+H+ NDP   S+  + K          
Subjt:  GSTVSKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNG

Query:  DQLEISHIGQGKGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL--------------------------------
        DQL+         ++LA+G++K GLYALEE  ++   V      ++KAS  +WH+RMGH   KS++ L                                
Subjt:  DQLEISHIGQGKGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL--------------------------------

Query:  -----------KI-----------------------------KVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTN
                   KI                             K+FQSDGGGEF S++ +  L + GI+ Q+SCP TP+QNGV ERKHRH++E GL    N
Subjt:  -----------KI-----------------------------KVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTN

Query:  HMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEI---------------QC---------------------------QNGVQND
          KL  S              +    T+ Y ++  P+T   ++ P   +               QC                             G +  
Subjt:  HMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEI---------------QC---------------------------QNGVQND

Query:  SPTNKAAQM-------------DEQSREQEENSLPIE-----------DSTNIAEQT---KENLVQPLLEER---HNDTTTMMEDENLMQPQIEERCTTK
         P+ K   +             D    ++    +PIE           +++ I  QT   KEN        +   H   TT   ++  ++ +I  RC T 
Subjt:  SPTNKAAQM-------------DEQSREQEENSLPIE-----------DSTNIAEQT---KENLVQPLLEER---HNDTTTMMEDENLMQPQIEERCTTK

Query:  VTHTGSIV----------------------HNN-LQNSTP----------------------QQENSKSHSLSQFDHLPDISSNLYIDLTL---------
             S V                      HN  +   TP                      Q E++ S+  ++  + PDIS +L +DL+          
Subjt:  VTHTGSIV----------------------HNN-LQNSTP----------------------QQENSKSHSLSQFDHLPDISSNLYIDLTL---------

Query:  --PSAGNQSNGSQGSNNTSTQSK------------HLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKV
          PS  N+S      +     SK            H  ++ TH M+TR KL  DP++   +      TR+              EPK Y++AL+IPHW  
Subjt:  --PSAGNQSNGSQGSNNTSTQSK------------HLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKV

Query:  AMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFL
        AM+EE+ AL QN TW LVP+P   N++GSKW+FKTKLKEDGT++RYKARLVA+GF+Q+ GLD+ ETFSPV+K TTIR+I ++A+   W ++QLDVKNAFL
Subjt:  AMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFL

Query:  HGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHL
        HG LKE ++M QPPGF +  LPNHVCKLN+SLYGLKQAPRAWF+RLS                 FI+                   GND N ++DLI  L
Subjt:  HGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHL

Query:  GTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQH
         +EF++KDLG LHYFLGLEV + P GL +SQTKY +DLLE   M   +  NTPMA+       D+   D T YR++VGSLQYLT TRPDI HAVNK CQH
Subjt:  GTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQH

Query:  LQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWIS
         Q PT   +R VKRILRY+KGT E+G+ F K SSL L GFCDADWAGC  TRRST+G+CIFLG NCISW+SK+QPTV+RSS+EAEYR++AS+ AEITW++
Subjt:  LQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWIS

Query:  FLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH
        FLLRD+GI L++PPQL CDNLSAL+M+VNPVFHAR+KHIE+DYHFVREKVA G LITR++PS LQVAD+ TK+L KTSF+  R KLGVH
Subjt:  FLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH

RVW43526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0044.63Show/hide
Query:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQH-GKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE
        MA  E  L+IQ+FHQCSSL+SIKL  SN LLW+SQ+LPLVRSLG+  HL SEN+H  K     +  E  +     W +NDGLLTSWLLG ++E+++ +++
Subjt:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQH-GKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE

Query:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL
         T+TA  +W SL E+LL M+KE E+ LT  L  +KKG  S+D+YL++ KGICD LAA++KPV D+ KVF +A+GLGT Y  F+ AMLSK PYP++N+FVL
Subjt:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL

Query:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTV
        AL+ HE  I T+++  K ++ N EQA++TQ+GR R RG  F SRGRGF P GR  +++TS+Q  N + + T    N  N    P   + +N   S  +  
Subjt:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTV

Query:  SKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLE
                +      +ICQIC K NH+AL+CWNRFD+ YQ EEIP+ALAAM L+ E+ DP  Y DSGAT+H+ NDPGK+S +  YKGHD IFVGNG+ L 
Subjt:  SKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLE

Query:  ISHIGQGK--------------------------------------------------GEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWH
        ISHIG+ +                                                   ++LA+G++K GLYALEE  ++   V      ++KAS  +WH
Subjt:  ISHIGQGK--------------------------------------------------GEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWH

Query:  KRMGHLNEKSLRSL--------------------------------------------------------------------------------------
        +RMGH   KS++ L                                                                                      
Subjt:  KRMGHLNEKSLRSL--------------------------------------------------------------------------------------

Query:  -----------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLG
                         +IK+FQSDGGGEF S++ +  L + GI+ Q+SCP TP+QNGV ERKHRH++E GL    N  KL  S              + 
Subjt:  -----------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLG

Query:  RSQTSSYDLDENPTTEHNVDPPNQEI---------------QC-----QNGVQNDSPTNKA------AQMDEQSREQEENSLPIEDSTNIAEQTKENLVQ
           T+ Y ++  P+T   ++ P   +               QC       G    SP          + + +  R    ++  +  S ++       L+ 
Subjt:  RSQTSSYDLDENPTTEHNVDPPNQEI---------------QC-----QNGVQNDSPTNKA------AQMDEQSREQEENSLPIEDSTNIAEQTKENLVQ

Query:  PLLEERHNDTTTMMEDENLMQPQIEERCTTKVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNNTSTQSKHL
         L +  HN   T     ++     +    T  T TG++   +L  +T         S S  D +PD S  + +D++ P                 Q +H 
Subjt:  PLLEERHNDTTTMMEDENLMQPQIEERCTTKVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNNTSTQSKHL

Query:  TSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKL
         ++ TH M+TR KL  DP++   +      TR+              EPK Y++ L+IPHW  AM+EE+ AL QN TW LVP+P   N++GSKW+FKTKL
Subjt:  TSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKL

Query:  KEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQ
        KEDGT++RYKARLVA+GF+Q+ GLD+ ETFSPV+K TTIR+I ++A+   W ++QLDVKNAFLHG LKE ++M QPPGF +  L NHVCKLN+SLYGLKQ
Subjt:  KEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQ

Query:  APRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKD
        APRAWF+RLSQ LLH+GF C ++D SLFI +    I+++L+YVDDII+TGND N ++DLI  L +EF++KDLG LHYFLGLEV + P GL +SQTKY +D
Subjt:  APRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKD

Query:  LLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNL
        LLE   M   +  NTPMA+       D+   D T YR++VGSLQYLT TRPDI HAVNK CQH Q PT   +R VKRILRY+KGT E+G+ F K SSL L
Subjt:  LLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNL

Query:  YGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTK
         GFCDADWAGC  TRRST+G+CIFLG NCISW+SK+QPTV+RSS+EAEYR++AS+ AEITW++FLLRD+GI L++PPQL CDNLSAL+M VN VFHAR+K
Subjt:  YGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTK

Query:  HIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH
        HIE+DYHFVREKVA G LITR++PS LQVAD+ TK+L KTSF+  R KLGVH
Subjt:  HIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH

RVX04589.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.0e-25945.6Show/hide
Query:  MSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKAHEVRIKTDHDTEKY
        M+KE E+ LT  L  +KKG  S+D+YL++ KGICD LAA++KPV D+ KVF +A+GLGT Y  F+ AMLSK PYP++N+FVLAL+ HE  I T+++  K 
Subjt:  MSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKAHEVRIKTDHDTEKY

Query:  TLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPT----KTGATHNIANYLTAPTDQNQKNITASHGSTVSKSGNFISNPYKSD
        ++ N EQA++TQ+GR R RG  F SRGRGF P GR  +++TS+Q  N  P     +  +T +  +  +AP  Q   + T+ H                  
Subjt:  TLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPT----KTGATHNIANYLTAPTDQNQKNITASHGSTVSKSGNFISNPYKSD

Query:  LIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLEISHIGQGKGEILAK
                             + ++++ +IP+ALAAM L+ E+ DP  Y DSGAT+H+ NDP   S+  + K          DQL+         ++LA+
Subjt:  LIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLEISHIGQGKGEILAK

Query:  GSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSLKIKVFQSDGGGEFTSLELKE-LLEQSGIVHQLSCPHTPQQNGVVERKHRH
        G++K GLYALEE  ++   V      ++KAS  +WH+RMGH   KS++ L  K F         +   KE    QS   +   C H P      E++++ 
Subjt:  GSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSLKIKVFQSDGGGEFTSLELKE-LLEQSGIVHQLSCPHTPQQNGVVERKHRH

Query:  LIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEIQCQNGVQNDSPTNKAAQMDEQSREQEENSLPIEDSTNIAE
         +E  +    N        +Q T+   +N+T    +  +          +HN             V  ++PT+ A          + NS     +  +++
Subjt:  LIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEIQCQNGVQNDSPTNKAAQMDEQSREQEENSLPIEDSTNIAE

Query:  QTKENLVQPLLEERHNDTTTMMEDENLMQPQIEERCTTKVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNN
            N  +P       D +  +   +L  PQ  ++           V  +LQN        KS S S  D +PD S  + +D++ P              
Subjt:  QTKENLVQPLLEERHNDTTTMMEDENLMQPQIEERCTTKVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNN

Query:  TSTQSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGS
           Q +H  ++ TH M+TR KL  DP++   +      TR+              EPK Y++ L+IPHW  AM+EE+ AL QN TW LVP+P   N++GS
Subjt:  TSTQSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGS

Query:  KWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLN
        KW+FKTKLKEDGT++RYKARLVA+GF+Q+ GLD+ ETFSPV+K TTIR+I ++A+   W ++QLDVKNAFLHG LKE ++M QPPGF +  LPNHVCKLN
Subjt:  KWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLN

Query:  KSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHL
        +SLYGLKQAPRAWF+RLSQ LLH+GF C ++D SLFI +    I+++L+YVDDII+TGND N ++DLI  L +EF++KDLG LHYFLGLEV + P GL +
Subjt:  KSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHL

Query:  SQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTF
        SQTKY +DLLE   M   +  NTPMA+       D+   D T YR++VGSLQYLT TRPDI HAVNK CQH Q PT   +R VKRILRY+KGT E+G+ F
Subjt:  SQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTF

Query:  HKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVN
         K SSL L GFCDADWAGC  TRRST+G+CIFLG NCISW+SK+QPTV+RSS+EAEYR++AS+ AEITW++FLLRD+GI L++PPQL CDNLSAL+M+VN
Subjt:  HKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVN

Query:  PVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH
        PVFHAR+KHIE+DYHFVREKVA G LITR++PS LQVAD+ TK+L KTSF+  R KLGVH
Subjt:  PVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH

TrEMBL top hitse value%identityAlignment
A0A2N9EWB3 Integrase catalytic domain-containing protein1.1e-25938.33Show/hide
Query:  AKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIEST
        + T  +L+  + H   +++S+KL S+NYL+W+ QI PL++SL + +HL  E      ++E  E  ++NP + +W   D LL SW+ GT+SE+ L  +   
Subjt:  AKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIEST

Query:  DTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLAL
        +TA+++W  LE + L  +KE E+ L   L   KK  +S+D YL+  K ICD LAA++KPV D  K   ++  LG  Y  F T MLSK P+PTFN+FV AL
Subjt:  DTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLAL

Query:  KAHEVRIK-TDHDTEKYTLPNQEQAFYTQK---GRYRGRGRH----FTSRGRGF-PQGRSTHSSTSNQF-SNFHPTKTGATHNIANYLTAPTDQNQKNIT
        + + +R + TD D +  +  N   AF T +   GR RGRGRH    F SRGRGF P     HSS   Q    FHP+ T   H      + P  QN +   
Subjt:  KAHEVRIK-TDHDTEKYTLPNQEQAFYTQK---GRYRGRGRH----FTSRGRGF-PQGRSTHSSTSNQF-SNFHPTKTGATHNIANYLTAPTDQNQKNIT

Query:  ASHGSTVSKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQ-SEEIPKALAAMNLNEDVDPL-MYADSGATSHMVNDPGKISSLQLYKGHDKIF
         S+ S +S      +N    +   CQICG+  H+AL+CW R+D++Y+ +E I +ALA   L++  D    Y D+GATSHM +  G + S   Y GHD + 
Subjt:  ASHGSTVSKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQ-SEEIPKALAAMNLNEDVDPL-MYADSGATSHMVNDPGKISSLQLYKGHDKIF

Query:  VGNGDQLEISHIGQ----------------------------GK-----------------------GEILAKGSRKAGLYALEEEKVEKNEVYIAGLLN
        VGNG +L ISH+G                             GK                       G I+A G +  GLYAL+          +A +  
Subjt:  VGNGDQLEISHIGQ----------------------------GK-----------------------GEILAKGSRKAGLYALEEEKVEKNEVYIAGLLN

Query:  NKASYSIWHKRMGHLNEKSLRSL-----------------------------------------------------------------------------
         KA   IWH+R+GH + K L  L                                                                             
Subjt:  NKASYSIWHKRMGHLNEKSLRSL-----------------------------------------------------------------------------

Query:  --------------------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDD
                                  KI++FQ DGGGEF+       L   GIV  +SCP TP+QNGV ERKHRH++ETGL     H +L +        
Subjt:  --------------------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDD

Query:  ININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEIQCQNGVQND-----------------------------------SPTNKAAQMDEQSREQ-----
            N  +    T+ Y ++  P+++  +D P  ++   +GV  D                                   SP +K  +      ++     
Subjt:  ININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEIQCQNGVQND-----------------------------------SPTNKAAQMDEQSREQ-----

Query:  ----EENSLPIEDSTNIAEQTKEN-------------LVQPLLEERHNDT------TTM-----------------------MEDENLMQPQI-EERCTT
            +E  LP  D   +   T  +             L  P + + H  T      TT+                        E  + +QP       + 
Subjt:  ----EENSLPIEDSTNIAEQTKEN-------------LVQPLLEERHNDT------TTM-----------------------MEDENLMQPQI-EERCTT

Query:  KVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNNTSTQSKHLT--------------------------SQS
          T + S+    L   +P+Q    S      D  P +   L    T P+  + S+ +  ++ T+  S   T                          S +
Subjt:  KVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNNTSTQSKHLT--------------------------SQS

Query:  THPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDG
        THPMVTR K H                   HQ    L      EPK  KSAL+  HW+ AM +E++ALHQN TWSLVP+ A++N++GS+W+FKTKLK DG
Subjt:  THPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDG

Query:  TVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRA
        ++ER+KARLVA+G+ Q+EGLD+ ETFSPV+KPTTIRL++++AI   WSL+QLDVKNAFLHG LKE +YM QPPGF     P HVC L+K++YGLKQAPRA
Subjt:  TVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRA

Query:  WFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEK
        WF+R S +LL IGF CS +D SLF+++  S  I++LVYVDDII+T +  + L+ LI  L +EFA+KDLG L+YFLG++V H   GL LSQ KYAK++L K
Subjt:  WFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEK

Query:  NNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFC
         +M +     TP+A           L +AT YR IVG+LQYLTLTRPD+THAVN VCQ +  P+  H + VKRILRY++GT +YG+    +SSL LYGF 
Subjt:  NNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFC

Query:  DADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEM
        DADWAGCP TRRSTTG+CI+LG NCISW SKKQ TV+RSS+EAEYRAMASA AE+TW+++LLRDLG+     P LFCDN SAL+M+VNPVFHARTKHIE+
Subjt:  DADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEM

Query:  DYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGV
        DYHFVREKVA G+L TRY+PS  Q+AD+ TK++SK  F   RSKLGV
Subjt:  DYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGV

A0A2N9FTN5 Integrase catalytic domain-containing protein3.0e-26038.4Show/hide
Query:  AKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIEST
        + T  +L+  + H   +++S+KL S+NYL+W+ QI PL++SL + +HL  E      ++E  E  ++NP + +W   D LL SW+ GT+SE+ L  +   
Subjt:  AKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIEST

Query:  DTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLAL
        +TA+++W  LE + L  +KE E+ L   L   KK  +S+D YL+  K ICD LAA++KPV D  K   ++  LG  Y  F T MLSK P+PTFN+FV AL
Subjt:  DTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLAL

Query:  KAHEVRIK-TDHDTEKYTLPNQEQAFYTQK---GRYRGRGRH----FTSRGRGF-PQGRSTHSSTSNQF-SNFHPTKTGATHNIANYLTAPTDQNQKNIT
        + + +R + TD D +  +  N   AF T +   GR RGRGRH    F SRGRGF P     HSS   Q    FHP+ T   H      + P  QN +   
Subjt:  KAHEVRIK-TDHDTEKYTLPNQEQAFYTQK---GRYRGRGRH----FTSRGRGF-PQGRSTHSSTSNQF-SNFHPTKTGATHNIANYLTAPTDQNQKNIT

Query:  ASHGSTVSKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQ-SEEIPKALAAMNLNEDVDPL-MYADSGATSHMVNDPGKISSLQLYKGHDKIF
         S+ S +S      +N    +   CQICG+  H+AL+CW R+D++Y+ +E I +ALA   L++  D    Y D+GATSHM +  G + S   Y GHD + 
Subjt:  ASHGSTVSKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQ-SEEIPKALAAMNLNEDVDPL-MYADSGATSHMVNDPGKISSLQLYKGHDKIF

Query:  VGNGDQLEISHIGQ----------------------------GK-----------------------GEILAKGSRKAGLYALEEEKVEKNEVYIAGLLN
        VGNG +L ISH+G                             GK                       G I+A G +  GLYAL+          +A +  
Subjt:  VGNGDQLEISHIGQ----------------------------GK-----------------------GEILAKGSRKAGLYALEEEKVEKNEVYIAGLLN

Query:  NKASYSIWHKRMGHLNEKSLRSL-----------------------------------------------------------------------------
         KA   IWH+R+GH + K L  L                                                                             
Subjt:  NKASYSIWHKRMGHLNEKSLRSL-----------------------------------------------------------------------------

Query:  --------------------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDD
                                  KI++FQ DGGGEF+       L   GIV  +SCP TP+QNGV ERKHRH++ETGL     H +L +        
Subjt:  --------------------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDD

Query:  ININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEIQCQNGVQND-----------------------------------SPTNKAAQMDEQSREQ-----
            N  +    T+ Y ++  P+++  +D P  ++   NGV  D                                   SP +K  +      ++     
Subjt:  ININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEIQCQNGVQND-----------------------------------SPTNKAAQMDEQSREQ-----

Query:  ----EENSLPIEDSTNIAEQTKEN-------------LVQPLLEERHNDT------TTM-----------------------MEDENLMQPQI-EERCTT
            +E  LP  D   +   T  +             L  P + + H  T      TT+                        E  + +QP       + 
Subjt:  ----EENSLPIEDSTNIAEQTKEN-------------LVQPLLEERHNDT------TTM-----------------------MEDENLMQPQI-EERCTT

Query:  KVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNNTSTQSKHLT--------------------------SQS
          T + S+    L   +P+Q    S      D  P +   L    T P+  + S+ +  ++ T+  S   T                          S +
Subjt:  KVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNNTSTQSKHLT--------------------------SQS

Query:  THPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDG
        THPMVTR K H                   HQ    L      EPK  KSAL+  HW+ AM +E++ALHQN TWSLVP+ A++N++GS+W+FKTKLK DG
Subjt:  THPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDG

Query:  TVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRA
        ++ER+KARLVA+G+ Q+EGLD+ ETFSPV+KPTTIRL++++AI   WSL+QLDVKNAFLHG LKE +YM QPPGF     P HVC L+K++YGLKQAPRA
Subjt:  TVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRA

Query:  WFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEK
        WF+R S +LL IGF CS +D SLF+++  S  I++LVYVDDII+T +  + L+ LI  L +EFA+KDLG L+YFLG++V H   GL LSQ KYAK++L K
Subjt:  WFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEK

Query:  NNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFC
         +M +     TP+A           L +AT YR IVG+LQYLTLTRPD+THAVN VCQ +  P+  H + VKRILRY++GT +YG+    +SSL LYGF 
Subjt:  NNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFC

Query:  DADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEM
        DADWAGCP TRRSTTG+CI+LG NCISW SKKQ TV+RSS+EAEYRAMASA AE+TW+++LLRDLG+     P LFCDN SAL+M+VNPVFHARTKHIE+
Subjt:  DADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEM

Query:  DYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGV
        DYHFVREKVA G+L TRY+PS  Q+AD+ TK++SK  F   RSKLGV
Subjt:  DYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGV

A0A2Z6P7T0 Reverse transcriptase Ty1/copia-type domain-containing protein2.9e-28744.92Show/hide
Query:  EPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSE-NQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIESTDT
        EP LTIQSFHQCSSLISIKL++SN+LLWKSQILPL+RSLG+E H+ ++ ++    I +    ++ NP   QWI NDGLLTSWLLG + E+ ++MI   DT
Subjt:  EPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSE-NQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIESTDT

Query:  AKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKA
        A  IW+SL EQLL  +++ E  L  +L  L KG LS+D+Y++K K +CD+L A+ KPV D+ KVF +++GLG  Y+ F+ A+LSK PYP+FN+F+++L+ 
Subjt:  AKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKA

Query:  HEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGFPQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTVSK-SG
         E    T+  ++     +  QAF+ Q    RGRGR+ T  GRG  +GR ++SS  N+ +N                         N  AS+ +++ K + 
Subjt:  HEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGFPQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTVSK-SG

Query:  NFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLNEDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLEISHIG
        N+  +P +     CQICG+ NH A  C+ R++++ + E   +ALAA+ +NE+ DP  YADSGAT+HM N  G + SL+ Y G D +FVGNG  L I+H G
Subjt:  NFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLNEDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLEISHIG

Query:  QG------KGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL-----------KIKVFQSDGGGEFTSLELKELLEQ
        +         +ILA+G +K  LYALE  K+E      A     +A   +WH R+GH N K L+ L           KIKVFQSDGGGEFTS E       
Subjt:  QG------KGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL-----------KIKVFQSDGGGEFTSLELKELLEQ

Query:  SGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRS------QTSSYDLDENPTTEHNVDPPNQEIQCQNGVQND
            H   C +      ++ R    ++ TG E  T + KL        D   I  + L +        T    +  +     N  P + +        +D
Subjt:  SGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRS------QTSSYDLDENPTTEHNVDPPNQEIQCQNGVQND

Query:  SPTNKAAQMDEQSREQEENSLPIEDSTNIAEQTKENLVQPLLEERHNDTTTMMEDENLM---QPQIEERCTTKVTHTGSIVHNN---LQNSTPQQENSKS
        S      + DE   E+      ++D T             + ++  +DT     D + +    P ++    T       I+ NN   +QN      +   
Subjt:  SPTNKAAQMDEQSREQEENSLPIEDSTNIAEQTKENLVQPLLEERHNDTTTMMEDENLM---QPQIEERCTTKVTHTGSIVHNN---LQNSTPQQENSKS

Query:  HSLSQFDHLPDISSNLYIDLTLP-SAGNQSNGSQGSNNTSTQSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKS
         S  Q  ++ + SS       LP    N  +    + N        +S+   P     K +  PTI          T     H+  ++  +  EPK YK+
Subjt:  HSLSQFDHLPDISSNLYIDLTLP-SAGNQSNGSQGSNNTSTQSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKS

Query:  ALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLK
        AL+  +W+ AM++E++ALH NNTW+LV +P + NVIGSKW+F+TKL EDG+++R+KARLVA+G+TQ+ GLD+ ETFSPV+K  TIR+I+++A+HF W LK
Subjt:  ALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLK

Query:  QLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTN
        QLDVKNAFLHG L E +YM QPPGF+H  LPNHVC+L+KSLYGLKQAPRAWFE+LS  L+ +GF CS++DPSLFI+++ +   ++LVYVDDIILTGN  +
Subjt:  QLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTN

Query:  ALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDIT
         ++ L+  L  +FA+KDLG+LHYFLG+E+ H   G+ +SQTKYA DLL++ +M  AS  NTP+A   ++ P+D    DAT YRR+ GSLQYLT TRPD+T
Subjt:  ALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDIT

Query:  HAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMAS
        HAVN VCQH QNPT + ++ VKRILRYIKGT  +G+ +   SSLNL  FCDADWAGCP TRRSTTGFCI+LG +CISW SKKQPTV+RSS+EAEY+A+A+
Subjt:  HAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMAS

Query:  ATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH
          AE+TW+ +LL DLGI L++ P +FCDN SA++MS NPVFHARTKHI +DYHF+REKV  G L  RY+ +  Q+ADV TKSL K SF T R KLGVH
Subjt:  ATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH

A0A438C9J9 Retrovirus-related Pol polyprotein from transposon RE11.3e-27941.24Show/hide
Query:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQH-GKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE
        MA  E  L+IQ+FHQCSSL+SIKL  SN LLW+SQ+LPLVRSLG+  HL SEN+H  +     +  E  +     W +NDGLLTSWLLG ++E+++ +++
Subjt:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQH-GKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE

Query:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL
         T+TA  +W SL E+LL M+KE E+ LT  L  +KKG  S+D+YL++ KGICD LAA++KPV D+ KVF +A+GLGT Y  F+ AMLSK PYP++N+FVL
Subjt:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL

Query:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYL----TAPTDQNQKNITASH
        AL+ HE  I T+++  K ++ N EQA++TQ+GR R +G  F SRGRGF P GR  +++TS+Q  N H +     ++  N+     +AP  Q   + T+ H
Subjt:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYL----TAPTDQNQKNITASH

Query:  GSTVSKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNG
                                               + ++++ +IP+ALAAM L+ E+ DP  Y DSGAT+H+ NDP   S+  + K          
Subjt:  GSTVSKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNG

Query:  DQLEISHIGQGKGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL--------------------------------
        DQL+         ++LA+G++K GLYALEE  ++   V      ++KAS  +WH+RMGH   KS++ L                                
Subjt:  DQLEISHIGQGKGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL--------------------------------

Query:  -----------KI-----------------------------KVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTN
                   KI                             K+FQSDGGGEF S++ +  L + GI+ Q+SCP TP+QNGV ERKHRH++E GL    N
Subjt:  -----------KI-----------------------------KVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTN

Query:  HMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEI---------------QC---------------------------QNGVQND
          KL  S              +    T+ Y ++  P+T   ++ P   +               QC                             G +  
Subjt:  HMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEI---------------QC---------------------------QNGVQND

Query:  SPTNKAAQM-------------DEQSREQEENSLPIE-----------DSTNIAEQT---KENLVQPLLEER---HNDTTTMMEDENLMQPQIEERCTTK
         P+ K   +             D    ++    +PIE           +++ I  QT   KEN        +   H   TT   ++  ++ +I  RC T 
Subjt:  SPTNKAAQM-------------DEQSREQEENSLPIE-----------DSTNIAEQT---KENLVQPLLEER---HNDTTTMMEDENLMQPQIEERCTTK

Query:  VTHTGSIV----------------------HNN-LQNSTP----------------------QQENSKSHSLSQFDHLPDISSNLYIDLTL---------
             S V                      HN  +   TP                      Q E++ S+  ++  + PDIS +L +DL+          
Subjt:  VTHTGSIV----------------------HNN-LQNSTP----------------------QQENSKSHSLSQFDHLPDISSNLYIDLTL---------

Query:  --PSAGNQSNGSQGSNNTSTQSK------------HLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKV
          PS  N+S      +     SK            H  ++ TH M+TR KL  DP++   +      TR+              EPK Y++AL+IPHW  
Subjt:  --PSAGNQSNGSQGSNNTSTQSK------------HLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKV

Query:  AMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFL
        AM+EE+ AL QN TW LVP+P   N++GSKW+FKTKLKEDGT++RYKARLVA+GF+Q+ GLD+ ETFSPV+K TTIR+I ++A+   W ++QLDVKNAFL
Subjt:  AMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFL

Query:  HGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHL
        HG LKE ++M QPPGF +  LPNHVCKLN+SLYGLKQAPRAWF+RLS                 FI+                   GND N ++DLI  L
Subjt:  HGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHL

Query:  GTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQH
         +EF++KDLG LHYFLGLEV + P GL +SQTKY +DLLE   M   +  NTPMA+       D+   D T YR++VGSLQYLT TRPDI HAVNK CQH
Subjt:  GTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQH

Query:  LQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWIS
         Q PT   +R VKRILRY+KGT E+G+ F K SSL L GFCDADWAGC  TRRST+G+CIFLG NCISW+SK+QPTV+RSS+EAEYR++AS+ AEITW++
Subjt:  LQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWIS

Query:  FLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH
        FLLRD+GI L++PPQL CDNLSAL+M+VNPVFHAR+KHIE+DYHFVREKVA G LITR++PS LQVAD+ TK+L KTSF+  R KLGVH
Subjt:  FLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH

A0A438E6Z5 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0044.63Show/hide
Query:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQH-GKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE
        MA  E  L+IQ+FHQCSSL+SIKL  SN LLW+SQ+LPLVRSLG+  HL SEN+H  K     +  E  +     W +NDGLLTSWLLG ++E+++ +++
Subjt:  MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQH-GKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIE

Query:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL
         T+TA  +W SL E+LL M+KE E+ LT  L  +KKG  S+D+YL++ KGICD LAA++KPV D+ KVF +A+GLGT Y  F+ AMLSK PYP++N+FVL
Subjt:  STDTAKKIWTSLEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVL

Query:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTV
        AL+ HE  I T+++  K ++ N EQA++TQ+GR R RG  F SRGRGF P GR  +++TS+Q  N + + T    N  N    P   + +N   S  +  
Subjt:  ALKAHEVRIKTDHDTEKYTLPNQEQAFYTQKGRYRGRGRHFTSRGRGF-PQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTV

Query:  SKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLE
                +      +ICQIC K NH+AL+CWNRFD+ YQ EEIP+ALAAM L+ E+ DP  Y DSGAT+H+ NDPGK+S +  YKGHD IFVGNG+ L 
Subjt:  SKSGNFISNPYKSDLIICQICGKGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLE

Query:  ISHIGQGK--------------------------------------------------GEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWH
        ISHIG+ +                                                   ++LA+G++K GLYALEE  ++   V      ++KAS  +WH
Subjt:  ISHIGQGK--------------------------------------------------GEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWH

Query:  KRMGHLNEKSLRSL--------------------------------------------------------------------------------------
        +RMGH   KS++ L                                                                                      
Subjt:  KRMGHLNEKSLRSL--------------------------------------------------------------------------------------

Query:  -----------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLG
                         +IK+FQSDGGGEF S++ +  L + GI+ Q+SCP TP+QNGV ERKHRH++E GL    N  KL  S              + 
Subjt:  -----------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLG

Query:  RSQTSSYDLDENPTTEHNVDPPNQEI---------------QC-----QNGVQNDSPTNKA------AQMDEQSREQEENSLPIEDSTNIAEQTKENLVQ
           T+ Y ++  P+T   ++ P   +               QC       G    SP          + + +  R    ++  +  S ++       L+ 
Subjt:  RSQTSSYDLDENPTTEHNVDPPNQEI---------------QC-----QNGVQNDSPTNKA------AQMDEQSREQEENSLPIEDSTNIAEQTKENLVQ

Query:  PLLEERHNDTTTMMEDENLMQPQIEERCTTKVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNNTSTQSKHL
         L +  HN   T     ++     +    T  T TG++   +L  +T         S S  D +PD S  + +D++ P                 Q +H 
Subjt:  PLLEERHNDTTTMMEDENLMQPQIEERCTTKVTHTGSIVHNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNNTSTQSKHL

Query:  TSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKL
         ++ TH M+TR KL  DP++   +      TR+              EPK Y++ L+IPHW  AM+EE+ AL QN TW LVP+P   N++GSKW+FKTKL
Subjt:  TSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKL

Query:  KEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQ
        KEDGT++RYKARLVA+GF+Q+ GLD+ ETFSPV+K TTIR+I ++A+   W ++QLDVKNAFLHG LKE ++M QPPGF +  L NHVCKLN+SLYGLKQ
Subjt:  KEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQ

Query:  APRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKD
        APRAWF+RLSQ LLH+GF C ++D SLFI +    I+++L+YVDDII+TGND N ++DLI  L +EF++KDLG LHYFLGLEV + P GL +SQTKY +D
Subjt:  APRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKD

Query:  LLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNL
        LLE   M   +  NTPMA+       D+   D T YR++VGSLQYLT TRPDI HAVNK CQH Q PT   +R VKRILRY+KGT E+G+ F K SSL L
Subjt:  LLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNL

Query:  YGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTK
         GFCDADWAGC  TRRST+G+CIFLG NCISW+SK+QPTV+RSS+EAEYR++AS+ AEITW++FLLRD+GI L++PPQL CDNLSAL+M VN VFHAR+K
Subjt:  YGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTK

Query:  HIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH
        HIE+DYHFVREKVA G LITR++PS LQVAD+ TK+L KTSF+  R KLGVH
Subjt:  HIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVH

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.7e-9630.97Show/hide
Query:  SLKIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTT
        +LK+     D G E+ S E+++   + GI + L+ PHTPQ NGV ER  R + E       +  KL +S      +  +  T L     S   +D + T 
Subjt:  SLKIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTT

Query:  E---HNVDPPNQEIQCQNGVQNDSPTNKAAQMDEQSREQ-----EENSLPIEDSTN----IAEQTKENLVQPLLEERHNDTTTMMED----ENLMQPQIE
            HN  P  + ++           NK  + D++S +      E N   + D+ N    +A     +    +        T  ++D    EN   P   
Subjt:  E---HNVDPPNQEIQCQNGVQNDSPTNKAAQMDEQSREQ-----EENSLPIEDSTN----IAEQTKENLVQPLLEERHNDTTTMMED----ENLMQPQIE

Query:  ERCTTKVTHTGSIVHNNLQ--NSTPQQEN----------------SKSHSLSQFDHLPDI-SSNLYI--DLTLPSAGNQSNGSQGSNN-----TSTQSKH
         +         S   +N+Q    + + EN                ++S        L D   SN Y   +       +  N S+GS N      S  ++H
Subjt:  ERCTTKVTHTGSIVHNNLQ--NSTPQQEN----------------SKSHSLSQFDHLPDI-SSNLYI--DLTLPSAGNQSNGSQGSNN-----TSTQSKH

Query:  L--------TSQSTHPMVTR--QKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNV
        L        T      ++ R  ++L   P I  +           + H+ +     + +   Y+       W+ A+  E+NA   NNTW++  +P N N+
Subjt:  L--------TSQSTHPMVTR--QKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNV

Query:  IGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVC
        + S+W+F  K  E G   RYKARLVA+GFTQ   +DYEETF+PV + ++ R I+++ I +N  + Q+DVK AFL+G LKE IYM  P G   +   ++VC
Subjt:  IGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVC

Query:  KLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSI--IIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTP
        KLNK++YGLKQA R WFE   Q L    F  S  D  ++I    +I   I +L+YVDD+++   D   +N+   +L  +F + DL  + +F+G+ +    
Subjt:  KLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSI--IIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTP

Query:  YGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTL-TRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTT
          ++LSQ+ Y K +L K NM N +  +TP+   ++    +      T  R ++G L Y+ L TRPD+T AVN + ++      +  + +KR+LRY+KGT 
Subjt:  YGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTL-TRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTT

Query:  EYGMTFHKNSSL--NLYGFCDADWAGCPLTRRSTTGFCI-FLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDN
        +  + F KN +    + G+ D+DWAG  + R+STTG+       N I W +K+Q +VA SS+EAEY A+  A  E  W+ FLL  + I L+ P +++ DN
Subjt:  EYGMTFHKNSSL--NLYGFCDADWAGCPLTRRSTTGFCI-FLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDN

Query:  LSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVHAQAHSN
           + ++ NP  H R KHI++ YHF RE+V    +   YIP++ Q+AD+ TK L    F  LR KLG+     SN
Subjt:  LSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVHAQAHSN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-9826.62Show/hide
Query:  SNYLLWKSQILPLVRSLGIEEHLKSENQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIESTDTAKKIWTSLEEQLLTMSKENEIHL
        + +  W+ ++  L+   G+ + L  +++    +K +D           W + D    S +   +S+D++  I   DTA+ IWT LE   ++ +  N+++L
Subjt:  SNYLLWKSQILPLVRSLGIEEHLKSENQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIESTDTAKKIWTSLEEQLLTMSKENEIHL

Query:  TETLLTLKKGK-LSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKAHEVRIKTDHDTEKYTLPNQEQA
         + L  L   +  +   +L    G+  QLA +   +++  K   +   L +SY    T +L         +   AL  +E   K           NQ QA
Subjt:  TETLLTLKKGK-LSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKAHEVRIKTDHDTEKYTLPNQEQA

Query:  FYTQKGRYRGRGRHFTSRGRGFPQGRSTHSSTS--------NQFSNF-----HPTK----TGATHNIANYLTAPTDQNQKNITASHGSTVSKSGNFISNP
          T+ GR R   R   + GR   +G+S + S S        NQ  +F     +P K    T    N  N  TA   QN  N+         +    +S P
Subjt:  FYTQKGRYRGRGRHFTSRGRGFPQGRSTHSSTS--------NQFSNF-----HPTK----TGATHNIANYLTAPTDQNQKNITASHGSTVSKSGNFISNP

Query:  YKSDLIICQICGKGNHTALECWNRFDHSYQSE--EIPKALAAMNLNEDVDPLMYADSGATSHMVNDPGKISSLQL---------YKGHDKIFVGNGDQLE
         +S+ ++       +H A    + F      +   +     + +    +  +    +   + ++ D   +  L++           G++  F     +L 
Subjt:  YKSDLIICQICGKGNHTALECWNRFDHSYQSE--EIPKALAAMNLNEDVDPLMYADSGATSHMVNDPGKISSLQL---------YKGHDKIFVGNGDQLE

Query:  ISHIGQGKGE-ILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL-----------------------------------
               KG  ++AKG  +  LY    E + + E+  A    ++ S  +WHKRMGH++EK L+ L                                   
Subjt:  ISHIGQGKGE-ILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNKASYSIWHKRMGHLNEKSLRSL-----------------------------------

Query:  -------------------------------------------------------------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSC
                                                                           K+K  +SD GGE+TS E +E     GI H+ + 
Subjt:  -------------------------------------------------------------------KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSC

Query:  PHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEIQCQNGVQNDSPTNKAAQMDEQSRE
        P TPQ NGV ER +R ++E  +       KL +S                  QT+ Y ++ +P+     + P +             TNK         E
Subjt:  PHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEHNVDPPNQEIQCQNGVQNDSPTNKAAQMDEQSRE

Query:  QEENSLPIEDSTNIAEQTKENLVQPLLEERHNDTTTMMEDENLMQPQIEERCTTKVTHTGSIV--HNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYID
           + L +      A   KE   +  L+++      +   +     ++ +    KV  +  +V   + ++ +    E  K+  +  F             
Subjt:  QEENSLPIEDSTNIAEQTKENLVQPLLEERHNDTTTMMEDENLMQPQIEERCTTKVTHTGSIV--HNNLQNSTPQQENSKSHSLSQFDHLPDISSNLYID

Query:  LTLPSAGNQSNGSQGSNNTSTQSKHLTSQSTHPMVTRQKLH------KDPTIDPHLHQEIQRTRTPHQHS------AYLIHKQTMEPKGYKSALRIP---
        +T+PS    SN    + +T+ +      Q    +   ++L       + PT     HQ ++R+  P   S       Y++     EP+  K  L  P   
Subjt:  LTLPSAGNQSNGSQGSNNTSTQSKHLTSQSTHPMVTRQKLH------KDPTIDPHLHQEIQRTRTPHQHS------AYLIHKQTMEPKGYKSALRIP---

Query:  HWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVK
            AM+EEM +L +N T+ LV  P     +  KW+FK K   D  + RYKARLV +GF Q +G+D++E FSPVVK T+IR I+++A   +  ++QLDVK
Subjt:  HWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVK

Query:  NAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYK-HHSIIIIMLVYVDDIILTGNDTNALND
         AFLHG L+E IYM QP GF+ +   + VCKLNKSLYGLKQAPR W+ +   ++    +  + SDP ++  +   +  II+L+YVDD+++ G D   +  
Subjt:  NAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYK-HHSIIIIMLVYVDDIILTGNDTNALND

Query:  LILHLGTEFAIKDLGRLHYFLGLEV--HHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMA--VGLSQFPNDQVLAD-----ATTYRRIVGSLQY-LTL
        L   L   F +KDLG     LG+++    T   L LSQ KY + +LE+ NM NA   +TP+A  + LS+      + +        Y   VGSL Y +  
Subjt:  LILHLGTEFAIKDLGRLHYFLGLEV--HHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMA--VGLSQFPNDQVLAD-----ATTYRRIVGSLQY-LTL

Query:  TRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAE
        TRPDI HAV  V + L+NP  +H   VK ILRY++GTT   + F  +  + L G+ DAD AG    R+S+TG+        ISW SK Q  VA S++EAE
Subjt:  TRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAE

Query:  YRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSK
        Y A      E+ W+   L++LG+  Q+   ++CD+ SA+ +S N ++HARTKHI++ YH++RE V   SL    I ++   AD+LTK + +  F   +  
Subjt:  YRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSK

Query:  LGVHA
        +G+H+
Subjt:  LGVHA

P92519 Uncharacterized mitochondrial protein AtMg008106.3e-6653.78Show/hide
Query:  MLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRR
        +L+YVDDI+LTG+    LN LI  L + F++KDLG +HYFLG+++   P GL LSQTKYA+ +L    M +    +TP+ + L+   +     D + +R 
Subjt:  MLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRR

Query:  IVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQP
        IVG+LQYLTLTRPDI++AVN VCQ +  PT+     +KR+LRY+KGT  +G+  HKNS LN+  FCD+DWAGC  TRRSTTGFC FLG N ISW++K+QP
Subjt:  IVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQP

Query:  TVARSSSEAEYRAMASATAEITWIS
        TV+RSS+E EYRA+A   AE+TW S
Subjt:  TVARSSSEAEYRAMASATAEITWIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-16130.61Show/hide
Query:  KLTSSNYLLWKSQILPLVRSLGIEEHLKSENQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIESTDTAKKIWTSLEEQLLTMSKEN
        KLTS+NYL+W  Q+  L     +   L             D    VNP +T+W   D L+ S +LG IS  +   +    TA +IW +L +     S  +
Subjt:  KLTSSNYLLWKSQILPLVRSLGIEEHLKSENQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIESTDTAKKIWTSLEEQLLTMSKEN

Query:  EIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKAHEVRIKTDHDTEKYTLPNQ
           L   L    KG  ++DDY++ +    DQLA + KP+D   +V  V   L   Y+     + +K   PT  E    L  HE +I              
Subjt:  EIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKAHEVRIKTDHDTEKYTLPNQ

Query:  EQAFYTQKGRYRGRGRHFTSRGRGFPQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITA--SHGSTVSKSGNFISNPYKSDLIICQICG
                           S     P        T+N  S+ + T T   +N  N      ++N  N +      ST     N  S PY   L  CQICG
Subjt:  EQAFYTQKGRYRGRGRHFTSRGRGFPQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITA--SHGSTVSKSGNFISNPYKSDLIICQICG

Query:  KGNHTALEC--WNRFDHSYQSEEIPKALAAMNLNEDV-------DPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLEISHIGQGKGEILAK
           H+A  C     F  S  S++ P          ++             DSGAT H+ +D   +S  Q Y G D + V +G  + ISH G      L+ 
Subjt:  KGNHTALEC--WNRFDHSYQSEEIPKALAAMNLNEDV-------DPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLEISHIGQGKGEILAK

Query:  GSRKAGLY-ALEEEKVEKNEVYIAGLLN---------------------------------------------------NKASYSIWHKRMGHLNEKSLR
         SR   L+  L    + KN + +  L N                                                   +KA++S WH R+GH     L 
Subjt:  GSRKAGLY-ALEEEKVEKNEVYIAGLLN---------------------------------------------------NKASYSIWHKRMGHLNEKSLR

Query:  SL--------------------------------------------------------------------------------------------------
        S+                                                                                                  
Subjt:  SL--------------------------------------------------------------------------------------------------

Query:  ----KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDE--
            +I  F SD GGEF +  L E   Q GI H  S PHTP+ NG+ ERKHRH++ETGL    +H  + ++       + +    + R  T    L+   
Subjt:  ----KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDE--

Query:  ------------------------NPTTEHNVDPPNQE----------------------IQCQNGVQNDSP----TNKAAQMDEQSREQEENSLPIEDS
                                 P  +H +D  +++                      +     V+ D      +N  A +     ++ E+S      
Subjt:  ------------------------NPTTEHNVDPPNQE----------------------IQCQNGVQNDSP----TNKAAQMDEQSREQEENSLPIEDS

Query:  TNIAEQTKENLVQPLLEERHNDTT-----------TMMEDENLMQPQIEERCTTKVTHTGSIVHNNLQNSTPQQENSKSHS---LSQFDHLPDISSNLYI
        T +  +T   L  P   + H+  T           + +   NL          +    T    +     + P Q  +++HS    SQ +   +  S L  
Subjt:  TNIAEQTKENLVQPLLEERHNDTT-----------TMMEDENLMQPQIEERCTTKVTHTGSIVHNNLQNSTPQQENSKSHS---LSQFDHLPDISSNLYI

Query:  DLTLPSAGNQSNGSQGSNNTSTQSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPH---------------QHSAYLIHKQTMEPKGYKSALRI
         L+ P+   QS+ S  S  TS      +S ST P      +H  P +   ++   Q     H               ++S  +      EP+    AL+ 
Subjt:  DLTLPSAGNQSNGSQGSNNTSTQSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPH---------------QHSAYLIHKQTMEPKGYKSALRI

Query:  PHWKVAMEEEMNALHQNNTWSLV-PKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLD
          W+ AM  E+NA   N+TW LV P P++V ++G +WIF  K   DG++ RYKARLVA+G+ Q  GLDY ETFSPV+K T+IR+++ VA+  +W ++QLD
Subjt:  PHWKVAMEEEMNALHQNNTWSLV-PKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLD

Query:  VKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALN
        V NAFL G L + +YM+QPPGF     PN+VCKL K+LYGLKQAPRAW+  L  YLL IGF  S SD SLF+ +    I+ MLVYVDDI++TGND   L+
Subjt:  VKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALN

Query:  DLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVG--LSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITH
        + + +L   F++KD   LHYFLG+E    P GLHLSQ +Y  DLL + NM  A    TPMA    LS +   + L D T YR IVGSLQYL  TRPDI++
Subjt:  DLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVG--LSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITH

Query:  AVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASA
        AVN++ Q +  PT +H++ +KRILRY+ GT  +G+   K ++L+L+ + DADWAG      ST G+ ++LG + ISW+SKKQ  V RSS+EAEYR++A+ 
Subjt:  AVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASA

Query:  TAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGV
        ++E+ WI  LL +LGI L +PP ++CDN+ A Y+  NPVFH+R KHI +DYHF+R +V  G+L   ++ +  Q+AD LTK LS+T+F+   SK+GV
Subjt:  TAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.6e-14437.34Show/hide
Query:  KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEH
        +I    SD GGEF  + L++ L Q GI H  S PHTP+ NG+ ERKHRH++E GL    +H  + ++       + +    + R  T    L ++P  + 
Subjt:  KIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSSYDLDENPTTEH

Query:  NVDPPNQE---------------------------------------IQCQNGVQNDSPTNKAAQMDEQ-------------SREQEENSLPIEDSTNIA
           PPN E                                         C +       T++  Q DE+             S+EQ  +S P   S    
Subjt:  NVDPPNQE---------------------------------------IQCQNGVQNDSPTNKAAQMDEQ-------------SREQEENSLPIEDSTNIA

Query:  EQTKENLVQPLLEERHNDT------------TTMMEDENLMQPQIEERCTTKVT---HTG---SIVHNNLQNS--------TPQQENSKSHSLSQFDHLP
          T   L  P     H DT            TT +   NL    I    +++ T   H G   +   +  QNS         P   +   +S +Q   LP
Subjt:  EQTKENLVQPLLEERHNDT------------TTMMEDENLMQPQIEERCTTKVT---HTG---SIVHNNLQNS--------TPQQENSKSHSLSQFDHLP

Query:  D--ISSNLYIDLTLPSAGNQSNGSQGSNNTST-------------QSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIH-KQTMEP
           ISS     +  PS       S  S++TST             Q       +TH M TR K                  R P+Q  +Y        EP
Subjt:  D--ISSNLYIDLTLPSAGNQSNGSQGSNNTST-------------QSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIH-KQTMEP

Query:  KGYKSALRIPHWKVAMEEEMNALHQNNTWSLV-PKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIH
        +    A++   W+ AM  E+NA   N+TW LV P P +V ++G +WIF  K   DG++ RYKARLVA+G+ Q  GLDY ETFSPV+K T+IR+++ VA+ 
Subjt:  KGYKSALRIPHWKVAMEEEMNALHQNNTWSLV-PKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIH

Query:  FNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIIL
         +W ++QLDV NAFL G L + +YM+QPPGF     P++VC+L K++YGLKQAPRAW+  L  YLL +GF  S SD SLF+ +    II MLVYVDDI++
Subjt:  FNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIIL

Query:  TGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLS-QFPNDQVLADATTYRRIVGSLQYLT
        TGNDT  L   +  L   F++K+   LHYFLG+E    P GLHLSQ +Y  DLL + NM  A    TPMA        +   L D T YR IVGSLQYL 
Subjt:  TGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLS-QFPNDQVLADATTYRRIVGSLQYLT

Query:  LTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEA
         TRPD+++AVN++ Q++  PT  H   +KR+LRY+ GT ++G+   K ++L+L+ + DADWAG      ST G+ ++LG + ISW+SKKQ  V RSS+EA
Subjt:  LTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEA

Query:  EYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRS
        EYR++A+ ++E+ WI  LL +LGI L  PP ++CDN+ A Y+  NPVFH+R KHI +DYHF+R +V  G+L   ++ +  Q+AD LTK LS+ +F+    
Subjt:  EYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRS

Query:  KLGVHAQAHSNDGLSR
        K+GV     S  G+ R
Subjt:  KLGVHAQAHSNDGLSR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.6e-11945.06Show/hide
Query:  TPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSP
        +P  HS  +   +  EP  Y  A     W  AM++E+ A+   +TW +   P N   IG KW++K K   DGT+ERYKARLVA+G+TQ EG+D+ ETFSP
Subjt:  TPHQHSAYLIHKQTMEPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSP

Query:  VVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQ----LPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLF
        V K T+++LI+A++  +N++L QLD+ NAFL+G L E IYM  PPG+   Q     PN VC L KS+YGLKQA R WF + S  L+  GF  S SD + F
Subjt:  VVKPTTIRLIIAVAIHFNWSLKQLDVKNAFLHGILKETIYMAQPPGFQHSQ----LPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLF

Query:  IYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPM--AVGLSQFPN
        +    ++ + +LVYVDDII+  N+  A+++L   L + F ++DLG L YFLGLE+  +  G+++ Q KYA DLL++  +      + PM  +V  S    
Subjt:  IYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPM--AVGLSQFPN

Query:  DQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLG
             DA  YRR++G L YL +TR DI+ AVNK+ Q  + P + H + V +IL YIKGT   G+ +   + + L  F DA +  C  TRRST G+C+FLG
Subjt:  DQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLG

Query:  PNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREK
         + ISW SKKQ  V++SS+EAEYRA++ AT E+ W++   R+L +PL +P  LFCDN +A++++ N VFH RTKHIE D H VRE+
Subjt:  PNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVNPVFHARTKHIEMDYHFVREK

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.1e-1548.72Show/hide
Query:  YLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFC
        YLT+TRPD+T AVN++ Q         M+ V ++L Y+KGT   G+ +   S L L  F D+DWA CP TRRS TGFC
Subjt:  YLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFC

ATMG00810.1 DNA/RNA polymerases superfamily protein4.5e-6753.78Show/hide
Query:  MLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRR
        +L+YVDDI+LTG+    LN LI  L + F++KDLG +HYFLG+++   P GL LSQTKYA+ +L    M +    +TP+ + L+   +     D + +R 
Subjt:  MLVYVDDIILTGNDTNALNDLILHLGTEFAIKDLGRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRR

Query:  IVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQP
        IVG+LQYLTLTRPDI++AVN VCQ +  PT+     +KR+LRY+KGT  +G+  HKNS LN+  FCD+DWAGC  TRRSTTGFC FLG N ISW++K+QP
Subjt:  IVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYIKGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQP

Query:  TVARSSSEAEYRAMASATAEITWIS
        TV+RSS+E EYRA+A   AE+TW S
Subjt:  TVARSSSEAEYRAMASATAEITWIS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-2653.1Show/hide
Query:  EPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAI
        EPK    AL+ P W  AM+EE++AL +N TW LVP P N N++G KW+FKTKL  DGT++R KARLVA+GF Q EG+ + ET+SPVV+  TIR I+ VA 
Subjt:  EPKGYKSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAI

Query:  H------FNWSLK
                NW  K
Subjt:  H------FNWSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACAGAACCAAACTTGACCATCCAATCCTTTCATCAATGCTCCAGCTTGATTTCAATCAAACTCACCTCCTCAAACTACCTACTCTGGAAATCACAGATTCT
ACCATTAGTTAGGAGTTTAGGAATTGAGGAACATTTGAAATCAGAAAATCAGCATGGAAAGTTCATCAAAGAAAAAGATGAAATGGAAGTAGTCAACCCACATTTCACTC
AATGGATCAACAATGATGGATTGTTGACATCTTGGCTACTTGGAACAATCTCTGAAGATATCCTTGCTATGATCGAAAGCACAGACACGGCCAAGAAAATATGGACATCT
CTGGAAGAACAATTACTTACGATGAGTAAGGAGAACGAGATCCACTTGACCGAGACGTTACTCACCCTGAAGAAAGGTAAACTCTCAGTAGATGATTATCTTAAGAAAAT
TAAAGGAATATGTGATCAACTTGCAGCAATGAAGAAACCAGTAGACGACATGACAAAAGTCTTCCATGTTGCTCGAGGACTTGGTACGAGTTATCAAGGCTTCAAAACAG
CCATGCTCTCAAAGGCTCCATATCCTACTTTCAATGAATTTGTACTAGCACTTAAAGCGCATGAGGTAAGAATTAAAACAGATCACGATACGGAAAAATATACTCTACCA
AACCAGGAGCAAGCCTTTTATACGCAAAAGGGAAGATACAGAGGGCGAGGAAGGCACTTTACCTCTAGAGGAAGAGGATTTCCTCAAGGTCGATCAACACACTCTAGCAC
TTCAAACCAATTTTCAAATTTTCATCCAACCAAAACTGGTGCAACACACAACATAGCAAATTATTTAACAGCGCCTACAGATCAAAATCAGAAAAACATTACAGCAAGTC
ATGGCTCAACAGTATCCAAATCAGGTAATTTCATCTCCAATCCTTACAAATCTGACCTGATCATTTGTCAAATATGTGGCAAAGGAAACCATACAGCCTTGGAATGCTGG
AATAGATTTGATCATTCCTACCAATCAGAAGAAATTCCAAAAGCATTAGCAGCCATGAATTTGAATGAAGATGTTGATCCATTGATGTATGCTGATTCCGGAGCCACTTC
TCACATGGTTAATGACCCTGGTAAGATCTCATCCCTACAACTTTACAAGGGACATGATAAAATATTTGTTGGAAATGGAGATCAACTTGAAATATCTCACATTGGACAAG
GAAAAGGGGAGATACTTGCTAAAGGATCTCGAAAAGCTGGATTATATGCACTAGAAGAGGAAAAAGTAGAGAAAAATGAAGTCTATATTGCTGGCCTTCTTAATAATAAA
GCCTCATACTCTATTTGGCATAAGAGGATGGGACATTTGAATGAAAAGTCTTTAAGATCTTTGAAAATTAAAGTCTTTCAGAGTGATGGTGGTGGAGAGTTCACTTCTCT
TGAGCTTAAAGAACTACTTGAACAAAGCGGAATTGTGCATCAACTTTCTTGTCCACACACTCCTCAACAAAATGGAGTAGTTGAAAGGAAGCATCGCCATCTTATTGAAA
CAGGACTTGAGGAGTGGACCAATCATATGAAATTAACACAATCTAGACACCAAGTGACAGATGACATCAATATAAATAACACACGTCTGGGGAGATCACAAACTTCCTCC
TATGACTTGGATGAAAATCCAACAACAGAACACAACGTTGATCCTCCTAATCAAGAAATTCAATGTCAAAATGGAGTACAAAATGATTCTCCAACCAATAAAGCAGCACA
GATGGATGAGCAATCAAGAGAGCAAGAGGAAAATTCCTTACCAATTGAAGATAGTACCAATATAGCAGAACAAACAAAAGAAAACCTTGTGCAGCCATTGCTTGAGGAAA
GGCATAATGACACAACCACTATGATGGAGGATGAGAATCTTATGCAGCCACAAATTGAGGAAAGATGTACCACCAAAGTTACTCATACAGGATCAATTGTCCATAACAAT
TTGCAAAATTCAACACCACAACAGGAGAATTCTAAATCTCATTCTCTCTCTCAGTTTGATCACTTACCTGACATTAGCTCCAACTTATATATTGACTTGACTTTACCTTC
TGCAGGAAATCAATCGAATGGGTCACAAGGCAGTAATAACACAAGCACACAAAGCAAACATCTAACGAGCCAATCCACACATCCCATGGTCACAAGGCAAAAGCTTCACA
AGGACCCAACTATTGATCCTCATTTGCATCAAGAAATTCAAAGGACGAGAACACCTCACCAACATTCAGCCTACTTAATCCACAAGCAAACGATGGAGCCTAAAGGGTAC
AAAAGTGCTCTACGAATTCCCCATTGGAAAGTTGCCATGGAAGAAGAGATGAATGCACTTCACCAGAACAACACTTGGTCTCTCGTACCAAAACCAGCTAATGTTAATGT
GATTGGGTCGAAATGGATCTTCAAAACAAAATTAAAGGAAGATGGAACAGTTGAGAGATATAAAGCACGTTTGGTTGCACAAGGCTTCACACAAGTTGAAGGTCTAGACT
ATGAAGAAACTTTCAGCCCTGTGGTCAAACCTACCACTATTCGCTTAATAATTGCTGTGGCTATCCATTTCAATTGGTCACTAAAACAATTGGATGTAAAGAACGCCTTT
CTACATGGGATTCTCAAAGAAACAATCTACATGGCTCAACCGCCTGGTTTCCAACACTCACAATTACCAAATCATGTGTGCAAACTCAACAAGTCACTTTATGGTCTAAA
ACAAGCTCCAAGGGCGTGGTTCGAGAGGCTATCACAATACCTACTTCACATTGGGTTCACTTGTTCCCGATCAGACCCTTCTCTTTTTATCTATAAACATCATTCAATCA
TCATTATCATGCTTGTTTATGTTGATGACATCATACTTACTGGAAATGATACTAATGCTCTTAATGATCTCATCCTACATCTAGGTACAGAATTTGCCATCAAAGACCTT
GGACGGCTCCACTACTTCTTAGGCTTAGAGGTGCATCATACACCCTATGGACTTCATTTATCTCAAACCAAATATGCCAAGGATCTCCTAGAAAAGAATAATATGGCAAA
TGCTTCCCATTTTAACACTCCTATGGCGGTGGGACTCTCACAGTTTCCAAATGATCAAGTACTAGCAGATGCTACAACCTATAGAAGAATCGTCGGCTCGTTACAATACC
TTACCTTGACTCGACCCGACATCACTCATGCTGTGAACAAAGTGTGTCAACACCTACAAAATCCAACGATGCAACACATGAGAACTGTGAAACGCATCTTACGATATATC
AAAGGAACCACTGAATATGGTATGACGTTTCACAAAAATAGCTCACTAAATTTATATGGTTTTTGTGATGCAGATTGGGCCGGATGTCCACTTACTCGACGTAGTACCAC
TGGATTTTGTATCTTCCTTGGACCAAATTGCATCTCATGGACCTCTAAAAAGCAACCAACCGTGGCTCGCTCAAGCTCAGAAGCTGAATATAGAGCAATGGCTAGTGCAA
CAGCTGAAATTACATGGATATCCTTCCTTCTCAGAGATCTTGGCATCCCACTTCAACAACCTCCTCAACTCTTCTGTGACAATCTAAGTGCTCTCTACATGTCAGTTAAT
CCAGTTTTCCATGCTCGCACAAAACATATCGAGATGGACTATCATTTTGTTCGAGAAAAAGTGGCATTAGGTTCTCTAATCACAAGGTATATACCGTCTGATCTCCAAGT
CGCTGATGTTCTCACCAAATCACTATCCAAGACTTCCTTTAGAACACTTCGAAGCAAACTTGGAGTTCATGCTCAAGCCCACTCCAACGATGGGTTAAGTAGGGAGGAAG
TTGAAACTCACAATGATAACGATGAGATGTTCGATCTTCTAGCTGATTTGCAAGGGCCAATGGTAGAAGGGGAAGGGGATGTAGATGACGAAGAAGATTTTGGGAATGAA
ATGCCTACTAATATGCCTGGAGAAAATGAGACTTCAAACACTTTCGAGGAATTGATGGTTGAAGCACGTAACCCATTGTACCTTGGTTGTACAAAATATTCTTCATTGAA
TTTCTTGGTGAAATTGATGCATATCAAGGTTCCCAAAAACTGCAGTAACAAAACGTTTGATATGTCATTAGAATTATTGAAAGACGTCTTTCCTTCTGGTACATCTGTAC
CTAGTTCATTTTATGAGGCTAAGAGGAAGCTACGTGACTTAGGTTTGGGATATGAACACATTCATGCTTACCCAAGGAATTTTCAATTAGCATTAGCTTCAGATGGATTC
AATCCATTCGGGAACATGAGTATTGCCTATATTCATTTGCCATATGAAACGAAGGTCGTTGATCTGGTTAACTACAAAACAAGGTTCAATAGCGATGCACGTAATGATGA
TTCACTTCCTGATGGAGCATATTGTGGTGATTTCGAAGTGTTCAAGAAAAGTATTTACTCATTACGTGAAAGAGGAGAAGCATTTGATCATCTTTGCTCGCTAGCACTGG
GGCCTTCTAATCAAGTCCTCTCTTACAATGGTTGCATCGTCAATGGAGTTCGTTTCCATTGTTTAGACCGCGACAATCATCGAGCTACACAGAATAGTAGTATCATGGTG
CCTGGAGAAACTGAAGTGGATGAAATGAATTTCTATGGTGTACTACATGAGGTTTTGGACTTAGAGTATTCTAAGAGAAGACGTGTGGTCATATTTAGATACCAACCTTT
CATTCTCGTTGCTCAAGCGAAACAAGTTTCTTACATCAATGACCACAAACTTGGTAATAGTTGGAAAGTGGTGCAAATTGTCCAAAATAAACAAGTGTGGGACGTTCCAG
AAGCTGAAGACATTGAAAACGACGAAATGGAGTTATTAGAAGTATCAAACGTGATCGAGGTTGATGAGTCCATTCATGAGGCCACACTTTATAGAGATGATGTTGACCCT
ATTTTCGTTCCACCTCAATATTTGAATAAACAGTTGAAGAATAAAAGTGTACCATCATTCAATGACAATTTCATAAATGATAAACCTGAAGATGTCGAAGCATCCACTGA
CAATAGTTCGAACTCATGTGAATATTTAGATTTTGATGATCAGGATAAGACTGAGGAGCCTAAGACTGAAGAGTACAACACACCTACTGGAGAAGCTGAGGAGGACACAT
CAGATGAAGCTGAAAAGCTTGATCCTGAGCTTCTTATTCCTTCTCCCACGGTGTTGGTCCTCAAAGAGAAGAAAAAGAAGAAAAAGAACAAACGGGCTGAATTTGACAAG
TTTATGAAAGCTTTTATGAATCTAAATATCGATATTCCTTTTGCAGAAGCACTAGAGATGCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACAGAACCAAACTTGACCATCCAATCCTTTCATCAATGCTCCAGCTTGATTTCAATCAAACTCACCTCCTCAAACTACCTACTCTGGAAATCACAGATTCT
ACCATTAGTTAGGAGTTTAGGAATTGAGGAACATTTGAAATCAGAAAATCAGCATGGAAAGTTCATCAAAGAAAAAGATGAAATGGAAGTAGTCAACCCACATTTCACTC
AATGGATCAACAATGATGGATTGTTGACATCTTGGCTACTTGGAACAATCTCTGAAGATATCCTTGCTATGATCGAAAGCACAGACACGGCCAAGAAAATATGGACATCT
CTGGAAGAACAATTACTTACGATGAGTAAGGAGAACGAGATCCACTTGACCGAGACGTTACTCACCCTGAAGAAAGGTAAACTCTCAGTAGATGATTATCTTAAGAAAAT
TAAAGGAATATGTGATCAACTTGCAGCAATGAAGAAACCAGTAGACGACATGACAAAAGTCTTCCATGTTGCTCGAGGACTTGGTACGAGTTATCAAGGCTTCAAAACAG
CCATGCTCTCAAAGGCTCCATATCCTACTTTCAATGAATTTGTACTAGCACTTAAAGCGCATGAGGTAAGAATTAAAACAGATCACGATACGGAAAAATATACTCTACCA
AACCAGGAGCAAGCCTTTTATACGCAAAAGGGAAGATACAGAGGGCGAGGAAGGCACTTTACCTCTAGAGGAAGAGGATTTCCTCAAGGTCGATCAACACACTCTAGCAC
TTCAAACCAATTTTCAAATTTTCATCCAACCAAAACTGGTGCAACACACAACATAGCAAATTATTTAACAGCGCCTACAGATCAAAATCAGAAAAACATTACAGCAAGTC
ATGGCTCAACAGTATCCAAATCAGGTAATTTCATCTCCAATCCTTACAAATCTGACCTGATCATTTGTCAAATATGTGGCAAAGGAAACCATACAGCCTTGGAATGCTGG
AATAGATTTGATCATTCCTACCAATCAGAAGAAATTCCAAAAGCATTAGCAGCCATGAATTTGAATGAAGATGTTGATCCATTGATGTATGCTGATTCCGGAGCCACTTC
TCACATGGTTAATGACCCTGGTAAGATCTCATCCCTACAACTTTACAAGGGACATGATAAAATATTTGTTGGAAATGGAGATCAACTTGAAATATCTCACATTGGACAAG
GAAAAGGGGAGATACTTGCTAAAGGATCTCGAAAAGCTGGATTATATGCACTAGAAGAGGAAAAAGTAGAGAAAAATGAAGTCTATATTGCTGGCCTTCTTAATAATAAA
GCCTCATACTCTATTTGGCATAAGAGGATGGGACATTTGAATGAAAAGTCTTTAAGATCTTTGAAAATTAAAGTCTTTCAGAGTGATGGTGGTGGAGAGTTCACTTCTCT
TGAGCTTAAAGAACTACTTGAACAAAGCGGAATTGTGCATCAACTTTCTTGTCCACACACTCCTCAACAAAATGGAGTAGTTGAAAGGAAGCATCGCCATCTTATTGAAA
CAGGACTTGAGGAGTGGACCAATCATATGAAATTAACACAATCTAGACACCAAGTGACAGATGACATCAATATAAATAACACACGTCTGGGGAGATCACAAACTTCCTCC
TATGACTTGGATGAAAATCCAACAACAGAACACAACGTTGATCCTCCTAATCAAGAAATTCAATGTCAAAATGGAGTACAAAATGATTCTCCAACCAATAAAGCAGCACA
GATGGATGAGCAATCAAGAGAGCAAGAGGAAAATTCCTTACCAATTGAAGATAGTACCAATATAGCAGAACAAACAAAAGAAAACCTTGTGCAGCCATTGCTTGAGGAAA
GGCATAATGACACAACCACTATGATGGAGGATGAGAATCTTATGCAGCCACAAATTGAGGAAAGATGTACCACCAAAGTTACTCATACAGGATCAATTGTCCATAACAAT
TTGCAAAATTCAACACCACAACAGGAGAATTCTAAATCTCATTCTCTCTCTCAGTTTGATCACTTACCTGACATTAGCTCCAACTTATATATTGACTTGACTTTACCTTC
TGCAGGAAATCAATCGAATGGGTCACAAGGCAGTAATAACACAAGCACACAAAGCAAACATCTAACGAGCCAATCCACACATCCCATGGTCACAAGGCAAAAGCTTCACA
AGGACCCAACTATTGATCCTCATTTGCATCAAGAAATTCAAAGGACGAGAACACCTCACCAACATTCAGCCTACTTAATCCACAAGCAAACGATGGAGCCTAAAGGGTAC
AAAAGTGCTCTACGAATTCCCCATTGGAAAGTTGCCATGGAAGAAGAGATGAATGCACTTCACCAGAACAACACTTGGTCTCTCGTACCAAAACCAGCTAATGTTAATGT
GATTGGGTCGAAATGGATCTTCAAAACAAAATTAAAGGAAGATGGAACAGTTGAGAGATATAAAGCACGTTTGGTTGCACAAGGCTTCACACAAGTTGAAGGTCTAGACT
ATGAAGAAACTTTCAGCCCTGTGGTCAAACCTACCACTATTCGCTTAATAATTGCTGTGGCTATCCATTTCAATTGGTCACTAAAACAATTGGATGTAAAGAACGCCTTT
CTACATGGGATTCTCAAAGAAACAATCTACATGGCTCAACCGCCTGGTTTCCAACACTCACAATTACCAAATCATGTGTGCAAACTCAACAAGTCACTTTATGGTCTAAA
ACAAGCTCCAAGGGCGTGGTTCGAGAGGCTATCACAATACCTACTTCACATTGGGTTCACTTGTTCCCGATCAGACCCTTCTCTTTTTATCTATAAACATCATTCAATCA
TCATTATCATGCTTGTTTATGTTGATGACATCATACTTACTGGAAATGATACTAATGCTCTTAATGATCTCATCCTACATCTAGGTACAGAATTTGCCATCAAAGACCTT
GGACGGCTCCACTACTTCTTAGGCTTAGAGGTGCATCATACACCCTATGGACTTCATTTATCTCAAACCAAATATGCCAAGGATCTCCTAGAAAAGAATAATATGGCAAA
TGCTTCCCATTTTAACACTCCTATGGCGGTGGGACTCTCACAGTTTCCAAATGATCAAGTACTAGCAGATGCTACAACCTATAGAAGAATCGTCGGCTCGTTACAATACC
TTACCTTGACTCGACCCGACATCACTCATGCTGTGAACAAAGTGTGTCAACACCTACAAAATCCAACGATGCAACACATGAGAACTGTGAAACGCATCTTACGATATATC
AAAGGAACCACTGAATATGGTATGACGTTTCACAAAAATAGCTCACTAAATTTATATGGTTTTTGTGATGCAGATTGGGCCGGATGTCCACTTACTCGACGTAGTACCAC
TGGATTTTGTATCTTCCTTGGACCAAATTGCATCTCATGGACCTCTAAAAAGCAACCAACCGTGGCTCGCTCAAGCTCAGAAGCTGAATATAGAGCAATGGCTAGTGCAA
CAGCTGAAATTACATGGATATCCTTCCTTCTCAGAGATCTTGGCATCCCACTTCAACAACCTCCTCAACTCTTCTGTGACAATCTAAGTGCTCTCTACATGTCAGTTAAT
CCAGTTTTCCATGCTCGCACAAAACATATCGAGATGGACTATCATTTTGTTCGAGAAAAAGTGGCATTAGGTTCTCTAATCACAAGGTATATACCGTCTGATCTCCAAGT
CGCTGATGTTCTCACCAAATCACTATCCAAGACTTCCTTTAGAACACTTCGAAGCAAACTTGGAGTTCATGCTCAAGCCCACTCCAACGATGGGTTAAGTAGGGAGGAAG
TTGAAACTCACAATGATAACGATGAGATGTTCGATCTTCTAGCTGATTTGCAAGGGCCAATGGTAGAAGGGGAAGGGGATGTAGATGACGAAGAAGATTTTGGGAATGAA
ATGCCTACTAATATGCCTGGAGAAAATGAGACTTCAAACACTTTCGAGGAATTGATGGTTGAAGCACGTAACCCATTGTACCTTGGTTGTACAAAATATTCTTCATTGAA
TTTCTTGGTGAAATTGATGCATATCAAGGTTCCCAAAAACTGCAGTAACAAAACGTTTGATATGTCATTAGAATTATTGAAAGACGTCTTTCCTTCTGGTACATCTGTAC
CTAGTTCATTTTATGAGGCTAAGAGGAAGCTACGTGACTTAGGTTTGGGATATGAACACATTCATGCTTACCCAAGGAATTTTCAATTAGCATTAGCTTCAGATGGATTC
AATCCATTCGGGAACATGAGTATTGCCTATATTCATTTGCCATATGAAACGAAGGTCGTTGATCTGGTTAACTACAAAACAAGGTTCAATAGCGATGCACGTAATGATGA
TTCACTTCCTGATGGAGCATATTGTGGTGATTTCGAAGTGTTCAAGAAAAGTATTTACTCATTACGTGAAAGAGGAGAAGCATTTGATCATCTTTGCTCGCTAGCACTGG
GGCCTTCTAATCAAGTCCTCTCTTACAATGGTTGCATCGTCAATGGAGTTCGTTTCCATTGTTTAGACCGCGACAATCATCGAGCTACACAGAATAGTAGTATCATGGTG
CCTGGAGAAACTGAAGTGGATGAAATGAATTTCTATGGTGTACTACATGAGGTTTTGGACTTAGAGTATTCTAAGAGAAGACGTGTGGTCATATTTAGATACCAACCTTT
CATTCTCGTTGCTCAAGCGAAACAAGTTTCTTACATCAATGACCACAAACTTGGTAATAGTTGGAAAGTGGTGCAAATTGTCCAAAATAAACAAGTGTGGGACGTTCCAG
AAGCTGAAGACATTGAAAACGACGAAATGGAGTTATTAGAAGTATCAAACGTGATCGAGGTTGATGAGTCCATTCATGAGGCCACACTTTATAGAGATGATGTTGACCCT
ATTTTCGTTCCACCTCAATATTTGAATAAACAGTTGAAGAATAAAAGTGTACCATCATTCAATGACAATTTCATAAATGATAAACCTGAAGATGTCGAAGCATCCACTGA
CAATAGTTCGAACTCATGTGAATATTTAGATTTTGATGATCAGGATAAGACTGAGGAGCCTAAGACTGAAGAGTACAACACACCTACTGGAGAAGCTGAGGAGGACACAT
CAGATGAAGCTGAAAAGCTTGATCCTGAGCTTCTTATTCCTTCTCCCACGGTGTTGGTCCTCAAAGAGAAGAAAAAGAAGAAAAAGAACAAACGGGCTGAATTTGACAAG
TTTATGAAAGCTTTTATGAATCTAAATATCGATATTCCTTTTGCAGAAGCACTAGAGATGCCCTAG
Protein sequenceShow/hide protein sequence
MAKTEPNLTIQSFHQCSSLISIKLTSSNYLLWKSQILPLVRSLGIEEHLKSENQHGKFIKEKDEMEVVNPHFTQWINNDGLLTSWLLGTISEDILAMIESTDTAKKIWTS
LEEQLLTMSKENEIHLTETLLTLKKGKLSVDDYLKKIKGICDQLAAMKKPVDDMTKVFHVARGLGTSYQGFKTAMLSKAPYPTFNEFVLALKAHEVRIKTDHDTEKYTLP
NQEQAFYTQKGRYRGRGRHFTSRGRGFPQGRSTHSSTSNQFSNFHPTKTGATHNIANYLTAPTDQNQKNITASHGSTVSKSGNFISNPYKSDLIICQICGKGNHTALECW
NRFDHSYQSEEIPKALAAMNLNEDVDPLMYADSGATSHMVNDPGKISSLQLYKGHDKIFVGNGDQLEISHIGQGKGEILAKGSRKAGLYALEEEKVEKNEVYIAGLLNNK
ASYSIWHKRMGHLNEKSLRSLKIKVFQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVERKHRHLIETGLEEWTNHMKLTQSRHQVTDDININNTRLGRSQTSS
YDLDENPTTEHNVDPPNQEIQCQNGVQNDSPTNKAAQMDEQSREQEENSLPIEDSTNIAEQTKENLVQPLLEERHNDTTTMMEDENLMQPQIEERCTTKVTHTGSIVHNN
LQNSTPQQENSKSHSLSQFDHLPDISSNLYIDLTLPSAGNQSNGSQGSNNTSTQSKHLTSQSTHPMVTRQKLHKDPTIDPHLHQEIQRTRTPHQHSAYLIHKQTMEPKGY
KSALRIPHWKVAMEEEMNALHQNNTWSLVPKPANVNVIGSKWIFKTKLKEDGTVERYKARLVAQGFTQVEGLDYEETFSPVVKPTTIRLIIAVAIHFNWSLKQLDVKNAF
LHGILKETIYMAQPPGFQHSQLPNHVCKLNKSLYGLKQAPRAWFERLSQYLLHIGFTCSRSDPSLFIYKHHSIIIIMLVYVDDIILTGNDTNALNDLILHLGTEFAIKDL
GRLHYFLGLEVHHTPYGLHLSQTKYAKDLLEKNNMANASHFNTPMAVGLSQFPNDQVLADATTYRRIVGSLQYLTLTRPDITHAVNKVCQHLQNPTMQHMRTVKRILRYI
KGTTEYGMTFHKNSSLNLYGFCDADWAGCPLTRRSTTGFCIFLGPNCISWTSKKQPTVARSSSEAEYRAMASATAEITWISFLLRDLGIPLQQPPQLFCDNLSALYMSVN
PVFHARTKHIEMDYHFVREKVALGSLITRYIPSDLQVADVLTKSLSKTSFRTLRSKLGVHAQAHSNDGLSREEVETHNDNDEMFDLLADLQGPMVEGEGDVDDEEDFGNE
MPTNMPGENETSNTFEELMVEARNPLYLGCTKYSSLNFLVKLMHIKVPKNCSNKTFDMSLELLKDVFPSGTSVPSSFYEAKRKLRDLGLGYEHIHAYPRNFQLALASDGF
NPFGNMSIAYIHLPYETKVVDLVNYKTRFNSDARNDDSLPDGAYCGDFEVFKKSIYSLRERGEAFDHLCSLALGPSNQVLSYNGCIVNGVRFHCLDRDNHRATQNSSIMV
PGETEVDEMNFYGVLHEVLDLEYSKRRRVVIFRYQPFILVAQAKQVSYINDHKLGNSWKVVQIVQNKQVWDVPEAEDIENDEMELLEVSNVIEVDESIHEATLYRDDVDP
IFVPPQYLNKQLKNKSVPSFNDNFINDKPEDVEASTDNSSNSCEYLDFDDQDKTEEPKTEEYNTPTGEAEEDTSDEAEKLDPELLIPSPTVLVLKEKKKKKKNKRAEFDK
FMKAFMNLNIDIPFAEALEMP