; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014937 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014937
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr07:8570806..8577018
RNA-Seq ExpressionPay0014937
SyntenyPay0014937
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74695.1 hypothetical protein VITISV_024648 [Vitis vinifera]1.6e-19937.18Show/hide
Query:  QILTALEAYDLENFLKSESEPPSKYLTSTG----------------TRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHN
        QILT L  + L++FL   S  PS++L+S                   +LI S LL S+++ +L +M++C ++ ++W TL+  F ++  A+  QFK +LHN
Subjt:  QILTALEAYDLENFLKSESEPPSKYLTSTG----------------TRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHN

Query:  IKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE------------------------------AMNNQN
         KK  +S+ +Y LKI+  VD LA +   +S  DHI  I   L  DY++ I  +++R D   V+E                                N   
Subjt:  IKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE------------------------------AMNNQN

Query:  NYHNNHSYNQRG---------------------DRGNGRSNRGGRGNRNKPQCQICTKLGH---------------------------------------
        + H N+  + R                       +G GR  RG     NKPQCQ+C ++GH                                       
Subjt:  NYHNNHSYNQRG---------------------DRGNGRSNRGGRGNRNKPQCQICTKLGH---------------------------------------

Query:  ---SMSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAK
           + S    +  +  DNNWY DSGAT+HLT +L+NL T+ ++   ++++  NG  LPI H G  SF+SS +P K+  L  LLHVP ITKNL+SVS+FA 
Subjt:  ---SMSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAK

Query:  DNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVF---NTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSGTI
        DNHVFFEFHPT C++K+  T  V + G L   LY F        LH+S              VP S      LWH RLGHP    + +VL   +      
Subjt:  DNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVF---NTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSGTI

Query:  DKMIFCEAYALGNHHALPFSH-------PSLIIHI----------------------IYNLLLVIYGVLHLM-PFLAFQKFKTCVEKSLGQSIKSHQTDG
           + C    +G  H   F H       P  +IH                        Y+    IY + H    F  F  FK+ VE  LG  IK+ Q+D 
Subjt:  DKMIFCEAYALGNHHALPFSH-------PSLIIHI----------------------IYNLLLVIYGVLHLM-PFLAFQKFKTCVEKSLGQSIKSHQTDG

Query:  GTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGC
        G +++ F  +L  +GI HRI+ PYT +QN + ERKHRHI +  + LL+QA+LP   WD+AF TSVYLINRLPTPV+   SPL+ LF  KP++  L+ FGC
Subjt:  GTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGC

Query:  KCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASF-----------SSHSSIPKSKNV------------LSPPL
         CY                 C  L YS +HKGYKCL+ +G + ISR V+FDE +F +A             SS +S+P   ++             S P 
Subjt:  KCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASF-----------SSHSSIPKSKNV------------LSPPL

Query:  HSIIQSSLMNHNEDRRHTDTVSD--NTDHL------------------NPTIV---------YPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIK
        +  I  +  NHN   +   + +    + H+                   PT V             T+E+ AL +N+TW L P   ++K++GCKWVFK+K
Subjt:  HSIIQSSLMNHNEDRRHTDTVSD--NTDHL------------------NPTIV---------YPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIK

Query:  RNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLK
         N  G I++YKARLVAKGFHQ    D+NETFSPVVK  TI ++LTIA+   W +RQLDVNNAFL+G+L E+++M Q  GF    +   VC L K++YGLK
Subjt:  RNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLK

Query:  QAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYI
        QAPRAW+E L   L  LGF ++K+D SL I +T T   Y L+YVDD++I G++ + V  ++  LN+QFALKDLG + YFLG++V + T+ G+ LSQ+KYI
Subjt:  QAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYI

Query:  TDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSDN
        ++LLQ+TKML  K +  PMVS   LS     PF +  L RS VGALQYAT+T P+I+YS+                 VKRILRYL G L+HGL L  + N
Subjt:  TDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSDN

Query:  TSLMI
        + L I
Subjt:  TSLMI

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0069.53Show/hide
Query:  MSSNSSLLGVENTEASSPINQIFGSGNKIYL------------FQILTALEAYDLENFLKSESEPPSKYLTS--------TGT------------RLISS
        MSS SSLLGVENTEASSPINQIFGSGNKI L            FQILTALEAYDLENFL+SESEPPSKYL S        TGT            RLISS
Subjt:  MSSNSSLLGVENTEASSPINQIFGSGNKIYL------------FQILTALEAYDLENFLKSESEPPSKYLTS--------TGT------------RLISS

Query:  WLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISV
        WLLGSMSEEILNQMLHCKSAKEIW TLQGIF SRYLAQAMQFKNKLHNIKK SM LKEYFLKI QCVDALASINKPVSSDDHILYILA LGSDYQSMISV
Subjt:  WLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISV

Query:  ISARTDSPFVQEAMN---------------------------------------NQNNYHNNHSYNQRGDRGNGRSNRGGRGNRNKPQCQICTKLGHS--
        ISARTDSP VQE M+                                       NQNNYHNNHSYNQRG RGNGRSNRG RGNRNKPQCQIC KLG+S  
Subjt:  ISARTDSPFVQEAMN---------------------------------------NQNNYHNNHSYNQRGDRGNGRSNRGGRGNRNKPQCQICTKLGHS--

Query:  --------------------------------MSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL
                                        MSAMVA+ +LNID+NWY DSGATNHLTHSLSNLS   EY GGNQIYAANGS LPI H+GSMSFNSSTL
Subjt:  --------------------------------MSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL

Query:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWH
        PFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEFHPTLCY+K++DT QV LQGLLND LYKFTI+PSHKRLHHS SNTK VFNTVVPKSN PLLDLWH
Subjt:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWH

Query:  KRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHP-SLIIHIIYNLLLVIYG----VLH-------------------------LMPF
        +RLGHPHLP +K VL HID SSGTI+K+ FCEA ALG HHALPFSH  +L  H +  +   ++G    V H                            F
Subjt:  KRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHP-SLIIHIIYNLLLVIYG----VLH-------------------------LMPF

Query:  LAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPV
        LAFQKFKTCVEKSLGQSIKS QTDGGT+FKPFKPFLDQHGIEHRIT PYTSKQNDIVERKHR+I +M LTLLSQATLPLS WD+AFSTSVYLINRLPTPV
Subjt:  LAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPV

Query:  IDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHSSIPKSKNVLSP
        +D ISPL+KLF  KPNFPSLR FGCKCY                 C  L YSTSHKGYKCLASDGRLFISR VLFDENSF YASF+SHSS PKSK+VLSP
Subjt:  IDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHSSIPKSKNVLSP

Query:  PLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPN
        PLHSII SSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLET   E+ + +       Q+P+
Subjt:  PLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPN

KYP46257.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.3e-18737.89Show/hide
Query:  RLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQ
        +L+ SWL  SMS+++L +++ CKS+ ++W  +   F S   A+A Q +N+L +   +++S+ EY L+IQ  VDAL +I   VS  +H+  IL  L  +Y+
Subjt:  RLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQ

Query:  SMISVISARTDSPFVQE-----------------------------------AMNNQNN--YHNNHS--YNQRGDRGNGRSNR------GGRGNRNKPQC
        S +S+IS+R D   + E                                   A N Q +  + NN S   ++RG R N R  R       GRG     QC
Subjt:  SMISVISARTDSPFVQE-----------------------------------AMNNQNN--YHNNHS--YNQRGDRGNGRSNR------GGRGNRNKPQC

Query:  QICTKLGHSMSAMV----------------ASPNLN-------IDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL
        Q+C + GH  SA                  A P+ N        +NNWY DSGA+NH+T+   N+     + G +QI+  NG  L I   G  +F+S   
Subjt:  QICTKLGHSMSAMV----------------ASPNLN-------IDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL

Query:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLL-NDSLYKF-TIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDL
        P   F LNNLL VPSITKNLISVSQF+KDN V+FEFHP +C +K+ DT++V LQG + +D LYKF  + P+       +     +F+     ++      
Subjt:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLL-NDSLYKF-TIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDL

Query:  WHKRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHPSLIIHIIYNLL------------------LVIYGVLH------------LM
        WH RLGHP++  +K V ++ +    + +K  FC    LG  H LP +    I +  ++L+                   V +   H              
Subjt:  WHKRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHPSLIIHIIYNLL------------------LVIYGVLH------------LM

Query:  PFLAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPT
         F  F++F   V+      IK+ QTD G +++ F  +L++ GI+HR+  P+T  QN + ERKHRHI ++ LTL++QA LP+  WD +F T+VYLINRLP+
Subjt:  PFLAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPT

Query:  PVIDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHS---------
          I    P  KLF   P++ SLR FGC C+                 C  L YSTSHKGYKCLA+DGRL+IS+ V+F+E  F Y    S S         
Subjt:  PVIDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHS---------

Query:  SIPKSKNVLSPPLH--------------SIIQSSLMNHNEDRRHTDTVSDNTDH---------LNPTIVYPLETE-------------------------
        S+P + N    P H              S+I  S   +        T+S  T           ++P  V+P+ T                          
Subjt:  SIPKSKNVLSPPLH--------------SIIQSSLMNHNEDRRHTDTVSDNTDH---------LNPTIVYPLETE-------------------------

Query:  --------------EFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSI
                      E+ AL  N+TW L P  P++  +GCKWVF++K N  G + +YKARLVAKGF+Q    DY+ETFSPV+K VT+ ++LT+A+   W I
Subjt:  --------------EFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSI

Query:  RQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSK
        +QLDVNNAFL+G L+E VYM+Q  GFE  S   +VC L KAIYGLKQAPRAW++ L S L  L F  SK D SL I     +  Y L+YVDD+II G++ 
Subjt:  RQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSK

Query:  KDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHP
          +++LV  L+S F+LKDLG L +FLG+EV    +G L L+QSKYI DLL RT M  +KPIS PM+SG  LS    E F D  L RSVVGALQYAT+T P
Subjt:  KDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHP

Query:  EISYSL-----------------VKRILRYLKGVLYHGLWLSKSDNTS
        EIS+S+                 VKRILRYLKG    GL L  + ++S
Subjt:  EISYSL-----------------VKRILRYLKGVLYHGLWLSKSDNTS

RVX03305.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.8e-20338.59Show/hide
Query:  QILTALEAYDLENFLKSESEPPSKYLTSTG----------------TRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHN
        QILT L  + L++FL   S  PS++L+S                   +LI SWLL S+++ +L +M++C ++ ++W TL+  F ++  A+  QFK +LHN
Subjt:  QILTALEAYDLENFLKSESEPPSKYLTSTG----------------TRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHN

Query:  IKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE------------------------------AMNNQN
         KK  +S+ +Y LKI+  VD LA +   +S  DHI  I   L  DY++ I  +++R D   V+E                                N   
Subjt:  IKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE------------------------------AMNNQN

Query:  NYHNNHSYNQRG---------------------DRGNGRSNRGGRGNRNKPQCQICTKLGH---------------------------------------
        + H N+  + R                       +G GR  RG     NKPQCQ+C ++GH                                       
Subjt:  NYHNNHSYNQRG---------------------DRGNGRSNRGGRGNRNKPQCQICTKLGH---------------------------------------

Query:  ---SMSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAK
           ++S    +  +  DNNWY DSGAT+HLT +L+NL T+ ++   ++++  NG  LPI H G  SF+SS +P K+  L  LLHVP ITKNL+SVS+FA 
Subjt:  ---SMSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAK

Query:  DNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVF---NTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSGTI
        DNHVFFEFHPT C++K++ T  V +   L   LY F        LH+S              VP S      LWH RLGHP    + +VL          
Subjt:  DNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVF---NTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSGTI

Query:  DKMIFCEAYALGNHHALPFSH-PSLIIHII--YNLLLVIYGVLHLM-PFLAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTS
                    N  + P SH     IH I  Y+    IY + H    F  F  FK+ VE  LG  IK+ Q+D G +++ F  +L  +GI HRI+ PYT 
Subjt:  DKMIFCEAYALGNHHALPFSH-PSLIIHII--YNLLLVIYGVLHLM-PFLAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTS

Query:  KQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRY
        +QN + ERKHRHI +  + LL+QA+LP   WD+AF TSVYLINRLPTPV+   SPL+ LF  KP++  L+ FGC CY                 C  L Y
Subjt:  KQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRY

Query:  STSHKGYKCLASDGRLFISRRVLFDENSFSYA----------SFSSHSS------------IPKSKNV-LSPPLHSIIQSSLMNHNE------------D
        S +HKGYKCL+ +G + ISR V+FDE++F +A          SFSS S+            +P S +   S P +  I  +  NHN              
Subjt:  STSHKGYKCLASDGRLFISRRVLFDENSFSYA----------SFSSHSS------------IPKSKNV-LSPPLHSIIQSSLMNHNE------------D

Query:  RRHTDTVSDN--------TDHLNPTIV---------YPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNID
          H  T S N             PT V           + T+E+ AL +N+TW L P   ++K++GCKWVFK+K N  G I++YKARLVAKGFHQ    D
Subjt:  RRHTDTVSDN--------TDHLNPTIV---------YPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNID

Query:  YNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADT
        +NETFSPVVK  TI ++LTIA+   W +RQLDVNNAFL+G+L E+++M Q  GF    +   VC L K++YGLKQAPRAW+E L   L  LGF ++K+D 
Subjt:  YNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADT

Query:  SLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLS
        SL I +T T   Y L+YVDD++I G++ + V  ++  LN+QFALKDLG + YFLG++V + T+ G+ LSQ+KYI++LLQ+TKML  KP+  PMVS   LS
Subjt:  SLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLS

Query:  AFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSDNTSLMI
             PF +  L RS VGALQYAT+T P+I+YS+                 VKRILRYL G L+HGL L  S N+ L I
Subjt:  AFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSDNTSLMI

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0069.65Show/hide
Query:  MSSNSSLLGVENTEASSPINQIFGSGNKIYL------------FQILTALEAYDLENFLKSESEPPSKYLTS--------TGT------------RLISS
        MSS SSLLGVENTEASSPINQIFGSGNKI L            FQILTALEAYDLENFL+SESEPPSKYL S        TGT            RLISS
Subjt:  MSSNSSLLGVENTEASSPINQIFGSGNKIYL------------FQILTALEAYDLENFLKSESEPPSKYLTS--------TGT------------RLISS

Query:  WLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISV
        WLLGSMSEEILNQMLHCKSAKEIW TLQGIF SRYLAQAMQFKNKLHNIKK SM LKEYFLKI QCVDALASINKPVSSDDHILYILA LGSDYQSMISV
Subjt:  WLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISV

Query:  ISARTDSPFVQEAMN---------------------------------------NQNNYHNNHSYNQRGDRGNGRSNRGGRGNRNKPQCQICTKLGHS--
        ISARTDSP VQE M+                                       NQNNYHNNHSYNQRG RGNGRSNRG RGNRNKPQCQIC KLG+S  
Subjt:  ISARTDSPFVQEAMN---------------------------------------NQNNYHNNHSYNQRGDRGNGRSNRGGRGNRNKPQCQICTKLGHS--

Query:  --------------------------------MSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL
                                        MSAMVA+ +LNID+NWY DSGATNHLTHSLSNLS   EY GGNQIYAANGS LPI H+GSMSFNSSTL
Subjt:  --------------------------------MSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL

Query:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWH
        PFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEFHPTLCY+K++DT QV LQGLLND LYKFTI+PSHKRLHHS SNTK VFNTVVPKSN PLLDLWH
Subjt:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWH

Query:  KRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHP-SLIIHIIYNLLLVIYG----VLH-------------------------LMPF
        +RLGHPHLP +K VL HID SSGTI+K+ FCEA ALG HHALPFSH  +L  H +  +   ++G    V H                            F
Subjt:  KRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHP-SLIIHIIYNLLLVIYG----VLH-------------------------LMPF

Query:  LAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPV
        LAFQKFKTCVEKSLGQSIKS QTDGGT+FKPFKPFLDQHGIEHRIT PYTSKQNDIVERKHR+I +M LTLLSQATLPLS WD+AFSTSVYLINRLPTPV
Subjt:  LAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPV

Query:  IDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHSSIPKSKNVLSP
        +D ISPL+KLF  KPNFPSLR FGCKCY                 C  L YSTSHKGYKCLASDGRLFISR VLFDENSF YASF+SHSSIPKSK+VLSP
Subjt:  IDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHSSIPKSKNVLSP

Query:  PLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPN
        PLHSII SSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLET   E+ + +       Q+P+
Subjt:  PLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPN

TrEMBL top hitse value%identityAlignment
A0A151RUP0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-18737.89Show/hide
Query:  RLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQ
        +L+ SWL  SMS+++L +++ CKS+ ++W  +   F S   A+A Q +N+L +   +++S+ EY L+IQ  VDAL +I   VS  +H+  IL  L  +Y+
Subjt:  RLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQ

Query:  SMISVISARTDSPFVQE-----------------------------------AMNNQNN--YHNNHS--YNQRGDRGNGRSNR------GGRGNRNKPQC
        S +S+IS+R D   + E                                   A N Q +  + NN S   ++RG R N R  R       GRG     QC
Subjt:  SMISVISARTDSPFVQE-----------------------------------AMNNQNN--YHNNHS--YNQRGDRGNGRSNR------GGRGNRNKPQC

Query:  QICTKLGHSMSAMV----------------ASPNLN-------IDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL
        Q+C + GH  SA                  A P+ N        +NNWY DSGA+NH+T+   N+     + G +QI+  NG  L I   G  +F+S   
Subjt:  QICTKLGHSMSAMV----------------ASPNLN-------IDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL

Query:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLL-NDSLYKF-TIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDL
        P   F LNNLL VPSITKNLISVSQF+KDN V+FEFHP +C +K+ DT++V LQG + +D LYKF  + P+       +     +F+     ++      
Subjt:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLL-NDSLYKF-TIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDL

Query:  WHKRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHPSLIIHIIYNLL------------------LVIYGVLH------------LM
        WH RLGHP++  +K V ++ +    + +K  FC    LG  H LP +    I +  ++L+                   V +   H              
Subjt:  WHKRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHPSLIIHIIYNLL------------------LVIYGVLH------------LM

Query:  PFLAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPT
         F  F++F   V+      IK+ QTD G +++ F  +L++ GI+HR+  P+T  QN + ERKHRHI ++ LTL++QA LP+  WD +F T+VYLINRLP+
Subjt:  PFLAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPT

Query:  PVIDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHS---------
          I    P  KLF   P++ SLR FGC C+                 C  L YSTSHKGYKCLA+DGRL+IS+ V+F+E  F Y    S S         
Subjt:  PVIDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHS---------

Query:  SIPKSKNVLSPPLH--------------SIIQSSLMNHNEDRRHTDTVSDNTDH---------LNPTIVYPLETE-------------------------
        S+P + N    P H              S+I  S   +        T+S  T           ++P  V+P+ T                          
Subjt:  SIPKSKNVLSPPLH--------------SIIQSSLMNHNEDRRHTDTVSDNTDH---------LNPTIVYPLETE-------------------------

Query:  --------------EFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSI
                      E+ AL  N+TW L P  P++  +GCKWVF++K N  G + +YKARLVAKGF+Q    DY+ETFSPV+K VT+ ++LT+A+   W I
Subjt:  --------------EFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSI

Query:  RQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSK
        +QLDVNNAFL+G L+E VYM+Q  GFE  S   +VC L KAIYGLKQAPRAW++ L S L  L F  SK D SL I     +  Y L+YVDD+II G++ 
Subjt:  RQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSK

Query:  KDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHP
          +++LV  L+S F+LKDLG L +FLG+EV    +G L L+QSKYI DLL RT M  +KPIS PM+SG  LS    E F D  L RSVVGALQYAT+T P
Subjt:  KDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHP

Query:  EISYSL-----------------VKRILRYLKGVLYHGLWLSKSDNTS
        EIS+S+                 VKRILRYLKG    GL L  + ++S
Subjt:  EISYSL-----------------VKRILRYLKGVLYHGLWLSKSDNTS

A0A438J300 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-20338.59Show/hide
Query:  QILTALEAYDLENFLKSESEPPSKYLTSTG----------------TRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHN
        QILT L  + L++FL   S  PS++L+S                   +LI SWLL S+++ +L +M++C ++ ++W TL+  F ++  A+  QFK +LHN
Subjt:  QILTALEAYDLENFLKSESEPPSKYLTSTG----------------TRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHN

Query:  IKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE------------------------------AMNNQN
         KK  +S+ +Y LKI+  VD LA +   +S  DHI  I   L  DY++ I  +++R D   V+E                                N   
Subjt:  IKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE------------------------------AMNNQN

Query:  NYHNNHSYNQRG---------------------DRGNGRSNRGGRGNRNKPQCQICTKLGH---------------------------------------
        + H N+  + R                       +G GR  RG     NKPQCQ+C ++GH                                       
Subjt:  NYHNNHSYNQRG---------------------DRGNGRSNRGGRGNRNKPQCQICTKLGH---------------------------------------

Query:  ---SMSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAK
           ++S    +  +  DNNWY DSGAT+HLT +L+NL T+ ++   ++++  NG  LPI H G  SF+SS +P K+  L  LLHVP ITKNL+SVS+FA 
Subjt:  ---SMSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAK

Query:  DNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVF---NTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSGTI
        DNHVFFEFHPT C++K++ T  V +   L   LY F        LH+S              VP S      LWH RLGHP    + +VL          
Subjt:  DNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVF---NTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSGTI

Query:  DKMIFCEAYALGNHHALPFSH-PSLIIHII--YNLLLVIYGVLHLM-PFLAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTS
                    N  + P SH     IH I  Y+    IY + H    F  F  FK+ VE  LG  IK+ Q+D G +++ F  +L  +GI HRI+ PYT 
Subjt:  DKMIFCEAYALGNHHALPFSH-PSLIIHII--YNLLLVIYGVLHLM-PFLAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTS

Query:  KQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRY
        +QN + ERKHRHI +  + LL+QA+LP   WD+AF TSVYLINRLPTPV+   SPL+ LF  KP++  L+ FGC CY                 C  L Y
Subjt:  KQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRY

Query:  STSHKGYKCLASDGRLFISRRVLFDENSFSYA----------SFSSHSS------------IPKSKNV-LSPPLHSIIQSSLMNHNE------------D
        S +HKGYKCL+ +G + ISR V+FDE++F +A          SFSS S+            +P S +   S P +  I  +  NHN              
Subjt:  STSHKGYKCLASDGRLFISRRVLFDENSFSYA----------SFSSHSS------------IPKSKNV-LSPPLHSIIQSSLMNHNE------------D

Query:  RRHTDTVSDN--------TDHLNPTIV---------YPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNID
          H  T S N             PT V           + T+E+ AL +N+TW L P   ++K++GCKWVFK+K N  G I++YKARLVAKGFHQ    D
Subjt:  RRHTDTVSDN--------TDHLNPTIV---------YPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNID

Query:  YNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADT
        +NETFSPVVK  TI ++LTIA+   W +RQLDVNNAFL+G+L E+++M Q  GF    +   VC L K++YGLKQAPRAW+E L   L  LGF ++K+D 
Subjt:  YNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADT

Query:  SLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLS
        SL I +T T   Y L+YVDD++I G++ + V  ++  LN+QFALKDLG + YFLG++V + T+ G+ LSQ+KYI++LLQ+TKML  KP+  PMVS   LS
Subjt:  SLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLS

Query:  AFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSDNTSLMI
             PF +  L RS VGALQYAT+T P+I+YS+                 VKRILRYL G L+HGL L  S N+ L I
Subjt:  AFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSDNTSLMI

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0069.53Show/hide
Query:  MSSNSSLLGVENTEASSPINQIFGSGNKIYL------------FQILTALEAYDLENFLKSESEPPSKYLTS--------TGT------------RLISS
        MSS SSLLGVENTEASSPINQIFGSGNKI L            FQILTALEAYDLENFL+SESEPPSKYL S        TGT            RLISS
Subjt:  MSSNSSLLGVENTEASSPINQIFGSGNKIYL------------FQILTALEAYDLENFLKSESEPPSKYLTS--------TGT------------RLISS

Query:  WLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISV
        WLLGSMSEEILNQMLHCKSAKEIW TLQGIF SRYLAQAMQFKNKLHNIKK SM LKEYFLKI QCVDALASINKPVSSDDHILYILA LGSDYQSMISV
Subjt:  WLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISV

Query:  ISARTDSPFVQEAMN---------------------------------------NQNNYHNNHSYNQRGDRGNGRSNRGGRGNRNKPQCQICTKLGHS--
        ISARTDSP VQE M+                                       NQNNYHNNHSYNQRG RGNGRSNRG RGNRNKPQCQIC KLG+S  
Subjt:  ISARTDSPFVQEAMN---------------------------------------NQNNYHNNHSYNQRGDRGNGRSNRGGRGNRNKPQCQICTKLGHS--

Query:  --------------------------------MSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL
                                        MSAMVA+ +LNID+NWY DSGATNHLTHSLSNLS   EY GGNQIYAANGS LPI H+GSMSFNSSTL
Subjt:  --------------------------------MSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL

Query:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWH
        PFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEFHPTLCY+K++DT QV LQGLLND LYKFTI+PSHKRLHHS SNTK VFNTVVPKSN PLLDLWH
Subjt:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWH

Query:  KRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHP-SLIIHIIYNLLLVIYG----VLH-------------------------LMPF
        +RLGHPHLP +K VL HID SSGTI+K+ FCEA ALG HHALPFSH  +L  H +  +   ++G    V H                            F
Subjt:  KRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHP-SLIIHIIYNLLLVIYG----VLH-------------------------LMPF

Query:  LAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPV
        LAFQKFKTCVEKSLGQSIKS QTDGGT+FKPFKPFLDQHGIEHRIT PYTSKQNDIVERKHR+I +M LTLLSQATLPLS WD+AFSTSVYLINRLPTPV
Subjt:  LAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPV

Query:  IDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHSSIPKSKNVLSP
        +D ISPL+KLF  KPNFPSLR FGCKCY                 C  L YSTSHKGYKCLASDGRLFISR VLFDENSF YASF+SHSS PKSK+VLSP
Subjt:  IDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHSSIPKSKNVLSP

Query:  PLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPN
        PLHSII SSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLET   E+ + +       Q+P+
Subjt:  PLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPN

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0069.65Show/hide
Query:  MSSNSSLLGVENTEASSPINQIFGSGNKIYL------------FQILTALEAYDLENFLKSESEPPSKYLTS--------TGT------------RLISS
        MSS SSLLGVENTEASSPINQIFGSGNKI L            FQILTALEAYDLENFL+SESEPPSKYL S        TGT            RLISS
Subjt:  MSSNSSLLGVENTEASSPINQIFGSGNKIYL------------FQILTALEAYDLENFLKSESEPPSKYLTS--------TGT------------RLISS

Query:  WLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISV
        WLLGSMSEEILNQMLHCKSAKEIW TLQGIF SRYLAQAMQFKNKLHNIKK SM LKEYFLKI QCVDALASINKPVSSDDHILYILA LGSDYQSMISV
Subjt:  WLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISV

Query:  ISARTDSPFVQEAMN---------------------------------------NQNNYHNNHSYNQRGDRGNGRSNRGGRGNRNKPQCQICTKLGHS--
        ISARTDSP VQE M+                                       NQNNYHNNHSYNQRG RGNGRSNRG RGNRNKPQCQIC KLG+S  
Subjt:  ISARTDSPFVQEAMN---------------------------------------NQNNYHNNHSYNQRGDRGNGRSNRGGRGNRNKPQCQICTKLGHS--

Query:  --------------------------------MSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL
                                        MSAMVA+ +LNID+NWY DSGATNHLTHSLSNLS   EY GGNQIYAANGS LPI H+GSMSFNSSTL
Subjt:  --------------------------------MSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTL

Query:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWH
        PFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEFHPTLCY+K++DT QV LQGLLND LYKFTI+PSHKRLHHS SNTK VFNTVVPKSN PLLDLWH
Subjt:  PFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWH

Query:  KRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHP-SLIIHIIYNLLLVIYG----VLH-------------------------LMPF
        +RLGHPHLP +K VL HID SSGTI+K+ FCEA ALG HHALPFSH  +L  H +  +   ++G    V H                            F
Subjt:  KRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFSHP-SLIIHIIYNLLLVIYG----VLH-------------------------LMPF

Query:  LAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPV
        LAFQKFKTCVEKSLGQSIKS QTDGGT+FKPFKPFLDQHGIEHRIT PYTSKQNDIVERKHR+I +M LTLLSQATLPLS WD+AFSTSVYLINRLPTPV
Subjt:  LAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPV

Query:  IDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHSSIPKSKNVLSP
        +D ISPL+KLF  KPNFPSLR FGCKCY                 C  L YSTSHKGYKCLASDGRLFISR VLFDENSF YASF+SHSSIPKSK+VLSP
Subjt:  IDTISPLDKLFGWKPNFPSLRAFGCKCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHSSIPKSKNVLSP

Query:  PLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPN
        PLHSII SSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLET   E+ + +       Q+P+
Subjt:  PLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPN

A5BK17 Integrase catalytic domain-containing protein7.6e-20037.18Show/hide
Query:  QILTALEAYDLENFLKSESEPPSKYLTSTG----------------TRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHN
        QILT L  + L++FL   S  PS++L+S                   +LI S LL S+++ +L +M++C ++ ++W TL+  F ++  A+  QFK +LHN
Subjt:  QILTALEAYDLENFLKSESEPPSKYLTSTG----------------TRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQFKNKLHN

Query:  IKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE------------------------------AMNNQN
         KK  +S+ +Y LKI+  VD LA +   +S  DHI  I   L  DY++ I  +++R D   V+E                                N   
Subjt:  IKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE------------------------------AMNNQN

Query:  NYHNNHSYNQRG---------------------DRGNGRSNRGGRGNRNKPQCQICTKLGH---------------------------------------
        + H N+  + R                       +G GR  RG     NKPQCQ+C ++GH                                       
Subjt:  NYHNNHSYNQRG---------------------DRGNGRSNRGGRGNRNKPQCQICTKLGH---------------------------------------

Query:  ---SMSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAK
           + S    +  +  DNNWY DSGAT+HLT +L+NL T+ ++   ++++  NG  LPI H G  SF+SS +P K+  L  LLHVP ITKNL+SVS+FA 
Subjt:  ---SMSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAK

Query:  DNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVF---NTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSGTI
        DNHVFFEFHPT C++K+  T  V + G L   LY F        LH+S              VP S      LWH RLGHP    + +VL   +      
Subjt:  DNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVF---NTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSGTI

Query:  DKMIFCEAYALGNHHALPFSH-------PSLIIHI----------------------IYNLLLVIYGVLHLM-PFLAFQKFKTCVEKSLGQSIKSHQTDG
           + C    +G  H   F H       P  +IH                        Y+    IY + H    F  F  FK+ VE  LG  IK+ Q+D 
Subjt:  DKMIFCEAYALGNHHALPFSH-------PSLIIHI----------------------IYNLLLVIYGVLHLM-PFLAFQKFKTCVEKSLGQSIKSHQTDG

Query:  GTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGC
        G +++ F  +L  +GI HRI+ PYT +QN + ERKHRHI +  + LL+QA+LP   WD+AF TSVYLINRLPTPV+   SPL+ LF  KP++  L+ FGC
Subjt:  GTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGC

Query:  KCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASF-----------SSHSSIPKSKNV------------LSPPL
         CY                 C  L YS +HKGYKCL+ +G + ISR V+FDE +F +A             SS +S+P   ++             S P 
Subjt:  KCY-----------------CNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASF-----------SSHSSIPKSKNV------------LSPPL

Query:  HSIIQSSLMNHNEDRRHTDTVSD--NTDHL------------------NPTIV---------YPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIK
        +  I  +  NHN   +   + +    + H+                   PT V             T+E+ AL +N+TW L P   ++K++GCKWVFK+K
Subjt:  HSIIQSSLMNHNEDRRHTDTVSD--NTDHL------------------NPTIV---------YPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIK

Query:  RNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLK
         N  G I++YKARLVAKGFHQ    D+NETFSPVVK  TI ++LTIA+   W +RQLDVNNAFL+G+L E+++M Q  GF    +   VC L K++YGLK
Subjt:  RNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLK

Query:  QAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYI
        QAPRAW+E L   L  LGF ++K+D SL I +T T   Y L+YVDD++I G++ + V  ++  LN+QFALKDLG + YFLG++V + T+ G+ LSQ+KYI
Subjt:  QAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYI

Query:  TDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSDN
        ++LLQ+TKML  K +  PMVS   LS     PF +  L RS VGALQYAT+T P+I+YS+                 VKRILRYL G L+HGL L  + N
Subjt:  TDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSDN

Query:  TSLMI
        + L I
Subjt:  TSLMI

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-4433.05Show/hide
Query:  EFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNL
        E  A + N+TW +T +  N+ IV  +WVF +K N  G   RYKARLVA+GF Q   IDY ETF+PV +  +   +L++ I     + Q+DV  AFL+G L
Subjt:  EFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNL

Query:  DENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLI--RVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNS
         E +YM    G    S    VC L KAIYGLKQA R W+E     L    F  S  D  + I  +  +    Y L+YVDD++I       + +    L  
Subjt:  DENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLI--RVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNS

Query:  QFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATL-THPEISYSL-----
        +F + DL ++ +F+G+ +    +  ++LSQS Y+  +L +  M +   +S P+ S         +   +    RS++G L Y  L T P+++ ++     
Subjt:  QFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATL-THPEISYSL-----

Query:  ------------VKRILRYLKGVLYHGLWLSKSDNTSLMIENPLLVYV
                    +KR+LRYLKG +   L   K    +L  EN ++ YV
Subjt:  ------------VKRILRYLKGVLYHGLWLSKSDNTSLMIENPLLVYV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.3e-5437.16Show/hide
Query:  DRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICM
        D R  +++ +   H     +     EE E+LQKN T++L      ++ + CKWVFK+K++    + RYKARLV KGF Q   ID++E FSPVVK  +I  
Subjt:  DRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICM

Query:  LLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLI-RVTLTSCCYAL
        +L++A      + QLDV  AFLHG+L+E +YMEQ  GFEV     MVC L K++YGLKQAPR WY    S + S  +  + +D  +   R +  +    L
Subjt:  LLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLI-RVTLTSCCYAL

Query:  IYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVE-VSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLS------AFQGEPFH
        +YVDD++I+G  K  +  L   L+  F +KDLG     LG++ V   T+  L+LSQ KYI  +L+R  M +AKP+S P+     LS        + +   
Subjt:  IYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVE-VSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLS------AFQGEPFH

Query:  DVHLLRSVVGALQYATL-THPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSD
              S VG+L YA + T P+I++++                 VK ILRYL+G     L    SD
Subjt:  DVHLLRSVVGALQYATL-THPEISYSL-----------------VKRILRYLKGVLYHGLWLSKSD

P92519 Uncharacterized mitochondrial protein AtMg008102.2e-2640.96Show/hide
Query:  YALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEV-SYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVH
        Y L+YVDD+++ GSS   +  L+  L+S F++KDLG + YFLG+++ ++P+  GLFLSQ+KY   +L    MLD KP+S P+    L S+     + D  
Subjt:  YALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEV-SYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVH

Query:  LLRSVVGALQYATLTHPEISYS-----------------LVKRILRYLKGVLYHGLWLSKSDNTSL
          RS+VGALQY TLT P+ISY+                 L+KR+LRY+KG ++HGL++ K+   ++
Subjt:  LLRSVVGALQYATLTHPEISYS-----------------LVKRILRYLKGVLYHGLWLSKSDNTSL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-14030.11Show/hide
Query:  SNSSLLGVENTEASSPINQIFGSGNKIYLFQILTALEAYDLENFLK-SESEPPSKYLTSTGTR-------------LISSWLLGSMSEEILNQMLHCKSA
        +N+S+L V      S + ++  +   ++  Q+    + Y+L  FL  S + PP+   T    R             LI S +LG++S  +   +    +A
Subjt:  SNSSLLGVENTEASSPINQIFGSGNKIYLFQILTALEAYDLENFLK-SESEPPSKYLTSTGTR-------------LISSWLLGSMSEEILNQMLHCKSA

Query:  KEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE--------
         +IW TL+ I+ +       Q + +L    K + ++ +Y   +    D LA + KP+  D+ +  +L +L  +Y+ +I  I+A+   P + E        
Subjt:  KEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE--------

Query:  --------------------------AMNNQNNYHNNHSYNQRGDRGNGR------SNRGGRGNRNKP---QCQICTKLGHSM--------------SAM
                                    NN NN + N+ Y+ R +  N +      +N     N++KP   +CQIC   GHS               S  
Subjt:  --------------------------AMNNQNNYHNNHSYNQRGDRGNGR------SNRGGRGNRNKP---QCQICTKLGHSM--------------SAM

Query:  VASP--------NLNI-----DNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISV
          SP        NL +      NNW LDSGAT+H+T   +NLS    Y GG+ +  A+GS +PI H GS S ++ + P     L+N+L+VP+I KNLISV
Subjt:  VASP--------NLNI-----DNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISV

Query:  SQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSG
         +    N V  EF P    +K+++T    LQG   D LY++ I  S      +  ++K   ++            WH RLGHP    +  V+   +YS  
Subjt:  SQFAKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSG

Query:  TID---KMIFCEAYALGNHHALPFSHPS------------------LIIHIIYNLLLVIYGVLHLMPFL-----------AFQKFKTCVEKSLGQSIKSH
         ++   K + C    +   + +PFS  +                  ++ H  Y   ++         +L            F  FK  +E      I + 
Subjt:  TID---KMIFCEAYALGNHHALPFSHPS------------------LIIHIIYNLLLVIYGVLHLMPFL-----------AFQKFKTCVEKSLGQSIKSH

Query:  QTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLR
         +D G +F     +  QHGI H  + P+T + N + ERKHRHI +  LTLLS A++P + W  AF+ +VYLINRLPTP++   SP  KLFG  PN+  LR
Subjt:  QTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLR

Query:  AFGCKCY-----------------CNALRYSTSHKGYKCL-ASDGRLFISRRVLFDENSFSYASF------------------SSHSSIPKSKNVLSPP-
         FGC CY                 C  L YS +   Y CL     RL+ISR V FDEN F ++++                  S H+++P    VL  P 
Subjt:  AFGCKCY-----------------CNALRYSTSHKGYKCL-ASDGRLFISRRVLFDENSFSYASF------------------SSHSSIPKSKNVLSPP-

Query:  ---------------------------LHSIIQSSLMNHNE---------------DRRHTDT-VSDNTDHLNPT-------------------------
                                   L S   SS  +  E                +  T T  S NT   NPT                         
Subjt:  ---------------------------LHSIIQSSLMNHNE---------------DRRHTDT-VSDNTDHLNPT-------------------------

Query:  ---------------IVYP-------------------------------------------LETEEFEALQK-------------------NDTWRLTP
                       +++P                                            E+E   A+Q                    N TW L P
Subjt:  ---------------IVYP-------------------------------------------LETEEFEALQK-------------------NDTWRLTP

Query:  QNPNQ-KIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEV
          P+   IVGC+W+F  K NS G ++RYKARLVAKG++Q P +DY ETFSPV+KS +I ++L +A+ + W IRQLDVNNAFL G L ++VYM Q  GF  
Subjt:  QNPNQ-KIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEV

Query:  KSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGV
        K     VC L+KA+YGLKQAPRAWY  L + L ++GF  S +DTSL +     S  Y L+YVDD++I G+    +++ + +L+ +F++KD  +L YFLG+
Subjt:  KSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGV

Query:  EVS-YPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRI
        E    PT  GL LSQ +YI DLL RT M+ AKP++ PM   P LS + G    D    R +VG+LQY   T P+ISY++                 +KRI
Subjt:  EVS-YPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRI

Query:  LRYLKGVLYHGLWLSKSDNTSL
        LRYL G   HG++L K +  SL
Subjt:  LRYLKGVLYHGLWLSKSDNTSL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.1e-13429.7Show/hide
Query:  NSSLLGVENTEASSPINQIFGSGNKIYLFQILTALEAYDLENFLK-SESEPPSKYLTSTGTR-------------LISSWLLGSMSEEILNQMLHCKSAK
        N+++L V      S + ++  +   ++  Q+    + Y+L  FL  S   PP+   T    R             LI S +LG++S  +   +    +A 
Subjt:  NSSLLGVENTEASSPINQIFGSGNKIYLFQILTALEAYDLENFLK-SESEPPSKYLTSTGTR-------------LISSWLLGSMSEEILNQMLHCKSAK

Query:  EIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE---------
        +IW TL+ I+ +       Q                   L+     D LA + KP+  D+ +  +L +L  DY+ +I  I+A+   P + E         
Subjt:  EIWGTLQGIFFSRYLAQAMQFKNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQE---------

Query:  ----AMN---------------------NQNNYHNNHSYNQRGDRGNG--RSNRGGRGNRNKP-----QCQICTKLGHS-----------------MSAM
            A+N                     NQNN  +N +YN   +R N    S+ G R +  +P     +CQIC+  GHS                  S  
Subjt:  ----AMN---------------------NQNNYHNNHSYNQRGDRGNG--RSNRGGRGNRNKP-----QCQICTKLGHS-----------------MSAM

Query:  VASP-----NLNID-----NNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQF
          +P     NL ++     NNW LDSGAT+H+T   +NLS    Y GG+ +  A+GS +PI H GS S  +S+   +S  LN +L+VP+I KNLISV + 
Subjt:  VASP-----NLNID-----NNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQF

Query:  AKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWHKRLGHPHLPTI-KVVLKHIDYSSGTI
           N V  EF P    +K+++T    LQG   D LY++ I  S      +   +K   ++            WH RLGHP L  +  V+  H        
Subjt:  AKDNHVFFEFHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWHKRLGHPHLPTI-KVVLKHIDYSSGTI

Query:  DKMIFCEAYALGNHHALPFSHPSLI----IHIIY------------NLLLVIYGVLHLMPFL-------------AFQKFKTCVEKSLGQSIKSHQTDGG
         K++ C    +   H +PFS+ ++     +  IY            N    +  V H   +               F  FK+ VE      I +  +D G
Subjt:  DKMIFCEAYALGNHHALPFSHPSLI----IHIIY------------NLLLVIYGVLHLMPFL-------------AFQKFKTCVEKSLGQSIKSHQTDGG

Query:  TKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGCK
         +F   + +L QHGI H  + P+T + N + ERKHRHI +M LTLLS A++P + W  AFS +VYLINRLPTP++   SP  KLFG  PN+  L+ FGC 
Subjt:  TKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDAFSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGCK

Query:  CY-----------------CNALRYSTSHKGYKCL-ASDGRLFISRRVLFDENSFSYA------------------SFSSHSSIP---------------
        CY                 C  + YS +   Y CL    GRL+ SR V FDE  F ++                  ++ SH+++P               
Subjt:  CY-----------------CNALRYSTSHKGYKCL-ASDGRLFISRRVLFDENSFSYA------------------SFSSHSSIP---------------

Query:  ------------------------KSKNVLSP-------PLHSIIQSSLMNHNEDRRHTD----------------------------------------
                                 S ++ SP       P H+  Q +   H     +++                                        
Subjt:  ------------------------KSKNVLSP-------PLHSIIQSSLMNHNEDRRHTD----------------------------------------

Query:  -------TVSDNTDHLNPTIVYP--------------------------------------LETEEFEALQ--KNDTWR------------------LTP
               + S +T  L P +  P                                        +E   A+Q  K+D WR                  + P
Subjt:  -------TVSDNTDHLNPTIVYP--------------------------------------LETEEFEALQ--KNDTWR------------------LTP

Query:  QNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVK
          P+  IVGC+W+F  K NS G ++RYKARLVAKG++Q P +DY ETFSPV+KS +I ++L +A+ + W IRQLDVNNAFL G L + VYM Q  GF  K
Subjt:  QNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVK

Query:  SSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVE
             VC L+KAIYGLKQAPRAWY  L + L ++GF  S +DTSL +     S  Y L+YVDD++I G+    +   + +L+ +F++K+   L YFLG+E
Subjt:  SSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVE

Query:  VSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILR
               GL LSQ +Y  DLL RT ML AKP++ PM + P L+   G    D    R +VG+LQY   T P++SY++                 +KR+LR
Subjt:  VSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL-----------------VKRILR

Query:  YLKGVLYHGLWLSKSDNTSL
        YL G   HG++L K +  SL
Subjt:  YLKGVLYHGLWLSKSDNTSL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-6440.06Show/hide
Query:  EEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGN
        +E  A++   TW +    PN+K +GCKWV+KIK NS G I RYKARLVAKG+ Q   ID+ ETFSPV K  ++ ++L I+ +  +++ QLD++NAFL+G+
Subjt:  EEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIMKGWSIRQLDVNNAFLHGN

Query:  LDENVYMEQIFGFEVK--SSYP--MVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHS
        LDE +YM+   G+  +   S P   VC+LKK+IYGLKQA R W+   S  L   GF  S +D +  +++T T     L+YVDD+II  ++   V  L   
Subjt:  LDENVYMEQIFGFEVK--SSYP--MVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYSLVHS

Query:  LNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL---
        L S F L+DLG L YFLG+E++  +  G+ + Q KY  DLL  T +L  KP S+PM      SA  G  F D    R ++G L Y  +T  +IS+++   
Subjt:  LNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSL---

Query:  --------------VKRILRYLKGVLYHGLWLSKSDNTSLMI
                      V +IL Y+KG +  GL+ S      L +
Subjt:  --------------VKRILRYLKGVLYHGLWLSKSDNTSLMI

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-2740.96Show/hide
Query:  YALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEV-SYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVH
        Y L+YVDD+++ GSS   +  L+  L+S F++KDLG + YFLG+++ ++P+  GLFLSQ+KY   +L    MLD KP+S P+    L S+     + D  
Subjt:  YALIYVDDLIIMGSSKKDVYSLVHSLNSQFALKDLGKLSYFLGVEV-SYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVH

Query:  LLRSVVGALQYATLTHPEISYS-----------------LVKRILRYLKGVLYHGLWLSKSDNTSL
          RS+VGALQY TLT P+ISY+                 L+KR+LRY+KG ++HGL++ K+   ++
Subjt:  LLRSVVGALQYATLTHPEISYS-----------------LVKRILRYLKGVLYHGLWLSKSDNTSL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.8e-1956.25Show/hide
Query:  EEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIA
        EE +AL +N TW L P   NQ I+GCKWVFK K +S G + R KARLVAKGFHQ   I + ET+SPVV++ TI  +L +A
Subjt:  EEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCAAACTCATCCCTGCTCGGTGTTGAGAACACTGAAGCATCTTCACCGATTAATCAAATATTTGGATCGGGTAACAAAATATATTTATTCCAAATTCTTACAGC
ATTAGAAGCCTATGACTTGGAAAATTTTCTTAAATCTGAATCAGAACCACCATCAAAATATCTCACATCCACTGGGACTCGCCTTATCTCCTCATGGCTTCTAGGGTCCA
TGAGTGAAGAAATACTGAATCAGATGCTTCATTGCAAATCTGCAAAAGAAATTTGGGGAACTCTTCAAGGTATTTTCTTTTCCCGTTACTTGGCACAAGCTATGCAATTC
AAAAACAAACTTCACAATATAAAGAAAGAATCTATGTCATTAAAAGAATACTTTCTCAAAATACAGCAGTGTGTTGATGCCTTAGCTTCAATTAACAAACCAGTTTCATC
TGATGATCATATTTTGTACATATTAGCTAGTTTAGGATCTGATTATCAATCAATGATATCTGTTATTTCCGCCAGAACTGACTCTCCTTTTGTACAAGAAGCTATGAACA
ACCAAAACAACTATCACAACAATCACTCCTACAATCAAAGGGGTGATCGTGGGAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAACAAACCACAATGTCAAATA
TGTACAAAGCTTGGACACAGTATGTCTGCTATGGTGGCTTCCCCCAACCTGAATATTGACAATAATTGGTATCTTGATTCGGGAGCTACAAACCATTTAACTCATAGTTT
GAGCAACCTATCTACTAGATTTGAGTATGTGGGAGGAAATCAAATATATGCAGCAAATGGGTCATGTTTGCCAATCATTCATCATGGTTCCATGTCATTTAACTCCTCTA
CATTACCATTCAAATCATTTACACTAAATAACTTGCTCCATGTTCCATCCATTACCAAAAACTTAATCAGTGTTTCACAATTTGCCAAAGATAATCATGTTTTCTTTGAA
TTTCACCCTACTTTGTGTTATATGAAGAATATGGATACTGACCAAGTATTTCTTCAAGGACTACTCAATGATAGCCTCTACAAATTTACCATCCAACCATCACATAAAAG
ACTTCACCATTCTAAATCCAACACCAAGTTTGTTTTCAATACAGTTGTACCTAAATCTAATATTCCCTTACTTGATCTATGGCATAAAAGACTAGGTCATCCCCATTTAC
CTACTATTAAAGTTGTTTTGAAACACATTGACTATTCTTCTGGCACTATAGATAAAATGATTTTTTGTGAAGCATATGCATTGGGCAACCACCATGCCCTTCCTTTCTCT
CACCCCTCACTCATTATACACATCATTTACAACTTATTACTTGTGATTTATGGGGTCCTGCATCTGATGCCTTTTTTAGCCTTTCAAAAATTCAAAACCTGTGTTGAAAA
GTCTCTTGGTCAATCAATTAAAAGTCATCAAACAGATGGTGGTACTAAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTGAACATAGGATAACATTTCCTT
ACACTTCAAAGCAGAATGACATAGTTGAGAGAAAACATAGGCATATCAAGAAAATGGTTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTATGGGATGATGCC
TTTTCCACTAGTGTCTATCTCATAAATCGTTTGCCTACCCCAGTCATTGATACTATAAGCCCGTTGGATAAGCTATTTGGCTGGAAACCTAACTTTCCTTCTCTTCGAGC
CTTTGGCTGCAAGTGTTATTGTAATGCCCTAAGATACAGTACCTCACATAAAGGGTACAAATGTCTAGCTTCAGATGGTCGCCTTTTCATTTCTAGACGTGTATTATTTG
ATGAAAATTCATTTTCATATGCATCATTTTCATCTCATTCTAGCATACCCAAATCCAAAAATGTCCTATCTCCACCACTTCACTCAATAATTCAATCATCCCTTATGAAC
CATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATCATCTAAACCCTACTATTGTGTATCCTTTAGAGACAGAAGAGTTTGAAGCCTTACAGAAAAA
TGACACCTGGAGACTTACTCCACAAAATCCTAATCAGAAAATTGTTGGTTGCAAATGGGTTTTTAAGATAAAAAGGAATTCATATGGGCCTATTTCTAGATATAAAGCAC
GCTTAGTTGCTAAAGGGTTTCATCAAACACCTAATATTGATTACAATGAAACATTTAGCCCTGTTGTGAAATCCGTTACTATTTGCATGCTATTAACTATAGCAATTATG
AAAGGATGGAGTATACGTCAATTAGATGTTAATAATGCTTTTCTTCATGGAAATTTAGATGAAAATGTTTACATGGAACAAATATTTGGTTTTGAAGTTAAAAGCTCTTA
TCCTATGGTTTGTCATTTGAAAAAGGCTATTTATGGTCTTAAACAAGCCCCTCGAGCTTGGTATGAAAACTTGAGCTCAGGTTTACATTCCCTTGGATTTAGAACTTCCA
AGGCTGATACATCTTTATTAATACGTGTTACTCTTACATCTTGTTGCTATGCCTTGATTTATGTTGATGATTTGATTATTATGGGCAGCTCTAAGAAAGATGTGTATTCT
TTAGTTCATTCTTTAAACAGTCAATTTGCACTTAAAGATTTGGGAAAGCTAAGCTACTTTCTTGGAGTTGAGGTGTCATACCCAACTAATGGAGGTTTGTTTTTATCTCA
ATCAAAGTATATTACTGATTTATTACAGAGAACAAAAATGTTGGATGCTAAACCTATCTCTATACCTATGGTAAGTGGTCCTTTACTTTCTGCTTTTCAAGGGGAACCAT
TTCATGATGTGCATTTGCTTAGAAGTGTTGTTGGTGCATTACAGTATGCCACACTTACTCATCCTGAGATATCATATAGTCTTGTGAAAAGAATTCTAAGATATCTTAAA
GGTGTACTATATCATGGTTTATGGCTTAGCAAGTCTGATAATACATCCTTAATGATAGAAAATCCACTTCTGGTTTATGTGTTTACTTTGGAAATAACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCAAACTCATCCCTGCTCGGTGTTGAGAACACTGAAGCATCTTCACCGATTAATCAAATATTTGGATCGGGTAACAAAATATATTTATTCCAAATTCTTACAGC
ATTAGAAGCCTATGACTTGGAAAATTTTCTTAAATCTGAATCAGAACCACCATCAAAATATCTCACATCCACTGGGACTCGCCTTATCTCCTCATGGCTTCTAGGGTCCA
TGAGTGAAGAAATACTGAATCAGATGCTTCATTGCAAATCTGCAAAAGAAATTTGGGGAACTCTTCAAGGTATTTTCTTTTCCCGTTACTTGGCACAAGCTATGCAATTC
AAAAACAAACTTCACAATATAAAGAAAGAATCTATGTCATTAAAAGAATACTTTCTCAAAATACAGCAGTGTGTTGATGCCTTAGCTTCAATTAACAAACCAGTTTCATC
TGATGATCATATTTTGTACATATTAGCTAGTTTAGGATCTGATTATCAATCAATGATATCTGTTATTTCCGCCAGAACTGACTCTCCTTTTGTACAAGAAGCTATGAACA
ACCAAAACAACTATCACAACAATCACTCCTACAATCAAAGGGGTGATCGTGGGAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAACAAACCACAATGTCAAATA
TGTACAAAGCTTGGACACAGTATGTCTGCTATGGTGGCTTCCCCCAACCTGAATATTGACAATAATTGGTATCTTGATTCGGGAGCTACAAACCATTTAACTCATAGTTT
GAGCAACCTATCTACTAGATTTGAGTATGTGGGAGGAAATCAAATATATGCAGCAAATGGGTCATGTTTGCCAATCATTCATCATGGTTCCATGTCATTTAACTCCTCTA
CATTACCATTCAAATCATTTACACTAAATAACTTGCTCCATGTTCCATCCATTACCAAAAACTTAATCAGTGTTTCACAATTTGCCAAAGATAATCATGTTTTCTTTGAA
TTTCACCCTACTTTGTGTTATATGAAGAATATGGATACTGACCAAGTATTTCTTCAAGGACTACTCAATGATAGCCTCTACAAATTTACCATCCAACCATCACATAAAAG
ACTTCACCATTCTAAATCCAACACCAAGTTTGTTTTCAATACAGTTGTACCTAAATCTAATATTCCCTTACTTGATCTATGGCATAAAAGACTAGGTCATCCCCATTTAC
CTACTATTAAAGTTGTTTTGAAACACATTGACTATTCTTCTGGCACTATAGATAAAATGATTTTTTGTGAAGCATATGCATTGGGCAACCACCATGCCCTTCCTTTCTCT
CACCCCTCACTCATTATACACATCATTTACAACTTATTACTTGTGATTTATGGGGTCCTGCATCTGATGCCTTTTTTAGCCTTTCAAAAATTCAAAACCTGTGTTGAAAA
GTCTCTTGGTCAATCAATTAAAAGTCATCAAACAGATGGTGGTACTAAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTGAACATAGGATAACATTTCCTT
ACACTTCAAAGCAGAATGACATAGTTGAGAGAAAACATAGGCATATCAAGAAAATGGTTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTATGGGATGATGCC
TTTTCCACTAGTGTCTATCTCATAAATCGTTTGCCTACCCCAGTCATTGATACTATAAGCCCGTTGGATAAGCTATTTGGCTGGAAACCTAACTTTCCTTCTCTTCGAGC
CTTTGGCTGCAAGTGTTATTGTAATGCCCTAAGATACAGTACCTCACATAAAGGGTACAAATGTCTAGCTTCAGATGGTCGCCTTTTCATTTCTAGACGTGTATTATTTG
ATGAAAATTCATTTTCATATGCATCATTTTCATCTCATTCTAGCATACCCAAATCCAAAAATGTCCTATCTCCACCACTTCACTCAATAATTCAATCATCCCTTATGAAC
CATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATCATCTAAACCCTACTATTGTGTATCCTTTAGAGACAGAAGAGTTTGAAGCCTTACAGAAAAA
TGACACCTGGAGACTTACTCCACAAAATCCTAATCAGAAAATTGTTGGTTGCAAATGGGTTTTTAAGATAAAAAGGAATTCATATGGGCCTATTTCTAGATATAAAGCAC
GCTTAGTTGCTAAAGGGTTTCATCAAACACCTAATATTGATTACAATGAAACATTTAGCCCTGTTGTGAAATCCGTTACTATTTGCATGCTATTAACTATAGCAATTATG
AAAGGATGGAGTATACGTCAATTAGATGTTAATAATGCTTTTCTTCATGGAAATTTAGATGAAAATGTTTACATGGAACAAATATTTGGTTTTGAAGTTAAAAGCTCTTA
TCCTATGGTTTGTCATTTGAAAAAGGCTATTTATGGTCTTAAACAAGCCCCTCGAGCTTGGTATGAAAACTTGAGCTCAGGTTTACATTCCCTTGGATTTAGAACTTCCA
AGGCTGATACATCTTTATTAATACGTGTTACTCTTACATCTTGTTGCTATGCCTTGATTTATGTTGATGATTTGATTATTATGGGCAGCTCTAAGAAAGATGTGTATTCT
TTAGTTCATTCTTTAAACAGTCAATTTGCACTTAAAGATTTGGGAAAGCTAAGCTACTTTCTTGGAGTTGAGGTGTCATACCCAACTAATGGAGGTTTGTTTTTATCTCA
ATCAAAGTATATTACTGATTTATTACAGAGAACAAAAATGTTGGATGCTAAACCTATCTCTATACCTATGGTAAGTGGTCCTTTACTTTCTGCTTTTCAAGGGGAACCAT
TTCATGATGTGCATTTGCTTAGAAGTGTTGTTGGTGCATTACAGTATGCCACACTTACTCATCCTGAGATATCATATAGTCTTGTGAAAAGAATTCTAAGATATCTTAAA
GGTGTACTATATCATGGTTTATGGCTTAGCAAGTCTGATAATACATCCTTAATGATAGAAAATCCACTTCTGGTTTATGTGTTTACTTTGGAAATAACTTAG
Protein sequenceShow/hide protein sequence
MSSNSSLLGVENTEASSPINQIFGSGNKIYLFQILTALEAYDLENFLKSESEPPSKYLTSTGTRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFFSRYLAQAMQF
KNKLHNIKKESMSLKEYFLKIQQCVDALASINKPVSSDDHILYILASLGSDYQSMISVISARTDSPFVQEAMNNQNNYHNNHSYNQRGDRGNGRSNRGGRGNRNKPQCQI
CTKLGHSMSAMVASPNLNIDNNWYLDSGATNHLTHSLSNLSTRFEYVGGNQIYAANGSCLPIIHHGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFE
FHPTLCYMKNMDTDQVFLQGLLNDSLYKFTIQPSHKRLHHSKSNTKFVFNTVVPKSNIPLLDLWHKRLGHPHLPTIKVVLKHIDYSSGTIDKMIFCEAYALGNHHALPFS
HPSLIIHIIYNLLLVIYGVLHLMPFLAFQKFKTCVEKSLGQSIKSHQTDGGTKFKPFKPFLDQHGIEHRITFPYTSKQNDIVERKHRHIKKMVLTLLSQATLPLSLWDDA
FSTSVYLINRLPTPVIDTISPLDKLFGWKPNFPSLRAFGCKCYCNALRYSTSHKGYKCLASDGRLFISRRVLFDENSFSYASFSSHSSIPKSKNVLSPPLHSIIQSSLMN
HNEDRRHTDTVSDNTDHLNPTIVYPLETEEFEALQKNDTWRLTPQNPNQKIVGCKWVFKIKRNSYGPISRYKARLVAKGFHQTPNIDYNETFSPVVKSVTICMLLTIAIM
KGWSIRQLDVNNAFLHGNLDENVYMEQIFGFEVKSSYPMVCHLKKAIYGLKQAPRAWYENLSSGLHSLGFRTSKADTSLLIRVTLTSCCYALIYVDDLIIMGSSKKDVYS
LVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLDAKPISIPMVSGPLLSAFQGEPFHDVHLLRSVVGALQYATLTHPEISYSLVKRILRYLK
GVLYHGLWLSKSDNTSLMIENPLLVYVFTLEIT