; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0018382 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0018382
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr04:20052587..20056825
RNA-Seq ExpressionPay0018382
SyntenyPay0018382
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]1.4e-22336.48Show/hide
Query:  VKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFINIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQ
        VKL + NFL+WK Q+++A     L++       + + F   + R    + +     +  I    LG  S   L+Q             L+  F S+  A+
Subjt:  VKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFINIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQ

Query:  AMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVN
        A QFK +L + KKG   + EY  K++ CVD+LAS+   +S+ DH+  IL  L +DY+S ++ +  R D  SV+E+ +LL+  ES+ E    S  + PS +
Subjt:  AMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVN

Query:  IV-TQTTEKG----AESYIRNSQNNYHN-----------------------NHSYNQRGNR------------------------NKPQCEICTKLGHSV
        +  +   EKG     + Y  NSQ ++                         N +YN R NR                         KP C++C K+GH V
Subjt:  IV-TQTTEKG----AESYIRNSQNNYHN-----------------------NHSYNQRGNR------------------------NKPQCEICTKLGHSV

Query:  DCCFFRY--TPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHS
          C++R+  T +   +  S NS   +Y + +  PQ++ ++ + ++  D NWY DSGA+NH+T +  NL   +++ G NQ++  NG+GL I + G   F S
Subjt:  DCCFFRY--TPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHS

Query:  STLPF--KSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKF-------------TIQPSHKRLDHHSKPNT-
           PF  K   LN+LLHVPSITKNL+SVS+FAKDN VFFEF+    +VKD   + VL+ G + DG Y F             +  PS       SK  T 
Subjt:  STLPF--KSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKF-------------TIQPSHKRLDHHSKPNT-

Query:  ----------KRLGYPHLPTVKAVLKHIDYSSSTINKM--NFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSDVFLAFQ
                  KRLG+P   T+K VL   + +   INKM  NFC +C LGK H FPFS   T YT PL+LI  DLWG T+ +S +G+ YYI F D F  F 
Subjt:  ----------KRLGYPHLPTVKAVLKHIDYSSSTINKM--NFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSDVFLAFQ

Query:  ----------------KFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIE---------------IDEAFST
                         FKT VE      IKSLQ D G EF+ F+ +L + GI H+++CP+  +QN + ERKHR I+E                DE+F T
Subjt:  ----------------KFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIE---------------IDEAFST

Query:  SVYLINRLPTLILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSH
         VYL NRLPT IL +  P+           +L+V     +P LRPY +HKL  RS  CTFLGY   HKGYKC++S+GR++IS  V+F+E SFPY+     
Subjt:  SVYLINRLPTLILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSH

Query:  SSIPKSKNIISLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDD--------GNSSSITQSPSPM-------EPQHQTDSGMN
            K+  + S  L ++  S+  +H         +S     + PT   P+ +    S  D+         NS+  T +P+ +         QH   S  +
Subjt:  SSIPKSKNIISLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDD--------GNSSSITQSPSPM-------EPQHQTDSGMN

Query:  TQFQST------SIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTP-----------------QNPN---QKI
             T      + HPMITR+K GI KPK F+    +  P +   A +   WKKAM  E++ALQ+N+ W L P                 +NP+   QK 
Subjt:  TQFQST------SIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTP-----------------QNPN---QKI

Query:  VARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYHMVCHL---------------KKL
         ARLVAKGFH+    D+ ETFSP+VK  T+                             E V+M+QP GF  + + ++VC L               +KL
Subjt:  VARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYHMVCHL---------------KKL

Query:  SSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV--------------------------
          +L S GF ++K+  SL +R TP    YVL+YVDD++++GS    + SL+  LN++F+LKDLG++ YFLG++V                          
Subjt:  SSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV--------------------------

Query:  ----------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADG
                  G  L    G+P  D+H YRS VGALQY T+T PE+S+SVNK CQFM  P   HW++VKRILR L+  L HGL L KS N+ L+GF DAD 
Subjt:  ----------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADG

Query:  ASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYF
        ASD DDR+STSG CV+ G NL+SW  KKQ I+S+S+ E E+R LA L  ++ W+RSLL++L + L  PP++WCD LS+V L+ANP+LH++TKH+ELD+YF
Subjt:  ASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYF

Query:  VRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKETYR
        VR+ + + ++ +RH+P+ +Q+AD+LTK +S+  F   +   R
Subjt:  VRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKETYR

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0078.56Show/hide
Query:  MSSNSSLLGVDNTEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWE-------FINIR------TRTPNLEYKVWKRQDRLIF
        MSS SSLLGV+NTEASSPINQIFGSGNK+SLVKL+DD FLLWKFQ+LTALE   ++ + +  E        I+        T TPN  YKVWKRQDRLI 
Subjt:  MSSNSSLLGVDNTEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWE-------FINIR------TRTPNLEYKVWKRQDRLIF

Query:  SWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMIS
        SWLLGSMSEEILNQMLHCK AKEIW TLQGIF SRYLAQAMQFKNKLHNIKKGSMPLKEYFLK+ QCVDALASINK VSSDDHILYILA LGSDYQSMIS
Subjt:  SWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMIS

Query:  VISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQ-------------RGNRNKPQCEICTKLGHSV
        VISARTDSPSVQEVMSLLLTQESQNESKLISET LPSVNIVTQTTEKGAESYIR +QNNYHNNHSYNQ             RGNRNKPQC+IC KLG+S 
Subjt:  VISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQ-------------RGNRNKPQCEICTKLGHSV

Query:  DCCFFRYTPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSST
        D CFFRYTPRSNSS YSPNSHNTSYTNMNNHPQMSAMVA+ DLNIDSNWY DSGATNHLTHSLSNLS  S+YGGGNQIYAANGSGLPIT+YGSMSF+SST
Subjt:  DCCFFRYTPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSST

Query:  LPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK-----------------
        LPFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEF+PTL YVKDLD  QVLLQGLLNDG YKFTI+PSHKRL HHS  NTK                 
Subjt:  LPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK-----------------

Query:  ---RLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISF----------------SD
           RLG+PHLP VKAVL HID SS TINK+NFCEACALGKHHA PFSH LT YTHPLQLITCDLWG  VNVS+NGF YYISF                SD
Subjt:  ---RLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISF----------------SD

Query:  VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPT
         FLAFQKFKTCVEKSLGQSIKSLQ DGGTEFKPFKPFLDQ+GIEH+ITCPY SKQNDIVERKHR+I+E+               DEAFSTSVYLINRLPT
Subjt:  VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPT

Query:  LILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSHSSIPKSKNII
         +LDNISPL            L+V     YPYLRPYQSHKLSLRSTPCTFLGY TSHKGYKCLASDGRLFISRHVLFDENSFPYASF+SHSS PKSK+++
Subjt:  LILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSHSSIPKSKNII

Query:  SLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSK
        S PLHSII SSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNS  ITQSPS MEP HQTDSGMNTQ QSTSIHPMIT+SK
Subjt:  SLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSK

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.7e-23236.68Show/hide
Query:  TEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFI--NIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHC
        T     +  +    +++  ++L DDNFL+WK+Q+  A     LE        +  + +   I    PN +++ ++RQD L+ SWLL S+    L Q++ C
Subjt:  TEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFI--NIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHC

Query:  KFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLL
          A E                               + +++Y  KM+   D LA+    +S  DHIL I+  LG +Y+S+I+VIS++  SPS+Q V S L
Subjt:  KFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLL

Query:  LTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNS---QNNYHNNHSY--NQ--RG----NRN----------KPQCEICTKLGHSVDCCFFRYTPR
        +  E +   K+ S     SVN  +Q + +G  S   ++    + + N + +  NQ  RG    NR           KPQC++C K GH+V  CF+RY P 
Subjt:  LTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNS---QNNYHNNHSY--NQ--RG----NRN----------KPQCEICTKLGHSVDCCFFRYTPR

Query:  -----------------------SNSSSYSPNSHNTSYTNMNN--HPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSG
                               S S S + N + T Y    N  + +M AMVA+P+   +  W+ DSGATNH+TH L NL++ ++Y G ++I+  NG+G
Subjt:  -----------------------SNSSSYSPNSHNTSYTNMNN--HPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSG

Query:  LPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPS---------------
        L I++ G   F SS+ P K   L N+L VP+I KNL+SVSQFA+DN+V+FEF+P + +VKD     +LLQG L+ G Y+F +                  
Subjt:  LPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPS---------------

Query:  -----------HKRLDHHSKPNT---------KRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWG-STV
                   +   D   K N+         KRLG+P    V  VL       ST +  + C AC LGK H  PF    T YT PLQL+  DLWG + +
Subjt:  -----------HKRLDHHSKPNT---------KRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWG-STV

Query:  NVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI
        N SY GF YY+SF D +                 AF  FK   E   G  +K+ Q D G EF+  K + +Q GI H+++CP+ SKQN I+ERKHRHI+E+
Subjt:  NVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI

Query:  ---------------DEAFSTSVYLINRLPTLILD---------NISPLWLQ------VLYPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRL
                        +AFST+V+LINRLPT +L          N  P + Q      + +P+LRPY  HKL  RS+PCTFLGY + HKGYKCL   GR+
Subjt:  ---------------DEAFSTSVYLINRLPTLILD---------NISPLWLQ------VLYPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRL

Query:  FISRHVLFDENSFPYA-------SFSSHSS-----IPKSKNI--------ISLPLHSIIQSSLMNHN--EDRRHTDTVSDNTDHLNPTIVYPLETGTQES
        FISR V+FDE  FP+A          SHS+     IP  KN+        +SLP  S   S  ++ N   D R       NTD  +   +         S
Subjt:  FISRHVLFDENSFPYA-------SFSSHSS-----IPKSKNI--------ISLPLHSIIQSSLMNHN--EDRRHTDTVSDNTDHLNPTIVYPLETGTQES

Query:  SRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRL-------
        S       +I  S +  EP    ++   T  Q    H M+TRSK+GIFKPK + +D    +P   +EA  HP WK+AM+EEF AL KN  W L       
Subjt:  SRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRL-------

Query:  -------------TPQNPNQKIVARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYH-
                      P     +  ARLVAKG+ + P  D+ ETFSP+VK  TI                             E VYM+QP GF+ K++   
Subjt:  -------------TPQNPNQKIVARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYH-

Query:  -MVCHL---------------KKLSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV--
         +VC L                KL  SL   GF ++K+  SL +R T  S  +VL+YVDD+++ GSS ++++ L+  L   F+LKDLG+LSYFLG+EV  
Subjt:  -MVCHL---------------KKLSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV--

Query:  -----------------------------------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLK
                                           G  L A  G+P  +V  YRSVVGALQY T+T PEI++SVNK CQFM  P  THW+ VKRILR L 
Subjt:  -----------------------------------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLK

Query:  SVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDY
             G+ L  S+ M+LVGF DAD  SD DDR+STSG CV+ G +LVSW  KKQ   S+S+TEAE+R LA L ++++W++SLL++L   + + P++WCD 
Subjt:  SVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDY

Query:  LSSVHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE
        +S+V L+ANP+LHS+TKH+ELD+YFVR+ + + KL + H+PT +Q+AD+ TKPLS + F  L+E
Subjt:  LSSVHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.2e-24437.69Show/hide
Query:  TEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFI--NIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHC
        T     +  +    +++  ++L DDNFL+WK+Q+  A     LE        +  + +   I    PN +++ ++RQD L+ SWLL S+    L Q++ C
Subjt:  TEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFI--NIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHC

Query:  KFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLL
          A E+W T+   F S+  A+ M +K+++  +KK  + +++Y  KM+   D LA+    +S  DHIL I+  LG +Y+S+I+VIS++  SPS+Q V S L
Subjt:  KFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLL

Query:  LTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNS---QNNYHNNHSY--NQ--RG----NRN----------KPQCEICTKLGHSVDCCFFRYTPR
        +  E +   K+ S     SVN  +Q + +G  S   ++    + + N + +  NQ  RG    NR           KPQC++C K GH+V  CF+RY P 
Subjt:  LTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNS---QNNYHNNHSY--NQ--RG----NRN----------KPQCEICTKLGHSVDCCFFRYTPR

Query:  -----------------------SNSSSYSPNSHNTSYTNMNN--HPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSG
                               S S S + N + T Y    N  + +M AMVA+P+   +  W+ DSGATNH+TH L NL++ ++Y G ++I+  NG+G
Subjt:  -----------------------SNSSSYSPNSHNTSYTNMNN--HPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSG

Query:  LPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPS---------------
        L I++ G   F SS+ P K   L N+L VP+I KNL+SVSQFA+DN+V+FEF+P + +VKD     +LLQG L+ G Y+F +                  
Subjt:  LPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPS---------------

Query:  -----------HKRLDHHSKPNT---------KRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWG-STV
                   +   D   K N+         KRLG+P    V  VL       ST +  + C AC LGK H  PF    T YT PLQL+  DLWG + +
Subjt:  -----------HKRLDHHSKPNT---------KRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWG-STV

Query:  NVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI
        N SY GF YY+SF D +                 AF  FK   E   G  +K+ Q D G EF+  K + +Q GI H+++CP+ SKQN I+ERKHRHI+E+
Subjt:  NVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI

Query:  ---------------DEAFSTSVYLINRLPTLILD---------NISPLWLQ------VLYPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRL
                        +AFST+V+LINRLPT +L          N  P + Q      + +P+LRPY  HKL  RS+PCTFLGY + HKGYKCL   GR+
Subjt:  ---------------DEAFSTSVYLINRLPTLILD---------NISPLWLQ------VLYPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRL

Query:  FISRHVLFDENSFPYA-------SFSSHSS-----IPKSKNI--------ISLPLHSIIQSSLMNHN--EDRRHTDTVSDNTDHLNPTIVYPLETGTQES
        FISR V+FDE  FP+A          SHS+     IP  KN+        +SLP  S   S  ++ N   D R       NTD  +   +         S
Subjt:  FISRHVLFDENSFPYA-------SFSSHSS-----IPKSKNI--------ISLPLHSIIQSSLMNHN--EDRRHTDTVSDNTDHLNPTIVYPLETGTQES

Query:  SRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRL-------
        S       +I  S +  EP    ++   T  Q    H M+TRSK+GIFKPK + +D    +P   +EA  HP WK+AM+EEF AL KN  W L       
Subjt:  SRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRL-------

Query:  -------------TPQNPNQKIVARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYH-
                      P     +  ARLVAKG+ + P  D+ ETFSP+VK  TI                             E VYM+QP GF+ K++   
Subjt:  -------------TPQNPNQKIVARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYH-

Query:  -MVCHL---------------KKLSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVE---
         +VC L                KL  SL   GF ++K+  SL +R T  S  +VL+YVDD+++ GSS ++++ L+  L   F+LKDLG+LSYFLG+E   
Subjt:  -MVCHL---------------KKLSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVE---

Query:  ----------------VGPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLV
                         G  L A  G+P  +V  YRSVVGALQY T+T PEI++SVNK CQFM  P  THW+ VKRILR L      G+ L  S+ M+LV
Subjt:  ----------------VGPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLV

Query:  GFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKH
        GF DAD  SD DDR+STSG CV+ G +LVSW  KKQ   S+S+TEAE+R LA L ++++W++SLL++L   + + P++WCD +S+V L+ANP+LHS+TKH
Subjt:  GFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKH

Query:  VELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE
        +ELD+YFVR+ + + KL + H+PT +Q+AD+ TKPLS + F  L+E
Subjt:  VELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0078.68Show/hide
Query:  MSSNSSLLGVDNTEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWE-------FINIR------TRTPNLEYKVWKRQDRLIF
        MSS SSLLGV+NTEASSPINQIFGSGNK+SLVKL+DD FLLWKFQ+LTALE   ++ + +  E        I+        T TPN  YKVWKRQDRLI 
Subjt:  MSSNSSLLGVDNTEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWE-------FINIR------TRTPNLEYKVWKRQDRLIF

Query:  SWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMIS
        SWLLGSMSEEILNQMLHCK AKEIW TLQGIF SRYLAQAMQFKNKLHNIKKGSMPLKEYFLK+ QCVDALASINK VSSDDHILYILA LGSDYQSMIS
Subjt:  SWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMIS

Query:  VISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQ-------------RGNRNKPQCEICTKLGHSV
        VISARTDSPSVQEVMSLLLTQESQNESKLISET LPSVNIVTQTTEKGAESYIR +QNNYHNNHSYNQ             RGNRNKPQC+IC KLG+S 
Subjt:  VISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQ-------------RGNRNKPQCEICTKLGHSV

Query:  DCCFFRYTPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSST
        D CFFRYTPRSNSS YSPNSHNTSYTNMNNHPQMSAMVA+ DLNIDSNWY DSGATNHLTHSLSNLS  S+YGGGNQIYAANGSGLPIT+YGSMSF+SST
Subjt:  DCCFFRYTPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSST

Query:  LPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK-----------------
        LPFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEF+PTL YVKDLD  QVLLQGLLNDG YKFTI+PSHKRL HHS  NTK                 
Subjt:  LPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK-----------------

Query:  ---RLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISF----------------SD
           RLG+PHLP VKAVL HID SS TINK+NFCEACALGKHHA PFSH LT YTHPLQLITCDLWG  VNVS+NGF YYISF                SD
Subjt:  ---RLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISF----------------SD

Query:  VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPT
         FLAFQKFKTCVEKSLGQSIKSLQ DGGTEFKPFKPFLDQ+GIEH+ITCPY SKQNDIVERKHR+I+E+               DEAFSTSVYLINRLPT
Subjt:  VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPT

Query:  LILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSHSSIPKSKNII
         +LDNISPL            L+V     YPYLRPYQSHKLSLRSTPCTFLGY TSHKGYKCLASDGRLFISRHVLFDENSFPYASF+SHSSIPKSK+++
Subjt:  LILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSHSSIPKSKNII

Query:  SLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSK
        S PLHSII SSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNS  ITQSPS MEP HQTDSGMNTQ QSTSIHPMIT+SK
Subjt:  SLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSK

TrEMBL top hitse value%identityAlignment
A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-948.1e-23336.68Show/hide
Query:  TEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFI--NIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHC
        T     +  +    +++  ++L DDNFL+WK+Q+  A     LE        +  + +   I    PN +++ ++RQD L+ SWLL S+    L Q++ C
Subjt:  TEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFI--NIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHC

Query:  KFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLL
          A E                               + +++Y  KM+   D LA+    +S  DHIL I+  LG +Y+S+I+VIS++  SPS+Q V S L
Subjt:  KFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLL

Query:  LTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNS---QNNYHNNHSY--NQ--RG----NRN----------KPQCEICTKLGHSVDCCFFRYTPR
        +  E +   K+ S     SVN  +Q + +G  S   ++    + + N + +  NQ  RG    NR           KPQC++C K GH+V  CF+RY P 
Subjt:  LTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNS---QNNYHNNHSY--NQ--RG----NRN----------KPQCEICTKLGHSVDCCFFRYTPR

Query:  -----------------------SNSSSYSPNSHNTSYTNMNN--HPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSG
                               S S S + N + T Y    N  + +M AMVA+P+   +  W+ DSGATNH+TH L NL++ ++Y G ++I+  NG+G
Subjt:  -----------------------SNSSSYSPNSHNTSYTNMNN--HPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSG

Query:  LPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPS---------------
        L I++ G   F SS+ P K   L N+L VP+I KNL+SVSQFA+DN+V+FEF+P + +VKD     +LLQG L+ G Y+F +                  
Subjt:  LPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPS---------------

Query:  -----------HKRLDHHSKPNT---------KRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWG-STV
                   +   D   K N+         KRLG+P    V  VL       ST +  + C AC LGK H  PF    T YT PLQL+  DLWG + +
Subjt:  -----------HKRLDHHSKPNT---------KRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWG-STV

Query:  NVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI
        N SY GF YY+SF D +                 AF  FK   E   G  +K+ Q D G EF+  K + +Q GI H+++CP+ SKQN I+ERKHRHI+E+
Subjt:  NVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI

Query:  ---------------DEAFSTSVYLINRLPTLILD---------NISPLWLQ------VLYPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRL
                        +AFST+V+LINRLPT +L          N  P + Q      + +P+LRPY  HKL  RS+PCTFLGY + HKGYKCL   GR+
Subjt:  ---------------DEAFSTSVYLINRLPTLILD---------NISPLWLQ------VLYPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRL

Query:  FISRHVLFDENSFPYA-------SFSSHSS-----IPKSKNI--------ISLPLHSIIQSSLMNHN--EDRRHTDTVSDNTDHLNPTIVYPLETGTQES
        FISR V+FDE  FP+A          SHS+     IP  KN+        +SLP  S   S  ++ N   D R       NTD  +   +         S
Subjt:  FISRHVLFDENSFPYA-------SFSSHSS-----IPKSKNI--------ISLPLHSIIQSSLMNHN--EDRRHTDTVSDNTDHLNPTIVYPLETGTQES

Query:  SRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRL-------
        S       +I  S +  EP    ++   T  Q    H M+TRSK+GIFKPK + +D    +P   +EA  HP WK+AM+EEF AL KN  W L       
Subjt:  SRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRL-------

Query:  -------------TPQNPNQKIVARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYH-
                      P     +  ARLVAKG+ + P  D+ ETFSP+VK  TI                             E VYM+QP GF+ K++   
Subjt:  -------------TPQNPNQKIVARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYH-

Query:  -MVCHL---------------KKLSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV--
         +VC L                KL  SL   GF ++K+  SL +R T  S  +VL+YVDD+++ GSS ++++ L+  L   F+LKDLG+LSYFLG+EV  
Subjt:  -MVCHL---------------KKLSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV--

Query:  -----------------------------------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLK
                                           G  L A  G+P  +V  YRSVVGALQY T+T PEI++SVNK CQFM  P  THW+ VKRILR L 
Subjt:  -----------------------------------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLK

Query:  SVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDY
             G+ L  S+ M+LVGF DAD  SD DDR+STSG CV+ G +LVSW  KKQ   S+S+TEAE+R LA L ++++W++SLL++L   + + P++WCD 
Subjt:  SVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDY

Query:  LSSVHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE
        +S+V L+ANP+LHS+TKH+ELD+YFVR+ + + KL + H+PT +Q+AD+ TKPLS + F  L+E
Subjt:  LSSVHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-24437.69Show/hide
Query:  TEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFI--NIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHC
        T     +  +    +++  ++L DDNFL+WK+Q+  A     LE        +  + +   I    PN +++ ++RQD L+ SWLL S+    L Q++ C
Subjt:  TEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFI--NIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHC

Query:  KFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLL
          A E+W T+   F S+  A+ M +K+++  +KK  + +++Y  KM+   D LA+    +S  DHIL I+  LG +Y+S+I+VIS++  SPS+Q V S L
Subjt:  KFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLL

Query:  LTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNS---QNNYHNNHSY--NQ--RG----NRN----------KPQCEICTKLGHSVDCCFFRYTPR
        +  E +   K+ S     SVN  +Q + +G  S   ++    + + N + +  NQ  RG    NR           KPQC++C K GH+V  CF+RY P 
Subjt:  LTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNS---QNNYHNNHSY--NQ--RG----NRN----------KPQCEICTKLGHSVDCCFFRYTPR

Query:  -----------------------SNSSSYSPNSHNTSYTNMNN--HPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSG
                               S S S + N + T Y    N  + +M AMVA+P+   +  W+ DSGATNH+TH L NL++ ++Y G ++I+  NG+G
Subjt:  -----------------------SNSSSYSPNSHNTSYTNMNN--HPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSG

Query:  LPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPS---------------
        L I++ G   F SS+ P K   L N+L VP+I KNL+SVSQFA+DN+V+FEF+P + +VKD     +LLQG L+ G Y+F +                  
Subjt:  LPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPS---------------

Query:  -----------HKRLDHHSKPNT---------KRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWG-STV
                   +   D   K N+         KRLG+P    V  VL       ST +  + C AC LGK H  PF    T YT PLQL+  DLWG + +
Subjt:  -----------HKRLDHHSKPNT---------KRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWG-STV

Query:  NVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI
        N SY GF YY+SF D +                 AF  FK   E   G  +K+ Q D G EF+  K + +Q GI H+++CP+ SKQN I+ERKHRHI+E+
Subjt:  NVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI

Query:  ---------------DEAFSTSVYLINRLPTLILD---------NISPLWLQ------VLYPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRL
                        +AFST+V+LINRLPT +L          N  P + Q      + +P+LRPY  HKL  RS+PCTFLGY + HKGYKCL   GR+
Subjt:  ---------------DEAFSTSVYLINRLPTLILD---------NISPLWLQ------VLYPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRL

Query:  FISRHVLFDENSFPYA-------SFSSHSS-----IPKSKNI--------ISLPLHSIIQSSLMNHN--EDRRHTDTVSDNTDHLNPTIVYPLETGTQES
        FISR V+FDE  FP+A          SHS+     IP  KN+        +SLP  S   S  ++ N   D R       NTD  +   +         S
Subjt:  FISRHVLFDENSFPYA-------SFSSHSS-----IPKSKNI--------ISLPLHSIIQSSLMNHN--EDRRHTDTVSDNTDHLNPTIVYPLETGTQES

Query:  SRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRL-------
        S       +I  S +  EP    ++   T  Q    H M+TRSK+GIFKPK + +D    +P   +EA  HP WK+AM+EEF AL KN  W L       
Subjt:  SRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRL-------

Query:  -------------TPQNPNQKIVARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYH-
                      P     +  ARLVAKG+ + P  D+ ETFSP+VK  TI                             E VYM+QP GF+ K++   
Subjt:  -------------TPQNPNQKIVARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYH-

Query:  -MVCHL---------------KKLSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVE---
         +VC L                KL  SL   GF ++K+  SL +R T  S  +VL+YVDD+++ GSS ++++ L+  L   F+LKDLG+LSYFLG+E   
Subjt:  -MVCHL---------------KKLSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVE---

Query:  ----------------VGPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLV
                         G  L A  G+P  +V  YRSVVGALQY T+T PEI++SVNK CQFM  P  THW+ VKRILR L      G+ L  S+ M+LV
Subjt:  ----------------VGPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLV

Query:  GFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKH
        GF DAD  SD DDR+STSG CV+ G +LVSW  KKQ   S+S+TEAE+R LA L ++++W++SLL++L   + + P++WCD +S+V L+ANP+LHS+TKH
Subjt:  GFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKH

Query:  VELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE
        +ELD+YFVR+ + + KL + H+PT +Q+AD+ TKPLS + F  L+E
Subjt:  VELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0078.56Show/hide
Query:  MSSNSSLLGVDNTEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWE-------FINIR------TRTPNLEYKVWKRQDRLIF
        MSS SSLLGV+NTEASSPINQIFGSGNK+SLVKL+DD FLLWKFQ+LTALE   ++ + +  E        I+        T TPN  YKVWKRQDRLI 
Subjt:  MSSNSSLLGVDNTEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWE-------FINIR------TRTPNLEYKVWKRQDRLIF

Query:  SWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMIS
        SWLLGSMSEEILNQMLHCK AKEIW TLQGIF SRYLAQAMQFKNKLHNIKKGSMPLKEYFLK+ QCVDALASINK VSSDDHILYILA LGSDYQSMIS
Subjt:  SWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMIS

Query:  VISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQ-------------RGNRNKPQCEICTKLGHSV
        VISARTDSPSVQEVMSLLLTQESQNESKLISET LPSVNIVTQTTEKGAESYIR +QNNYHNNHSYNQ             RGNRNKPQC+IC KLG+S 
Subjt:  VISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQ-------------RGNRNKPQCEICTKLGHSV

Query:  DCCFFRYTPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSST
        D CFFRYTPRSNSS YSPNSHNTSYTNMNNHPQMSAMVA+ DLNIDSNWY DSGATNHLTHSLSNLS  S+YGGGNQIYAANGSGLPIT+YGSMSF+SST
Subjt:  DCCFFRYTPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSST

Query:  LPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK-----------------
        LPFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEF+PTL YVKDLD  QVLLQGLLNDG YKFTI+PSHKRL HHS  NTK                 
Subjt:  LPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK-----------------

Query:  ---RLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISF----------------SD
           RLG+PHLP VKAVL HID SS TINK+NFCEACALGKHHA PFSH LT YTHPLQLITCDLWG  VNVS+NGF YYISF                SD
Subjt:  ---RLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISF----------------SD

Query:  VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPT
         FLAFQKFKTCVEKSLGQSIKSLQ DGGTEFKPFKPFLDQ+GIEH+ITCPY SKQNDIVERKHR+I+E+               DEAFSTSVYLINRLPT
Subjt:  VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPT

Query:  LILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSHSSIPKSKNII
         +LDNISPL            L+V     YPYLRPYQSHKLSLRSTPCTFLGY TSHKGYKCLASDGRLFISRHVLFDENSFPYASF+SHSS PKSK+++
Subjt:  LILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSHSSIPKSKNII

Query:  SLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSK
        S PLHSII SSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNS  ITQSPS MEP HQTDSGMNTQ QSTSIHPMIT+SK
Subjt:  SLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSK

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0078.68Show/hide
Query:  MSSNSSLLGVDNTEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWE-------FINIR------TRTPNLEYKVWKRQDRLIF
        MSS SSLLGV+NTEASSPINQIFGSGNK+SLVKL+DD FLLWKFQ+LTALE   ++ + +  E        I+        T TPN  YKVWKRQDRLI 
Subjt:  MSSNSSLLGVDNTEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWE-------FINIR------TRTPNLEYKVWKRQDRLIF

Query:  SWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMIS
        SWLLGSMSEEILNQMLHCK AKEIW TLQGIF SRYLAQAMQFKNKLHNIKKGSMPLKEYFLK+ QCVDALASINK VSSDDHILYILA LGSDYQSMIS
Subjt:  SWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMIS

Query:  VISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQ-------------RGNRNKPQCEICTKLGHSV
        VISARTDSPSVQEVMSLLLTQESQNESKLISET LPSVNIVTQTTEKGAESYIR +QNNYHNNHSYNQ             RGNRNKPQC+IC KLG+S 
Subjt:  VISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQ-------------RGNRNKPQCEICTKLGHSV

Query:  DCCFFRYTPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSST
        D CFFRYTPRSNSS YSPNSHNTSYTNMNNHPQMSAMVA+ DLNIDSNWY DSGATNHLTHSLSNLS  S+YGGGNQIYAANGSGLPIT+YGSMSF+SST
Subjt:  DCCFFRYTPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSST

Query:  LPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK-----------------
        LPFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEF+PTL YVKDLD  QVLLQGLLNDG YKFTI+PSHKRL HHS  NTK                 
Subjt:  LPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK-----------------

Query:  ---RLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISF----------------SD
           RLG+PHLP VKAVL HID SS TINK+NFCEACALGKHHA PFSH LT YTHPLQLITCDLWG  VNVS+NGF YYISF                SD
Subjt:  ---RLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISF----------------SD

Query:  VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPT
         FLAFQKFKTCVEKSLGQSIKSLQ DGGTEFKPFKPFLDQ+GIEH+ITCPY SKQNDIVERKHR+I+E+               DEAFSTSVYLINRLPT
Subjt:  VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPT

Query:  LILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSHSSIPKSKNII
         +LDNISPL            L+V     YPYLRPYQSHKLSLRSTPCTFLGY TSHKGYKCLASDGRLFISRHVLFDENSFPYASF+SHSSIPKSK+++
Subjt:  LILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSHSSIPKSKNII

Query:  SLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSK
        S PLHSII SSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNS  ITQSPS MEP HQTDSGMNTQ QSTSIHPMIT+SK
Subjt:  SLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSK

A5BFT3 Integrase catalytic domain-containing protein6.9e-22436.48Show/hide
Query:  VKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFINIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQ
        VKL + NFL+WK Q+++A     L++       + + F   + R    + +     +  I    LG  S   L+Q             L+  F S+  A+
Subjt:  VKLSDDNFLLWKFQVLTA-----LERTTIKISHIHWEFINIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSRYLAQ

Query:  AMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVN
        A QFK +L + KKG   + EY  K++ CVD+LAS+   +S+ DH+  IL  L +DY+S ++ +  R D  SV+E+ +LL+  ES+ E    S  + PS +
Subjt:  AMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETTLPSVN

Query:  IV-TQTTEKG----AESYIRNSQNNYHN-----------------------NHSYNQRGNR------------------------NKPQCEICTKLGHSV
        +  +   EKG     + Y  NSQ ++                         N +YN R NR                         KP C++C K+GH V
Subjt:  IV-TQTTEKG----AESYIRNSQNNYHN-----------------------NHSYNQRGNR------------------------NKPQCEICTKLGHSV

Query:  DCCFFRY--TPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHS
          C++R+  T +   +  S NS   +Y + +  PQ++ ++ + ++  D NWY DSGA+NH+T +  NL   +++ G NQ++  NG+GL I + G   F S
Subjt:  DCCFFRY--TPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHS

Query:  STLPF--KSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKF-------------TIQPSHKRLDHHSKPNT-
           PF  K   LN+LLHVPSITKNL+SVS+FAKDN VFFEF+    +VKD   + VL+ G + DG Y F             +  PS       SK  T 
Subjt:  STLPF--KSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKF-------------TIQPSHKRLDHHSKPNT-

Query:  ----------KRLGYPHLPTVKAVLKHIDYSSSTINKM--NFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSDVFLAFQ
                  KRLG+P   T+K VL   + +   INKM  NFC +C LGK H FPFS   T YT PL+LI  DLWG T+ +S +G+ YYI F D F  F 
Subjt:  ----------KRLGYPHLPTVKAVLKHIDYSSSTINKM--NFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSDVFLAFQ

Query:  ----------------KFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIE---------------IDEAFST
                         FKT VE      IKSLQ D G EF+ F+ +L + GI H+++CP+  +QN + ERKHR I+E                DE+F T
Subjt:  ----------------KFKTCVEKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIE---------------IDEAFST

Query:  SVYLINRLPTLILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSH
         VYL NRLPT IL +  P+           +L+V     +P LRPY +HKL  RS  CTFLGY   HKGYKC++S+GR++IS  V+F+E SFPY+     
Subjt:  SVYLINRLPTLILDNISPL-----------WLQVL----YPYLRPYQSHKLSLRSTPCTFLGYITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSH

Query:  SSIPKSKNIISLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDD--------GNSSSITQSPSPM-------EPQHQTDSGMN
            K+  + S  L ++  S+  +H         +S     + PT   P+ +    S  D+         NS+  T +P+ +         QH   S  +
Subjt:  SSIPKSKNIISLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDD--------GNSSSITQSPSPM-------EPQHQTDSGMN

Query:  TQFQST------SIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTP-----------------QNPN---QKI
             T      + HPMITR+K GI KPK F+    +  P +   A +   WKKAM  E++ALQ+N+ W L P                 +NP+   QK 
Subjt:  TQFQST------SIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTP-----------------QNPN---QKI

Query:  VARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYHMVCHL---------------KKL
         ARLVAKGFH+    D+ ETFSP+VK  T+                             E V+M+QP GF  + + ++VC L               +KL
Subjt:  VARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQPFGFEVKSSYHMVCHL---------------KKL

Query:  SSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV--------------------------
          +L S GF ++K+  SL +R TP    YVL+YVDD++++GS    + SL+  LN++F+LKDLG++ YFLG++V                          
Subjt:  SSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV--------------------------

Query:  ----------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADG
                  G  L    G+P  D+H YRS VGALQY T+T PE+S+SVNK CQFM  P   HW++VKRILR L+  L HGL L KS N+ L+GF DAD 
Subjt:  ----------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADG

Query:  ASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYF
        ASD DDR+STSG CV+ G NL+SW  KKQ I+S+S+ E E+R LA L  ++ W+RSLL++L + L  PP++WCD LS+V L+ANP+LH++TKH+ELD+YF
Subjt:  ASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYF

Query:  VRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKETYR
        VR+ + + ++ +RH+P+ +Q+AD+LTK +S+  F   +   R
Subjt:  VRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKETYR

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.1e-4024.51Show/hide
Query:  MNHNEDRRHTDTVSDNTDHLNP-------TIVYPLETGTQESSRDDG----NSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFL
        +N ++ R+  D ++++    NP       T  +  E G    +++DG    N  S      P    ++ D+ +N           +  + H IF      
Subjt:  MNHNEDRRHTDTVSDNTDHLNP-------TIVYPLETGTQESSRDDG----NSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFL

Query:  IDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQKIV--------------------ARLVAKGFHRTPNIDYNETFSPIVKLV----
         D  Q +   +        W++A+  E  A + N+ W +T +  N+ IV                    ARLVA+GF +   IDY ETF+P+ ++     
Subjt:  IDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQKIV--------------------ARLVAKGFHRTPNIDYNETFSPIVKLV----

Query:  ------------------------TIHENVYMEQPFGFEVKSSYHMVCHLKK---------------LSSSLHSLGFRTSKAGTSLLI--RVTPTSCCYV
                                T+ E +YM  P G    S    VC L K                  +L    F  S     + I  +       YV
Subjt:  ------------------------TIHENVYMEQPFGFEVKSSYHMVCHLKK---------------LSSSLHSLGFRTSKAGTSLLI--RVTPTSCCYV

Query:  LIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVE---------------VGPLLYAFQGEPFHDVHL--------------------YRSV
        L+YVDD++I       +N+    L  +F + DL ++ +F+G+                V  +L  F  E  + V                       RS+
Subjt:  LIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVE---------------VGPLLYAFQGEPFHDVHL--------------------YRSV

Query:  VGALQYATL-THPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNK--SDNMSLVGFVDADGASDPDDRKSTSGFCV-YFGNNLVSWGFK
        +G L Y  L T P+++ +VN   ++        WQ +KR+LR LK  +   L   K  +    ++G+VD+D A    DRKST+G+    F  NL+ W  K
Subjt:  VGALQYATL-THPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNK--SDNMSLVGFVDADGASDPDDRKSTSGFCV-YFGNNLVSWGFK

Query:  KQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTK
        +Q+ ++ S+TEAE+  L     + +W++ LL  + I L  P  ++ D    + +  NP  H + KH+++  +F R+ +Q   + + ++PT  Q+ADI TK
Subjt:  KQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTK

Query:  PLSAQSFHNLKE
        PL A  F  L++
Subjt:  PLSAQSFHNLKE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-8225.4Show/hide
Query:  SGNKVSLVKLSDDN-FLLWKFQVLTALERTTIKISHIHWEFINIRTRTPN-LEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSR
        SG K  + K + DN F  W+ ++     R  +    +H + +++ ++ P+ ++ + W   D    S +   +S++++N ++    A+ IW  L+ ++ S+
Subjt:  SGNKVSLVKLSDDN-FLLWKFQVLTALERTTIKISHIHWEFINIRTRTPN-LEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGTLQGIFFSR

Query:  YLAQAMQFKNKLH--NIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISET
         L   +  K +L+  ++ +G+  L  +       +  LA++   +  +D  + +L  L S Y ++ + I     +  +++V S LL  E   +       
Subjt:  YLAQAMQFKNKLH--NIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISET

Query:  TLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQRGNRNKPQ---CEICTKLGHSVDCCFFRYTPR--SNSSSYSPNSHNTSYTNMNNHPQMSAMVASP
          P        TE    SY R+S NNY  + +  +  NR+K +   C  C + GH    C     PR     +S   N  NT+    NN   +  +    
Subjt:  TLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQRGNRNKPQ---CEICTKLGHSVDCCFFRYTPR--SNSSSYSPNSHNTSYTNMNNHPQMSAMVASP

Query:  DL----NIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEF
        +       +S W +D+ A++H T  + +L  R   G    +   N S   I   G +   ++     +  L ++ HVP +  NLIS     +D    +E 
Subjt:  DL----NIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEF

Query:  Y----------PTLYYVKDL------DIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTKRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKH
        Y           +L   K +           + QG LN    + ++   HKR+ H S+   + L    L         I Y+  T  K   C+ C  GK 
Subjt:  Y----------PTLYYVKDL------DIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTKRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKH

Query:  HAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSD----------------VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEF--KPFKPFLD
        H   F        + L L+  D+ G     S  G  Y+++F D                VF  FQKF   VE+  G+ +K L+ D G E+  + F+ +  
Subjt:  HAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSD----------------VFLAFQKFKTCVEKSLGQSIKSLQIDGGTEF--KPFKPFLD

Query:  QYGIEHKITCPYASKQNDIVERKHRHIIE---------------IDEAFSTSVYLINRLPTLILDNISP--LWL--QVLYPYLRPY-----------QSH
         +GI H+ T P   + N + ER +R I+E                 EA  T+ YLINR P++ L    P  +W   +V Y +L+ +           Q  
Subjt:  QYGIEHKITCPYASKQNDIVERKHRHIIE---------------IDEAFSTSVYLINRLPTLILDNISP--LWL--QVLYPYLRPY-----------QSH

Query:  KLSLRSTPCTFLGYITSHKGYKCLASDGRLFI-SRHVLFDENSFPYASFSSHSSIPKSKNIISLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVY
        KL  +S PC F+GY     GY+      +  I SR V+F E+    A+  S     K KN I +P    I S+  N       TD VS+  +        
Subjt:  KLSLRSTPCTFLGYITSHKGYKCLASDGRLFI-SRHVLFDENSFPYASFSSHSSIPKSKNIISLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVY

Query:  PLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHP---HWKKAMEEEFEALQKN
        P E   Q    D+G      + P+  E QHQ            S  P +   +   +    +++     +P + KE   HP      KAM+EE E+LQKN
Subjt:  PLETGTQESSRDDGNSSSITQSPSPMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHP---HWKKAMEEEFEALQKN

Query:  DIWRLT-----------------PQNPNQKIV---ARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQ
          ++L                   ++ + K+V   ARLV KGF +   ID++E FSP+VK+ +I                             E +YMEQ
Subjt:  DIWRLT-----------------PQNPNQKIV---ARLVAKGFHRTPNIDYNETFSPIVKLVTI----------------------------HENVYMEQ

Query:  PFGFEVKSSYHMVCHLKKLSSSLHSLG-------------------FRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKD
        P GFEV    HMVC   KL+ SL+ L                     +T         R +  +   +L+YVDD++I+G  K  +  L   L+  F +KD
Subjt:  PFGFEVKSSYHMVCHLKKLSSSLHSLG-------------------FRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKD

Query:  LGKLSYFLGVEV-----GPLLYAFQGEPFHDV------------------HL---------------------YRSVVGALQYATL-THPEISYSVNKAC
        LG     LG+++        L+  Q +    V                  HL                     Y S VG+L YA + T P+I+++V    
Subjt:  LGKLSYFLGVEV-----GPLLYAFQGEPFHDV------------------HL---------------------YRSVVGALQYATL-THPEISYSVNKAC

Query:  QFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVW
        +F+  P   HW+ VK ILR L+      L    SD + L G+ DAD A D D+RKS++G+   F    +SW  K Q  ++ S TEAE+        +++W
Subjt:  QFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVW

Query:  IRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE
        ++  L +L ++     +++CD  S++ L+ N + H++TKH+++  +++R+++    L +  + T E  AD+LTK +    F   KE
Subjt:  IRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHNLKE

P92519 Uncharacterized mitochondrial protein AtMg008105.0e-3837.5Show/hide
Query:  YVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV---------GPLLYAFQ--------------------------GEPFHDVHLYR
        Y+L+YVDD+++ GSS   +N L+  L++ F++KDLG + YFLG+++             YA Q                             + D   +R
Subjt:  YVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV---------GPLLYAFQ--------------------------GEPFHDVHLYR

Query:  SVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQ
        S+VGALQY TLT P+ISY+VN  CQ MH P    + L+KR+LR +K  ++HGL ++K+  +++  F D+D A     R+ST+GFC + G N++SW  K+Q
Subjt:  SVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQ

Query:  SIISKSNTEAEHRCLALLETKLVW
          +S+S+TE E+R LAL   +L W
Subjt:  SIISKSNTEAEHRCLALLETKLVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-16530.7Show/hide
Query:  NKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWEFINIRTRTP------------NLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGT
        N  ++ KL+  N+L+W  QV    +   +        F++  T  P            N +Y  WKRQD+LI+S +LG++S  +   +     A +IW T
Subjt:  NKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWEFINIRTRTP------------NLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGT

Query:  LQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNES
        L+ I+ +       Q + +L    KG+  + +Y   +    D LA + K +  D+ +  +L +L  +Y+ +I  I+A+   P++ E+   LL  ES+  +
Subjt:  LQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNES

Query:  KLISETTLP-SVNIV----TQTTEKGAESYIRNSQNNYHNNH----------SYNQRGNRNKP---QCEICTKLGHSVDCCFFRYTPRSNSSSYSPNSHN
         + S T +P + N V    T TT         N  +N +NN+          +++   N++KP   +C+IC   GHS   C       S+ +S  P S  
Subjt:  KLISETTLP-SVNIV----TQTTEKGAESYIRNSQNNYHNNH----------SYNQRGNRNKP---QCEICTKLGHSVDCCFFRYTPRSNSSSYSPNSHN

Query:  TSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLI
        T +      P+ +  + SP     +NW LDSGAT+H+T   +NLS    Y GG+ +  A+GS +PI++ GS S  + + P     L+N+L+VP+I KNLI
Subjt:  TSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLI

Query:  SVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK--------RLGYPHLPTVKAVLKHIDYSSSTIN---KMN
        SV +    N V  EF+P  + VKDL+    LLQG   D  Y++ I  S + +   + P++K        RLG+P    + +V+   +YS S +N   K  
Subjt:  SVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK--------RLGYPHLPTVKAVLKHIDYSSSTIN---KMN

Query:  FCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEF
         C  C + K +  PFS    + T PL+ I  D+W S + +S++ + YY+ F D F                  F  FK  +E      I +   D G EF
Subjt:  FCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSDVFL----------------AFQKFKTCVEKSLGQSIKSLQIDGGTEF

Query:  KPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPTLILDNISPLW-----------LQVL----YP
             +  Q+GI H  + P+  + N + ERKHRHI+E                  AF+ +VYLINRLPT +L   SP             L+V     YP
Subjt:  KPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPTLILDNISPLW-----------LQVL----YP

Query:  YLRPYQSHKLSLRSTPCTFLGYITSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF------------------SSHSSIPKSKNIISLP---------
        +LRPY  HKL  +S  C FLGY  +   Y CL     RL+ISRHV FDEN FP++++                  S H+++P    ++  P         
Subjt:  YLRPYQSHKLSLRSTPCTFLGYITSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF------------------SSHSSIPKSKNIISLP---------

Query:  -------------------LHSIIQSSLMNHNE---------------DRRHTDT-VSDNTDHLNPT------IVYPLETGTQESSRDDG---NSSSITQ
                           L S   SS  +  E                +  T T  S NT   NPT      +   L T  Q SS       ++SS + 
Subjt:  -------------------LHSIIQSSLMNHNE---------------DRRHTDT-VSDNTDHLNPT------IVYPLETGTQESSRDDG---NSSSITQ

Query:  SPSP----MEPQHQTDSGMNTQFQS-TSIHPMITRSKHGIFKPK---AFLIDY-TQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQKIV
        SP+P    + P       +N   Q+  + H M TR+K GI KP    +  +    +++P  A +A K   W+ AM  E  A   N  W L P  P+   +
Subjt:  SPSP----MEPQHQTDSGMNTQFQS-TSIHPMITRSKHGIFKPK---AFLIDY-TQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQKIV

Query:  ---------------------ARLVAKGFHRTPNIDYNETFSPIVKLV----------------------------TIHENVYMEQPFGFEVKSSYHMVC
                             ARLVAKG+++ P +DY ETFSP++K                              T+ ++VYM QP GF  K   + VC
Subjt:  ---------------------ARLVAKGFHRTPNIDYNETFSPIVKLV----------------------------TIHENVYMEQPFGFEVKSSYHMVC

Query:  HLKK---------------LSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV------
         L+K               L + L ++GF  S + TSL +     S  Y+L+YVDD++I G+    +++ + +L+ +F++KD  +L YFLG+E       
Subjt:  HLKK---------------LSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV------

Query:  ------------------------------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYH
                                       P L  + G    D   YR +VG+LQY   T P+ISY+VN+  QFMH+P   H Q +KRILR L     H
Subjt:  ------------------------------GPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYH

Query:  GLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVH
        G+ L K + +SL  + DAD A D DD  ST+G+ VY G++ +SW  KKQ  + +S+TEAE+R +A   +++ WI SLL +L I L  PP+++CD + + +
Subjt:  GLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVH

Query:  LNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHN
        L ANP+ HS+ KH+ +D +F+R+ +Q G L + H+ T +Q+AD LTKPLS  +F N
Subjt:  LNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-15629.97Show/hide
Query:  NKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWEFINIRTRTP------------NLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGT
        N  ++ KL+  N+L+W  QV    +   +        F++  T  P            N +Y  W+RQD+LI+S +LG++S  +   +     A +IW T
Subjt:  NKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWEFINIRTRTP------------NLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKEIWGT

Query:  LQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNES
        L+ I+ +       Q                   L+     D LA + K +  D+ +  +L +L  DY+ +I  I+A+   PS+ E+   L+ +ES+  +
Subjt:  LQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNES

Query:  KLISETTLPSVNIVT-QTTEKGAESYIRNSQNNYHNNH---------SYNQRGNRNKP-----QCEICTKLGHSVDCCFFRYTPRSNSSSYSPNSHNTSY
           +E    + N+VT + T        R    NY+NN+         S   R +  +P     +C+IC+  GHS   C     P+ +    + N   ++ 
Subjt:  KLISETTLPSVNIVT-QTTEKGAESYIRNSQNNYHNNH---------SYNQRGNRNKP-----QCEICTKLGHSVDCCFFRYTPRSNSSSYSPNSHNTSY

Query:  TNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVS
              P+ +  V SP     +NW LDSGAT+H+T   +NLS    Y GG+ +  A+GS +PIT+ GS S  +S+   +S  LN +L+VP+I KNLISV 
Subjt:  TNMNNHPQMSAMVASPDLNIDSNWYLDSGATNHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVS

Query:  QFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK--------RLGYPHLPTVKAVLKHIDYSSSTIN---KMNFCE
        +    N V  EF+P  + VKDL+    LLQG   D  Y++ I  S + +   + P +K        RLG+P L  + +V+   ++S   +N   K+  C 
Subjt:  QFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFTIQPSHKRLDHHSKPNTK--------RLGYPHLPTVKAVLKHIDYSSSTIN---KMNFCE

Query:  ACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSDVFLAF-------QK---------FKTCVEKSLGQSIKSLQIDGGTEFKPF
         C + K H  PFS+     + PL+ I  D+W S + +S + + YY+ F D F  +       QK         FK+ VE      I +L  D G EF   
Subjt:  ACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSDVFLAF-------QK---------FKTCVEKSLGQSIKSLQIDGGTEFKPF

Query:  KPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPTLILDNISPLW-----------LQVL----YPYLR
        + +L Q+GI H  + P+  + N + ERKHRHI+E+                 AFS +VYLINRLPT +L   SP             L+V     YP+LR
Subjt:  KPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEI---------------DEAFSTSVYLINRLPTLILDNISPLW-----------LQVL----YPYLR

Query:  PYQSHKLSLRSTPCTFLGYITSHKGYKCL-ASDGRLFISRHVLFDENSFPYA------------------SFSSHSSIPKSKNII---------------
        PY  HKL  +S  C F+GY  +   Y CL    GRL+ SRHV FDE  FP++                  ++ SH+++P +  ++               
Subjt:  PYQSHKLSLRSTPCTFLGYITSHKGYKCL-ASDGRLFISRHVLFDENSFPYA------------------SFSSHSSIPKSKNII---------------

Query:  ----------------SLPLHSIIQSS-----LMNHN--------EDRRHTDTVSDNTDHLNPTIVYP------------------LETGTQESSRDDGN
                        +LP  SI   S       +HN           +++++ S   ++ NP    P                  + T +   S  +  
Subjt:  ----------------SLPLHSIIQSS-----LMNHN--------EDRRHTDTVSDNTDHLNPTIVYP------------------LETGTQESSRDDGN

Query:  SSSITQSPSPMEPQHQTDSGMNTQFQS-TSIHPMITRSKHGIFKPKAFLIDYT----QTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQK
        SSS T +P P+ P       +    Q+  + H M TR+K GI KP       T     ++P  A +A K   W++AM  E  A   N  W L P  P   
Subjt:  SSSITQSPSPMEPQHQTDSGMNTQFQS-TSIHPMITRSKHGIFKPKAFLIDYT----QTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQK

Query:  IV---------------------ARLVAKGFHRTPNIDYNETFSPIVKLV----------------------------TIHENVYMEQPFGFEVKSSYHM
         +                     ARLVAKG+++ P +DY ETFSP++K                              T+ + VYM QP GF  K     
Subjt:  IV---------------------ARLVAKGFHRTPNIDYNETFSPIVKLV----------------------------TIHENVYMEQPFGFEVKSSYHM

Query:  VCHLKK---------------LSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVE-----
        VC L+K               L + L ++GF  S + TSL +     S  Y+L+YVDD++I G+    +   + +L+ +F++K+   L YFLG+E     
Subjt:  VCHLKK---------------LSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVE-----

Query:  -------------------------------VGPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVL
                                         P L    G    D   YR +VG+LQY   T P++SY+VN+  Q+MH+P   HW  +KR+LR L    
Subjt:  -------------------------------VGPLLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVL

Query:  YHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSS
         HG+ L K + +SL  + DAD A D DD  ST+G+ VY G++ +SW  KKQ  + +S+TEAE+R +A   ++L WI SLL +L I L  PP+++CD + +
Subjt:  YHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSS

Query:  VHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHN
         +L ANP+ HS+ KH+ LD +F+R+ +Q G L + H+ T +Q+AD LTKPLS  +F N
Subjt:  VHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQSFHN

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.1e-5629.85Show/hide
Query:  AFLIDYTQTK-PCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQKIV--------------------ARLVAKGFHRTPNIDYNETFSPIVKLV
        +FL+   + K P    EA +   W  AM++E  A++    W +    PN+K +                    ARLVAKG+ +   ID+ ETFSP+ KL 
Subjt:  AFLIDYTQTK-PCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQKIV--------------------ARLVAKGFHRTPNIDYNETFSPIVKLV

Query:  --------------TIH--------------ENVYMEQPFGFEVKSSYHM----VCHLK---------------KLSSSLHSLGFRTSKAGTSLLIRVTP
                      T+H              E +YM+ P G+  +    +    VC+LK               K S +L   GF  S +  +  +++T 
Subjt:  --------------TIH--------------ENVYMEQPFGFEVKSSYHM----VCHLK---------------KLSSSLHSLGFRTSKAGTSLLIRVTP

Query:  TSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEVG-------------------------------PL-----LYAFQGEPFHD
        T    VL+YVDD+II  ++   V+ L   L + F L+DLG L YFLG+E+                                P+       A  G  F D
Subjt:  TSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEVG-------------------------------PL-----LYAFQGEPFHD

Query:  VHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSW
           YR ++G L Y  +T  +IS++VNK  QF   P+  H Q V +IL  +K  +  GL  +    M L  F DA   S  D R+ST+G+C++ G +L+SW
Subjt:  VHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSW

Query:  GFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYFVRD
          KKQ ++SKS+ EAE+R L+    +++W+     +L + L  P +L+CD  +++H+  N + H +TKH+E D + VR+
Subjt:  GFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYFVRD

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.1e-1530.46Show/hide
Query:  TRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCK-FAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSS
        T TP  E K WK +D L+  W+ G++++ +L+ ++     A+++W +L+ +F     A+A+QF+N+L       + + EY  K++   D L +++  +S 
Subjt:  TRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCK-FAKEIWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSS

Query:  DDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQ--NESK-LISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQRGNRNK
           ++++L  L   Y  +++VI  ++  PS  E  S+LL +ES+  N+SK  +S T  PS++ V  T  +  E Y       YHNN+S   RG   K
Subjt:  DDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQ--NESK-LISETTLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQRGNRNK

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.9e-0932.73Show/hide
Query:  YATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIIS---
        Y T+T P+++++VN+  QF    +    Q V ++L  +K  +  GL  + + ++ L  F D+D AS PD R+S +GFC      L   G  ++SI+S   
Subjt:  YATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQSIIS---

Query:  --KSNTEAEH
          + N EA H
Subjt:  --KSNTEAEH

ATMG00810.1 DNA/RNA polymerases superfamily protein3.5e-3937.5Show/hide
Query:  YVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV---------GPLLYAFQ--------------------------GEPFHDVHLYR
        Y+L+YVDD+++ GSS   +N L+  L++ F++KDLG + YFLG+++             YA Q                             + D   +R
Subjt:  YVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEV---------GPLLYAFQ--------------------------GEPFHDVHLYR

Query:  SVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQ
        S+VGALQY TLT P+ISY+VN  CQ MH P    + L+KR+LR +K  ++HGL ++K+  +++  F D+D A     R+ST+GFC + G N++SW  K+Q
Subjt:  SVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLVSWGFKKQ

Query:  SIISKSNTEAEHRCLALLETKLVW
          +S+S+TE E+R LAL   +L W
Subjt:  SIISKSNTEAEHRCLALLETKLVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-1137.14Show/hide
Query:  MITRSKHGIFK--PKAFLIDYTQTK--PCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQKIV--------------------ARLVAKGFHRT
        M+TRSK GI K  PK  L   T  K  P +   A K P W +AM+EE +AL +N  W L P   NQ I+                    ARLVAKGFH+ 
Subjt:  MITRSKHGIFK--PKAFLIDYTQTK--PCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQKIV--------------------ARLVAKGFHRT

Query:  PNIDYNETFSPIVKLVTIHE--NVYMEQPFGFEVKSSYHM
          I + ET+SP+V+  TI    NV  +   G  +   + M
Subjt:  PNIDYNETFSPIVKLVTIHE--NVYMEQPFGFEVKSSYHM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCAAACTCATCCCTACTCGGTGTTGACAACACTGAAGCATCTTCACCAATTAATCAAATATTTGGATCGGGTAACAAAGTATCTTTAGTGAAGCTCAGCGATGA
TAATTTTCTCTTATGGAAGTTCCAAGTTCTTACAGCATTAGAAAGAACCACCATTAAAATATCTCACATCCACTGGGAGTTCATCAACATCCGTACAAGAACACCAAATC
TGGAATATAAGGTATGGAAACGCCAAGATCGCCTTATCTTCTCATGGCTTCTAGGGTCCATGAGTGAAGAAATACTGAATCAGATGCTTCATTGCAAATTTGCAAAAGAA
ATTTGGGGAACTCTTCAAGGTATTTTCTTTTCCCGTTACTTGGCACAAGCTATGCAATTCAAAAACAAACTTCACAATATAAAGAAAGGATCCATGCCATTAAAAGAATA
CTTTCTCAAAATGCAGCAGTGTGTTGATGCCTTAGCTTCAATTAACAAACTGGTTTCATCTGATGATCATATTCTTTACATATTGGCCGATTTAGGATCTGATTATCAAT
CCATGATATCTGTTATTTCCGCTAGAACTGACTCTCCTTCTGTACAAGAAGTTATGTCTTTATTACTTACTCAGGAATCTCAAAATGAGAGCAAATTAATCAGTGAGACT
ACTCTACCTTCTGTTAACATTGTCACCCAAACAACTGAAAAAGGAGCAGAATCTTACATAAGGAACAGCCAAAACAACTATCACAACAATCACTCCTACAATCAAAGGGG
AAATCGTAATAAACCACAATGTGAAATCTGTACAAAGCTTGGACACAGTGTTGATTGTTGTTTCTTTCGATATACTCCAAGATCAAATTCATCAAGTTACTCACCAAACT
CACATAATACTTCATATACTAATATGAATAATCATCCACAGATGTCTGCTATGGTGGCTTCCCCCGACCTGAATATTGACAGCAATTGGTATCTTGATTCGGGAGCTACA
AACCATTTAACTCATAGTTTGAGCAACCTATCTACTAGATCTAAGTACGGGGGAGGAAATCAAATATATGCAGCAAATGGGTCAGGTTTGCCAATCACTTATTATGGTTC
CATGTCATTTCACTCCTCTACATTACCATTCAAATCATTTACACTAAATAACTTGCTTCATGTTCCATCCATTACCAAAAACTTAATCAGTGTTTCACAATTTGCCAAAG
ATAATCATGTTTTCTTTGAATTTTACCCTACTTTGTATTATGTGAAGGATCTGGATATTCGCCAAGTACTTCTTCAAGGACTACTCAATGATGGGTTCTACAAGTTTACC
ATCCAACCATCACATAAAAGACTTGATCACCATTCTAAACCCAACACCAAAAGACTAGGTTATCCCCATTTACCTACTGTTAAAGCTGTTTTGAAACACATTGACTATTC
TTCTAGCACTATAAATAAAATGAATTTTTGTGAAGCATGTGCATTGGGCAAACATCATGCCTTTCCTTTCTCTCACTTCCTTACTCATTATACACATCCTTTACAACTTA
TTACTTGTGATTTATGGGGTTCTACAGTAAATGTATCTTATAATGGTTTTGGTTACTATATAAGTTTTTCTGATGTTTTTTTAGCCTTTCAAAAATTCAAAACCTGTGTT
GAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAATTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTCTTGATCAATATGGCATTGAACATAAGATAACATG
TCCTTACGCTTCAAAGCAGAATGACATAGTTGAGAGAAAACATAGGCATATCATTGAAATAGATGAAGCCTTTTCCACTAGTGTCTATCTCATAAATCGTTTGCCTACCC
TAATCCTTGATAATATAAGCCCCCTTTGGCTGCAAGTGTTATATCCTTACCTTCGACCTTACCAATCACATAAACTATCTCTCCGATCCACACCATGTACTTTCCTAGGA
TACATTACCTCACATAAAGGGTACAAATGTCTAGCTTCAGATGGTCGCCTTTTCATTTCTAGACATGTATTATTTGATGAAAATTCATTTCCATATGCATCATTTTCATC
TCATTCTAGCATACCTAAATCCAAAAATATCATATCTCTACCACTTCACTCAATAATTCAATCATCCCTTATGAACCATAATGAGGATAGGCGACACACTGACACAGTTT
CTGATAATACTGATCATCTAAACCCTACTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTCGGGATGATGGTAACAGTAGCAGTATTACTCAGTCTCCAAGT
CCTATGGAACCACAACATCAAACCGATTCTGGTATGAACACTCAATTTCAATCTACATCAATTCATCCCATGATAACACGGAGTAAGCATGGTATTTTCAAACCAAAAGC
ATTCTTGATTGATTATACTCAAACTAAACCTTGTAATGCCAAGGAAGCTTTTAAACATCCTCACTGGAAAAAGGCCATGGAAGAAGAGTTTGAAGCCTTACAGAAAAATG
ACATCTGGAGACTTACTCCACAAAATCCTAATCAGAAAATTGTTGCACGCTTAGTTGCTAAAGGGTTTCATCGAACACCTAATATTGATTACAATGAAACATTTAGCCCT
ATTGTGAAACTAGTTACTATTCATGAAAATGTTTACATGGAACAACCATTTGGTTTTGAAGTTAAAAGCTCTTATCATATGGTTTGTCATTTGAAAAAGTTGAGCTCAAG
TTTACATTCTCTTGGATTTAGAACTTCCAAGGCTGGTACATCTTTATTAATACGTGTTACTCCTACATCTTGTTGCTATGTCTTGATTTATGTTGATGATTTGATTATTA
TGGGCAGCTCTAAGAAAGATGTGAATTCTTTAGTTCATTCTTTAAATAATCAATTTGCACTTAAGGATTTGGGAAAGCTGAGCTACTTTCTTGGAGTTGAGGTTGGTCCT
TTACTTTATGCTTTTCAAGGGGAACCATTTCATGATGTGCATCTGTATAGAAGTGTTGTTGGTGCATTACAGTATGCCACACTTACTCATCCTGAGATATCATATAGTGT
CAATAAAGCTTGTCAATTTATGCATATTCCAAAACATACACACTGGCAGCTTGTGAAGAGAATTCTAAGATCTCTTAAAAGTGTACTATATCATGGTTTATCACTTAACA
AATCTGATAATATGTCCTTAGTTGGTTTTGTTGATGCGGATGGGGCTTCTGATCCAGATGACAGAAAATCTACTTCTGGTTTCTGTGTTTATTTTGGAAATAACTTAGTA
TCTTGGGGTTTCAAGAAACAGTCTATTATTTCCAAGTCTAATACTGAAGCTGAACATCGTTGCCTTGCTCTTTTGGAAACCAAACTGGTATGGATTCGTTCTCTCTTGAA
TGACTTATACATTAATCTACCTCTTCCACCTATTTTGTGGTGTGATTACCTAAGTTCAGTGCATCTGAATGCAAATCCTATATTACATTCCAAGACAAAGCATGTTGAAC
TTGACATCTATTTTGTTAGAGATCTTATACAAAAGGGAAAATTATCTATTAGACATCTTCCAACAACTGAACAAATTGCAGATATACTCACCAAGCCATTGTCTGCTCAA
AGTTTCCACAATCTGAAAGAAACTTACCGTCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCAAACTCATCCCTACTCGGTGTTGACAACACTGAAGCATCTTCACCAATTAATCAAATATTTGGATCGGGTAACAAAGTATCTTTAGTGAAGCTCAGCGATGA
TAATTTTCTCTTATGGAAGTTCCAAGTTCTTACAGCATTAGAAAGAACCACCATTAAAATATCTCACATCCACTGGGAGTTCATCAACATCCGTACAAGAACACCAAATC
TGGAATATAAGGTATGGAAACGCCAAGATCGCCTTATCTTCTCATGGCTTCTAGGGTCCATGAGTGAAGAAATACTGAATCAGATGCTTCATTGCAAATTTGCAAAAGAA
ATTTGGGGAACTCTTCAAGGTATTTTCTTTTCCCGTTACTTGGCACAAGCTATGCAATTCAAAAACAAACTTCACAATATAAAGAAAGGATCCATGCCATTAAAAGAATA
CTTTCTCAAAATGCAGCAGTGTGTTGATGCCTTAGCTTCAATTAACAAACTGGTTTCATCTGATGATCATATTCTTTACATATTGGCCGATTTAGGATCTGATTATCAAT
CCATGATATCTGTTATTTCCGCTAGAACTGACTCTCCTTCTGTACAAGAAGTTATGTCTTTATTACTTACTCAGGAATCTCAAAATGAGAGCAAATTAATCAGTGAGACT
ACTCTACCTTCTGTTAACATTGTCACCCAAACAACTGAAAAAGGAGCAGAATCTTACATAAGGAACAGCCAAAACAACTATCACAACAATCACTCCTACAATCAAAGGGG
AAATCGTAATAAACCACAATGTGAAATCTGTACAAAGCTTGGACACAGTGTTGATTGTTGTTTCTTTCGATATACTCCAAGATCAAATTCATCAAGTTACTCACCAAACT
CACATAATACTTCATATACTAATATGAATAATCATCCACAGATGTCTGCTATGGTGGCTTCCCCCGACCTGAATATTGACAGCAATTGGTATCTTGATTCGGGAGCTACA
AACCATTTAACTCATAGTTTGAGCAACCTATCTACTAGATCTAAGTACGGGGGAGGAAATCAAATATATGCAGCAAATGGGTCAGGTTTGCCAATCACTTATTATGGTTC
CATGTCATTTCACTCCTCTACATTACCATTCAAATCATTTACACTAAATAACTTGCTTCATGTTCCATCCATTACCAAAAACTTAATCAGTGTTTCACAATTTGCCAAAG
ATAATCATGTTTTCTTTGAATTTTACCCTACTTTGTATTATGTGAAGGATCTGGATATTCGCCAAGTACTTCTTCAAGGACTACTCAATGATGGGTTCTACAAGTTTACC
ATCCAACCATCACATAAAAGACTTGATCACCATTCTAAACCCAACACCAAAAGACTAGGTTATCCCCATTTACCTACTGTTAAAGCTGTTTTGAAACACATTGACTATTC
TTCTAGCACTATAAATAAAATGAATTTTTGTGAAGCATGTGCATTGGGCAAACATCATGCCTTTCCTTTCTCTCACTTCCTTACTCATTATACACATCCTTTACAACTTA
TTACTTGTGATTTATGGGGTTCTACAGTAAATGTATCTTATAATGGTTTTGGTTACTATATAAGTTTTTCTGATGTTTTTTTAGCCTTTCAAAAATTCAAAACCTGTGTT
GAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAATTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTCTTGATCAATATGGCATTGAACATAAGATAACATG
TCCTTACGCTTCAAAGCAGAATGACATAGTTGAGAGAAAACATAGGCATATCATTGAAATAGATGAAGCCTTTTCCACTAGTGTCTATCTCATAAATCGTTTGCCTACCC
TAATCCTTGATAATATAAGCCCCCTTTGGCTGCAAGTGTTATATCCTTACCTTCGACCTTACCAATCACATAAACTATCTCTCCGATCCACACCATGTACTTTCCTAGGA
TACATTACCTCACATAAAGGGTACAAATGTCTAGCTTCAGATGGTCGCCTTTTCATTTCTAGACATGTATTATTTGATGAAAATTCATTTCCATATGCATCATTTTCATC
TCATTCTAGCATACCTAAATCCAAAAATATCATATCTCTACCACTTCACTCAATAATTCAATCATCCCTTATGAACCATAATGAGGATAGGCGACACACTGACACAGTTT
CTGATAATACTGATCATCTAAACCCTACTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTCGGGATGATGGTAACAGTAGCAGTATTACTCAGTCTCCAAGT
CCTATGGAACCACAACATCAAACCGATTCTGGTATGAACACTCAATTTCAATCTACATCAATTCATCCCATGATAACACGGAGTAAGCATGGTATTTTCAAACCAAAAGC
ATTCTTGATTGATTATACTCAAACTAAACCTTGTAATGCCAAGGAAGCTTTTAAACATCCTCACTGGAAAAAGGCCATGGAAGAAGAGTTTGAAGCCTTACAGAAAAATG
ACATCTGGAGACTTACTCCACAAAATCCTAATCAGAAAATTGTTGCACGCTTAGTTGCTAAAGGGTTTCATCGAACACCTAATATTGATTACAATGAAACATTTAGCCCT
ATTGTGAAACTAGTTACTATTCATGAAAATGTTTACATGGAACAACCATTTGGTTTTGAAGTTAAAAGCTCTTATCATATGGTTTGTCATTTGAAAAAGTTGAGCTCAAG
TTTACATTCTCTTGGATTTAGAACTTCCAAGGCTGGTACATCTTTATTAATACGTGTTACTCCTACATCTTGTTGCTATGTCTTGATTTATGTTGATGATTTGATTATTA
TGGGCAGCTCTAAGAAAGATGTGAATTCTTTAGTTCATTCTTTAAATAATCAATTTGCACTTAAGGATTTGGGAAAGCTGAGCTACTTTCTTGGAGTTGAGGTTGGTCCT
TTACTTTATGCTTTTCAAGGGGAACCATTTCATGATGTGCATCTGTATAGAAGTGTTGTTGGTGCATTACAGTATGCCACACTTACTCATCCTGAGATATCATATAGTGT
CAATAAAGCTTGTCAATTTATGCATATTCCAAAACATACACACTGGCAGCTTGTGAAGAGAATTCTAAGATCTCTTAAAAGTGTACTATATCATGGTTTATCACTTAACA
AATCTGATAATATGTCCTTAGTTGGTTTTGTTGATGCGGATGGGGCTTCTGATCCAGATGACAGAAAATCTACTTCTGGTTTCTGTGTTTATTTTGGAAATAACTTAGTA
TCTTGGGGTTTCAAGAAACAGTCTATTATTTCCAAGTCTAATACTGAAGCTGAACATCGTTGCCTTGCTCTTTTGGAAACCAAACTGGTATGGATTCGTTCTCTCTTGAA
TGACTTATACATTAATCTACCTCTTCCACCTATTTTGTGGTGTGATTACCTAAGTTCAGTGCATCTGAATGCAAATCCTATATTACATTCCAAGACAAAGCATGTTGAAC
TTGACATCTATTTTGTTAGAGATCTTATACAAAAGGGAAAATTATCTATTAGACATCTTCCAACAACTGAACAAATTGCAGATATACTCACCAAGCCATTGTCTGCTCAA
AGTTTCCACAATCTGAAAGAAACTTACCGTCATTGA
Protein sequenceShow/hide protein sequence
MSSNSSLLGVDNTEASSPINQIFGSGNKVSLVKLSDDNFLLWKFQVLTALERTTIKISHIHWEFINIRTRTPNLEYKVWKRQDRLIFSWLLGSMSEEILNQMLHCKFAKE
IWGTLQGIFFSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKMQQCVDALASINKLVSSDDHILYILADLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISET
TLPSVNIVTQTTEKGAESYIRNSQNNYHNNHSYNQRGNRNKPQCEICTKLGHSVDCCFFRYTPRSNSSSYSPNSHNTSYTNMNNHPQMSAMVASPDLNIDSNWYLDSGAT
NHLTHSLSNLSTRSKYGGGNQIYAANGSGLPITYYGSMSFHSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLYYVKDLDIRQVLLQGLLNDGFYKFT
IQPSHKRLDHHSKPNTKRLGYPHLPTVKAVLKHIDYSSSTINKMNFCEACALGKHHAFPFSHFLTHYTHPLQLITCDLWGSTVNVSYNGFGYYISFSDVFLAFQKFKTCV
EKSLGQSIKSLQIDGGTEFKPFKPFLDQYGIEHKITCPYASKQNDIVERKHRHIIEIDEAFSTSVYLINRLPTLILDNISPLWLQVLYPYLRPYQSHKLSLRSTPCTFLG
YITSHKGYKCLASDGRLFISRHVLFDENSFPYASFSSHSSIPKSKNIISLPLHSIIQSSLMNHNEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSSSITQSPS
PMEPQHQTDSGMNTQFQSTSIHPMITRSKHGIFKPKAFLIDYTQTKPCNAKEAFKHPHWKKAMEEEFEALQKNDIWRLTPQNPNQKIVARLVAKGFHRTPNIDYNETFSP
IVKLVTIHENVYMEQPFGFEVKSSYHMVCHLKKLSSSLHSLGFRTSKAGTSLLIRVTPTSCCYVLIYVDDLIIMGSSKKDVNSLVHSLNNQFALKDLGKLSYFLGVEVGP
LLYAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHIPKHTHWQLVKRILRSLKSVLYHGLSLNKSDNMSLVGFVDADGASDPDDRKSTSGFCVYFGNNLV
SWGFKKQSIISKSNTEAEHRCLALLETKLVWIRSLLNDLYINLPLPPILWCDYLSSVHLNANPILHSKTKHVELDIYFVRDLIQKGKLSIRHLPTTEQIADILTKPLSAQ
SFHNLKETYRH