; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021078 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021078
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr04:24588210..24591016
RNA-Seq ExpressionPay0021078
SyntenyPay0021078
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0097.59Show/hide
Query:  MSSTSSLLGVENTEASSPINQIFGSGNKISL------------FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISS
        MSSTSSLLGVENTEASSPINQIFGSGNKISL            FQILTALEAYDLENFLESESEPPSKYLIST SSSASATGTPNPAYKVWKRQDRLISS
Subjt:  MSSTSSLLGVENTEASSPINQIFGSGNKISL------------FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISS

Query:  WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV
        WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV
Subjt:  WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV

Query:  ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQCQICAKLGHSAD
        ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRG RGNRNKPQCQICAKLG+SAD
Subjt:  ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQCQICAKLGHSAD

Query:  RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL
        RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL
Subjt:  RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL

Query:  PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWH
        PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTK VFNTVVPKSNTPLLDLWH
Subjt:  PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWH

Query:  RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF
        RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPA+NVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF
Subjt:  RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF

Query:  LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPV
        LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLIN LPTPV
Subjt:  LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPV

Query:  LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP
        LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP
Subjt:  LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP

Query:  PLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDS
        PLHSIIPSSLMNHNEDRRHTDTVSDNTDHL+PTIVYPLETGTQESSRDDGNSGGITQSPSSMEP HQTDS
Subjt:  PLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDS

KAA0067212.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]1.0e-24774.83Show/hide
Query:  QMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNH
        QMSAMVA  DLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLL                    
Subjt:  QMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNH

Query:  VFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFC
                     DLDTGQVLLQGLLNDGLYKFTI+PSHKRLHHS+SNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLP VKAVLNHID+SS         
Subjt:  VFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFC

Query:  EACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKP
                                                                              AFQKFKTCVEKSLGQSIKSLQTDGGTEFKP
Subjt:  EACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKP

Query:  FKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYL
        FKPFLDQHGIEHRITCPYTSKQNDIVERKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLIN LPTPVLDNISPLEKLFCRKPNFP LRVFGCKCYPY 
Subjt:  FKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYL

Query:  RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDHLS
        RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSS PKSK+VLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTD+L+
Subjt:  RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDHLS

Query:  PTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDSA-------------------GIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEALQ
         TIVYPLETGTQESSRDDGNSGGITQSPS MEPPHQTDS                     IFKPK FLIDYTQTE CNAKEAFNHPHWKKAMEEEFEALQ
Subjt:  PTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDSA-------------------GIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEALQ

Query:  KNGT
        KNGT
Subjt:  KNGT

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.4e-18942.46Show/hide
Query:  FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFK
        +QI  A+  Y LE FL    + P K +     +       PNP ++ ++RQD L+ SWLL S+    L Q++ C SA E+W T+   F+S+  A+ M +K
Subjt:  FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFK

Query:  NKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT
        +++  +KK  + +++Y  K+    D LA+    +S  DHIL I+ GLG +Y+S+I+VIS++  SPS+Q V S L+  E +   K+ S     SVN  +Q 
Subjt:  NKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT

Query:  TEKGAESYIRTNQNNYHNNHSYNQRGG----RGNGRSNRG-GRGNRN---KPQCQICAKLGHSADRCFFRYTP--------------------RSNSSGY
        + +G  S   +N        + NQ GG    RG+   NRG GRG      KPQCQ+C K GH+  RCF+RY P                    R+ +SG 
Subjt:  TEKGAESYIRTNQNNYHNNHSYNQRGG----RGNGRSNRG-GRGNRN---KPQCQICAKLGHSADRCFFRYTP--------------------RSNSSGY

Query:  SPNSHNTSYTNMN-----NHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNL
          ++ N + T  +     ++ +M AMVA  +   +  W+PDSGATNH+TH L NL++G+EY G ++I+  NG+GL I+H G   F SS+ P K   L N+
Subjt:  SPNSHNTSYTNMN-----NHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNL

Query:  LQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPS--HKRLHHSNSNTKSVF-----------NTVVPK---SNTP
        L+VP+I KNL+SVSQFA+DN+V+FEFHP +C+VKD     +LLQG L+ GLY+F +      K    S SN K+             N+  P+   S+  
Subjt:  LQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPS--HKRLHHSNSNTKSVF-----------NTVVPK---SNTP

Query:  LLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLN
        + DLWH+RLGHP   IV  VLN       T +  + C AC LGK H LPF  S T+YT PLQL+  DLWGPA   S  GF YY+SFVDAYSRYTW+YFL 
Subjt:  LLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLN

Query:  SKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLIN
        +KS    AF  FK   E   G  +K+ QTD G EF+  K + +Q+GI HR++CP+TSKQN I+ERKHR+I+E+GLTLL+QA+LPL +W +AFST+V+LIN
Subjt:  SKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLIN

Query:  CLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFAS
         LPT VL    P E LF  KPN+  L+VFGC C+P+LRPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE  FP+A          S
Subjt:  CLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFAS

Query:  HSST-----PKSKDV--------LSPPLHSIIPSSLMNHN--EDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEP--------
        HS+      P  K++        LS P  S   S  ++ N   D R       NTD  S   +         SS      G I  S +S EP        
Subjt:  HSST-----PKSKDV--------LSPPLHSIIPSSLMNHN--EDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEP--------

Query:  ------PHQ---TDSAGIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEALQKNGT
              PH        GIFKPKV+ +D    EP   +EA +HP WK+AM+EEF AL KN T
Subjt:  ------PHQ---TDSAGIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEALQKNGT

RVW80632.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.2e-18942.76Show/hide
Query:  QILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKN
        QILT L  + L++FL   S  PS++L    SS        NP ++ W++QD+LI SWLL S+++ +L +M++C ++ ++W+TL+  F+++  A+  QFK 
Subjt:  QILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKN

Query:  KLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKL-ISETALPSVNIVTQT
        +LHN KKG + + +Y LKI   VD LA +   +S  DHI  I  GL  DY++ I  +++R D  +V+E+  LLL QES+ E  + I++ + PS+  +  T
Subjt:  KLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKL-ISETALPSVNIVTQT

Query:  TEKGAESY---IRTNQNNYH-----NNHSYNQRGG---RGNGRSNRGGRGNRNKPQCQICAKLGHSADRCFFR----YTPRSNSSGYSPN---SHNTSYT
           G+  +     T  +N+       N   + RG    +G GR  RG     NKPQCQ+C ++GH   +C++R    +T  S   G  P    +H     
Subjt:  TEKGAESY---IRTNQNNYH-----NNHSYNQRGG---RGNGRSNRGGRGNRNKPQCQICAKLGHSADRCFFR----YTPRSNSSGYSPN---SHNTSYT

Query:  NMNNHPQMSAM-VAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVS
        + N  P  S++     ++  D+NWYPDSGAT+HLT +L+NL T S++   ++++  NG GLPI H G  SF+SS +P K+  L  LL VP ITKNL+SVS
Subjt:  NMNNHPQMSAM-VAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVS

Query:  QFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVF---NTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNS
        +FA DNHVFFEFHPT C+VKDL T  VL+ G L  GLY F        LH+S+    +        VP S+T    LWH RLGHP   IV  VLN  +  
Subjt:  QFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVF---NTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNS

Query:  SGTINKLN--FCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIK
           +NK+    C AC +GK H  PF HS + YT PL+LI  DLWGP    S +G +YYI F+DAYSR+TWIY L  KS+AF  F  FK+ VE  LG  IK
Subjt:  SGTINKLN--FCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIK

Query:  SLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPS
        ++Q+D G E++ F  +L  +GI HRI+CPYT +QN + ERKHR+I+E G+ LL+QA+LP  +WDEAF TSVYLIN LPTPVL N SPLE LF +KP++  
Subjt:  SLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPS

Query:  LRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPK--SKDVLSPPLHSIIPSSLMNHNEDR
        L+VFGC CYP LRP+  HKL  RS PCTFLGYS +HKGYKCL+ +G + ISR V+FDE++FP+A   S   T    S    S P  + +P  ++      
Subjt:  LRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPK--SKDVLSPPLHSIIPSSLMNHNEDR

Query:  RHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQT------DSAGIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEAL
            + S +T   +   ++P  +          N    +Q P S  PP  +         GIFKPK +LI    T P +  EA    HWK+AM +E+ AL
Subjt:  RHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQT------DSAGIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEAL

Query:  QKNGT
         +N T
Subjt:  QKNGT

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0097.47Show/hide
Query:  MSSTSSLLGVENTEASSPINQIFGSGNKISL------------FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISS
        MSSTSSLLGVENTEASSPINQIFGSGNKISL            FQILTALEAYDLENFLESESEPPSKYLIST SSSASATGTPNPAYKVWKRQDRLISS
Subjt:  MSSTSSLLGVENTEASSPINQIFGSGNKISL------------FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISS

Query:  WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV
        WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV
Subjt:  WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV

Query:  ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQCQICAKLGHSAD
        ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRG RGNRNKPQCQICAKLG+SAD
Subjt:  ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQCQICAKLGHSAD

Query:  RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL
        RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL
Subjt:  RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL

Query:  PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWH
        PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTK VFNTVVPKSNTPLLDLWH
Subjt:  PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWH

Query:  RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF
        RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPA+NVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF
Subjt:  RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF

Query:  LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPV
        LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLIN LPTPV
Subjt:  LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPV

Query:  LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP
        LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSS PKSKDVLSP
Subjt:  LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP

Query:  PLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDS
        PLHSIIPSSLMNHNEDRRHTDTVSDNTDHL+PTIVYPLETGTQESSRDDGNSGGITQSPSSMEP HQTDS
Subjt:  PLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDS

TrEMBL top hitse value%identityAlignment
A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-18942.46Show/hide
Query:  FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFK
        +QI  A+  Y LE FL    + P K +     +       PNP ++ ++RQD L+ SWLL S+    L Q++ C SA E+W T+   F+S+  A+ M +K
Subjt:  FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFK

Query:  NKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT
        +++  +KK  + +++Y  K+    D LA+    +S  DHIL I+ GLG +Y+S+I+VIS++  SPS+Q V S L+  E +   K+ S     SVN  +Q 
Subjt:  NKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT

Query:  TEKGAESYIRTNQNNYHNNHSYNQRGG----RGNGRSNRG-GRGNRN---KPQCQICAKLGHSADRCFFRYTP--------------------RSNSSGY
        + +G  S   +N        + NQ GG    RG+   NRG GRG      KPQCQ+C K GH+  RCF+RY P                    R+ +SG 
Subjt:  TEKGAESYIRTNQNNYHNNHSYNQRGG----RGNGRSNRG-GRGNRN---KPQCQICAKLGHSADRCFFRYTP--------------------RSNSSGY

Query:  SPNSHNTSYTNMN-----NHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNL
          ++ N + T  +     ++ +M AMVA  +   +  W+PDSGATNH+TH L NL++G+EY G ++I+  NG+GL I+H G   F SS+ P K   L N+
Subjt:  SPNSHNTSYTNMN-----NHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNL

Query:  LQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPS--HKRLHHSNSNTKSVF-----------NTVVPK---SNTP
        L+VP+I KNL+SVSQFA+DN+V+FEFHP +C+VKD     +LLQG L+ GLY+F +      K    S SN K+             N+  P+   S+  
Subjt:  LQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPS--HKRLHHSNSNTKSVF-----------NTVVPK---SNTP

Query:  LLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLN
        + DLWH+RLGHP   IV  VLN       T +  + C AC LGK H LPF  S T+YT PLQL+  DLWGPA   S  GF YY+SFVDAYSRYTW+YFL 
Subjt:  LLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLN

Query:  SKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLIN
        +KS    AF  FK   E   G  +K+ QTD G EF+  K + +Q+GI HR++CP+TSKQN I+ERKHR+I+E+GLTLL+QA+LPL +W +AFST+V+LIN
Subjt:  SKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLIN

Query:  CLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFAS
         LPT VL    P E LF  KPN+  L+VFGC C+P+LRPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE  FP+A          S
Subjt:  CLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFAS

Query:  HSST-----PKSKDV--------LSPPLHSIIPSSLMNHN--EDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEP--------
        HS+      P  K++        LS P  S   S  ++ N   D R       NTD  S   +         SS      G I  S +S EP        
Subjt:  HSST-----PKSKDV--------LSPPLHSIIPSSLMNHN--EDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEP--------

Query:  ------PHQ---TDSAGIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEALQKNGT
              PH        GIFKPKV+ +D    EP   +EA +HP WK+AM+EEF AL KN T
Subjt:  ------PHQ---TDSAGIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEALQKNGT

A0A438H844 Retrovirus-related Pol polyprotein from transposon RE11.1e-18942.76Show/hide
Query:  QILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKN
        QILT L  + L++FL   S  PS++L    SS        NP ++ W++QD+LI SWLL S+++ +L +M++C ++ ++W+TL+  F+++  A+  QFK 
Subjt:  QILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKN

Query:  KLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKL-ISETALPSVNIVTQT
        +LHN KKG + + +Y LKI   VD LA +   +S  DHI  I  GL  DY++ I  +++R D  +V+E+  LLL QES+ E  + I++ + PS+  +  T
Subjt:  KLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKL-ISETALPSVNIVTQT

Query:  TEKGAESY---IRTNQNNYH-----NNHSYNQRGG---RGNGRSNRGGRGNRNKPQCQICAKLGHSADRCFFR----YTPRSNSSGYSPN---SHNTSYT
           G+  +     T  +N+       N   + RG    +G GR  RG     NKPQCQ+C ++GH   +C++R    +T  S   G  P    +H     
Subjt:  TEKGAESY---IRTNQNNYH-----NNHSYNQRGG---RGNGRSNRGGRGNRNKPQCQICAKLGHSADRCFFR----YTPRSNSSGYSPN---SHNTSYT

Query:  NMNNHPQMSAM-VAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVS
        + N  P  S++     ++  D+NWYPDSGAT+HLT +L+NL T S++   ++++  NG GLPI H G  SF+SS +P K+  L  LL VP ITKNL+SVS
Subjt:  NMNNHPQMSAM-VAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVS

Query:  QFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVF---NTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNS
        +FA DNHVFFEFHPT C+VKDL T  VL+ G L  GLY F        LH+S+    +        VP S+T    LWH RLGHP   IV  VLN  +  
Subjt:  QFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVF---NTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNS

Query:  SGTINKLN--FCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIK
           +NK+    C AC +GK H  PF HS + YT PL+LI  DLWGP    S +G +YYI F+DAYSR+TWIY L  KS+AF  F  FK+ VE  LG  IK
Subjt:  SGTINKLN--FCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIK

Query:  SLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPS
        ++Q+D G E++ F  +L  +GI HRI+CPYT +QN + ERKHR+I+E G+ LL+QA+LP  +WDEAF TSVYLIN LPTPVL N SPLE LF +KP++  
Subjt:  SLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPS

Query:  LRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPK--SKDVLSPPLHSIIPSSLMNHNEDR
        L+VFGC CYP LRP+  HKL  RS PCTFLGYS +HKGYKCL+ +G + ISR V+FDE++FP+A   S   T    S    S P  + +P  ++      
Subjt:  LRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPK--SKDVLSPPLHSIIPSSLMNHNEDR

Query:  RHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQT------DSAGIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEAL
            + S +T   +   ++P  +          N    +Q P S  PP  +         GIFKPK +LI    T P +  EA    HWK+AM +E+ AL
Subjt:  RHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQT------DSAGIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEAL

Query:  QKNGT
         +N T
Subjt:  QKNGT

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0097.59Show/hide
Query:  MSSTSSLLGVENTEASSPINQIFGSGNKISL------------FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISS
        MSSTSSLLGVENTEASSPINQIFGSGNKISL            FQILTALEAYDLENFLESESEPPSKYLIST SSSASATGTPNPAYKVWKRQDRLISS
Subjt:  MSSTSSLLGVENTEASSPINQIFGSGNKISL------------FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISS

Query:  WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV
        WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV
Subjt:  WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV

Query:  ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQCQICAKLGHSAD
        ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRG RGNRNKPQCQICAKLG+SAD
Subjt:  ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQCQICAKLGHSAD

Query:  RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL
        RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL
Subjt:  RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL

Query:  PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWH
        PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTK VFNTVVPKSNTPLLDLWH
Subjt:  PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWH

Query:  RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF
        RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPA+NVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF
Subjt:  RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF

Query:  LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPV
        LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLIN LPTPV
Subjt:  LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPV

Query:  LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP
        LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP
Subjt:  LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP

Query:  PLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDS
        PLHSIIPSSLMNHNEDRRHTDTVSDNTDHL+PTIVYPLETGTQESSRDDGNSGGITQSPSSMEP HQTDS
Subjt:  PLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDS

A0A5A7VFQ6 Retrotransposon protein, putative, Ty1-copia subclass5.0e-24874.83Show/hide
Query:  QMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNH
        QMSAMVA  DLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLL                    
Subjt:  QMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNH

Query:  VFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFC
                     DLDTGQVLLQGLLNDGLYKFTI+PSHKRLHHS+SNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLP VKAVLNHID+SS         
Subjt:  VFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFC

Query:  EACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKP
                                                                              AFQKFKTCVEKSLGQSIKSLQTDGGTEFKP
Subjt:  EACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKP

Query:  FKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYL
        FKPFLDQHGIEHRITCPYTSKQNDIVERKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLIN LPTPVLDNISPLEKLFCRKPNFP LRVFGCKCYPY 
Subjt:  FKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYL

Query:  RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDHLS
        RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSS PKSK+VLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTD+L+
Subjt:  RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDHLS

Query:  PTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDSA-------------------GIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEALQ
         TIVYPLETGTQESSRDDGNSGGITQSPS MEPPHQTDS                     IFKPK FLIDYTQTE CNAKEAFNHPHWKKAMEEEFEALQ
Subjt:  PTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDSA-------------------GIFKPKVFLIDYTQTEPCNAKEAFNHPHWKKAMEEEFEALQ

Query:  KNGT
        KNGT
Subjt:  KNGT

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0097.47Show/hide
Query:  MSSTSSLLGVENTEASSPINQIFGSGNKISL------------FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISS
        MSSTSSLLGVENTEASSPINQIFGSGNKISL            FQILTALEAYDLENFLESESEPPSKYLIST SSSASATGTPNPAYKVWKRQDRLISS
Subjt:  MSSTSSLLGVENTEASSPINQIFGSGNKISL------------FQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISS

Query:  WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV
        WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV
Subjt:  WLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISV

Query:  ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQCQICAKLGHSAD
        ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRG RGNRNKPQCQICAKLG+SAD
Subjt:  ISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQCQICAKLGHSAD

Query:  RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL
        RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL
Subjt:  RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTL

Query:  PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWH
        PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTK VFNTVVPKSNTPLLDLWH
Subjt:  PFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWH

Query:  RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF
        RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPA+NVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF
Subjt:  RRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF

Query:  LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPV
        LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLIN LPTPV
Subjt:  LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPV

Query:  LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP
        LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSS PKSKDVLSP
Subjt:  LDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSP

Query:  PLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDS
        PLHSIIPSSLMNHNEDRRHTDTVSDNTDHL+PTIVYPLETGTQESSRDDGNSGGITQSPSSMEP HQTDS
Subjt:  PLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDS

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.8e-3825.14Show/hide
Query:  PNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGS-MPLKEYFLKILQCVDALASINKPVSSDDH
        PN     WK+ +R   S ++  +S+  LN      +A++I E L  ++  + LA  +  + +L ++K  S M L  +F    + +  L +    +   D 
Subjt:  PNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGS-MPLKEYFLKILQCVDALASINKPVSSDDH

Query:  ILYILAGLGSDYQSMISVISART-DSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGR
        I ++L  L S Y  +I+ I   + ++ ++  V + LL QE            +   N    T++K   + +  N N Y NN   N+       +  +  +
Subjt:  ILYILAGLGSDYQSMISVISART-DSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGR

Query:  GN-RNKPQCQICAKLGHSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQI-YA
        GN + K +C  C + GH    CF  +  R  ++    N         +    M   V    +  +  +  DSGA++HL +  S  +   E     +I  A
Subjt:  GN-RNKPQCQICAKLGHSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQI-YA

Query:  ANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSN
          G  +  T  G +   +        TL ++L       NL+SV +  ++  +  EF  +   +       V   G+LN+      +   + + +  N+ 
Subjt:  ANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSN

Query:  TKSVFNTVVPKSNTPLLDLWHRRLGH-PHLPIVKAVLNHIDNSSGTINKL----NFCEACALGKHHALPFSH--SLTLYTHPLQLITCDLWGPAINVSHN
         K+ F             LWH R GH     +++    ++ +    +N L      CE C  GK   LPF      T    PL ++  D+ GP   V+ +
Subjt:  TKSVFNTVVPKSNTPLLDLWHRRLGH-PHLPIVKAVLNHIDNSSGTINKL----NFCEACALGKHHALPFSH--SLTLYTHPLQLITCDLWGPAINVSHN

Query:  GFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLT
           Y++ FVD ++ Y   Y +  KSD F  FQ F    E      +  L  D G E+     + F  + GI + +T P+T + N + ER  R I E   T
Subjt:  GFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLT

Query:  LLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNIS--PLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRL
        ++S A L  SFW EA  T+ YLIN +P+  L + S  P E    +KP    LRVFG   Y +++  Q  K   +S    F+GY  +  G+K   A + + 
Subjt:  LLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNIS--PLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRL

Query:  FISRHVLFDENS
         ++R V+ DE +
Subjt:  FISRHVLFDENS

P0C2I5 Transposon Ty1-LR2 Gag-Pol polyprotein4.9e-1423.78Show/hide
Query:  DSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDL--DT
        DSGA+  L  S  ++ + S     N +  A    +PI   G + F+       + T   +L  P+I  +L+S+++ A           T C+ K++   +
Subjt:  DSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDL--DT

Query:  GQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPL-LDLWHRRLGHPHLPIVKAVLNH----------IDNSSGTINKLNFCEACALG
           +L  ++  G + +    S K L  SN +  ++ N    +S         HR L H + P ++  L +          +D SS    +   C  C +G
Subjt:  GQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPL-LDLWHRRLGHPHLPIVKAVLNH----------IDNSSGTINKLNFCEACALG

Query:  ---KHHALPFSHSLTLYTH-PLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNS-KSDAFL-AFQKFKTCVEKSLGQSIKSLQTDGGTEF--
           KH  +  S      ++ P Q +  D++GP  N+ ++   Y+ISF D  +++ W+Y L+  + D+ L  F      ++     S+  +Q D G+E+  
Subjt:  ---KHHALPFSHSLTLYTH-PLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNS-KSDAFL-AFQKFKTCVEKSLGQSIKSLQTDGGTEF--

Query:  KPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTP
        +    FL+++GI    T    S+ + + ER +R +++   T L  + LP   W  A   S  + N L +P
Subjt:  KPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-5725.92Show/hide
Query:  WKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQ-CVDALASINKPVSSDDHILYILAG
        W   D   +S +   +S++++N ++   +A+ IW  L+ ++ S+ L   +  K +L+ +           L +    +  LA++   +  +D  + +L  
Subjt:  WKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQ-CVDALASINKPVSSDDHILYILAG

Query:  LGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQC
        L S Y ++ + I     +  +++V S LL  E   +       AL         TE    SY R++ N       Y + G RG  + NR     RN   C
Subjt:  LGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQC

Query:  QICAKLGHSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNI---DSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLP
          C + GH    C      +  +SG   + +  +    N++  +        +++   +S W  D+ A++H T  + +L      G    +   N S   
Subjt:  QICAKLGHSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNI---DSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLP

Query:  ITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNT
        I   G +   ++     +  L ++  VP +  NLIS     +D +  +  +      K      V+ +G+    LY+   E     L+ +          
Subjt:  ITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNT

Query:  VVPKSNTPLLDLWHRRLGH---PHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDA
                 +DLWH+R+GH     L I+ A  + I  + GT  K   C+ C  GK H + F  S     + L L+  D+ GP    S  G +Y+++F+D 
Subjt:  VVPKSNTPLLDLWHRRLGH---PHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDA

Query:  YSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSF
         SR  W+Y L +K   F  FQKF   VE+  G+ +K L++D G E+  + F+ +   HGI H  T P T + N + ER +R I+E   ++L  A LP SF
Subjt:  YSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSF

Query:  WDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFI-SRHVLFDENSF
        W EA  T+ YLIN  P+  L    P      ++ ++  L+VFGC+ + ++   Q  KL  +S PC F+GY     GY+      +  I SR V+F E+  
Subjt:  WDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFI-SRHVLFDENSF

Query:  PYASFASHSSTPKSKDVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVY--PLETGTQESSRDDGNSGGITQSPSSMEPPHQTDSAGI----F
          A+  S     K K+ + P   + IPS+  N       TD VS+  +     I     L+ G +E           TQ     +P  +++   +    +
Subjt:  PYASFASHSSTPKSKDVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVY--PLETGTQESSRDDGNSGGITQSPSSMEPPHQTDSAGI----F

Query:  KPKVFLIDYTQTEPCNAKEAFNHP---HWKKAMEEEFEALQKNGT
            +++     EP + KE  +HP      KAM+EE E+LQKNGT
Subjt:  KPKVFLIDYTQTEPCNAKEAFNHP---HWKKAMEEEFEALQKNGT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-12234.09Show/hide
Query:  QILTALEAYDLENFLE-SESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFK
        Q+    + Y+L  FL+ S + PP+    + G+ +A      NP Y  WKRQD+LI S +LG++S  +   +    +A +IWETL+ I+++       Q +
Subjt:  QILTALEAYDLENFLE-SESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFK

Query:  NKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT
         +L    KG+  + +Y   ++   D LA + KP+  D+ +  +L  L  +Y+ +I  I+A+   P++ E+   LL  ES+  + + S T +P    +T  
Subjt:  NKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQT

Query:  TEKGAESYIRTNQNNYHNNHSYNQRGGRGNGR------SNRGGRGNRNKP---QCQICAKLGHSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMS
              +    N NN + N+ Y+ R    N +      +N     N++KP   +CQIC   GHSA RC       S+ +   P S  T +       Q  
Subjt:  TEKGAESYIRTNQNNYHNNHSYNQRGGRGNGR------SNRGGRGNRNKP---QCQICAKLGHSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMS

Query:  AMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFF
        A +A       +NW  DSGAT+H+T   +NLS    Y GG+ +  A+GS +PI+H GS S ++ + P     L+N+L VP+I KNLISV +    N V  
Subjt:  AMVAALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFF

Query:  EFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNS-SGTINKLNFCEA
        EF P    VKDL+TG  LLQG   D LY++ I  S      ++ ++K+  ++            WH RLGHP   I+ +V+++   S     +K   C  
Subjt:  EFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNS-SGTINKLNFCEA

Query:  CALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK
        C + K + +PFS S    T PL+ I  D+W   I +SH+ +RYY+ FVD ++RYTW+Y L  KS     F  FK  +E      I +  +D G EF    
Subjt:  CALGKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK

Query:  PFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRP
         +  QHGI H  + P+T + N + ERKHR+I+E GLTLLS A++P ++W  AF+ +VYLIN LPTP+L   SP +KLF   PN+  LRVFGC CYP+LRP
Subjt:  PFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRP

Query:  YQSHKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF------------------ASHSSTPKSKDVL-----SPPLHSIIP--
        Y  HKL  +S  C FLGYS +   Y CL     RL+ISRHV FDEN FP++++                  + H++ P    VL     S P H+  P  
Subjt:  YQSHKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF------------------ASHSSTPKSKDVL-----SPPLHSIIP--

Query:  --------SSLMNHNEDRRHTDTVSDNTDHLSP------TIVYPLETGTQ-ESSRDDGNSGGITQSPS----SMEPPHQTDSA
                S + + N D   + +   + +  +P          P +T TQ  SS++   +    +SPS    S+  P Q+ S+
Subjt:  --------SSLMNHNEDRRHTDTVSDNTDHLSP------TIVYPLETGTQ-ESSRDDGNSGGITQSPS----SMEPPHQTDSA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-12235.33Show/hide
Query:  QILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKN
        Q+    + Y+L  FL+  +  P        +    A    NP Y  W+RQD+LI S +LG++S  +   +    +A +IWETL+ I+++       Q   
Subjt:  QILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKN

Query:  KLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTT
                        L+ +   D LA + KP+  D+ +  +L  L  DY+ +I  I+A+   PS+ E+   L+ +ES+  +   +E    + N+VT   
Subjt:  KLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTT

Query:  EKGAESYIRTNQNNYHNNHSYNQRGGRGNG--RSNRGGRGNRNKP-----QCQICAKLGHSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMV
             +    NQNN  +N +YN    R N    S+ G R +  +P     +CQIC+  GHSA RC   +  +S ++     S  T +       Q  A +
Subjt:  EKGAESYIRTNQNNYHNNHSYNQRGGRGNG--RSNRGGRGNRNKP-----QCQICAKLGHSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMV

Query:  AALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFH
        A       +NW  DSGAT+H+T   +NLS    Y GG+ +  A+GS +PITH GS S  +S+   +S  LN +L VP+I KNLISV +    N V  EF 
Subjt:  AALDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFH

Query:  PTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVL-NHIDNSSGTINKLNFCEACAL
        P    VKDL+TG  LLQG   D LY++ I         ++S   S+F +   K+       WH RLGHP L I+ +V+ NH        +KL  C  C +
Subjt:  PTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVL-NHIDNSSGTINKLNFCEACAL

Query:  GKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFL
         K H +PFS+S    + PL+ I  D+W   I +S + +RYY+ FVD ++RYTW+Y L  KS     F  FK+ VE      I +L +D G EF   + +L
Subjt:  GKHHALPFSHSLTLYTHPLQLITCDLWGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFL

Query:  DQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQS
         QHGI H  + P+T + N + ERKHR+I+EMGLTLLS A++P ++W  AFS +VYLIN LPTP+L   SP +KLF + PN+  L+VFGC CYP+LRPY  
Subjt:  DQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQS

Query:  HKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF---ASHSSTPKSKDVLSPPLHSIIPSSLMNHNED---RRHTDTVSDNTDH
        HKL  +S  C F+GYS +   Y CL    GRL+ SRHV FDE  FP+++     S S   +S    + P H+ +P++ +          H DT       
Subjt:  HKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF---ASHSSTPKSKDVLSPPLHSIIPSSLMNHNED---RRHTDTVSDNTDH

Query:  LSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEP-----PHQTDSAGIFKP
         SP     + +    SS     S     +PS   P     PHQT ++    P
Subjt:  LSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEP-----PHQTDSAGIFKP

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.7e-0725.32Show/hide
Query:  NPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKI
        +P Y+ W++ + ++  WL+ SM++++L  +++ ++A ++WE L+ +F      +  Q + +L  +++G   ++EYF K+
Subjt:  NPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.7e-0722.84Show/hide
Query:  WKRQDRLISSWLLGSMS-EEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAG
        W+++D ++   L G+++ ++     +   ++++IW  ++  F +   A+A++  ++L     G M + +Y+ K+ +  D+L +++ PV+  + ++Y+L G
Subjt:  WKRQDRLISSWLLGSMS-EEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAG

Query:  LGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNR--GGRGNR
        L   + ++I+VI  R   PS  +  ++L     Q E   +     P+   V  ++     +               NQ G RG GR N    GRG R
Subjt:  LGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNR--GGRGNR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.1e-1628.36Show/hide
Query:  GSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCK-SAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALAS
        G    S+T TP    K WK +D L+  W+ G++++ +L+ ++    +A+++W +L+ +F     A+A+QF+N+L       + + EY  K+    D L +
Subjt:  GSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCK-SAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALAS

Query:  INKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQ--NESK-LISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHS-----
        ++ P+S    ++++L GL   Y  +++VI  ++  PS  E  S+LL +ES+  N+SK  +S T  PS++ V  T  +  E Y       YHNN+S     
Subjt:  INKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQ--NESK-LISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHS-----

Query:  ----YNQRGGRGNGRSNRGGRGNRNKPQCQICA------KLGHSADRCFFR--YTPRSNSSGYSPNSHNTSYTNM
             N+ GG  +GR N       N+P   I           H   + F +  Y P+      S  SH  S T++
Subjt:  ----YNQRGGRGNGRSNRGGRGNRNKPQCQICA------KLGHSADRCFFR--YTPRSNSSGYSPNSHNTSYTNM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCAACCTCATCCCTACTCGGTGTTGAGAACACTGAAGCATCTTCACCGATTAATCAAATATTTGGATCGGGCAACAAAATATCGTTATTCCAAATTCTTACAGC
ATTAGAAGCTTATGATCTGGAAAATTTTCTTGAATCTGAATCAGAACCACCATCAAAATATCTCATATCCACTGGGAGTTCATCAGCATCCGCTACTGGAACACCAAATC
CGGCATATAAGGTATGGAAACGCCAAGATCGCCTTATCTCCTCATGGCTTTTAGGATCTATGAGTGAAGAAATATTGAATCAGATGCTTCATTGCAAATCTGCAAAAGAA
ATTTGGGAAACTCTTCAAGGTATTTTCTCTTCCCGTTACTTGGCACAAGCTATGCAATTCAAAAACAAACTTCACAATATAAAGAAAGGATCCATGCCATTAAAAGAATA
CTTTCTCAAAATACTGCAGTGTGTTGATGCCTTGGCTTCAATTAACAAACCAGTTTCATCTGATGATCATATTCTGTACATATTGGCTGGTTTAGGATCTGATTATCAAT
CCATGATATCTGTTATTTCCGCCAGAACTGACTCTCCTTCTGTACAAGAAGTTATGTCTTTATTACTTACTCAGGAATCTCAAAATGAGAGCAAATTAATCAGCGAAACT
GCTCTACCTTCTGTTAATATTGTCACCCAAACAACTGAAAAAGGAGCAGAATCTTACATAAGGACCAACCAAAACAACTATCACAACAATCATTCCTACAATCAAAGGGG
TGGCCGTGGCAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAATAAACCACAATGCCAAATCTGTGCAAAGCTTGGACATAGTGCTGATCGCTGCTTCTTTCGAT
ATACTCCAAGATCAAATTCATCAGGTTACTCACCGAACTCACATAATACTTCATATACTAATATGAATAATCATCCACAGATGTCTGCTATGGTGGCTGCCCTCGACCTG
AATATTGACAGCAATTGGTATCCTGATTCGGGAGCTACAAATCATTTAACTCATAGCTTGAGCAACCTATCTACTGGATCTGAGTACGGGGGAGGAAATCAAATATATGC
AGCAAATGGGTCAGGTTTGCCAATCACTCATTATGGTTCCATGTCATTTAACTCCTCTACATTACCATTTAAATCGTTTACACTCAATAACTTACTCCAAGTTCCATCTA
TTACCAAAAACTTAATCAGTGTTTCACAATTTGCCAAAGATAATCATGTTTTCTTTGAATTTCACCCCACTTTGTGTTATGTGAAGGATCTGGATACTGGCCAAGTACTT
CTTCAAGGACTACTTAATGATGGGCTCTACAAATTTACCATTGAACCATCACATAAAAGACTTCACCATTCTAACTCCAACACCAAGTCCGTTTTCAATACCGTCGTACC
TAAATCTAATACTCCCTTACTTGATTTATGGCATAGAAGACTAGGTCATCCCCATTTACCTATTGTTAAAGCTGTTTTGAATCACATTGACAATTCTTCTGGTACTATAA
ATAAACTGAATTTTTGTGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTCCCTTACTCTTTATACACATCCTTTACAACTTATTACTTGTGATTTA
TGGGGTCCTGCCATAAATGTATCTCATAATGGTTTTAGATATTACATAAGTTTTGTTGATGCCTATAGTAGATACACCTGGATATATTTCTTAAATTCCAAGTCTGATGC
CTTTTTAGCTTTTCAAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAACTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTC
TTGATCAACATGGCATTGAACATAGAATAACATGTCCTTACACTTCAAAGCAGAATGACATAGTTGAGAGAAAACATAGGTATATCATGGAAATGGGTCTTACATTGCTA
TCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTCTCCACTAGTGTCTATCTCATAAATTGTTTGCCTACCCCAGTTCTTGATAATATAAGCCCGTTGGAGAA
GCTATTTTGCCGGAAACCTAACTTTCCTTCTCTTAGAGTTTTTGGCTGCAAGTGTTATCCCTACCTTCGACCCTACCAATCACATAAACTATCTCTCCGATCCACACCAT
GTACTTTCCTAGGATACAGTACCTCCCATAAAGGGTACAAATGTCTAGCTTCAGATGGGCGTCTTTTCATTTCTAGACATGTATTGTTTGATGAAAATTCATTTCCATAT
GCATCATTTGCATCTCATTCTAGCACACCCAAATCCAAAGATGTTCTATCCCCACCACTTCACTCAATAATTCCATCATCTCTTATGAACCATAATGAGGATAGGCGACA
CACTGACACAGTTTCTGATAACACTGATCATCTAAGCCCTACTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTAGGGATGATGGTAACAGTGGAGGTATTA
CTCAATCTCCAAGTTCTATGGAACCACCGCATCAAACTGATTCTGCTGGTATTTTTAAACCAAAAGTATTCTTGATTGATTATACTCAAACTGAACCTTGCAATGCCAAG
GAAGCTTTTAACCATCCTCATTGGAAAAAGGCCATGGAAGAAGAGTTTGAAGCCTTACAAAAAAATGGCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCAACCTCATCCCTACTCGGTGTTGAGAACACTGAAGCATCTTCACCGATTAATCAAATATTTGGATCGGGCAACAAAATATCGTTATTCCAAATTCTTACAGC
ATTAGAAGCTTATGATCTGGAAAATTTTCTTGAATCTGAATCAGAACCACCATCAAAATATCTCATATCCACTGGGAGTTCATCAGCATCCGCTACTGGAACACCAAATC
CGGCATATAAGGTATGGAAACGCCAAGATCGCCTTATCTCCTCATGGCTTTTAGGATCTATGAGTGAAGAAATATTGAATCAGATGCTTCATTGCAAATCTGCAAAAGAA
ATTTGGGAAACTCTTCAAGGTATTTTCTCTTCCCGTTACTTGGCACAAGCTATGCAATTCAAAAACAAACTTCACAATATAAAGAAAGGATCCATGCCATTAAAAGAATA
CTTTCTCAAAATACTGCAGTGTGTTGATGCCTTGGCTTCAATTAACAAACCAGTTTCATCTGATGATCATATTCTGTACATATTGGCTGGTTTAGGATCTGATTATCAAT
CCATGATATCTGTTATTTCCGCCAGAACTGACTCTCCTTCTGTACAAGAAGTTATGTCTTTATTACTTACTCAGGAATCTCAAAATGAGAGCAAATTAATCAGCGAAACT
GCTCTACCTTCTGTTAATATTGTCACCCAAACAACTGAAAAAGGAGCAGAATCTTACATAAGGACCAACCAAAACAACTATCACAACAATCATTCCTACAATCAAAGGGG
TGGCCGTGGCAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAATAAACCACAATGCCAAATCTGTGCAAAGCTTGGACATAGTGCTGATCGCTGCTTCTTTCGAT
ATACTCCAAGATCAAATTCATCAGGTTACTCACCGAACTCACATAATACTTCATATACTAATATGAATAATCATCCACAGATGTCTGCTATGGTGGCTGCCCTCGACCTG
AATATTGACAGCAATTGGTATCCTGATTCGGGAGCTACAAATCATTTAACTCATAGCTTGAGCAACCTATCTACTGGATCTGAGTACGGGGGAGGAAATCAAATATATGC
AGCAAATGGGTCAGGTTTGCCAATCACTCATTATGGTTCCATGTCATTTAACTCCTCTACATTACCATTTAAATCGTTTACACTCAATAACTTACTCCAAGTTCCATCTA
TTACCAAAAACTTAATCAGTGTTTCACAATTTGCCAAAGATAATCATGTTTTCTTTGAATTTCACCCCACTTTGTGTTATGTGAAGGATCTGGATACTGGCCAAGTACTT
CTTCAAGGACTACTTAATGATGGGCTCTACAAATTTACCATTGAACCATCACATAAAAGACTTCACCATTCTAACTCCAACACCAAGTCCGTTTTCAATACCGTCGTACC
TAAATCTAATACTCCCTTACTTGATTTATGGCATAGAAGACTAGGTCATCCCCATTTACCTATTGTTAAAGCTGTTTTGAATCACATTGACAATTCTTCTGGTACTATAA
ATAAACTGAATTTTTGTGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTCCCTTACTCTTTATACACATCCTTTACAACTTATTACTTGTGATTTA
TGGGGTCCTGCCATAAATGTATCTCATAATGGTTTTAGATATTACATAAGTTTTGTTGATGCCTATAGTAGATACACCTGGATATATTTCTTAAATTCCAAGTCTGATGC
CTTTTTAGCTTTTCAAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAACTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTC
TTGATCAACATGGCATTGAACATAGAATAACATGTCCTTACACTTCAAAGCAGAATGACATAGTTGAGAGAAAACATAGGTATATCATGGAAATGGGTCTTACATTGCTA
TCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTCTCCACTAGTGTCTATCTCATAAATTGTTTGCCTACCCCAGTTCTTGATAATATAAGCCCGTTGGAGAA
GCTATTTTGCCGGAAACCTAACTTTCCTTCTCTTAGAGTTTTTGGCTGCAAGTGTTATCCCTACCTTCGACCCTACCAATCACATAAACTATCTCTCCGATCCACACCAT
GTACTTTCCTAGGATACAGTACCTCCCATAAAGGGTACAAATGTCTAGCTTCAGATGGGCGTCTTTTCATTTCTAGACATGTATTGTTTGATGAAAATTCATTTCCATAT
GCATCATTTGCATCTCATTCTAGCACACCCAAATCCAAAGATGTTCTATCCCCACCACTTCACTCAATAATTCCATCATCTCTTATGAACCATAATGAGGATAGGCGACA
CACTGACACAGTTTCTGATAACACTGATCATCTAAGCCCTACTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTAGGGATGATGGTAACAGTGGAGGTATTA
CTCAATCTCCAAGTTCTATGGAACCACCGCATCAAACTGATTCTGCTGGTATTTTTAAACCAAAAGTATTCTTGATTGATTATACTCAAACTGAACCTTGCAATGCCAAG
GAAGCTTTTAACCATCCTCATTGGAAAAAGGCCATGGAAGAAGAGTTTGAAGCCTTACAAAAAAATGGCACTTGA
Protein sequenceShow/hide protein sequence
MSSTSSLLGVENTEASSPINQIFGSGNKISLFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKE
IWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISET
ALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGGRGNRNKPQCQICAKLGHSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDL
NIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVL
LQGLLNDGLYKFTIEPSHKRLHHSNSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDL
WGPAINVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLL
SQATLPLSFWDEAFSTSVYLINCLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPY
ASFASHSSTPKSKDVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDHLSPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPPHQTDSAGIFKPKVFLIDYTQTEPCNAK
EAFNHPHWKKAMEEEFEALQKNGT