; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G06930 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G06930
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationClcChr08:18453493..18464244
RNA-Seq ExpressionClc08G06930
SyntenyClc08G06930
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR009943 - Protein of unknown function DUF1475
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN60456.1 hypothetical protein VITISV_019554 [Vitis vinifera]1.0e-12234.64Show/hide
Query:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLG-----------------DIRQGEDGCIYKFLAGLNVEFDEIRGRILGKSNLPT
        LVNS+ +D SCNYMCY T +ELW++V QMYS+LGNQ Q+FEL LKL                    +  ED  I+KFLAGLNVEFDE+RGRI+G+  LP+
Subjt:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLG-----------------DIRQGEDGCIYKFLAGLNVEFDEIRGRILGKSNLPT

Query:  INDVFYEVRREESRMNIMIGKKSID-SVESPALVIE-NIIMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNI
        I +VF EVRREESR N+M+GKK    ++E  ALV       K      K+ ++PRVWC  CNKPRHTRE CWK+HGKPANWK                  
Subjt:  INDVFYEVRREESRMNIMIGKKSID-SVESPALVIE-NIIMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNI

Query:  VDSSPFKEQIDRILKLLKSNSLGIPSVSLVQTGNSPQALAC-LNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEK-----------------VYIADGC
                                      +TGN   AL+C   S  WIIDSR+ +H+T+ S +FESYSP   N+K                  Y  D  
Subjt:  VDSSPFKEQIDRILKLLKSNSLGIPSVSLVQTGNSPQALAC-LNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEK-----------------VYIADGC

Query:  FTSMAGKGTILPSTKLILRSVLHVPQLAC-RLSSSVSKISKDANCRVIFCEIHCLF----------QDQD------------------SGETIARTRMID
         ++   +G    S+  +   +++   L C +     S+I+ +   ++  C  +             +DQ                   SG        I 
Subjt:  FTSMAGKGTILPSTKLILRSVLHVPQLAC-RLSSSVSKISKDANCRVIFCEIHCLF----------QDQD------------------SGETIARTRMID

Query:  GSS-----LSSPSH-NMLPDVS-----------------DLDISIAHRKATR------------------------------------------------
         SS     LS PSH    P++S                 DLD+ IA RK TR                                                
Subjt:  GSS-----LSSPSH-NMLPDVS-----------------DLDISIAHRKATR------------------------------------------------

Query:  ---------------------------------------------------------------------------------QFNFDWPLYQLHAKNAFLN
                                                                                           N +WPL+QL  KN FLN
Subjt:  ---------------------------------------------------------------------------------QFNFDWPLYQLHAKNAFLN

Query:  GEVEEEVLMDLPPALKQTLGL--------------------------------TSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKL
        G +EEEV M  PP  +++ G+                                 +QSQADHTMFYKH+   KV +LIVY+D + LTG+D   +  +K KL
Subjt:  GEVEEEVLMDLPPALKQTLGL--------------------------------TSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKL

Query:  ADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKET----------------------------------------------------VSMVSQ
         ++F+IK++ + KYFLGMEFARSK G+ VN+RKY+LDLL  T                                                    VSMVSQ
Subjt:  ADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKET----------------------------------------------------VSMVSQ

Query:  FMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWM
        FMHA  P HFEAV+RILRYLKGTPG+G+LFK   HLQ+E +TDADWAGS  ++RS+SGYCSFVGGNLVTWRSKKQ+VVARSS E EFRA+ HGICE +W+
Subjt:  FMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWM

Query:  KKLLEELKCSQTIPI
        ++LLEELK + + P+
Subjt:  KKLLEELKCSQTIPI

CAN76591.1 hypothetical protein VITISV_020293 [Vitis vinifera]5.5e-12935.26Show/hide
Query:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGCIYKFLAGLNVEFDEIRGRILGKSNLPTINDVF--YEVRREESRM
        LVNS+ +D S NYMCY T ++LW++V QMYS+LGNQ Q+FEL  KLG+IRQ ED  I K+   L   + ++              D+F  YE +  E   
Subjt:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGCIYKFLAGLNVEFDEIRGRILGKSNLPTINDVF--YEVRREESRM

Query:  NIMIGKKSIDSVESPALVIENIIMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSPF-KEQIDRILKL
            G+    ++E   +    + +      +K  ++PRVWC   NKPRHTRE CWK+HGKPANWK  K  ++        +N  ++SPF  +Q++ +L L
Subjt:  NIMIGKKSIDSVESPALVIENIIMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSPF-KEQIDRILKL

Query:  LKSN-SLGIPSVSLVQTGNSPQALAC-LNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEKVYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLS
        LKSN +    SVS+   GN   AL+C   S  WIIDS +++H+TS S +FESYSP   N+KV IADG F+ + GKG I  S  + L+SVLHVP+L C L 
Subjt:  LKSN-SLGIPSVSLVQTGNSPQALAC-LNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEKVYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLS

Query:  SSVSKISKDANCRVIFCEIHCLFQDQDSGETIARTRMIDG------------------------------------------------------------
          VSK+SKD+NC VIF E HC+FQD  SG+TI   RMI+G                                                            
Subjt:  SSVSKISKDANCRVIFCEIHCLFQDQDSGETIARTRMIDG------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------SSLSSPSH-NMLPDVS----------
                                                                                  + LS PSH +  P++S          
Subjt:  --------------------------------------------------------------------------SSLSSPSH-NMLPDVS----------

Query:  -------DLDISIAHRKATRQF-------------------------------NFDWPLYQLHAKNAFLNGEVEEEVLMDLPPALKQTLGL---------
               DLD+ IA RK TR +                               N +WPL+QL  KN FLNG++EEEV M  PP  +++ G+         
Subjt:  -------DLDISIAHRKATRQF-------------------------------NFDWPLYQLHAKNAFLNGEVEEEVLMDLPPALKQTLGL---------

Query:  -----------------------TSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVN
                                +QSQADHTMFYKH+   KV +LIVY+DD++LTG+D   +  +K KLA++F+IKDL + KYFLGMEFARSK G+ VN
Subjt:  -----------------------TSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVN

Query:  ERKYILDLLKET---VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVA
        ++KY+LD L ET   VSMVSQFMHA    HFEAV RILRYLKGT G+G+LFK   HLQ+E + DA WAGS  ++RS+SGYCSFVGGNLVTWRSKKQ+VVA
Subjt:  ERKYILDLLKET---VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVA

Query:  RSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI
        RSSA+ EFRA+AHGICE +W+++LLEELK + + P+
Subjt:  RSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI

CAN82268.1 hypothetical protein VITISV_009286 [Vitis vinifera]2.6e-13440.96Show/hide
Query:  VFELNLKLGDIRQGEDGC---------------------------------------IYKFLAGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMN
        +FEL LKLG+IRQGED                                         I+KFL GLNVEFDE+RGRI+G+  L +I +VF EV REESR N
Subjt:  VFELNLKLGDIRQGEDGC---------------------------------------IYKFLAGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMN

Query:  IMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSPF-KEQIDRILK
        +M+GKK    ++E  ALV  +    KA+       ++P+VWC  CNKPRHTRE  WK+HGK ANWK  K  ++        +N  ++S F   Q++ +L 
Subjt:  IMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSPF-KEQIDRILK

Query:  LLKSN-SLGIPSVSLVQTGNSPQALAC-LNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEKVYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRL
        LLKSN +    SVSL  T N   AL+C   S  WIIDS +++H+T+ S +FESYSP   N+KV IA+  F+ +A KG I  S  + L+SVLHVP+L C L
Subjt:  LLKSN-SLGIPSVSLVQTGNSPQALAC-LNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEKVYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRL

Query:  SSSVSKISKDANCRVIFCEIHCLFQDQDSGETIARTRMIDG-----------------SSLSSPS-HNMLPDVSDLDISIAHRKATR-------------
          SVSK+SKD+NC VIF E HC+FQDQ SG+TI   RMI+G                 SS+SS S  + + D+ DLD+ IA RK TR             
Subjt:  SSSVSKISKDANCRVIFCEIHCLFQDQDSGETIARTRMIDG-----------------SSLSSPS-HNMLPDVSDLDISIAHRKATR-------------

Query:  -----------------------QFNFD---WPLYQLHAKNAF-----------------------------LNGEVEEEVLMDLPPALKQTLGLT----
                               Q   D   W L      NA                               +G VE+     +     QT G+     
Subjt:  -----------------------QFNFD---WPLYQLHAKNAF-----------------------------LNGEVEEEVLMDLPPALKQTLGLT----

Query:  ------------------------------SQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSK
                                      +QSQADHTMFYKH+   KVV+LIVY+DD++LTG+D   +  +K KLA++F+IKDL + KYFLGMEFARSK
Subjt:  ------------------------------SQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSK

Query:  TGLLVNERKYILDLLKET----------------------------------------------------VSMVSQFMHAREPAHFEAVFRILRYLKGTP
         G+ VN+RKY+LDLL ET                                                    VSMVSQFMHA  P HFE V+RILRYLKGTP
Subjt:  TGLLVNERKYILDLLKET----------------------------------------------------VSMVSQFMHAREPAHFEAVFRILRYLKGTP

Query:  GKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI
        G+G+LFK   HLQ+E +TDADWA S  ++ S+S YCSFVGGNLVTWRSKK + VARSS E EFRA+AHGICE +W+++L EELK + + P+
Subjt:  GKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI

GAU48207.1 hypothetical protein TSUD_304870 [Trifolium subterraneum]2.6e-12336.51Show/hide
Query:  SDTKVPAAKLVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQG---------------------------------------
        S+  +  A LV+++       YM   +AK++W++V + YSN+ N  Q+F L  KL   +QG                                       
Subjt:  SDTKVPAAKLVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQG---------------------------------------

Query:  EDGCIYKFLAGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMNIMIGKKSIDS-VESPALVIENIIMKASDQSNKTHDKPRVWCYHCNKPRHTRET
        E+  ++ FLAGLN + DE+RGR+LGK  LPT+ + F E++REESR  IM+GK    S  E  AL   N       +  K  + P  WC H  +  HTRE 
Subjt:  EDGCIYKFLAGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMNIMIGKKSIDS-VESPALVIENIIMKASDQSNKTHDKPRVWCYHCNKPRHTRET

Query:  CWKLHGKPANW-------KSSKQFERNSHQHASNSNIVDSSPFKEQIDRILKLLKSNSLGIPSVSLVQTGNSPQALACLNSFLWIIDSRSTEHITSFSCL
        CWKL GKP NW       K  + F+ ++      S    S    EQ++++ KLL+S +   PS S+   GNS   L+   +  WI+DS +++H+T  S L
Subjt:  CWKLHGKPANW-------KSSKQFERNSHQHASNSNIVDSSPFKEQIDRILKLLKSNSLGIPSVSLVQTGNSPQALACLNSFLWIIDSRSTEHITSFSCL

Query:  FESYSPVYCNEKVYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLSSSVSKISKDANCR-------------------VIFCEIHCLFQDQ----
        F SYSP   N+K+ +ADG F+++AGKG+++ S  L L++VLHVP L+C L S ++      N +                    +  E+  L +++    
Subjt:  FESYSPVYCNEKVYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLSSSVSKISKDANCR-------------------VIFCEIHCLFQDQ----

Query:  ----DSGETIARTRMI------DGSSLSSPSHNMLP--------DVSDLDISIAHRKATR-----QFNFDWPLYQLHAKNAFLNGEVEEEVLMDLPPALK
            D  +T+    +       DGS     +  +          D S+    +A     R       N DWPL+QL  KNAFLNG++EEEV M++PP  +
Subjt:  ----DSGETIARTRMI------DGSSLSSPSHNMLP--------DVSDLDISIAHRKATR-----QFNFDWPLYQLHAKNAFLNGEVEEEVLMDLPPALK

Query:  QTLGLT-------------------------------SQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLG
           G                                  Q QA HT+F K + + K+VVLIVY+DD++LTGND   +  VKEKLA DF+IKDL S +YFLG
Subjt:  QTLGLT-------------------------------SQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLG

Query:  MEFARSKTGLLVNERKYILDLLKET---------------------------------------------------VSMVSQFMHAREPAHFEAVFRILR
        ME ARS  G+LV+++KYILDLLKET                                                   VS+VSQFMH+    H EAV+RILR
Subjt:  MEFARSKTGLLVNERKYILDLLKET---------------------------------------------------VSMVSQFMHAREPAHFEAVFRILR

Query:  YLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI
        YLK  PGKG+ F+K +   V + TD DWAGS T++RS+SGYC +V GNLVTWRSKKQ VVARSSA+ EFRA+ HGICEG+W+ ++LEELK    +P+
Subjt:  YLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI

RVW93810.1 Protein FORGETTER 1 [Vitis vinifera]3.6e-13641.05Show/hide
Query:  LVNSVVKDFSCNYMCYSTAKELW--DSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC----------IYKFLAGLNVEFDEIRGRILGKSNLPTINDVF
        LVNS+ +D S NYMCY TA+ELW  D+VT+ +++L    Q  +L     + +  EDG           I+KFLAGLNVEFDE+RGRI+G+ +LP+I +VF
Subjt:  LVNSVVKDFSCNYMCYSTAKELW--DSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC----------IYKFLAGLNVEFDEIRGRILGKSNLPTINDVF

Query:  YEVRREESRMNIMIGKKSID-SVESPALVIE-NIIMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSP
         E RREESR N+M+GKK    ++E  ALV       K      K+ ++P+VWC  CNKPRHTRE CWK+HGKPANWK  K  ++        +N  ++SP
Subjt:  YEVRREESRMNIMIGKKSID-SVESPALVIE-NIIMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSP

Query:  F-KEQIDRILKLLKSN-SLGIPSVSLVQTGNSPQALA-CLNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEK-----------------VYIADGCFTS
        F  EQ++ +L LLKSN +    SVSL  TGN   AL+ C  S  WIIDS +++H+T+ S +FESYSP   N+K                  Y  D   ++
Subjt:  F-KEQIDRILKLLKSN-SLGIPSVSLVQTGNSPQALA-CLNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEK-----------------VYIADGCFTS

Query:  MAGKGTILPSTKLILRSVLHVPQLACRL---SSSVSKIS----------------------------KDANCRVIF------CEIHCL--------FQDQ
           +G  L +  L + S  + P +   +   SSSV+ +S                             D +  +        C  H +          D 
Subjt:  MAGKGTILPSTKLILRSVLHVPQLACRL---SSSVSKIS----------------------------KDANCRVIF------CEIHCL--------FQDQ

Query:  DSGETIARTRMIDGS--SLSSPSHNMLPDV-------SDLDISIAHRKA-------TRQFNFDW-------------------------PLYQLHAKNAF
            T   +++++G+   +  P    +          S  D S+   K        T+ +  D+                         PL+QL  KN F
Subjt:  DSGETIARTRMIDGS--SLSSPSHNMLPDV-------SDLDISIAHRKA-------TRQFNFDW-------------------------PLYQLHAKNAF

Query:  LNGEVEEEVLMDLPPALKQTLGL--------------------------------TSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKE
        LNG++E+EV M  PP  +++ G+                                 +QSQ DHTMFYKH+   KV +LIVY+DD++LTG+D   +  +K 
Subjt:  LNGEVEEEVLMDLPPALKQTLGL--------------------------------TSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKE

Query:  KLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKETVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGST
        KL ++F+IKDL + KYFLGMEFA+SK G+ VN+RKY+LDLL ETVSMVSQFMHA  P HFE V+RILRYLKGTPG+G+LFK   HLQ+E +TDADWAGS 
Subjt:  KLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKETVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGST

Query:  TNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI
         ++RS+ GYCSFVGGNLVTWRSKKQ+VVARSSAE EFRA+AHGICE +W+++LLEELK + + P+
Subjt:  TNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI

TrEMBL top hitse value%identityAlignment
A0A2N9EE05 Uncharacterized protein5.9e-15343.63Show/hide
Query:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC---------------------------------------IYKFL
        LVNS+ +D S NYMCY TA+ELW++V QMYS+LGNQ Q+FEL LKLG++RQGED                                         I+KFL
Subjt:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC---------------------------------------IYKFL

Query:  AGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMNIMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKP
        AGLN+EFDE+RGR++G+  LP+I DVF EVRREESR N+M+GKK    +VES ALV  +    KA     +T DKPRVWC +CNKP HTRETCWK+HGKP
Subjt:  AGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMNIMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKP

Query:  ANWKSSKQFERNSHQHASNSNIVDSSPFKEQIDRILKLLKSN-SLGIPSVSLVQTGNSPQALA-CLNSFL-WIIDSRSTEHITSFSCLFESYSPVYCNEK
        ANWKSSK  +R+     + +    +S  KEQ++ +L LLKSN S GIPSVS+ QTGN P AL+ CLNS   WIIDS +++H+TS    FESYSP    EK
Subjt:  ANWKSSKQFERNSHQHASNSNIVDSSPFKEQIDRILKLLKSN-SLGIPSVSLVQTGNSPQALA-CLNSFL-WIIDSRSTEHITSFSCLFESYSPVYCNEK

Query:  VYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLSSSVSKISKDANCRVI------------------FCEIHCL-FQDQDSGETIA-------RT
        V IADG F+S+AGKG I  S ++ L+SVLHVP+LAC L S  S      + R+I                  F  I  +  ++Q  GE I        ++
Subjt:  VYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLSSSVSKISKDANCRVI------------------FCEIHCL-FQDQDSGETIA-------RT

Query:  RMIDGSSLSSPSHN---------------------------MLPDVSDL----------DISIAHRKATRQFNFDWPLYQLHAKNAFLNGEVEEEVLMDL
          +  + +  PS +                            LP   ++          ++ +  RK   + + D P+  +  ++  LN       ++ +
Subjt:  RMIDGSSLSSPSHN---------------------------MLPDVSDL----------DISIAHRKATRQFNFDWPLYQLHAKNAFLNGEVEEEVLMDL

Query:  PPALKQTLG-----LTSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDL
        P  +++ L      L    + +      H G           DD+ILTG+D   +  +KE+LAD+F+IKDL + KYFLGMEFA+SK G+ VN+RKY+LDL
Subjt:  PPALKQTLG-----LTSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDL

Query:  LKET----------------------------------------------------VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQV
        L ET                                                    VSMVSQFMH+  P HFEAV+RILRYLKGTPGKG+LFKK  HLQ+
Subjt:  LKET----------------------------------------------------VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQV

Query:  EVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI
        E +TDADWAGS T++RS+SGYCSFVGGNLVTWRSKKQ+VVARSSAE EFRA+AHGICE +W+K++LEELK S ++P+
Subjt:  EVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI

A0A2N9GKJ5 Uncharacterized protein1.5e-13234.83Show/hide
Query:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC---------------------------------------IYKFL
        LVNS+ +D S NYMCY TA+ELW++V QMYS+LGNQ Q+FEL LKLG++RQGED                                         I+KFL
Subjt:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC---------------------------------------IYKFL

Query:  AGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMNIMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKP
         GLN+EFDE+RGR++G+  LP+I DVF EVRREESR N+M+GKK    +VES ALV  +    KA     +T DKPRVWC +CNKPRHTRETCWK+HGKP
Subjt:  AGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMNIMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKP

Query:  ANWKSSKQFERNSHQHASNSNIVDSSPFKEQIDRILKLLKSNSLGIPSVSLVQTGNSPQALACLNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEKVYI
        ANWKSSK  +R+     + +    +S  KEQI     + K                  Q + C                                EKV I
Subjt:  ANWKSSKQFERNSHQHASNSNIVDSSPFKEQIDRILKLLKSNSLGIPSVSLVQTGNSPQALACLNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEKVYI

Query:  ADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLSSSVSKISKDANCRV--------------------------------------------------
        ADG F+S+AGKG I  S ++ L+SVLHVP+LAC L S  S      N R+                                                  
Subjt:  ADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLSSSVSKISKDANCRV--------------------------------------------------

Query:  -----------------IFCEIHCLFQDQDSG---------------------ETIARTRM---------------------------IDGSSLSSP-SH
                         IF EI    QD  SG                     E +  TR                            I G+S   P   
Subjt:  -----------------IFCEIHCLFQDQDSG---------------------ETIARTRM---------------------------IDGSSLSSP-SH

Query:  NMLP------------------DVSDLDISIAHRKATR--------------------------------------------------------------
        N +P                    SDLDI IA RK  R                                                              
Subjt:  NMLP------------------DVSDLDISIAHRKATR--------------------------------------------------------------

Query:  --------------------------------------------------------------------QFNFDWPLYQLHAKNAFLNGEVEEEVLMDLPP
                                                                              N +WPL+QL  KNAFLNG++EEEV M LPP
Subjt:  --------------------------------------------------------------------QFNFDWPLYQLHAKNAFLNGEVEEEVLMDLPP

Query:  ALKQTLGLTS--------------------------------QSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSK
          ++ LG+                                  QSQADHTMFYKH+   K+ +LIVY+DD+ILTG+D   +  +KE+LAD+F+IKDL + K
Subjt:  ALKQTLGLTS--------------------------------QSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSK

Query:  YFLGMEFARSKTGLLVNERKYILDLLKET----------------------------------------------------VSMVSQFMHAREPAHFEAV
        YFLGMEFA+SK G+ VN+ KY+LDLL ET                                                    VSMVSQFMH+  P HFEAV
Subjt:  YFLGMEFARSKTGLLVNERKYILDLLKET----------------------------------------------------VSMVSQFMHAREPAHFEAV

Query:  FRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTI
        +RILRYLKGTPGKG+ FKK  HLQ+E +TDADWAGS T++RS+SGYCSFVGGNLVTWRSKKQ+VVARSSAE EFRA+AHGICE +W+K++LEELK S ++
Subjt:  FRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTI

Query:  PI
        P+
Subjt:  PI

A0A2N9I543 Uncharacterized protein5.7e-14042.71Show/hide
Query:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC---------------------------------------IYKFL
        LVNS+ +D S NYMCY TA+ELW++V QMYS+LGNQ Q+FEL LKLG++RQGED                                         I+KFL
Subjt:  LVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC---------------------------------------IYKFL

Query:  AGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMNIMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKP
        AGLN+EFDE+RGR++G+  LP+I DVF EVRREESR N+M+GKK    +VES ALV  +    KA     +T DKPRVWC +CNKPRHTRETCWK+HGKP
Subjt:  AGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMNIMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKP

Query:  ANWKSSKQFERNSHQHASNSNIVDSSPFKEQIDRILKLLKSN-SLGIPSVSLVQTGNSPQALA-CLNSFL-WIIDSRSTEHITSFSCLFESYSPVYCNEK
        ANWKSSK  +R+     + +    +S  KEQ++ +L LLKSN S GIPSVS+ QTGN P AL+ CLNS   WIIDS +++H+TS    FESYSP    EK
Subjt:  ANWKSSKQFERNSHQHASNSNIVDSSPFKEQIDRILKLLKSN-SLGIPSVSLVQTGNSPQALA-CLNSFL-WIIDSRSTEHITSFSCLFESYSPVYCNEK

Query:  VYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLSSSVSKISKDANCRVI------------------FCEIHCL-FQDQDSGETIA-------RT
        V IADG F+S+AGKG I  S ++ L+SVLHVP+LAC L S  S      + R+I                  F  I  +  ++Q  GE I        ++
Subjt:  VYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRLSSSVSKISKDANCRVI------------------FCEIHCL-FQDQDSGETIA-------RT

Query:  RMIDGSSLSSPSHN------MLPDVS----------------------------DLDISIAHRKATRQFNFD------------WPLYQLHAKNAFLNGE
          +  + +  PS +      + P++                             + ++ +  RK   + + D            W L  L   NA     
Subjt:  RMIDGSSLSSPSHN------MLPDVS----------------------------DLDISIAHRKATRQFNFD------------WPLYQLHAKNAFLNGE

Query:  VEEEVLMDLPPALKQTLGLTSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKY
          E  L+DLP   K+ +G     +   ++ Y+  G+     ++ Y   L+  G+D   +  +KE+L D+F+IKDL + KYFLGMEFA+SK G+ VN+RKY
Subjt:  VEEEVLMDLPPALKQTLGLTSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKY

Query:  ILDLLKETVSM----------------VSQFMHAREPAHFEAVFRILRYL---------KGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCS
        +LDLL ET  +                ++Q         ++ +   L YL          GTPGKG+LFKK  HLQ+E +TDADWAGS T++RS+SGYCS
Subjt:  ILDLLKETVSM----------------VSQFMHAREPAHFEAVFRILRYL---------KGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCS

Query:  FVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI
        FVGGNLVTWRSKKQ+VVARSSAE EFR +AHGICE +W+K++LEELK S ++P+
Subjt:  FVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI

A0A438IAS2 Protein FORGETTER 11.7e-13641.05Show/hide
Query:  LVNSVVKDFSCNYMCYSTAKELW--DSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC----------IYKFLAGLNVEFDEIRGRILGKSNLPTINDVF
        LVNS+ +D S NYMCY TA+ELW  D+VT+ +++L    Q  +L     + +  EDG           I+KFLAGLNVEFDE+RGRI+G+ +LP+I +VF
Subjt:  LVNSVVKDFSCNYMCYSTAKELW--DSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGC----------IYKFLAGLNVEFDEIRGRILGKSNLPTINDVF

Query:  YEVRREESRMNIMIGKKSID-SVESPALVIE-NIIMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSP
         E RREESR N+M+GKK    ++E  ALV       K      K+ ++P+VWC  CNKPRHTRE CWK+HGKPANWK  K  ++        +N  ++SP
Subjt:  YEVRREESRMNIMIGKKSID-SVESPALVIE-NIIMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSP

Query:  F-KEQIDRILKLLKSN-SLGIPSVSLVQTGNSPQALA-CLNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEK-----------------VYIADGCFTS
        F  EQ++ +L LLKSN +    SVSL  TGN   AL+ C  S  WIIDS +++H+T+ S +FESYSP   N+K                  Y  D   ++
Subjt:  F-KEQIDRILKLLKSN-SLGIPSVSLVQTGNSPQALA-CLNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEK-----------------VYIADGCFTS

Query:  MAGKGTILPSTKLILRSVLHVPQLACRL---SSSVSKIS----------------------------KDANCRVIF------CEIHCL--------FQDQ
           +G  L +  L + S  + P +   +   SSSV+ +S                             D +  +        C  H +          D 
Subjt:  MAGKGTILPSTKLILRSVLHVPQLACRL---SSSVSKIS----------------------------KDANCRVIF------CEIHCL--------FQDQ

Query:  DSGETIARTRMIDGS--SLSSPSHNMLPDV-------SDLDISIAHRKA-------TRQFNFDW-------------------------PLYQLHAKNAF
            T   +++++G+   +  P    +          S  D S+   K        T+ +  D+                         PL+QL  KN F
Subjt:  DSGETIARTRMIDGS--SLSSPSHNMLPDV-------SDLDISIAHRKA-------TRQFNFDW-------------------------PLYQLHAKNAF

Query:  LNGEVEEEVLMDLPPALKQTLGL--------------------------------TSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKE
        LNG++E+EV M  PP  +++ G+                                 +QSQ DHTMFYKH+   KV +LIVY+DD++LTG+D   +  +K 
Subjt:  LNGEVEEEVLMDLPPALKQTLGL--------------------------------TSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKE

Query:  KLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKETVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGST
        KL ++F+IKDL + KYFLGMEFA+SK G+ VN+RKY+LDLL ETVSMVSQFMHA  P HFE V+RILRYLKGTPG+G+LFK   HLQ+E +TDADWAGS 
Subjt:  KLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKETVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGST

Query:  TNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI
         ++RS+ GYCSFVGGNLVTWRSKKQ+VVARSSAE EFRA+AHGICE +W+++LLEELK + + P+
Subjt:  TNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI

A5AK63 Uncharacterized protein1.2e-13440.96Show/hide
Query:  VFELNLKLGDIRQGEDGC---------------------------------------IYKFLAGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMN
        +FEL LKLG+IRQGED                                         I+KFL GLNVEFDE+RGRI+G+  L +I +VF EV REESR N
Subjt:  VFELNLKLGDIRQGEDGC---------------------------------------IYKFLAGLNVEFDEIRGRILGKSNLPTINDVFYEVRREESRMN

Query:  IMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSPF-KEQIDRILK
        +M+GKK    ++E  ALV  +    KA+       ++P+VWC  CNKPRHTRE  WK+HGK ANWK  K  ++        +N  ++S F   Q++ +L 
Subjt:  IMIGKKSID-SVESPALVIENI-IMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHASNSNIVDSSPF-KEQIDRILK

Query:  LLKSN-SLGIPSVSLVQTGNSPQALAC-LNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEKVYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRL
        LLKSN +    SVSL  T N   AL+C   S  WIIDS +++H+T+ S +FESYSP   N+KV IA+  F+ +A KG I  S  + L+SVLHVP+L C L
Subjt:  LLKSN-SLGIPSVSLVQTGNSPQALAC-LNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEKVYIADGCFTSMAGKGTILPSTKLILRSVLHVPQLACRL

Query:  SSSVSKISKDANCRVIFCEIHCLFQDQDSGETIARTRMIDG-----------------SSLSSPS-HNMLPDVSDLDISIAHRKATR-------------
          SVSK+SKD+NC VIF E HC+FQDQ SG+TI   RMI+G                 SS+SS S  + + D+ DLD+ IA RK TR             
Subjt:  SSSVSKISKDANCRVIFCEIHCLFQDQDSGETIARTRMIDG-----------------SSLSSPS-HNMLPDVSDLDISIAHRKATR-------------

Query:  -----------------------QFNFD---WPLYQLHAKNAF-----------------------------LNGEVEEEVLMDLPPALKQTLGLT----
                               Q   D   W L      NA                               +G VE+     +     QT G+     
Subjt:  -----------------------QFNFD---WPLYQLHAKNAF-----------------------------LNGEVEEEVLMDLPPALKQTLGLT----

Query:  ------------------------------SQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSK
                                      +QSQADHTMFYKH+   KVV+LIVY+DD++LTG+D   +  +K KLA++F+IKDL + KYFLGMEFARSK
Subjt:  ------------------------------SQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSK

Query:  TGLLVNERKYILDLLKET----------------------------------------------------VSMVSQFMHAREPAHFEAVFRILRYLKGTP
         G+ VN+RKY+LDLL ET                                                    VSMVSQFMHA  P HFE V+RILRYLKGTP
Subjt:  TGLLVNERKYILDLLKET----------------------------------------------------VSMVSQFMHAREPAHFEAVFRILRYLKGTP

Query:  GKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI
        G+G+LFK   HLQ+E +TDADWA S  ++ S+S YCSFVGGNLVTWRSKK + VARSS E EFRA+AHGICE +W+++L EELK + + P+
Subjt:  GKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPI

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-2022.52Show/hide
Query:  FDWPLYQLHAKNAFLNGEVEEEVLMDLPPAL-----------KQTLGLTS-------------------QSQADHTMFYKHTGN-DKVVVLIVYLDDLIL
        ++  ++Q+  K AFLNG ++EE+ M LP  +           K   GL                      S  D  ++    GN ++ + +++Y+DD+++
Subjt:  FDWPLYQLHAKNAFLNGEVEEEVLMDLPPAL-----------KQTLGLTS-------------------QSQADHTMFYKHTGN-DKVVVLIVYLDDLIL

Query:  TGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKE--------------------------------------------
           D   ++  K  L + F++ DL   K+F+G+     +  + +++  Y+  +L +                                            
Subjt:  TGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKE--------------------------------------------

Query:  --------TVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEV--HTDADWAGSTTNKRSSSGYC-SFVGGNLVTWRSKKQSVVARSS
                 V+++S++        ++ + R+LRYLKGT    ++FKK+   + ++  + D+DWAGS  +++S++GY       NL+ W +K+Q+ VA SS
Subjt:  --------TVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEV--HTDADWAGSTTNKRSSSGYC-SFVGGNLVTWRSKKQSVVARSS

Query:  AEVEFRALAHGICEGIWMKKLLEELKCSQTIPI
         E E+ AL   + E +W+K LL  +      PI
Subjt:  AEVEFRALAHGICEGIWMKKLLEELKCSQTIPI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-2524.69Show/hide
Query:  VSKISKDANCRVIFCEIHCLFQDQDSGETIARTRMIDGSSLSSPSHNMLPDVSDLDISIAHRKATRQFNFDWPLYQLHAKNAFLNGEVEEEVLMDLPPAL
        V K+ KD +C+++  +   + +         + + ID   + SP   M    + L ++          + D  + QL  K AFL+G++EEE+ M+ P   
Subjt:  VSKISKDANCRVIFCEIHCLFQDQDSGETIARTRMIDGSSLSSPSHNMLPDVSDLDISIAHRKATRQFNFDWPLYQLHAKNAFLNGEVEEEVLMDLPPAL

Query:  -------------KQTLGLTS-------------------QSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYF
                     K   GL                     ++ +D  +++K    +  ++L++Y+DD+++ G D+  I+ +K  L+  F +KDL  ++  
Subjt:  -------------KQTLGLTS-------------------QSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYF

Query:  LGMEFARSKTG--LLVNERKYILDLLK-----------------------------------------------------------ETVSMVSQFMHARE
        LGM+  R +T   L +++ KYI  +L+                                                             V +VS+F+    
Subjt:  LGMEFARSKTG--LLVNERKYILDLLK-----------------------------------------------------------ETVSMVSQFMHARE

Query:  PAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEE
          H+EAV  ILRYL+GT G  + F   D + ++ +TDAD AG   N++SS+GY     G  ++W+SK Q  VA S+ E E+ A      E IW+K+ L+E
Subjt:  PAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEE

Query:  LKCSQ
        L   Q
Subjt:  LKCSQ

P92519 Uncharacterized mitochondrial protein AtMg008103.5e-2530.04Show/hide
Query:  LIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKE----------------------------------
        L++Y+DD++LTG+    ++++  +L+  F +KDL    YFLG++     +GL +++ KY   +L                                    
Subjt:  LIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKE----------------------------------

Query:  -----------------TVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQS
                          V++V Q MH    A F+ + R+LRY+KGT   G+   K+  L V+   D+DWAG T+ +RS++G+C+F+G N+++W +K+Q 
Subjt:  -----------------TVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQS

Query:  VVARSSAEVEFRALAHGICEGIW
         V+RSS E E+RALA    E  W
Subjt:  VVARSSAEVEFRALAHGICEGIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-3631.91Show/hide
Query:  WPLYQLHAKNAFLNGEVEEEVLMDLPP-------------------ALKQ--------------TLGLTSQSQADHTMFYKHTGNDKVVVLIVYLDDLIL
        WP+ QL   NAFL G + ++V M  PP                    LKQ              T+G  + S +D ++F    G   +V ++VY+DD+++
Subjt:  WPLYQLHAKNAFLNGEVEEEVLMDLPP-------------------ALKQ--------------TLGLTSQSQADHTMFYKHTGNDKVVVLIVYLDDLIL

Query:  TGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKET-------------------------------------------
        TGND   +    + L+  F +KD     YFLG+E  R  TGL +++R+YILDLL  T                                           
Subjt:  TGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKET-------------------------------------------

Query:  ---------VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEV
                 V+ +SQFMH     H +A+ RILRYL GTP  GI  KK + L +  ++DADWAG   +  S++GY  ++G + ++W SKKQ  V RSS E 
Subjt:  ---------VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEV

Query:  EFRALAHGICEGIWMKKLLEELKCSQTIP
        E+R++A+   E  W+  LL EL    T P
Subjt:  EFRALAHGICEGIWMKKLLEELKCSQTIP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-3430.75Show/hide
Query:  WPLYQLHAKNAFLNGEVEEEVLMDLPP-------------------ALKQ--------------TLGLTSQSQADHTMFYKHTGNDKVVVLIVYLDDLIL
        WP+ QL   NAFL G + +EV M  PP                    LKQ              T+G  + S +D ++F    G   ++ ++VY+DD+++
Subjt:  WPLYQLHAKNAFLNGEVEEEVLMDLPP-------------------ALKQ--------------TLGLTSQSQADHTMFYKHTGNDKVVVLIVYLDDLIL

Query:  TGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKET-------------------------------------------
        TGND + +    + L+  F +K+     YFLG+E  R   GL +++R+Y LDLL  T                                           
Subjt:  TGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKET-------------------------------------------

Query:  ---------VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEV
                 V+ +SQ+MH     H+ A+ R+LRYL GTP  GI  KK + L +  ++DADWAG T +  S++GY  ++G + ++W SKKQ  V RSS E 
Subjt:  ---------VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEV

Query:  EFRALAHGICEGIWMKKLLEEL
        E+R++A+   E  W+  LL EL
Subjt:  EFRALAHGICEGIWMKKLLEEL

Arabidopsis top hitse value%identityAlignment
AT1G22750.1 unknown protein5.3e-1347.95Show/hide
Query:  WTVATLIDFYINGTALSAWIFYKEESLLTALVWIVLFIIFGSASTCPFIVKELFKLNSEDPAYLVLFKNSNRQ
        W V  L++FYI+   LS W+ YKE SL+  ++W+ L I  GS  T   IV +LF+L+  DP YLVL  NSNR+
Subjt:  WTVATLIDFYINGTALSAWIFYKEESLLTALVWIVLFIIFGSASTCPFIVKELFKLNSEDPAYLVLFKNSNRQ

AT1G22750.1 unknown protein6.5e-0344.74Show/hide
Query:  MYNVLIKDTNKNGVQQHGKHSNIMIAKIVFGAMGCLMV
        MY +L++D+ K+GV    K+S ++ A+ VFGA+GC+M+
Subjt:  MYNVLIKDTNKNGVQQHGKHSNIMIAKIVFGAMGCLMV

AT1G22750.2 unknown protein1.8e-1348.65Show/hide
Query:  WTVATLIDFYINGTALSAWIFYKEESLLTALVWIVLFIIFGSASTCPFIVKELFKLNSEDPAYLVLFKNSNRQV
        W V  L++FYI+   LS W+ YKE SL+  ++W+ L I  GS  T   IV +LF+L+  DP YLVL  NSNR+V
Subjt:  WTVATLIDFYINGTALSAWIFYKEESLLTALVWIVLFIIFGSASTCPFIVKELFKLNSEDPAYLVLFKNSNRQV

AT1G22750.2 unknown protein6.5e-0344.74Show/hide
Query:  MYNVLIKDTNKNGVQQHGKHSNIMIAKIVFGAMGCLMV
        MY +L++D+ K+GV    K+S ++ A+ VFGA+GC+M+
Subjt:  MYNVLIKDTNKNGVQQHGKHSNIMIAKIVFGAMGCLMV

AT1G22750.3 unknown protein6.9e-1348.61Show/hide
Query:  WTVATLIDFYINGTALSAWIFYKEESLLTALVWIVLFIIFGSASTCPFIVKELFKLNSEDPAYLVLFKNSNR
        W V  L++FYI+   LS W+ YKE SL+  ++W+ L I  GS  T   IV +LF+L+  DP YLVL  NSNR
Subjt:  WTVATLIDFYINGTALSAWIFYKEESLLTALVWIVLFIIFGSASTCPFIVKELFKLNSEDPAYLVLFKNSNR

AT1G22750.3 unknown protein6.5e-0344.74Show/hide
Query:  MYNVLIKDTNKNGVQQHGKHSNIMIAKIVFGAMGCLMV
        MY +L++D+ K+GV    K+S ++ A+ VFGA+GC+M+
Subjt:  MYNVLIKDTNKNGVQQHGKHSNIMIAKIVFGAMGCLMV

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.4e-4332.34Show/hide
Query:  FDWPLYQLHAKNAFLNGEVEEEVLMDLPPAL-----------------KQTLGLTS-------------------QSQADHTMFYKHTGNDKVVVLIVYL
        +++ L+QL   NAFLNG+++EE+ M LPP                   K   GL                     QS +DHT F K T    + VL VY+
Subjt:  FDWPLYQLHAKNAFLNGEVEEEVLMDLPPAL-----------------KQTLGLTS-------------------QSQADHTMFYKHTGNDKVVVLIVYL

Query:  DDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKET--------------------------------------
        DD+I+  N++  +  +K +L   FK++DL   KYFLG+E ARS  G+ + +RKY LDLL ET                                      
Subjt:  DDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKET--------------------------------------

Query:  --------------VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVAR
                      V+ +SQF  A   AH +AV +IL Y+KGT G+G+ +     +Q++V +DA +      +RS++GYC F+G +L++W+SKKQ VV++
Subjt:  --------------VSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVAR

Query:  SSAEVEFRALAHGICEGIWMKKLLEELKCSQTIP
        SSAE E+RAL+    E +W+ +   EL+   + P
Subjt:  SSAEVEFRALAHGICEGIWMKKLLEELKCSQTIP

ATMG00810.1 DNA/RNA polymerases superfamily protein2.5e-2630.04Show/hide
Query:  LIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKE----------------------------------
        L++Y+DD++LTG+    ++++  +L+  F +KDL    YFLG++     +GL +++ KY   +L                                    
Subjt:  LIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKE----------------------------------

Query:  -----------------TVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQS
                          V++V Q MH    A F+ + R+LRY+KGT   G+   K+  L V+   D+DWAG T+ +RS++G+C+F+G N+++W +K+Q 
Subjt:  -----------------TVSMVSQFMHAREPAHFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQS

Query:  VVARSSAEVEFRALAHGICEGIW
         V+RSS E E+RALA    E  W
Subjt:  VVARSSAEVEFRALAHGICEGIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAATGTTCTGATCAAGGATACAAACAAGAATGGCGTGCAGCAGCATGGGAAGCACTCCAATATTATGATTGCGAAAATAGTTTTTGGTGCTATGGGTTGTTTGAT
GGTGATCTACTGCCGCCGGACACCACCTAGTCCTCAAATCTCCACCGGTTTGCAAAGCCTAAAGCCGCCGACGCTCGAAGACGGGAACTCCCGTCGATGCCGATGCACGT
TTTTGGGTATACCGACGCCTGGGTTCGAGTCTCCTCCGCTGGCCGACCTCATCTGCACTACCGTCCAAGTTTTCCGACCAATAATCGCCATCGACCGTCCAGTTTTTCGT
CTCTACTGCCGCCGCGCGTTTCTTTTGGGTTTTTTTAGGTCTTTGTTTGATTTCTTTTTCAGATCTACTGTTTCTCGGTTTGGGTTTTTTTGTCTCTTAAACATGTCGGA
CACTAAGGTTCCTGCCGCTAAGCTCGTCAACTCTGTAGTGAAAGACTTCAGCTGTAACTACATGTGCTACTCTACTGCAAAGGAATTATGGGATAGTGTGACTCAGATGT
ATTCTAATTTGGGTAACCAACCACAAGTGTTTGAGTTAAATCTTAAATTGGGTGATATACGACAAGGAGAAGATGGTTGCATTTATAAATTCCTTGCTGGTCTCAATGTT
GAGTTTGATGAGATCAGAGGCAGGATACTTGGGAAAAGTAATCTTCCAACTATTAACGATGTTTTTTATGAAGTTCGCAGGGAGGAAAGCCGCATGAATATTATGATTGG
AAAGAAGTCAATTGACTCAGTTGAAAGTCCAGCGTTGGTGATTGAAAATATTATAATGAAAGCTTCTGATCAATCCAACAAAACTCATGACAAGCCTCGTGTCTGGTGCT
ACCACTGCAATAAACCCCGTCATACGCGAGAAACTTGTTGGAAACTTCATGGAAAACCTGCAAATTGGAAGAGCTCCAAACAATTTGAGAGAAATTCCCATCAGCATGCT
TCTAACTCGAATATTGTTGATTCTAGTCCATTTAAAGAGCAAATTGATAGAATCCTGAAGCTGCTGAAGTCCAATTCATTGGGTATTCCTAGTGTTTCCTTGGTACAAAC
AGGTAATTCCCCTCAAGCTCTCGCGTGTCTAAATTCCTTTCTGTGGATCATCGATTCCAGATCCACTGAACATATAACTAGTTTCTCTTGTTTATTTGAATCATACTCCC
CTGTGTATTGCAATGAGAAAGTCTATATTGCTGATGGTTGTTTCACGTCTATGGCAGGCAAAGGAACTATTCTCCCAAGTACAAAACTCATATTACGTTCTGTCCTTCAT
GTTCCACAACTAGCTTGTAGGTTATCATCATCTGTGAGCAAAATATCTAAGGATGCTAATTGTCGTGTTATCTTTTGTGAAATTCATTGCCTCTTTCAGGATCAGGATTC
AGGGGAGACGATTGCACGTACCAGGATGATTGACGGTAGTTCTCTCAGCTCTCCTTCTCATAATATGTTACCTGATGTCTCTGATCTTGATATTTCAATTGCCCATAGAA
AAGCTACCCGTCAATTTAATTTTGATTGGCCGCTTTATCAACTTCATGCTAAGAATGCGTTTCTCAATGGGGAGGTTGAAGAAGAAGTACTTATGGACTTACCGCCTGCT
TTGAAGCAGACCTTGGGATTAACAAGCCAAAGTCAAGCTGATCACACTATGTTCTACAAGCATACGGGAAATGACAAGGTTGTTGTTTTAATAGTGTATCTTGATGATCT
CATACTTACGGGCAATGATGAGATAGGAATATCTATTGTGAAGGAAAAATTGGCGGATGATTTCAAGATCAAAGACCTGAGATCCTCAAAGTACTTCCTAGGCATGGAGT
TTGCTAGGTCCAAAACTGGTCTTCTTGTCAACGAAAGAAAGTATATTCTTGATCTACTCAAAGAGACGGTTAGTATGGTAAGTCAATTCATGCATGCCCGTGAGCCAGCT
CACTTTGAAGCTGTCTTTAGAATCCTAAGATATTTGAAAGGTACTCCAGGAAAAGGGATACTCTTCAAGAAACATGACCACCTACAGGTTGAGGTTCATACTGATGCTGA
TTGGGCAGGTAGCACGACCAATAAAAGATCGTCTTCTGGATATTGCTCCTTTGTTGGAGGAAACTTAGTTACTTGGCGTAGTAAAAAACAAAGTGTGGTAGCAAGAAGTA
GTGCTGAAGTAGAATTTAGGGCATTAGCCCATGGTATTTGTGAAGGCATATGGATGAAAAAGCTGTTGGAAGAATTGAAATGCTCTCAAACAATCCCCATAGCATTTACT
GTGATAACAAGGCAGCAATCTCCATTGCCCACAATCCAGCCCTTCATGATAGAACGAAACACATTGAATTGGACGGTGGCCACGCTGATTGATTTCTATATAAATGGCAC
AGCATTATCAGCCTGGATTTTCTATAAAGAAGAATCGTTGCTTACTGCACTCGTTTGGATAGTTCTATTCATAATCTTTGGGAGTGCCTCAACATGTCCTTTCATCGTTA
AGGAGCTATTCAAGCTTAACTCTGAAGATCCAGCATACCTTGTTTTATTTAAAAATTCCAACAGGCAAGTCCACACTTCTCTTTAG
mRNA sequenceShow/hide mRNA sequence
CTTTTGTGGAGATTTTGATGTAGATTTTTATGGGGGAAAGCTCGTATGAAATTGTTAGACTAACTATAATTTCTATCCAATCCATTTCTGCTTGTCATTATTGGAAGTTC
AAAGGATTAACGTCTAGCCATCTACTTCAAAATCATTTACAGACAAAAGAACTATTGATACATATCTGTTCAAGCATTCATGGTAACTTCTCTCTCTCTCTCACGTACTT
TTATTTTTTATTTTTTAATTTATCAGACTTGGGTTTGCTATAAGGAATCAAACTGGATTGCTGCTGCGACTTGGATAGTTTTTCTTGTATGTCTTGGCAGGAACTTTGAT
TTCCAAACTCTTATTTGTTACTAAGACCAAACTTGTCCCATGCATGGGCTACTTTATTTCCATAAGAAGTTGCAGCATTTTTAGATTGGTAATACTTGCAGATTGAATCT
TTATTTTATTTTTAGATCTGATTAACTTTACTTACCCTATCCTTTTCCGCAGCGCGGCCACTTGTGCCTACATTCTTTGGCAGTTGTGGCAATTATCATCTCAGGAATCC
TTTGAAGATATTATGTACAATGTTCTGATCAAGGATACAAACAAGAATGGCGTGCAGCAGCATGGGAAGCACTCCAATATTATGATTGCGAAAATAGTTTTTGGTGCTAT
GGGTTGTTTGATGGTGATCTACTGCCGCCGGACACCACCTAGTCCTCAAATCTCCACCGGTTTGCAAAGCCTAAAGCCGCCGACGCTCGAAGACGGGAACTCCCGTCGAT
GCCGATGCACGTTTTTGGGTATACCGACGCCTGGGTTCGAGTCTCCTCCGCTGGCCGACCTCATCTGCACTACCGTCCAAGTTTTCCGACCAATAATCGCCATCGACCGT
CCAGTTTTTCGTCTCTACTGCCGCCGCGCGTTTCTTTTGGGTTTTTTTAGGTCTTTGTTTGATTTCTTTTTCAGATCTACTGTTTCTCGGTTTGGGTTTTTTTGTCTCTT
AAACATGTCGGACACTAAGGTTCCTGCCGCTAAGCTCGTCAACTCTGTAGTGAAAGACTTCAGCTGTAACTACATGTGCTACTCTACTGCAAAGGAATTATGGGATAGTG
TGACTCAGATGTATTCTAATTTGGGTAACCAACCACAAGTGTTTGAGTTAAATCTTAAATTGGGTGATATACGACAAGGAGAAGATGGTTGCATTTATAAATTCCTTGCT
GGTCTCAATGTTGAGTTTGATGAGATCAGAGGCAGGATACTTGGGAAAAGTAATCTTCCAACTATTAACGATGTTTTTTATGAAGTTCGCAGGGAGGAAAGCCGCATGAA
TATTATGATTGGAAAGAAGTCAATTGACTCAGTTGAAAGTCCAGCGTTGGTGATTGAAAATATTATAATGAAAGCTTCTGATCAATCCAACAAAACTCATGACAAGCCTC
GTGTCTGGTGCTACCACTGCAATAAACCCCGTCATACGCGAGAAACTTGTTGGAAACTTCATGGAAAACCTGCAAATTGGAAGAGCTCCAAACAATTTGAGAGAAATTCC
CATCAGCATGCTTCTAACTCGAATATTGTTGATTCTAGTCCATTTAAAGAGCAAATTGATAGAATCCTGAAGCTGCTGAAGTCCAATTCATTGGGTATTCCTAGTGTTTC
CTTGGTACAAACAGGTAATTCCCCTCAAGCTCTCGCGTGTCTAAATTCCTTTCTGTGGATCATCGATTCCAGATCCACTGAACATATAACTAGTTTCTCTTGTTTATTTG
AATCATACTCCCCTGTGTATTGCAATGAGAAAGTCTATATTGCTGATGGTTGTTTCACGTCTATGGCAGGCAAAGGAACTATTCTCCCAAGTACAAAACTCATATTACGT
TCTGTCCTTCATGTTCCACAACTAGCTTGTAGGTTATCATCATCTGTGAGCAAAATATCTAAGGATGCTAATTGTCGTGTTATCTTTTGTGAAATTCATTGCCTCTTTCA
GGATCAGGATTCAGGGGAGACGATTGCACGTACCAGGATGATTGACGGTAGTTCTCTCAGCTCTCCTTCTCATAATATGTTACCTGATGTCTCTGATCTTGATATTTCAA
TTGCCCATAGAAAAGCTACCCGTCAATTTAATTTTGATTGGCCGCTTTATCAACTTCATGCTAAGAATGCGTTTCTCAATGGGGAGGTTGAAGAAGAAGTACTTATGGAC
TTACCGCCTGCTTTGAAGCAGACCTTGGGATTAACAAGCCAAAGTCAAGCTGATCACACTATGTTCTACAAGCATACGGGAAATGACAAGGTTGTTGTTTTAATAGTGTA
TCTTGATGATCTCATACTTACGGGCAATGATGAGATAGGAATATCTATTGTGAAGGAAAAATTGGCGGATGATTTCAAGATCAAAGACCTGAGATCCTCAAAGTACTTCC
TAGGCATGGAGTTTGCTAGGTCCAAAACTGGTCTTCTTGTCAACGAAAGAAAGTATATTCTTGATCTACTCAAAGAGACGGTTAGTATGGTAAGTCAATTCATGCATGCC
CGTGAGCCAGCTCACTTTGAAGCTGTCTTTAGAATCCTAAGATATTTGAAAGGTACTCCAGGAAAAGGGATACTCTTCAAGAAACATGACCACCTACAGGTTGAGGTTCA
TACTGATGCTGATTGGGCAGGTAGCACGACCAATAAAAGATCGTCTTCTGGATATTGCTCCTTTGTTGGAGGAAACTTAGTTACTTGGCGTAGTAAAAAACAAAGTGTGG
TAGCAAGAAGTAGTGCTGAAGTAGAATTTAGGGCATTAGCCCATGGTATTTGTGAAGGCATATGGATGAAAAAGCTGTTGGAAGAATTGAAATGCTCTCAAACAATCCCC
ATAGCATTTACTGTGATAACAAGGCAGCAATCTCCATTGCCCACAATCCAGCCCTTCATGATAGAACGAAACACATTGAATTGGACGGTGGCCACGCTGATTGATTTCTA
TATAAATGGCACAGCATTATCAGCCTGGATTTTCTATAAAGAAGAATCGTTGCTTACTGCACTCGTTTGGATAGTTCTATTCATAATCTTTGGGAGTGCCTCAACATGTC
CTTTCATCGTTAAGGAGCTATTCAAGCTTAACTCTGAAGATCCAGCATACCTTGTTTTATTTAAAAATTCCAACAGGCAAGTCCACACTTCTCTTTAG
Protein sequenceShow/hide protein sequence
MYNVLIKDTNKNGVQQHGKHSNIMIAKIVFGAMGCLMVIYCRRTPPSPQISTGLQSLKPPTLEDGNSRRCRCTFLGIPTPGFESPPLADLICTTVQVFRPIIAIDRPVFR
LYCRRAFLLGFFRSLFDFFFRSTVSRFGFFCLLNMSDTKVPAAKLVNSVVKDFSCNYMCYSTAKELWDSVTQMYSNLGNQPQVFELNLKLGDIRQGEDGCIYKFLAGLNV
EFDEIRGRILGKSNLPTINDVFYEVRREESRMNIMIGKKSIDSVESPALVIENIIMKASDQSNKTHDKPRVWCYHCNKPRHTRETCWKLHGKPANWKSSKQFERNSHQHA
SNSNIVDSSPFKEQIDRILKLLKSNSLGIPSVSLVQTGNSPQALACLNSFLWIIDSRSTEHITSFSCLFESYSPVYCNEKVYIADGCFTSMAGKGTILPSTKLILRSVLH
VPQLACRLSSSVSKISKDANCRVIFCEIHCLFQDQDSGETIARTRMIDGSSLSSPSHNMLPDVSDLDISIAHRKATRQFNFDWPLYQLHAKNAFLNGEVEEEVLMDLPPA
LKQTLGLTSQSQADHTMFYKHTGNDKVVVLIVYLDDLILTGNDEIGISIVKEKLADDFKIKDLRSSKYFLGMEFARSKTGLLVNERKYILDLLKETVSMVSQFMHAREPA
HFEAVFRILRYLKGTPGKGILFKKHDHLQVEVHTDADWAGSTTNKRSSSGYCSFVGGNLVTWRSKKQSVVARSSAEVEFRALAHGICEGIWMKKLLEELKCSQTIPIAFT
VITRQQSPLPTIQPFMIERNTLNWTVATLIDFYINGTALSAWIFYKEESLLTALVWIVLFIIFGSASTCPFIVKELFKLNSEDPAYLVLFKNSNRQVHTSL