; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017980 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017980
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:12511461..12519986
RNA-Seq ExpressionLag0017980
SyntenyLag0017980
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCH50966.1 T4.5 [Malus x robusta]2.9e-8326.52Show/hide
Query:  ILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSR
        +LK  KL G ++G  +CPP  +      S T V               N  ++ W  + Q LM  IN+TLS   L   +G   S+  W+ LE+ +S +SR
Subjt:  ILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSR

Query:  TNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEK-QTKHDDAL
        T++ +L+S +Q+I  K   S+ D++  IKE+ +KLA     + E D+  Y L+G P ++ +F  S+ T ++SVT DELH LL ++E +++K +T+   + 
Subjt:  TNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEK-QTKHDDAL

Query:  TQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFN---------QGRGSFYSP--QSSDGQGRVSCQICQRLGHSAINCYNR
          P   +A+QS       S   G+F +G S GR  N+NR  +  N  G    N          G G    P   SS     V CQ+C + GH A  C NR
Subjt:  TQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFN---------QGRGSFYSP--QSSDGQGRVSCQICQRLGHSAINCYNR

Query:  MNYHFQGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNG----VSGSSSTWLTDSGCNAHLTSDL------NNLT
        ++   Q + P     AM+A+ S             S   T    A  S  + Y  +    + +G    +S + S  +        L   L      +NL 
Subjt:  MNYHFQGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNG----VSGSSSTWLTDSGCNAHLTSDL------NNLT

Query:  IASKYAGDD-----------QVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVA---TNKSYSLWHNRLGHPG----HS---------------
           K+  D+            V D STGK+LFQGPS  GLYP     S+      ++  A          WH RLGHP     HS               
Subjt:  IASKYAGDD-----------QVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVA---TNKSYSLWHNRLGHPG----HS---------------

Query:  -----------------------------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
                                     L+H+DVWGP+P +S  G+ +Y+  +DD  K++WLYP+  KSDV +  + F   ++ L   +++++R+D GG
Subjt:  -----------------------------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

Query:  EYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF--------------------CSSPM-----ASPSYQ--------
        E++NK+L ++ +  GI HQ SC +T EQNG AERKHRH+V++  +L+SQ+ +P +F                      SP      ASP Y         
Subjt:  EYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF--------------------CSSPM-----ASPSYQ--------

Query:  -----------------------GHNIN----------------------------------VATDTANVVTNAYAPNISDVLPL------SESLPA---
                               G+++N                                   A+  +  V++   P +S  LPL       +S PA   
Subjt:  -----------------------GHNIN----------------------------------VATDTANVVTNAYAPNISDVLPL------SESLPA---

Query:  -------------------PATSVDSI-----------------PRVQNAHSMQTRGKSGISKRK---------------LTPLPPSKSD----------
                           P+++ +S+                 P   N H+M TR K+GI K K               LT LPP+ S           
Subjt:  -------------------PATSVDSI-----------------PRVQNAHSMQTRGKSGISKRK---------------LTPLPPSKSD----------

Query:  ----------------------------IGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDKH--------------------------------
                                    +GCKWV++VK  PDG+I RYKARLVAKG+HQQEG+D+ +                                 
Subjt:  ----------------------------IGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDKH--------------------------------

Query:  -----------------------------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLIS
                                                             S+ F  S +D+SLF+ K +    ++L+YVDDIIITG++     S+IS
Subjt:  -----------------------------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLIS

Query:  QLH----ATDVGSQCLAEDVQIFSS--------SCWCLTLPHVFTPIYFIFCQLVIPIHAVSSSCSFGGCQTG--SSVYCWYFIVWSS------------
        QL       D+G       +++  S        + + L L      +    C   +    +  S +     T   S+V    ++ W+             
Subjt:  QLH----ATDVGSQCLAEDVQIFSS--------SCWCLTLPHVFTPIYFIFCQLVIPIHAVSSSCSFGGCQTG--SSVYCWYFIVWSS------------

Query:  -----------------------------FRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
                                     F KGS   LT +SD+DWAG  +DRRST+G+ +FLG N +SW AKKQ+TV+RSSTEAEYR+LA+TAAE+
Subjt:  -----------------------------FRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]1.2e-8435.45Show/hide
Query:  ASILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSS
        +SILKAH L G+IDG+  CP K +                       +QINP Y+ W  + QALMTL+NATLS  AL++ +G ++S++AW  LE+ +S+S
Subjt:  ASILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSS

Query:  SRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDA
        +R+NI+ LKS L +ISK   +SI+ Y+++IK+ +D LA+VSV++++EDI IY LNG P ++N F  S+RT S+++T +E++ +LK EE  IE   K +++
Subjt:  SRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDA

Query:  LTQPAAMFASQSTPN-SSQRSNPYGNF-GRGRSFGRGRNQNRGGR--GFNPSGRGNFNQGRGSF------YSPQSSDGQGRVSCQICQRLGHSAINCYNR
           P AM A+   PN SS R     NF GRGR  GRGR  NRGGR   F      NF Q    +       S Q S+    V CQIC + GHSA++CY+R
Subjt:  LTQPAAMFASQSTPN-SSQRSNPYGNF-GRGRSFGRGRNQNRGGR--GFNPSGRGNFNQGRGSF------YSPQSSDGQGRVSCQICQRLGHSAINCYNR

Query:  MNYHFQGRHPPTQLVAMNALL---SDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGV------SGSSSTWLTDSGCNAH----LTSDLN
        M++ +QG+ P  QL AM+A     SD            +   T+ L AN +  V Y      ++ NG       SG SS    D     +    + S   
Subjt:  MNYHFQGRHPPTQLVAMNALL---SDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGV------SGSSSTWLTDSGCNAH----LTSDLN

Query:  NLTIASKYAGDD-----------QVSDKSTGKVLFQGPSINGLYPL--SSIHSSTTPSC----------------------------YVAHVATNKSYSL
        NL    ++  D+           Q+ DK+T ++LFQGPS +GLYPL  SSI   + PS                             + A++    S  L
Subjt:  NLTIASKYAGDD-----------QVSDKSTGKVLFQGPSINGLYPL--SSIHSSTTPSC----------------------------YVAHVATNKSYSL

Query:  WHNRLGHPGHS----------------------------------------------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDV
        WH+RLGHP  +                                              LVHSD+WGPAP TS D F YYVSF+DD                
Subjt:  WHNRLGHPGHS----------------------------------------------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDV

Query:  PTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKFCSSPMASPSY
                      FS      R+DGGGEY    L+  L+ +GI H+ SC +TP+QNG+AERKHRHIV+  L+L+S+AS+P+K+ +   ++ +Y
Subjt:  PTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKFCSSPMASPSY

PKU87026.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]1.9e-8727.55Show/hide
Query:  ILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSR
        IL+A+    F+D     P + + +S  SS                   NP Y+ WV   Q LM  I +T+S + L Y V   S+   W+ LE+ + SS+R
Subjt:  ILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSR

Query:  TNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDALT
        + ++ LK++L +I+ K  +S+  Y+  IK L D++A     +D+EDI +Y LNG P  + +F  ++RT    ++ D L+ LL +EE  I   T    A  
Subjt:  TNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDALT

Query:  QP-AAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFNQGRGSFYSPQSSDGQGRVS-----CQICQRLGHSAINCYNRMNYHF-
         P  A++AS+               GRGR     RN N                       PQ+S+ + + S     CQIC + GH+  NC++RMN  + 
Subjt:  QP-AAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFNQGRGSFYSPQSSDGQGRVS-----CQICQRLGHSAINCYNRMNYHF-

Query:  --QGRHPPTQLVAMN-ALLSDWREQILCCSKAESKTATSQLEANTSQN----VAYCNSAPGSMHNGV----SGSSSTWLTDSGCNAHLTSDLNNLTIASK
           G   P  +VA +    +DW       S   +     Q+ A  ++N    V    S P   H+G     + S   +L+      H+     NL   S+
Subjt:  --QGRHPPTQLVAMN-ALLSDWREQILCCSKAESKTATSQLEANTSQN----VAYCNSAPGSMHNGV----SGSSSTWLTDSGCNAHLTSDLNNLTIASK

Query:  YAGDDQVS-----------DKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVATNKSYSLWHNRLGHP-----------------------------
           D+ +S           D  T +VL +GP   GLYP++S+ S  T +   A  AT    ++WH RLGHP                             
Subjt:  YAGDDQVS-----------DKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVATNKSYSLWHNRLGHP-----------------------------

Query:  --GH---------------SLVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSN
          GH                L+HSDVWGP+P TS   F YYV F+DD  +FTWL+P+  KS+V  +F  FK  +ENL S +IK LRTDGG EY+N +L  
Subjt:  --GH---------------SLVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSN

Query:  YLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKFCSSPMASPSY-------------------------QGHNINVATD----TANV
        +L  +GI HQ SC YTPEQNG+AERKHRH+++    L+  +SVP K+    + + +Y                          GH      +    T N 
Subjt:  YLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKFCSSPMASPSY-------------------------QGHNINVATD----TANV

Query:  VTNAYAPNISDVLPLSESLPAP-------ATSVDSIPRVQ---------NAHSMQTRGKSGISKRK----------------------------------
        + +   P  +  + L  S           AT+   I R           + H M TR KSG  K +                                  
Subjt:  VTNAYAPNISDVLPLSESLPAP-------ATSVDSIPRVQ---------NAHSMQTRGKSGISKRK----------------------------------

Query:  --------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDKH------------------------------------
                      L   P +   +GCKW YR KR+ +GSI  YKARLVA G HQ+ GIDY++                                     
Subjt:  --------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDKH------------------------------------

Query:  -------------------------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL--
                                                         S+ F  S++D SL +   +   I+LL+YVDDI+ITG  +  I+ L+ QL  
Subjt:  -------------------------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL--

Query:  -------------------HATD---VGSQCLAEDV--QIFSSSC------WCLTLPHVFT-------PIYF-------IFCQLVIPIHAVSSSCSFGGC
                           H TD   +  +  A+ +  Q F  +C       C  LP+ F+       P+ +        +  L  P  A + +      
Subjt:  -------------------HATD---VGSQCLAEDV--QIFSSSC------WCLTLPHVFT-------PIYF-------IFCQLVIPIHAVSSSCSFGGC

Query:  QTGSSVYCWYFIVWSSFRKG----------SSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
            S + +       + +G          S L L +FSD+DWAG  + R+ST+G+  FLG   +SW  KKQ T +RSSTE+EYRALA+ AA++
Subjt:  QTGSSVYCWYFIVWSSFRKG----------SSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

TQD93593.1 hypothetical protein C1H46_020801 [Malus baccata]1.1e-8726.14Show/hide
Query:  ILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSR
        +L+ + +FGF+DGS  CP K   S +   + +               I   YK W    +ALMTLI ATLS AAL+  +GC SS+  W  L++ +S+ +R
Subjt:  ILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSR

Query:  TNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDALT
        T+IV +K DLQ+I +K  ESI+ Y+++IK+ +D+LA V V + +EDI I  L G P +FNT    +R     V+  EL   LK EEA +++  K    ++
Subjt:  TNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDALT

Query:  Q---------------PAAMFASQSTPNSS--------------------QRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRG----------------
                         AA  ASQ + N S                    Q ++P     +G   G G   N  G  F P G+G                
Subjt:  Q---------------PAAMFASQSTPNSS--------------------QRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRG----------------

Query:  --------NFNQGRG--SFYSP-------------QSSDGQG-------RVSCQICQRLGHSAINCYNRMNYHFQGRHPPTQLVAMNALLSDWREQILCC
                NF+QG    S +SP             Q  D +G       +  CQIC R GH+A  C++R N  F    PP Q  + +   S  + Q    
Subjt:  --------NFNQGRG--SFYSP-------------QSSDGQG-------RVSCQICQRLGHSAINCYNRMNYHFQGRHPPTQLVAMNALLSDWREQILCC

Query:  SKAESKTATSQLEANTSQNVAYCNS---APGSMHNGVSGSSST-----WLTDSGCNAHLTSDLNNLTIASKYAGDDQVS---------------------
        +   +    + +      + A  NS   +P +M    + S S      WL D G   H+TSDL+N+ +A+ Y+  D V+                     
Subjt:  SKAESKTATSQLEANTSQNVAYCNS---APGSMHNGVSGSSST-----WLTDSGCNAHLTSDLNNLTIASKYAGDDQVS---------------------

Query:  ------------------------------------------DKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVATNKSYSLWHNRLGHPGHSLV-
                                                  DK T ++L++G S N +YPL  + SS  P    A++    + +LWH RLGHP  S+V 
Subjt:  ------------------------------------------DKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVATNKSYSLWHNRLGHPGHSLV-

Query:  ----------------------------------------------HSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVEN
                                                      H+DVWGP+P  S++G+ YYVSFID+  ++TW++PI  K+ V  +F +F+  V N
Subjt:  ----------------------------------------------HSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVEN

Query:  LFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF----CSS-------------PMASP----
         F+  I+ L++DGGGEYI  +  N+L   GILH  SC YTP+QNG+ ERK+RHI + A++L+ QA +P +F    C++              M SP    
Subjt:  LFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF----CSS-------------PMASP----

Query:  ----------------------SYQGHNINVATD------------------------------------------TANVVTNAYAPNISDVLPLS----
                               Y+ H +   T                                               V+ +   ++S ++P +    
Subjt:  ----------------------SYQGHNINVATD------------------------------------------TANVVTNAYAPNISDVLPLS----

Query:  ---ESLPAP-------------------------ATSVDSIP--------------------------------------RVQNAHSMQTRGKSGISKRK
            S+P P                         + ++ SIP                                         N+H MQTR KSGI K+K
Subjt:  ---ESLPAP-------------------------ATSVDSIP--------------------------------------RVQNAHSMQTRGKSGISKRK

Query:  --------------------------------------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDK-------
                                                    L PLPP K+ +GCKW+Y++K++PDG++ARYKARLVAKG+ Q+ G+DY +       
Subjt:  --------------------------------------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDK-------

Query:  ------------------------------------------------HSVRFC------------------------------GSQADSSLFVLKLNGD
                                                        H    C                               S AD SLFV   N  
Subjt:  ------------------------------------------------HSVRFC------------------------------GSQADSSLFVLKLNGD

Query:  FIYLLLYVDDIIITGTNNALINSLISQLHA----TDVG--SQCLAEDVQIFSSSCW---------------------CLTLPH-----------------
         + LLLYVDDII+TG + A ++S+I QL A     ++G     L   ++  SS  +                     CLT  H                 
Subjt:  FIYLLLYVDDIIITGTNNALINSLISQLHA----TDVG--SQCLAEDVQIFSSSCW---------------------CLTLPH-----------------

Query:  -----VFTPIYFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIVWSSFRKGS----------SLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSW
             V    Y  F +  I  ++V+  C F              I+   + +G+          SL +  ++D+DWAG   DRRSTTGFV+FLG NP+SW
Subjt:  -----VFTPIYFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIVWSSFRKGS----------SLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSW

Query:  GAKKQSTVSRSSTEAEYRALASTAAEL
         +KKQ TVSRSSTEAEYRA+A+T AE+
Subjt:  GAKKQSTVSRSSTEAEYRALASTAAEL

TQE01264.1 hypothetical protein C1H46_013171 [Malus baccata]1.4e-8525.65Show/hide
Query:  QALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDF
        +A+M LI ATLSP A++  +GC SS   W  L+  +S+ ++ +I  LK++LQ+I KK  +S++ Y+++IK+++D L+   VI +++DI I  L G PS++
Subjt:  QALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDF

Query:  NTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDALTQ----------PAAMFASQSTPNSSQR------------SNPYGNFGRGRSFGRGRNQN
        NTF   +R     ++  E    L  EEA +E  +  +  +T            A M    S+  SSQ             S P  ++  G SF   R + 
Subjt:  NTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDALTQ----------PAAMFASQSTPNSSQR------------SNPYGNFGRGRSFGRGRNQN

Query:  RGGRGFNPSGRGNFNQGRGS-------------------------------------FYSPQS-SDGQGRVSCQICQRLGHSAINCYNRMNYHFQGRHPP
        RG   F+ + + N      S                                     F    S S    +V CQIC + GH A+ CY+R N+ +QGR PP
Subjt:  RGGRGFNPSGRGNFNQGRGS-------------------------------------FYSPQS-SDGQGRVSCQICQRLGHSAINCYNRMNYHFQGRHPP

Query:  TQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIASKYAGDDQVS---------
        + L AMN   S                                 SAP             W+ D+G  +H+TSDL+NL +A+ ++G D V+         
Subjt:  TQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIASKYAGDDQVS---------

Query:  ------------------------------------------------------DKSTGKVLFQGPSINGLYPL-----------SSIHSST--TPSCYV
                                                              DK TG+++ QG    GLYP+           +  H+++    +CY 
Subjt:  ------------------------------------------------------DKSTGKVLFQGPSINGLYPL-----------SSIHSST--TPSCYV

Query:  AHVATNKSYSLWHNRLGHPGH-----------------------------------------------SLVHSDVWGPAPKTSVDGFNYYVSFIDDHFKF
          + +    +LWH RLGHP +                                                ++HSDVWGP+   S++G+ +YVSF+D+  +F
Subjt:  AHVATNKSYSLWHNRLGHPGH-----------------------------------------------SLVHSDVWGPAPKTSVDGFNYYVSFIDDHFKF

Query:  TWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMK-----
        TW++P+  KS+V  VF  F   +   FS  +K  ++DGGGEY +     YL   GILHQ SC YTP+QNG+AERKHRHI++ A++L+  AS+P K     
Subjt:  TWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMK-----

Query:  -------------------------------------------------------------------------FCSSPMASPSYQGHNINVATDT---AN
                                                                                  C +P+ +  Y   ++     T   ++
Subjt:  -------------------------------------------------------------------------FCSSPMASPSYQGHNINVATDT---AN

Query:  VVT-----------------------------------------------NAYAPNISDVLPLSE------SLPA----PATSVDSIP------------
        ++T                                               NA   ++S  LP S       SLP+    PA S  S+P            
Subjt:  VVT-----------------------------------------------NAYAPNISDVLPLSE------SLPA----PATSVDSIP------------

Query:  -----RVQNAHSMQTRGKSGISKRK---------------------------------------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARY
              + + H MQTR KSGISK+K                                             L PLP +K+ +GCKWVYR+K NPDGS+ARY
Subjt:  -----RVQNAHSMQTRGKSGISKRK---------------------------------------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARY

Query:  KARLVAKGYHQQEGIDY---------------------------------------DKH-----------------------------------------
        KARLVAKGY Q+EG+DY                                       D H                                         
Subjt:  KARLVAKGYHQQEGIDY---------------------------------------DKH-----------------------------------------

Query:  ------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL----HATDVGS------------------------QCLAEDVQIF
               + F  S AD SLFV   +   + LLLYVDDII+TG+++ LI+ +I  L       D+G                         + L E V + 
Subjt:  ------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL----HATDVGS------------------------QCLAEDVQIF

Query:  SS---SCWCLTL-------------PHVFTPI-----YFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIV---------WSSFRKGSSLHLTTFSDS
         S   +  CL               PH +  I     Y  F +  I   +V+  C F      S V     I+            F+ G  L L  +SD+
Subjt:  SS---SCWCLTL-------------PHVFTPI-----YFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIV---------WSSFRKGSSLHLTTFSDS

Query:  DWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
        DWAG   DRRST+G +++LG +P+SW +KKQ TVSRSSTEAEYRALA  AAEL
Subjt:  DWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

TrEMBL top hitse value%identityAlignment
A0A2N9E6N0 Uncharacterized protein1.3e-11630.43Show/hide
Query:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSS
        SI + + L   +DG TV PP+                AD     ++ + N LYK W A+ QAL TLINATLSP+A+   +G T+++  W+VLE+ Y+S S
Subjt:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSS

Query:  RTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDAL
        RT+I++LK++L  + K   E+I  Y+ ++KE++DKL +V VI+D+ED+    L G P++++ FC +MRT  ++++ +ELHVLL +EE + +K  KH    
Subjt:  RTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDAL

Query:  TQPAAMFASQSTPNSSQRSNPY----GNFGRGRSFGRGRNQNRGGRGFNPSGRGNFNQGRGS---FYSP--QSSDGQGRVSCQICQRLGHSAINCYNRMN
            AM A+ S   +   +NP       + RGR  GRG N+ RGGR  N  G  + +QG  S    +SP   SS    R  CQIC + GH A++C++RMN
Subjt:  TQPAAMFASQSTPNSSQRSNPY----GNFGRGRSFGRGRNQNRGGRGFNPSGRGNFNQGRGS---FYSP--QSSDGQGRVSCQICQRLGHSAINCYNRMN

Query:  YHFQGRHPPTQL--VAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCN--------------AHLTSDL
        + +QGR PP +L  +A  A+ S         S   S T  +        ++  C+   G+    V    S  +T +G +               H+ S  
Subjt:  YHFQGRHPPTQL--VAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCN--------------AHLTSDL

Query:  NNLTIASKYAGDD-----------QVSDKSTGKVLFQGPSINGLYPL-SSIHSSTTPSCYVAHVATNKSYSLWHNRLGHPGHS-----------------
        +NL    ++  D+           ++ D S+G++L+ GPS +GLYP+  +I  +++P  +    A + S  LWHNRLGHP  S                 
Subjt:  NNLTIASKYAGDD-----------QVSDKSTGKVLFQGPSINGLYPL-SSIHSSTTPSCYVAHVATNKSYSLWHNRLGHPGHS-----------------

Query:  -----------------------------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
                                     +VHSDVWGPAP TS +   YYV+F+DD  +FTW +P+  KS V + F  FK  +ENL S ++K LRTD GG
Subjt:  -----------------------------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

Query:  EYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF--------------------------------------------
        EY   +  ++ SS G+ HQ +C +T +QNGVAERKHRHIV + L+LMSQAS+P+ F                                            
Subjt:  EYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF--------------------------------------------

Query:  --------------------------------------------------------------CSSP---MASPSYQGHNIN-------------------
                                                                      CS P    A P+   ++ N                   
Subjt:  --------------------------------------------------------------CSSP---MASPSYQGHNIN-------------------

Query:  VATDTANVVTNAYAPNISDVLPLSESLPAPATS------------VDSIPRVQNAHSMQTRGKSGISKRK------------------------------
        V + TA V T++ AP     +P S + P P++S              S P V NAH MQTRGKSGI+K+K                              
Subjt:  VATDTANVVTNAYAPNISDVLPLSESLPAPATS------------VDSIPRVQNAHSMQTRGKSGISKRK------------------------------

Query:  ----------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDK-----------------------------------
                        L P  P    IGC WV+++KRN DGS+ARYKARLVAKG HQ  GID+ +                                   
Subjt:  ----------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDK-----------------------------------

Query:  -HSVRFC--------------GSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQLHAT----DVGSQCLAEDVQI---------------
          S++ C               S AD SLF+ +     +YLLLYVDDIIITG ++  +  LI+ L +     D+G       +QI               
Subjt:  -HSVRFC--------------GSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQLHAT----DVGSQCLAEDVQI---------------

Query:  -------FSSSCWCLTLPHVF---------------TPI--------YFIFCQLVIPIHAVSSSCSFGGCQTGSSV--------YCWYFIVWSSFRKGSS
                 ++C   T P V                TP         Y  F +  +   AV+S C      T + +        Y    +      +   
Subjt:  -------FSSSCWCLTLPHVF---------------TPI--------YFIFCQLVIPIHAVSSSCSFGGCQTGSSV--------YCWYFIVWSSFRKGSS

Query:  LHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
        +HLT F+D+DWAG+ +DRRSTTGF++FLG N ++W +KKQ TVSRSSTEAEYR+LA  AAE+
Subjt:  LHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

A0A2N9EFT0 Uncharacterized protein2.2e-12131.59Show/hide
Query:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSS
        SIL+A+ L  FIDGS  CP K +     S S +V               N  Y  W+++ + L+T++NATLSP+ L+  VG  S++  W+ LEK ++S +
Subjt:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSS

Query:  RTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDAL
        R+NI+NLK DL  + K   + ++ +++++KE +DKL  V V + +E+I    L G P++F++   ++RT +  ++FDEL VLL  EE++++      DA 
Subjt:  RTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDAL

Query:  TQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNF-NQGRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNYHFQGRHP
          P  M A  ST N    ++    F    + GRGRN N  GRG   +GRG F NQ   S     S +   R  CQIC ++GH A++CY+RM+Y +QGRHP
Subjt:  TQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNF-NQGRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNYHFQGRHP

Query:  PTQLVAM---NALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGS---MHNGVSGSSSTWLTDSGC-NAHLTSDLNNLTIASKYAGDDQVS-
        P +L A+   N LL+    Q    S   ++           QN  +    P S   +    + S++TW++D+G  + H T DL NL     Y G DQVS 
Subjt:  PTQLVAM---NALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGS---MHNGVSGSSSTWLTDSGC-NAHLTSDLNNLTIASKYAGDDQVS-

Query:  ---DKSTGKVLFQGPSINGLYPLSSI----------HSSTTPSCYVAHVATNKSYSLWHNRLGHPG----HS----------------------------
           D  +G+ L++G S +GLYP+  +          HSST P+   A + T  + S+WH+RLGHP     HS                            
Subjt:  ---DKSTGKVLFQGPSINGLYPLSSI----------HSSTTPSCYVAHVATNKSYSLWHNRLGHPG----HS----------------------------

Query:  -----------------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLS
                         LVHSDVWGPAP TS++G  +YVSF+D   +FTWL+PI  KS V   FQ F   +EN+ +TRIK LRTD GGEY N    ++ S
Subjt:  -----------------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLS

Query:  SNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVP----------------------MKFCS--------------------------------
        + GILHQ SC +TP+QNGVAERKHRHIV+ AL+L+S++S+P                      +KF S                                
Subjt:  SNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVP----------------------MKFCS--------------------------------

Query:  --SPMASP----------------SYQGHNINVATDTA-----------------NVVTNAY---------------------APNISDVLPLSESL---
           P +SP                + Q H + ++   A                 +  +N +                      P +S   PLS SL   
Subjt:  --SPMASP----------------SYQGHNINVATDTA-----------------NVVTNAY---------------------APNISDVLPLSESL---

Query:  ---------------------PAPATSVDSIPRVQ--NAHSMQTRGKSGISKRK-------LTPL---PPSKSDIGCKW-------------VYRVKR-N
                             P P+ SV S P +   N+H MQTRGKSGISKRK       L PL   PPS   +  K+             + R +   
Subjt:  ---------------------PAPATSVDSIPRVQ--NAHSMQTRGKSGISKRK-------LTPL---PPSKSDIGCKW-------------VYRVKR-N

Query:  PDGSIARYKARLVAKGYHQQEGIDYDK---------------------------------------------------------HSV-------------
        PDGS+ARYKARLVAKGYHQQ G+DYD+                                                         H V             
Subjt:  PDGSIARYKARLVAKGYHQQEGIDYDK---------------------------------------------------------HSV-------------

Query:  ---------------RFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL---------------------------------HATDV
                        F  S AD SLF+ + +   I+LL+YVDDIIITG + + ++SL+ QL                                 +A+D+
Subjt:  ---------------RFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL---------------------------------HATDV

Query:  GSQCLAEDVQIFSSSCWCLTL---PHVFTPI--------------YFIFCQLVIPIHAVSSSCSFGGCQTG---SSVYCWYFIVWSSFRKG-----SSLH
          +    D +  S+ C C ++     + TP+              Y  F +  +  + V+S C F    T    S+       +  S   G      SL 
Subjt:  GSQCLAEDVQIFSSSCWCLTL---PHVFTPI--------------YFIFCQLVIPIHAVSSSCSFGGCQTG---SSVYCWYFIVWSSFRKG-----SSLH

Query:  LTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
        L  +SD+DWAG    RRSTTG+++F+G NP++W +KKQSTVSRSSTEAEYRALAS AAE+
Subjt:  LTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

A0A2N9ER29 Integrase catalytic domain-containing protein1.2e-11431.28Show/hide
Query:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSS
        SI + + L   IDGST  P + +           A    TP      Q +  YK W  + QAL TL+NATLSP AL+  +  ++++  WEVLE+ Y+S S
Subjt:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSS

Query:  RTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDAL
        RT++++LK +L  I KK  ES++ ++ ++KEL+DKL+ V V +D+E++    + G P +++ FC +MRT  +S++ +ELHV+L +EE + +K ++   + 
Subjt:  RTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDAL

Query:  TQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGR-GRNQNRGGRGFNPSGRGNFNQGRGSFYS--PQSSDGQGRVSCQICQRLGHSAINCYNRMNYHFQGR
          P    A+ +  +S   + P   F    + GR GR+QN  GR     GRGN+   RG F     Q+S  Q R +CQIC + GH A++C++RMN+ +QGR
Subjt:  TQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGR-GRNQNRGGRGFNPSGRGNFNQGRGSFYS--PQSSDGQGRVSCQICQRLGHSAINCYNRMNYHFQGR

Query:  HPPTQLVAM------NALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSG-CNAHLTSDLNNL-----------T
        HPP +L A+      NA+ +    Q    S   S T  +        ++  C++  G+    V    S  +T SG    H +S L +L           +
Subjt:  HPPTQLVAM------NALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSG-CNAHLTSDLNNL-----------T

Query:  IASKY-------------AGDDQVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVATNKSYSLWHNRLGHP-----------------------
        + S Y             A   Q+ D  +GK+L+ G S +GLYP+      T+ S   A +++  S  LWH RLGHP                       
Subjt:  IASKY-------------AGDDQVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVATNKSYSLWHNRLGHP-----------------------

Query:  --GH----------------------SLVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEY
           H                       LVH+DVWGPAP TS +G  YYVSFIDD+ +FTW +P+  KS V   F+ FK  +EN+    IK LR+D GGEY
Subjt:  --GH----------------------SLVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEY

Query:  INKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF-----------------CSSPMASP---------------------
              ++ SSNGILHQ SC +T +QNG+AERKHRHIV +AL+L+SQ+S+P+ F                  S  + SP                     
Subjt:  INKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF-----------------CSSPMASP---------------------

Query:  -----SYQGHNI-----------NVATDT----------------ANVVTNAYAPNISDV--LPLSESLPAPATSVDSIPRVQ--NAHSMQTRGKSGISK
              Y  H +           ++  DT                  +     AP+IS    +P +  +PA A ++ + P V   N H M TR KSGI+K
Subjt:  -----SYQGHNI-----------NVATDT----------------ANVVTNAYAPNISDV--LPLSESLPAPATSVDSIPRVQ--NAHSMQTRGKSGISK

Query:  RK-------------------------------------------------LTPLPPSKSD--IGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDY
        RK                                                  T +PPS S   IGC+WV+++KRN DGS+AR+KARLVAKG HQQ G+D+
Subjt:  RK-------------------------------------------------LTPLPPSKSD--IGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDY

Query:  DK-------------------------------------------------------HS------------------------------VRFCGSQADSS
        D+                                                       HS                              V F  S AD S
Subjt:  DK-------------------------------------------------------HS------------------------------VRFCGSQADSS

Query:  LFVLKLNGDFIYLLLYVDDIIITGTNNALINSLIS--------------------QLHATDVG----SQCLAEDVQIFSSSCWCLTLPHVFTP-------
        LFV K     IYLLLYVDDII+TG+    I +LI                     Q+  TD G        A D+ +  +   C      F P       
Subjt:  LFVLKLNGDFIYLLLYVDDIIITGTNNALINSLIS--------------------QLHATDVG----SQCLAEDVQIFSSSCWCLTLPHVFTP-------

Query:  ------------------IYFIFCQLVIPIHAVSSSCSFGGCQTGSSV--------YCWYFIVWSSFRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFL
                           Y  F +  +   AV+S C      T S +        Y    +      +   L LT F+DSDWAG+ +DRRSTTGF+IFL
Subjt:  ------------------IYFIFCQLVIPIHAVSSSCSFGGCQTGSSV--------YCWYFIVWSSFRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFL

Query:  GPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
        G N ++W +KKQ TVSRSSTEAEYRALA  AAEL
Subjt:  GPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

A0A2N9F9F8 Uncharacterized protein7.4e-11729.88Show/hide
Query:  ASILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSS
        +SILKA+ +  ++DG+   P + + ++  + +T+V               NP ++ W  + Q L+ LIN+TLS + L+  VG  S+++ W+ LE  ++S+
Subjt:  ASILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSS

Query:  SRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEE-AAIEKQTKHDD
        SR N++NLK +L ++ KK  ESIN Y++++K  +DKL  V  ++D E++    L G P ++  FC ++RT ++ VTF+E+ VLL+TEE +A E      D
Subjt:  SRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEE-AAIEKQTKHDD

Query:  ALTQPAAMFASQSTPNSSQRSNPYGNFGRGRSF-GRGRNQNRGGRG--FNPSGRGNFNQ-GRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNYHF
          + P AMFA  S PN+   ++    +G    F GRGRN ++ GRG  F  S +  F+Q  +G+   PQ  +G  R  CQIC +LGH A++CY+RM++ +
Subjt:  ALTQPAAMFASQSTPNSSQRSNPYGNFGRGRSF-GRGRNQNRGGRG--FNPSGRGNFNQ-GRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNYHF

Query:  QGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIASKYAGDDQVSDKS
        QGRHPP +L AM                           A+TS              NG  G   TWLTD+G   HLT++L NL  A+ Y G +Q  D  
Subjt:  QGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIASKYAGDDQVSDKS

Query:  TGKVLFQGPSINGLYPLSSIHSST--TPSC----YVAHVATNKSYSLWHNRLGHPG----------------------------------------HS--
        +GKVL++G S NGLYP+ ++ SS+  +PS       A +++   + LWH+RLGHP                                         HS  
Subjt:  TGKVLFQGPSINGLYPLSSIHSST--TPSC----YVAHVATNKSYSLWHNRLGHPG----------------------------------------HS--

Query:  -------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSC
               LVHSDVWGPAP  S +G+ YY+ F+DD  KF+WL+ +  KS+V   F+ FK  VEN  S  IK LRTD GGEY +   +++ S+ GI HQ SC
Subjt:  -------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSC

Query:  AYTPEQNGVAERKHRHIVQVALSLMSQASVPMKFCS--------------------------------------------------------SPMASP--
         +TP+QNG  ERKHRHI++ AL+L+S AS+P+   +                                                         P  +P  
Subjt:  AYTPEQNGVAERKHRHIVQVALSLMSQASVPMKFCS--------------------------------------------------------SPMASP--

Query:  --SYQGHN-----INVATD---------------------TANVV---------------------------TNAYAPNISDVLPLSESLPAPATSVDSI
           Y  H      +N AT                      ++N V                            +A AP+ S  +P S S  AP+ S  + 
Subjt:  --SYQGHN-----INVATD---------------------TANVV---------------------------TNAYAPNISDVLPLSESLPAPATSVDSI

Query:  -----------------------PRVQNAHSMQ-------TRGKSGISKRK--------------------------------------------LTPLP
                               P V+   S         TR K GI K K                                            L P P
Subjt:  -----------------------PRVQNAHSMQ-------TRGKSGISKRK--------------------------------------------LTPLP

Query:  PSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDKH-------------------------------------------------------
          K+ +GCKWV+++KRN DGSI+RYKARLVAKG+HQQ GID+++                                                        
Subjt:  PSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDKH-------------------------------------------------------

Query:  ------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL----HATDVGS----------
                                      S+ F  S ADSSLF  K   D  +LLLYVDDI++TG N++ I  LI  L       D+GS          
Subjt:  ------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL----HATDVGS----------

Query:  ---------------QCLAEDVQIFSSSCWCLTLPHVFTPIY--------FIFCQLVIPIH-----------AVSSSCSFGGCQTGSSV--------YCW
                         L +   +  S C    +PHV    +          +  LV  +H           AV   C F    +   +        Y  
Subjt:  ---------------QCLAEDVQIFSSSCWCLTLPHVFTPIY--------FIFCQLVIPIH-----------AVSSSCSFGGCQTGSSV--------YCW

Query:  YFIVWSSFRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
          +    F     + L+ FSD+DWAG   DRRST+G +++LG NP++W AKKQ TVSRSSTEAEYRALAS +AE+
Subjt:  YFIVWSSFRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

A0A2N9GCR2 Uncharacterized protein9.9e-11428.26Show/hide
Query:  ASILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSS
        +SILKA+ +  F+DG+   P + ++                      S+ NP ++ W  + QAL+TLIN+TLSP  L+  VG  S++  W+ LE+ ++S+
Subjt:  ASILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSS

Query:  SRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDA
        SR N++NLK +L ++ KK GESI+ Y++++K  +DKL  V +++D E++    L G P ++  FC ++RT ++ V+F+E+ VLL+TEE ++ + +     
Subjt:  SRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDA

Query:  LTQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFN----------------QGRGSF-YSPQSSDGQGRVSCQICQRLGHS
        L Q  A+FAS +  N +  S          S GRGRN ++ GRG    GR N N                QG+ +F    Q+     R  CQIC +LGH 
Subjt:  LTQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFN----------------QGRGSF-YSPQSSDGQGRVSCQICQRLGHS

Query:  AINCYNRMNYHFQGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIAS
        A++CY+RM++ +QGRHPP +L AM                           A+TS              NG     S WLTD+G   HLT+++NNL + +
Subjt:  AINCYNRMNYHFQGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIAS

Query:  KYAGDDQVS------------------------------------DKSTGKVLFQGPSINGLYPLSSIHSS----TTPSCYVAHVATNKSYSLWHNRLGH
         Y G+DQV+                                    D  +GKVL++G S NGLYP+ + H S    T      A +++   + LWH+RLGH
Subjt:  KYAGDDQVS------------------------------------DKSTGKVLFQGPSINGLYPLSSIHSS----TTPSCYVAHVATNKSYSLWHNRLGH

Query:  PG----------------------------------------HS---------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVF
        P                                         HS         L+HSDVWGPAP TS +G+ YY+ F+DD+ +F+WLY +  KSDV + F
Subjt:  PG----------------------------------------HS---------LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVF

Query:  QRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASV------------------------
        + FK  VEN  S +IK LRTD GGEY +   + +  SNGI H  SC +TP+QNG  ERKHRHI++ AL+L+S AS+                        
Subjt:  QRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASV------------------------

Query:  -------------------------------------------PMKFCSSPMASPSY-------------------------------------------
                                                   P  F   P  S  Y                                           
Subjt:  -------------------------------------------PMKFCSSPMASPSY-------------------------------------------

Query:  -------------------QGHNINVATDTANVVTNAYAPNI------------SDVLPLSES---------------LPA------PATSVDSIPRVQN
                              +I+ A     V ++A +PN             S   P+  S               +PA      P +S  ++P    
Subjt:  -------------------QGHNINVATDTANVVTNAYAPNI------------SDVLPLSES---------------LPA------PATSVDSIPRVQN

Query:  AHSMQTRGKSGISKRK--------------------------------------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYH
         H M TR K+GI K K                                            L PLPP K+ +GCKWV+++K+N DG+I+RYKARLVAKG+H
Subjt:  AHSMQTRGKSGISKRK--------------------------------------------LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYH

Query:  QQEGIDY------------------------------------------------------DKH-------------------------------SVRFC
        QQ GID+                                                      D H                               ++ F 
Subjt:  QQEGIDY------------------------------------------------------DKH-------------------------------SVRFC

Query:  GSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL----HATDVGSQCLAEDVQIFSSS-------------------------CWCLTLP
         + ADSSLF+ +      YLLLYVDDI++TG + + +  LIS L       D+G+      +QI  SS                         C    +P
Subjt:  GSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQL----HATDVGSQCLAEDVQIFSSS-------------------------CWCLTLP

Query:  HV---------FTPIYFIFCQLVIPIH-----------AVSSSCSFGGCQTGSSV--------YCWYFIVWSSFRKGSSLHLTTFSDSDWAGSSLDRRST
        H           T ++  +  LV  +H           AV   C F    T   +        Y    I    F     + L+ FSD+DWAG   DRRST
Subjt:  HV---------FTPIYFIFCQLVIPIH-----------AVSSSCSFGGCQTGSSV--------YCWYFIVWSSFRKGSSLHLTTFSDSDWAGSSLDRRST

Query:  TGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
        +G ++FLG NP++W AKKQ TVSRSSTEAEY ALAS +AEL
Subjt:  TGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.9e-1432.85Show/hide
Query:  LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQN
        +VHSDV GP    ++D  NY+V F+D    +   Y I  KSDV ++FQ F    E  F+ ++  L  D G EY++  +  +    GI +  +  +TP+ N
Subjt:  LVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQN

Query:  GVAERKHRHIVQVALSLMSQASVPMKFCSSPMASPSY
        GV+ER  R I + A +++S A +   F    + + +Y
Subjt:  GVAERKHRHIVQVALSLMSQASVPMKFCSSPMASPSY

P04146 Copia protein5.2e-0327.87Show/hide
Query:  SSPMASPSYQGHNINVATDTANVVTNAYAPNISDVLPLSESLPAPATSVDSIPRVQNAHSMQTRGKSGISKRKLTPLPPSKSDIGCKWVYRVKRNPDGSI
        + P  S + + +++N     A+ + N   PN  D +   +     ++  ++I    NAH +        +   +T  P +K+ +  +WV+ VK N  G+ 
Subjt:  SSPMASPSYQGHNINVATDTANVVTNAYAPNISDVLPLSESLPAPATSVDSIPRVQNAHSMQTRGKSGISKRKLTPLPPSKSDIGCKWVYRVKRNPDGSI

Query:  ARYKARLVAKGYHQQEGIDYDK
         RYKARLVA+G+ Q+  IDY++
Subjt:  ARYKARLVAKGYHQQEGIDYDK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-2023.64Show/hide
Query:  KDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTL
        +DW    +   + I   LS   +   +   +++  W  LE  Y S + TN + LK  L ++    G +   ++     L  +LAN+ V ++EED  I  L
Subjt:  KDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTL

Query:  NGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDALTQPAAMFASQSTPNSSQR-SNPYGNFG-RGRSFGRGRNQNRGGRGFNPSGRGN
        N  PS ++    ++     ++   ++   L   E   +K      AL        ++    S QR SN YG  G RG+S  + R+++R    +N +  G+
Subjt:  NGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDALTQPAAMFASQSTPNSSQR-SNPYGNFG-RGRSFGRGRNQNRGGRGFNPSGRGN

Query:  FNQGRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNYHFQGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNV--AYCNSAPGS
        F +       P    G+G  S Q  +   ++A    N  N                 L  +  E+ +  S  ES+       ++ +  V   +C    G 
Subjt:  FNQGRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNYHFQGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNV--AYCNSAPGS

Query:  MHNGVSGSSS----------TWLTDSGCNAHL-----TSDLNNLTIA---------SKYAGDDQVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVA
              G++S             T+ GC   L       DL    I+           Y  + +        V+ +G +   LY  ++       +    
Subjt:  MHNGVSGSSS----------TWLTDSGCNAHL-----TSDLNNLTIA---------SKYAGDDQVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVA

Query:  HVATNKSYSLWHNRLGHPGH-----------------------------------------------SLVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFT
         +    S  LWH R+GH                                                   LV+SDV GP    S+ G  Y+V+FIDD  +  
Subjt:  HVATNKSYSLWHNRLGHPGH-----------------------------------------------SLVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFT

Query:  WLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKFCSSPM
        W+Y +  K  V  VFQ+F  LVE     ++K LR+D GGEY ++    Y SS+GI H+ +   TP+ NGVAER +R IV+   S++  A +P  F    +
Subjt:  WLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKFCSSPM

Query:  ASPSY
         +  Y
Subjt:  ASPSY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.1e-0850Show/hide
Query:  KLTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDK
        KL  LP  K  + CKWV+++K++ D  + RYKARLV KG+ Q++GID+D+
Subjt:  KLTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-0338.46Show/hide
Query:  GSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
        GS   L  ++D+D AG   +R+S+TG++       +SW +K Q  V+ S+TEAEY A   T  E+
Subjt:  GSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

P92519 Uncharacterized mitochondrial protein AtMg008101.0e-1434.53Show/hide
Query:  IYLLLYVDDIIITGTNNALINSLISQLHAT----DVG-------------------SQC-LAEDVQIFSSSCWCLTL-------------------PHVF
        +YLLLYVDDI++TG++N L+N LI QL +T    D+G                   SQ   AE +   +    C  +                   P  F
Subjt:  IYLLLYVDDIIITGTNNALINSLISQLHAT----DVG-------------------SQC-LAEDVQIFSSSCWCLTL-------------------PHVF

Query:  TPIYFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIVWSSFR-------------KGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKK
          I      L +    +S + +    +        + ++    R             K S L++  F DSDWAG +  RRSTTGF  FLG N +SW AK+
Subjt:  TPIYFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIVWSSFR-------------KGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKK

Query:  QSTVSRSSTEAEYRALASTAAEL
        Q TVSRSSTE EYRALA TAAEL
Subjt:  QSTVSRSSTEAEYRALASTAAEL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-4226.22Show/hide
Query:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAK----YQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHY
        ++   ++L GF+DGST  PP  I + A               AP   ++NP Y  W  +    Y A++  I+ ++ PA        T++ Q WE L K Y
Subjt:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAK----YQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHY

Query:  SSSSRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHV-LLKTEEAAIEKQTK
        ++ S  ++  L++ L+  +K   ++I+DY++ +    D+LA +   MD ++     L   P ++      +       T  E+H  LL  E   +   + 
Subjt:  SSSSRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHV-LLKTEEAAIEKQTK

Query:  HDDALTQPAAMFASQSTPNSS---QRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFNQGRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNY
            +T  A    + +T N++    R+N Y N          RN N   + +  S   NF+         QS    G+  CQIC   GHSA  C    ++
Subjt:  HDDALTQPAAMFASQSTPNSS---QRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFNQGRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNY

Query:  --HFQGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIASKYAGDD--
              + PP+         + W+ +                           N A GS +     SS+ WL DSG   H+TSD NNL++   Y G D  
Subjt:  --HFQGRHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIASKYAGDD--

Query:  -------------------------------------------------------------QVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHV
                                                                     QV D +TG  L QG + + LY      +S+ P    A  
Subjt:  -------------------------------------------------------------QVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHV

Query:  ATNKSYSLWHNRLGHPGHSL------------------------------------------------VHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTW
        ++  ++S WH RLGHP  S+                                                ++SDVW  +P  S D + YYV F+D   ++TW
Subjt:  ATNKSYSLWHNRLGHPGHSL------------------------------------------------VHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTW

Query:  LYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF
        LYP+ +KS V   F  FK L+EN F TRI T  +D GGE++   L  Y S +GI H  S  +TPE NG++ERKHRHIV+  L+L+S AS+P  +
Subjt:  LYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.3e-2125.84Show/hide
Query:  SESLPAPATSVDSIPRVQNAHSMQTRGKSGISKRKLTPLPPSK-SDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDY-----------------
        +ES P  A       R +NA   +   + G     L P PPS  + +GC+W++  K N DGS+ RYKARLVAKGY+Q+ G+DY                 
Subjt:  SESLPAPATSVDSIPRVQNAHSMQTRGKSGISKRKLTPLPPSK-SDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDY-----------------

Query:  -------------------------------------DKH-------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYV
                                             DK                                ++ F  S +D+SLFVL+     +Y+L+YV
Subjt:  -------------------------------------DKH-------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYV

Query:  DDIIITGTNNALINSLISQL---------------------------------HATDVGSQC-----------LAEDVQIFSSSCWCLTLPHVFTPI---
        DDI+ITG +  L+++ +  L                                 +  D+ ++            +A   ++   S   LT P  +  I   
Subjt:  DDIIITGTNNALINSLISQL---------------------------------HATDVGSQC-----------LAEDVQIFSSSCWCLTLPHVFTPI---

Query:  --YFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIV---------WSSFRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVS
          Y  F +  I  +AV+    F    T   +     I+             +KG++L L  +SD+DWAG   D  ST G++++LG +P+SW +KKQ  V 
Subjt:  --YFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIV---------WSSFRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVS

Query:  RSSTEAEYRALASTAAEL
        RSSTEAEYR++A+T++E+
Subjt:  RSSTEAEYRALASTAAEL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.1e-4026.16Show/hide
Query:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAK----YQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHY
        ++   ++L GF+DGST  PP  I + A                  V ++NP Y  W  +    Y A++  I+ ++ PA        T++ Q WE L K Y
Subjt:  SILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAK----YQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHY

Query:  SSSSRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKH
        ++ S  ++  L+                ++ +     D+LA +   MD ++     L   P D+      +       +  E+H  L   E+ +      
Subjt:  SSSSRTNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKH

Query:  DDALTQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFNQGRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNYHFQG
             +   + A+  T  ++  +    N G  R++    N N     + PS  G+ +  R      Q     GR  CQIC   GHSA  C     + FQ 
Subjt:  DDALTQPAAMFASQSTPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFNQGRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNYHFQG

Query:  RHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIASKYAGDD--------
            T      +  + W                 Q  AN + N  Y              +++ WL DSG   H+TSD NNL+    Y G D        
Subjt:  RHPPTQLVAMNALLSDWREQILCCSKAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIASKYAGDD--------

Query:  -------------------------------------------------------QVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVATNKSY
                                                               QV D +TG  L QG + + LY    I SS   S + A   +  ++
Subjt:  -------------------------------------------------------QVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVATNKSY

Query:  SLWHNRLGHP----------GHSL--------------------------------------VHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIAR
        S WH+RLGHP           HSL                                      ++SDVW  +P  S+D + YYV F+D   ++TWLYP+ +
Subjt:  SLWHNRLGHP----------GHSL--------------------------------------VHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIAR

Query:  KSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF
        KS V   F  FK LVEN F TRI TL +D GGE++   L +YLS +GI H  S  +TPE NG++ERKHRHIV++ L+L+S ASVP  +
Subjt:  KSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAYTPEQNGVAERKHRHIVQVALSLMSQASVPMKF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-2026.23Show/hide
Query:  PLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDY------------------------------------------------------D
        P PPS + +GC+W++  K N DGS+ RYKARLVAKGY+Q+ G+DY                                                      D
Subjt:  PLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDY------------------------------------------------------D

Query:  KH-------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQLHATDVGSQCLA----EDV
        K                                +V F  S +D+SLFVL+     IY+L+YVDDI+ITG +  L+       H  D  SQ  +    ED+
Subjt:  KH-------------------------------SVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQLHATDVGSQCLA----EDV

Query:  QIF--------------SSSCWCLTL--------------PHVFTPIYFIFCQLVIP--------------IHAVSSSCSFGGCQTGSSVYCWYFIVWSS
          F              S   + L L              P   +P   +     +P              +       S+   +    ++      W++
Subjt:  QIF--------------SSSCWCLTL--------------PHVFTPIYFIFCQLVIP--------------IHAVSSSCSFGGCQTGSSVYCWYFIVWSS

Query:  -----------------FRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
                          +KG++L L  +SD+DWAG + D  ST G++++LG +P+SW +KKQ  V RSSTEAEYR++A+T++EL
Subjt:  -----------------FRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-1368.75Show/hide
Query:  KLTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDY
        ++  LPP+K  IGCKWVY++K N DG+I RYKARLVAKGY QQEGID+
Subjt:  KLTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDY

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.3e-0845.31Show/hide
Query:  SSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL
        + + L  FSD+ +      RRST G+ +FLG + +SW +KKQ  VS+SS EAEYRAL+    E+
Subjt:  SSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSSTEAEYRALASTAAEL

ATMG00810.1 DNA/RNA polymerases superfamily protein7.2e-1634.53Show/hide
Query:  IYLLLYVDDIIITGTNNALINSLISQLHAT----DVG-------------------SQC-LAEDVQIFSSSCWCLTL-------------------PHVF
        +YLLLYVDDI++TG++N L+N LI QL +T    D+G                   SQ   AE +   +    C  +                   P  F
Subjt:  IYLLLYVDDIIITGTNNALINSLISQLHAT----DVG-------------------SQC-LAEDVQIFSSSCWCLTL-------------------PHVF

Query:  TPIYFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIVWSSFR-------------KGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKK
          I      L +    +S + +    +        + ++    R             K S L++  F DSDWAG +  RRSTTGF  FLG N +SW AK+
Subjt:  TPIYFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIVWSSFR-------------KGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKK

Query:  QSTVSRSSTEAEYRALASTAAEL
        Q TVSRSSTE EYRALA TAAEL
Subjt:  QSTVSRSSTEAEYRALASTAAEL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.6e-0955.56Show/hide
Query:  LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGI
        L P P +++ +GCKWV++ K + DG++ R KARLVAKG+HQ+EGI
Subjt:  LTPLPPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATAAGGATGTGTTTGTCAATGGATGTGGCAAGTCTAGTAGCCCATGAGACAACTGCAGTCAA
ATTGATGGAAGCGCTTACAAACAGGTATGAAAATCCTTCTGCAAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATACAAATGTCTGAGGATGCTTCTGTGAATT
CCTATATTAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAGTTTTCTGATGAGGTGAATGTTATTCAGTTGTTAACGTCTTTACCTGATAGTTGG
GAAACGATGAAGACAGCAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTGAGGAAATTCGTAGTGCAGCTTCTGTTCA
CTTAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGGGATTGGAGATGTTAGTC
TGAAGACAGAATGTGGAGGTAAATTGGTACTGCGAGATGTCAAGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAGTGGCAGTTGGTCACAGGAAATCTAC
ACTGTACAGATGCAGTTGAATGTTGTCAAAGGTTCAAAGAGACAGTGGATGTTGGTTAAAGCTGCAGATGGTAGTTGTAGAGGAGAGAAAGTTGATGGCTATCGTGAATC
CCCAGTTGTCAGACGCTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGACGCATCAAAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTAAGGTTCACGTAGCTA
GCCCATACCTCGGGCGAGCACAGATGATCTACAGTTATATTCTGATAGTTGCCTATTTCTGGGCTATTTGTCTGCCATTATTCATCAAGGAAATCAACATCATCTTTCAT
ATTGAAAAGTTTCATGGTATCAGAGCATCAATTTTGAAAGCCCACAAATTATTTGGGTTCATTGATGGATCTACAGTATGTCCTCCAAAGATGATTTCATCATCTGCTTT
GTCTTCCTCAACCTCTGTTGCTGCTGCAGCTGATACACCTCCTGCTCCTACTGTTTCTCAAATTAATCCCCTCTATAAAGATTGGGTTGCAAAATACCAAGCCTTAATGA
CGTTGATCAACGCCACACTCTCACCGGCAGCGTTGGCCTATGCTGTTGGTTGTACATCATCCAAACAAGCTTGGGAAGTCTTGGAGAAGCATTATTCCTCGAGTTCAAGA
ACCAACATTGTCAATCTAAAATCTGATCTTCAATCTATCTCTAAGAAACCGGGTGAGTCCATTAATGACTATGTTAAACAAATTAAGGAGCTTAAGGACAAATTAGCTAA
TGTCTCTGTTATTATGGATGAAGAGGATATTCAAATTTATACCCTAAATGGCTCACCCTCTGATTTTAATACATTTTGCAAGTCTATGAGAACCTGTTCACAGTCTGTTA
CTTTCGATGAGCTACATGTTTTATTGAAGACCGAAGAAGCTGCCATTGAAAAACAGACGAAACATGATGATGCCCTAACTCAACCCGCAGCTATGTTTGCATCGCAATCA
ACTCCTAACTCTTCTCAACGTTCAAATCCGTATGGAAATTTTGGTAGAGGAAGATCATTTGGTCGTGGGCGAAATCAAAACCGTGGTGGTCGTGGCTTCAATCCATCTGG
GCGAGGAAATTTCAATCAAGGTAGAGGATCCTTTTATTCTCCACAATCATCTGATGGACAGGGTCGTGTCTCCTGTCAAATATGTCAACGCCTTGGACATAGTGCCATCA
ATTGCTACAATAGAATGAATTACCATTTCCAAGGACGTCATCCACCCACACAATTGGTTGCCATGAATGCATTGCTAAGCGACTGGAGGGAGCAAATTCTGTGCTGCAGC
AAAGCTGAGAGCAAAACTGCCACTTCACAGCTCGAAGCCAACACTTCTCAAAATGTAGCCTACTGTAACTCTGCACCTGGTAGTATGCATAATGGTGTTTCTGGTTCATC
TTCTACTTGGTTAACCGACTCAGGTTGTAATGCCCATCTTACTTCAGATCTCAATAATTTAACTATTGCTTCAAAGTATGCAGGTGATGATCAAGTCTCAGACAAATCTA
CGGGCAAGGTTTTGTTCCAAGGTCCTAGTATCAATGGACTCTATCCTTTATCATCCATTCATTCATCTACTACTCCATCATGTTACGTTGCTCATGTTGCTACAAATAAA
TCTTATTCTCTATGGCATAATCGTTTAGGACATCCTGGCCACTCTCTTGTACACAGTGATGTTTGGGGTCCTGCTCCTAAAACTTCTGTTGATGGCTTTAATTATTATGT
CTCTTTTATTGATGACCACTTTAAGTTTACTTGGTTGTATCCCATTGCTCGCAAGTCTGATGTCCCTACTGTTTTTCAACGCTTCAAACCTCTTGTTGAGAATTTATTTT
CCACTCGAATTAAAACACTTCGAACAGACGGTGGGGGTGAATATATAAATAAGAACCTTTCTAATTATCTTTCTAGTAATGGCATTCTTCACCAAAATTCATGTGCTTAC
ACCCCAGAACAAAATGGCGTTGCCGAACGGAAACACCGCCATATTGTTCAAGTTGCCTTATCTCTAATGTCTCAAGCCTCTGTTCCTATGAAATTTTGTTCTTCTCCAAT
GGCTTCACCATCTTACCAGGGTCATAATATTAATGTTGCCACTGATACCGCTAATGTCGTTACTAATGCTTATGCTCCCAATATTTCTGATGTTTTGCCTCTAAGTGAAT
CTTTACCAGCTCCTGCTACTAGTGTTGATTCGATTCCTCGAGTTCAGAATGCTCATTCAATGCAAACTCGTGGCAAATCAGGGATTTCTAAGCGGAAGCTTACCCCTCTT
CCTCCTAGCAAGAGTGACATTGGTTGTAAATGGGTTTATCGTGTGAAGCGCAATCCAGATGGTTCTATTGCTCGCTACAAGGCTCGTCTTGTTGCTAAGGGATATCATCA
ACAAGAAGGAATTGACTATGATAAACATTCAGTCCGTTTTTGTGGATCACAAGCCGATTCTTCCTTGTTTGTGTTGAAGCTGAATGGTGATTTTATATACCTTCTTCTGT
ATGTCGATGATATCATCATCACAGGCACTAATAATGCTTTGATTAACTCCTTAATCTCTCAATTACATGCTACTGATGTTGGTTCTCAGTGTTTGGCTGAGGATGTTCAA
ATTTTTTCGAGCTCTTGTTGGTGCCTTACACTACCTCACGTTTTCACGCCCATATATTTCATTTTCTGTCAGTTGGTTATCCCAATTCATGCAGTCTCCTCATCATGCTC
ATTTGGTGGCTGCCAAACAGGTTCTTCGGTATATTGTTGGTACTTTATCGTCTGGTCTTCCTTCAGGAAAGGCTCCTCCTTGCACCTCACAACCTTCTCTGATTCTGACT
GGGCTGGTAGTTCACTTGATCGGCGTTCCACCACTGGCTTTGTTATTTTCTTGGGGCCAAATCCAGTGTCTTGGGGTGCCAAAAAGCAGTCCACAGTATCTCGAAGTTCA
ACAGAGGCCGAGTATCGTGCTCTAGCTTCCACTGCTGCCGAGTTGTTTACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATAAGGATGTGTTTGTCAATGGATGTGGCAAGTCTAGTAGCCCATGAGACAACTGCAGTCAA
ATTGATGGAAGCGCTTACAAACAGGTATGAAAATCCTTCTGCAAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATACAAATGTCTGAGGATGCTTCTGTGAATT
CCTATATTAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAGTTTTCTGATGAGGTGAATGTTATTCAGTTGTTAACGTCTTTACCTGATAGTTGG
GAAACGATGAAGACAGCAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTGAGGAAATTCGTAGTGCAGCTTCTGTTCA
CTTAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGGGATTGGAGATGTTAGTC
TGAAGACAGAATGTGGAGGTAAATTGGTACTGCGAGATGTCAAGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAGTGGCAGTTGGTCACAGGAAATCTAC
ACTGTACAGATGCAGTTGAATGTTGTCAAAGGTTCAAAGAGACAGTGGATGTTGGTTAAAGCTGCAGATGGTAGTTGTAGAGGAGAGAAAGTTGATGGCTATCGTGAATC
CCCAGTTGTCAGACGCTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGACGCATCAAAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTAAGGTTCACGTAGCTA
GCCCATACCTCGGGCGAGCACAGATGATCTACAGTTATATTCTGATAGTTGCCTATTTCTGGGCTATTTGTCTGCCATTATTCATCAAGGAAATCAACATCATCTTTCAT
ATTGAAAAGTTTCATGGTATCAGAGCATCAATTTTGAAAGCCCACAAATTATTTGGGTTCATTGATGGATCTACAGTATGTCCTCCAAAGATGATTTCATCATCTGCTTT
GTCTTCCTCAACCTCTGTTGCTGCTGCAGCTGATACACCTCCTGCTCCTACTGTTTCTCAAATTAATCCCCTCTATAAAGATTGGGTTGCAAAATACCAAGCCTTAATGA
CGTTGATCAACGCCACACTCTCACCGGCAGCGTTGGCCTATGCTGTTGGTTGTACATCATCCAAACAAGCTTGGGAAGTCTTGGAGAAGCATTATTCCTCGAGTTCAAGA
ACCAACATTGTCAATCTAAAATCTGATCTTCAATCTATCTCTAAGAAACCGGGTGAGTCCATTAATGACTATGTTAAACAAATTAAGGAGCTTAAGGACAAATTAGCTAA
TGTCTCTGTTATTATGGATGAAGAGGATATTCAAATTTATACCCTAAATGGCTCACCCTCTGATTTTAATACATTTTGCAAGTCTATGAGAACCTGTTCACAGTCTGTTA
CTTTCGATGAGCTACATGTTTTATTGAAGACCGAAGAAGCTGCCATTGAAAAACAGACGAAACATGATGATGCCCTAACTCAACCCGCAGCTATGTTTGCATCGCAATCA
ACTCCTAACTCTTCTCAACGTTCAAATCCGTATGGAAATTTTGGTAGAGGAAGATCATTTGGTCGTGGGCGAAATCAAAACCGTGGTGGTCGTGGCTTCAATCCATCTGG
GCGAGGAAATTTCAATCAAGGTAGAGGATCCTTTTATTCTCCACAATCATCTGATGGACAGGGTCGTGTCTCCTGTCAAATATGTCAACGCCTTGGACATAGTGCCATCA
ATTGCTACAATAGAATGAATTACCATTTCCAAGGACGTCATCCACCCACACAATTGGTTGCCATGAATGCATTGCTAAGCGACTGGAGGGAGCAAATTCTGTGCTGCAGC
AAAGCTGAGAGCAAAACTGCCACTTCACAGCTCGAAGCCAACACTTCTCAAAATGTAGCCTACTGTAACTCTGCACCTGGTAGTATGCATAATGGTGTTTCTGGTTCATC
TTCTACTTGGTTAACCGACTCAGGTTGTAATGCCCATCTTACTTCAGATCTCAATAATTTAACTATTGCTTCAAAGTATGCAGGTGATGATCAAGTCTCAGACAAATCTA
CGGGCAAGGTTTTGTTCCAAGGTCCTAGTATCAATGGACTCTATCCTTTATCATCCATTCATTCATCTACTACTCCATCATGTTACGTTGCTCATGTTGCTACAAATAAA
TCTTATTCTCTATGGCATAATCGTTTAGGACATCCTGGCCACTCTCTTGTACACAGTGATGTTTGGGGTCCTGCTCCTAAAACTTCTGTTGATGGCTTTAATTATTATGT
CTCTTTTATTGATGACCACTTTAAGTTTACTTGGTTGTATCCCATTGCTCGCAAGTCTGATGTCCCTACTGTTTTTCAACGCTTCAAACCTCTTGTTGAGAATTTATTTT
CCACTCGAATTAAAACACTTCGAACAGACGGTGGGGGTGAATATATAAATAAGAACCTTTCTAATTATCTTTCTAGTAATGGCATTCTTCACCAAAATTCATGTGCTTAC
ACCCCAGAACAAAATGGCGTTGCCGAACGGAAACACCGCCATATTGTTCAAGTTGCCTTATCTCTAATGTCTCAAGCCTCTGTTCCTATGAAATTTTGTTCTTCTCCAAT
GGCTTCACCATCTTACCAGGGTCATAATATTAATGTTGCCACTGATACCGCTAATGTCGTTACTAATGCTTATGCTCCCAATATTTCTGATGTTTTGCCTCTAAGTGAAT
CTTTACCAGCTCCTGCTACTAGTGTTGATTCGATTCCTCGAGTTCAGAATGCTCATTCAATGCAAACTCGTGGCAAATCAGGGATTTCTAAGCGGAAGCTTACCCCTCTT
CCTCCTAGCAAGAGTGACATTGGTTGTAAATGGGTTTATCGTGTGAAGCGCAATCCAGATGGTTCTATTGCTCGCTACAAGGCTCGTCTTGTTGCTAAGGGATATCATCA
ACAAGAAGGAATTGACTATGATAAACATTCAGTCCGTTTTTGTGGATCACAAGCCGATTCTTCCTTGTTTGTGTTGAAGCTGAATGGTGATTTTATATACCTTCTTCTGT
ATGTCGATGATATCATCATCACAGGCACTAATAATGCTTTGATTAACTCCTTAATCTCTCAATTACATGCTACTGATGTTGGTTCTCAGTGTTTGGCTGAGGATGTTCAA
ATTTTTTCGAGCTCTTGTTGGTGCCTTACACTACCTCACGTTTTCACGCCCATATATTTCATTTTCTGTCAGTTGGTTATCCCAATTCATGCAGTCTCCTCATCATGCTC
ATTTGGTGGCTGCCAAACAGGTTCTTCGGTATATTGTTGGTACTTTATCGTCTGGTCTTCCTTCAGGAAAGGCTCCTCCTTGCACCTCACAACCTTCTCTGATTCTGACT
GGGCTGGTAGTTCACTTGATCGGCGTTCCACCACTGGCTTTGTTATTTTCTTGGGGCCAAATCCAGTGTCTTGGGGTGCCAAAAAGCAGTCCACAGTATCTCGAAGTTCA
ACAGAGGCCGAGTATCGTGCTCTAGCTTCCACTGCTGCCGAGTTGTTTACGTGA
Protein sequenceShow/hide protein sequence
MSDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTNRYENPSANNKVYLVKKFFNIQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSW
ETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRSAASVHLASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVKYVPNIKMNLISIGSGSWSQEIY
TVQMQLNVVKGSKRQWMLVKAADGSCRGEKVDGYRESPVVRRSNELKKSLRRVDASKWKARAVAKVKGKVHVASPYLGRAQMIYSYILIVAYFWAICLPLFIKEINIIFH
IEKFHGIRASILKAHKLFGFIDGSTVCPPKMISSSALSSSTSVAAAADTPPAPTVSQINPLYKDWVAKYQALMTLINATLSPAALAYAVGCTSSKQAWEVLEKHYSSSSR
TNIVNLKSDLQSISKKPGESINDYVKQIKELKDKLANVSVIMDEEDIQIYTLNGSPSDFNTFCKSMRTCSQSVTFDELHVLLKTEEAAIEKQTKHDDALTQPAAMFASQS
TPNSSQRSNPYGNFGRGRSFGRGRNQNRGGRGFNPSGRGNFNQGRGSFYSPQSSDGQGRVSCQICQRLGHSAINCYNRMNYHFQGRHPPTQLVAMNALLSDWREQILCCS
KAESKTATSQLEANTSQNVAYCNSAPGSMHNGVSGSSSTWLTDSGCNAHLTSDLNNLTIASKYAGDDQVSDKSTGKVLFQGPSINGLYPLSSIHSSTTPSCYVAHVATNK
SYSLWHNRLGHPGHSLVHSDVWGPAPKTSVDGFNYYVSFIDDHFKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGEYINKNLSNYLSSNGILHQNSCAY
TPEQNGVAERKHRHIVQVALSLMSQASVPMKFCSSPMASPSYQGHNINVATDTANVVTNAYAPNISDVLPLSESLPAPATSVDSIPRVQNAHSMQTRGKSGISKRKLTPL
PPSKSDIGCKWVYRVKRNPDGSIARYKARLVAKGYHQQEGIDYDKHSVRFCGSQADSSLFVLKLNGDFIYLLLYVDDIIITGTNNALINSLISQLHATDVGSQCLAEDVQ
IFSSSCWCLTLPHVFTPIYFIFCQLVIPIHAVSSSCSFGGCQTGSSVYCWYFIVWSSFRKGSSLHLTTFSDSDWAGSSLDRRSTTGFVIFLGPNPVSWGAKKQSTVSRSS
TEAEYRALASTAAELFT