; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016922 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016922
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr12:42374393..42398602
RNA-Seq ExpressionLag0016922
SyntenyLag0016922
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR019734 - Tetratricopeptide repeat
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]0.0e+0042.87Show/hide
Query:  ATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVV
        A+   S+  + L + ++VKLD  NY LW+ +VL ++RG K+DGY+LGT+  P E    +  +  K    N  + EWQ  DQ L GW+  SMT  IA  ++
Subjt:  ATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVV

Query:  SFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSI
          +TS+++W   + + G+ ++++I  L+    + +KG MKM ++L  MK   + L+LAGNP+S  DL+   L GLDS+Y P+V  + D+   +W +L + 
Subjt:  SFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSI

Query:  LIQFEGTLARYSTIPN-----TRDLPDLAAHLAFNQQSQPSGQ--------RSLFQIGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNLHANSA
        L+ FE  + + + + N     T ++ + + H   +  +   G         R   + G++P +      ++ I+                     H    
Subjt:  LIQFEGTLARYSTIPN-----TRDLPDLAAHLAFNQQSQPSGQ--------RSLFQIGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNLHANSA

Query:  ATSNSNSQGGN--SASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIK
          S SN   G+    S +A++A+   + D  W  DSGA+NHV+          E++G  SLVVGNG KL I   GSS +      K + L+++L+VP+I 
Subjt:  ATSNSNSQGGN--SASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIK

Query:  RNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVSIS---NKPFGLGLSTEEVSGLSSH--------VVDSASVFHVVKSSTSFLSK--
        +NL+S+++L ADN++ VEF    C VKDK + +V+L+G LKD    +S     P       E       H        V++S  V      + SF     
Subjt:  RNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVSIS---NKPFGLGLSTEEVSGLSSH--------VVDSASVFHVVKSSTSFLSK--

Query:  -ALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPF
           +H     +S      PLEL++ D+WGP+P+ ++ G++YY+ FVD+F+R T+IYPLK KSE + +F QFK L EN+F   IK++Q D GGE++     
Subjt:  -ALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPF

Query:  LKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYN
            GI+FR +CP+TSQQNG  ERKHRHI E GL++LAQ+ MPLH+WWEAFS+AV++INRLP+ V +N+SP+     +  D   L+TFGCAC+P L+PYN
Subjt:  LKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYN

Query:  TQKFQFHSTKCVFLGYSFSHKGYKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHP-SDFQ
          K Q+H+T+CVFLGYS SHKGYKCL+S GRI+ISRHV+FNED FPF +GF  + S    + +V           P ++ P   + +   D + P  + +
Subjt:  TQKFQFHSTKCVFLGYSFSHKGYKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHP-SDFQ

Query:  GASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKE
          + +    SQD+N           S   +    P  +  +H    D+             ++SH + TR+KSGI +PK     L  +     EP + KE
Subjt:  GASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKE

Query:  ALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIR
        AL    WK+AM  E  AL  N TW LV      N++ SKWVFK K   DGS ER KARLVAKGF Q  GID+ ETFSPV+KA T+RIIL++AV   W +R
Subjt:  ALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIR

Query:  QLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPN
        QLD+NNAFLNGHL E VFM QPEG+V+   P+++CKL KA+ GLKQAPRAW+D L++ LL+WGF+N+++DSS+F       +  +LIYVDDI+VTG    
Subjt:  QLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPN

Query:  FISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDI
        F+  FI++LND F+LKD+G L Y+LG+EV R  SGM+L QSKYI DLL +  M +  PC TP +T ++F+ VEG  + +PT++R  IG LQYLTHT PDI
Subjt:  FISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDI

Query:  SYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLA
        ++ VNKLSQ++  P++ HWQG+KR+LRYL+GT+NY L +  S+ L +  ++DADWA  +DDRKS++  C+FLG +L+SWSS+KQ VV+RSS ESEYR+LA
Subjt:  SYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLA

Query:  QTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVLL
           AE++W+++ L E+ +     PI+WCDN SA +LA+N V H+RSKHIE+DVH++RD+VLQ  + + YVP++DQ+AD LTKP+S  +F+ LR KLGV+L
Subjt:  QTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVLL

Query:  SP
        SP
Subjt:  SP

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]0.0e+0043.91Show/hide
Query:  LSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKA
        L ++++VKLD  NY LW+ +VL+++RG K+DGYILGT    +EC E    +  K    NP + +W   DQAL GWL  SM   IA  ++  +TS+++W  
Subjt:  LSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKA

Query:  LEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARY
         + + G+ +K+RI  L+    NT+KG MKM E+L  MK  S+ L+LAG+PIS  DL+   L GLD++Y P+V  + D+   +W ++ + L+ FE  L ++
Subjt:  LEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARY

Query:  S-----TIPNTRDLPDLAAHLAFNQQSQPSGQRSLFQ-IGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAY
        +     T+  + +  +          S+ + +RS F+ +     KG +S     +      +A +    F+ P T          N +++     S SA+
Subjt:  S-----TIPNTRDLPDLAAHLAFNQQSQPSGQRSLFQ-IGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAY

Query:  IATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFH
        IA+P    D +W  DSGA NHV+         NE+NG  SL+VGNG KL I   GS+ ++       + L+++L+VP I +NL+S+++LTADN++ VEF 
Subjt:  IATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFH

Query:  SCFCLVKDKASKRVMLQGTLKDANVSISNKPFGLGLSTEE-----VSGLSSHVVDSASVFHVVKSSTS-------FLSKALLHRRLGHASDRVLNNPLEL
        +  C VKDK + + +L+G LKD    +SNK   + +S +E     +   ++ V+D       VK S S             LH      S   +  PL L
Subjt:  SCFCLVKDKASKRVMLQGTLKDANVSISNKPFGLGLSTEE-----VSGLSSHVVDSASVFHVVKSSTS-------FLSKALLHRRLGHASDRVLNNPLEL

Query:  IYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVV
        I+ D+WGP+P+ S  G++YY+ F+D+F+R T+I+PLK KS+ + +F QFK L EN+F   IKI+Q D GGE+++        GI+FR +CP+TSQQNG  
Subjt:  IYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVV

Query:  ERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKG
        ERKHRH+ E+GL++LAQ+ MPL +WWEAFS+AV++INRLP+ V  N+SP+   F +  D ++L+ FGCAC+P L+PYN  K QFH+T+CVF+GYS SHKG
Subjt:  ERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKG

Query:  YKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDE
        YKC++S GRI++SRHV+FNE+ FPF  GF  + +      D         N S +  LPT  + +   D   P +   +  +  ++    N    E +D 
Subjt:  YKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDE

Query:  PLSAISEPTASPV-IEPVSHTSQPD-----LLHSVEPPAQPLVASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSA
            ++   +S   IE  +     D     +  +++  AQ    S++H M+TR+K GI +PK     +A +D    EP SVKEAL    WK+AMD E  A
Subjt:  PLSAISEPTASPV-IEPVSHTSQPD-----LLHSVEPPAQPLVASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSA

Query:  LRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDV
        L  N+TW LV      N+I SKW+FK K  SDGS ER KARLVAKGF Q  G+DF ETFSPVVK+ T+RIIL +AV   W +RQLD+NNAFLNG L E V
Subjt:  LRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDV

Query:  FMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKD
        FM QPEGY++ A P+++CKL KA+ GLKQAPRAWYD LRSTL++WGF+N++ D+S+FF         +LIYVDDI+VTG    F+  F  +LN  ++LKD
Subjt:  FMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKD

Query:  IGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVI
        +G L Y+LGVEVHR DSGM+L Q+KYI D+L +  M +   C TP VT ++F + EG  MSNPTLYR  IGALQYLT+TRPDI++ VNKLSQ++  PT+ 
Subjt:  IGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVI

Query:  HWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQ-------
        HWQG+KR+LRYL+GT N+ L +  S++L +  + DADWA   DDRKS    C+FLG +LVSW+S+KQ VV+RSS ESEYRSLA   AEVS          
Subjt:  HWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQ-------

Query:  ------------AFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGV
                      L+E+ +     P++WCDN SA +LA+N V H+RSKHIE+D+H++RD+VL+  + I YVP++DQ+AD LTKP+   +F  +R KLGV
Subjt:  ------------AFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGV

Query:  LLSPS
         +SPS
Subjt:  LLSPS

KAE8657094.1 hypothetical protein F3Y22_tig00116997pilonHSYRG00993 [Hibiscus syriacus]4.4e-30934.11Show/hide
Query:  SERMNQTLLERARCMLSNVGLLRRFWIEAVSTACYLINRGPHTGIDCKTSYEVWSGKPADYSLLRVFGCSVYYHV---NEGKLEPRAKNGIFVGFGDGVK
        +ERMN++L ERARC+  N GL + FW +AV+ ACYLINR P   +  K + EVW+G    +  LR+FGC  Y HV      KL+ ++K  IF+G+  GVK
Subjt:  SERMNQTLLERARCMLSNVGLLRRFWIEAVSTACYLINRGPHTGIDCKTSYEVWSGKPADYSLLRVFGCSVYYHV---NEGKLEPRAKNGIFVGFGDGVK

Query:  GYRIWSPSENRVILSRNVVFDENSMFNSVVKPAVAE-ESSDVDKQVELQFTSGQDESQHQGGEDQHAAIETDVPVVDTGTRIDEPKSSSLETARQPSIAL
        G++ W P   ++++SR+VVFDE SM        V + E    +K    Q TSG                                  S+ +  +  S+A 
Subjt:  GYRIWSPSENRVILSRNVVFDENSMFNSVVKPAVAE-ESSDVDKQVELQFTSGQDESQHQGGEDQHAAIETDVPVVDTGTRIDEPKSSSLETARQPSIAL

Query:  DGARRVGIRPPQRYGFEDMIAYALQVVEEVDPHESSTYREVVSSSESAQWLAAMGDEIESLDKNQTWELVRRPKERKIVTCKWVFKKKEGISPTEGIKFK
        D  RR  I+PP    FED++++AL +V   DP    T+ + ++S E+ +W+ AM +E+E+L+ NQT         RK                 +G KFK
Subjt:  DGARRVGIRPPQRYGFEDMIAYALQVVEEVDPHESSTYREVVSSSESAQWLAAMGDEIESLDKNQTWELVRRPKERKIVTCKWVFKKKEGISPTEGIKFK

Query:  ARVVARGFTQREGVDYNEIFSPVVRHTSIRVLLAMVAHQNLELEQLDVKTVFLHGELEEEIYMTQPEGFQCPGKEDYVCKLKKSLYGLKQSLRQWYKRFD
        AR+VA+GF+Q++GVDY+EIFSPVVRHTSIR +LA+VA  +  LE +DVKT F HG+LE++IYM QPEGF  PG E  VC+LKKSLYGLKQS RQWYKRFD
Subjt:  ARVVARGFTQREGVDYNEIFSPVVRHTSIRVLLAMVAHQNLELEQLDVKTVFLHGELEEEIYMTQPEGFQCPGKEDYVCKLKKSLYGLKQSLRQWYKRFD

Query:  GYMIELGYNRSPYDCCVYHNELGDGLKIYLVLYVDDMLIAAKFKSDILKLKDLLSAEFEMKDLGDAQKILGMEIYRKRDKKKFFLSQKGYIQKILSRFGM
         YMI++GYNR  YDCCVY   L  G  I+L+LYVDDMLIAAK   D++ LK LLS EF+MKDLG  +KILGMEI R RD +K +LSQ GY++K+L RF M
Subjt:  GYMIELGYNRSPYDCCVYHNELGDGLKIYLVLYVDDMLIAAKFKSDILKLKDLLSAEFEMKDLGDAQKILGMEIYRKRDKKKFFLSQKGYIQKILSRFGM

Query:  SSTKPIDTPSVASVRLSSERASQSEAEKQYMSRVPYASAVGSLMYAMVCNRPDLAHVVSVV-------------------------------------KC
        SS KP+ TP     +LSSE+ ++++ E + M++V Y++ VG LMYAMVC RPDL +VVS V                                       
Subjt:  SSTKPIDTPSVASVRLSSERASQSEAEKQYMSRVPYASAVGSLMYAMVCNRPDLAHVVSVV-------------------------------------KC

Query:  LVTGYSYFDYVGDVDSRRSMTGYVFTPGGFVVSWKATLQPTVSLSTTEAEYMALTEAAKEAIWLKGLVGDLG---LHHDQAVSEDVNFEVLRGLTNALLA
        LV GY   DY GD+D+RRS  GYVFT GG  + WK+T+Q  V+LSTTEAEYM   EAAKEA+WL GLV +LG   L+ +Q+       +V R  T  +  
Subjt:  LVTGYSYFDYVGDVDSRRSMTGYVFTPGGFVVSWKATLQPTVSLSTTEAEYMALTEAAKEAIWLKGLVGDLG---LHHDQAVSEDVNFEVLRGLTNALLA

Query:  AKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYLAKGIILKENGRAGDAERMFIQ
               V+F      +++ +    +A G              E+L  K ++D     +A  ++ + +TT  D F+  L                     
Subjt:  AKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYLAKGIILKENGRAGDAERMFIQ

Query:  ARFFAPENAKMLVDRASFKANAMGDE---ISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSEC
                   L++ +S      G E    S  S A ++ P   +                   L E   ++W G                GT   P + 
Subjt:  ARFFAPENAKMLVDRASFKANAMGDE---ISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSEC

Query:  NEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENL
              TG+   + N   ++  T                                      G   +AR+                               
Subjt:  NEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENL

Query:  QLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAFNQQSQPSGQRSLFQIGESPIKGIISK
        QL G    +G  V       D D+V I   + +   S ++E SS +              N  +L  +A     N+                        
Subjt:  QLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAFNQQSQPSGQRSLFQIGESPIKGIISK

Query:  VYLIIEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLH
                                +++    AATS++N++   +                +W+ DSGAT+HV+ DA ++    + +G  ++VVGNG+ L+
Subjt:  VYLIIEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLH

Query:  IDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVSISNKPFGLGLSTEEVSGLSSHVVD
        +  +G S +   S ++ + L ++LHVP I ++L+S+++L  DN +F EFH+  C V+D+A+  ++L+G  K             GL T  +     +  +
Subjt:  IDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVSISNKPFGLGLSTEEVSGLSSHVVD

Query:  SASVFHVVKSSTSFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSP-----VQSTLGYRYYLSFVDNFT------RLTYIYPLKSKSEALASFKQFKL
        +A  + V     SF    L HRRLGH +   L      I  D+  PS      V   LG  + L FVD+ T      +L + Y         A F+ FK 
Subjt:  SASVFHVVKSSTSFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSP-----VQSTLGYRYYLSFVDNFT------RLTYIYPLKSKSEALASFKQFKL

Query:  LVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWE
        +VE +F  +IK +QSDWGGE+R++   L   GI  R TCPH+S+QNGVVE KHRH++E+ L +LAQ+S+P+ FW  A ++  ++INRLP+ VL N SP+E
Subjt:  LVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWE

Query:  KAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPN
        K F++  D +++  FGC CFP+LR +   K    S +C FLGYS  HK                                                    
Subjt:  KAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPN

Query:  PSPVSTLPTSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPMQTRAKSG
                     +A  D AH       S SL  V              P  +    T +P++E   H +QP +     P   PL+  ++HPM TR+K G
Subjt:  PSPVSTLPTSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPMQTRAKSG

Query:  IFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHE
        IF+PK        + +P  E                M++E                                           RLVAK F Q+PG +F E
Subjt:  IFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHE

Query:  TFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGY--VNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSS
        TFSPVVK PT+ +IL++AV+  WP+R +DVNNAFL+G L+E+VFMQQP G+   +E     VCKL KAL GL+QAPR W ++L+  LL  GF  SRAD S
Subjt:  TFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGY--VNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSS

Query:  VFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLV
        +F   +   +  +L+YVDDI++TG     I R                           +  G+ L Q K+I++LL +  M      STP V S +    
Subjt:  VFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLV

Query:  EGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYG-LFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIF
         G  + +   +RS++GAL Y  HTRPDI Y V+K++QF+ +P   H Q  KR+LRYL GT  YG LF P   +L +VA+ D DW  C+DDR+S++  C+ 
Subjt:  EGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYG-LFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIF

Query:  LGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVP
         G +LVSWS+KK  VV+ S+ E+EYRSLA   AEV+W+ A L +IGV     PI+WCDNTSAISL+ N + H++SKH++LDVHFVR+KV    + + YVP
Subjt:  LGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVP

Query:  SS
        ++
Subjt:  SS

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.6e-30442.29Show/hide
Query:  MGDEISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTID
        M     SS   A + P ST AT       H     L +KLD  NY+LWR  +  ++     + +I G K  P +     T +G      NP +  W+  D
Subjt:  MGDEISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTID

Query:  QALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYV
        + +  W++ S+TP I   +V +++S   W ALE+++ ++S+AR+ QLR   Q T+KGS+ M+E++  +K  ++NL   G P++  D +  +L GL +DY 
Subjt:  QALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYV

Query:  PIVCSIEDK-DISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAF-------NQQSQPSGQRSLFQI------GESPIKGIISKVYLIIEVPIEV
         IV S+  + D  +   + SIL+  E  L+  +++        ++A+LA        N++S    ++S F        G S       +  L  +    V
Subjt:  PIVCSIEDK-DISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAF-------NQQSQPSGQRSLFQI------GESPIKGIISKVYLIIEVPIEV

Query:  VAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSC
        V        N    N + ++  T+  N++        A +A+P  ++D  W  D+GAT+H+S     L+    Y GN+ ++VGNG  L I H G+++   
Subjt:  VAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSC

Query:  ASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKD--------------ANVSIS-NKPFGLGLST--------
         S +K  +L  +LHVP I  NLIS+++  ADN+ F EFH  F  VKD+ +K+++LQG+L+               A VS S ++   L L+T        
Subjt:  ASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKD--------------ANVSIS-NKPFGLGLST--------

Query:  --EEVSGLSSHVVDSASVFHVVKSST--SFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASF
             + +  H++ S ++ H    +        A  H+   + S    ++PL L++ DLWGP+ + ST G RY++ FVD+F+R ++IYPL SK +AL+ F
Subjt:  --EEVSGLSSHVVDSASVFHVVKSST--SFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASF

Query:  KQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLEN
         +FK LVEN+F S I+ L+SD GGEF++F  +L T GI+ + +CP+T +QNG  ERK RHI+E GL++LA +S+P  FW  AF +A+F+INRLPT VL  
Subjt:  KQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLEN

Query:  KSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVH
        +SP++  F +  +    + FGC C+PY+RPYN  K  + S++CVFLGYS +HKGY CL+  +GR+Y++RHVVF+E  FPFQ+   QSSS         V 
Subjt:  KSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVH

Query:  WLPFPNPSPVSTLP-TSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPM
         +P P   P S+ P +SL S     T+ P        ++P+ +    +SL + +  P + IS             TS+P               ++ HPM
Subjt:  WLPFPNPSPVSTLP-TSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPM

Query:  QTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQI
         TRAK+GI + K + S        ++EP +  +A++ S W  AM+ E SAL+RNNTWHLV PP N N+IG KWV+K+K   DG+ +R KARLVA+GF Q 
Subjt:  QTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQI

Query:  PGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNS
         G+D+ ETFSPVVKA TIRIILA+A++  W + QLDV NAFL+G L E VFMQQP G++N  +P +VCKL+KAL GLKQAPRAWY++L ++LL WGF+ S
Subjt:  PGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNS

Query:  RADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSK
        RADSS+F   S H V+++LIYVDDILVTG     +S FI RLN  FAL+D+G ++Y+LG+EV R+ +   L Q KY  DLL+R  ML  KP +TP +  +
Subjt:  RADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSK

Query:  RFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAA
          S ++G P S+ TLYRS +GALQYLT TRPDIS+ VNK  QF+  PT  HW  VKR+LRYLKGTL+YG+ + +S+SL +  YTDADWA C DDR+S   
Subjt:  RFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAA

Query:  HCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDI
        + IFLG +LVSWSS KQ VV+RSSAESEYR+LA  T+E+ W+Q  LQE+ + SSS P++WCDN SA  LA N VFH+R+KHIE+D+HF+RD VL+K + I
Subjt:  HCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDI

Query:  RYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRG
        +Y+PS++Q+AD  TK IS  QF + R+KL V+ SP  LRG
Subjt:  RYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRG

RVX06084.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.8e-30242.15Show/hide
Query:  MGDEISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTID
        M     SS   A + P ST AT       H     L +KLD  NY+LWR  +  ++     + +I G K  P +     T +G      NP +  W+  D
Subjt:  MGDEISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTID

Query:  QALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYV
        + +  W++ S+TP I   +V +++S   W ALE+++ ++S+AR+ QLR   Q T+KGS+ M+E++  +K  ++NL   G P++  D +  +L GL +DY 
Subjt:  QALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYV

Query:  PIVCSIEDK-DISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAF-------NQQSQPSGQRSLFQI------GESPIKGIISKVYLIIEVPIEV
         IV S+  + D  +   + SIL+  E  L+  +++        ++A+LA        N++S    ++S F        G S       +  L  +    V
Subjt:  PIVCSIEDK-DISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAF-------NQQSQPSGQRSLFQI------GESPIKGIISKVYLIIEVPIEV

Query:  VAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSC
        V        N    N + ++  T+  N++        A +A+P  ++D  W  D+GAT+H+S     L+    Y GN+ ++VGNG  L I H G+++   
Subjt:  VAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSC

Query:  ASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKD--------------ANVSIS-NKPFGLGLST--------
         S +K  +L  +LHVP I  NLIS+++  ADN+ F EFH  F  VKD+ +K+++LQG+L+               A VS S ++   L L+T        
Subjt:  ASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKD--------------ANVSIS-NKPFGLGLST--------

Query:  --EEVSGLSSHVVDSASVFHVVKSST--SFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASF
               +  H++ S ++ H    +        A  H+   + S    ++PL L++ DLWGP+ + ST G RY++ FVD+F+R ++IYPL SK +AL+ F
Subjt:  --EEVSGLSSHVVDSASVFHVVKSST--SFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASF

Query:  KQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLEN
         +FK LVEN+F S I+ L+SD GGEF++F  +L T GI+ + +CP+T +QNG  ERK RHI+E GL++LA +S+P  FW  AF + +F+INRLPT VL  
Subjt:  KQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLEN

Query:  KSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVH
        +SP++  F +  +    + FGC C+PY+RPYN  K  + S++CVFLGYS +HKGY CL+  +GR+Y++RHVVF+E  FPFQ+   QSSS         V 
Subjt:  KSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVH

Query:  WLPFPNPSPVSTLP-TSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPM
         +P P   P S+ P +SL S     T+ P        ++P+ +    +SL + +  P + IS             TS+P               ++ HPM
Subjt:  WLPFPNPSPVSTLP-TSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPM

Query:  QTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQI
         TRAK+GI + K + S        ++EP +  +A++ S W  AM+ E SAL+RNNTWHLV PP N N+IG KWV+K+K   DG+ +R KARLVA+GF Q 
Subjt:  QTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQI

Query:  PGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNS
         G+D+ ETFSPVVKA TIRIILA+A++  W + QLDV NAFL+G L E VFMQQP G++N  +P +VCKL+KAL GLKQAPRAWY++L ++LL WGF+ S
Subjt:  PGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNS

Query:  RADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSK
        RADSS+F   S H V+++LIYVDDILVTG     +S FI RLN  FAL+D+G ++Y+LG+EV  + +   L Q KY  DLL+R  ML  KP +TP +  +
Subjt:  RADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSK

Query:  RFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAA
          S ++G P S+ TLYRS +GALQYLT TRPDIS+ VNK  QF+  PT  HW  VKR+LRYLKGTL+YG+ + +S+SL +  YTDADWA C DDR+S   
Subjt:  RFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAA

Query:  HCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDI
        + IFLG +LVSWSS KQ VV+RSSAESEYR+LA  T+E+ W+Q  LQE+ + SSS P++WCDN SA  LA N VFH+R+KHIE+D+HF+RD VL+K + I
Subjt:  HCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDI

Query:  RYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRG
        +Y+PS++Q+AD  TK IS  QF + R+KL V+ SP  LRG
Subjt:  RYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRG

TrEMBL top hitse value%identityAlignment
A0A2N9H171 Uncharacterized protein0.0e+0037.63Show/hide
Query:  LSSSTSLANVTLANGSTLSVIGFDTVVSKTRFRLLICLESDMKR---SERMNQTLLERARCMLSNVGLLRRFWIEAVSTACYLINRGPHTGIDCKTSYEV
        L S   +  +   NG   + + FD+   +   +    +    ++   +ERMN+TLLER R +L   G+ R  W EAV TACY+INR P T ID KT  E+
Subjt:  LSSSTSLANVTLANGSTLSVIGFDTVVSKTRFRLLICLESDMKR---SERMNQTLLERARCMLSNVGLLRRFWIEAVSTACYLINRGPHTGIDCKTSYEV

Query:  WSGKPADYSLLRVFGCSVY--YHVNE-GKLEPRAKNGIFVGFGDGVKGYRIWSPSENRVILSRNVVFDENSMFNSVVKPAVAEESSDVDKQVELQFTSGQ
        W+GKPADYS L  FG  VY  Y+  E  KL+P+++  +F+G+ DGVKGYR+W P+  +V++SR+V+F E+ +          +E+    K+     T   
Subjt:  WSGKPADYSLLRVFGCSVY--YHVNE-GKLEPRAKNGIFVGFGDGVKGYRIWSPSENRVILSRNVVFDENSMFNSVVKPAVAEESSDVDKQVELQFTSGQ

Query:  DESQHQGGEDQHAAIETDVPVVDTGTRIDEPKSSSLETARQPSIALDGARRVGIRPPQRYGFEDMIAYALQVVEEVDPHESSTYREVVSSSESAQWLAAM
        +ESQ Q           +VP  D+     E +         P +      R        Y  E  +AY L + E+ +P   ST+ E   S E+  W+ AM
Subjt:  DESQHQGGEDQHAAIETDVPVVDTGTRIDEPKSSSLETARQPSIALDGARRVGIRPPQRYGFEDMIAYALQVVEEVDPHESSTYREVVSSSESAQWLAAM

Query:  GDEIESLDKNQTWELVRRPKERKIVTCKWVFK-KKEGISPTEGIKFKARVVARGFTQREGVDYNEIFSPVVRHTSIRVLLAMVAHQNLELEQLDVKTVFL
         +EIE+L KN+TW+LV  P+ RK +  KWV+K K++G    E  +++AR+V +GF Q+EG+D+NEIFSPVVR T++R +LAM A  +L LEQLD+KT FL
Subjt:  GDEIESLDKNQTWELVRRPKERKIVTCKWVFK-KKEGISPTEGIKFKARVVARGFTQREGVDYNEIFSPVVRHTSIRVLLAMVAHQNLELEQLDVKTVFL

Query:  HGELEEEIYMTQPEGFQCPGKEDYVCKLKKSLYGLKQSLRQWYKRFDGYMIELGYNRSPYDCCVYHNELGDGLKIYLVLYVDDMLIAAKFKSDILKLKDL
        HGELEEEIYM QPEGF+  GKE+ VCKL KSLYGLKQ+ R WYKRFD +++ LGYNR   D CVY+   GDG  I L+LYVDDML+A   K  I  LK  
Subjt:  HGELEEEIYMTQPEGFQCPGKEDYVCKLKKSLYGLKQSLRQWYKRFDGYMIELGYNRSPYDCCVYHNELGDGLKIYLVLYVDDMLIAAKFKSDILKLKDL

Query:  LSAEFEMKDLGDAQKILGMEIYRKRDKKKFFLSQKGYIQKILSRFGMSSTKPIDTPSVASVRLSSERASQSEAEKQYMSRVPYASAVGSLMYAMVCNRPD
        L+ EFEMKDLG A KILGM+IYR R+ +K +LSQK Y++KIL RF M   KPI TP   ++++SS  +  +EAE+  MS+VPYASAVGSLM+AMVC RPD
Subjt:  LSAEFEMKDLGDAQKILGMEIYRKRDKKKFFLSQKGYIQKILSRFGMSSTKPIDTPSVASVRLSSERASQSEAEKQYMSRVPYASAVGSLMYAMVCNRPD

Query:  LAHVVSV-----VKC------LVTGYSYFDYVGDVDSRRSMTGYVFTPGGFVVSWKATLQPTVSLSTTEAEYMALTEAAKEAIWLKGLVGDLGLHHDQAV
        +A  V V       C       V GY   DY GD+D  +S +GYVFT  G  VSW + LQ  V+ STTEAEY+A T+A+KEAIWL+ L+ +L    ++  
Subjt:  LAHVVSV-----VKC------LVTGYSYFDYVGDVDSRRSMTGYVFTPGGFVVSWKATLQPTVSLSTTEAEYMALTEAAKEAIWLKGLVGDLGLHHDQAV

Query:  SEDVNFEVLRGLTNALLAAKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYLAKG
            +   L    N    +K     VQ+    RE+   ++ GS+   +++ TK ++  +  + +    +  W      +S Y + +      F     + 
Subjt:  SEDVNFEVLRGLTNALLAAKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYLAKG

Query:  IILKENGRAGDAERMFIQARFFAPENAKMLVDR---ASFKANAMGDEISSSSTAAVSAPGSTAATVISSSFGHP-LSTVLTVKLDEKNYLLWRGMVLAIL
         ++KE       E++    +  A E A+M+  R   A   +     E SSS   +  A  ++ A    ++F  P +S ++++KLD  NYL W    L IL
Subjt:  IILKENGRAGDAERMFIQARFFAPENAKMLVDR---ASFKANAMGDEISSSSTAAVSAPGSTAATVISSSFGHP-LSTVLTVKLDEKNYLLWRGMVLAIL

Query:  RGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKK
        R   +   + G++  P++   +A +    K V NP Y  W   DQ +  WL  ++T  + + V    TSR+VW +L   Y S SK+RI  L+  LQ  ++
Subjt:  RGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKK

Query:  GSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAH--LAFNQQSQ
         S    ++L + K  ++ L   G P+   DL+S++++GL+  +                   ++ I       R S +P      +L  H  L  NQ S 
Subjt:  GSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAH--LAFNQQSQ

Query:  PSGQRSLFQIGESPIKGIISKVYLIIEVPIEVVAE-EVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGN
         + +   F +  +       +  L+    ++     +       P T L   +A  +++NSQ             PE  ++  W ADSGA  H++AD  N
Subjt:  PSGQRSLFQIGESPIKGIISKVYLIIEVPIEVVAE-EVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGN

Query:  LAVKNE-YNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVSI
        L +  E Y GN  + VGNG+ L I + GS+  S  + N   KLN++LH P++  NL+SI +  ADN+   E  +   +VKD  + + +L G  +D    I
Subjt:  LAVKNE-YNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVSI

Query:  SNKPFGLGLSTEEVSGLSSHVVDSASVFHVVKSSTSFLSKALLHRRLGHAS----DRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYP
            F     + + SGL++++         +++ST      + H RLGH +     R+LNN                      + L   ++  + ++  P
Subjt:  SNKPFGLGLSTEEVSGLSSHVVDSASVFHVVKSSTSFLSKALLHRRLGHAS----DRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYP

Query:  LK-SKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVF
         + +KS+ L   K           ST ++  S     FRS        GI  R +CPHTSQQNGV ERKHRH+V+ GL++LA S +   +W EAF +A++
Subjt:  LK-SKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVF

Query:  IINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLS-SSGRIYISRHVVFNEDDFPFQNGFSQSS
        +INRLPTP L N +P+ K F++  D   LRTFGCAC+P LRPYN  K  F S KCVFLGYS  H+GY+CL  S+ R+YISR VVFNE DFP +       
Subjt:  IINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLS-SSGRIYISRHVVFNEDDFPFQNGFSQSS

Query:  SSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPA
                                  T++S    A+ A+PS+      S+P       +SL  P D P+     P  +P + P   T+ P  +  +  PA
Subjt:  SSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPA

Query:  QPLVASSSHPMQTRAKSGIFRPKNWGSF------------LASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKV
         P ++  +    TR+ +G  RPK +                 ++DLP  EP +   A +   WK AMD E  AL  N TW L   PPN  +I +KWV+++
Subjt:  QPLVASSSHPMQTRAKSGIFRPKNWGSF------------LASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKV

Query:  KRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLD-KALSG
        K+ +DGS +R KARLVAKGF Q  GID+ ETFSP+   P+  +                +  +  N H++           +   F   +C +  K   G
Subjt:  KRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLD-KALSG

Query:  LKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKY
        LKQAPRAW+ RLR +L+  GFK S  D+S+F F    + + VL+YVDDILVTG   + IS  I+ L   F LKD+G LSY+LG+ V R   G+ L QSKY
Subjt:  LKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKY

Query:  ITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSS
        I +LL+R  M+  KPCSTP  +S + SL +G  +S  T YR ++GALQY T TRPDI++ VN+L QF+  PT  HW   KRVLRYLKGT+++GLF  + S
Subjt:  ITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSS

Query:  SLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWV
        SL L A+ D+DWA   DDR+S     IFLG  LVSWS+KKQSVVARSS E+EYR++A TTA++ W+
Subjt:  SLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWV

A0A2Z6MBG6 Integrase catalytic domain-containing protein0.0e+0042.87Show/hide
Query:  ATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVV
        A+   S+  + L + ++VKLD  NY LW+ +VL ++RG K+DGY+LGT+  P E    +  +  K    N  + EWQ  DQ L GW+  SMT  IA  ++
Subjt:  ATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVV

Query:  SFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSI
          +TS+++W   + + G+ ++++I  L+    + +KG MKM ++L  MK   + L+LAGNP+S  DL+   L GLDS+Y P+V  + D+   +W +L + 
Subjt:  SFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSI

Query:  LIQFEGTLARYSTIPN-----TRDLPDLAAHLAFNQQSQPSGQ--------RSLFQIGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNLHANSA
        L+ FE  + + + + N     T ++ + + H   +  +   G         R   + G++P +      ++ I+                     H    
Subjt:  LIQFEGTLARYSTIPN-----TRDLPDLAAHLAFNQQSQPSGQ--------RSLFQIGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNLHANSA

Query:  ATSNSNSQGGN--SASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIK
          S SN   G+    S +A++A+   + D  W  DSGA+NHV+          E++G  SLVVGNG KL I   GSS +      K + L+++L+VP+I 
Subjt:  ATSNSNSQGGN--SASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIK

Query:  RNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVSIS---NKPFGLGLSTEEVSGLSSH--------VVDSASVFHVVKSSTSFLSK--
        +NL+S+++L ADN++ VEF    C VKDK + +V+L+G LKD    +S     P       E       H        V++S  V      + SF     
Subjt:  RNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVSIS---NKPFGLGLSTEEVSGLSSH--------VVDSASVFHVVKSSTSFLSK--

Query:  -ALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPF
           +H     +S      PLEL++ D+WGP+P+ ++ G++YY+ FVD+F+R T+IYPLK KSE + +F QFK L EN+F   IK++Q D GGE++     
Subjt:  -ALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPF

Query:  LKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYN
            GI+FR +CP+TSQQNG  ERKHRHI E GL++LAQ+ MPLH+WWEAFS+AV++INRLP+ V +N+SP+     +  D   L+TFGCAC+P L+PYN
Subjt:  LKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYN

Query:  TQKFQFHSTKCVFLGYSFSHKGYKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHP-SDFQ
          K Q+H+T+CVFLGYS SHKGYKCL+S GRI+ISRHV+FNED FPF +GF  + S    + +V           P ++ P   + +   D + P  + +
Subjt:  TQKFQFHSTKCVFLGYSFSHKGYKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHP-SDFQ

Query:  GASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKE
          + +    SQD+N           S   +    P  +  +H    D+             ++SH + TR+KSGI +PK     L  +     EP + KE
Subjt:  GASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKE

Query:  ALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIR
        AL    WK+AM  E  AL  N TW LV      N++ SKWVFK K   DGS ER KARLVAKGF Q  GID+ ETFSPV+KA T+RIIL++AV   W +R
Subjt:  ALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIR

Query:  QLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPN
        QLD+NNAFLNGHL E VFM QPEG+V+   P+++CKL KA+ GLKQAPRAW+D L++ LL+WGF+N+++DSS+F       +  +LIYVDDI+VTG    
Subjt:  QLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPN

Query:  FISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDI
        F+  FI++LND F+LKD+G L Y+LG+EV R  SGM+L QSKYI DLL +  M +  PC TP +T ++F+ VEG  + +PT++R  IG LQYLTHT PDI
Subjt:  FISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDI

Query:  SYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLA
        ++ VNKLSQ++  P++ HWQG+KR+LRYL+GT+NY L +  S+ L +  ++DADWA  +DDRKS++  C+FLG +L+SWSS+KQ VV+RSS ESEYR+LA
Subjt:  SYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLA

Query:  QTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVLL
           AE++W+++ L E+ +     PI+WCDN SA +LA+N V H+RSKHIE+DVH++RD+VLQ  + + YVP++DQ+AD LTKP+S  +F+ LR KLGV+L
Subjt:  QTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVLL

Query:  SP
        SP
Subjt:  SP

A0A2Z6P4D5 Integrase catalytic domain-containing protein0.0e+0043.91Show/hide
Query:  LSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKA
        L ++++VKLD  NY LW+ +VL+++RG K+DGYILGT    +EC E    +  K    NP + +W   DQAL GWL  SM   IA  ++  +TS+++W  
Subjt:  LSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKA

Query:  LEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARY
         + + G+ +K+RI  L+    NT+KG MKM E+L  MK  S+ L+LAG+PIS  DL+   L GLD++Y P+V  + D+   +W ++ + L+ FE  L ++
Subjt:  LEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARY

Query:  S-----TIPNTRDLPDLAAHLAFNQQSQPSGQRSLFQ-IGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAY
        +     T+  + +  +          S+ + +RS F+ +     KG +S     +      +A +    F+ P T          N +++     S SA+
Subjt:  S-----TIPNTRDLPDLAAHLAFNQQSQPSGQRSLFQ-IGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAY

Query:  IATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFH
        IA+P    D +W  DSGA NHV+         NE+NG  SL+VGNG KL I   GS+ ++       + L+++L+VP I +NL+S+++LTADN++ VEF 
Subjt:  IATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFH

Query:  SCFCLVKDKASKRVMLQGTLKDANVSISNKPFGLGLSTEE-----VSGLSSHVVDSASVFHVVKSSTS-------FLSKALLHRRLGHASDRVLNNPLEL
        +  C VKDK + + +L+G LKD    +SNK   + +S +E     +   ++ V+D       VK S S             LH      S   +  PL L
Subjt:  SCFCLVKDKASKRVMLQGTLKDANVSISNKPFGLGLSTEE-----VSGLSSHVVDSASVFHVVKSSTS-------FLSKALLHRRLGHASDRVLNNPLEL

Query:  IYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVV
        I+ D+WGP+P+ S  G++YY+ F+D+F+R T+I+PLK KS+ + +F QFK L EN+F   IKI+Q D GGE+++        GI+FR +CP+TSQQNG  
Subjt:  IYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVV

Query:  ERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKG
        ERKHRH+ E+GL++LAQ+ MPL +WWEAFS+AV++INRLP+ V  N+SP+   F +  D ++L+ FGCAC+P L+PYN  K QFH+T+CVF+GYS SHKG
Subjt:  ERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKG

Query:  YKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDE
        YKC++S GRI++SRHV+FNE+ FPF  GF  + +      D         N S +  LPT  + +   D   P +   +  +  ++    N    E +D 
Subjt:  YKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDE

Query:  PLSAISEPTASPV-IEPVSHTSQPD-----LLHSVEPPAQPLVASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSA
            ++   +S   IE  +     D     +  +++  AQ    S++H M+TR+K GI +PK     +A +D    EP SVKEAL    WK+AMD E  A
Subjt:  PLSAISEPTASPV-IEPVSHTSQPD-----LLHSVEPPAQPLVASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSA

Query:  LRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDV
        L  N+TW LV      N+I SKW+FK K  SDGS ER KARLVAKGF Q  G+DF ETFSPVVK+ T+RIIL +AV   W +RQLD+NNAFLNG L E V
Subjt:  LRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDV

Query:  FMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKD
        FM QPEGY++ A P+++CKL KA+ GLKQAPRAWYD LRSTL++WGF+N++ D+S+FF         +LIYVDDI+VTG    F+  F  +LN  ++LKD
Subjt:  FMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKD

Query:  IGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVI
        +G L Y+LGVEVHR DSGM+L Q+KYI D+L +  M +   C TP VT ++F + EG  MSNPTLYR  IGALQYLT+TRPDI++ VNKLSQ++  PT+ 
Subjt:  IGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVI

Query:  HWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQ-------
        HWQG+KR+LRYL+GT N+ L +  S++L +  + DADWA   DDRKS    C+FLG +LVSW+S+KQ VV+RSS ESEYRSLA   AEVS          
Subjt:  HWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQ-------

Query:  ------------AFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGV
                      L+E+ +     P++WCDN SA +LA+N V H+RSKHIE+D+H++RD+VL+  + I YVP++DQ+AD LTKP+   +F  +R KLGV
Subjt:  ------------AFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGV

Query:  LLSPS
         +SPS
Subjt:  LLSPS

A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-30442.29Show/hide
Query:  MGDEISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTID
        M     SS   A + P ST AT       H     L +KLD  NY+LWR  +  ++     + +I G K  P +     T +G      NP +  W+  D
Subjt:  MGDEISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTID

Query:  QALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYV
        + +  W++ S+TP I   +V +++S   W ALE+++ ++S+AR+ QLR   Q T+KGS+ M+E++  +K  ++NL   G P++  D +  +L GL +DY 
Subjt:  QALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYV

Query:  PIVCSIEDK-DISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAF-------NQQSQPSGQRSLFQI------GESPIKGIISKVYLIIEVPIEV
         IV S+  + D  +   + SIL+  E  L+  +++        ++A+LA        N++S    ++S F        G S       +  L  +    V
Subjt:  PIVCSIEDK-DISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAF-------NQQSQPSGQRSLFQI------GESPIKGIISKVYLIIEVPIEV

Query:  VAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSC
        V        N    N + ++  T+  N++        A +A+P  ++D  W  D+GAT+H+S     L+    Y GN+ ++VGNG  L I H G+++   
Subjt:  VAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSC

Query:  ASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKD--------------ANVSIS-NKPFGLGLST--------
         S +K  +L  +LHVP I  NLIS+++  ADN+ F EFH  F  VKD+ +K+++LQG+L+               A VS S ++   L L+T        
Subjt:  ASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKD--------------ANVSIS-NKPFGLGLST--------

Query:  --EEVSGLSSHVVDSASVFHVVKSST--SFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASF
             + +  H++ S ++ H    +        A  H+   + S    ++PL L++ DLWGP+ + ST G RY++ FVD+F+R ++IYPL SK +AL+ F
Subjt:  --EEVSGLSSHVVDSASVFHVVKSST--SFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASF

Query:  KQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLEN
         +FK LVEN+F S I+ L+SD GGEF++F  +L T GI+ + +CP+T +QNG  ERK RHI+E GL++LA +S+P  FW  AF +A+F+INRLPT VL  
Subjt:  KQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLEN

Query:  KSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVH
        +SP++  F +  +    + FGC C+PY+RPYN  K  + S++CVFLGYS +HKGY CL+  +GR+Y++RHVVF+E  FPFQ+   QSSS         V 
Subjt:  KSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVH

Query:  WLPFPNPSPVSTLP-TSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPM
         +P P   P S+ P +SL S     T+ P        ++P+ +    +SL + +  P + IS             TS+P               ++ HPM
Subjt:  WLPFPNPSPVSTLP-TSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPM

Query:  QTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQI
         TRAK+GI + K + S        ++EP +  +A++ S W  AM+ E SAL+RNNTWHLV PP N N+IG KWV+K+K   DG+ +R KARLVA+GF Q 
Subjt:  QTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQI

Query:  PGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNS
         G+D+ ETFSPVVKA TIRIILA+A++  W + QLDV NAFL+G L E VFMQQP G++N  +P +VCKL+KAL GLKQAPRAWY++L ++LL WGF+ S
Subjt:  PGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNS

Query:  RADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSK
        RADSS+F   S H V+++LIYVDDILVTG     +S FI RLN  FAL+D+G ++Y+LG+EV R+ +   L Q KY  DLL+R  ML  KP +TP +  +
Subjt:  RADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSK

Query:  RFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAA
          S ++G P S+ TLYRS +GALQYLT TRPDIS+ VNK  QF+  PT  HW  VKR+LRYLKGTL+YG+ + +S+SL +  YTDADWA C DDR+S   
Subjt:  RFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAA

Query:  HCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDI
        + IFLG +LVSWSS KQ VV+RSSAESEYR+LA  T+E+ W+Q  LQE+ + SSS P++WCDN SA  LA N VFH+R+KHIE+D+HF+RD VL+K + I
Subjt:  HCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDI

Query:  RYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRG
        +Y+PS++Q+AD  TK IS  QF + R+KL V+ SP  LRG
Subjt:  RYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRG

A0A438JAU4 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-30242.15Show/hide
Query:  MGDEISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTID
        M     SS   A + P ST AT       H     L +KLD  NY+LWR  +  ++     + +I G K  P +     T +G      NP +  W+  D
Subjt:  MGDEISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTID

Query:  QALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYV
        + +  W++ S+TP I   +V +++S   W ALE+++ ++S+AR+ QLR   Q T+KGS+ M+E++  +K  ++NL   G P++  D +  +L GL +DY 
Subjt:  QALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYV

Query:  PIVCSIEDK-DISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAF-------NQQSQPSGQRSLFQI------GESPIKGIISKVYLIIEVPIEV
         IV S+  + D  +   + SIL+  E  L+  +++        ++A+LA        N++S    ++S F        G S       +  L  +    V
Subjt:  PIVCSIEDK-DISTWQELSSILIQFEGTLARYSTIPNTRDLPDLAAHLAF-------NQQSQPSGQRSLFQI------GESPIKGIISKVYLIIEVPIEV

Query:  VAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSC
        V        N    N + ++  T+  N++        A +A+P  ++D  W  D+GAT+H+S     L+    Y GN+ ++VGNG  L I H G+++   
Subjt:  VAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSC

Query:  ASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKD--------------ANVSIS-NKPFGLGLST--------
         S +K  +L  +LHVP I  NLIS+++  ADN+ F EFH  F  VKD+ +K+++LQG+L+               A VS S ++   L L+T        
Subjt:  ASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVMLQGTLKD--------------ANVSIS-NKPFGLGLST--------

Query:  --EEVSGLSSHVVDSASVFHVVKSST--SFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASF
               +  H++ S ++ H    +        A  H+   + S    ++PL L++ DLWGP+ + ST G RY++ FVD+F+R ++IYPL SK +AL+ F
Subjt:  --EEVSGLSSHVVDSASVFHVVKSST--SFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASF

Query:  KQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLEN
         +FK LVEN+F S I+ L+SD GGEF++F  +L T GI+ + +CP+T +QNG  ERK RHI+E GL++LA +S+P  FW  AF + +F+INRLPT VL  
Subjt:  KQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLEN

Query:  KSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVH
        +SP++  F +  +    + FGC C+PY+RPYN  K  + S++CVFLGYS +HKGY CL+  +GR+Y++RHVVF+E  FPFQ+   QSSS         V 
Subjt:  KSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVH

Query:  WLPFPNPSPVSTLP-TSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPM
         +P P   P S+ P +SL S     T+ P        ++P+ +    +SL + +  P + IS             TS+P               ++ HPM
Subjt:  WLPFPNPSPVSTLP-TSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPM

Query:  QTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQI
         TRAK+GI + K + S        ++EP +  +A++ S W  AM+ E SAL+RNNTWHLV PP N N+IG KWV+K+K   DG+ +R KARLVA+GF Q 
Subjt:  QTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQI

Query:  PGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNS
         G+D+ ETFSPVVKA TIRIILA+A++  W + QLDV NAFL+G L E VFMQQP G++N  +P +VCKL+KAL GLKQAPRAWY++L ++LL WGF+ S
Subjt:  PGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNS

Query:  RADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSK
        RADSS+F   S H V+++LIYVDDILVTG     +S FI RLN  FAL+D+G ++Y+LG+EV  + +   L Q KY  DLL+R  ML  KP +TP +  +
Subjt:  RADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSK

Query:  RFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAA
          S ++G P S+ TLYRS +GALQYLT TRPDIS+ VNK  QF+  PT  HW  VKR+LRYLKGTL+YG+ + +S+SL +  YTDADWA C DDR+S   
Subjt:  RFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAA

Query:  HCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDI
        + IFLG +LVSWSS KQ VV+RSSAESEYR+LA  T+E+ W+Q  LQE+ + SSS P++WCDN SA  LA N VFH+R+KHIE+D+HF+RD VL+K + I
Subjt:  HCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDI

Query:  RYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRG
        +Y+PS++Q+AD  TK IS  QF + R+KL V+ SP  LRG
Subjt:  RYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRG

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.7e-11426.23Show/hide
Query:  DEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTS
        D + Y +W+  + A+L  Q V   + G                   L+PN + + W+  ++     +   ++ +      S  T+R++ + L+ VY   S
Subjt:  DEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGSTS

Query:  KARINQLRGVLQNTKKGS-MKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARYSTIPNTRD
         A    LR  L + K  S M ++    I  +    L  AG  I   D +S++L  L S Y  I+ +IE    +  +E  ++       L +   I N  +
Subjt:  KARINQLRGVLQNTKKGS-MKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSILIQFEGTLARYSTIPNTRD

Query:  LPDLAAHLAFNQQSQPSGQRSLFQIGESPIKGII--SKVYLI------IEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEI
                A    +  + + +LF+   +  K I   +  Y +       E  I+      + I N        N      + S G   A     +    +
Subjt:  LPDLAAHLAFNQQSQPSGQRSLFQIGESPIKGII--SKVYLI------IEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPEI

Query:  LNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASG------NKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFH
        +++  ++ DSGA++H+        + +E    +S+ V    K+ +   G    +   G      +  I L ++L       NL+S+ RL  +  + +EF 
Subjt:  LNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASG------NKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFH

Query:  SCFCLVKDKASKRVMLQGTLKDANVSISNKPFGLGLSTEEVSGL----SSHVVDSASVFHVVKSSTSFLSKALLHR-------------------RLGHA
             +       V   G L +  V I+ + + +    +    L      H+ D   +   +K    F  ++LL+                         
Subjt:  SCFCLVKDKASKRVMLQGTLKDANVSISNKPFGLGLSTEEVSGL----SSHVVDSASVFHVVKSSTSFLSKALLHR-------------------RLGHA

Query:  SDRV-LNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFD--PFLKTVGIEF
         D+  +  PL +++ D+ GP    +     Y++ FVD FT     Y +K KS+  + F+ F    E  F   +  L  D G E+ S +   F    GI +
Subjt:  SDRV-LNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFD--PFLKTVGIEF

Query:  RHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVL--ENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQ-KFQ
          T PHT Q NGV ER  R I E   +M++ + +   FW EA  +A ++INR+P+  L   +K+P+E   N+   L  LR FG   + +++  N Q KF 
Subjt:  RHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVL--ENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQ-KFQ

Query:  FHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDF------PFQNGF------SQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADT
          S K +F+GY     G+K   + + +  ++R VV +E +        F+  F      S++ + P  S  ++     FPN S        L  S  ++ 
Subjt:  FHSTKCVFLGYSFSHKGYKCLSS-SGRIYISRHVVFNEDDF------PFQNGF------SQSSSSPCVSPDVVVHWLPFPNPSPVSTLPTSLSSSAAADT

Query:  AH-PSD------------------FQGASPSLPAVSQDLNLSLGEPMDEPLS---AISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPMQTRA
         + P+D                   Q    S  +    LN S     D+ L+       P  S   E   H  +  + +  +     ++   S  ++T+ 
Subjt:  AH-PSD------------------FQGASPSLPAVSQDLNLSLGEPMDEPLS---AISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPMQTRA

Query:  KSGIFRPKN------WGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFH
        +       N        +    +D+P +    ++     S W++A++ E++A + NNTW +   P N N++ S+WVF VK +  G+  R KARLVA+GF 
Subjt:  KSGIFRPKN------WGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFH

Query:  QIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFK
        Q   ID+ ETF+PV +  + R IL+L +     + Q+DV  AFLNG L E+++M+ P+G         VCKL+KA+ GLKQA R W++     L +  F 
Subjt:  QIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFK

Query:  NSRADSSVFFFSSPHM--VVLVLIYVDDILV-TGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTP
        NS  D  ++     ++   + VL+YVDD+++ TGD    ++ F + L + F + D+  + +++G+ +   +  ++L QS Y+  +L++  M +    STP
Subjt:  NSRADSSVFFFSSPHM--VVLVLIYVDDILV-TGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTP

Query:  AVTSKRFSLVEGSPMSNPTLYRSVIGALQY-LTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSL--SLVAYTDADWACCV
          +   + L+      N T  RS+IG L Y +  TRPD++  VN LS++  +     WQ +KRVLRYLKGT++  L   ++ +    ++ Y D+DWA   
Subjt:  AVTSKRFSLVEGSPMSNPTLYRSVIGALQY-LTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSL--SLVAYTDADWACCV

Query:  DDRKSIAAHCI-FLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRD
         DRKS   +       +L+ W++K+Q+ VA SS E+EY +L +   E  W++  L  I +   +   ++ DN   IS+ANN   H R+KHI++  HF R+
Subjt:  DDRKSIAAHCI-FLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRD

Query:  KVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVL
        +V    I + Y+P+ +QLAD  TKP+   +F  LR KLG+L
Subjt:  KVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.7e-14142.98Show/hide
Query:  SERMNQTLLERARCMLSNVGLLRRFWIEAVSTACYLINRGPHTGIDCKTSYEVWSGKPADYSLLRVFGCSVYYHV---NEGKLEPRAKNGIFVGFGDGVK
        +ERMN+T++E+ R ML    L + FW EAV TACYLINR P   +  +    VW+ K   YS L+VFGC  + HV      KL+ ++   IF+G+GD   
Subjt:  SERMNQTLLERARCMLSNVGLLRRFWIEAVSTACYLINRGPHTGIDCKTSYEVWSGKPADYSLLRVFGCSVYYHV---NEGKLEPRAKNGIFVGFGDGVK

Query:  GYRIWSPSENRVILSRNVVFDENSMFNSVVKPAVAEESSDVDKQVELQFTSGQDESQHQGGEDQHAAIETDVPVVDTGTRIDEPKSSSLETARQPSIALD
        GYR+W P + +VI SR+VVF E+ +  +    +   ++  +   V +  TS    S     ++     E    V++ G ++DE     +E    P+   +
Subjt:  GYRIWSPSENRVILSRNVVFDENSMFNSVVKPAVAEESSDVDKQVELQFTSGQDESQHQGGEDQHAAIETDVPVVDTGTRIDEPKSSSLETARQPSIALD

Query:  GARRVGIRPPQRYGFEDMIAYALQVVEEVDPHESSTYREVVSSSESAQWLAAMGDEIESLDKNQTWELVRRPKERKIVTCKWVFK-KKEGISPTEGIKFK
          +   +R  +R   E     + + V   D  E  + +EV+S  E  Q + AM +E+ESL KN T++LV  PK ++ + CKWVFK KK+G    + +++K
Subjt:  GARRVGIRPPQRYGFEDMIAYALQVVEEVDPHESSTYREVVSSSESAQWLAAMGDEIESLDKNQTWELVRRPKERKIVTCKWVFK-KKEGISPTEGIKFK

Query:  ARVVARGFTQREGVDYNEIFSPVVRHTSIRVLLAMVAHQNLELEQLDVKTVFLHGELEEEIYMTQPEGFQCPGKEDYVCKLKKSLYGLKQSLRQWYKRFD
        AR+V +GF Q++G+D++EIFSPVV+ TSIR +L++ A  +LE+EQLDVKT FLHG+LEEEIYM QPEGF+  GK+  VCKL KSLYGLKQ+ RQWY +FD
Subjt:  ARVVARGFTQREGVDYNEIFSPVVRHTSIRVLLAMVAHQNLELEQLDVKTVFLHGELEEEIYMTQPEGFQCPGKEDYVCKLKKSLYGLKQSLRQWYKRFD

Query:  GYMIELGYNRSPYDCCVYHNELGDGLKIYLVLYVDDMLIAAKFKSDILKLKDLLSAEFEMKDLGDAQKILGMEIYRKRDKKKFFLSQKGYIQKILSRFGM
         +M    Y ++  D CVY     +   I L+LYVDDMLI  K K  I KLK  LS  F+MKDLG AQ+ILGM+I R+R  +K +LSQ+ YI+++L RF M
Subjt:  GYMIELGYNRSPYDCCVYHNELGDGLKIYLVLYVDDMLIAAKFKSDILKLKDLLSAEFEMKDLGDAQKILGMEIYRKRDKKKFFLSQKGYIQKILSRFGM

Query:  SSTKPIDTPSVASVRLSSERASQSEAEKQYMSRVPYASAVGSLMYAMVCNRPDLAHVVSVVK---------------------------CL--------V
         + KP+ TP    ++LS +    +  EK  M++VPY+SAVGSLMYAMVC RPD+AH V VV                            CL        +
Subjt:  SSTKPIDTPSVASVRLSSERASQSEAEKQYMSRVPYASAVGSLMYAMVCNRPDLAHVVSVVK---------------------------CL--------V

Query:  TGYSYFDYVGDVDSRRSMTGYVFTPGGFVVSWKATLQPTVSLSTTEAEYMALTEAAKEAIWLKGLVGDLGLHHDQAV
         GY+  D  GD+D+R+S TGY+FT  G  +SW++ LQ  V+LSTTEAEY+A TE  KE IWLK  + +LGLH  + V
Subjt:  TGYSYFDYVGDVDSRRSMTGYVFTPGGFVVSWKATLQPTVSLSTTEAEYMALTEAAKEAIWLKGLVGDLGLHHDQAV

P92519 Uncharacterized mitochondrial protein AtMg008103.4e-5346.26Show/hide
Query:  VLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYR
        +L+YVDDIL+TG     ++  I +L+  F++KD+G + Y+LG+++    SG+FL Q+KY   +LN  GML  KP STP +  K  S V  +   +P+ +R
Subjt:  VLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYR

Query:  SVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQ
        S++GALQYLT TRPDISY VN + Q + +PT+  +  +KRVLRY+KGT+ +GL++ ++S L++ A+ D+DWA C   R+S    C FLG +++SWS+K+Q
Subjt:  SVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQ

Query:  SVVARSSAESEYRSLAQTTAEVSWVQA
          V+RSS E+EYR+LA T AE++W  A
Subjt:  SVVARSSAESEYRSLAQTTAEVSWVQA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-25337.63Show/hide
Query:  KLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGS
        KL   NYL+W   V A+  G ++ G++ G+   P     + T+   +    NP Y  W+  D+ +   + G+++ ++   V    T+ ++W+ L K+Y +
Subjt:  KLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGS

Query:  TSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDI-STWQELSSILIQFEGTLARYST---I
         S   + QLR  L+   KG+  + +++  +    + L L G P+   + V  VL  L  +Y P++  I  KD   T  E+   L+  E  +   S+   I
Subjt:  TSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDI-STWQELSSILIQFEGTLARYST---I

Query:  P-----------------NTRDLPDLAAHLAFNQQSQPSGQRSL-FQIGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNL-HANSAATSNSNSQ
        P                 N  +  +   +   N  S+P  Q S  F    +  K  + K  +            V+    +  + L H  S+  S     
Subjt:  P-----------------NTRDLPDLAAHLAFNQQSQPSGQRSL-FQIGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNL-HANSAATSNSNSQ

Query:  GGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLT
                A +A     +   WL DSGAT+H+++D  NL++   Y G + ++V +G+ + I H GS+ +S  S  + + L+N+L+VP+I +NLIS+ RL 
Subjt:  GGNSASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLT

Query:  ADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVS---ISNKPFGLGLSTEEVSGLSS-HVVDSASVFHVVKSSTSFLSKALLH--RRLGHASDRVLN-
          N V VEF      VKD  +   +LQG  KD        S++P  L  S    +  SS H         ++ S  S  S ++L+   +    SD ++N 
Subjt:  ADNDVFVEFHSCFCLVKDKASKRVMLQGTLKDANVS---ISNKPFGLGLSTEEVSGLSS-HVVDSASVFHVVKSSTSFLSKALLH--RRLGHASDRVLN-

Query:  --------------NPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKT
                       PLE IY D+W  SP+ S   YRYY+ FVD+FTR T++YPLK KS+   +F  FK L+ENRF + I    SD GGEF +   +   
Subjt:  --------------NPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKT

Query:  VGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQK
         GI    + PHT + NG+ ERKHRHIVE GL++L+ +S+P  +W  AF+ AV++INRLPTP+L+ +SP++K F    +   LR FGCAC+P+LRPYN  K
Subjt:  VGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQK

Query:  FQFHSTKCVFLGYSFSHKGYKCLS-SSGRIYISRHVVFNEDDFPFQNGFS-------QSSSSPCV-SPDVVV---------------HWLPFPNPSPVST
            S +CVFLGYS +   Y CL   + R+YISRHV F+E+ FPF N  +       Q   S CV SP   +               H    P  SP + 
Subjt:  FQFHSTKCVFLGYSFSHKGYKCLS-SSGRIYISRHVVFNEDDFPFQNGFS-------QSSSSPCV-SPDVVV---------------HWLPFPNPSPVST

Query:  LPTSLSSSAAADTAHPSDFQGA-SPSLP----------------AVSQDLNLSLGEPMDEPLSAISE--------------PTASPVIEPVSHTSQPDLL
           S  SS+  D++  S F  +  P+ P                      N S   P +E  S +++              PT S      S T    L+
Subjt:  LPTSLSSSAAADTAHPSDFQGA-SPSLP----------------AVSQDLNLSLGEPMDEPLSAISE--------------PTASPVIEPVSHTSQPDLL

Query:  HSVEPPAQPL-----VASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPP-NVNLIGSKWVFK
        H   P AQ +        ++H M TRAK+GI +P N    LA S    +EP +  +AL+   W+ AM +E++A   N+TW LV PPP +V ++G +W+F 
Subjt:  HSVEPPAQPL-----VASSSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPP-NVNLIGSKWVFK

Query:  VKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSG
         K +SDGS  R KARLVAKG++Q PG+D+ ETFSPV+K+ +IRI+L +AV + WPIRQLDVNNAFL G L++DV+M QP G++++  P+YVCKL KAL G
Subjt:  VKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSG

Query:  LKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKY
        LKQAPRAWY  LR+ LL  GF NS +D+S+F       +V +L+YVDDIL+TG+ P  +   +  L+  F++KD   L Y+LG+E  R  +G+ L Q +Y
Subjt:  LKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKY

Query:  ITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSS
        I DLL R  M+  KP +TP   S + SL  G+ +++PT YR ++G+LQYL  TRPDISY VN+LSQF+  PT  H Q +KR+LRYL GT N+G+FL + +
Subjt:  ITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSS

Query:  SLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFH
        +LSL AY+DADWA   DD  S   + ++LGH  +SWSSKKQ  V RSS E+EYRS+A T++E+ W+ + L E+G+  +  P+++CDN  A  L  N VFH
Subjt:  SLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFH

Query:  SRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVLLSP
        SR KHI +D HF+R++V   A+ + +V + DQLAD LTKP+S   F    SK+GV   P
Subjt:  SRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVLLSP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-24937.17Show/hide
Query:  KLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGS
        KL   NYL+W   V A+  G ++ G++ G+   P     + T+   +    NP Y  W+  D+ +   + G+++ ++   V    T+ ++W+ L K+Y +
Subjt:  KLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGSMTPAIAADVVSFKTSREVWKALEKVYGS

Query:  TSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDI-STWQELSSILIQFEGTL---------
         S   + QLR + +                    + L L G P+   + V  VL  L  DY P++  I  KD   +  E+   LI  E  L         
Subjt:  TSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDI-STWQELSSILIQFEGTL---------

Query:  ---ARYSTIPNT---RDLPDLAAHLAFNQQSQPSGQRSLFQIGESPIKGIISKVYL----IIEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGN
           A   T  NT   R+  +   +  +N  +  S        G S       K YL    I  V     A+   ++    +T     S +         N
Subjt:  ---ARYSTIPNT---RDLPDLAAHLAFNQQSQPSGQRSLFQIGESPIKGIISKVYL----IIEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGN

Query:  SASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADN
         A +S Y       N   WL DSGAT+H+++D  NL+    Y G + +++ +G+ + I H GS+  S  + ++ + LN +L+VP+I +NLIS+ RL   N
Subjt:  SASSSAYIATPEILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADN

Query:  DVFVEFHSCFCLVKDKASKRVMLQGTLKD----------ANVSISNKPFG----------LGLSTEEV--SGLSSHVVDSASVFH-VVKSSTSFLSKALL
         V VEF      VKD  +   +LQG  KD            VS+   P            LG  +  +  S +S+H +   +  H ++  S  F++K+  
Subjt:  DVFVEFHSCFCLVKDKASKRVMLQGTLKD----------ANVSISNKPFG----------LGLSTEEV--SGLSSHVVDSASVFH-VVKSSTSFLSKALL

Query:  HRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTV
        H+     S    + PLE IY D+W  SP+ S   YRYY+ FVD+FTR T++YPLK KS+   +F  FK LVENRF + I  L SD GGEF     +L   
Subjt:  HRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLKSKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTV

Query:  GIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKF
        GI    + PHT + NG+ ERKHRHIVEMGL++L+ +S+P  +W  AFS AV++INRLPTP+L+ +SP++K F Q  +   L+ FGCAC+P+LRPYN  K 
Subjt:  GIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENKSPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKF

Query:  QFHSTKCVFLGYSFSHKGYKCLS-SSGRIYISRHVVFNEDDFPFQ-NGFSQSSSS--------------------------PCVSPDVVVHWLPFPNPSP
        +  S +C F+GYS +   Y CL   +GR+Y SRHV F+E  FPF    F  S+S                           PC+ P +     P  +PSP
Subjt:  QFHSTKCVFLGYSFSHKGYKCLS-SSGRIYISRHVVFNEDDFPFQ-NGFSQSSSS--------------------------PCVSPDVVVHWLPFPNPSP

Query:  VSTLPTSLSSSAAADTAHPSDFQGASPS--------LPAVSQDLNLS---LGEPMDE-------------PLSAISEPTASPVIEPVSHTSQPDLLHSVE
        + T   S S+  ++  + PS  +  +PS         P  +Q+ N +   L  P                P S IS P        +S  + P    +  
Subjt:  VSTLPTSLSSSAAADTAHPSDFQGASPS--------LPAVSQDLNLS---LGEPMDE-------------PLSAISEPTASPVIEPVSHTSQPDLLHSVE

Query:  PPAQPLVAS------------SSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLV-APPPNVNLIGSKW
        PP  P++ +            ++H M TRAK GI +P    S+ A+S    +EP +  +A++   W+QAM +E++A   N+TW LV  PPP+V ++G +W
Subjt:  PPAQPLVAS------------SSHPMQTRAKSGIFRPKNWGSFLASSDLPMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLV-APPPNVNLIGSKW

Query:  VFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKA
        +F  K +SDGS  R KARLVAKG++Q PG+D+ ETFSPV+K+ +IRI+L +AV + WPIRQLDVNNAFL G L+++V+M QP G+V++  P YVC+L KA
Subjt:  VFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKA

Query:  LSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQ
        + GLKQAPRAWY  LR+ LL  GF NS +D+S+F       ++ +L+YVDDIL+TG+    +   +  L+  F++K+   L Y+LG+E  R   G+ L Q
Subjt:  LSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQ

Query:  SKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLP
         +Y  DLL R  ML  KP +TP  TS + +L  G+ + +PT YR ++G+LQYL  TRPD+SY VN+LSQ++  PT  HW  +KRVLRYL GT ++G+FL 
Subjt:  SKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLP

Query:  RSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNL
        + ++LSL AY+DADWA   DD  S   + ++LGH  +SWSSKKQ  V RSS E+EYRS+A T++E+ W+ + L E+G+  S  P+++CDN  A  L  N 
Subjt:  RSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNL

Query:  VFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRGIL
        VFHSR KHI LD HF+R++V   A+ + +V + DQLAD LTKP+S V F     K+GV+  P    G+L
Subjt:  VFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRGIL

Arabidopsis top hitse value%identityAlignment
AT1G78915.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-4767.39Show/hide
Query:  QAVSEDVNFEVLRGLTNALLAAKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYL
        + VS+ ++ EV RGL NA LAAKKPDEAV+FLLD+RERL + K+ +      +  + ++DPIQVELLLGK+YSDWGH+ DAI+VYDQLI+ HP+DFRGYL
Subjt:  QAVSEDVNFEVLRGLTNALLAAKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYL

Query:  AKGIILKENGRAGDAERMFIQARFFAPENAKMLVDRAS
        AKGIIL+ENG  GDAERMFIQARFFAP  AK LVDR S
Subjt:  AKGIILKENGRAGDAERMFIQARFFAPENAKMLVDRAS

AT1G78915.2 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-4767.39Show/hide
Query:  QAVSEDVNFEVLRGLTNALLAAKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYL
        + VS+ ++ EV RGL NA LAAKKPDEAV+FLLD+RERL + K+ +      +  + ++DPIQVELLLGK+YSDWGH+ DAI+VYDQLI+ HP+DFRGYL
Subjt:  QAVSEDVNFEVLRGLTNALLAAKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYL

Query:  AKGIILKENGRAGDAERMFIQARFFAPENAKMLVDRAS
        AKGIIL+ENG  GDAERMFIQARFFAP  AK LVDR S
Subjt:  AKGIILKENGRAGDAERMFIQARFFAPENAKMLVDRAS

AT1G78915.3 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-4767.39Show/hide
Query:  QAVSEDVNFEVLRGLTNALLAAKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYL
        + VS+ ++ EV RGL NA LAAKKPDEAV+FLLD+RERL + K+ +      +  + ++DPIQVELLLGK+YSDWGH+ DAI+VYDQLI+ HP+DFRGYL
Subjt:  QAVSEDVNFEVLRGLTNALLAAKKPDEAVQFLLDSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYL

Query:  AKGIILKENGRAGDAERMFIQARFFAPENAKMLVDRAS
        AKGIIL+ENG  GDAERMFIQARFFAP  AK LVDR S
Subjt:  AKGIILKENGRAGDAERMFIQARFFAPENAKMLVDRAS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.9e-11441.51Show/hide
Query:  EPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAV
        EP +  EA +   W  AMD+E+ A+   +TW +   PPN   IG KWV+K+K +SDG+ ER KARLVAKG+ Q  GIDF ETFSPV K  ++++ILA++ 
Subjt:  EPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAV

Query:  TKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYV----NEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYV
           + + QLD++NAFLNG L E+++M+ P GY     +   P+ VC L K++ GLKQA R W+ +   TL+ +GF  S +D + F   +  + + VL+YV
Subjt:  TKGWPIRQLDVNNAFLNGHLSEDVFMQQPEGYV----NEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYV

Query:  DDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGA
        DDI++  +    +     +L   F L+D+G L Y+LG+E+ R+ +G+ + Q KY  DLL+  G+L  KP S P   S  FS   G    +   YR +IG 
Subjt:  DDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGA

Query:  LQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVAR
        L YL  TR DIS+ VNKLSQF + P + H Q V ++L Y+KGT+  GLF    + + L  ++DA +  C D R+S   +C+FLG SL+SW SKKQ VV++
Subjt:  LQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVAR

Query:  SSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDK-VLQKAIDIRYVPSSDQLADALTKPISEVQ
        SSAE+EYR+L+  T E+ W+  F +E+ +  S   +++CDNT+AI +A N VFH R+KHIE D H VR++ V Q  +   +    +Q  D  T+ +S + 
Subjt:  SSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDNTSAISLANNLVFHSRSKHIELDVHFVRDK-VLQKAIDIRYVPSSDQLADALTKPISEVQ

Query:  FTALRSKLGVLLSPSRLRGILEHKVIPLFL
           LR  +  ++S   L G LE  +  +FL
Subjt:  FTALRSKLGVLLSPSRLRGILEHKVIPLFL

ATMG00810.1 DNA/RNA polymerases superfamily protein2.4e-5446.26Show/hide
Query:  VLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYR
        +L+YVDDIL+TG     ++  I +L+  F++KD+G + Y+LG+++    SG+FL Q+KY   +LN  GML  KP STP +  K  S V  +   +P+ +R
Subjt:  VLIYVDDILVTGDCPNFISRFIQRLNDVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYR

Query:  SVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQ
        S++GALQYLT TRPDISY VN + Q + +PT+  +  +KRVLRY+KGT+ +GL++ ++S L++ A+ D+DWA C   R+S    C FLG +++SWS+K+Q
Subjt:  SVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQGVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQ

Query:  SVVARSSAESEYRSLAQTTAEVSWVQA
          V+RSS E+EYR+LA T AE++W  A
Subjt:  SVVARSSAESEYRSLAQTTAEVSWVQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCTTTAGAAGATCCAAATTGGAAAGTAGCAGTTATGGAGGAGATGAATGCTCTAAAACAAAATGGAACTTGGGAAATAGTAGAGTTACCAAAAGATAAGAAACC
AGTGGGTTGTAAATGGGTGTTCACCATAAAATGCAATATAGATGGTAGTATTGAAAGATATAAGGCCAGACTAGTAGCTAAAGGCTTCACCCAAACATATGGGATTGATT
ATCGGGAAACGTTTGCCTCAGTTGCTAAAATTAACTCTATTCGAGTGCTTTTGTCTCTTGCAGCTAATTCAGAGTGGCCTCTTTACCAACTTGATGTGAAAAATGCTTTT
CTCAATGGCGACCTTGAGGAAGAAGTTTTTATGCATTTACCACCAGGTTTTGAAAAGGAACTTGGAAGTAATAAAATTTGCAGATTAAAGAAATTTCTATATGGCCTTAA
ACAGTCTCCAAGAGCTTGGTTCGAATGGTCTGGAAAAGTTGTGTCTAGTTATGGGTTTCTTCAAAGTCAGGCTGAACATACTATATTTTATAAACACTCTGAGAATAATA
AAATCTCTATTTTAATTGTCTATGTAGATGATATTATTCTCACAGGTGATGATGAAGTAGGTCTAGTTGATCTCAAGAAAAAACTTGCAACTGAGTTTCAAATCAAGGAT
TTGGGAACGTTAAAATATTTCCTTGGGATGGAATTTGCAAGATCAAGGAAGGGTATCTTTCTAAATCAAAGAAAATATGTACTTGACTTACTTGAAGAAACAAGATTACT
TGGATGCAGAGTTGTTGAAACTCCTATTGAGCCCAATTTGAAATTAGAAGCAGCAAAAGAAGAAGAAATAAAAGATAAGGAGAGATACCCAAGACTTTTGAGGAGGCTAA
TTTATTTGTCTCATACACGTTTGGACATTGCTTTTGCTGTAAGCATGGTAAGTAAGTTTATGCATGCACCTGGACCAACTCATTTTGAAGCTGCCTATAGGATTTTAAGA
TATTTAAAAGGAGCTCCTGGTAAAGACTCTGATGCTACGGATCATATGATGGGTAATCCTAATTTGTTTTCCACTCTTTCTTCATCTACATCCCTTGCTAATGTTACTTT
GGCACAGATGATGGACTCTGATGCTACGGATCATATGATGGGTAATCCTAATTTGTTTTCCACTCTTTCTTCATCTACATCCCTTGCTAATGTTACTTTGGCAAATGGAT
CAACATTGTCTGTTATTGGATTTGACACGGTTGTCTCCAAAACTAGATTTCGATTGTTGATATGCCTTGAATCTGATATGAAGCGCTCCGAACGAATGAATCAAACACTT
CTTGAGAGAGCAAGATGCATGCTTTCTAATGTTGGGTTACTTAGAAGATTTTGGATAGAAGCAGTAAGCACAGCGTGTTATTTGATTAACCGAGGACCTCATACAGGTAT
TGATTGCAAAACATCTTATGAGGTGTGGTCTGGTAAGCCTGCAGATTACTCTTTGCTGAGAGTTTTTGGTTGTTCAGTTTATTATCATGTGAATGAGGGTAAATTAGAAC
CTAGAGCTAAAAATGGTATATTTGTGGGATTTGGTGATGGGGTCAAAGGTTATAGGATTTGGTCACCTTCTGAAAATAGAGTTATACTCAGTAGAAATGTTGTGTTTGAT
GAAAATTCTATGTTTAACTCTGTTGTGAAGCCCGCTGTTGCAGAAGAAAGTAGTGATGTTGATAAACAGGTGGAGCTGCAATTCACTTCTGGTCAGGATGAATCGCAGCA
CCAAGGTGGAGAAGATCAACATGCTGCTATAGAAACTGATGTGCCAGTTGTTGATACAGGTACAAGAATAGATGAGCCAAAGTCTTCTAGTTTAGAAACAGCACGTCAGC
CAAGTATAGCTCTTGATGGAGCTAGGAGAGTTGGCATTCGACCCCCACAGAGGTATGGTTTTGAGGATATGATTGCTTATGCACTACAGGTTGTTGAAGAGGTGGATCCC
CATGAGTCGTCCACCTATAGAGAAGTTGTTTCGAGTAGTGAGTCTGCTCAATGGCTTGCTGCAATGGGAGATGAGATTGAGTCACTTGATAAAAATCAGACTTGGGAGCT
AGTCAGACGACCTAAGGAAAGAAAGATTGTTACTTGCAAATGGGTCTTTAAGAAAAAGGAAGGGATCTCACCTACAGAGGGTATTAAGTTTAAAGCTCGAGTTGTTGCAA
GAGGGTTTACACAGAGAGAAGGAGTTGACTACAATGAAATATTTTCACCAGTGGTCAGACATACTTCCATCAGGGTGCTTCTAGCCATGGTTGCACATCAGAATTTAGAA
CTTGAGCAACTAGATGTGAAGACAGTTTTCTTACATGGAGAGTTGGAAGAGGAGATTTATATGACCCAACCAGAGGGGTTTCAGTGTCCTGGTAAGGAAGACTATGTTTG
CAAGTTGAAGAAGTCTTTATATGGGCTCAAGCAGTCTCTACGACAGTGGTATAAGAGGTTTGACGGTTATATGATTGAGTTAGGCTACAACAGAAGTCCGTATGATTGTT
GTGTGTATCACAACGAGCTTGGTGATGGATTGAAGATTTATCTGGTTCTGTATGTAGATGACATGTTGATAGCTGCTAAGTTCAAGTCTGATATTCTGAAGTTGAAAGAC
CTTCTCAGTGCTGAATTTGAGATGAAAGATTTGGGTGATGCTCAGAAAATTTTGGGTATGGAGATTTATAGGAAAAGAGATAAAAAGAAGTTTTTCTTGTCACAAAAAGG
ATATATTCAGAAGATATTGTCCAGGTTTGGGATGTCTTCAACTAAGCCTATAGATACTCCTAGTGTTGCAAGTGTTCGTTTGTCTTCTGAACGTGCATCACAATCTGAAG
CTGAGAAGCAATACATGTCTCGAGTCCCATATGCTAGTGCAGTAGGAAGTTTGATGTATGCTATGGTTTGCAATAGGCCTGATCTTGCACATGTTGTGAGTGTTGTCAAA
TGTTTAGTGACTGGTTATTCTTATTTTGACTATGTTGGAGATGTTGACAGTAGAAGGTCCATGACTGGTTATGTGTTTACCCCTGGTGGTTTTGTTGTTAGTTGGAAGGC
AACCCTACAGCCTACAGTTAGTTTGTCTACTACTGAGGCTGAGTATATGGCACTAACAGAAGCTGCTAAAGAAGCAATATGGTTGAAAGGACTAGTTGGTGACCTTGGTC
TTCATCATGATCAGGCGGTATCTGAAGATGTCAATTTTGAGGTTCTGCGTGGCCTTACAAATGCGTTACTTGCTGCTAAGAAACCAGATGAGGCTGTTCAATTCCTTTTG
GACTCCCGTGAACGTCTTAAAAGTATGAAATCAGGAAGTATGGCTGGGGGCAAGGAGATGGAAACAAAATTATCGATAGATCCTATTCAAGTTGAGTTACTACTTGGAAA
ATCGTACTCAGACTGGGGACATGTTGGTGATGCTATTTCTGTTTATGATCAACTTATCACCACCCACCCTGATGACTTCCGTGGTTACTTGGCTAAGGGAATTATTCTAA
AGGAAAATGGCAGAGCTGGAGACGCTGAGAGGATGTTCATCCAAGCCCGGTTCTTTGCTCCGGAGAATGCCAAGATGCTTGTAGATCGAGCCTCTTTCAAGGCAAACGCC
ATGGGTGACGAAATTTCTTCATCCTCCACTGCTGCAGTATCAGCTCCAGGATCAACAGCTGCTACAGTCATCAGTTCGTCGTTTGGCCATCCGTTGAGCACTGTTCTTAC
CGTAAAGCTCGACGAGAAGAACTATCTGTTGTGGAGAGGGATGGTTTTAGCCATCCTTCGTGGTCAGAAAGTTGATGGTTATATTCTTGGCACTAAAGCCCAACCATCGG
AATGCAATGAAGTTGCTACTGAGACAGGAACGAAGAAGCTTGTTCCAAATCCACTGTATGAAGAGTGGCAGACAATAGACCAGGCGCTATGCGGTTGGCTCTTTGGATCA
ATGACACCAGCGATTGCAGCTGATGTAGTGAGCTTCAAGACCTCCAGAGAAGTGTGGAAGGCATTAGAGAAGGTTTATGGTTCAACTAGCAAGGCAAGAATAAATCAGCT
CCGCGGGGTTCTTCAGAACACGAAAAAGGGATCAATGAAGATGGTTGAATTCCTAGCCATCATGAAGCAAGCCTCGGAAAATCTTCAACTAGCAGGTAACCCTATTTCCT
TGGGTGATTTGGTGTCCTACGTTTTAGCGGGTCTCGATTCAGACTATGTCCCAATAGTCTGCTCAATTGAAGACAAGGATATCTCGACATGGCAAGAACTATCGTCTATT
TTGATTCAATTCGAGGGCACTCTGGCTCGGTACAGTACAATACCTAATACCAGAGATTTACCAGACCTAGCAGCTCATCTGGCTTTCAATCAACAAAGCCAACCTTCTGG
GCAAAGATCTTTGTTCCAAATCGGGGAGTCTCCAATCAAGGGCATAATCAGCAAAGTTTATCTAATAATAGAGGTTCCAATAGAGGTCGTGGCCGAGGAGGTAGAGGAAA
TTTTCAACGAACCAACAACAAACCTACATGCCAACTCTGCGGCAACAAGTAACAGTAACAGTCAAGGGGGAAATTCTGCTTCATCCTCCGCCTATATTGCAACTCCAGAA
ATTCTGAATGACCCTAAGTGGCTTGCTGATAGTGGAGCCACCAACCACGTCTCGGCAGATGCTGGGAACCTTGCAGTGAAAAATGAGTATAATGGTAACGAATCCTTAGT
TGTTGGTAATGGAACTAAGCTACATATAGATCATATTGGTAGCAGTTATGTTTCTTGTGCTTCTGGAAACAAGATGATTAAATTAAATAACATGCTTCATGTGCCACATA
TTAAGAGGAATTTGATTAGCATTGCCCGATTAACTGCTGATAATGATGTTTTCGTGGAATTTCATTCTTGTTTTTGTCTTGTGAAGGACAAGGCTTCAAAGAGGGTGATG
CTCCAAGGAACGCTTAAGGATGCCAATGTGTCTATTTCAAATAAGCCGTTTGGTCTTGGTCTGTCCACTGAGGAAGTCTCTGGTTTATCGAGTCATGTTGTGGATTCTGC
TTCTGTTTTTCATGTTGTTAAGTCTAGCACGTCTTTTCTGTCTAAAGCTTTATTACATAGGCGTCTTGGTCATGCATCTGATCGAGTTCTCAACAATCCACTTGAATTGA
TCTACTGTGATCTTTGGGGGCCTTCCCCTGTTCAGTCCACATTAGGCTATCGTTATTACCTGAGTTTTGTTGACAACTTCACTCGTCTCACATACATCTACCCTTTAAAA
TCTAAGAGTGAAGCTTTGGCCTCTTTTAAACAATTCAAACTCCTTGTTGAAAATAGATTCGGTTCAACCATTAAAATTCTTCAATCTGATTGGGGAGGGGAGTTTCGCTC
TTTTGATCCTTTTCTTAAAACTGTTGGCATTGAATTTCGGCATACTTGTCCTCACACTAGTCAACAAAACGGTGTTGTAGAACGCAAACATCGCCATATTGTTGAGATGG
GTCTATCCATGCTTGCTCAATCTTCCATGCCCTTACATTTCTGGTGGGAAGCCTTTTCTTCGGCTGTCTTCATTATTAATAGGTTGCCAACTCCTGTTCTTGAGAACAAA
TCTCCTTGGGAAAAAGCCTTTAATCAAATTTCGGACTTGAGCTCCTTAAGGACCTTTGGCTGTGCATGCTTTCCCTATTTACGGCCTTATAACACTCAAAAATTTCAATT
CCATAGCACCAAGTGTGTTTTCCTTGGCTATAGTTTTTCACACAAGGGTTATAAGTGCCTCAGCTCTAGTGGTCGCATATATATATCTCGTCATGTTGTCTTCAATGAGG
ATGATTTCCCTTTCCAAAATGGTTTTTCCCAGTCTTCTTCTTCTCCATGTGTATCTCCTGATGTTGTTGTGCACTGGCTTCCCTTCCCTAATCCGTCTCCTGTGTCCACC
CTGCCTACTTCTTTATCTTCCTCTGCAGCTGCTGATACTGCTCATCCTTCTGATTTTCAAGGAGCTTCTCCCTCTCTACCAGCCGTGTCTCAAGACTTGAACTTGTCGCT
TGGTGAGCCTATGGATGAGCCGTTATCTGCCATCTCTGAGCCTACTGCTTCTCCTGTAATTGAGCCCGTTTCTCATACCTCTCAGCCTGATCTGCTTCACTCGGTTGAAC
CTCCCGCACAGCCTCTTGTTGCTAGCAGCAGTCATCCTATGCAAACAAGGGCTAAAAGTGGTATTTTTCGTCCTAAAAACTGGGGCTCGTTTCTTGCTTCTTCTGATTTG
CCTATGGCTGAACCATTATCTGTAAAAGAAGCTCTCCAATCCTCCTTTTGGAAACAGGCCATGGACAATGAAATGAGTGCTCTTCGACGTAATAATACTTGGCATCTTGT
GGCCCCTCCTCCGAATGTGAATCTTATTGGCAGCAAGTGGGTCTTCAAAGTTAAGCGTCATTCTGATGGATCCTTTGAGCGGTGTAAAGCTCGTCTTGTGGCCAAGGGGT
TTCATCAGATTCCGGGAATTGATTTTCATGAAACCTTTAGTCCTGTTGTGAAAGCTCCAACTATTCGTATTATTCTTGCTCTTGCAGTTACAAAGGGGTGGCCTATCCGA
CAGCTTGATGTAAACAATGCGTTTTTAAATGGCCACTTGTCTGAAGACGTATTTATGCAGCAACCCGAAGGATATGTGAATGAAGCCTTTCCTCATTATGTCTGTAAGCT
TGACAAAGCCCTCTCTGGTTTAAAACAGGCTCCTCGGGCTTGGTATGACAGACTTCGTTCTACTCTGTTGGATTGGGGTTTTAAAAATTCTAGGGCTGATAGTTCGGTTT
TCTTTTTCTCATCACCACACATGGTTGTTCTTGTTCTCATCTATGTAGATGATATTTTGGTAACAGGGGACTGTCCAAATTTTATCTCTCGGTTTATTCAGCGGCTTAAT
GATGTCTTTGCCCTGAAAGACATTGGTCTGTTAAGCTACTACCTTGGAGTTGAAGTTCATCGAACTGACAGTGGTATGTTTCTTCTGCAATCAAAGTATATTACTGATCT
GCTTAATAGAATTGGTATGCTGCACTTGAAGCCTTGCTCCACACCCGCCGTCACTAGCAAACGCTTCTCTTTAGTTGAAGGTTCGCCCATGTCAAATCCTACGTTGTATC
GCAGTGTTATTGGCGCCTTACAGTACCTCACTCACACGCGTCCAGACATATCTTACATTGTTAATAAACTTAGCCAGTTTCTCAAGCAACCGACGGTGATTCACTGGCAG
GGAGTTAAACGTGTTCTCAGGTACTTAAAAGGAACTTTGAACTATGGCCTGTTCCTCCCTCGATCGTCTTCTCTGTCCTTAGTTGCTTATACTGATGCTGACTGGGCATG
CTGTGTGGATGACCGTAAATCTATTGCTGCTCACTGCATCTTTTTAGGCCATTCCTTAGTTTCATGGTCCTCCAAAAAGCAAAGTGTAGTAGCTCGATCGAGTGCTGAGT
CCGAATACCGCTCCCTCGCCCAAACCACTGCTGAAGTATCATGGGTTCAGGCTTTTCTCCAAGAGATAGGAGTGTGCTCTTCCTCAGTTCCTATTGTCTGGTGTGACAAT
ACTAGTGCAATTTCTTTGGCAAATAATCTTGTATTTCACTCTCGAAGCAAACATATTGAGTTAGACGTCCACTTTGTCAGGGATAAAGTTCTTCAAAAGGCTATAGATAT
TCGATATGTACCTTCATCAGATCAGCTTGCAGATGCTCTCACCAAACCCATCTCTGAAGTTCAGTTTACTGCCTTACGTTCCAAACTCGGGGTTCTTCTCTCACCCTCTC
GTTTGAGGGGGATATTAGAGCACAAAGTAATTCCATTATTTCTGAAGAATTACCAGAGGAAGAATCAGCTGCTCAACCATATCATGGCGTTCGACATGGAAGGCTTTATT
AATGGCACTCCAGCTCCGCCGAAGTTCTTAGACGCCGCTCAAGCTCAGGTTAGTCCTCGATATCTGCAAAATGGTAATGTTGAACGGAAAAATAAACATGTTGCCAAAAA
CTATCCTTTCTTCGACAGAATGTCTAAATTCAAAGCTGGTTGTGGTAAAAAGATTCAGTTCTGGGAAGACGTGTGGTTGGATCATGTTCCCCTGAAAAATCTCTTCCCAG
AAATATTAGAATATTTAGCTCATTTCAAATTGGGAAATGGGAAAAGGATTTCATTTTGGAATGACCCGTGGATTGCAAATCGTCTTTTGAAGCTGGAATTCCCTCAGTTA
TTTTCTCTATCTTCTTCTCCTAATGGTTCTGTTTCAGATTATTGGGATGCTTCTCACTTATCTTGGAATGTTATATTTCGTAGATTCTTGAAGGAGGAGGAAATTCCAGA
TTTTCAATCCTTATTGGAAAGTTTATCTTCTATTTTCATTGCTCCTTTTGATGATAAACGTATATGGTCTTTGGAACCTTCAGGGTGTTTTTCTGTGAAGTCTCTTACCC
GTCATTTGGCTTCCTCACTTCCTTTGGAGAAAGAGTTTTTTAATGCTCTGTGGAAAACTAAGAGCCCGAAAAGAACTTCCATCCTTGTGTGGATCATGCTTAATGGTTCC
TTAAATTCTTCAGATGTAGTTATCCTTAGACTACTTATCCTGATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCTTTAGAAGATCCAAATTGGAAAGTAGCAGTTATGGAGGAGATGAATGCTCTAAAACAAAATGGAACTTGGGAAATAGTAGAGTTACCAAAAGATAAGAAACC
AGTGGGTTGTAAATGGGTGTTCACCATAAAATGCAATATAGATGGTAGTATTGAAAGATATAAGGCCAGACTAGTAGCTAAAGGCTTCACCCAAACATATGGGATTGATT
ATCGGGAAACGTTTGCCTCAGTTGCTAAAATTAACTCTATTCGAGTGCTTTTGTCTCTTGCAGCTAATTCAGAGTGGCCTCTTTACCAACTTGATGTGAAAAATGCTTTT
CTCAATGGCGACCTTGAGGAAGAAGTTTTTATGCATTTACCACCAGGTTTTGAAAAGGAACTTGGAAGTAATAAAATTTGCAGATTAAAGAAATTTCTATATGGCCTTAA
ACAGTCTCCAAGAGCTTGGTTCGAATGGTCTGGAAAAGTTGTGTCTAGTTATGGGTTTCTTCAAAGTCAGGCTGAACATACTATATTTTATAAACACTCTGAGAATAATA
AAATCTCTATTTTAATTGTCTATGTAGATGATATTATTCTCACAGGTGATGATGAAGTAGGTCTAGTTGATCTCAAGAAAAAACTTGCAACTGAGTTTCAAATCAAGGAT
TTGGGAACGTTAAAATATTTCCTTGGGATGGAATTTGCAAGATCAAGGAAGGGTATCTTTCTAAATCAAAGAAAATATGTACTTGACTTACTTGAAGAAACAAGATTACT
TGGATGCAGAGTTGTTGAAACTCCTATTGAGCCCAATTTGAAATTAGAAGCAGCAAAAGAAGAAGAAATAAAAGATAAGGAGAGATACCCAAGACTTTTGAGGAGGCTAA
TTTATTTGTCTCATACACGTTTGGACATTGCTTTTGCTGTAAGCATGGTAAGTAAGTTTATGCATGCACCTGGACCAACTCATTTTGAAGCTGCCTATAGGATTTTAAGA
TATTTAAAAGGAGCTCCTGGTAAAGACTCTGATGCTACGGATCATATGATGGGTAATCCTAATTTGTTTTCCACTCTTTCTTCATCTACATCCCTTGCTAATGTTACTTT
GGCACAGATGATGGACTCTGATGCTACGGATCATATGATGGGTAATCCTAATTTGTTTTCCACTCTTTCTTCATCTACATCCCTTGCTAATGTTACTTTGGCAAATGGAT
CAACATTGTCTGTTATTGGATTTGACACGGTTGTCTCCAAAACTAGATTTCGATTGTTGATATGCCTTGAATCTGATATGAAGCGCTCCGAACGAATGAATCAAACACTT
CTTGAGAGAGCAAGATGCATGCTTTCTAATGTTGGGTTACTTAGAAGATTTTGGATAGAAGCAGTAAGCACAGCGTGTTATTTGATTAACCGAGGACCTCATACAGGTAT
TGATTGCAAAACATCTTATGAGGTGTGGTCTGGTAAGCCTGCAGATTACTCTTTGCTGAGAGTTTTTGGTTGTTCAGTTTATTATCATGTGAATGAGGGTAAATTAGAAC
CTAGAGCTAAAAATGGTATATTTGTGGGATTTGGTGATGGGGTCAAAGGTTATAGGATTTGGTCACCTTCTGAAAATAGAGTTATACTCAGTAGAAATGTTGTGTTTGAT
GAAAATTCTATGTTTAACTCTGTTGTGAAGCCCGCTGTTGCAGAAGAAAGTAGTGATGTTGATAAACAGGTGGAGCTGCAATTCACTTCTGGTCAGGATGAATCGCAGCA
CCAAGGTGGAGAAGATCAACATGCTGCTATAGAAACTGATGTGCCAGTTGTTGATACAGGTACAAGAATAGATGAGCCAAAGTCTTCTAGTTTAGAAACAGCACGTCAGC
CAAGTATAGCTCTTGATGGAGCTAGGAGAGTTGGCATTCGACCCCCACAGAGGTATGGTTTTGAGGATATGATTGCTTATGCACTACAGGTTGTTGAAGAGGTGGATCCC
CATGAGTCGTCCACCTATAGAGAAGTTGTTTCGAGTAGTGAGTCTGCTCAATGGCTTGCTGCAATGGGAGATGAGATTGAGTCACTTGATAAAAATCAGACTTGGGAGCT
AGTCAGACGACCTAAGGAAAGAAAGATTGTTACTTGCAAATGGGTCTTTAAGAAAAAGGAAGGGATCTCACCTACAGAGGGTATTAAGTTTAAAGCTCGAGTTGTTGCAA
GAGGGTTTACACAGAGAGAAGGAGTTGACTACAATGAAATATTTTCACCAGTGGTCAGACATACTTCCATCAGGGTGCTTCTAGCCATGGTTGCACATCAGAATTTAGAA
CTTGAGCAACTAGATGTGAAGACAGTTTTCTTACATGGAGAGTTGGAAGAGGAGATTTATATGACCCAACCAGAGGGGTTTCAGTGTCCTGGTAAGGAAGACTATGTTTG
CAAGTTGAAGAAGTCTTTATATGGGCTCAAGCAGTCTCTACGACAGTGGTATAAGAGGTTTGACGGTTATATGATTGAGTTAGGCTACAACAGAAGTCCGTATGATTGTT
GTGTGTATCACAACGAGCTTGGTGATGGATTGAAGATTTATCTGGTTCTGTATGTAGATGACATGTTGATAGCTGCTAAGTTCAAGTCTGATATTCTGAAGTTGAAAGAC
CTTCTCAGTGCTGAATTTGAGATGAAAGATTTGGGTGATGCTCAGAAAATTTTGGGTATGGAGATTTATAGGAAAAGAGATAAAAAGAAGTTTTTCTTGTCACAAAAAGG
ATATATTCAGAAGATATTGTCCAGGTTTGGGATGTCTTCAACTAAGCCTATAGATACTCCTAGTGTTGCAAGTGTTCGTTTGTCTTCTGAACGTGCATCACAATCTGAAG
CTGAGAAGCAATACATGTCTCGAGTCCCATATGCTAGTGCAGTAGGAAGTTTGATGTATGCTATGGTTTGCAATAGGCCTGATCTTGCACATGTTGTGAGTGTTGTCAAA
TGTTTAGTGACTGGTTATTCTTATTTTGACTATGTTGGAGATGTTGACAGTAGAAGGTCCATGACTGGTTATGTGTTTACCCCTGGTGGTTTTGTTGTTAGTTGGAAGGC
AACCCTACAGCCTACAGTTAGTTTGTCTACTACTGAGGCTGAGTATATGGCACTAACAGAAGCTGCTAAAGAAGCAATATGGTTGAAAGGACTAGTTGGTGACCTTGGTC
TTCATCATGATCAGGCGGTATCTGAAGATGTCAATTTTGAGGTTCTGCGTGGCCTTACAAATGCGTTACTTGCTGCTAAGAAACCAGATGAGGCTGTTCAATTCCTTTTG
GACTCCCGTGAACGTCTTAAAAGTATGAAATCAGGAAGTATGGCTGGGGGCAAGGAGATGGAAACAAAATTATCGATAGATCCTATTCAAGTTGAGTTACTACTTGGAAA
ATCGTACTCAGACTGGGGACATGTTGGTGATGCTATTTCTGTTTATGATCAACTTATCACCACCCACCCTGATGACTTCCGTGGTTACTTGGCTAAGGGAATTATTCTAA
AGGAAAATGGCAGAGCTGGAGACGCTGAGAGGATGTTCATCCAAGCCCGGTTCTTTGCTCCGGAGAATGCCAAGATGCTTGTAGATCGAGCCTCTTTCAAGGCAAACGCC
ATGGGTGACGAAATTTCTTCATCCTCCACTGCTGCAGTATCAGCTCCAGGATCAACAGCTGCTACAGTCATCAGTTCGTCGTTTGGCCATCCGTTGAGCACTGTTCTTAC
CGTAAAGCTCGACGAGAAGAACTATCTGTTGTGGAGAGGGATGGTTTTAGCCATCCTTCGTGGTCAGAAAGTTGATGGTTATATTCTTGGCACTAAAGCCCAACCATCGG
AATGCAATGAAGTTGCTACTGAGACAGGAACGAAGAAGCTTGTTCCAAATCCACTGTATGAAGAGTGGCAGACAATAGACCAGGCGCTATGCGGTTGGCTCTTTGGATCA
ATGACACCAGCGATTGCAGCTGATGTAGTGAGCTTCAAGACCTCCAGAGAAGTGTGGAAGGCATTAGAGAAGGTTTATGGTTCAACTAGCAAGGCAAGAATAAATCAGCT
CCGCGGGGTTCTTCAGAACACGAAAAAGGGATCAATGAAGATGGTTGAATTCCTAGCCATCATGAAGCAAGCCTCGGAAAATCTTCAACTAGCAGGTAACCCTATTTCCT
TGGGTGATTTGGTGTCCTACGTTTTAGCGGGTCTCGATTCAGACTATGTCCCAATAGTCTGCTCAATTGAAGACAAGGATATCTCGACATGGCAAGAACTATCGTCTATT
TTGATTCAATTCGAGGGCACTCTGGCTCGGTACAGTACAATACCTAATACCAGAGATTTACCAGACCTAGCAGCTCATCTGGCTTTCAATCAACAAAGCCAACCTTCTGG
GCAAAGATCTTTGTTCCAAATCGGGGAGTCTCCAATCAAGGGCATAATCAGCAAAGTTTATCTAATAATAGAGGTTCCAATAGAGGTCGTGGCCGAGGAGGTAGAGGAAA
TTTTCAACGAACCAACAACAAACCTACATGCCAACTCTGCGGCAACAAGTAACAGTAACAGTCAAGGGGGAAATTCTGCTTCATCCTCCGCCTATATTGCAACTCCAGAA
ATTCTGAATGACCCTAAGTGGCTTGCTGATAGTGGAGCCACCAACCACGTCTCGGCAGATGCTGGGAACCTTGCAGTGAAAAATGAGTATAATGGTAACGAATCCTTAGT
TGTTGGTAATGGAACTAAGCTACATATAGATCATATTGGTAGCAGTTATGTTTCTTGTGCTTCTGGAAACAAGATGATTAAATTAAATAACATGCTTCATGTGCCACATA
TTAAGAGGAATTTGATTAGCATTGCCCGATTAACTGCTGATAATGATGTTTTCGTGGAATTTCATTCTTGTTTTTGTCTTGTGAAGGACAAGGCTTCAAAGAGGGTGATG
CTCCAAGGAACGCTTAAGGATGCCAATGTGTCTATTTCAAATAAGCCGTTTGGTCTTGGTCTGTCCACTGAGGAAGTCTCTGGTTTATCGAGTCATGTTGTGGATTCTGC
TTCTGTTTTTCATGTTGTTAAGTCTAGCACGTCTTTTCTGTCTAAAGCTTTATTACATAGGCGTCTTGGTCATGCATCTGATCGAGTTCTCAACAATCCACTTGAATTGA
TCTACTGTGATCTTTGGGGGCCTTCCCCTGTTCAGTCCACATTAGGCTATCGTTATTACCTGAGTTTTGTTGACAACTTCACTCGTCTCACATACATCTACCCTTTAAAA
TCTAAGAGTGAAGCTTTGGCCTCTTTTAAACAATTCAAACTCCTTGTTGAAAATAGATTCGGTTCAACCATTAAAATTCTTCAATCTGATTGGGGAGGGGAGTTTCGCTC
TTTTGATCCTTTTCTTAAAACTGTTGGCATTGAATTTCGGCATACTTGTCCTCACACTAGTCAACAAAACGGTGTTGTAGAACGCAAACATCGCCATATTGTTGAGATGG
GTCTATCCATGCTTGCTCAATCTTCCATGCCCTTACATTTCTGGTGGGAAGCCTTTTCTTCGGCTGTCTTCATTATTAATAGGTTGCCAACTCCTGTTCTTGAGAACAAA
TCTCCTTGGGAAAAAGCCTTTAATCAAATTTCGGACTTGAGCTCCTTAAGGACCTTTGGCTGTGCATGCTTTCCCTATTTACGGCCTTATAACACTCAAAAATTTCAATT
CCATAGCACCAAGTGTGTTTTCCTTGGCTATAGTTTTTCACACAAGGGTTATAAGTGCCTCAGCTCTAGTGGTCGCATATATATATCTCGTCATGTTGTCTTCAATGAGG
ATGATTTCCCTTTCCAAAATGGTTTTTCCCAGTCTTCTTCTTCTCCATGTGTATCTCCTGATGTTGTTGTGCACTGGCTTCCCTTCCCTAATCCGTCTCCTGTGTCCACC
CTGCCTACTTCTTTATCTTCCTCTGCAGCTGCTGATACTGCTCATCCTTCTGATTTTCAAGGAGCTTCTCCCTCTCTACCAGCCGTGTCTCAAGACTTGAACTTGTCGCT
TGGTGAGCCTATGGATGAGCCGTTATCTGCCATCTCTGAGCCTACTGCTTCTCCTGTAATTGAGCCCGTTTCTCATACCTCTCAGCCTGATCTGCTTCACTCGGTTGAAC
CTCCCGCACAGCCTCTTGTTGCTAGCAGCAGTCATCCTATGCAAACAAGGGCTAAAAGTGGTATTTTTCGTCCTAAAAACTGGGGCTCGTTTCTTGCTTCTTCTGATTTG
CCTATGGCTGAACCATTATCTGTAAAAGAAGCTCTCCAATCCTCCTTTTGGAAACAGGCCATGGACAATGAAATGAGTGCTCTTCGACGTAATAATACTTGGCATCTTGT
GGCCCCTCCTCCGAATGTGAATCTTATTGGCAGCAAGTGGGTCTTCAAAGTTAAGCGTCATTCTGATGGATCCTTTGAGCGGTGTAAAGCTCGTCTTGTGGCCAAGGGGT
TTCATCAGATTCCGGGAATTGATTTTCATGAAACCTTTAGTCCTGTTGTGAAAGCTCCAACTATTCGTATTATTCTTGCTCTTGCAGTTACAAAGGGGTGGCCTATCCGA
CAGCTTGATGTAAACAATGCGTTTTTAAATGGCCACTTGTCTGAAGACGTATTTATGCAGCAACCCGAAGGATATGTGAATGAAGCCTTTCCTCATTATGTCTGTAAGCT
TGACAAAGCCCTCTCTGGTTTAAAACAGGCTCCTCGGGCTTGGTATGACAGACTTCGTTCTACTCTGTTGGATTGGGGTTTTAAAAATTCTAGGGCTGATAGTTCGGTTT
TCTTTTTCTCATCACCACACATGGTTGTTCTTGTTCTCATCTATGTAGATGATATTTTGGTAACAGGGGACTGTCCAAATTTTATCTCTCGGTTTATTCAGCGGCTTAAT
GATGTCTTTGCCCTGAAAGACATTGGTCTGTTAAGCTACTACCTTGGAGTTGAAGTTCATCGAACTGACAGTGGTATGTTTCTTCTGCAATCAAAGTATATTACTGATCT
GCTTAATAGAATTGGTATGCTGCACTTGAAGCCTTGCTCCACACCCGCCGTCACTAGCAAACGCTTCTCTTTAGTTGAAGGTTCGCCCATGTCAAATCCTACGTTGTATC
GCAGTGTTATTGGCGCCTTACAGTACCTCACTCACACGCGTCCAGACATATCTTACATTGTTAATAAACTTAGCCAGTTTCTCAAGCAACCGACGGTGATTCACTGGCAG
GGAGTTAAACGTGTTCTCAGGTACTTAAAAGGAACTTTGAACTATGGCCTGTTCCTCCCTCGATCGTCTTCTCTGTCCTTAGTTGCTTATACTGATGCTGACTGGGCATG
CTGTGTGGATGACCGTAAATCTATTGCTGCTCACTGCATCTTTTTAGGCCATTCCTTAGTTTCATGGTCCTCCAAAAAGCAAAGTGTAGTAGCTCGATCGAGTGCTGAGT
CCGAATACCGCTCCCTCGCCCAAACCACTGCTGAAGTATCATGGGTTCAGGCTTTTCTCCAAGAGATAGGAGTGTGCTCTTCCTCAGTTCCTATTGTCTGGTGTGACAAT
ACTAGTGCAATTTCTTTGGCAAATAATCTTGTATTTCACTCTCGAAGCAAACATATTGAGTTAGACGTCCACTTTGTCAGGGATAAAGTTCTTCAAAAGGCTATAGATAT
TCGATATGTACCTTCATCAGATCAGCTTGCAGATGCTCTCACCAAACCCATCTCTGAAGTTCAGTTTACTGCCTTACGTTCCAAACTCGGGGTTCTTCTCTCACCCTCTC
GTTTGAGGGGGATATTAGAGCACAAAGTAATTCCATTATTTCTGAAGAATTACCAGAGGAAGAATCAGCTGCTCAACCATATCATGGCGTTCGACATGGAAGGCTTTATT
AATGGCACTCCAGCTCCGCCGAAGTTCTTAGACGCCGCTCAAGCTCAGGTTAGTCCTCGATATCTGCAAAATGGTAATGTTGAACGGAAAAATAAACATGTTGCCAAAAA
CTATCCTTTCTTCGACAGAATGTCTAAATTCAAAGCTGGTTGTGGTAAAAAGATTCAGTTCTGGGAAGACGTGTGGTTGGATCATGTTCCCCTGAAAAATCTCTTCCCAG
AAATATTAGAATATTTAGCTCATTTCAAATTGGGAAATGGGAAAAGGATTTCATTTTGGAATGACCCGTGGATTGCAAATCGTCTTTTGAAGCTGGAATTCCCTCAGTTA
TTTTCTCTATCTTCTTCTCCTAATGGTTCTGTTTCAGATTATTGGGATGCTTCTCACTTATCTTGGAATGTTATATTTCGTAGATTCTTGAAGGAGGAGGAAATTCCAGA
TTTTCAATCCTTATTGGAAAGTTTATCTTCTATTTTCATTGCTCCTTTTGATGATAAACGTATATGGTCTTTGGAACCTTCAGGGTGTTTTTCTGTGAAGTCTCTTACCC
GTCATTTGGCTTCCTCACTTCCTTTGGAGAAAGAGTTTTTTAATGCTCTGTGGAAAACTAAGAGCCCGAAAAGAACTTCCATCCTTGTGTGGATCATGCTTAATGGTTCC
TTAAATTCTTCAGATGTAGTTATCCTTAGACTACTTATCCTGATGTAG
Protein sequenceShow/hide protein sequence
MKALEDPNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNIDGSIERYKARLVAKGFTQTYGIDYRETFASVAKINSIRVLLSLAANSEWPLYQLDVKNAF
LNGDLEEEVFMHLPPGFEKELGSNKICRLKKFLYGLKQSPRAWFEWSGKVVSSYGFLQSQAEHTIFYKHSENNKISILIVYVDDIILTGDDEVGLVDLKKKLATEFQIKD
LGTLKYFLGMEFARSRKGIFLNQRKYVLDLLEETRLLGCRVVETPIEPNLKLEAAKEEEIKDKERYPRLLRRLIYLSHTRLDIAFAVSMVSKFMHAPGPTHFEAAYRILR
YLKGAPGKDSDATDHMMGNPNLFSTLSSSTSLANVTLAQMMDSDATDHMMGNPNLFSTLSSSTSLANVTLANGSTLSVIGFDTVVSKTRFRLLICLESDMKRSERMNQTL
LERARCMLSNVGLLRRFWIEAVSTACYLINRGPHTGIDCKTSYEVWSGKPADYSLLRVFGCSVYYHVNEGKLEPRAKNGIFVGFGDGVKGYRIWSPSENRVILSRNVVFD
ENSMFNSVVKPAVAEESSDVDKQVELQFTSGQDESQHQGGEDQHAAIETDVPVVDTGTRIDEPKSSSLETARQPSIALDGARRVGIRPPQRYGFEDMIAYALQVVEEVDP
HESSTYREVVSSSESAQWLAAMGDEIESLDKNQTWELVRRPKERKIVTCKWVFKKKEGISPTEGIKFKARVVARGFTQREGVDYNEIFSPVVRHTSIRVLLAMVAHQNLE
LEQLDVKTVFLHGELEEEIYMTQPEGFQCPGKEDYVCKLKKSLYGLKQSLRQWYKRFDGYMIELGYNRSPYDCCVYHNELGDGLKIYLVLYVDDMLIAAKFKSDILKLKD
LLSAEFEMKDLGDAQKILGMEIYRKRDKKKFFLSQKGYIQKILSRFGMSSTKPIDTPSVASVRLSSERASQSEAEKQYMSRVPYASAVGSLMYAMVCNRPDLAHVVSVVK
CLVTGYSYFDYVGDVDSRRSMTGYVFTPGGFVVSWKATLQPTVSLSTTEAEYMALTEAAKEAIWLKGLVGDLGLHHDQAVSEDVNFEVLRGLTNALLAAKKPDEAVQFLL
DSRERLKSMKSGSMAGGKEMETKLSIDPIQVELLLGKSYSDWGHVGDAISVYDQLITTHPDDFRGYLAKGIILKENGRAGDAERMFIQARFFAPENAKMLVDRASFKANA
MGDEISSSSTAAVSAPGSTAATVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYILGTKAQPSECNEVATETGTKKLVPNPLYEEWQTIDQALCGWLFGS
MTPAIAADVVSFKTSREVWKALEKVYGSTSKARINQLRGVLQNTKKGSMKMVEFLAIMKQASENLQLAGNPISLGDLVSYVLAGLDSDYVPIVCSIEDKDISTWQELSSI
LIQFEGTLARYSTIPNTRDLPDLAAHLAFNQQSQPSGQRSLFQIGESPIKGIISKVYLIIEVPIEVVAEEVEEIFNEPTTNLHANSAATSNSNSQGGNSASSSAYIATPE
ILNDPKWLADSGATNHVSADAGNLAVKNEYNGNESLVVGNGTKLHIDHIGSSYVSCASGNKMIKLNNMLHVPHIKRNLISIARLTADNDVFVEFHSCFCLVKDKASKRVM
LQGTLKDANVSISNKPFGLGLSTEEVSGLSSHVVDSASVFHVVKSSTSFLSKALLHRRLGHASDRVLNNPLELIYCDLWGPSPVQSTLGYRYYLSFVDNFTRLTYIYPLK
SKSEALASFKQFKLLVENRFGSTIKILQSDWGGEFRSFDPFLKTVGIEFRHTCPHTSQQNGVVERKHRHIVEMGLSMLAQSSMPLHFWWEAFSSAVFIINRLPTPVLENK
SPWEKAFNQISDLSSLRTFGCACFPYLRPYNTQKFQFHSTKCVFLGYSFSHKGYKCLSSSGRIYISRHVVFNEDDFPFQNGFSQSSSSPCVSPDVVVHWLPFPNPSPVST
LPTSLSSSAAADTAHPSDFQGASPSLPAVSQDLNLSLGEPMDEPLSAISEPTASPVIEPVSHTSQPDLLHSVEPPAQPLVASSSHPMQTRAKSGIFRPKNWGSFLASSDL
PMAEPLSVKEALQSSFWKQAMDNEMSALRRNNTWHLVAPPPNVNLIGSKWVFKVKRHSDGSFERCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILALAVTKGWPIR
QLDVNNAFLNGHLSEDVFMQQPEGYVNEAFPHYVCKLDKALSGLKQAPRAWYDRLRSTLLDWGFKNSRADSSVFFFSSPHMVVLVLIYVDDILVTGDCPNFISRFIQRLN
DVFALKDIGLLSYYLGVEVHRTDSGMFLLQSKYITDLLNRIGMLHLKPCSTPAVTSKRFSLVEGSPMSNPTLYRSVIGALQYLTHTRPDISYIVNKLSQFLKQPTVIHWQ
GVKRVLRYLKGTLNYGLFLPRSSSLSLVAYTDADWACCVDDRKSIAAHCIFLGHSLVSWSSKKQSVVARSSAESEYRSLAQTTAEVSWVQAFLQEIGVCSSSVPIVWCDN
TSAISLANNLVFHSRSKHIELDVHFVRDKVLQKAIDIRYVPSSDQLADALTKPISEVQFTALRSKLGVLLSPSRLRGILEHKVIPLFLKNYQRKNQLLNHIMAFDMEGFI
NGTPAPPKFLDAAQAQVSPRYLQNGNVERKNKHVAKNYPFFDRMSKFKAGCGKKIQFWEDVWLDHVPLKNLFPEILEYLAHFKLGNGKRISFWNDPWIANRLLKLEFPQL
FSLSSSPNGSVSDYWDASHLSWNVIFRRFLKEEEIPDFQSLLESLSSIFIAPFDDKRIWSLEPSGCFSVKSLTRHLASSLPLEKEFFNALWKTKSPKRTSILVWIMLNGS
LNSSDVVILRLLILM