; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G19970 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G19970
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr5:21152082..21155390
RNA-Seq ExpressionCSPI05G19970
SyntenyCSPI05G19970
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81016.1 hypothetical protein VITISV_025518 [Vitis vinifera]4.2e-20145.44Show/hide
Query:  LLSIFKLIETQFSKVIKVFRSDNAPKL-NFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLS
        L S    +ETQF++ IK  R+DN  ++ + +      G  +  SCAYTPQQN VVERK +H LN+ RAL FQ+ +PL FWG+ + +  YLINR P  LLS
Subjt:  LLSIFKLIETQFSKVIKVFRSDNAPKL-NFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLS

Query:  NNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFI
        + +P+  L  K   Y+ ++TFGCL YA T  +   KFD RA+ C+F+G+P G KGYR+YD+   KFF S DV+F E +FPFH+  +++    HD +   +
Subjt:  NNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFI

Query:  VPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH-VSNSEETSN-------TDQAPIPIVTRKSSRPHHPSSYLKDFHCNLT---------SQ
        +P P              T   P T +T +     DDQ P  +S+ E TSN       T  +P P  TR+S R   P+ +L++FH   T         S 
Subjt:  VPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH-VSNSEETSN-------TDQAPIPIVTRKSSRPHHPSSYLKDFHCNLT---------SQ

Query:  NSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAK
        + T  PL +Y+SY   S  ++N++  +T++ EPT Y QAV    W++AMA E+ A+E+ +TWT+  +   H  +G KWVYK+K   DGT++RYK RLVAK
Subjt:  NSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAK

Query:  GYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPL---------------------------------AAISSH
        G+ Q+EGI++ +TFSPVAK +TV+  LA+A   +W + QMD+ NAFL+GDL EEV+M LPL                                 A I   
Subjt:  GYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPL---------------------------------AAISSH

Query:  GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAP
        GF QS+ADYSLFTK +G++F A+L+YVDD+++ G   + I ++K+SL   F++KDLGQ RYFLG+E++RS  G+ +SQRKY L IL++ G L +KP+  P
Subjt:  GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAP

Query:  MDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFD
        M+ N KL  + G+ L  ++ + YRRL+G+LIYL I+RP+I +SVH LSQF+ +P K HL A HHLL+YLKG+PGQG+      +  L+ F DADW  C  
Subjt:  MDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFD

Query:  TRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKI
        TRRSVTG+CIFL  + ISWK++KQ TVSRSS E+EY+A+AS+T EL W+  LL D KV+      +FCD++AA+ IA+NP +HE+TKHIEIDCH VR++I
Subjt:  TRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKI

Query:  VEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIHRPT
          G +    + +S QLAD+FTK L SS  +  +SK G+ DIH PT
Subjt:  VEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIHRPT

KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.6e-20346.63Show/hide
Query:  PTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLLSI----FKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-L
        P  T    E+   + I +D      +++   K +++SI    FKLIETQ+ K IK  RSDNA KL F   F + G  HQ+SC   PQQNSVVE+K QH L
Subjt:  PTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLLSI----FKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-L

Query:  NIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAK
        N ARAL FQS+VPL FWG C+++  YLI+RTP  LL    PF  L     DYN +K FG L YAS+   N SKF PRA P VF+G+P G+KGY+LYDI  
Subjt:  NIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAK

Query:  RKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFIVPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNSEETSNTDQAPIPIVTRKS
        +K FISRD +  E +                               +  DI + T                +  Q+   + +EET+           R+S
Subjt:  RKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFIVPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNSEETSNTDQAPIPIVTRKS

Query:  SRPHHPSSYLKDFHCNL------TSQNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTV
        +R   P SYL+ +HC+L       +Q ST +P+NQYLSY A    +K  +   ++  E ++YH+AV  Q W +AM  E+EAME   TW+IV + K  +++
Subjt:  SRPHHPSSYLKDFHCNL------TSQNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTV

Query:  GNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPL-------------
        G +WVYK+K K DG+I+RYKTRLVAKGY QQEG+++ +TFS V K  TVK  L +A S  W + Q+D+NNAFL+G+LFEEV+M LPL             
Subjt:  GNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPL-------------

Query:  -----------------------AAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSE
                                 + S GF QSKA+YSLF +G   +F+ALLVYVDDI++ G +   I  +K +L   F LKDLG  ++FLGLEL+R+ 
Subjt:  -----------------------AAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSE

Query:  RGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKG
         GL LSQ+ Y LQ++EDTG L +KP   PMDP  KL  S+ + L   DAT YRRLIGRL+YL ISRPDI F+VH+LSQF+ KPT +H++AA+HL+KYLKG
Subjt:  RGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKG

Query:  SPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQ
        SPG+G+++  +  F + AF DA+WGS  DTRRSVTGFC+FLG+S++SWKS+KQ TV+RSSAEAEY+AL   T E++W+  LL + K+K+     +FCDNQ
Subjt:  SPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQ

Query:  AAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQL
        AAI IA+NP FHE+TKHIE+DCHFVRD+I++G +K+LP+   + L
Subjt:  AAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQL

KAG7578768.1 GAG-pre-integrase domain [Arabidopsis thaliana x Arabidopsis arenosa]2.7e-20345.6Show/hide
Query:  IETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVAL
        +ETQ++  +K  RSDNAP+L+F  L    G  H FSC  TPQQNSVVERK QH LN+ARAL+FQS +P+ +W  C+ +  YLINRTP  LL+N TPF  L
Subjt:  IETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVAL

Query:  FKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDI-PISHDFLEQFIVPCPLFD
          KK  Y+ +K+FGCL Y ST   +R KF PRA+  VF+G+P G KGY++ ++      ISR+V+F E++FPFHS    D+ P + D     I+P PL  
Subjt:  FKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDI-PISHDFLEQFIVPCPLFD

Query:  CLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNSEETSNTDQAPIPIVTRKSSRPHHPSSYLKDFHCNL-----TSQNSTPFPLNQYLSYNAYSQH
                D T+ + P     P ++  + D +    +S ++ NT    IP+ T +  R     SYL D+HCNL     T   +T  PL+  L Y   + H
Subjt:  CLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNSEETSNTDQAPIPIVTRKSSRPHHPSSYLKDFHCNL-----TSQNSTPFPLNQYLSYNAYSQH

Query:  HKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAK
        ++ ++ N+++  EP  + +AV+ + W   M EE++    T T+++VS+      +G +WVYK+K   DGTIDRY+ RLVAKGY QQEG++++DTFSPVAK
Subjt:  HKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAK

Query:  KSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPLA-------------------------------------AISSHGFIQSKADYSLFTKG
          TVK+ L L+    W ++QMD+ NAFL+GDL EE++M LP                                        + + GF QS++D++LF K 
Subjt:  KSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPLA-------------------------------------AISSHGFIQSKADYSLFTKG

Query:  NGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQL
          + F+ALLVYVDDIL+   S ++++ +K  L A FKLKDLGQA+YFLGLE++R++ G+ +SQRKY L +LE  G L  KPV  PMD  ++L    G+ L
Subjt:  NGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQL

Query:  TEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDS
           DAT YR LIG+L+YL I+R DI F+VH+LSQFL +P   HL+AAH +++YLKG PG+G+         L+AF DADWG+C DTRRS TGFC+FLG S
Subjt:  TEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDS

Query:  IISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQ
        ++SWKS+KQ T SRSSAE+EY+ALA  T EL+W+++LL D  V+      ++CD+ AAI IASN  FHE+TKHIEIDCH VRDK+  GFLK++ ++T  Q
Subjt:  IISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQ

Query:  LADMFTKALPSSTLNRHISKLGMKDIHRP
        L D+FTKALP  T    +S+     +  P
Subjt:  LADMFTKALPSSTLNRHISKLGMKDIHRP

KAG7578768.1 GAG-pre-integrase domain [Arabidopsis thaliana x Arabidopsis arenosa]5.5e-0738.2Show/hide
Query:  IAGTCLSNSLNDPLTWIIDSGASSHICHDKFMFTDLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLLSIFKLIE
        +  T L + LN    WIIDSGA++H+C +  +F D+ S     V LP  T++ +   G V +S+ L+L++VL+IP F  NL+S+  L++
Subjt:  IAGTCLSNSLNDPLTWIIDSGASSHICHDKFMFTDLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLLSIFKLIE

KAG7578768.1 GAG-pre-integrase domain [Arabidopsis thaliana x Arabidopsis arenosa]1.1e-20145.25Show/hide
Query:  LYIPDFKYNLLSIF----KLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVA
        +Y+   K ++LSIF    +++ TQF   +K  RSDNAP+L F D FAK G TH  SC   PQQNSVVERK QH LN+ARAL+FQS +PL +W  C+ +  
Subjt:  LYIPDFKYNLLSIF----KLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVA

Query:  YLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKD
        YLINRTP  +L++ TPF  L  K   Y+ +K FGCL YAST   +R KF PRA  CVF+G+PPG KGY+L ++   + FISRDV+F E  FP+ +     
Subjt:  YLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKD

Query:  IPISHDFLEQFIVPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNSEETSNTDQAPIPIVTRKSSRPHHPSSYLKDFHCNLTS---QNS
         P+S                     + D T +  P+++ TP             +++++ S T            SRPH+  S+L+D+HC   S     S
Subjt:  IPISHDFLEQFIVPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNSEETSNTDQAPIPIVTRKSSRPHHPSSYLKDFHCNLTS---QNS

Query:  TPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGY
        T  P++  ++Y+  S  H+ ++ N++SI EPT + QAV    WR+AM EE++A+E  +TW+IVS+ +    VG +WVYK K   DG++ RYK RLVAKGY
Subjt:  TPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGY

Query:  NQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP-----------------------------------LAAISSH
         QQEG+++L+TFSPVAK  TV+  LALA    WF+ Q+D+NNAFL+GDL EEV+MTLP                                    + + S 
Subjt:  NQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP-----------------------------------LAAISSH

Query:  GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAP
        GFIQS AD SLF + + + F+AL+VYVDDI++     ++ + +KD L + FKLKDLG  +YFLG+E++RS RG+ + QR Y + +L + G L  KP   P
Subjt:  GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAP

Query:  MDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFD
        M+ N KL +  GE L+  D   YRRLIGRL+YL I+RPD+ F+V++LSQ++  P   H++AA ++LKY+KG+ GQG+         L+AF DADWG+C D
Subjt:  MDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFD

Query:  TRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKI
        TRRSVTG+C+FLG+S+ISW+++KQ TVSRSSAEAEY++LA+ T E++WI QLL D  V     T +FCD+QAA+ IASNP FHE+TKHI+IDCH VR+K+
Subjt:  TRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKI

Query:  VEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIH
         +  +K++ +++  QLAD+FTK L  S  +  + K+G+ +IH
Subjt:  VEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIH

RVW89581.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.3e-20145.33Show/hide
Query:  LLSIFKLIETQFSKVIKVFRSDNAPKL-NFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLS
        L S    +ETQF++ IK  R+DN  ++ + +      G  +  SCAYTPQQN VVERK +H LN+ RAL FQ+ +PL FWG+ + +  YLINR P  LLS
Subjt:  LLSIFKLIETQFSKVIKVFRSDNAPKL-NFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLS

Query:  NNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFI
        + +P+  L  K   Y+ ++TFGCL YA T  +   KFD RA+ C+F+G+P G KGYR+YD+   KFF S+DV+F E +FPFH+  +++    HD +   +
Subjt:  NNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFI

Query:  VPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH-VSNSEETSN-------TDQAPIPIVTRKSSRPHHPSSYLKDFHCNLT---------SQ
        +P P              T   P T +T +     DDQ P  +S+ E TSN       T  +P P  TR+S R   P+ +L++FH   T         S 
Subjt:  VPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH-VSNSEETSN-------TDQAPIPIVTRKSSRPHHPSSYLKDFHCNLT---------SQ

Query:  NSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAK
        + T  PL +Y+SY   S  ++N++  +T++ EPT Y QAV    W++AMA E+ A+E+ +TWT+  +   H  +G KWVYK+K   DGT++RYK RLVAK
Subjt:  NSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAK

Query:  GYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPL---------------------------------AAISSH
        G+ Q+EGI++ +TFSPVAK +TV+  LA+A   +W + QMD+ NAFL+GDL EEV+M LPL                                 A I   
Subjt:  GYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPL---------------------------------AAISSH

Query:  GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAP
        GF QS+ADYSLFTK +G++F A+L+YVDD+++ G   + I ++K+SL   F++KDLGQ RYFLG+E++RS  G+ +SQRKY L IL++ G L +KP+  P
Subjt:  GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAP

Query:  MDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFD
        M+ N KL  + G+ L  ++ + YRRL+G+LIYL I+RP+I +SVH LSQF+ +P K HL A HHLL+YLKG+PGQG+      +  L+ F D+DW  C  
Subjt:  MDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFD

Query:  TRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKI
        TRRSVTG+CIFL  ++ISWK++KQ TVSRSS E+EY+A+AS+T EL W+  LL D KV+      +FCD++AA+ IA+NP +HE+TKHIEIDCH VR++I
Subjt:  TRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKI

Query:  VEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIHRPT
          G +    + +S QLAD+FTK L SS  +  +SK G+ DIH PT
Subjt:  VEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIHRPT

TrEMBL top hitse value%identityAlignment
A0A2N9EL12 Integrase catalytic domain-containing protein4.7e-22247.7Show/hide
Query:  LLSIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSN
        LLS F +IETQF+  IK  RSDN  +    D F+  G  HQ SC  TPQQNSVVERK QH LN+ARAL FQS VPL FWG  +L  AYLINR P  LL N
Subjt:  LLSIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSN

Query:  NTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIP-----------
         TPF  L      Y+ +K FGCLAYAS  S +++KFD RA PCVF+G+P G+KGY+L+D++ +KF +SRDV+F E  FPF+S      P           
Subjt:  NTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIP-----------

Query:  --ISHDFLE-----------------QFIVPCPLFDCLKKEDIIDATTDARPTTEDTPEDS------HG---------------------VDDQNPHVSN
          ++H  L                   F    PL  CL+     +  +    + E  P+ S      HG                      D   P  + 
Subjt:  --ISHDFLE-----------------QFIVPCPLFDCLKKEDIIDATTDARPTTEDTPEDS------HG---------------------VDDQNPHVSN

Query:  SEETSNTDQAPIPIVTRKSSRPHHPSSYLKDFHCNL--TSQNSTPF------PLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEI
        SE   +T  +      RKSSRP    SYL+D+HCNL  ++  S PF      P+   LSY+  S  HK +   +++  EP +YH+A+    W +AM++E+
Subjt:  SEETSNTDQAPIPIVTRKSSRPHHPSSYLKDFHCNL--TSQNSTPF------PLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEI

Query:  EAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFE
         A+E  +TW + S+S   H +G KWVYK+K K DG+I+RYK RLVAKGYNQQEGI++ +TFSPVAK  TV+ F+A+A +  W ++Q+D+NNAFL+GDL E
Subjt:  EAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFE

Query:  EVHMTLPL----------------------------------AAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFK
        EV+M+LPL                                  + +  HGFIQSK DYSLFTK  GSTF+ALLVYVDDIL+   +P+++  +   L   FK
Subjt:  EVHMTLPL----------------------------------AAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFK

Query:  LKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLH
        LKDLG A+YFLGLEL+RS +G+ L QRKY L ILED GFL SKPV  PM+ ++KL + +G  LT  D T YRRL+G+L+YL ++RPDI +SV RLSQF+ 
Subjt:  LKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLH

Query:  KPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQL
        +P   HL AAH +L+YLKGSPGQG+     +S  LKAF D+DW  C DTRRSVTGFC+FLGDS+ISW+S+KQ+ VSRSSAEAEY+A+A  T E+ W+  L
Subjt:  KPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQL

Query:  LTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIHRPT
        L DF++   M+  +FC+NQAA+ IA+NP FHE+TKHIE+DCHF+RDKI  G LK L I ++ QLAD+FTK L   + +  +SKLG+ +IH PT
Subjt:  LTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIHRPT

A0A2N9ERT7 Integrase catalytic domain-containing protein8.3e-22745.58Show/hide
Query:  TWIIDSGASSHICHDKFMFTDLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLL--------------------------------
        +WI+D+GA+ H+ H    FT + S     V LP    + V HIG V +S+ L+L DVL +P F +NL+                                
Subjt:  TWIIDSGASSHICHDKFMFTDLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLL--------------------------------

Query:  -----------------------------SIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSK
                                     S F ++ETQF+  IK  RSDN  +    D F+  G  HQ SC  TPQQNSVVERK QH LN+ARA+ FQS 
Subjt:  -----------------------------SIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSK

Query:  VPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLF
        +PL FWG+C+L  AYLINR P  +L   TP+  L  K   Y  +K FGCLAYAS  S +++KFD +A PCVF+G+P G KGY+L D++  + F+SRDV+F
Subjt:  VPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLF

Query:  FEELFPFHSIKEKDIPISHDFLEQFIV-PCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH---------------VSNSEETSNTDQAPIP-
         E +FPFH+      P  H  L+      C  F  +   +I   +T    +T  TP + H +    PH               VS+S+ + +   +P+  
Subjt:  FEELFPFHSIKEKDIPISHDFLEQFIV-PCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH---------------VSNSEETSNTDQAPIP-

Query:  -IVTRKSSRPHHPSSYLKDFHCNLTS--------QNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIV
         +  R+SSR   P SYL+D+HC+L S          +T +P+   LSY+  S  HK +   +++  EP +YH+AVK   W  AM++E+EA+E  +TW + 
Subjt:  -IVTRKSSRPHHPSSYLKDFHCNLTS--------QNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIV

Query:  SISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP-----
        S+      +G KWVYK+K K DGTI+RYK RLVAKGYNQ+EGI++ +TFSPVAK  TV+ F+ALA +  W I+Q+D+NNAFL+GDL EEV M+LP     
Subjt:  SISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP-----

Query:  -----------------------------LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLG
                                      + + +HGFIQSK DYSLFTK  G  F+ALLVYVDDIL+      S+ ++   L  HFKLKDLG A+YFLG
Subjt:  -----------------------------LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLG

Query:  LELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHH
        LEL+R+ +G+ L QRKY L IL+DTGFL SKPV  PM+ +LKL K EG  L   D T YRRLIG+L+YL ++RPDI +SV RLSQF+ +P   HL AAH 
Subjt:  LELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHH

Query:  LLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMST
        +L+YLKGSPGQG+      S  LKAF D+DW  C DTRRSVTGFCIFLGDS+ISW+S+KQ+ VSRSSAEAEY+A+A  T E+ W+  LL DF +   M+ 
Subjt:  LLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMST

Query:  TVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKL
         +FCDNQAA+ IASNP FHE+TKHIE+DCHF+RDKI  G LK L + ++ QLAD+FTK L     +  ISKL
Subjt:  TVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKL

A0A2N9FI02 Integrase catalytic domain-containing protein2.8e-22745.59Show/hide
Query:  SGASSHICHDKFMFTDLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLL-------------------------------------
        S  + H+ H    FT + S     V LP    + V HIG V +S+ L+L DVL +P F +NL+                                     
Subjt:  SGASSHICHDKFMFTDLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLL-------------------------------------

Query:  ------------------------SIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIF
                                S F ++ETQF+  IK  RSDN  +    D F+  G  HQ SC  TPQQNSVVERK QH LN+ARA+ FQS +PL F
Subjt:  ------------------------SIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIF

Query:  WGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELF
        WG+C+L  AYLINR P  +L   TP+  L  K   Y  +K FGCLAYAS  S +++KFD +A PCVF+G+P G KGY+L D++  + F+SRDV+F E +F
Subjt:  WGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELF

Query:  PFHSIKEKDIPISHDFLEQFIV-PCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH---------------VSNSEETSNTDQAPIP--IVTR
        PFH+      P  H  L+      C  F  +   +I   +T    +T  TP + H +    PH               VS+S+ + +   +P+   +  R
Subjt:  PFHSIKEKDIPISHDFLEQFIV-PCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH---------------VSNSEETSNTDQAPIP--IVTR

Query:  KSSRPHHPSSYLKDFHCNLTS--------QNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKD
        +SSR   P SYL+D+HC+L S          +T +P+   LSY+  S  HK +   +++  EP +YH+AVK   W  AM++E+EA+E  +TW + S+   
Subjt:  KSSRPHHPSSYLKDFHCNLTS--------QNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKD

Query:  HHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP----------
           +G KWVYK+K K DGTI+RYK RLVAKGYNQ+EGI++ +TFSPVAK  TV+ F+ALA +  W I+Q+D+NNAFL+GDL EEV M+LP          
Subjt:  HHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP----------

Query:  ------------------------LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSR
                                 + + +HGFIQSK DYSLFTK  G  F+ALLVYVDDIL+      S+ ++   L  HFKLKDLG A+YFLGLEL+R
Subjt:  ------------------------LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSR

Query:  SERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYL
        + +G+ L QRKY L IL+DTGFL SKPV  PM+ +LKL K EG  L   D T YRRLIG+L+YL ++RPDI +SV RLSQF+ +P   HL AAH +L+YL
Subjt:  SERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYL

Query:  KGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCD
        KGSPGQG+      S  LKAF D+DW  C DTRRSVTGFCIFLGDS+ISW+S+KQ+ VSRSSAEAEY+A+A  T E+ W+  LL DF +   M+  +FCD
Subjt:  KGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCD

Query:  NQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIHRPT
        NQAA+ IASNP FHE+TKHIE+DCHF+RDKI  G LK L + ++ QLAD+FTK L     +  ISKLG+ DIH PT
Subjt:  NQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGMKDIHRPT

A0A2N9GXF7 Integrase catalytic domain-containing protein1.7e-21949.23Show/hide
Query:  LLSIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSN
        L+S F ++ETQF+  IK  RSDN  +    D F+  G  HQ SC  TPQQNSVVERK QH LN+ARA+ FQS +PL FWG+C+L  AYLINR P  +L  
Subjt:  LLSIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSN

Query:  NTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFIV
         TP+  L  K   Y  +K FGCLAYAS  S +++KFD +A PCVF+G+P G KGY+L D++  + F+SRDV+F E +FPFH+      P  H  L+    
Subjt:  NTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFIV

Query:  -PCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH---------------VSNSEETSNTDQAPIP--IVTRKSSRPHHPSSYLKDFHCNLTS-
          C  F  +   +I   +T    +T  TP + H +    PH               VS+S+ + +   +P+   +  R+SSR   P SYL+D+HC+L S 
Subjt:  -PCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH---------------VSNSEETSNTDQAPIP--IVTRKSSRPHHPSSYLKDFHCNLTS-

Query:  -------QNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDR
                 +T +P+   LSY+  S  HK +   +++  EP +YH+AVK   W  AM++E+EA+E  +TW + S+      +G KWVYK+K K DGTI+R
Subjt:  -------QNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDR

Query:  YKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP--------------------------------
        YK RLVAKGYNQ+EGI++ +TFSPVAK  TV+ F+ALA +  W I+Q+D+NNAFL+GDL EEV M+LP                                
Subjt:  YKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP--------------------------------

Query:  --LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGF
           + + +HGFIQSK DYSLFTK  G  F+ALLVYVDDIL+      S+ ++   L  HFKLKDLG A+YFLGLEL+R+ +G+ L QRKY L IL+DTGF
Subjt:  --LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGF

Query:  LDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFV
        L SKPV  PM+ +LKL K EG  L   D T YRRLIG+L+YL ++RPDI +SV RLSQF+ +P   HL AAH +L+YLKGSPGQG+      S  LKAF 
Subjt:  LDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFV

Query:  DADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEI
        D+DW  C DTRRSVTGFCIFLGDS+ISW+S+KQ+ VSRSSAEAEY+A+A  T E+ W+  LL DF +   M+  +FCDNQAA+ IASNP FHE+TKHIE+
Subjt:  DADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEI

Query:  DCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKL
        DCHF+RDKI  G LK L + ++ QLAD+FTK L     +  ISKL
Subjt:  DCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKL

A0A2N9IF64 Integrase catalytic domain-containing protein2.6e-22844.51Show/hide
Query:  TWIIDSGASSHICHDKFMFTDLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLL--------------------------------
        +WI+D+GA+ H+ H    FT + S     V LP    + V HIG V +S+ L+L DVL +P F +NL+                                
Subjt:  TWIIDSGASSHICHDKFMFTDLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLL--------------------------------

Query:  ---------------------------------------------------SIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQN
                                                           S F ++ETQF+  IK  RSDN  +    D F+  G  HQ SC  TPQQN
Subjt:  ---------------------------------------------------SIFKLIETQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQN

Query:  SVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPG
        SVVERK QH LN+ARA+ FQS +PL FWG+C+L  AYLINR P  +L   TP+  L  K   Y  +K FGCLAYAS  S +++KFD +A PCVF+G+P G
Subjt:  SVVERKQQH-LNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPG

Query:  IKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFIV-PCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH----------
         KGY+L D++  + F+SRDV+F E +FPFH+      P  H  L+      C  F  +   +I   +T    +T  TP + H +  Q PH          
Subjt:  IKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFIV-PCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPH----------

Query:  -------------VSNSEETSNTDQAPIP--IVTRKSSRPHHPSSYLKDFHCNLTS--------QNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTY
                     VS+S+ + ++  +P+   +  R+SSR   P SYL+D+HC+L S          +T +P+   LSY+  S  HK +   +++  EP +
Subjt:  -------------VSNSEETSNTDQAPIP--IVTRKSSRPHHPSSYLKDFHCNLTS--------QNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTY

Query:  YHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNW
        YH+AVK   W  AM++E+EA+E  +TW + S+      +G KWVYK+K K DGTI+RYK RLVAKGYNQ+EGI++ +TFSPVAK  TV+ F+ALA +  W
Subjt:  YHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNW

Query:  FISQMDINNAFLNGDLFEEVHMTLP----------------------------------LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIG
         I+Q+D+NNAFL+GDL EEV M+LP                                   + + +HGFIQSK DYSLFTK  G  F+ALLVYVDDIL+  
Subjt:  FISQMDINNAFLNGDLFEEVHMTLP----------------------------------LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIG

Query:  PSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQ
            S+ ++   L  HFKLKDLG A+YFLGLEL+R+ +G+ L QRKY L IL+DTGFL SKPV  PM+ +LKL K EG  L   D T YRRLIG+L+YL 
Subjt:  PSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQ

Query:  ISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEA
        ++RPDI +SV RLSQF+ +P   HL AAH +L+YLKGSPGQG+      S  LKAF D+DW  C DTRRSVTGFCIFLGDS+ISW+S+KQ+ VSRSSAEA
Subjt:  ISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEA

Query:  EYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHIS
        EY+A+A  T E+ W+  LL DF +   M+  +FCDNQAA+ IASNP FHE+TKHIE+DCHF+RDKI  G LK L + ++ QLAD+FTK L     +  IS
Subjt:  EYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHIS

Query:  KLGMKDIHRPT
        KLG+ DIH PT
Subjt:  KLGMKDIHRPT

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-8227.63Show/hide
Query:  VLYIPDFKYNLLSIFK----LIETQFSKVIKVFRSDNAPKL---NFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQHL-NIARALMFQSKVPLIFWGKCV
        V Y+  +K ++ S+F+      E  F+  +     DN  +      R    K G ++  +  +TPQ N V ER  + +   AR ++  +K+   FWG+ V
Subjt:  VLYIPDFKYNLLSIFK----LIETQFSKVIKVFRSDNAPKL---NFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQHL-NIARALMFQSKVPLIFWGKCV

Query:  LSVAYLINRTPMVLL--SNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLF--------
        L+  YLINR P   L  S+ TP+     KK     ++ FG   Y    +  + KFD ++   +F+G+ P   G++L+D    KF ++RDV+         
Subjt:  LSVAYLINRTPMVLL--SNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLF--------

Query:  ----FEELFPFHS--IKEKDIPISHDFLEQFIVPCPLFDC--------LKKEDIIDATTDARPTTE-DTPEDSHGVD---------DQNPHVSNSEETSN
            FE +F   S   + K+ P     + Q   P    +C         K+ +  +   D+R   + + P +S   D         + N +  N  +   
Subjt:  ----FEELFPFHS--IKEKDIPISHDFLEQFIVPCPLFDC--------LKKEDIIDATTDARPTTE-DTPEDSHGVD---------DQNPHVSNSEETSN

Query:  TDQ-------APIPIVTRKSSRPHHPSSY-----LKDFHCNLTSQNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYE--PTYYHQAV---KHQTWRKAMA
         D        +  P  +R+S    H          K+    + ++ S        +SYN         + N  +I+   P  + +        +W +A+ 
Subjt:  TDQ-------APIPIVTRKSSRPHHPSSY-----LKDFHCNLTSQNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYE--PTYYHQAV---KHQTWRKAMA

Query:  EEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGD
         E+ A +  NTWTI    ++ + V ++WV+ VK    G   RYK RLVA+G+ Q+  I++ +TF+PVA+ S+ +  L+L   YN  + QMD+  AFLNG 
Subjt:  EEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGD

Query:  LFEEVHMTLPLA-------------------------------AISSHGFIQSKADYSLF--TKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAH
        L EE++M LP                                 A+    F+ S  D  ++   KGN +  + +L+YVDD+++     + +N+ K  L   
Subjt:  LFEEVHMTLPLA-------------------------------AISSHGFIQSKADYSLF--TKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAH

Query:  FKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDP--NLKLCKSEGEQLTEEDATCYRRLIGRLIYLQI-SRPDICFSVHRL
        F++ DL + ++F+G+ +   E  + LSQ  Y  +IL      +   V  P+    N +L  S+     E+  T  R LIG L+Y+ + +RPD+  +V+ L
Subjt:  FKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDP--NLKLCKSEGEQLTEEDATCYRRLIGRLIYLQI-SRPDICFSVHRL

Query:  SQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLK--AFVDADWGSCFDTRRSVTGFCIFLGD-SIISWKSRKQATVSRSSAEAEYKALASVTS
        S++  K           +L+YLKG+    ++ K   +F  K   +VD+DW      R+S TG+   + D ++I W +++Q +V+ SS EAEY AL     
Subjt:  SQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLK--AFVDADWGSCFDTRRSVTGFCIFLGD-SIISWKSRKQATVSRSSAEAEYKALASVTS

Query:  ELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGM
        E +W+  LLT   +K      ++ DNQ  I+IA+NP+ H++ KHI+I  HF R+++    + +  I T  QLAD+FTK LP++       KLG+
Subjt:  ELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-10230.11Show/hide
Query:  LKVEHIGD----VFISNDLVLKDVLYIPDFKYNLLSIFK----LIETQFSKVIKVFRSDNAPKL---NFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH
        +++E +G     V   +D   K  +YI   K  +  +F+    L+E +  + +K  RSDN  +     F +  +  G  H+ +   TPQ N V ER  + 
Subjt:  LKVEHIGD----VFISNDLVLKDVLYIPDFKYNLLSIFK----LIETQFSKVIKVFRSDNAPKL---NFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQH

Query:  L-NIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDI
        +    R+++  +K+P  FWG+ V +  YLINR+P V L+   P      K+  Y+ +K FGC A+A  P   R+K D ++ PC+F+G+     GYRL+D 
Subjt:  L-NIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDI

Query:  AKRKFFISRDVLFFE-ELFPFHSIKEKDIPISHDFLEQFI-VPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNSEETSNTDQAPIPIV
         K+K   SRDV+F E E+     + EK   + +  +  F+ +P    +    E   D  ++      +  E    +D+    V   E  +  ++   P+ 
Subjt:  AKRKFFISRDVLFFE-ELFPFHSIKEKDIPISHDFLEQFI-VPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNSEETSNTDQAPIPIV

Query:  TRKSSRPHHPSSYLKDFHCNLTSQNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGN
         R+S RP   S         L S +  P  L + LS+       KN +                      KAM EE+E++++  T+ +V + K    +  
Subjt:  TRKSSRPHHPSSYLKDFHCNLTSQNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGN

Query:  KWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPL---------------
        KWV+K+K   D  + RYK RLV KG+ Q++GI+F + FSPV K ++++  L+LA S +  + Q+D+  AFL+GDL EE++M  P                
Subjt:  KWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPL---------------

Query:  ------------------AAISSHGFIQSKADYSL-FTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLEL--SRSERG
                          + + S  ++++ +D  + F + + + F+ LL+YVDD+L++G     I  +K  L   F +KDLG A+  LG+++   R+ R 
Subjt:  ------------------AAISSHGFIQSKADYSL-FTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLEL--SRSERG

Query:  LMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATC----YRRLIGRLIYLQI-SRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKY
        L LSQ KY  ++LE     ++KPV  P+  +LKL K       EE        Y   +G L+Y  + +RPDI  +V  +S+FL  P K H +A   +L+Y
Subjt:  LMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATC----YRRLIGRLIYLQI-SRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKY

Query:  LKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFC
        L+G+ G  +     D   LK + DAD     D R+S TG+        ISW+S+ Q  V+ S+ EAEY A      E++W+ + L +  +       V+C
Subjt:  LKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFC

Query:  DNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGM
        D+Q+AI ++ N  +H +TKHI++  H++R+ + +  LKVL I+T+   ADM TK +P +        +GM
Subjt:  DNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGM

P92519 Uncharacterized mitochondrial protein AtMg008105.1e-4843.81Show/hide
Query:  LLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATC
        LL+YVDDILL G S + +N +   L + F +KDLG   YFLG+++     GL LSQ KY  QIL + G LD KP+  P+   L    S  +     D + 
Subjt:  LLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATC

Query:  YRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSR
        +R ++G L YL ++RPDI ++V+ + Q +H+PT    D    +L+Y+KG+   G+ I      +++AF D+DW  C  TRRS TGFC FLG +IISW ++
Subjt:  YRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSR

Query:  KQATVSRSSAEAEYKALASVTSELVW
        +Q TVSRSS E EY+ALA   +EL W
Subjt:  KQATVSRSSAEAEYKALASVTSELVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-12832.4Show/hide
Query:  KYNLLSIFKLIETQFSKVIKVFRSDNAPK-LNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQHL-NIARALMFQSKVPLIFWGKCVLSVAYLINRTPMV
        K   ++   L+E +F   I  F SDN  + +   + F++ G +H  S  +TP+ N + ERK +H+      L+  + +P  +W        YLINR P  
Subjt:  KYNLLSIFKLIETQFSKVIKVFRSDNAPK-LNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQHL-NIARALMFQSKVPLIFWGKCVLSVAYLINRTPMV

Query:  LLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLE
        LL   +PF  LF    +Y+ ++ FGC  Y      N+ K D +++ CVF+G+      Y    +   + +ISR V F E  FPF +      P+     E
Subjt:  LLSNNTPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLE

Query:  QFIVPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNS--------------EETSNTDQAPIPIV--TRKSSRPH-------------H
           V  P      +  ++ A + + P    TP  S     +N  VS+S              E T+     P P    T+  ++ H              
Subjt:  QFIVPCPLFDCLKKEDIIDATTDARPTTEDTPEDSHGVDDQNPHVSNS--------------EETSNTDQAPIPIV--TRKSSRPH-------------H

Query:  PSSYLKDFHCNLTSQNSTPFPL-----------------------------NQYLSYNAYSQHHK------------NYMFNVTSIYEPTYYHQAVKHQT
        PS   +       S +S+P P                              N     N +S   +            +   ++ +  EP    QA+K + 
Subjt:  PSSYLKDFHCNLTSQNSTPFPL-----------------------------NQYLSYNAYSQHHK------------NYMFNVTSIYEPTYYHQAVKHQT

Query:  WRKAMAEEIEAMERTNTWTIVSISKDHHT-VGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDIN
        WR AM  EI A    +TW +V     H T VG +W++  K   DG+++RYK RLVAKGYNQ+ G+++ +TFSPV K ++++I L +A   +W I Q+D+N
Subjt:  WRKAMAEEIEAMERTNTWTIVSISKDHHT-VGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDIN

Query:  NAFLNGDLFEEVHMTLPLAAISSH---------------------------------GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSI
        NAFL G L ++V+M+ P   I                                    GF+ S +D SLF    G + V +LVYVDDIL+ G  P+ +++ 
Subjt:  NAFLNGDLFEEVHMTLPLAAISSH---------------------------------GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSI

Query:  KDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFS
         D+L   F +KD  +  YFLG+E  R   GL LSQR+Y L +L  T  + +KPV  PM P+ KL    G +LT  D T YR ++G L YL  +RPDI ++
Subjt:  KDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFS

Query:  VHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVT
        V+RLSQF+H PT+ HL A   +L+YL G+P  G+ +K  ++  L A+ DADW    D   S  G+ ++LG   ISW S+KQ  V RSS EAEY+++A+ +
Subjt:  VHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVT

Query:  SELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGM
        SE+ WI  LLT+  ++      ++CDN  A  + +NP FH + KHI ID HF+R+++  G L+V+ ++T  QLAD  TK L  +      SK+G+
Subjt:  SELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-12031.99Show/hide
Query:  IFK-LIETQFSKVIKVFRSDNAPK-LNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQHL-NIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNN
        IFK L+E +F   I    SDN  + +  RD  ++ G +H  S  +TP+ N + ERK +H+  +   L+  + VP  +W        YLINR P  LL   
Subjt:  IFK-LIETQFSKVIKVFRSDNAPK-LNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQHL-NIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNN

Query:  TPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSI----------KEKDIP--
        +PF  LF +  +Y  +K FGC  Y      NR K + +++ C FMG+      Y    I   + + SR V F E  FPF +           +    P  
Subjt:  TPFVALFKKKADYNIIKTFGCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSI----------KEKDIP--

Query:  ISHDFLEQFIVPCPLFDCL--------------------------KKEDIIDATTDARPTTED--------TPEDSHGVDDQNPHVSNSEETSNTDQAP-
         SH  L    +  P   CL                               I + + + PT            P  +   +  +P ++N    S +  +P 
Subjt:  ISHDFLEQFIVPCPLFDCL--------------------------KKEDIIDATTDARPTTED--------TPEDSHGVDDQNPHVSNSEETSNTDQAP-

Query:  --IPIVTRKSSRPHHPSSYLKDFHCNLTSQNSTPFP-------------LNQYLSYNAYSQHHK------------NYMFNVTSIYEPTYYHQAVKHQTW
           P+     S PH P+        N  S +ST  P             +N     N +S   +            +Y  ++ +  EP    QA+K   W
Subjt:  --IPIVTRKSSRPHHPSSYLKDFHCNLTSQNSTPFP-------------LNQYLSYNAYSQHHK------------NYMFNVTSIYEPTYYHQAVKHQTW

Query:  RKAMAEEIEAMERTNTWTIVSISKDHHT-VGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINN
        R+AM  EI A    +TW +V       T VG +W++  K   DG+++RYK RLVAKGYNQ+ G+++ +TFSPV K ++++I L +A   +W I Q+D+NN
Subjt:  RKAMAEEIEAMERTNTWTIVSISKDHHT-VGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINN

Query:  AFLNGDLFEEVHMTLPLAAISSH---------------------------------GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIK
        AFL G L +EV+M+ P   +                                    GF+ S +D SLF    G + + +LVYVDDIL+ G     +    
Subjt:  AFLNGDLFEEVHMTLPLAAISSH---------------------------------GFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIK

Query:  DSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSV
        D+L   F +K+     YFLG+E  R  +GL LSQR+Y L +L  T  L +KPV  PM  + KL    G +L   D T YR ++G L YL  +RPD+ ++V
Subjt:  DSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSV

Query:  HRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTS
        +RLSQ++H PT  H +A   +L+YL G+P  G+ +K  ++  L A+ DADW    D   S  G+ ++LG   ISW S+KQ  V RSS EAEY+++A+ +S
Subjt:  HRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTS

Query:  ELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGM
        EL WI  LLT+  ++      ++CDN  A  + +NP FH + KHI +D HF+R+++  G L+V+ ++T  QLAD  TK L          K+G+
Subjt:  ELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNRHISKLGM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.9e-12343.2Show/hide
Query:  DQAPIPIVTRKSSRPHHPSSYLKDFHCNLTSQNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSIS
        +  P P V     R   P +YL+D++C+  + + T   ++Q+LSY   S  + +++  +    EP+ Y++A +   W  AM +EI AME T+TW I ++ 
Subjt:  DQAPIPIVTRKSSRPHHPSSYLKDFHCNLTSQNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSIS

Query:  KDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP--------
         +   +G KWVYK+K   DGTI+RYK RLVAKGY QQEGI+F++TFSPV K ++VK+ LA++  YN+ + Q+DI+NAFLNGDL EE++M LP        
Subjt:  KDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLP--------

Query:  -----------------------------LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLG
                                        +   GF+QS +D++ F K   + F+ +LVYVDDI++   + ++++ +K  LK+ FKL+DLG  +YFLG
Subjt:  -----------------------------LAAISSHGFIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLG

Query:  LELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHH
        LE++RS  G+ + QRKY L +L++TG L  KP   PMDP++      G      DA  YRRLIGRL+YLQI+R DI F+V++LSQF   P   H  A   
Subjt:  LELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHH

Query:  LLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMST
        +L Y+KG+ GQG+         L+ F DA + SC DTRRS  G+C+FLG S+ISWKS+KQ  VS+SSAEAEY+AL+  T E++W+ Q   + ++     T
Subjt:  LLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSRKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMST

Query:  TVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIV
         +FCDN AAI IA+N  FHE+TKHIE DCH VR++ V
Subjt:  TVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIV

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.7e-1750.63Show/hide
Query:  IYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFC
        +YL I+RPD+ F+V+RLSQF        + A + +L Y+KG+ GQG+         LKAF D+DW SC DTRRSVTGFC
Subjt:  IYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFC

ATMG00810.1 DNA/RNA polymerases superfamily protein3.6e-4943.81Show/hide
Query:  LLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATC
        LL+YVDDILL G S + +N +   L + F +KDLG   YFLG+++     GL LSQ KY  QIL + G LD KP+  P+   L    S  +     D + 
Subjt:  LLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSEGEQLTEEDATC

Query:  YRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSR
        +R ++G L YL ++RPDI ++V+ + Q +H+PT    D    +L+Y+KG+   G+ I      +++AF D+DW  C  TRRS TGFC FLG +IISW ++
Subjt:  YRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKSR

Query:  KQATVSRSSAEAEYKALASVTSELVW
        +Q TVSRSS E EY+ALA   +EL W
Subjt:  KQATVSRSSAEAEYKALASVTSELVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.9e-1942.73Show/hide
Query:  EPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALA-
        EP     A+K   W +AM EE++A+ R  TW +V    + + +G KWV+K K   DGT+DR K RLVAKG++Q+EGI F++T+SPV + +T++  L +A 
Subjt:  EPTYYHQAVKHQTWRKAMAEEIEAMERTNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALA-

Query:  -----TSYNW
              S NW
Subjt:  -----TSYNW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCAAACTCATCTCAACACACCTCAAAATGGTGAGAATTTAAAAAATGAGACTACACACATAGCAGGTACTTGCCTCTCTAACTCACTCAATGATCCCTTAACATG
GATCATTGACTCTGGTGCTTCCTCACACATTTGCCATGACAAGTTTATGTTTACAGATCTCTATAGCGCCCAGAATATGTTTGTTATCTTGCCCACTAAGACTCGTCTAA
AGGTTGAGCATATAGGAGATGTTTTCATATCAAATGATCTAGTCCTGAAAGATGTACTTTATATTCCTGACTTCAAGTATAACCTGCTGTCAATTTTCAAGTTAATTGAA
ACCCAATTTTCAAAAGTCATCAAGGTCTTTCGATCTGACAATGCTCCTAAGTTGAATTTCAGGGATCTTTTTGCCAAAATTGGAACAACTCATCAATTCTCGTGTGCTTA
CACTCCTCAGCAAAATTCAGTAGTGGAAAGAAAACAACAACACCTTAACATAGCAAGAGCATTGATGTTCCAATCAAAGGTTCCTCTTATCTTTTGGGGAAAATGTGTTC
TAAGTGTTGCATACTTGATCAACAGAACACCTATGGTATTACTATCAAATAACACTCCCTTTGTTGCTCTATTCAAGAAAAAAGCAGATTACAACATCATTAAGACCTTC
GGGTGTCTTGCCTATGCCTCTACCCCCTCAGTAAACAGATCTAAGTTTGATCCTAGAGCACAACCTTGTGTTTTTATGGGGTTCCCACCAGGCATAAAAGGATACAGATT
ATATGACATAGCCAAGAGAAAGTTCTTTATATCTAGGGATGTCCTATTCTTTGAAGAACTATTTCCCTTTCATTCTATCAAAGAAAAGGACATTCCCATCTCCCATGACT
TCCTTGAGCAATTCATCGTACCATGCCCCCTATTTGATTGCCTAAAAAAGGAAGATATTATTGATGCAACTACTGATGCAAGACCTACGACAGAGGATACCCCTGAAGAC
AGCCACGGTGTTGATGATCAAAACCCACATGTCAGTAACTCAGAAGAAACCAGTAACACTGATCAAGCACCAATTCCCATCGTGACCAGAAAATCCTCCCGACCACACCA
CCCATCATCTTACCTAAAAGACTTCCATTGCAACCTCACCTCCCAAAATTCAACTCCCTTTCCCCTTAACCAATACCTCTCCTATAATGCCTATTCCCAACACCATAAGA
ACTATATGTTCAATGTTACCTCCATCTATGAACCCACATATTATCACCAAGCTGTGAAACATCAGACTTGGAGAAAAGCTATGGCTGAGGAAATAGAAGCCATGGAAAGA
ACCAATACATGGACAATTGTATCCATTTCAAAGGATCATCACACCGTTGGCAACAAATGGGTGTACAAAGTAAAGTGCAAACCGGATGGTACCATTGATAGATACAAGAC
AAGACTTGTAGCAAAGGGCTATAACCAACAAGAGGGAATCAATTTTTTGGATACCTTTTCACCAGTGGCGAAAAAAAGCACTGTGAAGATATTCTTAGCTCTTGCCACAT
CCTATAATTGGTTCATTAGCCAAATGGACATAAATAATGCCTTCCTCAATGGAGACTTATTTGAAGAAGTGCACATGACCCTACCATTGGCAGCAATATCCTCACATGGT
TTCATCCAATCCAAGGCTGATTACTCCTTATTTACTAAGGGGAATGGAAGCACCTTTGTAGCATTATTAGTATATGTTGATGACATATTACTAATAGGACCATCTCCTTC
GAGTATCAACTCAATCAAAGATTCTTTGAAGGCACACTTCAAATTAAAGGACCTAGGACAAGCAAGATACTTCTTGGGTCTAGAATTATCAAGATCTGAACGAGGACTTA
TGCTCTCCCAAAGAAAATATTGTCTTCAAATCCTAGAAGATACTGGTTTTCTTGATTCTAAACCAGTTGTAGCACCTATGGATCCTAATCTGAAGCTATGTAAATCTGAA
GGAGAACAACTGACTGAGGAAGACGCCACTTGCTATAGAAGATTAATTGGCAGACTGATATACTTACAAATATCCAGACCTGATATTTGCTTCTCTGTTCACCGCTTAAG
CCAATTTTTGCACAAGCCTACTAAACATCACCTAGATGCTGCTCATCACCTATTGAAGTACCTCAAAGGTTCCCCAGGACAAGGTGTTTTAATAAAACCTATTGATTCGT
TTCACTTAAAAGCCTTCGTTGATGCTGATTGGGGATCGTGCTTTGACACTAGAAGATCGGTCACAGGGTTCTGTATCTTCCTAGGGGATTCCATCATCTCCTGGAAATCT
AGGAAACAAGCAACCGTCTCAAGGTCCTCTGCAGAAGCTGAATATAAGGCCTTAGCATCAGTCACCAGTGAGCTAGTATGGATCACACAGCTCCTTACTGATTTTAAAGT
AAAGTCCTTGATGTCAACCACTGTTTTCTGTGATAATCAAGCAGCCATTGCTATTGCTTCGAATCCGACATTCCATGAACAGACAAAACACATAGAAATTGATTGTCATT
TTGTTCGAGACAAAATAGTTGAAGGATTTCTAAAGGTTTTACCTATCAACACTAGCCTACAACTAGCTGATATGTTCACTAAAGCACTACCTTCATCTACCTTAAACAGA
CATATATCCAAGTTGGGAATGAAAGACATTCATCGTCCAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCAAACTCATCTCAACACACCTCAAAATGGTGAGAATTTAAAAAATGAGACTACACACATAGCAGGTACTTGCCTCTCTAACTCACTCAATGATCCCTTAACATG
GATCATTGACTCTGGTGCTTCCTCACACATTTGCCATGACAAGTTTATGTTTACAGATCTCTATAGCGCCCAGAATATGTTTGTTATCTTGCCCACTAAGACTCGTCTAA
AGGTTGAGCATATAGGAGATGTTTTCATATCAAATGATCTAGTCCTGAAAGATGTACTTTATATTCCTGACTTCAAGTATAACCTGCTGTCAATTTTCAAGTTAATTGAA
ACCCAATTTTCAAAAGTCATCAAGGTCTTTCGATCTGACAATGCTCCTAAGTTGAATTTCAGGGATCTTTTTGCCAAAATTGGAACAACTCATCAATTCTCGTGTGCTTA
CACTCCTCAGCAAAATTCAGTAGTGGAAAGAAAACAACAACACCTTAACATAGCAAGAGCATTGATGTTCCAATCAAAGGTTCCTCTTATCTTTTGGGGAAAATGTGTTC
TAAGTGTTGCATACTTGATCAACAGAACACCTATGGTATTACTATCAAATAACACTCCCTTTGTTGCTCTATTCAAGAAAAAAGCAGATTACAACATCATTAAGACCTTC
GGGTGTCTTGCCTATGCCTCTACCCCCTCAGTAAACAGATCTAAGTTTGATCCTAGAGCACAACCTTGTGTTTTTATGGGGTTCCCACCAGGCATAAAAGGATACAGATT
ATATGACATAGCCAAGAGAAAGTTCTTTATATCTAGGGATGTCCTATTCTTTGAAGAACTATTTCCCTTTCATTCTATCAAAGAAAAGGACATTCCCATCTCCCATGACT
TCCTTGAGCAATTCATCGTACCATGCCCCCTATTTGATTGCCTAAAAAAGGAAGATATTATTGATGCAACTACTGATGCAAGACCTACGACAGAGGATACCCCTGAAGAC
AGCCACGGTGTTGATGATCAAAACCCACATGTCAGTAACTCAGAAGAAACCAGTAACACTGATCAAGCACCAATTCCCATCGTGACCAGAAAATCCTCCCGACCACACCA
CCCATCATCTTACCTAAAAGACTTCCATTGCAACCTCACCTCCCAAAATTCAACTCCCTTTCCCCTTAACCAATACCTCTCCTATAATGCCTATTCCCAACACCATAAGA
ACTATATGTTCAATGTTACCTCCATCTATGAACCCACATATTATCACCAAGCTGTGAAACATCAGACTTGGAGAAAAGCTATGGCTGAGGAAATAGAAGCCATGGAAAGA
ACCAATACATGGACAATTGTATCCATTTCAAAGGATCATCACACCGTTGGCAACAAATGGGTGTACAAAGTAAAGTGCAAACCGGATGGTACCATTGATAGATACAAGAC
AAGACTTGTAGCAAAGGGCTATAACCAACAAGAGGGAATCAATTTTTTGGATACCTTTTCACCAGTGGCGAAAAAAAGCACTGTGAAGATATTCTTAGCTCTTGCCACAT
CCTATAATTGGTTCATTAGCCAAATGGACATAAATAATGCCTTCCTCAATGGAGACTTATTTGAAGAAGTGCACATGACCCTACCATTGGCAGCAATATCCTCACATGGT
TTCATCCAATCCAAGGCTGATTACTCCTTATTTACTAAGGGGAATGGAAGCACCTTTGTAGCATTATTAGTATATGTTGATGACATATTACTAATAGGACCATCTCCTTC
GAGTATCAACTCAATCAAAGATTCTTTGAAGGCACACTTCAAATTAAAGGACCTAGGACAAGCAAGATACTTCTTGGGTCTAGAATTATCAAGATCTGAACGAGGACTTA
TGCTCTCCCAAAGAAAATATTGTCTTCAAATCCTAGAAGATACTGGTTTTCTTGATTCTAAACCAGTTGTAGCACCTATGGATCCTAATCTGAAGCTATGTAAATCTGAA
GGAGAACAACTGACTGAGGAAGACGCCACTTGCTATAGAAGATTAATTGGCAGACTGATATACTTACAAATATCCAGACCTGATATTTGCTTCTCTGTTCACCGCTTAAG
CCAATTTTTGCACAAGCCTACTAAACATCACCTAGATGCTGCTCATCACCTATTGAAGTACCTCAAAGGTTCCCCAGGACAAGGTGTTTTAATAAAACCTATTGATTCGT
TTCACTTAAAAGCCTTCGTTGATGCTGATTGGGGATCGTGCTTTGACACTAGAAGATCGGTCACAGGGTTCTGTATCTTCCTAGGGGATTCCATCATCTCCTGGAAATCT
AGGAAACAAGCAACCGTCTCAAGGTCCTCTGCAGAAGCTGAATATAAGGCCTTAGCATCAGTCACCAGTGAGCTAGTATGGATCACACAGCTCCTTACTGATTTTAAAGT
AAAGTCCTTGATGTCAACCACTGTTTTCTGTGATAATCAAGCAGCCATTGCTATTGCTTCGAATCCGACATTCCATGAACAGACAAAACACATAGAAATTGATTGTCATT
TTGTTCGAGACAAAATAGTTGAAGGATTTCTAAAGGTTTTACCTATCAACACTAGCCTACAACTAGCTGATATGTTCACTAAAGCACTACCTTCATCTACCTTAAACAGA
CATATATCCAAGTTGGGAATGAAAGACATTCATCGTCCAACTTAA
Protein sequenceShow/hide protein sequence
MLQTHLNTPQNGENLKNETTHIAGTCLSNSLNDPLTWIIDSGASSHICHDKFMFTDLYSAQNMFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIPDFKYNLLSIFKLIE
TQFSKVIKVFRSDNAPKLNFRDLFAKIGTTHQFSCAYTPQQNSVVERKQQHLNIARALMFQSKVPLIFWGKCVLSVAYLINRTPMVLLSNNTPFVALFKKKADYNIIKTF
GCLAYASTPSVNRSKFDPRAQPCVFMGFPPGIKGYRLYDIAKRKFFISRDVLFFEELFPFHSIKEKDIPISHDFLEQFIVPCPLFDCLKKEDIIDATTDARPTTEDTPED
SHGVDDQNPHVSNSEETSNTDQAPIPIVTRKSSRPHHPSSYLKDFHCNLTSQNSTPFPLNQYLSYNAYSQHHKNYMFNVTSIYEPTYYHQAVKHQTWRKAMAEEIEAMER
TNTWTIVSISKDHHTVGNKWVYKVKCKPDGTIDRYKTRLVAKGYNQQEGINFLDTFSPVAKKSTVKIFLALATSYNWFISQMDINNAFLNGDLFEEVHMTLPLAAISSHG
FIQSKADYSLFTKGNGSTFVALLVYVDDILLIGPSPSSINSIKDSLKAHFKLKDLGQARYFLGLELSRSERGLMLSQRKYCLQILEDTGFLDSKPVVAPMDPNLKLCKSE
GEQLTEEDATCYRRLIGRLIYLQISRPDICFSVHRLSQFLHKPTKHHLDAAHHLLKYLKGSPGQGVLIKPIDSFHLKAFVDADWGSCFDTRRSVTGFCIFLGDSIISWKS
RKQATVSRSSAEAEYKALASVTSELVWITQLLTDFKVKSLMSTTVFCDNQAAIAIASNPTFHEQTKHIEIDCHFVRDKIVEGFLKVLPINTSLQLADMFTKALPSSTLNR
HISKLGMKDIHRPT