; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006924 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006924
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr6:47147681..47155934
RNA-Seq ExpressionLag0006924
SyntenyLag0006924
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000086 - NUDIX hydrolase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR015375 - NADH pyrophosphatase-like, N-terminal
IPR015376 - Zinc ribbon, NADH pyrophosphatase
IPR015797 - NUDIX hydrolase-like domain superfamily
IPR020084 - NUDIX hydrolase, conserved site
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG93649.1 wall-associated kinase 2 [Prunus dulcis]2.5e-22337.07Show/hide
Query:  SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEFIMIFLMGLNESSANVRTRILLMNPLPTINKAFSLVIQEE
        + P I+Q++K + +  QGT+S+  YY K+K LW EL  Y L   C+CGG K+     + + IM FLMGLN+S   VR +ILLMNPLP++ KA+SLV QEE
Subjt:  SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEFIMIFLMGLNESSANVRTRILLMNPLPTINKAFSLVIQEE

Query:  RQRSISTLPSVESVTLLAAAEASKRVN---DQSKKRNNQRPPPVCSHCN-----------LKGHAP---------KAADAKPNSAFFASLSNDQYNQLMT
        +QR I +  + E  ++ AA   +K  N   ++S+  +    P  CS+C+           L G+ P          AA +   ++    +S   Y    +
Subjt:  RQRSISTLPSVESVTLLAAAEASKRVN---DQSKKRNNQRPPPVCSHCN-----------LKGHAP---------KAADAKPNSAFFASLSNDQYNQLMT

Query:  MLQTHVA-----------------ATKPE---------------SGVKVE-----TNHVAGTCSSSTSDN--------------------FCSKSWVIDS
          QT  A                 +T P                SG+  E     T  +    + S+SDN                      S  W++DS
Subjt:  MLQTHVA-----------------ATKPE---------------SGVKVE-----TNHVAGTCSSSTSDN--------------------FCSKSWVIDS

Query:  GASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS---------------------DKLQLRKIV-------
        GA+ HI  D + FV+   +   SV LP  +   +   G +   +D+ L++VL++P F  +LLS                       L   K++       
Subjt:  GASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS---------------------DKLQLRKIV-------

Query:  ----------------------------------KVECIDSLYLL------------------------STSTAKVTTAAL-------------------
                                          +++ +  L+                          S  T+K     L                   
Subjt:  ----------------------------------KVECIDSLYLL------------------------STSTAKVTTAAL-------------------

Query:  --TLDDCSRYT-------------------------------SIALDNAPE-LAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKV
           +DD SR T                               +I  DN  E L+   FF S G   Q SCV TPQQN VVERKH+H+LN+ARAL FQS +
Subjt:  --TLDDCSRYT-------------------------------SIALDNAPE-LAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKV

Query:  PLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFF
        PL FWGDCVLTA ++INR P  LL+NK+PF +LY +    S LRVFGCL YA T+     KFD R   CVFIGYPTG KGYKLYDM   +F++SRDV FF
Subjt:  PLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFF

Query:  EDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVD
        E++FP+ + S   SS         VLP P +D P     PS+               +  V  P  S+T      T L +  S+ P +T+   A   ++ 
Subjt:  EDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVD

Query:  TTPTHFGPRR-TARNRHPPTYLKDFHCNLLAGSNPCLSKPFSI--------DKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAM
          P    P R + R +H P +  D+H +    S    S+P SI          ++SY+ FSP+HR FL ++S   EP  Y QAV  P W  AM  EL A+
Subjt:  TTPTHFGPRR-TARNRHPPTYLKDFHCNLLAGSNPCLSKPFSI--------DKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAM

Query:  ERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVH
            TW++V +P G   IGCKWVYR+KY  DGT++R+K RLVAKG+ Q EG+D+L+TFSP AK+ T++ LL +A S +W L Q+DV NAFL+GDL EEV+
Subjt:  ERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVH

Query:  MALPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFL
        M  P G +      +GE + CRLNKS+Y LKQASR WF+KFS+A+ + GF QSK+DYSLFT+  G SF+ +L+YVDDIL+TG    E+  +K  L   F 
Subjt:  MALPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFL

Query:  LKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQ
        +KDLGN++YFLG+E++RS++G+F SQRKY L IL+D GLL A+P   PM+ N+KLS   G  L  ++ T YRRL+G+L+Y+ ++RPD+V+SV  LSQF+ 
Subjt:  LKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQ

Query:  KPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQL
        +P   HL AA  +LR++K +PGQG+L    S+ +LKAF D+DW                                       EYRA+ T   EL W+  +
Subjt:  KPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQL

Query:  LKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKL
        L+DLQV    PA  +CDNQAA+ IA+NP FHERTKHIEIDCH VR+K++ G +    + S++QLAD+FTKAL  +       KL
Subjt:  LKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKL

KAG7578768.1 GAG-pre-integrase domain [Arabidopsis thaliana x Arabidopsis arenosa]4.4e-22835.8Show/hide
Query:  WSKAMRMALSVSGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGG-----SKSIITFLNSEFIMIFLMGLNESSANVRTRILLMN
        W K ++     + P +Y++ ++LA   QG+L + ++Y K+ T+W+E+   +    C CGG     ++  +      F++ FL GLN+S  NVR  I++++
Subjt:  WSKAMRMALSVSGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGG-----SKSIITFLNSEFIMIFLMGLNESSANVRTRILLMN

Query:  PLPTINKAFSLVIQEERQRSISTLPSVESVTLLAAAE-----------ASKRVNDQSKKRNN----------QRPPPVCSHCNLKGH-------------
        PLP + K  ++VIQ+E  + I  +P   SV    +++           +S  + D    +++           R  PVC++C L+GH             
Subjt:  PLPTINKAFSLVIQEERQRSISTLPSVESVTLLAAAE-----------ASKRVNDQSKKRNN----------QRPPPVCSHCNLKGH-------------

Query:  ---------------------APKAADAKP--------NSAF---------------------------------------FASLSNDQYNQLMTMLQTH
                             APK    +P        N  F                                        A LS DQ  QL + L + 
Subjt:  ---------------------APKAADAKP--------NSAF---------------------------------------FASLSNDQYNQLMTMLQTH

Query:  VAATKPESGVKVETNHVAGT-----CSSSTSDNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDF
             P        N+ + +     CS+         +W+IDSGA++H+C +L++F +I   +  +V LPN T+  +   G VKL + LLL NVL+IP F
Subjt:  VAATKPESGVKVETNHVAGT-----CSSSTSDNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDF

Query:  TYHLLSDKLQLR----------------------KIVKVECIDSLYLL---STSTAKVTTAALT------------------------------------
          +L+S    L+                       I K    + LY L   S S+   ++ +L                                     
Subjt:  TYHLLSDKLQLR----------------------KIVKVECIDSLYLL---STSTAKVTTAALT------------------------------------

Query:  ---------------------------------------------------------LDDCSRYTSIAL-------------------------------
                                                                 +DDC+R T I L                               
Subjt:  ---------------------------------------------------------LDDCSRYTSIAL-------------------------------

Query:  DNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFG
        DNAPEL+FT    S G  H FSCV TPQQNSVVERKHQH+LNVARAL+FQS +P+ +W DC+ T+ +LINRTP  LL NKTPF +L  K+  YS L+ FG
Subjt:  DNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFG

Query:  CLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTST
        CL Y ST    R KF PRA   VF+GYP+G KGYK+ ++      ISR+V+F ED+FPFHS   A S+   D+ N  +LP PL D               
Subjt:  CLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTST

Query:  TVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPCLS--KPFSIDKY
        T +  +P      V  P  +N ++             +  S + +++D TI  TT       R  RN   P+YL D+HCNL+    P +S      +   
Subjt:  TVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPCLS--KPFSIDKY

Query:  ISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDF
        + YT  +P ++ F+LN+S E EP  + +AV+   W   M++EL+    T T+++VS+P G+  IGC+WVY++K+  DGT+DR++ RLVAKG+ QQEG+D+
Subjt:  ISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDF

Query:  LDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEKI----ACRLNKSIYDLKQASRQWFTKFSSALISHGF
        +DTFSPVAK+VTVK+LL L+    W L QMDV NAFL+GDL EE++M LP GY     PP GE +      RL+KS+Y LKQASRQW  KFS  L++ GF
Subjt:  LDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEKI----ACRLNKSIYDLKQASRQWFTKFSSALISHGF

Query:  AQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMD
         QS+SD++LF +     FIALLVYVDDIL+   S   V+ +K++L A F LKDLG  +YFLGLE++R+K G+  SQRKY L +LE VGLL  KPV+TPMD
Subjt:  AQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMD

Query:  PNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW-------
          ++L+ E+G  L   +AT YR LIG+LLY+ I+R DI F+VH LSQFL +P + HL+AAH ++RYLK  PG+G+     S  +L+AF DADW       
Subjt:  PNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW-------

Query:  --------------------------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVE
                                        EYRAL   T ELLW+++LLKDL V    P   YCD+ AAI IASN  FHERTKHIEIDCH VRDK+  
Subjt:  --------------------------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVE

Query:  GFLKLLPIRSQSQLADMFTKALPSSTLSFFMGK
        GFLKL+ + +++QL D+FTKALP  T    M +
Subjt:  GFLKLLPIRSQSQLADMFTKALPSSTLSFFMGK

KAG7587171.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]2.0e-22535Show/hide
Query:  WSKAMRMALSVSGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEF-----IMIFLMGLNESSANVRTRILLMN
        W+  M        P +Y++ + L++  QG+L I +YY  + TLW+E   Y  +  CTCG  +     L  +      +  FLMGLNE+    R  IL++ 
Subjt:  WSKAMRMALSVSGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEF-----IMIFLMGLNESSANVRTRILLMN

Query:  PLPTINKAFSLVIQEERQRSISTLPSVESVTLLAAAEASKRVN--DQSK---KRNNQRP---PPVCSHCNLKGH--------------------------
        P+P+I +A+++V Q+ERQR++  +   ++V   A +   +  +  DQS+     N  RP    P+C+HC   GH                          
Subjt:  PLPTINKAFSLVIQEERQRSISTLPSVESVTLLAAAEASKRVN--DQSK---KRNNQRP---PPVCSHCNLKGH--------------------------

Query:  ---APKAADA-------KPNSAFF------------------------------------------ASLSNDQYNQLMTMLQTHVAATKPE---------
           AP+           +P + F                                           + L+ +Q   L+  L +H     PE         
Subjt:  ---APKAADA-------KPNSAFF------------------------------------------ASLSNDQYNQLMTMLQTHVAATKPE---------

Query:  --------------------SGVKVETNHVA--GTCSSSTSDNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLL
                            S ++ E N++     C SS      + +W+IDSGA+SH+C DL+ F  I   +GI+V LPN T   +   G ++L E L+
Subjt:  --------------------SGVKVETNHVA--GTCSSSTSDNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLL

Query:  LSNVLYIPDFTYHLLSDKLQLR----------------------KIVKVECIDSLYLLSTSTAKVTTAA-------------------------------
        L NVL++P F ++L+S    L                        I     + +LY+L+  + K  T A                               
Subjt:  LSNVLYIPDFTYHLLSDKLQLR----------------------KIVKVECIDSLYLLSTSTAKVTTAA-------------------------------

Query:  ----------------------------------------------------------------LTL-DDCSRYT-------------------------
                                                                        LT+ DDC+R T                         
Subjt:  ----------------------------------------------------------------LTL-DDCSRYT-------------------------

Query:  ------SIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKE
              +I  DNAPEL F E     G +H FSC  TPQQNSVVERKHQHLLNVARAL+FQSKVPL +W DCVLTA FLINR P  +L N TP+  L  K+
Subjt:  ------SIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKE

Query:  FDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHP---LSDLP
         DY+ L+ FGCL YASTL   R KF PRA PCVF+GYP+G KGYK+ ++      I+R+V+F E  FPFH+  Q    +  D     +LP+P   ++DLP
Subjt:  FDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHP---LSDLP

Query:  DSNKLPSAAST------------------STTVAQFVP------YGDAEIVHSPIDSN---TNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFG
          + +   +++                  S T ++ +P        DA   H  IDSN    +L    + +  + +++    T    + T+   T  +  
Subjt:  DSNKLPSAAST------------------STTVAQFVP------YGDAEIVHSPIDSN---TNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFG

Query:  P-RRTARNRHPPTYLKDFHCNLLAGSNPC---LSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIP
        P  R  R    P+YL  +HC L   S P     + P+ +   + Y  F+P +  ++L+ S E EP  + QA+    W+ A+++EL+AME+ KTW+I S+P
Subjt:  P-RRTARNRHPPTYLKDFHCNLLAGSNPC---LSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIP

Query:  VGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGY---KS
         G++++GCKWV+ +KY  DGT++R+K RLVAKGF QQEG+DF +TFSPVAK+ +VK++L LA    W L QMDV+NAFL+  L EE++M+LP GY    S
Subjt:  VGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGY---KS

Query:  SFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRY
          +PP      CRL+KSIY LKQASRQW   F+S L+  GF QSKSD +LF +    SFIALLVYVDDI +   + +++ ++KA+L   F +KDLG +R+
Subjt:  SFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRY

Query:  FLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQA
        FLGLE++R+ +G+   QRKY L +L+D GLL  KP   PMDP + LSK+TG  L     T YR LIGRLLY+ I+RPDI ++VH LSQF+Q PT  H+QA
Subjt:  FLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQA

Query:  AHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLKDLQVVCS
        A  +L+YLK SPGQG+     +   L AF DADW                                       EYR+L  VT E+LWI QLLKD +V  +
Subjt:  AHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLKDLQVVCS

Query:  SPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKAL
        + A  +CDN++AI IA+NP FHERTKHIEIDCH  RD+I  G LKLL + S++QLAD+ TK L
Subjt:  SPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKAL

KAG7588551.1 Ribonuclease H domain [Arabidopsis suecica]7.7e-22533.83Show/hide
Query:  STSNPAIESPDLDVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV---------------------------------------------
        STSN       LD   NP+ LH S  +   LV+ +L   +++ SW +++RMAL+V                                             
Subjt:  STSNPAIESPDLDVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV---------------------------------------------

Query:  ----------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEF-----IMIFLMGLNESS
                                P +Y++ + L++  QG++ + +YY ++ T+W+E   Y  +  CTCG  +     L  +      +  FLMGLNES 
Subjt:  ----------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEF-----IMIFLMGLNESS

Query:  ANVRTRILLMNPLPTINKAFSLVIQEERQRSISTLPSVESVTLLAAAEASKRVN-DQSKKR-------------------NNQRPPPVCSHCNLKG----
           R  IL++ P+P+I   F++V Q+ERQ+SI   PS     +  A   S + N D S  +                     ++P P+C+HC   G    
Subjt:  ANVRTRILLMNPLPTINKAFSLVIQEERQRSISTLPSVESVTLLAAAEASKRVN-DQSKKR-------------------NNQRPPPVCSHCNLKG----

Query:  -----------------------HAPKAADAKPNSAF-----------------------------------FASLSNDQYNQLMTMLQTH---------
                               H+ +     P   F                                   F+ +S DQ   L+  L  H         
Subjt:  -----------------------HAPKAADAKPNSAF-----------------------------------FASLSNDQYNQLMTMLQTH---------

Query:  -----------VAATKPESG-------------------VKVETNHVA--GTCSSSTSDNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQT
                   V A +  SG                   ++ E N +     C SS   N    SW+IDSGA+SH+C DL++F      +G++V LPN T
Subjt:  -----------VAATKPESG-------------------VKVETNHVA--GTCSSSTSDNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQT

Query:  RFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS----------------------DKLQLRKIVKVECIDSLYLLS------------------------
        R  +   G + +   L+L +VL++P F ++L+S                      +  Q   I K   + +LY+L                         
Subjt:  RFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS----------------------DKLQLRKIVKVECIDSLYLLS------------------------

Query:  ------------------TSTAKVTTAAL------------------------------------------------------TLDDCSRYTSIAL----
                          +ST  +  ++L                                                       +DDC+R T I L    
Subjt:  ------------------TSTAKVTTAAL------------------------------------------------------TLDDCSRYTSIAL----

Query:  ---------------------------DNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTP
                                   DNAPELAFTE     G +HQFSC  TPQQNSVVERKHQHLLNVARAL+FQS VPL++W DC+ TA FLINR P
Subjt:  ---------------------------DNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTP

Query:  MTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDV
          LL N +P+ +L  K  DYSLLR FGCL YASTL   R KF PRA  CVFIGY +G KGYKL  +      +SR+V+F+E +FPFH ++ A    SP +
Subjt:  MTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDV

Query:  LNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTY
         +  +LP P+S                     V +   + V  P  S  +   H +     + + P +   + A+ +IV     +  P+R+A+    P+Y
Subjt:  LNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTY

Query:  LKDFHCNLL-AGSNPCLS--------KPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGC
        L D+HC+L+   S+P LS         P+ +   +SY+     ++  +L+ S E EP+ + QA+    W +AMD EL+AME   TW+IV +P G++I+G 
Subjt:  LKDFHCNLL-AGSNPCLS--------KPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGC

Query:  KWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGY---KSSFIPPKGE
        KWVY +KY  DGT++R+K RLVAKGF QQEG+DF DTFSPVAK+ +VK++L LA +  W L QMDV+NAFL+ +L EE++M+LP GY       +PP   
Subjt:  KWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGY---KSSFIPPKGE

Query:  KIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSR
           CRL+KSIY LKQASRQW+  FS  L+ + F QS SD +LF +  G SFI LLVYVDDI++   SV+ V+S+KA+L   F +KDLG VR+FLGLE++R
Subjt:  KIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSR

Query:  SKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYL
        +K+G+  SQRKYCL +L+D G L  KP   PMDP + L K+TG  L   +   YR LIGRLLY+ I+RPDI F+V+ LSQFL  PT  HLQAA+H+L+YL
Subjt:  SKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYL

Query:  KKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCD
        K +PGQG+     +   L  F DADW                                       EYRA+   + EL+W+TQLLKDL+V   SPA  YCD
Subjt:  KKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCD

Query:  NQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKAL
        +++A+ +A+NP FHERTKH+EIDCH  RD++  G  ++  I +++QLAD+ TK L
Subjt:  NQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKAL

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]4.2e-23135.84Show/hide
Query:  MAGTGSSSTSTSNPAIESPDLDVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV------------------------------------
        MA  G    +   P + +  L+   +P++LH+      TLV+  L+G SNY +W +AM +AL+                                     
Subjt:  MAGTGSSSTSTSNPAIESPDLDVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV------------------------------------

Query:  -------------------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEFIMIFLMGL
                                       + P IYQ++K L+   QG++ + SYY K++TLW EL +Y+    CTCG  +    + N E +M FLMGL
Subjt:  -------------------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEFIMIFLMGL

Query:  NESSANVRTRILLMNPLPTINKAFSLVIQEERQRSISTLPS---------VESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGHA---------
        N+S A VR ++L++ PLPTI K F+LVIQEERQRSI    S         + +V   A    S R +  SK     R   +CSHC+ + H          
Subjt:  NESSANVRTRILLMNPLPTINKAFSLVIQEERQRSISTLPS---------VESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGHA---------

Query:  -----PK-----------AADAKPNSAFFA---------SLSNDQYNQLMTMLQTHVAATK---PESGVKVETNHVAGTCSSSTS-DNFCSKSWVIDSGA
             PK           A  A  +S             SL+  Q  QL+  L + +   +    E   +   + + G CS+++       K W++D+GA
Subjt:  -----PK-----------AADAKPNSAFFA---------SLSNDQYNQLMTMLQTHVAATK---PESGVKVETNHVAGTCSSSTS-DNFCSKSWVIDSGA

Query:  SSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS----------------------DKLQLRKIVKVECIDSL
        + HIC  L+MF + R A    V+LPN     V + G V +  +L+L NVLY+P F ++LLS                      D  Q+R I   + I +L
Subjt:  SSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS----------------------DKLQLRKIVKVECIDSL

Query:  YLL----------------------------------------------------------------------------------------STSTAKVTT
        Y+L                                                                                         TS      
Subjt:  YLL----------------------------------------------------------------------------------------STSTAKVTT

Query:  AALTLDDCSRYT-------------------------------SIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKV
            +DD SRYT                               S+  DNAPEL F +FFA  G  H  SCV  PQQNSVVERKHQH+LNVARAL+FQS +
Subjt:  AALTLDDCSRYT-------------------------------SIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKV

Query:  PLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFF
        PL +W DC+ T+ +LINRTP  +LA+KTPF +L+GK   YS L+VFGCL YASTL   R KF PRA  CVFIGYP G KGYKL ++   +  ISRDV+F 
Subjt:  PLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFF

Query:  EDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVD
        E+ FP+ + S                  P+S                                           ++D+  ++S +   T  + AD     
Subjt:  EDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVD

Query:  TTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPC-LSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTI
                 RT+R  + P++L+D+HC  +  S PC  S    I   ++Y+  S +HR F+ N+S+  EPT + QAV  P WRQAMD+ELKA+E   TW+I
Subjt:  TTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPC-LSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTI

Query:  VSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYK
        VS+P G+  +GC+WVY+ K+  DG++ R+K RLVAKG+ QQEG+D+L+TFSPVAK+VTV+ LL LA    W L Q+DVNNAFL+GDL EEV+M LP G+ 
Subjt:  VSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYK

Query:  SSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVR
        S    P   +  C+L+KSIY LKQASRQWF KFSS L+S GF QS +D SLF R     F+AL+VYVDDI++     +  + +K  L + F LKDLGN++
Subjt:  SSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVR

Query:  YFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQ
        YFLG+E++RS +G+   QR Y + +L + GLL  KP  TPM+ N KL++++G+ L+  +  SYRRLIGRLLY+ I+RPD+VF+V+ LSQ++  P   H++
Subjt:  YFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQ

Query:  AAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLKDLQVVC
        AA ++L+Y+K + GQG+     S  +L+AF DADW                                       EYR+L   T E+LWI QLL DL V  
Subjt:  AAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLKDLQVVC

Query:  SSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKL
        + P V +CD+QAA+ IASNP FHERTKHI+IDCH VR+K+ +  +KL+ + S+ QLAD+FTK L  S     + K+
Subjt:  SSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKL

TrEMBL top hitse value%identityAlignment
A0A2N9EQM3 Integrase catalytic domain-containing protein1.0e-24340.41Show/hide
Query:  DVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV---------------------------------------------------------
        DV  N ++LHH  +  A LV+Q LVG  NY +WS++M MAL+                                                          
Subjt:  DVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV---------------------------------------------------------

Query:  -----------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEFIMIFLMGLNESSANVRTRILLMNPLPTI
                   +GP I++++K ++   Q   S+ SYY ++K+LW EL  +R + DC+CG  K ++     E++M FLMGLN+S ++VR +IL+ +PLP+I
Subjt:  -----------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEFIMIFLMGLNESSANVRTRILLMNPLPTI

Query:  NKAFSLVIQEERQRSI---STLPSVESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGHAPKAA------------DAKPNSAFFAS--------
         KAF LVIQEERQR+I   S  P+ +SV      EA  R N   + ++ ++  P+CSHC + GH                  AK +SA  +S        
Subjt:  NKAFSLVIQEERQRSI---STLPSVESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGHAPKAA------------DAKPNSAFFAS--------

Query:  -LSNDQYNQLMTMLQTHVAATKPESGVKVETNHVAGTCSSSTSDNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCED
          +  Q  QL++ML +  +    +S V   TN V    S STS             A+S I H ++                          G   L ED
Subjt:  -LSNDQYNQLMTMLQTHVAATKPESGVKVETNHVAGTCSSSTSDNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCED

Query:  LLLSNVLYIPDFTYHLLSDKLQLRKIVKVECIDSLYLLSTSTAKVTTAALTLDDCSRYTSIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKH
         L S                                                           DN PE     F+A  GTLHQ SCV TPQQN+ VERKH
Subjt:  LLLSNVLYIPDFTYHLLSDKLQLRKIVKVECIDSLYLLSTSTAKVTTAALTLDDCSRYTSIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKH

Query:  QHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLY
        QHLL VARAL FQ+ +PL FWG CVLTAT LINR P  LL NK+PF +L+ K  +YS LRVFGCL YA+TLS +R KF PR+  CV +GYP GIKGY+L 
Subjt:  QHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLY

Query:  DMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLP---------DSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHI
        D+   Q  +SRDV+F+E+ FPFHS SQ +   +P      VLP P++DLP          +N  PS  STS              +HSP+  +T      
Subjt:  DMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLP---------DSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHI

Query:  TDLAIDISTAPTSTTCLEADGTIVDTTPTHFGP-----RRTARNRHPPTYLKDFHCNLLAG------SNPCLSK-----PFSIDKYISYTTFSPTHRHFL
           +  +S +P  +T L+      D     F P     R++ R   PP+YL+ FHCN  +       S+P   +      F +  YISY+  +P +  F+
Subjt:  TDLAIDISTAPTSTTCLEADGTIVDTTPTHFGP-----RRTARNRHPPTYLKDFHCNLLAG------SNPCLSK-----PFSIDKYISYTTFSPTHRHFL

Query:  LNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVK
        LN S   EPT +H+A K P+W QAM  EL A+E   TW++ S+P G+  IG KWV++VK + DG+++R+K RLVAKG+NQQEG D+ +TFSPVAK VTV+
Subjt:  LNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVK

Query:  VLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGE--KIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGC
         LL +A   SW L Q+DVNNAFL+G+L EEV+M LP G+ S     KGE     C+LNKS+Y LKQASRQWF+KFS+ L++HGF QSK+DYSLFTR  G 
Subjt:  VLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGE--KIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGC

Query:  SFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVD
        SFIALLVYVDDIL+       V  +K  L A F LKDLG VRYFLGLE++RS QG+  SQRKY L+ILED GLL  KPV  PMD N+KLSK  G  L   
Subjt:  SFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVD

Query:  EATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW-----------------------
        + T+YRRLIGRL+Y+ ++RPDIVF+VH LSQF++ P   H +AA H+L+Y+K +P QG+L    S   +KAF D+DW                       
Subjt:  EATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW-----------------------

Query:  ----------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLAD
                        EYRA+ +   E++W+  LL DLQ+   + A+ + D+QAAI IA+NP FHERTKHIEIDCH VRDKI EG ++ L + S+ Q+AD
Subjt:  ----------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLAD

Query:  MFTKALPSSTLSFFMGKLVGPNNM
        + TKAL  +  S  + K+ G +N+
Subjt:  MFTKALPSSTLSFFMGKLVGPNNM

A0A2N9ERT7 Integrase catalytic domain-containing protein5.9e-24738.71Show/hide
Query:  SSAMAGTGSSSTSTSNPAIESPDLDVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV---------------------------------
        +S ++ T S S++++  AI+ P   +  +P++LH S  S+  LVT+ L G  N+ SW ++M MAL++                                 
Subjt:  SSAMAGTGSSSTSTSNPAIESPDLDVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV---------------------------------

Query:  ------------------------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEY------RLVDDCTCGGSKSIITFLN
                                            +GP I+Q+ KE+ +  Q   S+  YY  ++ LW+EL+ Y           C+CG     +    
Subjt:  ------------------------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEY------RLVDDCTCGGSKSIITFLN

Query:  SEFIMIFLMGLNESSANVRTRILLMNPLPTINKAFSLVIQEERQRSISTLPSVESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGHAPKA-ADA
           +M FLMGLNES A VR +ILLM+P+P INK   LV + + QR ++      +  ++    A+  +      R   + P V +  +L      A A A
Subjt:  SEFIMIFLMGLNESSANVRTRILLMNPLPTINKAFSLVIQEERQRSISTLPSVESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGHAPKA-ADA

Query:  KPNSAFFASLSNDQYNQLMTMLQTHVAATKPESGVKVETNHVAGTCSSSTSDNF--------------CS-------------------------KSWVI
        +  S    S    Q  QL+ ++      T   S   V   H   T ++++S  F              CS                          SW++
Subjt:  KPNSAFFASLSNDQYNQLMTMLQTHVAATKPESGVKVETNHVAGTCSSSTSDNF--------------CS-------------------------KSWVI

Query:  DSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLSDKLQLRKIVKVECIDSLYLLST-------------
        D+GA+ H+ H L+ F  +      +V LPN     V  +G VKL   L+L++VL +P F ++L+S     R I   +  + LY+L +             
Subjt:  DSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLSDKLQLRKIVKVECIDSLYLLST-------------

Query:  -----------STAKVTTAALTL----------DDCSRYTSIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLS
                   S + +T A  +L             ++  +I  DN  E   ++FF+S G +HQ SCV TPQQNSVVERKHQHLLNVARA+ FQS +PLS
Subjt:  -----------STAKVTTAALTL----------DDCSRYTSIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLS

Query:  FWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDL
        FWG+C+L A +LINR P  +L  KTP+ VL  K   Y+ L+VFGCLAYAS LS H+ KFD +A PCVF+GYP G KGYKL D++  Q  +SRDV+F E +
Subjt:  FWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDL

Query:  FPFH-SVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTT
        FPFH S S  N   S D ++     HP S +   N +PS ++        +P         P  S  +   HI+  A+ ++  PTS + + +  + V  +
Subjt:  FPFH-SVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTT

Query:  PT----HFGPRRTARNRHPPTYLKDFHCNLLAGSNP-----CLSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMER
         +    H   R+++R   PP+YL+D+HC+ LA S P       +  + I   +SY+  S  H+ F L +ST  EP +YH+AVK PHW  AM  EL+A+E 
Subjt:  PT----HFGPRRTARNRHPPTYLKDFHCNLLAGSNP-----CLSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMER

Query:  TKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMA
          TW + S+P G+  IGCKWVY++K+K DGT++R+K RLVAKG+NQ+EGID+ +TFSPVAK+VTV+  + LA +  WP+ Q+DVNNAFL+GDL EEV M+
Subjt:  TKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMA

Query:  LPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLK
        LP G+ +    P      CRL KS+Y LKQASRQWF+KFSS L++HGF QSK DYSLFT+  G +F+ALLVYVDDIL+       VT++ A L  HF LK
Subjt:  LPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLK

Query:  DLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKP
        DLG  +YFLGLEL+R+ +G+   QRKY L IL+D G L +KPV  PM+ ++KLSK+ G AL   + T YRRLIG+LLY+ ++RPDI +SV  LSQF+ +P
Subjt:  DLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKP

Query:  TSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLK
         + HL AAH +L+YLK SPGQG+     +  QLKAF D+DW                                       EYRA+   T E+ W+  LLK
Subjt:  TSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLK

Query:  DLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKL
        D  +     AV +CDNQAA+ IASNP FHERTKHIE+DCHF+RDKI  G LK L + S  QLAD+FTK L     S  + KL
Subjt:  DLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKL

A0A2N9GKP6 Ribonuclease H7.5e-24238.05Show/hide
Query:  PFPSSAMAGTGSSSTSTSNPAIESPDLDVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV------------------------------
        P+ S +MA     + S ++ +++ P      +P++LH S  S+  LV + L G  N+ SW ++M M LS+                              
Subjt:  PFPSSAMAGTGSSSTSTSNPAIESPDLDVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV------------------------------

Query:  ---------------------------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYR------LVDDCTCGGSKSIIT
                                               +GP I+QV + + +  Q  +S+ +YY K++  W+EL+ YR       +  C+CG  + ++ 
Subjt:  ---------------------------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYR------LVDDCTCGGSKSIIT

Query:  FLNSEFIMIFLMGLNESSANVRTRILLMNPLPTINKAFSLVIQEERQRSISTL--PSVESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGH---
          +   +M FLMGLNE+   VR +ILLM+P+P I+K FSL+ QEERQRSI  L  P VES  LL  +E  +     +K+   +R  P C+HC   GH   
Subjt:  FLNSEFIMIFLMGLNESSANVRTRILLMNPLPTINKAFSLVIQEERQRSISTL--PSVESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGH---

Query:  ------------------APKAADAKPN-------------SAFFASLSNDQYNQLMTMLQTH-VAATKPES-GVKVETNHVAGTCSSS---------TS
                           P A     N             S F  S    Q  QL+ +L T  + +  PE+    V    +A T SSS         T+
Subjt:  ------------------APKAADAKPN-------------SAFFASLSNDQYNQLMTMLQTH-VAATKPES-GVKVETNHVAGTCSSS---------TS

Query:  DNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLSDKLQLRKIVKVECIDSLYLLSTSTA
        +N     W+ID+GA+ H+ H L+    I      SV LPN     V  +G V++   + L+NVL      Y L+      R I   +    LY+L TS  
Subjt:  DNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLSDKLQLRKIVKVECIDSLYLLSTSTA

Query:  KVTTAAL-------------------------------------------------------------TLDDC------------------------SRY
          ++A+L                                                             TL+                          ++ 
Subjt:  KVTTAAL-------------------------------------------------------------TLDDC------------------------SRY

Query:  TSIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSL
         S+  D   E   ++FF+  G +HQ SC  TPQQNSVVERKHQHLLNVARA+ FQS +P  FWG+CVLTA ++INR P  +L +KTPF +L  K   YS 
Subjt:  TSIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSL

Query:  LRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSA
        L+VFGCLAYASTL  HR KFD RA  CVF+GYP G KGYKL+++   QF +SRDV+F E +FPF             +   FV   P SD   ++  PS 
Subjt:  LRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSA

Query:  ASTSTTVAQFVPYGDAEIVHSPIDSNTNLGD---------HITDLAIDISTAPTSTTCLEADGTIVD-TTPTHFGP------RRTARNRHPPTYLKDFHC
        +S         P+      HSP D   ++ D         H + L+I + + P     L +D ++ D  TP    P      R ++R +H  +YL+D+HC
Subjt:  ASTSTTVAQFVPYGDAEIVHSPIDSNTNLGD---------HITDLAIDISTAPTSTTCLEADGTIVD-TTPTHFGP------RRTARNRHPPTYLKDFHC

Query:  NLLAG-SNPCLSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTV
        +L++    P  S  + I   ISY+  SP H+ + L +ST  EP +YH+ V  P W +AMD EL A+E   TW + ++P+G+H IGCKWVY++K+K DG++
Subjt:  NLLAG-SNPCLSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTV

Query:  DRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEKIACRLNKSIYDLKQAS
        +RHK RLVAKG+NQ EGID+ +TFSPVAK+VTV+  + LA +  W L Q+DVNNAFL+GDL EEV+M LP G+KS         + CRL KS+Y LKQAS
Subjt:  DRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEKIACRLNKSIYDLKQAS

Query:  RQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQIL
        RQWF+KFSS LI  GF QSK DYSLF +  G  FI +LVYVDDIL+T      V ++   L   F LKDLG  +YFLGLEL+RSK+G+   QRKY L IL
Subjt:  RQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQIL

Query:  EDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQ
        ED G L +KPV  PM+ ++KLSK+ G  L   + + YRRLIGRLLY+ ++RPDI +SV CLSQF+ +P   HLQAA+ +L+YLK SPGQG+     +  Q
Subjt:  EDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQ

Query:  LKAFVDADW-----------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRS
        LK F D+DW           EYRA+   T E+ W+  LL+D  +     +V YCDNQA + IA+NP FHERTKHIE+DCHF+RDKI  G LK L + S
Subjt:  LKAFVDADW-----------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRS

A0A2N9GZW3 Integrase catalytic domain-containing protein2.0e-24736.54Show/hide
Query:  SSSTSTSNPAIESPDL--DVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV---------------------------------------
        ++S S  +   ++P+   DV  + ++LHH  +  A LV+Q LVG  NY +WS++M MAL+                                        
Subjt:  SSSTSTSNPAIESPDL--DVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSV---------------------------------------

Query:  -----------------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEFIMIFLMGLNE
                                     +GP I++++K ++   Q   S+ SYY ++K+LW EL  +R + DC+CG  K ++     E++M FLMGLN+
Subjt:  -----------------------------SGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGGSKSIITFLNSEFIMIFLMGLNE

Query:  SSANVRTRILLMNPLPTINKAFSLVIQEERQRSI---STLPSVESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGHAPKAA------------D
        S ++VR +IL+ +PLP+I KAF+LVIQEERQR+I   S  P+ +SV L    EA++  ++  K ++ ++  P+CSHC + GH                  
Subjt:  SSANVRTRILLMNPLPTINKAFSLVIQEERQRSI---STLPSVESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGHAPKAA------------D

Query:  AKPNSAFFAS---------LSNDQYNQLMTMLQTHVAATKPESGVKVETNHVAGTCSSSTSD-------------------------------------N
        AK +SA  +S          +  Q  QL++ML +  +    +S      N V    S+ TS                                       
Subjt:  AKPNSAFFAS---------LSNDQYNQLMTMLQTHVAATKPESGVKVETNHVAGTCSSSTSD-------------------------------------N

Query:  FCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS----------------------DKLQL
        F   +W++D+GA+ H+ H L  F +I  +    + LPN  +     +G V++   LLL++VL +P F+++L+S                      D +  
Subjt:  FCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS----------------------DKLQL

Query:  RKIVKVECIDSLYLLSTSTAKVTTAALTL-----------------------------------------------------------------------
        ++I      + LY L  ST  V +++  L                                                                       
Subjt:  RKIVKVECIDSLYLLSTSTAKVTTAALTL-----------------------------------------------------------------------

Query:  --------------------------DDCSRYTSIAL-------------------------------DNAPELAFTEFFASTGTLHQFSCVATPQQNSV
                                  DDC+R T + L                               DN PE     F+A  GTLHQ SCV TPQQN+ 
Subjt:  --------------------------DDCSRYTSIAL-------------------------------DNAPELAFTEFFASTGTLHQFSCVATPQQNSV

Query:  VERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIK
        VERKHQHLL VARAL FQ+ +PL FWG CVLTAT LINR P  LL NK+PF +L+ K  +YS LRVFGCL YA+TLS +R KF PR+  CV +GYP GIK
Subjt:  VERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIK

Query:  GYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLA
        GY+L D+   Q  +SRDVLF+E+ FPFH++  +  + S       VLP P++DLP S   P    T+T+ +       + + +SP+ S  +     T   
Subjt:  GYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLA

Query:  IDISTAPTSTTCLEADGTIVDTTPTHFGP-----RRTARNRHPPTYLKDFHCNLLAGSNPCLSKP------------FSIDKYISYTTFSPTHRHFLLNV
           S  P ++T L+    + D T   F P     R++ R   PP+YL+ FHCN  A S P  S              F +  YISY+  +P +  F+L+ 
Subjt:  IDISTAPTSTTCLEADGTIVDTTPTHFGP-----RRTARNRHPPTYLKDFHCNLLAGSNPCLSKP------------FSIDKYISYTTFSPTHRHFLLNV

Query:  STEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLL
        S   EPT +H+A K P+W QAM  EL A+E   TW++  +P G+  IG KWV++VK + DG+++R+K RLVAKG+NQQEG D+ +TFSPVAK VTV+ LL
Subjt:  STEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLL

Query:  TLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIAL
         +A    W L Q+DVNNAFL+G+L EEV+M LP G  S   P     I C+L KS+Y LKQASRQWF+KFS+ L++HGF QSK+DYSLFTR  G SFIAL
Subjt:  TLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIAL

Query:  LVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSY
        LVYVDDIL+       V  +K  L A F LKDLG VRYFLGLE++RS QG+  SQRKY L+ILED GLL  KPV  PMD N+KLSK  G  L   + T Y
Subjt:  LVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSY

Query:  RRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW----------------------------
        RRLIGRL+Y+ ++RPDIVF+VH LSQF++ P   H +AA H+L+Y+K +P QG+     S   +KAF D+DW                            
Subjt:  RRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW----------------------------

Query:  -----------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKA
                   EYRA+ +   E++W+  LL DLQ+   + A+ + D+QAAI IA+NP FHERTKHIEIDCH VRDKI EG ++ + + S+ Q+AD+ TKA
Subjt:  -----------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKA

Query:  LPSSTLSFFMGKLVGPNNM
        L  +  S  + K+ G +N+
Subjt:  LPSSTLSFFMGKLVGPNNM

A0A2N9H2Y3 Integrase catalytic domain-containing protein1.0e-24337.37Show/hide
Query:  WSKAMRMALSVSGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGG------SKSIITFLNSEFIMIFLMGLNESSANVRTRILLM
        W    +     +G  ++ +++ +A+  Q  + +  Y+ ++K LW E + YR +  CTCG       S++++ + + +++  FLMGLN+S A VR +ILLM
Subjt:  WSKAMRMALSVSGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGG------SKSIITFLNSEFIMIFLMGLNESSANVRTRILLM

Query:  NPLPTINKAFSLVIQEERQR--SISTLPSVESVTLLAAAEASKRV---------------NDQSKKR----NNQRPPPVCSHCNLKGHAP----------
         PLP INK FSL+  +E+QR   +  LP+V+S  LL+  E                     D  K+        +PP +CSHC  KGH            
Subjt:  NPLPTINKAFSLVIQEERQR--SISTLPSVESVTLLAAAEASKRV---------------NDQSKKR----NNQRPPPVCSHCNLKGHAP----------

Query:  ----------KAADAKPNSAFFASLSND-------------QYNQLMTMLQTHVAATKPES---------GVKVETNH--VAG--TCSSSTSD-------
                    A+   +SA   S S D             Q  QL+ ML        P S          + V  +H  +AG  TC S+ S+       
Subjt:  ----------KAADAKPNSAFFASLSND-------------QYNQLMTMLQTHVAATKPES---------GVKVETNH--VAG--TCSSSTSD-------

Query:  ---------NFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS-----------------
                  F S  WVID+GA  H+      +        ISV LPN     V  +G V+L   LLL+NVL +P F ++L+S                 
Subjt:  ---------NFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCEDLLLSNVLYIPDFTYHLLS-----------------

Query:  -----DKLQLRKIVKVECIDSLYLLSTSTAKVTTAA----------------------------------------------------------------
             D +  R I      + LYLL +S+   TTAA                                                                
Subjt:  -----DKLQLRKIVKVECIDSLYLLSTSTAKVTTAA----------------------------------------------------------------

Query:  ------------------------------------------LTL-DDCSRYTSIAL-------------------------------DNAPELAFTEFF
                                                  LTL DDC+R T I L                               DN  E    +F+
Subjt:  ------------------------------------------LTL-DDCSRYTSIAL-------------------------------DNAPELAFTEFF

Query:  ASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHR
        AS G +HQ SCV TPQQNSVVERKHQH+LNVARAL FQS +PL +WG C+ TA +LINR P  +L+NK+PF  L  K   Y+ L+VFGCL +ASTLS HR
Subjt:  ASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHR

Query:  KKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFV--LPHPLSD----LPDSNKLPSAASTSTTVAQFV
         KFDPRA  C F+GYP+G+KGYKL ++   + +ISRDV+F E +FPF      N +  PD  + F+   P PLS     +P S+ +    S ++  A   
Subjt:  KKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFV--LPHPLSD----LPDSNKLPSAASTSTTVAQFV

Query:  PYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTYLKDFHCNLL----AGSNPCLS---KPFSIDKYI
        P   A +  SP+D+++ L DH +  +  +    T +        +          RR+ R   PPTYL+D+HC L     + S+P L+   KP+ +   +
Subjt:  PYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTYLKDFHCNLL----AGSNPCLS---KPFSIDKYI

Query:  SYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFL
        SY   SPTHR+F L+V+   EP+++HQA + PHW++AM  EL A+E   TWT+  +P+G+H IGCKWVY+VK K DG+++R+K RLVAKG+ QQEG+D+ 
Subjt:  SYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFL

Query:  DTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGE--KIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQS
        +TFSPVAK  TV+ LL +A+   W L Q+DVNNAFL+GDL EEV+MALP G+     P KGE   + C+LNKS+Y LKQASRQWF KFSS +I  GF QS
Subjt:  DTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGE--KIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQS

Query:  KSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNI
        KSDYSLFTR  G +FIALLVYVDDIL+     +++ S+K  L A F LKDLGN++YFLGLE++RS +G+   QRKY L IL D G+L +KPV TPM+ N+
Subjt:  KSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNI

Query:  KLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW----------
        K+S+ TG+ L  D+ + YRRL+GRLLY+ ++RPDI +SV  LSQF+ KPTS HL AA+ +LRY+K + GQG+     S+ QLKAF D+DW          
Subjt:  KLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW----------

Query:  -----------------------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFL
                                     EYRA+ +V  EL+W+  LL +LQ      A+ +CD+QAA+ IA+NP +HERTKHIE+DCH +R+KI  G +
Subjt:  -----------------------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFL

Query:  KLLPIRSQSQLADMFTKALPSSTLSFFMGKLVGPNNMPCQLMVGFFAYAKSLDINVDKGEL
        + L + SQ+QLAD+ TKAL S+     +GK+    N P    V    Y     +  D+ ++
Subjt:  KLLPIRSQSQLADMFTKALPSSTLSFFMGKLVGPNNMPCQLMVGFFAYAKSLDINVDKGEL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-8727.77Show/hide
Query:  VLYIPDFTYHLLSDKLQLRKIVKVECIDSLYLLSTSTAKVTTAALTLDDCSRYTSIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLN
        V+++  FT++ ++  ++ +  V           S +   +    L +D+   Y S  +         +F    G  +  +   TPQ N V ER  + +  
Subjt:  VLYIPDFTYHLLSDKLQLRKIVKVECIDSLYLLSTSTAKVTTAALTLDDCSRYTSIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLN

Query:  VARALMFQSKVPLSFWGDCVLTATFLINRTPMTLL--ANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMT
         AR ++  +K+  SFWG+ VLTAT+LINR P   L  ++KTP+ + + K+     LRVFG   Y   +   + KFD ++   +F+GY     G+KL+D  
Subjt:  VARALMFQSKVPLSFWGDCVLTATFLINRTPMTLL--ANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMT

Query:  KGQFIISRDVLF------------FEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHI
          +FI++RDV+             FE +F   S    N +F  D         P ++  + + +     +  +  +  P    +I+ +   + +   D+I
Subjt:  KGQFIISRDVLF------------FEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHI

Query:  TDLAIDISTAP---TSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPC-----LSKPFSIDKYISYTTFSPTHRHFLLNVSTEYE
          L     +       +   + D  + ++  +  G    +R      +LK+   +    ++        S+       ISY     +    +LN  T + 
Subjt:  TDLAIDISTAP---TSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPC-----LSKPFSIDKYISYTTFSPTHRHFLLNVSTEYE

Query:  --PTYYHQAVKF----PHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVL
          P  + + +++      W +A++ EL A +   TWTI   P  ++I+  +WV+ VKY   G   R+K RLVA+GF Q+  ID+ +TF+PVA+I + + +
Subjt:  --PTYYHQAVKF----PHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVL

Query:  LTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLF--TRGHGCSF
        L+L   Y+  + QMDV  AFLNG L EE++M LP G   +          C+LNK+IY LKQA+R WF  F  AL    F  S  D  ++   +G+    
Subjt:  LTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLF--TRGHGCSF

Query:  IALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDE-
        I +L+YVDD+++    +  + + K  L   F + DL  +++F+G+ +   +  ++ SQ  Y  +IL    + +   V+TP+      SK   + L  DE 
Subjt:  IALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDE-

Query:  -ATSYRRLIGRLLYMQI-SRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLK--AFVDADW--------------------
          T  R LIG L+Y+ + +RPD+  +V+ LS++  K  S   Q    +LRYLK +    ++ K    F+ K   +VD+DW                    
Subjt:  -ATSYRRLIGRLLYMQI-SRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLK--AFVDADW--------------------

Query:  --------------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQS
                            EY AL     E LW+  LL  + +   +P   Y DNQ  ISIA+NP+ H+R KHI+I  HF R+++    + L  I +++
Subjt:  --------------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQS

Query:  QLADMFTKALPSSTLSFFMGKL
        QLAD+FTK LP++       KL
Subjt:  QLADMFTKALPSSTLSFFMGKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-10631.83Show/hide
Query:  FTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYAST
        F E+ +S G  H+ +   TPQ N V ER ++ ++   R+++  +K+P SFWG+ V TA +LINR+P   LA + P  V   KE  YS L+VFGC A+A  
Subjt:  FTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYAST

Query:  LSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVP
            R K D ++ PC+FIGY     GY+L+D  K + I SRDV+F E      S  +  +  S  V N  ++P+ ++    SN   SA ST+  V++   
Subjt:  LSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVP

Query:  YGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPCL-SKPFSIDKYISYTTFSP
            E     I+    L + + ++                                       PT  ++ H  L     P + S+ +   +Y+       
Subjt:  YGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPCL-SKPFSIDKYISYTTFSP

Query:  THRHFLLNVSTEYEPTYYHQAVKFPHWRQ---AMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFS
                +S + EP    + +  P   Q   AM +E++++++  T+ +V +P G+  + CKWV+++K   D  + R+K RLV KGF Q++GIDF + FS
Subjt:  THRHFLLNVSTEYEPTYYHQAVKFPHWRQ---AMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFS

Query:  PVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEK-IACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYS
        PV K+ +++ +L+LA S    + Q+DV  AFL+GDL EE++M  P G++ +     G+K + C+LNKS+Y LKQA RQW+ KF S + S  + ++ SD  
Subjt:  PVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEK-IACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYS

Query:  L-FTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLEL--SRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKL
        + F R    +FI LL+YVDD+L+ G     +  +K  L   F +KDLG  +  LG+++   R+ + L+ SQ KY  ++LE   + +AKPV+TP+  ++KL
Subjt:  L-FTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLEL--SRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKL

Query:  SKETGKALAVDEA----TSYRRLIGRLLY-MQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDAD--------
        SK+       ++       Y   +G L+Y M  +RPDI  +V  +S+FL+ P   H +A   +LRYL+ + G   L    S   LK + DAD        
Subjt:  SKETGKALAVDEA----TSYRRLIGRLLY-MQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDAD--------

Query:  ----------------W---------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVE
                        W               EY A T    E++W+ + L++L  +     V YCD+Q+AI ++ N  +H RTKHI++  H++R+ + +
Subjt:  ----------------W---------------EYRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVE

Query:  GFLKLLPIRSQSQLADMFTKALP
          LK+L I +    ADM TK +P
Subjt:  GFLKLLPIRSQSQLADMFTKALP

Q94A82 Nudix hydrolase 19, chloroplastic1.1e-10964.69Show/hide
Query:  PEKSFNSRNPFNSTAPSISNSTSIRP-TMSISLQSHAFAGNPIKLKTPKPEDPFSPSSALESLNSQL-LDNTHLASSINFNVLPFRNGRPLARSSARPND
        P  SF SR    S   +++ +T++   TMS++L++HAFAGNP+K KTPK  DPFSP+SA ESL + + +   H   S +F VLPF  GRPL  SS    +
Subjt:  PEKSFNSRNPFNSTAPSISNSTSIRP-TMSISLQSHAFAGNPIKLKTPKPEDPFSPSSALESLNSQL-LDNTHLASSINFNVLPFRNGRPLARSSARPND

Query:  AASSWHLGWIGLADFKALFANSAVELAGDSFIYLGSSDEENAVYWAIDVSSEDGLVSEFGSKLLCFVEVRTLMVAADWSDARAMGELAIAGHARALLEWH
            WHLGW+ LAD K L A+  V+L  DS +YLG   EE+ VYWA+D+ +EDG VSE G + LCFVE+RTLMVAADW+D RAM ELAIAG+ARALLEWH
Subjt:  AASSWHLGWIGLADFKALFANSAVELAGDSFIYLGSSDEENAVYWAIDVSSEDGLVSEFGSKLLCFVEVRTLMVAADWSDARAMGELAIAGHARALLEWH

Query:  NVSKFCGHCGGKTVPMEAGRRKQCSNPSCKKRVYPRVDPVVIMLVIDKENDRALLSKQSRFVPRMWSCLAGFIEAGESLEEAVRRETWEETGIEVGEVVY
        NVS+FCG CG KT P EAGRRKQCS+ +C+KRVYPRVDPVVIMLVID+ENDRALLS+QSR+VPRMWSCLAGFIE GESLEEAVRRETWEETGIEVG+VVY
Subjt:  NVSKFCGHCGGKTVPMEAGRRKQCSNPSCKKRVYPRVDPVVIMLVIDKENDRALLSKQSRFVPRMWSCLAGFIEAGESLEEAVRRETWEETGIEVGEVVY

Query:  HSSQPWP-GILPFPSSAMAG
        HSSQPWP G    P   M G
Subjt:  HSSQPWP-GILPFPSSAMAG

Q94A82 Nudix hydrolase 19, chloroplastic3.9e-4673.6Show/hide
Query:  VGPNNMPCQLMVGFFAYAKSLDINVDKGELEDAQWHTREDVKNALTFAEYKKAQRTAAAKVEQMCKGVERQQSLSSDFNVESGELAPMFVPGPFAIAHHL
        VGP++MPCQLM+GFFA+AK+LDINVDK ELEDAQWH+RE+VK AL  AEY+KAQRTAAAKVEQ+CKGVER QSLS+DFN+ESGELAPMF+PGPFAIAHHL
Subjt:  VGPNNMPCQLMVGFFAYAKSLDINVDKGELEDAQWHTREDVKNALTFAEYKKAQRTAAAKVEQMCKGVERQQSLSSDFNVESGELAPMFVPGPFAIAHHL

Query:  ISSWVYNEGSELNKPRS---SLSSL
        IS+WV     +++  +    SLSSL
Subjt:  ISSWVYNEGSELNKPRS---SLSSL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-12832.62Show/hide
Query:  DNAPE-LAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVF
        DN  E +A  E+F+  G  H  S   TP+ N + ERKH+H++     L+  + +P ++W      A +LINR P  LL  ++PF  L+G   +Y  LRVF
Subjt:  DNAPE-LAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVF

Query:  GCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTS
        GC  Y      ++ K D ++  CVF+GY      Y    +   +  ISR V F E+ FPF +     S           +  P + LP    +  A S S
Subjt:  GCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTS

Query:  -------TTVAQFVPYGDAEIVHSPIDS-----------------------------------------NTNLGDHITDLAIDISTAPTSTTCLEADGTI
                  +   P+ ++++  S +DS                                         N    +  + LA  +ST   S++   +  T 
Subjt:  -------TTVAQFVPYGDAEIVHSPIDS-----------------------------------------NTNLGDHITDLAIDISTAPTSTTCLEADGTI

Query:  VDTTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPCLSKPFSIDKYISYTTFSPTHRHFL-LNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTW
          ++ T   P   +   HPP  L      ++  +N       S+          P  ++ L ++++ E EP    QA+K   WR AM  E+ A     TW
Subjt:  VDTTPTHFGPRRTARNRHPPTYLKDFHCNLLAGSNPCLSKPFSIDKYISYTTFSPTHRHFL-LNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTW

Query:  TIVSIPVGQ-HIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPL
         +V  P     I+GC+W++  KY  DG+++R+K RLVAKG+NQ+ G+D+ +TFSPV K  +++++L +A   SWP+ Q+DVNNAFL G L ++V+M+ P 
Subjt:  TIVSIPVGQ-HIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPL

Query:  GYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLG
        G    FI        C+L K++Y LKQA R W+ +  + L++ GF  S SD SLF    G S + +LVYVDDIL+TG     + +    L   F +KD  
Subjt:  GYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLG

Query:  NVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTST
         + YFLG+E  R   GL  SQR+Y L +L    ++ AKPV TPM P+ KLS  +G  L   + T YR ++G L Y+  +RPDI ++V+ LSQF+  PT  
Subjt:  NVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTST

Query:  HLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLKDLQ
        HLQA   +LRYL  +P  GI LK  +   L A+ DADW                                       EYR++   +SE+ WI  LL +L 
Subjt:  HLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTSELLWITQLLKDLQ

Query:  VVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKLVGPNNMP
        +  + P V YCDN  A  + +NP FH R KHI ID HF+R+++  G L+++ + +  QLAD  TK L  +    F  K +G   +P
Subjt:  VVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKLVGPNNMP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-12232.06Show/hide
Query:  SRYTSIALDNAPE-LAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEF
        +R  ++  DN  E +   ++ +  G  H  S   TP+ N + ERKH+H++ +   L+  + VP ++W      A +LINR P  LL  ++PF  L+G+  
Subjt:  SRYTSIALDNAPE-LAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARALMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEF

Query:  DYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHS----VSQANSSFSPDVLN----------QF
        +Y  L+VFGC  Y      +R K + ++  C F+GY      Y    +  G+   SR V F E  FPF +    VS +    S    N            
Subjt:  DYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDLFPFHS----VSQANSSFSPDVLN----------QF

Query:  VLPHP------LSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPP
        VLP P      L   P     PS   T+   +  +P   +  + SP  S      H        +  P  T    ++  I++  P    P   + N++ P
Subjt:  VLPHP------LSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTARNRHPP

Query:  TYLKDFHCNLLAGSNPCLSKPFS----------------IDKYISYTTFSPTHRH------------------FLLNVSTEYEPTYYHQAVKFPHWRQAM
                  +   +  +S+P S                    I     +P + H                  +  +++   EP    QA+K   WRQAM
Subjt:  TYLKDFHCNLLAGSNPCLSKPFS----------------IDKYISYTTFSPTHRH------------------FLLNVSTEYEPTYYHQAVKFPHWRQAM

Query:  DDELKAMERTKTWTIV-SIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLN
          E+ A     TW +V   P    I+GC+W++  K+  DG+++R+K RLVAKG+NQ+ G+D+ +TFSPV K  +++++L +A   SWP+ Q+DVNNAFL 
Subjt:  DDELKAMERTKTWTIV-SIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLN

Query:  GDLFEEVHMALPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVK
        G L +EV+M+ P G    F+        CRL K+IY LKQA R W+ +  + L++ GF  S SD SLF    G S I +LVYVDDIL+TG     +    
Subjt:  GDLFEEVHMALPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVK

Query:  ALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSV
          L   F +K+  ++ YFLG+E  R  QGL  SQR+Y L +L    +L AKPVATPM  + KL+  +G  L   + T YR ++G L Y+  +RPD+ ++V
Subjt:  ALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSV

Query:  HCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTS
        + LSQ++  PT  H  A   +LRYL  +P  GI LK  +   L A+ DADW                                       EYR++   +S
Subjt:  HCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------EYRALTTVTS

Query:  ELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKL
        EL WI  LL +L +  S P V YCDN  A  + +NP FH R KHI +D HF+R+++  G L+++ + +  QLAD  TK L       F  K+
Subjt:  ELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKL

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).6.1e-1031.11Show/hide
Query:  TATLVTQQLVGASNYISWSKAMRMALSVSGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGG-----SKSIITFLNSEFIMIFLM
        T  L+   +   + +  W    R+ +      IYQ+R+ LA   QG  S+E Y+ K+  +W EL EY  + +C CGG     +K        E    FLM
Subjt:  TATLVTQQLVGASNYISWSKAMRMALSVSGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDDCTCGG-----SKSIITFLNSEFIMIFLM

Query:  G--LNESSANVRTRILLMNPLPTINKAFSLVIQEE
        G  LN+    V T+I+   P P++++AF++V   E
Subjt:  G--LNESSANVRTRILLMNPLPTINKAFSLVIQEE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-12442.76Show/hide
Query:  DISTAPTSTTCLEADGTIVDTTP---THFGPRRTARNRHPPTYLKDFHCNLLAGSNPCLSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKF
        D ST+ +S   + +   I +  P    H   RRT +    P YL+D++C+ +A           I +++SY   SP +  FL+ ++   EP+ Y++A +F
Subjt:  DISTAPTSTTCLEADGTIVDTTP---THFGPRRTARNRHPPTYLKDFHCNLLAGSNPCLSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKF

Query:  PHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDV
          W  AMDDE+ AME T TW I ++P  +  IGCKWVY++KY  DGT++R+K RLVAKG+ QQEGIDF++TFSPV K+ +VK++L ++  Y++ L Q+D+
Subjt:  PHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDV

Query:  NNAFLNGDLFEEVHMALPLGY---KSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGP
        +NAFLNGDL EE++M LP GY   +   +PP      C L KSIY LKQASRQWF KFS  LI  GF QS SD++ F +     F+ +LVYVDDI++   
Subjt:  NNAFLNGDLFEEVHMALPLGY---KSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFSSALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGP

Query:  SVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQI
        +   V  +K+ LK+ F L+DLG ++YFLGLE++RS  G+   QRKY L +L++ GLL  KP + PMDP++  S  +G      +A +YRRLIGRL+Y+QI
Subjt:  SVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATSYRRLIGRLLYMQI

Query:  SRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------E
        +R DI F+V+ LSQF + P   H QA   +L Y+K + GQG+     +  QL+ F DA +                                       E
Subjt:  SRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW---------------------------------------E

Query:  YRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIV
        YRAL+  T E++W+ Q  ++LQ+  S P + +CDN AAI IA+N  FHERTKHIE DCH VR++ V
Subjt:  YRALTTVTSELLWITQLLKDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIV

AT5G20070.1 nudix hydrolase homolog 197.9e-11164.69Show/hide
Query:  PEKSFNSRNPFNSTAPSISNSTSIRP-TMSISLQSHAFAGNPIKLKTPKPEDPFSPSSALESLNSQL-LDNTHLASSINFNVLPFRNGRPLARSSARPND
        P  SF SR    S   +++ +T++   TMS++L++HAFAGNP+K KTPK  DPFSP+SA ESL + + +   H   S +F VLPF  GRPL  SS    +
Subjt:  PEKSFNSRNPFNSTAPSISNSTSIRP-TMSISLQSHAFAGNPIKLKTPKPEDPFSPSSALESLNSQL-LDNTHLASSINFNVLPFRNGRPLARSSARPND

Query:  AASSWHLGWIGLADFKALFANSAVELAGDSFIYLGSSDEENAVYWAIDVSSEDGLVSEFGSKLLCFVEVRTLMVAADWSDARAMGELAIAGHARALLEWH
            WHLGW+ LAD K L A+  V+L  DS +YLG   EE+ VYWA+D+ +EDG VSE G + LCFVE+RTLMVAADW+D RAM ELAIAG+ARALLEWH
Subjt:  AASSWHLGWIGLADFKALFANSAVELAGDSFIYLGSSDEENAVYWAIDVSSEDGLVSEFGSKLLCFVEVRTLMVAADWSDARAMGELAIAGHARALLEWH

Query:  NVSKFCGHCGGKTVPMEAGRRKQCSNPSCKKRVYPRVDPVVIMLVIDKENDRALLSKQSRFVPRMWSCLAGFIEAGESLEEAVRRETWEETGIEVGEVVY
        NVS+FCG CG KT P EAGRRKQCS+ +C+KRVYPRVDPVVIMLVID+ENDRALLS+QSR+VPRMWSCLAGFIE GESLEEAVRRETWEETGIEVG+VVY
Subjt:  NVSKFCGHCGGKTVPMEAGRRKQCSNPSCKKRVYPRVDPVVIMLVIDKENDRALLSKQSRFVPRMWSCLAGFIEAGESLEEAVRRETWEETGIEVGEVVY

Query:  HSSQPWP-GILPFPSSAMAG
        HSSQPWP G    P   M G
Subjt:  HSSQPWP-GILPFPSSAMAG

AT5G20070.1 nudix hydrolase homolog 192.8e-4773.6Show/hide
Query:  VGPNNMPCQLMVGFFAYAKSLDINVDKGELEDAQWHTREDVKNALTFAEYKKAQRTAAAKVEQMCKGVERQQSLSSDFNVESGELAPMFVPGPFAIAHHL
        VGP++MPCQLM+GFFA+AK+LDINVDK ELEDAQWH+RE+VK AL  AEY+KAQRTAAAKVEQ+CKGVER QSLS+DFN+ESGELAPMF+PGPFAIAHHL
Subjt:  VGPNNMPCQLMVGFFAYAKSLDINVDKGELEDAQWHTREDVKNALTFAEYKKAQRTAAAKVEQMCKGVERQQSLSSDFNVESGELAPMFVPGPFAIAHHL

Query:  ISSWVYNEGSELNKPRS---SLSSL
        IS+WV     +++  +    SLSSL
Subjt:  ISSWVYNEGSELNKPRS---SLSSL

ATMG00810.1 DNA/RNA polymerases superfamily protein9.9e-2938.15Show/hide
Query:  LLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATS
        LL+YVDDILLTG S   +  +   L + F +KDLG V YFLG+++     GLF SQ KY  QIL + G+LD KP++TP+   +  S  T K     + + 
Subjt:  LLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPNIKLSKETGKALAVDEATS

Query:  YRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW
        +R ++G L Y+ ++RPDI ++V+ + Q + +PT         +LRY+K +   G+ +   S   ++AF D+DW
Subjt:  YRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.7e-2549.06Show/hide
Query:  LNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVK
        +  + + EP     A+K P W QAM +EL A+ R KTW +V  PV Q+I+GCKWV++ K   DGT+DR K RLVAKGF+Q+EGI F++T+SPV +  T++
Subjt:  LNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDGTVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVK

Query:  VLLTLA
         +L +A
Subjt:  VLLTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATCAGTTCCAATTTCCCCACTCATTTCCACAGCTCCTCTGCATTACATCCCTGAAAAATCCTTCAATTCCCGCAATCCCTTCAATTCAACAGCCCCATCAATCTC
TAATTCCACCTCAATTCGTCCAACAATGTCCATTAGCTTACAGTCTCACGCCTTCGCCGGCAACCCCATCAAGTTGAAAACCCCAAAACCAGAAGACCCGTTTTCCCCTT
CTTCAGCTCTCGAATCCCTCAATTCCCAACTTTTGGACAACACCCACTTGGCTTCTTCAATCAATTTCAACGTCTTGCCCTTCAGGAACGGCAGGCCCTTGGCCCGTTCG
AGTGCGCGGCCTAATGATGCCGCCTCGAGCTGGCATCTGGGTTGGATTGGCTTGGCTGATTTCAAGGCTCTGTTTGCCAATTCGGCGGTTGAATTGGCTGGGGATTCGTT
CATCTATTTGGGTTCGTCGGATGAAGAGAATGCTGTTTATTGGGCTATTGATGTTTCGAGTGAGGATGGTTTGGTTTCTGAATTTGGGAGCAAGCTACTCTGCTTTGTGG
AGGTGAGAACTCTCATGGTCGCTGCTGATTGGTCCGATGCTCGAGCTATGGGAGAATTGGCTATTGCTGGTCATGCTAGAGCATTGCTAGAATGGCACAATGTGTCAAAA
TTTTGTGGACACTGTGGAGGAAAAACAGTACCCATGGAAGCAGGGAGACGGAAGCAATGTTCTAATCCCTCGTGCAAGAAGAGGGTTTATCCTCGTGTTGATCCGGTTGT
CATTATGCTAGTTATTGATAAAGAAAATGATCGTGCACTCTTGAGTAAACAATCGAGATTTGTTCCACGAATGTGGAGTTGTCTAGCTGGTTTCATAGAGGCTGGAGAAA
GCTTAGAAGAGGCAGTAAGAAGAGAAACATGGGAGGAGACGGGCATTGAAGTAGGGGAAGTTGTCTACCATAGTTCTCAGCCTTGGCCTGGAATTTTACCTTTTCCTTCT
TCTGCGATGGCTGGAACTGGTAGCTCTTCTACGTCTACTTCAAATCCTGCAATCGAATCTCCAGACCTAGATGTGCAGCTGAATCCTTTTCATCTGCATCACTCCGTCAC
TTCCACTGCCACGCTAGTTACTCAACAGCTTGTTGGTGCTAGCAATTATATATCATGGAGCAAGGCGATGCGTATGGCGCTTTCTGTTAGCGGCCCTCACATCTATCAGG
TACGTAAAGAACTCGCCAATACACTACAAGGTACTCTATCCATTGAGTCTTACTATGTGAAGGTGAAAACTCTCTGGCAAGAATTAATCGAATACCGACTTGTTGATGAT
TGCACTTGTGGCGGATCAAAATCGATTATAACCTTCTTAAACTCTGAATTCATCATGATCTTTTTAATGGGACTGAACGAGTCGTCTGCCAATGTTAGAACTCGGATTCT
TCTTATGAATCCTCTGCCTACTATAAACAAAGCCTTCTCCCTTGTAATTCAAGAAGAACGACAACGGTCAATCAGTACTTTGCCTTCAGTAGAATCTGTCACATTACTTG
CAGCAGCAGAGGCTTCTAAGAGAGTGAACGATCAGTCCAAGAAAAGAAACAACCAGCGTCCACCACCAGTGTGCTCTCATTGCAATTTAAAAGGGCATGCTCCTAAGGCT
GCTGATGCGAAGCCAAATTCAGCCTTCTTTGCCAGCCTGAGTAATGACCAGTACAACCAATTGATGACCATGTTGCAGACTCATGTTGCAGCTACAAAACCAGAATCTGG
TGTCAAGGTTGAGACAAATCACGTCGCAGGTACTTGTTCTTCCTCGACTAGTGATAATTTCTGTTCAAAGTCTTGGGTGATTGATTCTGGTGCTTCTAGTCATATCTGCC
ATGACCTGAATATGTTTGTCAATATACGTGATGCTACTGGAATTTCGGTTATTCTGCCTAATCAGACCCGTTTCCATGTTAAATTGGTTGGTGATGTTAAATTGTGTGAA
GACTTGTTGTTGTCCAATGTTTTGTACATCCCTGACTTCACCTATCACTTGCTATCGGACAAGTTACAATTGAGGAAGATTGTCAAGGTTGAATGCATAGATAGTCTTTA
CTTGCTTAGTACATCTACTGCTAAAGTGACTACTGCTGCTTTGACCTTGGATGATTGCTCTAGGTACACATCGATTGCTCTAGACAATGCTCCTGAACTAGCCTTTACTG
AATTTTTTGCCTCCACTGGAACCTTACACCAGTTTTCATGTGTTGCCACACCACAACAAAATTCAGTCGTCGAACGAAAGCACCAACACCTTCTTAACGTGGCTAGAGCC
TTGATGTTCCAATCTAAAGTTCCCCTATCCTTTTGGGGTGATTGTGTTCTAACAGCCACTTTTTTGATCAATCGTACGCCCATGACATTGTTGGCAAACAAAACCCCTTT
CGTTGTTCTTTATGGAAAGGAATTTGATTACTCTCTGCTACGAGTCTTTGGTTGTTTGGCGTATGCCTCAACTTTATCCATACATCGAAAGAAATTTGATCCTAGAGCGT
GTCCATGCGTCTTCATTGGTTATCCAACAGGGATAAAAGGCTATAAACTATACGATATGACAAAGGGGCAATTTATCATCTCGAGGGATGTCCTTTTCTTTGAGGACTTG
TTCCCTTTTCATTCTGTTTCACAGGCCAACAGTTCATTCTCACCAGATGTTTTGAATCAGTTTGTTCTACCTCACCCCTTGAGTGATCTACCTGATTCAAACAAACTGCC
TTCCGCTGCCAGCACGTCTACAACAGTTGCTCAGTTTGTTCCCTATGGAGATGCTGAGATTGTTCATTCTCCCATAGATAGTAACACTAACCTGGGAGATCATATCACTG
ATTTGGCCATTGATATTTCTACTGCTCCTACTTCTACTACTTGTCTTGAGGCTGATGGTACGATTGTTGATACAACCCCTACACATTTTGGCCCACGAAGGACTGCTAGA
AACCGCCATCCGCCCACCTACTTGAAAGATTTTCATTGCAACTTATTGGCTGGTTCTAATCCTTGTCTTTCAAAGCCTTTTTCAATTGACAAGTATATCTCCTACACAAC
CTTTTCTCCTACTCATAGACATTTTCTTTTAAATGTTTCTACCGAGTATGAGCCCACTTATTACCATCAGGCCGTGAAATTCCCTCATTGGCGGCAAGCCATGGATGATG
AGTTAAAGGCTATGGAACGAACTAAAACGTGGACTATTGTTTCTATTCCTGTTGGACAACACATTATTGGTTGTAAATGGGTTTATAGAGTTAAGTATAAACCTGATGGC
ACCGTTGATCGTCATAAAACCCGATTAGTGGCAAAAGGCTTCAACCAACAGGAAGGAATTGATTTCTTAGACACGTTTTCACCGGTCGCCAAAATTGTCACAGTGAAGGT
TCTCTTAACTTTGGCAACATCTTACAGCTGGCCTTTGGCTCAGATGGACGTGAATAACGCGTTTCTGAACGGTGATCTCTTTGAGGAAGTTCATATGGCTCTTCCATTGG
GATATAAGTCATCCTTTATTCCTCCTAAGGGGGAGAAAATTGCTTGCCGATTAAATAAATCCATATATGACTTAAAACAGGCTTCTAGGCAATGGTTCACGAAGTTCTCA
TCTGCCTTGATTTCACATGGGTTCGCTCAATCTAAATCAGATTATTCCTTGTTCACTCGTGGGCATGGCTGCTCTTTTATTGCTTTACTCGTCTATGTTGACGACATATT
ATTGACTGGACCTTCGGTTGATGAAGTTACGTCTGTAAAGGCTTTACTCAAAGCCCATTTCTTGTTGAAAGATTTAGGAAATGTGAGGTATTTCCTTGGCCTTGAGCTTT
CACGGTCTAAGCAGGGTCTTTTCTTTTCTCAAAGAAAATACTGCCTCCAAATCCTTGAGGATGTTGGTCTCTTGGATGCAAAGCCTGTTGCTACCCCTATGGACCCCAAC
ATCAAACTCTCAAAAGAAACTGGCAAAGCCTTAGCTGTTGATGAGGCTACTTCTTATCGGAGGCTTATAGGACGTTTGTTGTATATGCAAATATCACGTCCTGACATTGT
TTTTTCTGTACATTGTCTAAGCCAATTTCTTCAAAAACCCACTTCAACACACCTTCAAGCAGCTCATCATCTTCTTCGGTATTTGAAGAAGTCACCTGGGCAGGGAATAC
TTTTGAAACCAGTCTCTCATTTTCAGCTTAAGGCTTTTGTAGATGCTGACTGGGAATATCGAGCATTAACAACAGTTACCAGTGAACTTCTATGGATTACTCAGCTTCTT
AAAGATTTGCAAGTTGTTTGTTCCTCACCGGCTGTAACATACTGCGATAACCAAGCTGCAATTTCCATTGCTTCCAATCCAACTTTCCACGAACGGACCAAGCATATTGA
GATCGATTGCCACTTCGTACGTGACAAAATTGTTGAAGGCTTCCTGAAATTACTTCCGATACGTTCACAATCACAGCTAGCTGACATGTTTACTAAAGCTCTTCCATCCT
CAACGTTATCTTTTTTCATGGGCAAGTTGGTTGGACCAAATAATATGCCATGTCAGTTGATGGTTGGTTTCTTTGCATATGCAAAATCACTTGATATAAATGTGGACAAG
GGAGAGTTAGAAGATGCTCAATGGCACACTAGAGAAGATGTGAAGAACGCTTTGACGTTTGCCGAGTACAAGAAGGCTCAAAGAACTGCAGCAGCCAAGGTGGAGCAAAT
GTGTAAAGGAGTTGAGAGACAACAGAGCTTATCTTCGGATTTCAACGTCGAAAGCGGCGAACTCGCTCCAATGTTTGTTCCGGGGCCCTTCGCAATCGCTCATCATCTCA
TCTCTTCATGGGTCTATAATGAAGGATCAGAATTGAACAAACCTAGAAGTTCTTTGTCCAGCTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGATCAGTTCCAATTTCCCCACTCATTTCCACAGCTCCTCTGCATTACATCCCTGAAAAATCCTTCAATTCCCGCAATCCCTTCAATTCAACAGCCCCATCAATCTC
TAATTCCACCTCAATTCGTCCAACAATGTCCATTAGCTTACAGTCTCACGCCTTCGCCGGCAACCCCATCAAGTTGAAAACCCCAAAACCAGAAGACCCGTTTTCCCCTT
CTTCAGCTCTCGAATCCCTCAATTCCCAACTTTTGGACAACACCCACTTGGCTTCTTCAATCAATTTCAACGTCTTGCCCTTCAGGAACGGCAGGCCCTTGGCCCGTTCG
AGTGCGCGGCCTAATGATGCCGCCTCGAGCTGGCATCTGGGTTGGATTGGCTTGGCTGATTTCAAGGCTCTGTTTGCCAATTCGGCGGTTGAATTGGCTGGGGATTCGTT
CATCTATTTGGGTTCGTCGGATGAAGAGAATGCTGTTTATTGGGCTATTGATGTTTCGAGTGAGGATGGTTTGGTTTCTGAATTTGGGAGCAAGCTACTCTGCTTTGTGG
AGGTGAGAACTCTCATGGTCGCTGCTGATTGGTCCGATGCTCGAGCTATGGGAGAATTGGCTATTGCTGGTCATGCTAGAGCATTGCTAGAATGGCACAATGTGTCAAAA
TTTTGTGGACACTGTGGAGGAAAAACAGTACCCATGGAAGCAGGGAGACGGAAGCAATGTTCTAATCCCTCGTGCAAGAAGAGGGTTTATCCTCGTGTTGATCCGGTTGT
CATTATGCTAGTTATTGATAAAGAAAATGATCGTGCACTCTTGAGTAAACAATCGAGATTTGTTCCACGAATGTGGAGTTGTCTAGCTGGTTTCATAGAGGCTGGAGAAA
GCTTAGAAGAGGCAGTAAGAAGAGAAACATGGGAGGAGACGGGCATTGAAGTAGGGGAAGTTGTCTACCATAGTTCTCAGCCTTGGCCTGGAATTTTACCTTTTCCTTCT
TCTGCGATGGCTGGAACTGGTAGCTCTTCTACGTCTACTTCAAATCCTGCAATCGAATCTCCAGACCTAGATGTGCAGCTGAATCCTTTTCATCTGCATCACTCCGTCAC
TTCCACTGCCACGCTAGTTACTCAACAGCTTGTTGGTGCTAGCAATTATATATCATGGAGCAAGGCGATGCGTATGGCGCTTTCTGTTAGCGGCCCTCACATCTATCAGG
TACGTAAAGAACTCGCCAATACACTACAAGGTACTCTATCCATTGAGTCTTACTATGTGAAGGTGAAAACTCTCTGGCAAGAATTAATCGAATACCGACTTGTTGATGAT
TGCACTTGTGGCGGATCAAAATCGATTATAACCTTCTTAAACTCTGAATTCATCATGATCTTTTTAATGGGACTGAACGAGTCGTCTGCCAATGTTAGAACTCGGATTCT
TCTTATGAATCCTCTGCCTACTATAAACAAAGCCTTCTCCCTTGTAATTCAAGAAGAACGACAACGGTCAATCAGTACTTTGCCTTCAGTAGAATCTGTCACATTACTTG
CAGCAGCAGAGGCTTCTAAGAGAGTGAACGATCAGTCCAAGAAAAGAAACAACCAGCGTCCACCACCAGTGTGCTCTCATTGCAATTTAAAAGGGCATGCTCCTAAGGCT
GCTGATGCGAAGCCAAATTCAGCCTTCTTTGCCAGCCTGAGTAATGACCAGTACAACCAATTGATGACCATGTTGCAGACTCATGTTGCAGCTACAAAACCAGAATCTGG
TGTCAAGGTTGAGACAAATCACGTCGCAGGTACTTGTTCTTCCTCGACTAGTGATAATTTCTGTTCAAAGTCTTGGGTGATTGATTCTGGTGCTTCTAGTCATATCTGCC
ATGACCTGAATATGTTTGTCAATATACGTGATGCTACTGGAATTTCGGTTATTCTGCCTAATCAGACCCGTTTCCATGTTAAATTGGTTGGTGATGTTAAATTGTGTGAA
GACTTGTTGTTGTCCAATGTTTTGTACATCCCTGACTTCACCTATCACTTGCTATCGGACAAGTTACAATTGAGGAAGATTGTCAAGGTTGAATGCATAGATAGTCTTTA
CTTGCTTAGTACATCTACTGCTAAAGTGACTACTGCTGCTTTGACCTTGGATGATTGCTCTAGGTACACATCGATTGCTCTAGACAATGCTCCTGAACTAGCCTTTACTG
AATTTTTTGCCTCCACTGGAACCTTACACCAGTTTTCATGTGTTGCCACACCACAACAAAATTCAGTCGTCGAACGAAAGCACCAACACCTTCTTAACGTGGCTAGAGCC
TTGATGTTCCAATCTAAAGTTCCCCTATCCTTTTGGGGTGATTGTGTTCTAACAGCCACTTTTTTGATCAATCGTACGCCCATGACATTGTTGGCAAACAAAACCCCTTT
CGTTGTTCTTTATGGAAAGGAATTTGATTACTCTCTGCTACGAGTCTTTGGTTGTTTGGCGTATGCCTCAACTTTATCCATACATCGAAAGAAATTTGATCCTAGAGCGT
GTCCATGCGTCTTCATTGGTTATCCAACAGGGATAAAAGGCTATAAACTATACGATATGACAAAGGGGCAATTTATCATCTCGAGGGATGTCCTTTTCTTTGAGGACTTG
TTCCCTTTTCATTCTGTTTCACAGGCCAACAGTTCATTCTCACCAGATGTTTTGAATCAGTTTGTTCTACCTCACCCCTTGAGTGATCTACCTGATTCAAACAAACTGCC
TTCCGCTGCCAGCACGTCTACAACAGTTGCTCAGTTTGTTCCCTATGGAGATGCTGAGATTGTTCATTCTCCCATAGATAGTAACACTAACCTGGGAGATCATATCACTG
ATTTGGCCATTGATATTTCTACTGCTCCTACTTCTACTACTTGTCTTGAGGCTGATGGTACGATTGTTGATACAACCCCTACACATTTTGGCCCACGAAGGACTGCTAGA
AACCGCCATCCGCCCACCTACTTGAAAGATTTTCATTGCAACTTATTGGCTGGTTCTAATCCTTGTCTTTCAAAGCCTTTTTCAATTGACAAGTATATCTCCTACACAAC
CTTTTCTCCTACTCATAGACATTTTCTTTTAAATGTTTCTACCGAGTATGAGCCCACTTATTACCATCAGGCCGTGAAATTCCCTCATTGGCGGCAAGCCATGGATGATG
AGTTAAAGGCTATGGAACGAACTAAAACGTGGACTATTGTTTCTATTCCTGTTGGACAACACATTATTGGTTGTAAATGGGTTTATAGAGTTAAGTATAAACCTGATGGC
ACCGTTGATCGTCATAAAACCCGATTAGTGGCAAAAGGCTTCAACCAACAGGAAGGAATTGATTTCTTAGACACGTTTTCACCGGTCGCCAAAATTGTCACAGTGAAGGT
TCTCTTAACTTTGGCAACATCTTACAGCTGGCCTTTGGCTCAGATGGACGTGAATAACGCGTTTCTGAACGGTGATCTCTTTGAGGAAGTTCATATGGCTCTTCCATTGG
GATATAAGTCATCCTTTATTCCTCCTAAGGGGGAGAAAATTGCTTGCCGATTAAATAAATCCATATATGACTTAAAACAGGCTTCTAGGCAATGGTTCACGAAGTTCTCA
TCTGCCTTGATTTCACATGGGTTCGCTCAATCTAAATCAGATTATTCCTTGTTCACTCGTGGGCATGGCTGCTCTTTTATTGCTTTACTCGTCTATGTTGACGACATATT
ATTGACTGGACCTTCGGTTGATGAAGTTACGTCTGTAAAGGCTTTACTCAAAGCCCATTTCTTGTTGAAAGATTTAGGAAATGTGAGGTATTTCCTTGGCCTTGAGCTTT
CACGGTCTAAGCAGGGTCTTTTCTTTTCTCAAAGAAAATACTGCCTCCAAATCCTTGAGGATGTTGGTCTCTTGGATGCAAAGCCTGTTGCTACCCCTATGGACCCCAAC
ATCAAACTCTCAAAAGAAACTGGCAAAGCCTTAGCTGTTGATGAGGCTACTTCTTATCGGAGGCTTATAGGACGTTTGTTGTATATGCAAATATCACGTCCTGACATTGT
TTTTTCTGTACATTGTCTAAGCCAATTTCTTCAAAAACCCACTTCAACACACCTTCAAGCAGCTCATCATCTTCTTCGGTATTTGAAGAAGTCACCTGGGCAGGGAATAC
TTTTGAAACCAGTCTCTCATTTTCAGCTTAAGGCTTTTGTAGATGCTGACTGGGAATATCGAGCATTAACAACAGTTACCAGTGAACTTCTATGGATTACTCAGCTTCTT
AAAGATTTGCAAGTTGTTTGTTCCTCACCGGCTGTAACATACTGCGATAACCAAGCTGCAATTTCCATTGCTTCCAATCCAACTTTCCACGAACGGACCAAGCATATTGA
GATCGATTGCCACTTCGTACGTGACAAAATTGTTGAAGGCTTCCTGAAATTACTTCCGATACGTTCACAATCACAGCTAGCTGACATGTTTACTAAAGCTCTTCCATCCT
CAACGTTATCTTTTTTCATGGGCAAGTTGGTTGGACCAAATAATATGCCATGTCAGTTGATGGTTGGTTTCTTTGCATATGCAAAATCACTTGATATAAATGTGGACAAG
GGAGAGTTAGAAGATGCTCAATGGCACACTAGAGAAGATGTGAAGAACGCTTTGACGTTTGCCGAGTACAAGAAGGCTCAAAGAACTGCAGCAGCCAAGGTGGAGCAAAT
GTGTAAAGGAGTTGAGAGACAACAGAGCTTATCTTCGGATTTCAACGTCGAAAGCGGCGAACTCGCTCCAATGTTTGTTCCGGGGCCCTTCGCAATCGCTCATCATCTCA
TCTCTTCATGGGTCTATAATGAAGGATCAGAATTGAACAAACCTAGAAGTTCTTTGTCCAGCTTGTAG
Protein sequenceShow/hide protein sequence
MRSVPISPLISTAPLHYIPEKSFNSRNPFNSTAPSISNSTSIRPTMSISLQSHAFAGNPIKLKTPKPEDPFSPSSALESLNSQLLDNTHLASSINFNVLPFRNGRPLARS
SARPNDAASSWHLGWIGLADFKALFANSAVELAGDSFIYLGSSDEENAVYWAIDVSSEDGLVSEFGSKLLCFVEVRTLMVAADWSDARAMGELAIAGHARALLEWHNVSK
FCGHCGGKTVPMEAGRRKQCSNPSCKKRVYPRVDPVVIMLVIDKENDRALLSKQSRFVPRMWSCLAGFIEAGESLEEAVRRETWEETGIEVGEVVYHSSQPWPGILPFPS
SAMAGTGSSSTSTSNPAIESPDLDVQLNPFHLHHSVTSTATLVTQQLVGASNYISWSKAMRMALSVSGPHIYQVRKELANTLQGTLSIESYYVKVKTLWQELIEYRLVDD
CTCGGSKSIITFLNSEFIMIFLMGLNESSANVRTRILLMNPLPTINKAFSLVIQEERQRSISTLPSVESVTLLAAAEASKRVNDQSKKRNNQRPPPVCSHCNLKGHAPKA
ADAKPNSAFFASLSNDQYNQLMTMLQTHVAATKPESGVKVETNHVAGTCSSSTSDNFCSKSWVIDSGASSHICHDLNMFVNIRDATGISVILPNQTRFHVKLVGDVKLCE
DLLLSNVLYIPDFTYHLLSDKLQLRKIVKVECIDSLYLLSTSTAKVTTAALTLDDCSRYTSIALDNAPELAFTEFFASTGTLHQFSCVATPQQNSVVERKHQHLLNVARA
LMFQSKVPLSFWGDCVLTATFLINRTPMTLLANKTPFVVLYGKEFDYSLLRVFGCLAYASTLSIHRKKFDPRACPCVFIGYPTGIKGYKLYDMTKGQFIISRDVLFFEDL
FPFHSVSQANSSFSPDVLNQFVLPHPLSDLPDSNKLPSAASTSTTVAQFVPYGDAEIVHSPIDSNTNLGDHITDLAIDISTAPTSTTCLEADGTIVDTTPTHFGPRRTAR
NRHPPTYLKDFHCNLLAGSNPCLSKPFSIDKYISYTTFSPTHRHFLLNVSTEYEPTYYHQAVKFPHWRQAMDDELKAMERTKTWTIVSIPVGQHIIGCKWVYRVKYKPDG
TVDRHKTRLVAKGFNQQEGIDFLDTFSPVAKIVTVKVLLTLATSYSWPLAQMDVNNAFLNGDLFEEVHMALPLGYKSSFIPPKGEKIACRLNKSIYDLKQASRQWFTKFS
SALISHGFAQSKSDYSLFTRGHGCSFIALLVYVDDILLTGPSVDEVTSVKALLKAHFLLKDLGNVRYFLGLELSRSKQGLFFSQRKYCLQILEDVGLLDAKPVATPMDPN
IKLSKETGKALAVDEATSYRRLIGRLLYMQISRPDIVFSVHCLSQFLQKPTSTHLQAAHHLLRYLKKSPGQGILLKPVSHFQLKAFVDADWEYRALTTVTSELLWITQLL
KDLQVVCSSPAVTYCDNQAAISIASNPTFHERTKHIEIDCHFVRDKIVEGFLKLLPIRSQSQLADMFTKALPSSTLSFFMGKLVGPNNMPCQLMVGFFAYAKSLDINVDK
GELEDAQWHTREDVKNALTFAEYKKAQRTAAAKVEQMCKGVERQQSLSSDFNVESGELAPMFVPGPFAIAHHLISSWVYNEGSELNKPRSSLSSL