; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017518 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017518
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:4750767..4758518
RNA-Seq ExpressionLag0017518
SyntenyLag0017518
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU44375.1 hypothetical protein TSUD_243070 [Trifolium subterraneum]1.5e-26441.57Show/hide
Query:  VPTNHSSATEVGLTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLND-NPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEV
        V T  +S  E  LTIQSFHQCSSLISIKL+T+N+LLW+SQ++PL+RS+G+EHH+  D + PD +I D  GT   NP  + W+ NDGLLTSWLLG + EE 
Subjt:  VPTNHSSATEVGLTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLND-NPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEV

Query:  LAMIEGLETARQVWKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTF
        ++MI G +TA  +W SL E LL  T++ E  + +SL+ L K                    A+ KP+ DV KVF +++GLG KYK+FR A+LSK PYP+F
Subjt:  LAMIEGLETARQVWKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTF

Query:  NQFVLALKAHEQMNQFELEEENITLPNANQAYYSQRGRGRG-RGRHFSSKGRGFQPTRQ--ATSPQTEAHNISLPRLQSPSKHTTGNQIL--CQICGKAN
        NQF+++L+  EQ+    L EE   + + NQA++ QRGRGR  RG   + +GRG   + Q    +    +++ SLP+  +  + +   Q    CQICG+ N
Subjt:  NQFVLALKAHEQMNQFELEEENITLPNANQAYYSQRGRGRG-RGRHFSSKGRGFQPTRQ--ATSPQTEAHNISLPRLQSPSKHTTGNQIL--CQICGKAN

Query:  HIALECWKRFDHSYQMEEIPKALAAMALTEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVP
        HIA  C+ R++++ ++E   +ALAA+ + EE+DP  Y DSGAT H+    G + SLK Y G + ++VGNG+ L I+H G+  LKT               
Subjt:  HIALECWKRFDHSYQMEEIPKALAAMALTEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVP

Query:  RIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKN
                                       +  Q+LA G K+  LYAL+  ++               AL+A                           
Subjt:  RIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKN

Query:  EVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKSYINQWESPNQGDTKTP----HLVNNLPMENNIDILHSNCENDSCWSHGSNNQ
        E     WH RLGH N   +  L N  +I VS WK               I  ++S   G+  +     HL + + + N + IL +  E       G N  
Subjt:  EVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKSYINQWESPNQGDTKTP----HLVNNLPMENNIDILHSNCENDSCWSHGSNNQ

Query:  SATTLTSHIEDQDGHGDQNTDAI---TSTIANQ------LSNQRLTELCQDSSPIGSPTNSE--PTSTRPTTNSSTLPDISEYLFVDLPFDAATSPSSLR
         +    S +       D +T  +      + N+        N  +T    DSS    P   E      +  T      D +   ++D  F  AT P +L 
Subjt:  SATTLTSHIEDQDGHGDQNTDAI---TSTIANQ------LSNQRLTELCQDSSPIGSPTNSE--PTSTRPTTNSSTLPDISEYLFVDLPFDAATSPSSLR

Query:  PNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPM-------------------------------TTRL----KAKQLANQQMKTFLASE-------
        P+  + S  Q +    +S   + +  +    + S P+                               TT L      +   +++   +L  +       
Subjt:  PNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPM-------------------------------TTRL----KAKQLANQQMKTFLASE-------

Query:  --NGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSP
          +   + L+   E EP +++ A+    W+ AM++E+ AL  N TW LV RP + N++GSKW+FRTK  EDG++DR+KARLVA+GY+Q+ G+D+GETFSP
Subjt:  --NGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSP

Query:  VIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQ
        VIK  TIR++L+LAV + WPLKQLDVKNAFLHG L E VYM QPPGF HP+  N+VC+L KSLYGLKQAPRAWF+KLS+ L+ +GF+CSKADPSLFIHR 
Subjt:  VIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQ

Query:  NTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVD
        +T  TL+LVYVDDIILTGN  +FI++LV  L  +FALKDLG LHYFLG+E+++   G  I+QTKY  D+LK+A ML  S I TP+A+   + P D   VD
Subjt:  NTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVD

Query:  STQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISW
        +T+Y+ + GSLQYLT TRPD+T++VN VCQH Q P  ++L+ VKRILRY+K T+ HG+ +L  SSLNL AFCDADW GCP TRRSTTGFC++LGS+ ISW
Subjt:  STQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISW

Query:  SSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADI
        +SKKQPT+SRSS EAEY+A+A T AE+TW+ YLL D+G+ L + P ++CDN SA+ MS NPVFHARTKHI +DYHF+REKV  G L  RY+ + +Q+AD+
Subjt:  SSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADI

Query:  LTKPLSKEMHRELTSKLGVRANPTLSLR
         TK L K+       KLGV      SL+
Subjt:  LTKPLSKEMHRELTSKLGVRANPTLSLR

RVW19921.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.2e-23439.58Show/hide
Query:  LTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQV
        L+IQ+FHQCSSL+SIKLN +N LLWRSQV+PLVRS+G+ HHL  +     + +  E   T +    TW  NDGLLTSWLLG+++EEV+ +++G ETA  V
Subjt:  LTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQV

Query:  WKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQM
        W SL E LL +TKE E+ +++ L  +KK                    A++KP+ D+ KVF LA+GLG KY DFR AMLSK PYP++NQFVLAL+ HEQM
Subjt:  WKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQM

Query:  NQFELEEENITLPNANQAYYSQRGRGRGRGRHFSSKGRGFQP----TRQATSPQTE-AHNISLPRLQSPSKHTTGNQILCQICGKANHIALECWKRFDHS
           E  EEN    N  QAY++QRGRGR +G  F S+GRGF P       ATS Q +  H+ +     S + H   N+   +I      IA       + +
Subjt:  NQFELEEENITLPNANQAYYSQRGRGRGRGRHFSSKGRGFQP----TRQATSPQTE-AHNISLPRLQSPSKHTTGNQILCQICGKANHIALECWKRFDHS

Query:  YQMEEIPKALAAMAL-TEENDPALYVDSGATAHIVKNSGKISS------------LKTYQGKEKLYV-------------GNGEGLNISHIGQGYLKTQN
        ++  +IP+ALAAMAL  EE DP  YVDSGATAHI  N  + SS            L     K  LY               +     + H   G+ +T++
Subjt:  YQMEEIPKALAAMAL-TEENDPALYVDSGATAHIVKNSGKISS------------LKTYQGKEKLYV-------------GNGEGLNISHIGQGYLKTQN

Query:  NNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQV------LAMGTKRHGL-------YALKQQEVYTALAG-------FTNSAIG
           L  K  + V    K+            C + F     +  N   ++       A      G          LK  +++ +  G       F N  + 
Subjt:  NNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQV------LAMGTKRHGL-------YALKQQEVYTALAG-------FTNSAIG

Query:  STALSAFTNSDITLNTKSVASDFNKNSATN-----FKNEVSLSTWHKR-------LGHLNFASISHLSNTNMI--QVSAWKKLHVV-CSVCQMSKSYINQ
           +    +   T     VA   +++         F  ++ LS W          +  L    +   S   M+  Q   ++ L +  C      + Y   
Subjt:  STALSAFTNSDITLNTKSVASDFNKNSATN-----FKNEVSLSTWHKR-------LGHLNFASISHLSNTNMI--QVSAWKKLHVV-CSVCQMSKSYINQ

Query:  WESPN-------------------QGDTK----TPHLV---------NNLPMENNIDI---------LHSNCEND------SCWSHGSNNQSA-------
          SP                       TK    + H++         N+L  EN++ I           ++CE          W   + NQS+       
Subjt:  WESPN-------------------QGDTK----TPHLV---------NNLPMENNIDI---------LHSNCEND------SCWSHGSNNQSA-------

Query:  ---TTLTSHIEDQDG------HGDQNTDA--ITST-----IANQLSNQR----------LTELCQDSSPIGSPTNSEPTSTRPT------TNS------S
           TT   +    +G      + +QNT+A  + ST     +A ++S              TE   D +   S  NS   +T  T      TNS      S
Subjt:  ---TTLTSHIEDQDG------HGDQNTDA--ITST-----IANQLSNQR----------LTELCQDSSPIGSPTNSEPTSTRPT------TNS------S

Query:  TLPDISEYLFVDLPF---------------DAATSPSSLRPNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPMT-TRLKAKQLANQQMKTFLASEN
          PDIS++L VDL F               + +TS S+    + S     V+  PP          + T    +H +T ++LK       QM TF A+ +
Subjt:  TLPDISEYLFVDLPF---------------DAATSPSSLRPNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPMT-TRLKAKQLANQQMKTFLASEN

Query:  GSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIK
                    EP ++R A+    W +AM+EE++ALI N+TW+LVPRP  TNIVGSKW+F+TK KEDGT+DRYKARLVA+G+SQ+ G+D+GETFSPVIK
Subjt:  GSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIK

Query:  PTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTI
         TTIR++ +LAV+  W ++QLDVKNAFLHG LKEEV+M QPPGF + +  N+VC+L +SLYGLKQAPRAWFD+LS+                        
Subjt:  PTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTI

Query:  LTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQ
                    + GN+ N I++L+++L +EF+LKDLG+LHYFLG+EV+ +PNG  ++QTKY RD+L+  KM+E +PI TPMA        DE+ +D TQ
Subjt:  LTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQ

Query:  YKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSK
        Y+ +VGSLQYLT TRPDI ++VNK CQH Q P   +LR VKRILRYLK T+ HGI F K SSL L  FCDADW GC  TRRST+G+C+FLG+N ISWSSK
Subjt:  YKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSK

Query:  KQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTK
        +QPT+SRSS EAEYR++A++ AEITW+T+LLRDIG+ L + P+L CDN+SAL M++NPVFHAR+KHIELDYHFVREKVA G L+TR++PS  Q+ADI TK
Subjt:  KQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTK

Query:  PLSKEMHRELTSKLGVRANPTLSLR
         L K   +    KLGV   P  SLR
Subjt:  PLSKEMHRELTSKLGVRANPTLSLR

RVW43526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.7e-30043.72Show/hide
Query:  LTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQV
        L+IQ+FHQCSSL+SIKLN +N LLWRSQV+PLVRS+G+ HHL  +     + +  E   T +    TW  NDGLLTSWLLG+++EEV+ +++G ETA  V
Subjt:  LTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQV

Query:  WKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQM
        W SL E LL +TKE E+ +++ L  +KK                    A++KP+ D+ KVF LA+GLG KY DFR AMLSK PYP++NQFVLAL+ HEQM
Subjt:  WKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQM

Query:  NQFELEEENITLPNANQAYYSQRGRGRGRGRHFSSKGRGFQP----TRQATSPQTE-----------AHNISLPRLQSPS---KHTTGNQ----------
           E  EEN    N  QAY++QRGRGR RG  F S+GRGF P       ATS Q +           + N   P  Q  +   +++  NQ          
Subjt:  NQFELEEENITLPNANQAYYSQRGRGRGRGRHFSSKGRGFQP----TRQATSPQTE-----------AHNISLPRLQSPS---KHTTGNQ----------

Query:  ---ILCQICGKANHIALECWKRFDHSYQMEEIPKALAAMAL-TEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQ
           ++CQIC KANH AL+CW RFD+ YQ+EEIP+ALAAMAL  EE DP  YVDSGATAHI  + GK+S +  Y+G + ++VGNGE L ISHIG+  LKT+
Subjt:  ---ILCQICGKANHIALECWKRFDHSYQMEEIPKALAAMAL-TEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQ

Query:  NNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSV
        + + L LK +LVVP IKKNL+SI +LTSDN CSI F++ GFV+K++  QVLA GTK+ GLYAL++                   + A T   +T ++K+ 
Subjt:  NNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSV

Query:  ASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS-------------------YINQW-ESPN---QG----
                        S   WH+R+GH    SI  L +   I+VS+W KL  VC  CQ+ KS                   + + W  +PN   QG    
Subjt:  ASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS-------------------YINQW-ESPN---QG----

Query:  ----DTKTPH---------------------LVNNLPMENNIDILHSNCENDSCWSHGSNNQSATTLTSHIE-----DQDGHGDQNTDAITSTIANQLSN
            D  T +                     LV N  +E  I I  S+   +       N+ S   +   +      +Q+G  ++    I       L N
Subjt:  ----DTKTPH---------------------LVNNLPMENNIDILHSNCENDSCWSHGSNNQSATTLTSHIE-----DQDGHGDQNTDAITSTIANQLSN

Query:  QRL---------------------TELCQDS----------------------------------SPIGSPT----------------------------
         +L                     T L  +S                                  SP   P                             
Subjt:  QRL---------------------TELCQDS----------------------------------SPIGSPT----------------------------

Query:  -----------------NSEPTSTRPT--------------TNSSTLPDISEYLFVDLPFDAATS--PSSLRPNVTSLSLPQVEFEPPSSRKSELRTDER
                         N+  T+  PT              T + TL  +S  +   LP  +A    P + +  V  +S PQ                + 
Subjt:  -----------------NSEPTSTRPT--------------TNSSTLPDISEYLFVDLPFDAATS--PSSLRPNVTSLSLPQVEFEPPSSRKSELRTDER

Query:  TMTHSSHPMT-TRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKED
        T    +H +T ++LK       QM TF A+ +            EP ++R  +    W +AM+EE++ALI N+TW+LVPRP  TNIVGSKW+F+TK KED
Subjt:  TMTHSSHPMT-TRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKED

Query:  GTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPR
        GT+DRYKARLVA+G+SQ+ G+D+GETFSPVIK TTIR++ +LAV+  W ++QLDVKNAFLHG LKEEV+M QPPGF + +  N+VC+L +SLYGLKQAPR
Subjt:  GTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPR

Query:  AWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILK
        AWFD+LS  LLH+GF C KAD SLFI R+   + L+L+YVDDII+TGN+ N I++L+++L +EF+LKDLG+LHYFLG+EV+ +PNG  ++QTKY RD+L+
Subjt:  AWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILK

Query:  KAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAF
          KM+E +PI TPMA        DE+ +D TQY+ +VGSLQYLT TRPDI ++VNK CQH Q P   +LR VKRILRYLK T+ HGI F K SSL L  F
Subjt:  KAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAF

Query:  CDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIE
        CDADW GC  TRRST+G+C+FLG+N ISWSSK+QPT+SRSS EAEYR++A++ AEITW+T+LLRDIG+ L + P+L CDN+SAL M +N VFHAR+KHIE
Subjt:  CDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIE

Query:  LDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR
        LDYHFVREKVA G L+TR++PS  Q+ADI TK L K   +    KLGV   P  SLR
Subjt:  LDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR

RVX04589.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.0e-23843.4Show/hide
Query:  ITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQMNQFELEEENI
        +TKE E+ +++ L  +KK                    A++KP+ D+ KVF LA+GLG KY DFR AMLSK PYP++NQFVLAL+ HEQM   E  EEN 
Subjt:  ITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQMNQFELEEENI

Query:  TLPNANQAYYSQRGRGRGRGRHFSSKGRGFQP----TRQATSPQTEAHNISLPRLQSPSKHTTGNQILCQICGKANHIALECWKRFDHSYQMEEIPKALA
           N  QAY++QRGRGR RG  F S+GRGF P       ATS Q + + I  P +Q  +  T+ +    +   +   IA       + +++  +IP+ALA
Subjt:  TLPNANQAYYSQRGRGRGRGRHFSSKGRGFQP----TRQATSPQTEAHNISLPRLQSPSKHTTGNQILCQICGKANHIALECWKRFDHSYQMEEIPKALA

Query:  AMAL-TEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFN
        AMAL  EE DP  YVDSGATAHI                                                                  T+D +    F+
Subjt:  AMAL-TEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFN

Query:  ADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLS
        + GFV K++  QVLA GTK+ GLYAL++                   + A T   +T ++K+                 S   WH+R+GH    SI  L 
Subjt:  ADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLS

Query:  NTNMIQVSAWKKLHVVCSVCQMSKSYINQWESPNQGDTKTPHLVNNLPMENNIDI---------LHSNCENDSCWSHGSNNQSATTLTSHIEDQDGHGDQ
        +   I+VS+W KL         + S+     + +    K  H        N   +          + N E     S G+    AT +++    ++ H   
Subjt:  NTNMIQVSAWKKLHVVCSVCQMSKSYINQWESPNQGDTKTPHLVNNLPMENNIDI---------LHSNCENDSCWSHGSNNQSATTLTSHIEDQDGHGDQ

Query:  NTDAITSTIANQLSNQRLTELCQDSSPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPFDAATSPS-----SLRPNVTSLSLPQVEFEPPSSRKSEL-
         T    + +A   S+         ++  G+ +  E T++      S  PDIS++L VDL F      +     SL+   TS S    + E P S K  + 
Subjt:  NTDAITSTIANQLSNQRLTELCQDSSPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPFDAATSPS-----SLRPNVTSLSLPQVEFEPPSSRKSEL-

Query:  ----RTDERTMTHSSHPMT-TRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKW
               + T    +H +T ++LK       QM TF A+ +            EP ++R  +    W +AM+EE++ALI N+TW+LVPRP  TNIVGSKW
Subjt:  ----RTDERTMTHSSHPMT-TRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKW

Query:  IFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKS
        +F+TK KEDGT+DRYKARLVA+G+SQ+ G+D+GETFSPVIK TTIR++ +LAV+  W ++QLDVKNAFLHG LKEEV+M QPPGF + +  N+VC+L +S
Subjt:  IFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKS

Query:  LYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQ
        LYGLKQAPRAWFD+LS  LLH+GF C KAD SLFI R+   + L+L+YVDDII+TGN+ N I++L+ +L +EF+LKDLG+LHYFLG+EV+ +PNG  ++Q
Subjt:  LYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQ

Query:  TKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLK
        TKY RD+L+  KM+E +PI TPMA        DE+ +D TQY+ +VGSLQYLT TRPDI ++VNK CQH Q P   +LR VKRILRYLK T+ HGI F K
Subjt:  TKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLK

Query:  NSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPV
         SSL L  FCDADW GC  TRRST+G+C+FLG+N ISWSSK+QPT+SRSS EAEYR++A++ AEITW+T+LLRDIG+ L + P+L CDN+SAL M++NPV
Subjt:  NSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPV

Query:  FHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR
        FHAR+KHIELDYHFVREKVA G L+TR++PS  Q+ADI TK L K   +    KLGV   P  SLR
Subjt:  FHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR

RWR75576.1 Zinc finger, CCCH-type [Cinnamomum micranthum f. kanehirae]3.2e-22736.1Show/hide
Query:  SLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDK----EGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQVWKSLEE
        + +S+KL  TN+LLW++Q++ L+ S  +   +  + P   K +      +  +  N  +  W  +D LL  W+ G +SEEVL ++ GLET+  VW++LE 
Subjt:  SLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDK----EGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQVWKSLEE

Query:  TLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQMNQFELE
        +    ++E E ++   L   +K                    A+ KPI D  KVF+L RGLG  Y+ F T+ML K P P++ + +  L+ HE M    + 
Subjt:  TLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQMNQFELE

Query:  EENITLPNANQAYYSQRGRGRGRGRHFSSKGR---GFQPTRQATSPQTEAHNISLPRL-QSPSKHTTGNQ---------ILCQICGKANHIALECWKRFD
         +    PN N A++ Q+  G   G+ F+ KG+    F  TR+  SP  +  N  L  L   P    T  Q          +CQIC K NH A+ CW RF+
Subjt:  EENITLPNANQAYYSQRGRGRGRGRHFSSKGR---GFQPTRQATSPQTEAHNISLPRL-QSPSKHTTGNQ---------ILCQICGKANHIALECWKRFD

Query:  HSYQMEEIPKALAAMALTEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISK
         +YQ  +  +ALAA  L + +DP+ + D+GATAH+  + GK+ SL  YQG +K+ VGNG  L+I+HIG   +K    + L L NVLVVP IKKNL+S+S+
Subjt:  HSYQMEEIPKALAAMALTEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISK

Query:  LTSDNDCSIIFNADGFVVKNK-QGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKNEVSLSTWHKR
        +TS+      F+ DGFVVK++  GQ++  G +  GLY+L+            N A+ ST   A             ASD                 WH+R
Subjt:  LTSDNDCSIIFNADGFVVKNK-QGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKNEVSLSTWHKR

Query:  LGHLNFASISHLSNTNMIQVSAWKKLHV-VCSVCQMSKS----YINQWESPNQG------DTKTPHLVNNLPMENNIDILHSNCENDSCW----------
        LGH     +  L   N+I +S  K  +V VC  CQM+K+    ++N     N        D   P  V +        I    C   + +          
Subjt:  LGHLNFASISHLSNTNMIQVSAWKKLHV-VCSVCQMSKS----YINQWESPNQG------DTKTPHLVNNLPMENNIDILHSNCENDSCW----------

Query:  ----------------------SHGSNNQSATTLTS--------------HIEDQDGHGDQNTDAITST--------------------IANQLSNQRLT
                              S G    S T   S              H   Q+G  ++    +T                       A  L N++L+
Subjt:  ----------------------SHGSNNQSATTLTS--------------HIEDQDGHGDQNTDAITST--------------------IANQLSNQRLT

Query:  ELCQDSSPIGSPTNSEP------------------------------------------------------TSTRPTTNSSTLP----------------
         +   +SP    T   P                                                      TS     +  T P                
Subjt:  ELCQDSSPIGSPTNSEP------------------------------------------------------TSTRPTTNSSTLP----------------

Query:  --DISEYL--------------FVDLP--FDAATSPSSLRPNVTSLSLPQVEF------------------------------------------EPPSS
            SE++                 LP   DA ++ SS  P+  S++   V F                                          + PS+
Subjt:  --DISEYL--------------FVDLP--FDAATSPSSLRPNVTSLSLPQVEF------------------------------------------EPPSS

Query:  RK------------------------------------------------------SELRTDERTMTHS---------SHPMTTRLKAKQLANQQMKTFL
         +                                                      S L  D   + HS         +HPM TR K+  +  +    + 
Subjt:  RK------------------------------------------------------SELRTDERTMTHS---------SHPMTTRLKAKQLANQQMKTFL

Query:  ASENGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFS
        +S+    ++L+ K   EP S + A+  + W  AM+EE+ AL  N TW LVPR SN N+VG KW+++TK + DG+L+R KARLVA+G++QVEGID+ ETFS
Subjt:  ASENGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFS

Query:  PVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHR
        PV+KP TIR+VL +A+++NW ++QLDVKNAFL+G L E V+M QPP FAHP + ++VC LKK+LYGL+QAPRAWFD+ S +LL +GF CS AD SLFI  
Subjt:  PVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHR

Query:  QNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENV
         +     +L+YVDDIILTGNN  F+T ++  L  EFA+KDLG +HYFLG++VQ  P G  + QTKY  ++L+KA M    PI TPM       P    N 
Subjt:  QNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENV

Query:  ---DSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSN
           D   Y+S+VG L YLT TRPDI+YSVN VCQ M  P   + + VKRILRY++ TI +GI  L NSSL LYAF DADW GCP+TRRSTTG+C +LGSN
Subjt:  ---DSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSN

Query:  IISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQ
         ISWSSK+QPT++RSSTEAEYRA+A+T AEITW+TY+LRDIG+ L + P L+CDN+SAL M++NPVFHARTKHIELDYHFVREKVALGSLVTR+VPS  Q
Subjt:  IISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQ

Query:  LADILTKPLSKEMHRELTSKLGVRANPTLSLR
        +ADILTKPLS+     L  KLGV+     SL+
Subjt:  LADILTKPLSKEMHRELTSKLGVRANPTLSLR

TrEMBL top hitse value%identityAlignment
A0A2N9EWB3 Integrase catalytic domain-containing protein7.7e-24336.26Show/hide
Query:  SATEVGLTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGL
        S+T   L+  + H   +++S+KLN+TNYL+WR Q+ PL++S+ +  HL  + P   K V + G   +NP+Y  W   D LL SW+ G +SEE L  + G+
Subjt:  SATEVGLTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGL

Query:  ETARQVWKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLAL
         TAR+VW+ LE   L  TKE E+ +   L   KK                    A++KP+ D  K   L+  LG KY  F T MLSK P+PTFNQFV AL
Subjt:  ETARQVWKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLAL

Query:  KAHE-QMNQFELEEENITLPNANQAYYSQR---GRGRGRGRH----FSSKGRGFQPTR------------------------QATSPQT-EAHNISLPRL
        + +  +    + +++  +  N N A+ + R   GRGRGRGRH    F S+GRGF P                             +PQ  +   +S    
Subjt:  KAHE-QMNQFELEEENITLPNANQAYYSQR---GRGRGRGRH----FSSKGRGFQPTR------------------------QATSPQT-EAHNISLPRL

Query:  QSPSKHTTGNQILCQICGKANHIALECWKRFDHSYQM-EEIPKALAAMALTEENDPA-LYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISH
         S +   + N   CQICG+  H AL+CW R+D++Y+  E I +ALA   L++ +D    Y D+GAT+H+   SG + S   Y G + + VGNG  L ISH
Subjt:  QSPSKHTTGNQILCQICGKANHIALECWKRFDHSYQM-EEIPKALAAMALTEENDPA-LYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISH

Query:  IGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNK-QGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTN
        +G   L +++ + L+L +VLVVP+++KNLIS+ KLT DN C     A  F +K+K  G+++A G K  GLYAL       ALA                 
Subjt:  IGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNK-QGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTN

Query:  SDITLNTKSVASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS----------------------------
                                +  +  WH+RLGH +   +  L++ N+I VS W K   VCS CQM KS                            
Subjt:  SDITLNTKSVASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS----------------------------

Query:  ---------YI---------------------------------NQWE------------------------------------SPNQ---GDTKTPH--
                 Y+                                 NQ+E                                    +P Q    + K  H  
Subjt:  ---------YI---------------------------------NQWE------------------------------------SPNQ---GDTKTPH--

Query:  --------------------------LVNNLPME------------------NNIDILHSNC--------------ENDSCWSHGSN-----------NQ
                                  L+N LP                    N++ +    C              ++  C   G +             
Subjt:  --------------------------LVNNLPME------------------NNIDILHSNC--------------ENDSCWSHGSN-----------NQ

Query:  SATTLTSHIEDQDG----------HGDQNTDAITSTIANQLSNQRLTELCQDSSPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPF--DAATSPSSL
            L+ H+   +G              +TD   ST A  ++        Q + P     +S P +T P +++S+L D        +P   + ++SP   
Subjt:  SATTLTSHIEDQDG----------HGDQNTDAITSTIANQLSNQRLTELCQDSSPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPF--DAATSPSSL

Query:  RPNVTSLSLPQV-EFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQL---------------------------------
         P  T+LS P       PS+  S +  ++ T   +  P+        +A     T  A  + S S L                                 
Subjt:  RPNVTSLSLPQV-EFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQL---------------------------------

Query:  -------------VNKTE---------------VEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARL
                     V +++                EP S + A+    W+QAM +EL AL  NKTW LVPR ++ NIVGS+W+F+TK K DG+++R+KARL
Subjt:  -------------VNKTE---------------VEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARL

Query:  VAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYL
        VA+GY+Q+EG+D+ ETFSPVIKPTTIRLVL+LA++  W L+QLDVKNAFLHG LKE VYM QPPGF+ P    +VC L K++YGLKQAPRAWFD+ SS+L
Subjt:  VAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYL

Query:  LHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPI
        L +GF CS AD SLF+ R ++   L+LVYVDDII+T ++ + ++ L+  L +EFA+KDLG L+YFLGV+V +   G  ++Q KY ++IL KA M +  PI
Subjt:  LHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPI

Query:  ATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPI
         TP+A  +    +    V++T Y+SIVG+LQYLTLTRPD+T++VN VCQ M +P   + + VKRILRYL+ T+ +GI  L +SSL LY F DADW GCP 
Subjt:  ATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPI

Query:  TRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKV
        TRRSTTG+C++LG+N ISW+SKKQ T+SRSS EAEYRAMA+  AE+TW+TYLLRD+G+  H +P L+CDN SAL M++NPVFHARTKHIELDYHFVREKV
Subjt:  TRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKV

Query:  ALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR
        A G+L TRYVPS  Q+AD+ TK +SK++     SKLGV   P  SLR
Subjt:  ALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR

A0A2N9FIQ8 Integrase catalytic domain-containing protein6.7e-24738.42Show/hide
Query:  SLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIV--DKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQVWKSLE---
        + +SIKL   NYLLW++Q+  L+ S  +   +  D P  +K +  D   T  +NP+++ W  +D LL  W+ G +SEE+L +  GLET+  VW++LE   
Subjt:  SLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIV--DKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQVWKSLE---

Query:  -----------ETLLAITKENEIHISDSLHNLKK------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQMNQFELEEE
                   E+ L++ ++ ++ ++D L   K+      A+ KPI D  KVF L +GLG +Y+ F T+ML K P P+++  +  L++HE +        
Subjt:  -----------ETLLAITKENEIHISDSLHNLKK------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQMNQFELEEE

Query:  NITLPNANQAYYSQR---GRGRGRGR----HFSSKGRGFQPTRQATS-PQT-EAHNI--------------SLPRLQSPSKHTTGNQILCQICGKANHIA
          T P+  +A+  QR   GRGRGRGR     FSS GRGF  T Q +S P T   HN                +   ++      G ++ CQICGK  H A
Subjt:  NITLPNANQAYYSQR---GRGRGRGR----HFSSKGRGFQPTRQATS-PQT-EAHNI--------------SLPRLQSPSKHTTGNQILCQICGKANHIA

Query:  LECWKRFDHSYQMEEIPKALAAMALTEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIK
        L+CW RF+H++Q  +IP+ALAA+ L    +     D+GA+AHI  + G + ++  Y G +K+ VG+G  L+ISHIG   +K   +  + LKNVL+VP +K
Subjt:  LECWKRFDHSYQMEEIPKALAAMALTEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIK

Query:  KNLISISKLTSDNDCSIIFNADGFVVKN-KQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKNEV
        KNLISIS+LT D  C + F++ GF++K+ K  ++LA+GTK+ GLY L   + + AL                                    +T F+   
Subjt:  KNLISISKLTSDNDCSIIFNADGFVVKN-KQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKNEV

Query:  SLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS-------------------------------------YIN--------QWESP-
        S + WHKRLGH     +  L+N  +I V  W + H VC  CQMSKS                                     Y++         W  P 
Subjt:  SLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS-------------------------------------YIN--------QWESP-

Query:  ------------------NQGDTK----------------------------------TPH---------------------------------------
                          NQ D K                                  TP                                        
Subjt:  ------------------NQGDTK----------------------------------TPH---------------------------------------

Query:  LVNNLPMEN------------------NIDILHSNC--------------------------------------------------EN-------DSCWS
        L+N LP  N                  ++ +    C                                                  EN       +S +S
Subjt:  LVNNLPMEN------------------NIDILHSNC--------------------------------------------------EN-------DSCWS

Query:  HGSNNQSATTLT------------SHIEDQ-DGHGDQNTDAITSTIANQLSNQRLTELCQDS--SPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPF
        +G      T++             S  +DQ   H     D  T   +  L N+ L  L  D   S    P  + P    P   SS L   ++ L  +   
Subjt:  HGSNNQSATTLT------------SHIEDQ-DGHGDQNTDAITSTIANQLSNQRLTELCQDS--SPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPF

Query:  DAATSPSSLRPNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAINKEVWKQAME
         + TS  S     T+LS       PP        T  + M  ++H M TR K++        + +AS N   S        EP + + A+    W QAM 
Subjt:  DAATSPSSLRPNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAINKEVWKQAME

Query:  EELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQ
        +EL AL  N+TW+LVPRP N N+VG +W+F+TK K +G++DR KARLVA+G+SQVEG+D+ ETFSPVIKP TIRLVL++A+++ W +KQLDVKNAFLHG 
Subjt:  EELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQ

Query:  LKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTE
        L+E+VYM QPPGF  P    +VC+LKK+LYGLKQAPRAWFD+ S YLL +GF+CS AD SLFI R +  + L+L+YVDD+I+TG   +F+   + SL  E
Subjt:  LKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTE

Query:  FALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQE
        FA+KDLG+LHYFLG+EV N+P+G  ++Q+KY  ++L KA+ML+   I+TPMA      P D    D T Y+SIVG L YLT+TRPD++YSVN VCQHMQ 
Subjt:  FALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQE

Query:  PKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLL
        P   + R VKRILRYL  T   G+  L  SSL LYAF DADW GCP+TRRSTTGFC FLGSN+ISWS+KKQ TI+RSSTEAEYR+MA+TTAE+TW+++LL
Subjt:  PKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLL

Query:  RDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANP
        RDIGV L   P L+CDNMSAL MSINPVFHARTKH+ELDYHFVREKVALGSL+TRYV +  QLADILTKPL K       +KLG++  P
Subjt:  RDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANP

A0A2N9FTN5 Integrase catalytic domain-containing protein7.7e-24336.26Show/hide
Query:  SATEVGLTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGL
        S+T   L+  + H   +++S+KLN+TNYL+WR Q+ PL++S+ +  HL  + P   K V + G   +NP+Y  W   D LL SW+ G +SEE L  + G+
Subjt:  SATEVGLTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGL

Query:  ETARQVWKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLAL
         TAR+VW+ LE   L  TKE E+ +   L   KK                    A++KP+ D  K   L+  LG KY  F T MLSK P+PTFNQFV AL
Subjt:  ETARQVWKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLAL

Query:  KAHE-QMNQFELEEENITLPNANQAYYSQR---GRGRGRGRH----FSSKGRGFQPTR------------------------QATSPQT-EAHNISLPRL
        + +  +    + +++  +  N N A+ + R   GRGRGRGRH    F S+GRGF P                             +PQ  +   +S    
Subjt:  KAHE-QMNQFELEEENITLPNANQAYYSQR---GRGRGRGRH----FSSKGRGFQPTR------------------------QATSPQT-EAHNISLPRL

Query:  QSPSKHTTGNQILCQICGKANHIALECWKRFDHSYQM-EEIPKALAAMALTEENDPA-LYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISH
         S +   + N   CQICG+  H AL+CW R+D++Y+  E I +ALA   L++ +D    Y D+GAT+H+   SG + S   Y G + + VGNG  L ISH
Subjt:  QSPSKHTTGNQILCQICGKANHIALECWKRFDHSYQM-EEIPKALAAMALTEENDPA-LYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISH

Query:  IGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNK-QGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTN
        +G   L +++ + L+L +VLVVP+++KNLIS+ KLT DN C     A  F +K+K  G+++A G K  GLYAL       ALA                 
Subjt:  IGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNK-QGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTN

Query:  SDITLNTKSVASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS----------------------------
                                +  +  WH+RLGH +   +  L++ N+I VS W K   VCS CQM KS                            
Subjt:  SDITLNTKSVASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS----------------------------

Query:  ---------YI---------------------------------NQWE------------------------------------SPNQ---GDTKTPH--
                 Y+                                 NQ+E                                    +P Q    + K  H  
Subjt:  ---------YI---------------------------------NQWE------------------------------------SPNQ---GDTKTPH--

Query:  --------------------------LVNNLPME------------------NNIDILHSNC--------------ENDSCWSHGSN-----------NQ
                                  L+N LP                    N++ +    C              ++  C   G +             
Subjt:  --------------------------LVNNLPME------------------NNIDILHSNC--------------ENDSCWSHGSN-----------NQ

Query:  SATTLTSHIEDQDG----------HGDQNTDAITSTIANQLSNQRLTELCQDSSPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPF--DAATSPSSL
            L+ H+   +G              +TD   ST A  ++        Q + P     +S P +T P +++S+L D        +P   + ++SP   
Subjt:  SATTLTSHIEDQDG----------HGDQNTDAITSTIANQLSNQRLTELCQDSSPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPF--DAATSPSSL

Query:  RPNVTSLSLPQV-EFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQL---------------------------------
         P  T+LS P       PS+  S +  ++ T   +  P+        +A     T  A  + S S L                                 
Subjt:  RPNVTSLSLPQV-EFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQL---------------------------------

Query:  -------------VNKTE---------------VEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARL
                     V +++                EP S + A+    W+QAM +EL AL  NKTW LVPR ++ NIVGS+W+F+TK K DG+++R+KARL
Subjt:  -------------VNKTE---------------VEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARL

Query:  VAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYL
        VA+GY+Q+EG+D+ ETFSPVIKPTTIRLVL+LA++  W L+QLDVKNAFLHG LKE VYM QPPGF+ P    +VC L K++YGLKQAPRAWFD+ SS+L
Subjt:  VAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYL

Query:  LHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPI
        L +GF CS AD SLF+ R ++   L+LVYVDDII+T ++ + ++ L+  L +EFA+KDLG L+YFLGV+V +   G  ++Q KY ++IL KA M +  PI
Subjt:  LHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPI

Query:  ATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPI
         TP+A  +    +    V++T Y+SIVG+LQYLTLTRPD+T++VN VCQ M +P   + + VKRILRYL+ T+ +GI  L +SSL LY F DADW GCP 
Subjt:  ATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPI

Query:  TRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKV
        TRRSTTG+C++LG+N ISW+SKKQ T+SRSS EAEYRAMA+  AE+TW+TYLLRD+G+  H +P L+CDN SAL M++NPVFHARTKHIELDYHFVREKV
Subjt:  TRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKV

Query:  ALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR
        A G+L TRYVPS  Q+AD+ TK +SK++     SKLGV   P  SLR
Subjt:  ALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR

A0A2Z6P7T0 Reverse transcriptase Ty1/copia-type domain-containing protein7.1e-26541.57Show/hide
Query:  VPTNHSSATEVGLTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLND-NPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEV
        V T  +S  E  LTIQSFHQCSSLISIKL+T+N+LLW+SQ++PL+RS+G+EHH+  D + PD +I D  GT   NP  + W+ NDGLLTSWLLG + EE 
Subjt:  VPTNHSSATEVGLTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLND-NPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEV

Query:  LAMIEGLETARQVWKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTF
        ++MI G +TA  +W SL E LL  T++ E  + +SL+ L K                    A+ KP+ DV KVF +++GLG KYK+FR A+LSK PYP+F
Subjt:  LAMIEGLETARQVWKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTF

Query:  NQFVLALKAHEQMNQFELEEENITLPNANQAYYSQRGRGRG-RGRHFSSKGRGFQPTRQ--ATSPQTEAHNISLPRLQSPSKHTTGNQIL--CQICGKAN
        NQF+++L+  EQ+    L EE   + + NQA++ QRGRGR  RG   + +GRG   + Q    +    +++ SLP+  +  + +   Q    CQICG+ N
Subjt:  NQFVLALKAHEQMNQFELEEENITLPNANQAYYSQRGRGRG-RGRHFSSKGRGFQPTRQ--ATSPQTEAHNISLPRLQSPSKHTTGNQIL--CQICGKAN

Query:  HIALECWKRFDHSYQMEEIPKALAAMALTEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVP
        HIA  C+ R++++ ++E   +ALAA+ + EE+DP  Y DSGAT H+    G + SLK Y G + ++VGNG+ L I+H G+  LKT               
Subjt:  HIALECWKRFDHSYQMEEIPKALAAMALTEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVP

Query:  RIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKN
                                       +  Q+LA G K+  LYAL+  ++               AL+A                           
Subjt:  RIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKN

Query:  EVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKSYINQWESPNQGDTKTP----HLVNNLPMENNIDILHSNCENDSCWSHGSNNQ
        E     WH RLGH N   +  L N  +I VS WK               I  ++S   G+  +     HL + + + N + IL +  E       G N  
Subjt:  EVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKSYINQWESPNQGDTKTP----HLVNNLPMENNIDILHSNCENDSCWSHGSNNQ

Query:  SATTLTSHIEDQDGHGDQNTDAI---TSTIANQ------LSNQRLTELCQDSSPIGSPTNSE--PTSTRPTTNSSTLPDISEYLFVDLPFDAATSPSSLR
         +    S +       D +T  +      + N+        N  +T    DSS    P   E      +  T      D +   ++D  F  AT P +L 
Subjt:  SATTLTSHIEDQDGHGDQNTDAI---TSTIANQ------LSNQRLTELCQDSSPIGSPTNSE--PTSTRPTTNSSTLPDISEYLFVDLPFDAATSPSSLR

Query:  PNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPM-------------------------------TTRL----KAKQLANQQMKTFLASE-------
        P+  + S  Q +    +S   + +  +    + S P+                               TT L      +   +++   +L  +       
Subjt:  PNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPM-------------------------------TTRL----KAKQLANQQMKTFLASE-------

Query:  --NGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSP
          +   + L+   E EP +++ A+    W+ AM++E+ AL  N TW LV RP + N++GSKW+FRTK  EDG++DR+KARLVA+GY+Q+ G+D+GETFSP
Subjt:  --NGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSP

Query:  VIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQ
        VIK  TIR++L+LAV + WPLKQLDVKNAFLHG L E VYM QPPGF HP+  N+VC+L KSLYGLKQAPRAWF+KLS+ L+ +GF+CSKADPSLFIHR 
Subjt:  VIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQ

Query:  NTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVD
        +T  TL+LVYVDDIILTGN  +FI++LV  L  +FALKDLG LHYFLG+E+++   G  I+QTKY  D+LK+A ML  S I TP+A+   + P D   VD
Subjt:  NTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVD

Query:  STQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISW
        +T+Y+ + GSLQYLT TRPD+T++VN VCQH Q P  ++L+ VKRILRY+K T+ HG+ +L  SSLNL AFCDADW GCP TRRSTTGFC++LGS+ ISW
Subjt:  STQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISW

Query:  SSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADI
        +SKKQPT+SRSS EAEY+A+A T AE+TW+ YLL D+G+ L + P ++CDN SA+ MS NPVFHARTKHI +DYHF+REKV  G L  RY+ + +Q+AD+
Subjt:  SSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADI

Query:  LTKPLSKEMHRELTSKLGVRANPTLSLR
         TK L K+       KLGV      SL+
Subjt:  LTKPLSKEMHRELTSKLGVRANPTLSLR

A0A438E6Z5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-30043.72Show/hide
Query:  LTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQV
        L+IQ+FHQCSSL+SIKLN +N LLWRSQV+PLVRS+G+ HHL  +     + +  E   T +    TW  NDGLLTSWLLG+++EEV+ +++G ETA  V
Subjt:  LTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQV

Query:  WKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQM
        W SL E LL +TKE E+ +++ L  +KK                    A++KP+ D+ KVF LA+GLG KY DFR AMLSK PYP++NQFVLAL+ HEQM
Subjt:  WKSLEETLLAITKENEIHISDSLHNLKK--------------------AMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQM

Query:  NQFELEEENITLPNANQAYYSQRGRGRGRGRHFSSKGRGFQP----TRQATSPQTE-----------AHNISLPRLQSPS---KHTTGNQ----------
           E  EEN    N  QAY++QRGRGR RG  F S+GRGF P       ATS Q +           + N   P  Q  +   +++  NQ          
Subjt:  NQFELEEENITLPNANQAYYSQRGRGRGRGRHFSSKGRGFQP----TRQATSPQTE-----------AHNISLPRLQSPS---KHTTGNQ----------

Query:  ---ILCQICGKANHIALECWKRFDHSYQMEEIPKALAAMAL-TEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQ
           ++CQIC KANH AL+CW RFD+ YQ+EEIP+ALAAMAL  EE DP  YVDSGATAHI  + GK+S +  Y+G + ++VGNGE L ISHIG+  LKT+
Subjt:  ---ILCQICGKANHIALECWKRFDHSYQMEEIPKALAAMAL-TEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQ

Query:  NNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSV
        + + L LK +LVVP IKKNL+SI +LTSDN CSI F++ GFV+K++  QVLA GTK+ GLYAL++                   + A T   +T ++K+ 
Subjt:  NNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQVLAMGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSV

Query:  ASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS-------------------YINQW-ESPN---QG----
                        S   WH+R+GH    SI  L +   I+VS+W KL  VC  CQ+ KS                   + + W  +PN   QG    
Subjt:  ASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS-------------------YINQW-ESPN---QG----

Query:  ----DTKTPH---------------------LVNNLPMENNIDILHSNCENDSCWSHGSNNQSATTLTSHIE-----DQDGHGDQNTDAITSTIANQLSN
            D  T +                     LV N  +E  I I  S+   +       N+ S   +   +      +Q+G  ++    I       L N
Subjt:  ----DTKTPH---------------------LVNNLPMENNIDILHSNCENDSCWSHGSNNQSATTLTSHIE-----DQDGHGDQNTDAITSTIANQLSN

Query:  QRL---------------------TELCQDS----------------------------------SPIGSPT----------------------------
         +L                     T L  +S                                  SP   P                             
Subjt:  QRL---------------------TELCQDS----------------------------------SPIGSPT----------------------------

Query:  -----------------NSEPTSTRPT--------------TNSSTLPDISEYLFVDLPFDAATS--PSSLRPNVTSLSLPQVEFEPPSSRKSELRTDER
                         N+  T+  PT              T + TL  +S  +   LP  +A    P + +  V  +S PQ                + 
Subjt:  -----------------NSEPTSTRPT--------------TNSSTLPDISEYLFVDLPFDAATS--PSSLRPNVTSLSLPQVEFEPPSSRKSELRTDER

Query:  TMTHSSHPMT-TRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKED
        T    +H +T ++LK       QM TF A+ +            EP ++R  +    W +AM+EE++ALI N+TW+LVPRP  TNIVGSKW+F+TK KED
Subjt:  TMTHSSHPMT-TRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKED

Query:  GTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPR
        GT+DRYKARLVA+G+SQ+ G+D+GETFSPVIK TTIR++ +LAV+  W ++QLDVKNAFLHG LKEEV+M QPPGF + +  N+VC+L +SLYGLKQAPR
Subjt:  GTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPR

Query:  AWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILK
        AWFD+LS  LLH+GF C KAD SLFI R+   + L+L+YVDDII+TGN+ N I++L+++L +EF+LKDLG+LHYFLG+EV+ +PNG  ++QTKY RD+L+
Subjt:  AWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILK

Query:  KAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAF
          KM+E +PI TPMA        DE+ +D TQY+ +VGSLQYLT TRPDI ++VNK CQH Q P   +LR VKRILRYLK T+ HGI F K SSL L  F
Subjt:  KAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAF

Query:  CDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIE
        CDADW GC  TRRST+G+C+FLG+N ISWSSK+QPT+SRSS EAEYR++A++ AEITW+T+LLRDIG+ L + P+L CDN+SAL M +N VFHAR+KHIE
Subjt:  CDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIE

Query:  LDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR
        LDYHFVREKVA G L+TR++PS  Q+ADI TK L K   +    KLGV   P  SLR
Subjt:  LDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-9434.43Show/hide
Query:  PPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQLVNKTEVEP-TSWRDAIN----------KEVWKQAMEEELQALIDNKTWE
        P  SR+SE     + +   +      ++     ++++KT        E   +NK  +   T + D  N          K  W++A+  EL A   N TW 
Subjt:  PPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQLVNKTEVEP-TSWRDAIN----------KEVWKQAMEEELQALIDNKTWE

Query:  LVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGF
        +  RP N NIV S+W+F  K+ E G   RYKARLVA+G++Q   IDY ETF+PV + ++ R +L+L + YN  + Q+DVK AFL+G LKEE+YM  P G 
Subjt:  LVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGF

Query:  AHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTI--LTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHY
        +  +  + VC+L K++YGLKQA R WF+     L    FV S  D  ++I  +  I     +L+YVDD+++   ++  + N    L  +F + DL  + +
Subjt:  AHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTI--LTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHY

Query:  FLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTL-TRPDITYSVNKVCQHMQEPKMQNLREVK
        F+G+ ++   +  +++Q+ Y + IL K  M   + ++TP+ +       + +   +T  +S++G L Y+ L TRPD+T +VN + ++  +   +  + +K
Subjt:  FLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTL-TRPDITYSVNKVCQHMQEPKMQNLREVK

Query:  RILRYLKHTIHHGITFLKNSSL--NLYAFCDADWGGCPITRRSTTGFCV-FLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPL
        R+LRYLK TI   + F KN +    +  + D+DW G  I R+STTG+       N+I W++K+Q +++ SSTEAEY A+     E  W+ +LL  I + L
Subjt:  RILRYLKHTIHHGITFLKNSSL--NLYAFCDADWGGCPITRRSTTGFCV-FLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPL

Query:  HKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGV
            ++Y DN   + ++ NP  H R KHI++ YHF RE+V    +   Y+P+  QLADI TKPL      EL  KLG+
Subjt:  HKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-8933.66Show/hide
Query:  FVDLPFDAATSPSSLRPNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAIN---
        FV +P   + +P+S       +S    + E P     +    +  +    HP     + + L   +     +    S   ++   + EP S ++ ++   
Subjt:  FVDLPFDAATSPSSLRPNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAIN---

Query:  KEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLD
        K    +AM+EE+++L  N T++LV  P     +  KW+F+ K   D  L RYKARLV +G+ Q +GID+ E FSPV+K T+IR +L+LA S +  ++QLD
Subjt:  KEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLD

Query:  VKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHR--QNTILTLMLVYVDDIILTGNNINF
        VK AFLHG L+EE+YM QP GF      + VC+L KSLYGLKQAPR W+ K  S++    ++ + +DP ++  R  +N  + ++L+YVDD+++ G +   
Subjt:  VKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHR--QNTILTLMLVYVDDIILTGNNINF

Query:  ITNLVASLGTEFALKDLGALHYFLGVEV--QNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPK-------DEENVDSTQYKSIVGSLQY-
        I  L   L   F +KDLG     LG+++  +       ++Q KY   +L++  M    P++TP+A     + K       ++ N+    Y S VGSL Y 
Subjt:  ITNLVASLGTEFALKDLGALHYFLGVEV--QNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPK-------DEENVDSTQYKSIVGSLQY-

Query:  LTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSST
        +  TRPDI ++V  V + ++ P  ++   VK ILRYL+ T    + F   S   L  + DAD  G    R+S+TG+        ISW SK Q  ++ S+T
Subjt:  LTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSST

Query:  EAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHREL
        EAEY A   T  E+ W+   L+++G+   K   +YCD+ SA+ +S N ++HARTKHI++ YH++RE V   SL    + + +  AD+LTK + +      
Subjt:  EAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHREL

Query:  TSKLGVRAN
           +G+ +N
Subjt:  TSKLGVRAN

P92519 Uncharacterized mitochondrial protein AtMg008101.2e-6754.26Show/hide
Query:  MLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKS
        +L+YVDDI+LTG++   +  L+  L + F++KDLG +HYFLG++++  P+G  ++QTKY   IL  A ML+  P++TP+      +    +  D + ++S
Subjt:  MLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKS

Query:  IVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQP
        IVG+LQYLTLTRPDI+Y+VN VCQ M EP + +   +KR+LRY+K TI HG+   KNS LN+ AFCD+DW GC  TRRSTTGFC FLG NIISWS+K+QP
Subjt:  IVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQP

Query:  TISRSSTEAEYRAMANTTAEITW
        T+SRSSTE EYRA+A T AE+TW
Subjt:  TISRSSTEAEYRAMANTTAEITW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-15231.03Show/hide
Query:  KLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQVWKSLEETLLAITKEN
        KL +TNYL+W  QV  L     +   L          +  +    +NP Y  W   D L+ S +LG IS  V   +    TA Q+W++L +     +  +
Subjt:  KLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQVWKSLEETLLAITKEN

Query:  EIHISDSLHNLKKAMK--------------------KPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQ------------MNQF
           +   L    K  K                    KP+D   +V  +   L  +YK     + +K   PT  +    L  HE             +   
Subjt:  EIHISDSLHNLKKAMK--------------------KPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQ------------MNQF

Query:  ELEEENITLPNANQAYYSQRGRGRGRGRHFSSKGRGFQPTRQATSPQTEAHNISLPRLQSPSKHTTGNQILCQICGKANHIALEC--WKRFDHSYQMEEI
         +   N T  N N    ++  R   R  + +SK     P +Q+++     +N S P L             CQICG   H A  C   + F  S   ++ 
Subjt:  ELEEENITLPNANQAYYSQRGRGRGRGRHFSSKGRGFQPTRQATSPQTEAHNISLPRLQSPSKHTTGNQILCQICGKANHIALEC--WKRFDHSYQMEEI

Query:  PKAL------AAMAL-TEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKL
        P         A +AL +  +     +DSGAT HI  +   +S  + Y G + + V +G  + ISH G   L T+ +  L+L N+L VP I KNLIS+ +L
Subjt:  PKAL------AAMAL-TEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKL

Query:  TSDNDCSIIFNADGFVVKN-KQGQVLAMGTKRHGLY-----------------------------------------------ALKQQEVYTALAG----
         + N  S+ F    F VK+   G  L  G  +  LY                                                L     + + +     
Subjt:  TSDNDCSIIFNADGFVVKN-KQGQVLAMGTKRHGLY-----------------------------------------------ALKQQEVYTALAG----

Query:  ------FTNSAIGSTALSAFTNSDITLNTKSVASDFN------------------------KNSATNFKN------EVSLSTWHKRLG--------HLNF
              F+ S I ST    +  SD+  +      ++                         K +   FKN      +  + T++   G        + + 
Subjt:  ------FTNSAIGSTALSAFTNSDITLNTKSVASDFN------------------------KNSATNFKN------EVSLSTWHKRLG--------HLNF

Query:  ASISHLSNTNMIQ----VSAWKKLHVV------CSVCQMSKSY-----------INQWESP-NQGDTKTPHLVNNLPMENNIDILHSNC-----------
          ISHL++         +S  K  H+V       S   + K+Y           IN+  +P  Q ++    L    P  + + +    C           
Subjt:  ASISHLSNTNMIQ----VSAWKKLHVV------CSVCQMSKSY-----------INQWESP-NQGDTKTPHLVNNLPMENNIDILHSNC-----------

Query:  ---ENDSCWSHGSNNQSATTLTSHIEDQDGHGDQNT--DAITSTIANQLSN-QRLTELCQDSSPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPFDA
           ++  C   G +   +  L  H++    +  ++   D      +N L+    + E  ++SS + SP  + PT T P   + +  D         P  A
Subjt:  ---ENDSCWSHGSNNQSATTLTSHIEDQDGHGDQNT--DAITSTIANQLSN-QRLTELCQDSSPIGSPTNSEPTSTRPTTNSSTLPDISEYLFVDLPFDA

Query:  ATSPSS----LRPNVTSLSLPQVEF--------EPPSSRKSELR-----TDERTMTHSSHPMT----TRLKAKQLANQQMKTFLASENGSES--------
        AT PSS     R +  S S     F        EP + R++  +     T  +T THSS   +    T     QLA Q + T   S + S S        
Subjt:  ATSPSS----LRPNVTSLSLPQVEF--------EPPSSRKSELR-----TDERTMTHSSHPMT----TRLKAKQLANQQMKTFLASENGSES--------

Query:  ------------------QLVNK---------------------------------TEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELV-PRPSNT
                          Q+VN                                   E EP +   A+  E W+ AM  E+ A I N TW+LV P PS+ 
Subjt:  ------------------QLVNK---------------------------------TEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELV-PRPSNT

Query:  NIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNY
         IVG +WIF  K+  DG+L+RYKARLVA+GY+Q  G+DY ETFSPVIK T+IR+VL +AV  +WP++QLDV NAFL G L ++VYM+QPPGF   +  NY
Subjt:  NIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNY

Query:  VCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVP
        VC+L+K+LYGLKQAPRAW+ +L +YLL +GFV S +D SLF+ ++   +  MLVYVDDI++TGN+   + N + +L   F++KD   LHYFLG+E + VP
Subjt:  VCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVP

Query:  NGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPK-----DEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYL
         G H++Q +Y  D+L +  M+   P+ TPMA S    PK       +  D T+Y+ IVGSLQYL  TRPDI+Y+VN++ Q M  P  ++L+ +KRILRYL
Subjt:  NGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPK-----DEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYL

Query:  KHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCD
          T +HGI   K ++L+L+A+ DADW G      ST G+ V+LG + ISWSSKKQ  + RSSTEAEYR++ANT++E+ WI  LL ++G+ L + P +YCD
Subjt:  KHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCD

Query:  NMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANP
        N+ A  +  NPVFH+R KHI +DYHF+R +V  G+L   +V ++ QLAD LTKPLS+   +   SK+GV   P
Subjt:  NMSALQMSINPVFHARTKHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-14930.12Show/hide
Query:  KLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQVWKSLEETLL-----A
        KL +TNYL+W  QV  L     +   L    P     +  +    +NP Y  W   D L+ S +LG IS  V   +    TA Q+W++L +         
Subjt:  KLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLTNDGLLTSWLLGIISEEVLAMIEGLETARQVWKSLEETLL-----A

Query:  ITKENEIHISDSLHNLKKAMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQ----MNQFELEEENITLPNANQAYYSQRGRG
        +T+   I   D L      + KP+D   +V  +   L   YK     + +K   P+  +    L   E     +N  E+      +        ++    
Subjt:  ITKENEIHISDSLHNLKKAMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQ----MNQFELEEENITLPNANQAYYSQRGRG

Query:  RGRGRHF---SSKGRGFQPTRQATSPQTEAHNISLPRLQSPSKHTTGNQILCQICGKANHIALEC--WKRFDHSYQMEEI--------PKALAAMALTEE
        RG  R++   +++   +QP+   +          L R              CQIC    H A  C    +F  +   ++         P+A  A+  +  
Subjt:  RGRGRHF---SSKGRGFQPTRQATSPQTEAHNISLPRLQSPSKHTTGNQILCQICGKANHIALEC--WKRFDHSYQMEEI--------PKALAAMALTEE

Query:  NDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKN
        N     +DSGAT HI  +   +S  + Y G + + + +G  + I+H G   L T ++ SL L  VL VP I KNLIS+ +L + N  S+ F    F VK+
Subjt:  NDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKN

Query:  -KQGQVLAMGTKRHGLY--ALKQQEVYTALA--------------------GFTNSAIGSTALSA----------------------FTNSDITLN----
           G  L  G  +  LY   +   +  +  A                       NS I + +L                        F+NS IT +    
Subjt:  -KQGQVLAMGTKRHGLY--ALKQQEVYTALA--------------------GFTNSAIGSTALSA----------------------FTNSDITLN----

Query:  -------TKSVASDFNKNSATNFKNEVSLSTW--------------------------------------------------------------------
               +  + S  N      F +  +  TW                                                                    
Subjt:  -------TKSVASDFNKNSATNFKNEVSLSTW--------------------------------------------------------------------

Query:  ---HKRLGHLNFASISHLS--------------------NTNMIQVSA-----------WKKLHVVCSVCQMSKSYINQWESPNQGD---------TKTP
           H+ +  +    +SH S                     T ++Q+ +           ++KL V    C       N+ +  ++           T++ 
Subjt:  ---HKRLGHLNFASISHLS--------------------NTNMIQVSA-----------WKKLHVVCSVCQMSKSYINQWESPNQGD---------TKTP

Query:  HLVNNLPMENNIDILHSNCENDSCWSHGSNNQSATTLTSHIEDQDGHGDQNTDAITSTIA---------------------NQLSNQRLTELCQDSSPIG
        +L  ++P        H   + + C+   + N   +T      D   +   +T   T+ +                      + L   +++     SS I 
Subjt:  HLVNNLPMENNIDILHSNCENDSCWSHGSNNQSATTLTSHIEDQDGHGDQNTDAITSTIA---------------------NQLSNQRLTELCQDSSPIG

Query:  SPTNSEPTS-----TRPTTN-SSTLPDISEYLFVDLPFDAATSPSSLRPNVTSLSLPQ-----VEFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLA
        SP++SEPT+      +PT     T    S    ++ P   + SP+S  PN  S  LPQ          PS+  SE  +   + T S+ P+   L A  + 
Subjt:  SPTNSEPTS-----TRPTTN-SSTLPDISEYLFVDLPFDAATSPSSLRPNVTSLSLPQ-----VEFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLA

Query:  N-------------QQMKTFLASENGSESQLVN-KTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELV-PRPSNTNIVGSKWIFRTKFKEDGTLDR
                       + K  +   N   S   +     EP +   A+  + W+QAM  E+ A I N TW+LV P P +  IVG +WIF  KF  DG+L+R
Subjt:  N-------------QQMKTFLASENGSESQLVN-KTEVEPTSWRDAINKEVWKQAMEEELQALIDNKTWELV-PRPSNTNIVGSKWIFRTKFKEDGTLDR

Query:  YKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDK
        YKARLVA+GY+Q  G+DY ETFSPVIK T+IR+VL +AV  +WP++QLDV NAFL G L +EVYM+QPPGF   +  +YVCRL+K++YGLKQAPRAW+ +
Subjt:  YKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKEEVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDK

Query:  LSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKML
        L +YLL +GFV S +D SLF+ ++   +  MLVYVDDI++TGN+   + + + +L   F++K+   LHYFLG+E + VP G H++Q +YT D+L +  ML
Subjt:  LSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKML

Query:  EVSPIATPMATSYTKNPKDEENV-DSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDAD
           P+ATPMATS          + D T+Y+ IVGSLQYL  TRPD++Y+VN++ Q+M  P   +   +KR+LRYL  T  HGI   K ++L+L+A+ DAD
Subjt:  EVSPIATPMATSYTKNPKDEENV-DSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDAD

Query:  WGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYH
        W G      ST G+ V+LG + ISWSSKKQ  + RSSTEAEYR++ANT++E+ WI  LL ++G+ L   P +YCDN+ A  +  NPVFH+R KHI LDYH
Subjt:  WGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYH

Query:  FVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGV
        F+R +V  G+L   +V ++ QLAD LTKPLS+   +  + K+GV
Subjt:  FVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-11441.44Show/hide
Query:  DAATSPSSLRPNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGS---ESQLVNKTEV-EPTSWRDAINKEVWK
        DA+TS SS+  ++   +  Q +   PS   S  RT +       +  +       L    +  FL+ E  S    S LV   +  EP+++ +A    VW 
Subjt:  DAATSPSSLRPNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGS---ESQLVNKTEV-EPTSWRDAINKEVWK

Query:  QAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAF
         AM++E+ A+    TWE+   P N   +G KW+++ K+  DGT++RYKARLVA+GY+Q EGID+ ETFSPV K T+++L+LA++  YN+ L QLD+ NAF
Subjt:  QAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAF

Query:  LHGQLKEEVYMTQPPGFAHPNF----LNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITN
        L+G L EE+YM  PPG+A         N VC LKKS+YGLKQA R WF K S  L+  GFV S +D + F+    T+   +LVYVDDII+  NN   +  
Subjt:  LHGQLKEEVYMTQPPGFAHPNF----LNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITN

Query:  LVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNP-KDEENVDSTQYKSIVGSLQYLTLTRPDITYSV
        L + L + F L+DLG L YFLG+E+     G +I Q KY  D+L +  +L   P + PM  S T +     + VD+  Y+ ++G L YL +TR DI+++V
Subjt:  LVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNP-KDEENVDSTQYKSIVGSLQYLTLTRPDITYSV

Query:  NKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTA
        NK+ Q  + P++ + + V +IL Y+K T+  G+ +   + + L  F DA +  C  TRRST G+C+FLG+++ISW SKKQ  +S+SS EAEYRA++  T 
Subjt:  NKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTA

Query:  EITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREK
        E+ W+    R++ +PL K   L+CDN +A+ ++ N VFH RTKHIE D H VRE+
Subjt:  EITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHARTKHIELDYHFVREK

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.4e-1544.87Show/hide
Query:  YLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFC
        YLT+TRPD+T++VN++ Q     +   ++ V ++L Y+K T+  G+ +   S L L AF D+DW  CP TRRS TGFC
Subjt:  YLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFC

ATMG00810.1 DNA/RNA polymerases superfamily protein8.5e-6954.26Show/hide
Query:  MLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKS
        +L+YVDDI+LTG++   +  L+  L + F++KDLG +HYFLG++++  P+G  ++QTKY   IL  A ML+  P++TP+      +    +  D + ++S
Subjt:  MLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFLGVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKS

Query:  IVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQP
        IVG+LQYLTLTRPDI+Y+VN VCQ M EP + +   +KR+LRY+K TI HG+   KNS LN+ AFCD+DW GC  TRRSTTGFC FLG NIISWS+K+QP
Subjt:  IVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHGITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQP

Query:  TISRSSTEAEYRAMANTTAEITW
        T+SRSSTE EYRA+A T AE+TW
Subjt:  TISRSSTEAEYRAMANTTAEITW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.0e-2553.98Show/hide
Query:  EPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALA-
        EP S   A+    W QAM+EEL AL  NKTW LVP P N NI+G KW+F+TK   DGTLDR KARLVA+G+ Q EGI + ET+SPV++  TIR +L +A 
Subjt:  EPTSWRDAINKEVWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALA-

Query:  -----VSYNWPLK
              S NW  K
Subjt:  -----VSYNWPLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGATTCTTTCAGCAAGAGATCAAGAAGATATGACTGAAACACATTCTGTTGAGCAAACTTCAAAAACCAAAGTTGTTCCTACAAACCATTCCTCTGCCAC
TGAAGTTGGCCTTACAATTCAATCCTTCCATCAATGTTCTAGCCTGATCTCAATAAAGTTGAATACTACAAATTATCTCCTTTGGCGGTCGCAAGTTGTTCCTCTTGTTC
GAAGTATGGGAATCGAACACCATCTATTAAATGACAATCCTCCAGATTCAAAGATAGTTGACAAAGAAGGAACCAACACCATCAATCCTCAATACCTCACTTGGCTCACT
AACGATGGCCTACTTACCTCTTGGTTATTGGGAATCATCAGTGAGGAAGTGTTGGCTATGATTGAAGGATTGGAGACAGCTCGACAAGTCTGGAAGTCACTAGAAGAAAC
ACTTCTTGCTATTACAAAGGAAAATGAAATTCACATCAGTGATTCGCTGCATAATCTCAAGAAAGCCATGAAGAAACCCATTGATGATGTGAGCAAGGTTTTCACCTTGG
CTAGAGGATTAGGATTAAAATACAAGGACTTTAGAACAGCAATGTTGTCTAAGGCCCCTTATCCAACGTTCAATCAATTTGTGCTCGCTTTAAAGGCTCATGAGCAGATG
AATCAATTCGAATTGGAGGAAGAAAATATCACTTTACCCAATGCAAATCAAGCTTATTATAGTCAAAGAGGAAGAGGAAGAGGAAGAGGTAGACACTTCAGTTCCAAGGG
AAGAGGCTTTCAACCTACTCGACAAGCCACCTCACCACAAACTGAAGCTCACAATATTTCCCTTCCAAGACTGCAATCACCTTCCAAACACACGACAGGTAATCAAATTC
TGTGTCAAATTTGTGGAAAAGCTAATCACATTGCTCTTGAATGCTGGAAAAGGTTTGACCACTCATACCAAATGGAGGAAATCCCAAAAGCACTAGCTGCTATGGCTTTG
ACTGAAGAAAATGATCCTGCTCTTTATGTTGATTCTGGTGCCACAGCACACATTGTCAAAAATTCAGGTAAAATTTCTTCTTTAAAAACTTATCAAGGGAAAGAAAAACT
ATATGTAGGAAATGGTGAAGGCCTAAATATATCTCACATTGGTCAAGGATACTTGAAAACACAAAATAACAACTCTTTACACCTTAAGAATGTGTTGGTTGTGCCTCGTA
TCAAGAAAAATCTAATATCAATAAGTAAATTAACAAGTGATAATGATTGTTCTATTATATTTAATGCTGATGGTTTTGTTGTAAAGAACAAACAGGGGCAGGTACTAGCA
ATGGGAACTAAACGACATGGACTTTATGCCTTGAAACAACAAGAAGTTTACACTGCACTTGCTGGTTTTACAAATTCTGCTATTGGATCTACTGCCTTGTCTGCATTTAC
CAATTCTGATATTACTTTAAATACAAAGTCTGTTGCTTCTGATTTCAATAAAAATTCTGCTACTAATTTTAAAAATGAAGTGTCTTTATCTACATGGCACAAGCGTTTAG
GGCATTTAAATTTTGCTTCCATTTCTCACCTTAGTAATACAAATATGATTCAAGTATCTGCTTGGAAGAAGTTGCATGTTGTTTGTTCAGTATGTCAAATGAGTAAAAGC
TATATTAACCAATGGGAATCACCAAATCAAGGGGACACAAAAACACCTCATTTGGTGAACAATCTACCTATGGAGAACAACATAGATATTCTTCACTCAAATTGTGAAAA
TGATAGTTGTTGGTCCCATGGTAGTAATAATCAATCTGCAACCACATTAACGAGTCACATTGAAGATCAAGATGGTCATGGTGATCAAAATACAGATGCAATCACTTCAA
CAATAGCAAATCAACTTTCCAACCAAAGACTTACTGAATTGTGTCAAGACTCATCACCAATTGGCTCCCCCACCAATTCAGAGCCAACTTCAACCAGACCTACCACAAAC
TCTTCAACATTACCTGATATAAGTGAATATCTTTTTGTCGACTTACCTTTTGATGCAGCTACAAGTCCCTCCAGCTTGAGACCGAATGTCACTTCTCTCAGTCTACCACA
AGTTGAGTTTGAACCACCAAGTAGCAGGAAGTCAGAATTGAGAACGGATGAGCGTACCATGACACACTCCTCACATCCTATGACCACAAGACTCAAAGCAAAACAACTTG
CAAACCAACAAATGAAAACATTCCTAGCCTCTGAAAATGGGAGTGAGTCTCAACTTGTTAACAAAACTGAAGTTGAACCCACAAGTTGGCGAGATGCTATAAATAAAGAA
GTATGGAAACAGGCTATGGAAGAAGAACTCCAAGCTCTCATTGACAACAAAACATGGGAGCTAGTTCCTAGACCATCCAACACAAATATCGTGGGTTCAAAATGGATATT
CAGAACAAAATTCAAGGAGGATGGAACACTCGATAGATACAAAGCAAGACTAGTTGCTCAAGGTTACTCTCAAGTTGAAGGAATTGACTATGGTGAAACCTTCAGCCCAG
TTATTAAACCTACAACTATTCGTCTAGTTCTTGCTCTTGCGGTATCCTACAATTGGCCACTCAAACAATTAGATGTGAAGAATGCCTTTCTCCATGGCCAACTCAAAGAA
GAGGTATATATGACTCAACCTCCAGGTTTTGCTCATCCTAACTTCCTTAACTATGTTTGCAGGTTGAAAAAGTCACTTTATGGACTCAAACAAGCACCCAGAGCATGGTT
TGATAAACTCTCCTCCTACCTTCTTCATATGGGGTTTGTATGTAGTAAGGCTGATCCATCACTCTTTATCCATAGACAAAACACTATACTCACCCTTATGCTTGTATATG
TCGATGATATTATACTTACAGGAAACAATATAAACTTTATTACAAACTTAGTTGCAAGTCTAGGAACTGAATTTGCCTTAAAAGACCTAGGTGCTCTACACTATTTCTTA
GGCGTTGAAGTACAAAATGTTCCAAATGGCTACCACATCACTCAAACCAAGTATACTAGAGACATACTTAAAAAGGCCAAAATGTTGGAAGTTTCACCCATAGCTACACC
AATGGCAACAAGCTACACCAAGAATCCTAAGGATGAAGAAAATGTGGACTCAACACAATACAAGAGTATAGTGGGTTCGCTCCAGTACCTCACCTTAACAAGACCAGATA
TTACATACTCAGTCAACAAAGTATGTCAACACATGCAGGAGCCTAAGATGCAAAATTTAAGGGAAGTCAAACGAATTCTTCGATATCTTAAACACACAATTCATCATGGC
ATTACTTTTCTTAAAAATAGTTCACTAAATCTTTATGCCTTCTGTGATGCAGATTGGGGTGGTTGTCCTATCACACGTAGAAGCACTACAGGATTTTGCGTGTTTCTTGG
ATCCAACATCATCTCTTGGTCTTCTAAAAAGCAACCAACAATCTCTCGCTCAAGCACTGAAGCGGAGTATAGAGCCATGGCTAATACGACAGCAGAAATCACATGGATAA
CGTATCTTCTTCGGGACATAGGAGTACCCCTTCACAAAGCACCAGAACTCTACTGTGACAACATGAGTGCACTACAAATGTCCATTAACCCAGTATTTCATGCTAGAACC
AAACATATCGAGTTGGACTATCATTTTGTTAGAGAAAAAGTTGCTCTTGGCTCATTGGTGACACGTTATGTACCATCATACAAACAACTCGCAGATATCTTAACCAAGCC
TCTATCCAAGGAGATGCATCGTGAGTTAACATCCAAACTTGGTGTTCGGGCAAACCCTACCCTCAGTTTGAGAACGGATGGGTCGTTGTTGGTGCAAGGTTTTGGAGATG
AGAATGTGCAAGAGTTTCTGATACCCTTGTTTGGTACAAGAAATAAGGCTGGGAGATGGACCCTAGAGGCGAAACCGGCAAGCAGGACGGGCCAAGATCGAAGGGGTCGG
GCTTTTGGCCCACCCCCTGCTCGGCCTCGGCCCAAGGCTGAGCCAAGGCCCATGGTCAGCCCGCTTGCGCGGGTCGAGCCCCTTCGGCTCCGTTTAATCCCTACCGCCTC
TGGCCACCCCGGTTCGGCCTGGTTTGTCCCGAAACGTCTTCAAATTCCTAAAAACCCTAGGAGCACGAGCAGCATCGGAGGCGGTGTGGCTAGCACCACACCAGTGTGCA
GGTTTTCCTTGATTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTACCATTGGTGGCACGTGAAGGTCAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGATTCTTTCAGCAAGAGATCAAGAAGATATGACTGAAACACATTCTGTTGAGCAAACTTCAAAAACCAAAGTTGTTCCTACAAACCATTCCTCTGCCAC
TGAAGTTGGCCTTACAATTCAATCCTTCCATCAATGTTCTAGCCTGATCTCAATAAAGTTGAATACTACAAATTATCTCCTTTGGCGGTCGCAAGTTGTTCCTCTTGTTC
GAAGTATGGGAATCGAACACCATCTATTAAATGACAATCCTCCAGATTCAAAGATAGTTGACAAAGAAGGAACCAACACCATCAATCCTCAATACCTCACTTGGCTCACT
AACGATGGCCTACTTACCTCTTGGTTATTGGGAATCATCAGTGAGGAAGTGTTGGCTATGATTGAAGGATTGGAGACAGCTCGACAAGTCTGGAAGTCACTAGAAGAAAC
ACTTCTTGCTATTACAAAGGAAAATGAAATTCACATCAGTGATTCGCTGCATAATCTCAAGAAAGCCATGAAGAAACCCATTGATGATGTGAGCAAGGTTTTCACCTTGG
CTAGAGGATTAGGATTAAAATACAAGGACTTTAGAACAGCAATGTTGTCTAAGGCCCCTTATCCAACGTTCAATCAATTTGTGCTCGCTTTAAAGGCTCATGAGCAGATG
AATCAATTCGAATTGGAGGAAGAAAATATCACTTTACCCAATGCAAATCAAGCTTATTATAGTCAAAGAGGAAGAGGAAGAGGAAGAGGTAGACACTTCAGTTCCAAGGG
AAGAGGCTTTCAACCTACTCGACAAGCCACCTCACCACAAACTGAAGCTCACAATATTTCCCTTCCAAGACTGCAATCACCTTCCAAACACACGACAGGTAATCAAATTC
TGTGTCAAATTTGTGGAAAAGCTAATCACATTGCTCTTGAATGCTGGAAAAGGTTTGACCACTCATACCAAATGGAGGAAATCCCAAAAGCACTAGCTGCTATGGCTTTG
ACTGAAGAAAATGATCCTGCTCTTTATGTTGATTCTGGTGCCACAGCACACATTGTCAAAAATTCAGGTAAAATTTCTTCTTTAAAAACTTATCAAGGGAAAGAAAAACT
ATATGTAGGAAATGGTGAAGGCCTAAATATATCTCACATTGGTCAAGGATACTTGAAAACACAAAATAACAACTCTTTACACCTTAAGAATGTGTTGGTTGTGCCTCGTA
TCAAGAAAAATCTAATATCAATAAGTAAATTAACAAGTGATAATGATTGTTCTATTATATTTAATGCTGATGGTTTTGTTGTAAAGAACAAACAGGGGCAGGTACTAGCA
ATGGGAACTAAACGACATGGACTTTATGCCTTGAAACAACAAGAAGTTTACACTGCACTTGCTGGTTTTACAAATTCTGCTATTGGATCTACTGCCTTGTCTGCATTTAC
CAATTCTGATATTACTTTAAATACAAAGTCTGTTGCTTCTGATTTCAATAAAAATTCTGCTACTAATTTTAAAAATGAAGTGTCTTTATCTACATGGCACAAGCGTTTAG
GGCATTTAAATTTTGCTTCCATTTCTCACCTTAGTAATACAAATATGATTCAAGTATCTGCTTGGAAGAAGTTGCATGTTGTTTGTTCAGTATGTCAAATGAGTAAAAGC
TATATTAACCAATGGGAATCACCAAATCAAGGGGACACAAAAACACCTCATTTGGTGAACAATCTACCTATGGAGAACAACATAGATATTCTTCACTCAAATTGTGAAAA
TGATAGTTGTTGGTCCCATGGTAGTAATAATCAATCTGCAACCACATTAACGAGTCACATTGAAGATCAAGATGGTCATGGTGATCAAAATACAGATGCAATCACTTCAA
CAATAGCAAATCAACTTTCCAACCAAAGACTTACTGAATTGTGTCAAGACTCATCACCAATTGGCTCCCCCACCAATTCAGAGCCAACTTCAACCAGACCTACCACAAAC
TCTTCAACATTACCTGATATAAGTGAATATCTTTTTGTCGACTTACCTTTTGATGCAGCTACAAGTCCCTCCAGCTTGAGACCGAATGTCACTTCTCTCAGTCTACCACA
AGTTGAGTTTGAACCACCAAGTAGCAGGAAGTCAGAATTGAGAACGGATGAGCGTACCATGACACACTCCTCACATCCTATGACCACAAGACTCAAAGCAAAACAACTTG
CAAACCAACAAATGAAAACATTCCTAGCCTCTGAAAATGGGAGTGAGTCTCAACTTGTTAACAAAACTGAAGTTGAACCCACAAGTTGGCGAGATGCTATAAATAAAGAA
GTATGGAAACAGGCTATGGAAGAAGAACTCCAAGCTCTCATTGACAACAAAACATGGGAGCTAGTTCCTAGACCATCCAACACAAATATCGTGGGTTCAAAATGGATATT
CAGAACAAAATTCAAGGAGGATGGAACACTCGATAGATACAAAGCAAGACTAGTTGCTCAAGGTTACTCTCAAGTTGAAGGAATTGACTATGGTGAAACCTTCAGCCCAG
TTATTAAACCTACAACTATTCGTCTAGTTCTTGCTCTTGCGGTATCCTACAATTGGCCACTCAAACAATTAGATGTGAAGAATGCCTTTCTCCATGGCCAACTCAAAGAA
GAGGTATATATGACTCAACCTCCAGGTTTTGCTCATCCTAACTTCCTTAACTATGTTTGCAGGTTGAAAAAGTCACTTTATGGACTCAAACAAGCACCCAGAGCATGGTT
TGATAAACTCTCCTCCTACCTTCTTCATATGGGGTTTGTATGTAGTAAGGCTGATCCATCACTCTTTATCCATAGACAAAACACTATACTCACCCTTATGCTTGTATATG
TCGATGATATTATACTTACAGGAAACAATATAAACTTTATTACAAACTTAGTTGCAAGTCTAGGAACTGAATTTGCCTTAAAAGACCTAGGTGCTCTACACTATTTCTTA
GGCGTTGAAGTACAAAATGTTCCAAATGGCTACCACATCACTCAAACCAAGTATACTAGAGACATACTTAAAAAGGCCAAAATGTTGGAAGTTTCACCCATAGCTACACC
AATGGCAACAAGCTACACCAAGAATCCTAAGGATGAAGAAAATGTGGACTCAACACAATACAAGAGTATAGTGGGTTCGCTCCAGTACCTCACCTTAACAAGACCAGATA
TTACATACTCAGTCAACAAAGTATGTCAACACATGCAGGAGCCTAAGATGCAAAATTTAAGGGAAGTCAAACGAATTCTTCGATATCTTAAACACACAATTCATCATGGC
ATTACTTTTCTTAAAAATAGTTCACTAAATCTTTATGCCTTCTGTGATGCAGATTGGGGTGGTTGTCCTATCACACGTAGAAGCACTACAGGATTTTGCGTGTTTCTTGG
ATCCAACATCATCTCTTGGTCTTCTAAAAAGCAACCAACAATCTCTCGCTCAAGCACTGAAGCGGAGTATAGAGCCATGGCTAATACGACAGCAGAAATCACATGGATAA
CGTATCTTCTTCGGGACATAGGAGTACCCCTTCACAAAGCACCAGAACTCTACTGTGACAACATGAGTGCACTACAAATGTCCATTAACCCAGTATTTCATGCTAGAACC
AAACATATCGAGTTGGACTATCATTTTGTTAGAGAAAAAGTTGCTCTTGGCTCATTGGTGACACGTTATGTACCATCATACAAACAACTCGCAGATATCTTAACCAAGCC
TCTATCCAAGGAGATGCATCGTGAGTTAACATCCAAACTTGGTGTTCGGGCAAACCCTACCCTCAGTTTGAGAACGGATGGGTCGTTGTTGGTGCAAGGTTTTGGAGATG
AGAATGTGCAAGAGTTTCTGATACCCTTGTTTGGTACAAGAAATAAGGCTGGGAGATGGACCCTAGAGGCGAAACCGGCAAGCAGGACGGGCCAAGATCGAAGGGGTCGG
GCTTTTGGCCCACCCCCTGCTCGGCCTCGGCCCAAGGCTGAGCCAAGGCCCATGGTCAGCCCGCTTGCGCGGGTCGAGCCCCTTCGGCTCCGTTTAATCCCTACCGCCTC
TGGCCACCCCGGTTCGGCCTGGTTTGTCCCGAAACGTCTTCAAATTCCTAAAAACCCTAGGAGCACGAGCAGCATCGGAGGCGGTGTGGCTAGCACCACACCAGTGTGCA
GGTTTTCCTTGATTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTACCATTGGTGGCACGTGAAGGTCAGGTGA
Protein sequenceShow/hide protein sequence
MVSEILSARDQEDMTETHSVEQTSKTKVVPTNHSSATEVGLTIQSFHQCSSLISIKLNTTNYLLWRSQVVPLVRSMGIEHHLLNDNPPDSKIVDKEGTNTINPQYLTWLT
NDGLLTSWLLGIISEEVLAMIEGLETARQVWKSLEETLLAITKENEIHISDSLHNLKKAMKKPIDDVSKVFTLARGLGLKYKDFRTAMLSKAPYPTFNQFVLALKAHEQM
NQFELEEENITLPNANQAYYSQRGRGRGRGRHFSSKGRGFQPTRQATSPQTEAHNISLPRLQSPSKHTTGNQILCQICGKANHIALECWKRFDHSYQMEEIPKALAAMAL
TEENDPALYVDSGATAHIVKNSGKISSLKTYQGKEKLYVGNGEGLNISHIGQGYLKTQNNNSLHLKNVLVVPRIKKNLISISKLTSDNDCSIIFNADGFVVKNKQGQVLA
MGTKRHGLYALKQQEVYTALAGFTNSAIGSTALSAFTNSDITLNTKSVASDFNKNSATNFKNEVSLSTWHKRLGHLNFASISHLSNTNMIQVSAWKKLHVVCSVCQMSKS
YINQWESPNQGDTKTPHLVNNLPMENNIDILHSNCENDSCWSHGSNNQSATTLTSHIEDQDGHGDQNTDAITSTIANQLSNQRLTELCQDSSPIGSPTNSEPTSTRPTTN
SSTLPDISEYLFVDLPFDAATSPSSLRPNVTSLSLPQVEFEPPSSRKSELRTDERTMTHSSHPMTTRLKAKQLANQQMKTFLASENGSESQLVNKTEVEPTSWRDAINKE
VWKQAMEEELQALIDNKTWELVPRPSNTNIVGSKWIFRTKFKEDGTLDRYKARLVAQGYSQVEGIDYGETFSPVIKPTTIRLVLALAVSYNWPLKQLDVKNAFLHGQLKE
EVYMTQPPGFAHPNFLNYVCRLKKSLYGLKQAPRAWFDKLSSYLLHMGFVCSKADPSLFIHRQNTILTLMLVYVDDIILTGNNINFITNLVASLGTEFALKDLGALHYFL
GVEVQNVPNGYHITQTKYTRDILKKAKMLEVSPIATPMATSYTKNPKDEENVDSTQYKSIVGSLQYLTLTRPDITYSVNKVCQHMQEPKMQNLREVKRILRYLKHTIHHG
ITFLKNSSLNLYAFCDADWGGCPITRRSTTGFCVFLGSNIISWSSKKQPTISRSSTEAEYRAMANTTAEITWITYLLRDIGVPLHKAPELYCDNMSALQMSINPVFHART
KHIELDYHFVREKVALGSLVTRYVPSYKQLADILTKPLSKEMHRELTSKLGVRANPTLSLRTDGSLLVQGFGDENVQEFLIPLFGTRNKAGRWTLEAKPASRTGQDRRGR
AFGPPPARPRPKAEPRPMVSPLARVEPLRLRLIPTASGHPGSAWFVPKRLQIPKNPRSTSSIGGGVASTTPVCRFSLICRPRLPPHLQIYHWWHVKVR