; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008350 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008350
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:18615713..18623629
RNA-Seq ExpressionLag0008350
SyntenyLag0008350
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.1e-26140.64Show/hide
Query:  RMLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTW-------------------------------
        +M+Q   Q  GLS E+P+ H+ +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW                               
Subjt:  RMLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTW-------------------------------

Query:  -----------------------------------------DEAD-----------LAMIANALKNVTVISHQQ------PPAMEPTTVVNQVT------
                                                 D+ D            A   N L N+ V  +++      PP       V+QVT      
Subjt:  -----------------------------------------DEAD-----------LAMIANALKNVTVISHQQ------PPAMEPTTVVNQVT------

Query:  -----DEACVYCGEDHNYEFCPSNPASVFFVEAQQK--------------MNQPGFAKAQAKPQGKLP-------SDTK-HPRREGKEQVKAVTLRSGKP
                   CGE H  + CP +  S+ FV   +K                 P F+    + QG  P        +TK +PR++GK Q +AVTLR+G+ 
Subjt:  -----DEACVYCGEDHNYEFCPSNPASVFFVEAQQK--------------MNQPGFAKAQAKPQGKLP-------SDTK-HPRREGKEQVKAVTLRSGKP

Query:  LEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN
        L+E  K  +   + +K +  E++ +                   +VE P     P     PFPQR + +    QF KFLE+ K+LHINIP  EA+EQMP+
Subjt:  LEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN

Query:  NVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASINLMPLSIYRKLGIREARPTTVTLQLADRSITYP
         VKF+KDIL+KK+R G++ETV+LTEECSAI++N LPPK KDPG              RALCDLGASINLMP SIYR LG+ EA+PT++TLQLADRS+TYP
Subjt:  NVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASINLMPLSIYRKLGIREARPTTVTLQLADRSITYP

Query:  EGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNVFKAMKFPDEMEDCSFIRILESTVIETTIQDSAD
        +G IED+LVKVDKFIF  DF++LD E D  VPIILGRPF A GR LIDVQKGEL MRV ++++ FNVFKAMKFP+E ++C  + + ++   + +I +   
Subjt:  EGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNVFKAMKFPDEMEDCSFIRILESTVIETTIQDSAD

Query:  KHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDV---FESLDLD--QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEAL
           E+    ++++      D + ++E+ +  D    F+S  ++  +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L
Subjt:  KHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDV---FESLDLD--QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEAL

Query:  IKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIP
        +++L+ ++ AIGWT+ADI+GISPSFCMHKI+LE+    S++ QRRLNP MKEVVKKE+IKWLDAGIIYPI+D +W+SPVQCVPKKGG+TVV N  NE IP
Subjt:  IKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIP

Query:  TRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML------------------------------------------------------------------
        T+ V GWRVCMDYR+LNKATRKDHFPLPFIDQML                                                                  
Subjt:  TRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML------------------------------------------------------------------

Query:  ----------------------------------------IDWLVRPTTVSWMVILG---------ITRLLLLLRIRRKPLSPVLMGHSLLGE-------
                                                ++W      V   ++LG         + +  L    +  P + V    S LG        
Subjt:  ----------------------------------------IDWLVRPTTVSWMVILG---------ITRLLLLLRIRRKPLSPVLMGHSLLGE-------

Query:  --------------------------CLLASAMLQQHFSG--------------VMCDASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENEL
                                  CL A   L++                  +MCDASD A+GA+LGQ++ K    IYYASK LN+AQ+NYTTTE EL
Subjt:  --------------------------CLLASAMLQQHFSG--------------VMCDASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENEL

Query:  LVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL------
        L VVFAF+KFR YLVG+KV V+TDHA+IRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG EN IADHLSRL+  +   E + I+D+FPDEQLL      
Subjt:  LVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL------

Query:  --------------------------------------------------LLRCVLGAEAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAH
                                                          L RCV   E  +I EQCH+ PYGGHF   RT  + L  GFFWP LFKD H
Subjt:  --------------------------------------------------LLRCVLGAEAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAH

Query:  WFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGS
         F   CD CQR GN+  R EMPL  ILEVELFDVWGIDFMGPF+PS GN++IL+AVDY+SKWVEA+A   +D+K V  F++ +IF RFGTPRA++SD G+
Subjt:  WFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGS

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.6e-25646.51Show/hide
Query:  AKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQK
        ++PQG LPS+T+ +PR++GK Q +AVTLR+G+ L+E  K  +   + +K ++ E++ +                   +VE P     P     PFPQR +
Subjt:  AKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQK

Query:  PKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASI
         +  + QF KFLE+ K+LHINIP  EA+EQMP+ VKF+KDIL+KK+R G++ETV+LTEECSAI++N LPPK KDPGSFT+P +IG    GRALCDLGASI
Subjt:  PKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASI

Query:  NLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNV
        NLMP SIYR LG+ EA+PT++TLQLADRS+TYP+G IED+LVKVDKFIF  DF++LD E D  VPIILGRPF A GR LIDVQKGEL MRV ++++ FNV
Subjt:  NLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNV

Query:  FKAMKFPDEMEDCSFIRILESTV------------IETTIQDSADKHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIE
        FKAMKFP+E ++C  + + +               +E  + D  D+ +E+  EV+       +LD     +  R E +       +R  P   +KPS+ +
Subjt:  FKAMKFPDEMEDCSFIRILESTV------------IETTIQDSADKHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIE

Query:  APTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIK
         PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI+LE+    S++ QRRLNP MKEVVKKE+IK
Subjt:  APTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIK

Query:  WLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML--------------------------
        WLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTR V GWRVCMDYR+LNKATRKDHFPLPFIDQML                          
Subjt:  WLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML--------------------------

Query:  --------------------------------------------------------------------------------IDWLVRPTTVSWMVILG---
                                                                                        ++W      V   ++LG   
Subjt:  --------------------------------------------------------------------------------IDWLVRPTTVSWMVILG---

Query:  ------ITRLLLLLRIRRKPLSPVLMGHSLLGE------------------CLLASAMLQQHFSG-----------------------------VMCDAS
              + +  L    +  P + V    S LG                   C L    +  +F                               +MCDAS
Subjt:  ------ITRLLLLLRIRRKPLSPVLMGHSLLGE------------------CLLASAMLQQHFSG-----------------------------VMCDAS

Query:  DAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKG
        D AVGA+LGQ++ K    IYYASK LN+AQ+NYTTTE ELL VVFAF+KFR YLVG+KV V+TDHA+IRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG
Subjt:  DAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKG

Query:  SENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGAEA
        +EN IADHLSRL+  +   E + I+D+FPDEQLL                                                        L RCV   E 
Subjt:  SENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGAEA

Query:  KEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVS
         +ILEQCH+ PYGGHF G RT  + L  GFFWP LFKDAH F   CD CQR GN+  R EMPL  ILEVELFDVWGIDFMGPF+PS GN++IL+AVDYVS
Subjt:  KEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVS

Query:  KWVEAIACHQSDAKTV
        KWVEA A   +D+K V
Subjt:  KWVEAIACHQSDAKTV

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]5.0e-25546.88Show/hide
Query:  AKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQK
        ++PQG LPS+T+ +PR++GK Q +AVTLR+G+ L+E  K       S +  V+ KE E                   +VE P     P     PFPQ+ +
Subjt:  AKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQK

Query:  PKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASI
         +  + QF KFLE+ K+LHINIP  EA+EQMP+ VKF+KDIL+KK+R G++ET +LTEEC+AI++N LPPK KDPGSFT+P +IG    GRALCDLGASI
Subjt:  PKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASI

Query:  NLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNV
        NLMP SIYR LG+ EA+PT++TLQLADRS+TYP+G IED+LVKVDKFIF  DF++LD E D  VPIILGRPF A GR LIDVQKGEL MRV ++++ FNV
Subjt:  NLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNV

Query:  FKAMKFPDEMEDCSFIRILESTVIETTIQDSADKHSEKHGEVIIEDFEFCSLD---RKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPL
        FKAMKFP+E ++C  + + ++     +I +      E+    +IE+     L+     N  + F+   V ESL   +R  P   +KPS+ + PTL+LKPL
Subjt:  FKAMKFPDEMEDCSFIRILESTVIETTIQDSADKHSEKHGEVIIEDFEFCSLD---RKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPL

Query:  PDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYP
        P+HL YVYLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI+LE+    S++ QRRLN  MKEVVKKE+IKWLDAGIIYP
Subjt:  PDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYP

Query:  IADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML-----------------------------------
        I+DS+WVSPVQCVPKKGG+TVV N  NELIPTR V GWRVCMDYR+LNKATRKDHFPLPFIDQML                                   
Subjt:  IADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML-----------------------------------

Query:  -----------------------------------------------------------------------IDWLVRPTTVSWMVILG---------ITR
                                                                               ++W      V   ++LG         + +
Subjt:  -----------------------------------------------------------------------IDWLVRPTTVSWMVILG---------ITR

Query:  LLLLLRIRRKPLSPV-----LMGHS---------------------------------------LLGECLLASAMLQQHFS---GVMCDASDAAVGAMLG
          L    +  P + V      +GH+                                       L G  + A  +    +S    +MCDASD A+GA+LG
Subjt:  LLLLLRIRRKPLSPV-----LMGHS---------------------------------------LLGECLLASAMLQQHFS---GVMCDASDAAVGAMLG

Query:  QKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHL
        Q++ K    IYYASK LN+AQ+NYTTTE ELL VVFAF+KFR YLVG+KV V+TDHA+IRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG+EN IADHL
Subjt:  QKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHL

Query:  SRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGAEAKEILEQCHS
        SRL+  +   E + I+D+FPDEQLL                                                        L RCV   E  +ILEQCH+
Subjt:  SRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGAEAKEILEQCHS

Query:  LPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACH
         PYGGHF G RT  + L  GFFWP LFKDAH F   CD CQR GN+  R EMPL  ILEVELFDVWGIDFMGPF+PS GN++IL+AVDYVSKWVEA A  
Subjt:  LPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACH

Query:  QSDAKTV
         +D+K V
Subjt:  QSDAKTV

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]1.4e-0738.82Show/hide
Query:  PILIANDRTRANRMLQTV--GQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDE
        P + AN+    +  +Q +   + +GLS+EDP+ HL++FL + D+F + GVP + +RL LF  SL   A+ WL+S    SI TW++
Subjt:  PILIANDRTRANRMLQTV--GQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDE

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]2.3e-24443.34Show/hide
Query:  AKPQGKLPSDTKHPRREGKEQVKAVTLRSGKPLEE--SRKTQDL----NNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPF
        A+ +G  PS+T+      KEQ KA+TLRSG+ +E   S++T+      NN   KN V E+E+           +   S S PD       PP    PLP+
Subjt:  AKPQGKLPSDTKHPRREGKEQVKAVTLRSGKPLEE--SRKTQDL----NNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPF

Query:  PQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCD
        PQR + +  D QF KFL+I K++HINIP  +A+EQMPN  KFLKDI++KK+R  EFETV L+EECSAI++  LP K KDPGSFT+P +IG     + LCD
Subjt:  PQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCD

Query:  LGASINLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEE
        LGASINLMPLS+YRKLG+ E + TT++LQLADRSI YP G IEDVLVKVDKFIF  DF++LD E D+ VP+ILGRPF A GRAL+DVQKGEL +RV  EE
Subjt:  LGASINLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEE

Query:  VKFNVFKAMKFPDEMEDCSFIRILESTVIETTIQDSADKHSEKHGEVIIEDFEF-----CSLDRKNEKELFR----CEDVFESLDLDQRKAPPIKPSLI-
        V FN+++AMKFP++   C  + ++E  V+E   +D    H E+        ++F     C  D     E F       +  + +    ++   ++P +  
Subjt:  VKFNVFKAMKFPDEMEDCSFIRILESTVIETTIQDSADKHSEKHGEVIIEDFEF-----CSLDRKNEKELFR----CEDVFESLDLDQRKAPPIKPSLI-

Query:  -----------EAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNP
                   +  T +LK LP+HL+Y +LG+S T P+IVA+ L PE EE L+++L+++R A+GWT++DI+GISPS CMHKI++EE    SI+ QRRLNP
Subjt:  -----------EAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNP

Query:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQMLIDWLV---------
        AMKEVV+ E++K L+AGIIY I+DS+WVSPVQ VPKKGG+TVV N +NE IPTR V GWRVCMDYR+LNKATRKDHFPLPFIDQML D L          
Subjt:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQMLIDWLV---------

Query:  --------------------------------------RPTTVSWMVILGITRL---------------------------LLLLRIRRKPL--------
                                               P T    ++   + +                           L+L R   K L        
Subjt:  --------------------------------------RPTTVSWMVILGITRL---------------------------LLLLRIRRKPL--------

Query:  ----SPVLMGH------------------------------SLLGE---------------------------------CLLA-SAMLQQHFSG------
              +++GH                              S LG                                  CL A +A+ ++  S       
Subjt:  ----SPVLMGH------------------------------SLLGE---------------------------------CLLA-SAMLQQHFSG------

Query:  -------VMCDASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILL
               VMCDASD A+GA+LGQ++ K    IYYAS+ LNEAQ+NYTTTE E+L VVFA +KFR YL+ +KV VFTDHA++RYL SKKDAKPRLIRWILL
Subjt:  -------VMCDASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILL

Query:  LQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQL------------------------------------------------------
        LQEFDLE++DKKGSEN +ADHLSRL+    +     I ++FPDEQL                                                      
Subjt:  LQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQL------------------------------------------------------

Query:  -LLLRCVLGAEAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNG
         ++ RCV   E + IL  CHS  YGGHF   RT  + L  GFFWP++F+D++   K CD CQR GN+  R E+PL  ILEVELFDVWGIDFMGPF PS G
Subjt:  -LLLRCVLGAEAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNG

Query:  NVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGSFIVIEVFPN----------GAITLQDEKDGRVFKVQKIVAAKIMLEQI
         V+ILLAVDYVSKWVEAIA   +DAK V +FL  +IF RFGTPRA++SDEG+    ++F N           A+    + +G+     + +  K +LE+ 
Subjt:  NVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGSFIVIEVFPN----------GAITLQDEKDGRVFKVQKIVAAKIMLEQI

Query:  SRTNRKDFS
          TNRKD++
Subjt:  SRTNRKDFS

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]7.8e-0635.25Show/hide
Query:  EGPVAANPQQ-----NPLLQQN--PLFEQNEQQNNQAENPILIANDRTRANRMLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYS
        EG   A P+       P++  N   +  Q    NN    P LI+        M+Q   Q  G   +DP++HL  FL + D+  + GV  D +RL LFP+S
Subjt:  EGPVAANPQQ-----NPLLQQN--PLFEQNEQQNNQAENPILIANDRTRANRMLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYS

Query:  LRDGAKSWLNSFALGSIRTWDE
        LRD A+ WL S   GSI +W +
Subjt:  LRDGAKSWLNSFALGSIRTWDE

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]4.9e-24247.44Show/hide
Query:  KPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQKP
        +P G LPS+T+ +P+ + +E  KA+TLRSGK +E + K  D   + +K +  E  +        SN  A AS           P     PP PFPQR + 
Subjt:  KPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQKP

Query:  KNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASIN
        + QD QF+KF+++ K+L INIP  EA+EQM + VKF+KDIL++K+R  EFETV+LTEECSAIL+  LPPK KDPGSFT+P +IG +  G+ALCDLGAS+N
Subjt:  KNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASIN

Query:  LMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNVF
        LMPLSI+ KLG+ E +PT+V LQLADRS+ YP G +EDVLVKVDKFIF  DFI+LD E D  +P++LGRPF A GR LIDVQKGEL MRV +E+V FNVF
Subjt:  LMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNVF

Query:  KAMKFPDEMEDCSFIRILES-----TVIETTIQDSADKHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLP
         AMKF ++ E C  +           ++E    D  +    + G+   E+   C  +        R    FES ++   K+   KPS+ E P L+LK LP
Subjt:  KAMKFPDEMEDCSFIRILES-----TVIETTIQDSADKHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLP

Query:  DHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYPI
         HLKY +LGE  TLP+I++S L  EHEE L+++L++Y++AIGW +ADI+GISPSFCMHKI +E+    +I+ QRRLNP MKEVVKKE+IKWLDAGIIYPI
Subjt:  DHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYPI

Query:  ADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQMLIDWLV-------------------------------
        +DS+WVSP+QCVPKKGG+TVV+N+ NELIPTR V GWRVCMDYR+LNKATRKDHFPLPFIDQML D L                                
Subjt:  ADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQMLIDWLV-------------------------------

Query:  ----------------RPTT--------VSWMVILGI---------------TRLLLLLRIRRK-------------PLSPVLMGHSLLGE---------
                         P+T         S M+  G+                 L+ L R+ ++             P S V    S LG          
Subjt:  ----------------RPTT--------VSWMVILGI---------------TRLLLLLRIRRK-------------PLSPVLMGHSLLGE---------

Query:  ------------------------CLLASAMLQQHFSG--------------VMCDASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLV
                                C+ A   L++                  +MCDASD A+G +LGQ+  K  H IYYASK L+ AQ+NYTTTE ELL 
Subjt:  ------------------------CLLASAMLQQHFSG--------------VMCDASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLV

Query:  VVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLLLLRCVLGA
        VV+AFEKFR YLVG+KV V+TDH++I+YL+ KKDAKPRLI+W+LLLQEFDLEIK +KGSEN +ADHLSRL+ +        I+DSFPDE+L+ +      
Subjt:  VVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLLLLRCVLGA

Query:  EAKEILEQCHS--LPYGGHFSGRRTTMRTLHCGFFW--PTLFKDAHWFYKQ--------CDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPS
           + +    S  +P   ++  ++  +  +   F W  P LFK       +        CD CQR GN+  R+EMPL YILEVELFDVWGIDFMGPF+ S
Subjt:  EAKEILEQCHS--LPYGGHFSGRRTTMRTLHCGFFW--PTLFKDAHWFYKQ--------CDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPS

Query:  NGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGS
          N +ILLAVDYVSKWVEAIA   SDA  V  F++ +IF RFGTPRA++SDEG+
Subjt:  NGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGS

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase2.0e-24138.7Show/hide
Query:  MLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDEADLA---------MIANA------------
        M+Q   Q  GLS E+P+ H+ +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW  ++           M+ N             
Subjt:  MLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDEADLA---------MIANA------------

Query:  --------------------------------LKNVTVISHQQ-------PPAMEPTTVVNQVT------------------------DEACVYCGEDHN
                                        L N  V +H +       PP       V+QVT                           C  CGE H 
Subjt:  --------------------------------LKNVTVISHQQ-------PPAMEPTTVVNQVT------------------------DEACVYCGEDHN

Query:  YEFCPSNPASVFFVE-----------------------------------------AQQKMNQPGFAK--------------------------------
         + CP +  S+ FV                                           QQ++ QP   K                                
Subjt:  YEFCPSNPASVFFVE-----------------------------------------AQQKMNQPGFAK--------------------------------

Query:  AQAKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQR
          ++PQG LPS+T+ +PR++GK Q +AVTLR+G+ L+E+ K  +   + +K ++ E   E G+                +VE P                
Subjt:  AQAKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQR

Query:  QKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGA
                        L++LHINIP  EA+EQMP+ VKF+KDIL+KK+  G++ETV+LTEECSAI++N LPPK KDPGSFT+P +IG    GRALCDLG 
Subjt:  QKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGA

Query:  SINLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKF
           L            EA+PT++TLQLADRS+TYP+G IED+LVKVDKFIF  DF++LD E D  VPIILGRPF A GR LIDVQ               
Subjt:  SINLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKF

Query:  NVFKAMKFPDEMEDCSFIRILESTV------------IETTIQDSADKHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSL
           KAMKFP+E ++C  + + ++ V            +E  + D  D+ +E+  EV+            +  + F+   V ESL   +R AP   +KPS+
Subjt:  NVFKAMKFPDEMEDCSFIRILESTV------------IETTIQDSADKHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSL

Query:  IEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEV
         E PTL+LKPLP+HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI+LE+    S++ QRRLNP MKEVVKKE+
Subjt:  IEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEV

Query:  IKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML------------------------
        IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTR V GWRVCMDYR+LNKATRKDHFPL FIDQML                        
Subjt:  IKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML------------------------

Query:  ----------------------------------------------------------------------------------IDWLVRPTTVSWMVILG-
                                                                                          ++W      V   ++LG 
Subjt:  ----------------------------------------------------------------------------------IDWLVRPTTVSWMVILG-

Query:  --------ITRLLLLLRIRRKPLSPVLMGHSLLGE------------------CLLASAMLQQHFSG-----------------------------VMCD
                + +  L    +  P + V    S LG                   C L    +  +F                               +MCD
Subjt:  --------ITRLLLLLRIRRKPLSPVLMGHSLLGE------------------CLLASAMLQQHFSG-----------------------------VMCD

Query:  ASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDK
        ASD AVGA+LGQ++ K    IYYASK LN+AQ+NYTTTE ELL VVFAF+KFR YLVG+KV V+TDHA+IRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+
Subjt:  ASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDK

Query:  KGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGA
        KG+EN IADHLSRL+  +   E + I+D+FPDEQLL                                                        L RCV   
Subjt:  KGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGA

Query:  EAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDY
        E  +ILEQCH+ PYGGHF G RT  + L  GFFWP LFKDAH F   CD CQR GN+  R EMPL  ILEVELFDVWGIDFMGPF+PS GN++IL+AVDY
Subjt:  EAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDY

Query:  VSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGS
        VSKWVEA A   +D+K V  F++ +IF RFGTPRA++SD G+
Subjt:  VSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGS

A0A2G9G6G2 Reverse transcriptase3.1e-24240.09Show/hide
Query:  MLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDEADLA---------MIANA------------
        M+Q   Q  GLS E+P+ H+ +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW  ++           M+ N             
Subjt:  MLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDEADLA---------MIANA------------

Query:  --------------------------------LKNVTVISHQQPPAMEPTTV-----------------------------VNQV--TDEACVYCGEDHN
                                        L N  V +H +  +   T                               VNQV  T   C  CGE H 
Subjt:  --------------------------------LKNVTVISHQQPPAMEPTTV-----------------------------VNQV--TDEACVYCGEDHN

Query:  YEFCPSNPASVFFVEAQQK--------------MNQPGFAKAQAKPQGKLPSDTKHPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELE
         + CP +  S+ FV   +K                 P F+    + QG  P      ++ G++QV+   ++  KP  E    Q + + +     +E ++ 
Subjt:  YEFCPSNPASVFFVEAQQK--------------MNQPGFAKAQAKPQGKLPSDTKHPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELE

Query:  SGQGAGGSNKNAGASGSVPDVEPPYLPPP---------PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFG
         GQ A   N      GS+P    P                V   P   ++K    + + K+    L++LHINIP  EA+EQMP+ VKF+KDIL+KK+R G
Subjt:  SGQGAGGSNKNAGASGSVPDVEPPYLPPP---------PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFG

Query:  EFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASINLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIF
        ++E V+LTEECS I++N LPPK K+PGSFT+P +IG    GRALCDLGASINLMP SIYR LG+ EA+PT++TLQLADRS+TYP+G I+D+LVKVDKFIF
Subjt:  EFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASINLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIF

Query:  HVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNVFKAMKFPDEMEDCSFIRILES------------TVIETTIQDSADKHSE
          DF++LD E D  VPIILGRPF A GR LIDVQ                  KAMKFP+E ++C  + + ++              +E  + D  D+ +E
Subjt:  HVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNVFKAMKFPDEMEDCSFIRILES------------TVIETTIQDSADKHSE

Query:  KHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRK
        K  EV+            +  + F+   V ESL   +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ 
Subjt:  KHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRK

Query:  AIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRV
         IGWT+ADI+GISPSFCMHKI+LE+    SI+ QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTR V GWRV
Subjt:  AIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRV

Query:  CMDYRRLNKATRKDHFPLPFIDQML---------------------------------------------------------------------------
        CMDYR+LNKATRKDHFPLPFIDQML                                                                           
Subjt:  CMDYRRLNKATRKDHFPLPFIDQML---------------------------------------------------------------------------

Query:  -------------------------------IDWLVRPTTVSWMVILG---------ITRLLLLLRIRRKPLSPVLMGHSLLGE----------------
                                       ++W      V   ++LG         + +  L    +  P + V    S LG                 
Subjt:  -------------------------------IDWLVRPTTVSWMVILG---------ITRLLLLLRIRRKPLSPVLMGHSLLGE----------------

Query:  --CLLASAMLQQHFSGVMCDA------------------SDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVT
          C L    +  +F+    DA                   D AVGA+LGQ++ K    IYYASK LN+AQ+NYTTTE ELL VVFAF+KFR YLV +KV 
Subjt:  --CLLASAMLQQHFSGVMCDA------------------SDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVT

Query:  VFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL--------------------------
        V+TDHA+IRYL+ KKDA P LI W+LLLQEFDLEI+D+KG+EN IADHLSRL+  +   E + I+D+FPDEQLL                          
Subjt:  VFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL--------------------------

Query:  ------------------------------LLRCVLGAEAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDE
                                      L RCV   E  +ILEQCH+ PYGGHF G RT  + L  GFFWP LFKDAH F   CD CQR  N+  R E
Subjt:  ------------------------------LLRCVLGAEAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDE

Query:  MPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSD
        MPL  ILEVELFDVWGIDFMGPF+PS GN++IL+AVDYVSKWVEA A   +D+K V  F++ +IF RFGTPRA++SD
Subjt:  MPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSD

A0A2G9HWF8 Reverse transcriptase1.0e-26140.64Show/hide
Query:  RMLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTW-------------------------------
        +M+Q   Q  GLS E+P+ H+ +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW                               
Subjt:  RMLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTW-------------------------------

Query:  -----------------------------------------DEAD-----------LAMIANALKNVTVISHQQ------PPAMEPTTVVNQVT------
                                                 D+ D            A   N L N+ V  +++      PP       V+QVT      
Subjt:  -----------------------------------------DEAD-----------LAMIANALKNVTVISHQQ------PPAMEPTTVVNQVT------

Query:  -----DEACVYCGEDHNYEFCPSNPASVFFVEAQQK--------------MNQPGFAKAQAKPQGKLP-------SDTK-HPRREGKEQVKAVTLRSGKP
                   CGE H  + CP +  S+ FV   +K                 P F+    + QG  P        +TK +PR++GK Q +AVTLR+G+ 
Subjt:  -----DEACVYCGEDHNYEFCPSNPASVFFVEAQQK--------------MNQPGFAKAQAKPQGKLP-------SDTK-HPRREGKEQVKAVTLRSGKP

Query:  LEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN
        L+E  K  +   + +K +  E++ +                   +VE P     P     PFPQR + +    QF KFLE+ K+LHINIP  EA+EQMP+
Subjt:  LEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN

Query:  NVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASINLMPLSIYRKLGIREARPTTVTLQLADRSITYP
         VKF+KDIL+KK+R G++ETV+LTEECSAI++N LPPK KDPG              RALCDLGASINLMP SIYR LG+ EA+PT++TLQLADRS+TYP
Subjt:  NVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASINLMPLSIYRKLGIREARPTTVTLQLADRSITYP

Query:  EGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNVFKAMKFPDEMEDCSFIRILESTVIETTIQDSAD
        +G IED+LVKVDKFIF  DF++LD E D  VPIILGRPF A GR LIDVQKGEL MRV ++++ FNVFKAMKFP+E ++C  + + ++   + +I +   
Subjt:  EGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNVFKAMKFPDEMEDCSFIRILESTVIETTIQDSAD

Query:  KHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDV---FESLDLD--QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEAL
           E+    ++++      D + ++E+ +  D    F+S  ++  +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L
Subjt:  KHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDV---FESLDLD--QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEAL

Query:  IKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIP
        +++L+ ++ AIGWT+ADI+GISPSFCMHKI+LE+    S++ QRRLNP MKEVVKKE+IKWLDAGIIYPI+D +W+SPVQCVPKKGG+TVV N  NE IP
Subjt:  IKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIP

Query:  TRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML------------------------------------------------------------------
        T+ V GWRVCMDYR+LNKATRKDHFPLPFIDQML                                                                  
Subjt:  TRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML------------------------------------------------------------------

Query:  ----------------------------------------IDWLVRPTTVSWMVILG---------ITRLLLLLRIRRKPLSPVLMGHSLLGE-------
                                                ++W      V   ++LG         + +  L    +  P + V    S LG        
Subjt:  ----------------------------------------IDWLVRPTTVSWMVILG---------ITRLLLLLRIRRKPLSPVLMGHSLLGE-------

Query:  --------------------------CLLASAMLQQHFSG--------------VMCDASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENEL
                                  CL A   L++                  +MCDASD A+GA+LGQ++ K    IYYASK LN+AQ+NYTTTE EL
Subjt:  --------------------------CLLASAMLQQHFSG--------------VMCDASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENEL

Query:  LVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL------
        L VVFAF+KFR YLVG+KV V+TDHA+IRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG EN IADHLSRL+  +   E + I+D+FPDEQLL      
Subjt:  LVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL------

Query:  --------------------------------------------------LLRCVLGAEAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAH
                                                          L RCV   E  +I EQCH+ PYGGHF   RT  + L  GFFWP LFKD H
Subjt:  --------------------------------------------------LLRCVLGAEAKEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAH

Query:  WFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGS
         F   CD CQR GN+  R EMPL  ILEVELFDVWGIDFMGPF+PS GN++IL+AVDY+SKWVEA+A   +D+K V  F++ +IF RFGTPRA++SD G+
Subjt:  WFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGS

A0A2G9HYA0 Reverse transcriptase7.6e-25746.51Show/hide
Query:  AKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQK
        ++PQG LPS+T+ +PR++GK Q +AVTLR+G+ L+E  K  +   + +K ++ E++ +                   +VE P     P     PFPQR +
Subjt:  AKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQK

Query:  PKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASI
         +  + QF KFLE+ K+LHINIP  EA+EQMP+ VKF+KDIL+KK+R G++ETV+LTEECSAI++N LPPK KDPGSFT+P +IG    GRALCDLGASI
Subjt:  PKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASI

Query:  NLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNV
        NLMP SIYR LG+ EA+PT++TLQLADRS+TYP+G IED+LVKVDKFIF  DF++LD E D  VPIILGRPF A GR LIDVQKGEL MRV ++++ FNV
Subjt:  NLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNV

Query:  FKAMKFPDEMEDCSFIRILESTV------------IETTIQDSADKHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIE
        FKAMKFP+E ++C  + + +               +E  + D  D+ +E+  EV+       +LD     +  R E +       +R  P   +KPS+ +
Subjt:  FKAMKFPDEMEDCSFIRILESTV------------IETTIQDSADKHSEKHGEVIIEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIE

Query:  APTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIK
         PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI+LE+    S++ QRRLNP MKEVVKKE+IK
Subjt:  APTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIK

Query:  WLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML--------------------------
        WLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTR V GWRVCMDYR+LNKATRKDHFPLPFIDQML                          
Subjt:  WLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML--------------------------

Query:  --------------------------------------------------------------------------------IDWLVRPTTVSWMVILG---
                                                                                        ++W      V   ++LG   
Subjt:  --------------------------------------------------------------------------------IDWLVRPTTVSWMVILG---

Query:  ------ITRLLLLLRIRRKPLSPVLMGHSLLGE------------------CLLASAMLQQHFSG-----------------------------VMCDAS
              + +  L    +  P + V    S LG                   C L    +  +F                               +MCDAS
Subjt:  ------ITRLLLLLRIRRKPLSPVLMGHSLLGE------------------CLLASAMLQQHFSG-----------------------------VMCDAS

Query:  DAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKG
        D AVGA+LGQ++ K    IYYASK LN+AQ+NYTTTE ELL VVFAF+KFR YLVG+KV V+TDHA+IRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG
Subjt:  DAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKG

Query:  SENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGAEA
        +EN IADHLSRL+  +   E + I+D+FPDEQLL                                                        L RCV   E 
Subjt:  SENVIADHLSRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGAEA

Query:  KEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVS
         +ILEQCH+ PYGGHF G RT  + L  GFFWP LFKDAH F   CD CQR GN+  R EMPL  ILEVELFDVWGIDFMGPF+PS GN++IL+AVDYVS
Subjt:  KEILEQCHSLPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVS

Query:  KWVEAIACHQSDAKTV
        KWVEA A   +D+K V
Subjt:  KWVEAIACHQSDAKTV

A0A2G9HYD8 Reverse transcriptase2.4e-25546.88Show/hide
Query:  AKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQK
        ++PQG LPS+T+ +PR++GK Q +AVTLR+G+ L+E  K       S +  V+ KE E                   +VE P     P     PFPQ+ +
Subjt:  AKPQGKLPSDTK-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQK

Query:  PKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASI
         +  + QF KFLE+ K+LHINIP  EA+EQMP+ VKF+KDIL+KK+R G++ET +LTEEC+AI++N LPPK KDPGSFT+P +IG    GRALCDLGASI
Subjt:  PKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASI

Query:  NLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNV
        NLMP SIYR LG+ EA+PT++TLQLADRS+TYP+G IED+LVKVDKFIF  DF++LD E D  VPIILGRPF A GR LIDVQKGEL MRV ++++ FNV
Subjt:  NLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNV

Query:  FKAMKFPDEMEDCSFIRILESTVIETTIQDSADKHSEKHGEVIIEDFEFCSLD---RKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPL
        FKAMKFP+E ++C  + + ++     +I +      E+    +IE+     L+     N  + F+   V ESL   +R  P   +KPS+ + PTL+LKPL
Subjt:  FKAMKFPDEMEDCSFIRILESTVIETTIQDSADKHSEKHGEVIIEDFEFCSLD---RKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPL

Query:  PDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYP
        P+HL YVYLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI+LE+    S++ QRRLN  MKEVVKKE+IKWLDAGIIYP
Subjt:  PDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEEGSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYP

Query:  IADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML-----------------------------------
        I+DS+WVSPVQCVPKKGG+TVV N  NELIPTR V GWRVCMDYR+LNKATRKDHFPLPFIDQML                                   
Subjt:  IADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQML-----------------------------------

Query:  -----------------------------------------------------------------------IDWLVRPTTVSWMVILG---------ITR
                                                                               ++W      V   ++LG         + +
Subjt:  -----------------------------------------------------------------------IDWLVRPTTVSWMVILG---------ITR

Query:  LLLLLRIRRKPLSPV-----LMGHS---------------------------------------LLGECLLASAMLQQHFS---GVMCDASDAAVGAMLG
          L    +  P + V      +GH+                                       L G  + A  +    +S    +MCDASD A+GA+LG
Subjt:  LLLLLRIRRKPLSPV-----LMGHS---------------------------------------LLGECLLASAMLQQHFS---GVMCDASDAAVGAMLG

Query:  QKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHL
        Q++ K    IYYASK LN+AQ+NYTTTE ELL VVFAF+KFR YLVG+KV V+TDHA+IRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG+EN IADHL
Subjt:  QKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHL

Query:  SRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGAEAKEILEQCHS
        SRL+  +   E + I+D+FPDEQLL                                                        L RCV   E  +ILEQCH+
Subjt:  SRLDLSSSLLEQSAISDSFPDEQLL--------------------------------------------------------LLRCVLGAEAKEILEQCHS

Query:  LPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACH
         PYGGHF G RT  + L  GFFWP LFKDAH F   CD CQR GN+  R EMPL  ILEVELFDVWGIDFMGPF+PS GN++IL+AVDYVSKWVEA A  
Subjt:  LPYGGHFSGRRTTMRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACH

Query:  QSDAKTV
         +D+K V
Subjt:  QSDAKTV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.1e-2141.3Show/hide
Query:  DASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKD
        DASD A+GA+L Q      HP+ Y S+ LNE ++NY+T E ELL +V+A + FR YL+G    + +DH  + +L   KD   +L RW + L EFD +IK 
Subjt:  DASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGSKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKD

Query:  KKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLLL
         KG EN +AD LSR+ L  + L +     +  D   L+
Subjt:  KKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLLL

P0CT34 Transposon Tf2-1 polyprotein4.9e-1925.9Show/hide
Query:  ECLLASAMLQQHFSG-----VMCDASDAAVGAMLGQK-QVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGS--KVTVFTDHASI--R
        +CL++  +L +HF       +  DASD AVGA+L QK      +P+ Y S  +++AQ+NY+ ++ E+L ++ + + +R YL  +     + TDH ++  R
Subjt:  ECLLASAMLQQHFSG-----VMCDASDAAVGAMLGQK-QVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGS--KVTVFTDHASI--R

Query:  YLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRL------------DLSSSLLEQSAISDSFPDEQL------LLLRCVLGAEAKEILEQC
             +    RL RW L LQ+F+ EI  + GS N IAD LSR+            D S + + Q +I+D F ++ +        L  +L  E K + E  
Subjt:  YLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRL------------DLSSSLLEQSAISDSFPDEQL------LLLRCVLGAEAKEILEQC

Query:  H-------------SLPYGGHFSGRRTTMRTLH------------------CGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL-FDVW
                       LP     +  RT ++  H                    F W  + K    + + C  CQ   +   +   PL  I   E  ++  
Subjt:  H-------------SLPYGGHFSGRRTTMRTLH------------------CGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL-FDVW

Query:  GIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQS-DAKTVARFLQSHIFVRFGTPRALVSD
         +DF+     S+G   + + VD  SK    + C +S  A+  AR     +   FG P+ +++D
Subjt:  GIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQS-DAKTVARFLQSHIFVRFGTPRALVSD

P0CT35 Transposon Tf2-2 polyprotein4.9e-1925.9Show/hide
Query:  ECLLASAMLQQHFSG-----VMCDASDAAVGAMLGQK-QVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGS--KVTVFTDHASI--R
        +CL++  +L +HF       +  DASD AVGA+L QK      +P+ Y S  +++AQ+NY+ ++ E+L ++ + + +R YL  +     + TDH ++  R
Subjt:  ECLLASAMLQQHFSG-----VMCDASDAAVGAMLGQK-QVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGS--KVTVFTDHASI--R

Query:  YLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRL------------DLSSSLLEQSAISDSFPDEQL------LLLRCVLGAEAKEILEQC
             +    RL RW L LQ+F+ EI  + GS N IAD LSR+            D S + + Q +I+D F ++ +        L  +L  E K + E  
Subjt:  YLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRL------------DLSSSLLEQSAISDSFPDEQL------LLLRCVLGAEAKEILEQC

Query:  H-------------SLPYGGHFSGRRTTMRTLH------------------CGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL-FDVW
                       LP     +  RT ++  H                    F W  + K    + + C  CQ   +   +   PL  I   E  ++  
Subjt:  H-------------SLPYGGHFSGRRTTMRTLH------------------CGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL-FDVW

Query:  GIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQS-DAKTVARFLQSHIFVRFGTPRALVSD
         +DF+     S+G   + + VD  SK    + C +S  A+  AR     +   FG P+ +++D
Subjt:  GIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQS-DAKTVARFLQSHIFVRFGTPRALVSD

P0CT41 Transposon Tf2-12 polyprotein4.9e-1925.9Show/hide
Query:  ECLLASAMLQQHFSG-----VMCDASDAAVGAMLGQK-QVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGS--KVTVFTDHASI--R
        +CL++  +L +HF       +  DASD AVGA+L QK      +P+ Y S  +++AQ+NY+ ++ E+L ++ + + +R YL  +     + TDH ++  R
Subjt:  ECLLASAMLQQHFSG-----VMCDASDAAVGAMLGQK-QVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGS--KVTVFTDHASI--R

Query:  YLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRL------------DLSSSLLEQSAISDSFPDEQL------LLLRCVLGAEAKEILEQC
             +    RL RW L LQ+F+ EI  + GS N IAD LSR+            D S + + Q +I+D F ++ +        L  +L  E K + E  
Subjt:  YLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRL------------DLSSSLLEQSAISDSFPDEQL------LLLRCVLGAEAKEILEQC

Query:  H-------------SLPYGGHFSGRRTTMRTLH------------------CGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL-FDVW
                       LP     +  RT ++  H                    F W  + K    + + C  CQ   +   +   PL  I   E  ++  
Subjt:  H-------------SLPYGGHFSGRRTTMRTLH------------------CGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL-FDVW

Query:  GIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQS-DAKTVARFLQSHIFVRFGTPRALVSD
         +DF+     S+G   + + VD  SK    + C +S  A+  AR     +   FG P+ +++D
Subjt:  GIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQS-DAKTVARFLQSHIFVRFGTPRALVSD

Q9UR07 Transposon Tf2-11 polyprotein4.9e-1925.9Show/hide
Query:  ECLLASAMLQQHFSG-----VMCDASDAAVGAMLGQK-QVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGS--KVTVFTDHASI--R
        +CL++  +L +HF       +  DASD AVGA+L QK      +P+ Y S  +++AQ+NY+ ++ E+L ++ + + +R YL  +     + TDH ++  R
Subjt:  ECLLASAMLQQHFSG-----VMCDASDAAVGAMLGQK-QVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVGS--KVTVFTDHASI--R

Query:  YLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRL------------DLSSSLLEQSAISDSFPDEQL------LLLRCVLGAEAKEILEQC
             +    RL RW L LQ+F+ EI  + GS N IAD LSR+            D S + + Q +I+D F ++ +        L  +L  E K + E  
Subjt:  YLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRL------------DLSSSLLEQSAISDSFPDEQL------LLLRCVLGAEAKEILEQC

Query:  H-------------SLPYGGHFSGRRTTMRTLH------------------CGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL-FDVW
                       LP     +  RT ++  H                    F W  + K    + + C  CQ   +   +   PL  I   E  ++  
Subjt:  H-------------SLPYGGHFSGRRTTMRTLH------------------CGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL-FDVW

Query:  GIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQS-DAKTVARFLQSHIFVRFGTPRALVSD
         +DF+     S+G   + + VD  SK    + C +S  A+  AR     +   FG P+ +++D
Subjt:  GIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQS-DAKTVARFLQSHIFVRFGTPRALVSD

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein4.7e-1767.27Show/hide
Query:  LHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM
        L  GF+WPT FKDAH F   CDACQR+GN   R+EMP  +ILEVE+FDVWGI FM
Subjt:  LHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACGTGCCGCAACTTCCGCAGGTTCTTGAAGGTCCAGTAGCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCGCTGTTTGAGCAAAATGAGCAGCA
AAATAATCAGGCTGAGAATCCTATCCTGATAGCAAACGATAGGACCAGAGCCAATCGAATGTTGCAAACCGTGGGGCAACTCCATGGTTTGTCATCTGAAGACCCTCATT
TACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAG
TCATGGTTAAACTCTTTTGCTCTAGGATCAATTAGGACGTGGGATGAGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAACC
ACCAGCTATGGAGCCTACTACAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGTGGTGAAGACCATAACTACGAGTTTTGCCCCAGCAACCCAGCTTCTGTGT
TTTTTGTAGAAGCGCAACAAAAGATGAACCAGCCGGGATTTGCTAAAGCGCAGGCGAAGCCTCAAGGGAAACTTCCCTCGGATACTAAACACCCTCGAAGGGAAGGTAAG
GAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAATAATAGTGATAAAAATATTGTTGTTGAGAAAGAGTT
GGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATACTTGCCGCCCCCACCTTATGTACCACCTCTAC
CTTTTCCACAAAGACAAAAGCCTAAGAATCAAGATGGTCAATTTAAAAAGTTTCTAGAGATTCTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAA
ATGCCTAATAATGTTAAGTTTCTTAAGGATATTTTGACTAAAAAGAAAAGATTTGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGG
GCTACCCCCCAAGGCTAAGGATCCAGGGTCATTTACCATGCCTGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGC
CTCTTTCGATCTATCGAAAGTTAGGTATTCGTGAAGCTAGGCCTACCACAGTAACACTCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTC
TTAGTGAAAGTAGATAAATTCATATTTCATGTTGATTTTATTATTTTAGACTATGAGACTGATAAATATGTCCCAATTATTCTAGGTCGTCCATTTTTTGCTATTGGTAG
GGCATTAATAGATGTTCAAAAAGGGGAATTAATAATGAGAGTCTGTAATGAGGAAGTGAAATTTAACGTGTTTAAAGCCATGAAGTTTCCAGACGAAATGGAGGATTGCT
CCTTCATTAGGATTCTAGAGAGCACAGTTATTGAGACAACAATACAGGATTCGGCTGATAAGCATTCAGAAAAGCATGGAGAGGTTATTATAGAAGATTTTGAATTTTGT
TCTTTAGATAGGAAAAATGAAAAAGAGTTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGGAAAGCTCCCCCAATTAAGCCATCCCTGATTGAGGC
ACCCACTTTAGATTTGAAGCCCTTACCAGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGCCGGAGCATGAGG
AGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCATTCTAGAGGAG
GGATCCTTTAGGAGTATTGATCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGCTGGATGCTGGGATCATCTATCCAATTGC
CGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGATAATGAGTTGATTCCAACTAGGATAGTAATTGGATGGA
GGGTTTGCATGGATTACAGAAGGCTTAATAAAGCTACCCGAAAGGATCATTTCCCTCTACCATTTATTGACCAGATGTTGATCGATTGGCTGGTTAGGCCTACTACTGTT
TCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTAGGCGAATGCCTTCT
GGCCTCTGCAATGCTCCAACAACATTTCAGCGGTGTAATGTGTGATGCGAGTGATGCTGCGGTAGGTGCGATGCTGGGGCAAAAGCAGGTCAAATTTATCCATCCTATAT
ATTATGCAAGCAAGGTTTTAAATGAAGCACAAGTCAACTATACAACTACTGAAAATGAGTTGTTAGTTGTGGTGTTCGCTTTTGAGAAGTTCCGACCATATTTGGTTGGA
TCCAAAGTCACAGTTTTCACGGATCATGCATCAATAAGGTACTTAATGTCTAAGAAAGATGCAAAGCCTAGACTAATTCGTTGGATTTTACTATTGCAGGAGTTCGACTT
GGAAATAAAGGACAAGAAGGGATCAGAAAATGTCATTGCAGATCATTTATCTCGTCTTGATCTGTCATCATCTTTGCTGGAGCAATCTGCCATTTCCGATTCTTTTCCAG
ATGAACAACTTTTGTTGTTGAGGTGTGTTTTAGGTGCTGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTTGCCGTATGGAGGTCATTTCAGCGGTCGGAGGACAACT
ATGAGGACTTTGCATTGCGGATTTTTCTGGCCTACGTTATTCAAGGATGCCCATTGGTTCTACAAGCAATGTGATGCTTGCCAACGGAGAGGAAATTTGGGGCCTAGAGA
TGAAATGCCTCTTACTTATATTTTAGAAGTTGAATTATTTGATGTATGGGGTATTGACTTTATGGGGCCATTTCTCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAG
TTGATTATGTGTCCAAATGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTAGCAAGGTTTCTTCAATCGCACATTTTTGTGCGGTTTGGGACACCTAGG
GCTCTAGTGAGTGATGAGGGATCGTTTATTGTGATTGAGGTTTTCCCCAATGGAGCAATTACTTTGCAGGATGAAAAAGATGGGAGAGTGTTCAAGGTTCAGAAGATTGT
TGCAGCAAAGATAATGCTGGAGCAAATTTCTCGTACGAATAGGAAGGATTTTAGTATGATTGAAATTCACGATTTTACCGTTAGGGATTTGAATTGCAAAGTAAATATGA
ACGATACGACTCGAGAGGAAAATGTTGAGGAGAATCAAGAACAGCAGGCTGAGATGGTGAGAGACGAGCAGGCAGACGTAGTGTCTGAAAGAGGAAACGAGCAGGACCAA
GAAGCTCGTGTTGAGGTAATCATGCCCGAACCGCCGAAACGCCGCCGCATAAAGAGAAAGGTCGGCCGTATTCAGGTGATTTGGACTGATACCCCATCACCACCATCATC
AGATTCTGAGAGAGAAAAGGCAGAGCGAGAGGAAAGAGAGAAGAAAGAAGTGAGGACAAAGAAAGAGAAGAAACAGAGAACAAAATACGCTGAGCTTCTGAAAAGAGATT
TCTTGTTCGACCGTGGTTTCAGTTGTGATCTTCCACAGTTTCTGAGGACCGGAGTCGAGGTGGATTGGAGTCCAAGTGCTATCAATGCACTGTACAACGTTCAAAACTTC
CCCCATGCGGCGTATAATGAGATGGCTGTGGCGCCATCTGATGAGCAACTAAGTGAGGCTGTAAGGAGAAGATGCTTCCAACAACACATGACTCGACAATCTCAGGGAAC
GGGTTCCCTAGCTTTTGCCATCTTACGGTCTCTCAGTATTAACGTAGGGAGGATCATTGCGAATGAGATTTCTGGATGTTGGAAAAGGAAGGTGGGGAAACTATTCTTTC
CGAACACAATCATGATGCTGTGCAGAAGAGCAGGGGTTCCGGTGGATGAGGGAGATGTGATCCTATTTGATAAAGGGATCATAGACACGCCCAAATTGGCACGGCTCCAG
CGTGTGCAGGAGAAGTTTGCTGAGAGGCAGACTTTAACCTTCTGGAGCTATGTTAAGAATCGGGATGCCGACTTAAAGAGGGCGCTGCAAGAGAATTTTTCAAACCCATA
CCCAACCCTTCCCATATTCCCTGACGATCTGTTGAACCCCTGGATTCCGCCCCCACCTGCTGAAAGAGGAGAAGAGAATGATGATGAAGAACAGGGTCAGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAACGTGCCGCAACTTCCGCAGGTTCTTGAAGGTCCAGTAGCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCGCTGTTTGAGCAAAATGAGCAGCA
AAATAATCAGGCTGAGAATCCTATCCTGATAGCAAACGATAGGACCAGAGCCAATCGAATGTTGCAAACCGTGGGGCAACTCCATGGTTTGTCATCTGAAGACCCTCATT
TACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAG
TCATGGTTAAACTCTTTTGCTCTAGGATCAATTAGGACGTGGGATGAGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAACC
ACCAGCTATGGAGCCTACTACAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGTGGTGAAGACCATAACTACGAGTTTTGCCCCAGCAACCCAGCTTCTGTGT
TTTTTGTAGAAGCGCAACAAAAGATGAACCAGCCGGGATTTGCTAAAGCGCAGGCGAAGCCTCAAGGGAAACTTCCCTCGGATACTAAACACCCTCGAAGGGAAGGTAAG
GAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAATAATAGTGATAAAAATATTGTTGTTGAGAAAGAGTT
GGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATACTTGCCGCCCCCACCTTATGTACCACCTCTAC
CTTTTCCACAAAGACAAAAGCCTAAGAATCAAGATGGTCAATTTAAAAAGTTTCTAGAGATTCTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAA
ATGCCTAATAATGTTAAGTTTCTTAAGGATATTTTGACTAAAAAGAAAAGATTTGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGG
GCTACCCCCCAAGGCTAAGGATCCAGGGTCATTTACCATGCCTGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGC
CTCTTTCGATCTATCGAAAGTTAGGTATTCGTGAAGCTAGGCCTACCACAGTAACACTCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTC
TTAGTGAAAGTAGATAAATTCATATTTCATGTTGATTTTATTATTTTAGACTATGAGACTGATAAATATGTCCCAATTATTCTAGGTCGTCCATTTTTTGCTATTGGTAG
GGCATTAATAGATGTTCAAAAAGGGGAATTAATAATGAGAGTCTGTAATGAGGAAGTGAAATTTAACGTGTTTAAAGCCATGAAGTTTCCAGACGAAATGGAGGATTGCT
CCTTCATTAGGATTCTAGAGAGCACAGTTATTGAGACAACAATACAGGATTCGGCTGATAAGCATTCAGAAAAGCATGGAGAGGTTATTATAGAAGATTTTGAATTTTGT
TCTTTAGATAGGAAAAATGAAAAAGAGTTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGGAAAGCTCCCCCAATTAAGCCATCCCTGATTGAGGC
ACCCACTTTAGATTTGAAGCCCTTACCAGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGCCGGAGCATGAGG
AGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCATTCTAGAGGAG
GGATCCTTTAGGAGTATTGATCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGCTGGATGCTGGGATCATCTATCCAATTGC
CGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGATAATGAGTTGATTCCAACTAGGATAGTAATTGGATGGA
GGGTTTGCATGGATTACAGAAGGCTTAATAAAGCTACCCGAAAGGATCATTTCCCTCTACCATTTATTGACCAGATGTTGATCGATTGGCTGGTTAGGCCTACTACTGTT
TCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTAGGCGAATGCCTTCT
GGCCTCTGCAATGCTCCAACAACATTTCAGCGGTGTAATGTGTGATGCGAGTGATGCTGCGGTAGGTGCGATGCTGGGGCAAAAGCAGGTCAAATTTATCCATCCTATAT
ATTATGCAAGCAAGGTTTTAAATGAAGCACAAGTCAACTATACAACTACTGAAAATGAGTTGTTAGTTGTGGTGTTCGCTTTTGAGAAGTTCCGACCATATTTGGTTGGA
TCCAAAGTCACAGTTTTCACGGATCATGCATCAATAAGGTACTTAATGTCTAAGAAAGATGCAAAGCCTAGACTAATTCGTTGGATTTTACTATTGCAGGAGTTCGACTT
GGAAATAAAGGACAAGAAGGGATCAGAAAATGTCATTGCAGATCATTTATCTCGTCTTGATCTGTCATCATCTTTGCTGGAGCAATCTGCCATTTCCGATTCTTTTCCAG
ATGAACAACTTTTGTTGTTGAGGTGTGTTTTAGGTGCTGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTTGCCGTATGGAGGTCATTTCAGCGGTCGGAGGACAACT
ATGAGGACTTTGCATTGCGGATTTTTCTGGCCTACGTTATTCAAGGATGCCCATTGGTTCTACAAGCAATGTGATGCTTGCCAACGGAGAGGAAATTTGGGGCCTAGAGA
TGAAATGCCTCTTACTTATATTTTAGAAGTTGAATTATTTGATGTATGGGGTATTGACTTTATGGGGCCATTTCTCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAG
TTGATTATGTGTCCAAATGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTAGCAAGGTTTCTTCAATCGCACATTTTTGTGCGGTTTGGGACACCTAGG
GCTCTAGTGAGTGATGAGGGATCGTTTATTGTGATTGAGGTTTTCCCCAATGGAGCAATTACTTTGCAGGATGAAAAAGATGGGAGAGTGTTCAAGGTTCAGAAGATTGT
TGCAGCAAAGATAATGCTGGAGCAAATTTCTCGTACGAATAGGAAGGATTTTAGTATGATTGAAATTCACGATTTTACCGTTAGGGATTTGAATTGCAAAGTAAATATGA
ACGATACGACTCGAGAGGAAAATGTTGAGGAGAATCAAGAACAGCAGGCTGAGATGGTGAGAGACGAGCAGGCAGACGTAGTGTCTGAAAGAGGAAACGAGCAGGACCAA
GAAGCTCGTGTTGAGGTAATCATGCCCGAACCGCCGAAACGCCGCCGCATAAAGAGAAAGGTCGGCCGTATTCAGGTGATTTGGACTGATACCCCATCACCACCATCATC
AGATTCTGAGAGAGAAAAGGCAGAGCGAGAGGAAAGAGAGAAGAAAGAAGTGAGGACAAAGAAAGAGAAGAAACAGAGAACAAAATACGCTGAGCTTCTGAAAAGAGATT
TCTTGTTCGACCGTGGTTTCAGTTGTGATCTTCCACAGTTTCTGAGGACCGGAGTCGAGGTGGATTGGAGTCCAAGTGCTATCAATGCACTGTACAACGTTCAAAACTTC
CCCCATGCGGCGTATAATGAGATGGCTGTGGCGCCATCTGATGAGCAACTAAGTGAGGCTGTAAGGAGAAGATGCTTCCAACAACACATGACTCGACAATCTCAGGGAAC
GGGTTCCCTAGCTTTTGCCATCTTACGGTCTCTCAGTATTAACGTAGGGAGGATCATTGCGAATGAGATTTCTGGATGTTGGAAAAGGAAGGTGGGGAAACTATTCTTTC
CGAACACAATCATGATGCTGTGCAGAAGAGCAGGGGTTCCGGTGGATGAGGGAGATGTGATCCTATTTGATAAAGGGATCATAGACACGCCCAAATTGGCACGGCTCCAG
CGTGTGCAGGAGAAGTTTGCTGAGAGGCAGACTTTAACCTTCTGGAGCTATGTTAAGAATCGGGATGCCGACTTAAAGAGGGCGCTGCAAGAGAATTTTTCAAACCCATA
CCCAACCCTTCCCATATTCCCTGACGATCTGTTGAACCCCTGGATTCCGCCCCCACCTGCTGAAAGAGGAGAAGAGAATGATGATGAAGAACAGGGTCAGGAAGATTGA
Protein sequenceShow/hide protein sequence
MENVPQLPQVLEGPVAANPQQNPLLQQNPLFEQNEQQNNQAENPILIANDRTRANRMLQTVGQLHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAK
SWLNSFALGSIRTWDEADLAMIANALKNVTVISHQQPPAMEPTTVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVEAQQKMNQPGFAKAQAKPQGKLPSDTKHPRREGK
EQVKAVTLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYLPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQ
MPNNVKFLKDILTKKKRFGEFETVSLTEECSAILKNGLPPKAKDPGSFTMPVSIGGKELGRALCDLGASINLMPLSIYRKLGIREARPTTVTLQLADRSITYPEGKIEDV
LVKVDKFIFHVDFIILDYETDKYVPIILGRPFFAIGRALIDVQKGELIMRVCNEEVKFNVFKAMKFPDEMEDCSFIRILESTVIETTIQDSADKHSEKHGEVIIEDFEFC
SLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKIILEE
GSFRSIDQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRIVIGWRVCMDYRRLNKATRKDHFPLPFIDQMLIDWLVRPTTV
SWMVILGITRLLLLLRIRRKPLSPVLMGHSLLGECLLASAMLQQHFSGVMCDASDAAVGAMLGQKQVKFIHPIYYASKVLNEAQVNYTTTENELLVVVFAFEKFRPYLVG
SKVTVFTDHASIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDLSSSLLEQSAISDSFPDEQLLLLRCVLGAEAKEILEQCHSLPYGGHFSGRRTT
MRTLHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFLPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFVRFGTPR
ALVSDEGSFIVIEVFPNGAITLQDEKDGRVFKVQKIVAAKIMLEQISRTNRKDFSMIEIHDFTVRDLNCKVNMNDTTREENVEENQEQQAEMVRDEQADVVSERGNEQDQ
EARVEVIMPEPPKRRRIKRKVGRIQVIWTDTPSPPSSDSEREKAEREEREKKEVRTKKEKKQRTKYAELLKRDFLFDRGFSCDLPQFLRTGVEVDWSPSAINALYNVQNF
PHAAYNEMAVAPSDEQLSEAVRRRCFQQHMTRQSQGTGSLAFAILRSLSINVGRIIANEISGCWKRKVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIDTPKLARLQ
RVQEKFAERQTLTFWSYVKNRDADLKRALQENFSNPYPTLPIFPDDLLNPWIPPPPAERGEENDDEEQGQED