; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022579 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022579
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:33527020..33537824
RNA-Seq ExpressionLag0022579
SyntenyLag0022579
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]4.7e-26440.79Show/hide
Query:  VNQV--IEEACVYCGEDHNYEFCP----------------SNPASVLAQP-----PQLSWG---GQGSNMQTQQKVNQPGFAKAQVLPQQNKQACPTKFR
        VNQV  I   C  CGE H  + CP                +NP S    P     P  SW    GQGS  + QQ     G  +    P Q K       +
Subjt:  VNQV--IEEACVYCGEDHNYEFCP----------------SNPASVLAQP-----PQLSWG---GQGSNMQTQQKVNQPGFAKAQVLPQQNKQACPTKFR

Query:  SSLEAMMKEFMARTDA---AIQRKLPQILNTL-----------------KEGKEQVKAVTLRSELETGQGAGGSNKDAGASGSVPYVEPPYVPPPPYVPP
         SLE  + +FMA T A    ++ ++ Q+ N +                 ++GK Q +AVTLR+  E  +                               
Subjt:  SSLEAMMKEFMARTDA---AIQRKLPQILNTL-----------------KEGKEQVKAVTLRSELETGQGAGGSNKDAGASGSVPYVEPPYVPPPPYVPP

Query:  LPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRA
         P   +++    + + K+    L++LHINIP  EA+EQMP+Y KF+KDIL+KK+ LG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    GRA
Subjt:  LPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRA

Query:  LCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVC
        LCDLG    L           GE +PT++TLQL DRS+TYP+G IED+LVKVDKFIFP DF++LD E +  VPIILG PF ATGR LIDVQK    M+  
Subjt:  LCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVC

Query:  NE---------------------------------------EVKFNVFK---AMKYLDEMEDCSFIRILESTIVKIAIQDSTD---KHLEGH------GE
        NE                                       E  + V K   A KY       S  R   S ++K +I++      K L  H      GE
Subjt:  NE---------------------------------------EVKFNVFK---AMKYLDEMEDCSFIRILESTIVKIAIQDSTD---KHLEGH------GE

Query:  ---------------------------------------GISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQ
                                               GISPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLD GIIYPI D++WVSPVQ
Subjt:  ---------------------------------------GISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQ

Query:  CVPKKGGVTM----------------------------------------------------------------ITIAPEDQEKITFTCPYGTFAFRRMP
        CVPKKGG+T+                                                                I IAPEDQEKITFTCPYGTFAFRRMP
Subjt:  CVPKKGGVTM----------------------------------------------------------------ITIAPEDQEKITFTCPYGTFAFRRMP

Query:  FGLCNAPATFQ-CK-----------------------------------VLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEP
        FGLCNAPATFQ C                                    VLKRCEDT+L+LNWEKCHFMV+EGIVLGH++S  G+EVD+AK+E IE+L P
Subjt:  FGLCNAPATFQ-CK-----------------------------------VLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEP

Query:  PNSVKGIQSFLGHAGFYRR---------------------------------------------------------------------------------
        P SVKG++SFLGHAGFYRR                                                                                 
Subjt:  PNSVKGIQSFLGHAGFYRR---------------------------------------------------------------------------------

Query:  ----VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK------------------------------------------
             LN+AQ+NYTTTEKELLAVVFAF+KFR +LVG+KV V+TDHAAIRYL+ KKDAK                                          
Subjt:  ----VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK------------------------------------------

Query:  ----------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQCHSSPYGGHFSGQR
                        V  D PW                                                  CV   E  +ILEQCH+SPYGGHF G R
Subjt:  ----------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQCHSSPYGGHFSGQR

Query:  TAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFL
        TA +IL  GFFWP LFKDAH F   CD CQR GN+  R EMPL  ILEVELFDVWGIDFMGPF PS GN++IL+AVDYVSKWVEA A   +D+K V  F+
Subjt:  TAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFL

Query:  QSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG------
        + +IF RFGTPRA++SD G HF N     LL+KY +KH+I+TPYHPQ +GQ E+S R IK ILEK V  +RKDWS RLDEALWAYRTAYKTP+G      
Subjt:  QSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG------

Query:  ------------------AI-------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPF
                          AI             R+LQLNEL+EFR  +YENA++YKEK K  H+KKI  + F  GQ VL +NSRLKLFPGK+KS+WSGPF
Subjt:  ------------------AI-------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPF

Query:  VVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEFQSKYPSL
         + EVFPHGA+ L+++     FKVN QR+KHYWGE    ++ S+
Subjt:  VVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEFQSKYPSL

PIN00904.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.3e-26640.77Show/hide
Query:  DELNPGIARPQIQAANFEMKRADLAMIANALKNVTVISHQQPQHGPAAVVNQVIEEACVYCGEDHNYEFCP----------------SNPASVLAQP---
        +   P  A   I+        A +  +  ++KN  V    Q QH P           C  CGE H  + CP                +NP S    P   
Subjt:  DELNPGIARPQIQAANFEMKRADLAMIANALKNVTVISHQQPQHGPAAVVNQVIEEACVYCGEDHNYEFCP----------------SNPASVLAQP---

Query:  --PQLSWG---GQGSNMQTQQKVNQPGFAKAQVLPQQNKQACPTKFRSSLEAMMKEFMARTDAAIQRKLPQILNTLKEGKEQVKAVTLRSELETGQGAGG
          P  SW    GQGS  + QQ+  Q      QV P   ++      + SLE  + +FMA T A  +    QI        +   A+  R      QG+  
Subjt:  --PQLSWG---GQGSNMQTQQKVNQPGFAKAQVLPQQNKQACPTKFRSSLEAMMKEFMARTDAAIQRKLPQILNTLKEGKEQVKAVTLRSELETGQGAGG

Query:  SNKDAGASGSVPYVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILK
        SN +  +   V       +     V   P   +++    + + K+    L++LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++E V+LTEECS I++
Subjt:  SNKDAGASGSVPYVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILK

Query:  NGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVP
        N LPPK K+PGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GE +PT++TLQL DRS+TYP+G I+D+LVKVDKFIFP DF++LD E +  VP
Subjt:  NGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVP

Query:  IILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFK--------AMKYLDEME-------------DCSFIRILE------------------STIVKI
        IILG PF ATGR LIDVQK        +E    ++F         A + LD +E             DC  ++ L+                  S ++K 
Subjt:  IILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFK--------AMKYLDEME-------------DCSFIRILE------------------STIVKI

Query:  AIQDSTD---KHLEGH------GE---------------------------------------GISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKK
        +I++      K L  H      GE                                       GISPSFCMHKI LE+    SIE QRRLNP MKEVVKK
Subjt:  AIQDSTD---KHLEGH------GE---------------------------------------GISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKK

Query:  EVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM----------------------------------------------------------------IT
        E+IKWLD GIIYPI D++WVSPVQCVPKKGG+T+                                                                I 
Subjt:  EVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM----------------------------------------------------------------IT

Query:  IAPEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ------------------------------------CKVLKRCEDTHLVLNWEKCHFMVKEGIVL
        IAPEDQEK TFTCPYGTFAFRRMPFGLCNAPATFQ                                      VLKRCEDT+LVLNWEKCHFMV+EGIVL
Subjt:  IAPEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ------------------------------------CKVLKRCEDTHLVLNWEKCHFMVKEGIVL

Query:  GHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRR---------------------------------------------------------
        GH++S  G+EVD+AK+E IE+L PP SVKG++SFLGHAGFYRR                                                         
Subjt:  GHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRR---------------------------------------------------------

Query:  -----------------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK-----------------------------
                          LN+AQ+NYTTTEKELLAVVFAF+KFR +LV +KV V+TDHAAIRYL+ KKDA                              
Subjt:  -----------------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK-----------------------------

Query:  -----------------------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQC
                                     V  + PW                                                  CV   E  +ILEQC
Subjt:  -----------------------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQC

Query:  HSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIA
        H+SPYGGHF G RTA +IL  GFFWP LFKDAH F   CD CQR  N+  R EMPL  ILEVELFDVWGIDFMGPF PS GN++IL+AVDYVSKWVEA A
Subjt:  HSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIA

Query:  CHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRT
           +D+K V  F++ +IF RFGTPRA++SD   +F N     LL+KY +KH+I TPYHPQ +G  E+S R IK ILEK V  +RKDWS RLDEALWAYRT
Subjt:  CHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRT

Query:  AYKTPLG------------------------AI-------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKL
        AYKTP+G                        AI             R+LQLNEL+EFR  +YENA++YKEKTK  HDKKI  + F  GQ VL +NSRLKL
Subjt:  AYKTPLG------------------------AI-------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKL

Query:  FPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWG
        FPGK+KS+W G F + EVFPHGA+ L++E     FK+N +R+KHYWG
Subjt:  FPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWG

PIN26668.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.6e-28041.88Show/hide
Query:  DELNPGIARPQIQAANFEMKRADLAMIANALKNVTVISHQQPQHGPAAVVNQVIEEACVYCGEDHNYEFCPSNPASVL----AQPPQLS---------WG
        +   P  A   I+        A +  +  ++KN  V    Q QH P           C  CGE H    CP++  S+     A+ PQ +         W 
Subjt:  DELNPGIARPQIQAANFEMKRADLAMIANALKNVTVISHQQPQHGPAAVVNQVIEEACVYCGEDHNYEFCPSNPASVL----AQPPQLS---------WG

Query:  GQGSNMQTQQKVNQPGFAKAQVLPQQNKQACPTKFRSSLEAMMKEFMARTDA---AIQRKLPQILNTL-----------------KEGKEQVKAVTLRSE
         Q  N        Q    + Q   QQ  Q    + + SLE  + +FMA T      ++ ++ Q+ N +                 ++GK Q +AVTLR+ 
Subjt:  GQGSNMQTQQKVNQPGFAKAQVLPQQNKQACPTKFRSSLEAMMKEFMARTDA---AIQRKLPQILNTL-----------------KEGKEQVKAVTLRSE

Query:  LETGQGAGGSNKDAGASGSVPYVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSL
         E  +      K  G    +   E   V       PL   Q+Q+ K    QF KFLE+ K+LHIN P  EA+EQMP+Y KF+K IL+KK+RLG++ETV+L
Subjt:  LETGQGAGGSNKDAGASGSVPYVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSL

Query:  TEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIIL
        TEECSAI++N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GE +PT++TLQL +RS+TYP+G IED+LVKVDKFIFP DF++L
Subjt:  TEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIIL

Query:  DYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYLDEMEDC-------------------------------------------
        D E +  VPIILG PF ATGR LIDVQKG+LTMRV ++++ FNVFKAMK+ +E ++C                                           
Subjt:  DYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYLDEMEDC-------------------------------------------

Query:  ----------SFIRILESTIVKIAIQDSTD---KHLEGH------GE---------------------------------------GISPSFCMHKIALE
                  S  R   S ++K +I++S     K L  H      GE                                       GIS SFCMHKI LE
Subjt:  ----------SFIRILESTIVKIAIQDSTD---KHLEGH------GE---------------------------------------GISPSFCMHKIALE

Query:  EGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTMI-----TIAPE--------------------------------
        +    S+E QRRLNP MKEVVKKE+IKW+D GIIYPI D++WVSPVQCVPKKGG+T++      + P                                 
Subjt:  EGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTMI-----TIAPE--------------------------------

Query:  -----------------DQEKITFTCPYGTFAFRRMPFGLCNAPATFQ-CK-----------------------------------VLKRCEDTHLVLNW
                         DQEK TFTCPYGTFAFRR+PFGLCNAPATFQ C                                    VLKRCEDT+LVLNW
Subjt:  -----------------DQEKITFTCPYGTFAFRRMPFGLCNAPATFQ-CK-----------------------------------VLKRCEDTHLVLNW

Query:  EKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRR--------------------------------------------
        +KCHFMV+EGIVL H++S  G+EV++AK+E IE+L PP SVKGI+SFLGHAGFYRR                                            
Subjt:  EKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRR--------------------------------------------

Query:  -----------------------------------------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK-----
                                                  LN+ Q+NYTTTEKELLAVVFAF+KFR +LVG+KV V+TDHAAIRYL+ KKDAK     
Subjt:  -----------------------------------------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK-----

Query:  ----------------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQCHSSPYGG
                              V  D PW                                                  CV   E  +ILEQCH+SPYGG
Subjt:  ----------------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQCHSSPYGG

Query:  HFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAK
        HF G RTA +IL  GFFWP LFKDAH F   CD CQR GN+  R EMPL  IL+VELFDVWGIDF+GPF PS GN++IL+AVDYVSKWVEA+A   +D+K
Subjt:  HFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAK

Query:  TVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
         V  F++ +IF RFGTPRA++SD G HF N      L+KY +KH+I TPYHPQ +GQ E+S R IK ILEK V  +R DWS RLDEALWAYRT YKTP+G
Subjt:  TVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

Query:  ------------------------AI-------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKS
                                AI             R+LQLNEL+EFR  +YENA++YKEKTK  HDKKI  + F  GQ VL +NSRLKLFP K+K 
Subjt:  ------------------------AI-------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKS

Query:  KWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWG
        +WSGPF + EVFPHGA+ L++E     FKVN QR+KHYWG
Subjt:  KWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWG

XP_012853091.1 PREDICTED: uncharacterized protein LOC105972661 [Erythranthe guttata]3.7e-26140.26Show/hide
Query:  SLEAMMKEFMARTDAAIQRKLPQILNTLKEG-KEQVKAVTLRSELETGQGAGGSNKDAGASGSVPYVEPPYVPP-------PPYVPPLPFPQRQRPKNQD
        +LE  M +F  + +      LP      ++G  EQ KAV+LR+  +  + A  +    G        E    PP       P   P +P+PQRQ     +
Subjt:  SLEAMMKEFMARTDAAIQRKLPQILNTLKEG-KEQVKAVTLRSELETGQGAGGSNKDAGASGSVPYVEPPYVPP-------PPYVPPLPFPQRQRPKNQD

Query:  GQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPL
          +K FLE L +LHINIP   A+E MP++ KFLKD+++KK++ GE E +SLTE+CSAIL   +PPK  DPG FTIPV+IGGK   ++L DLGASINLMP 
Subjt:  GQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPL

Query:  SVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFKAMK
        SV++ LG+G+++  +VTLQL DRS+ YP+G +EDVLVKVDKFIFP DF++LD E +K +P+ILG PF  TGR +IDV KG L+M + +E +KF+VF+ MK
Subjt:  SVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFKAMK

Query:  YLDEMEDCSFIRILE-----------------STIVKIAIQDS-----------------------------------------------TDKHLE--GH
        +  E+++C    +L+                 S  V  ++++                                                +  HL+   H
Subjt:  YLDEMEDCSFIRILE-----------------STIVKIAIQDS-----------------------------------------------TDKHLE--GH

Query:  G---------------------------------------EGISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSP
        G                                       +GISP+FC HKI LEE    S+EQQRRLNP MKEVVKKE+IKWLD GII+PI D+ WVSP
Subjt:  G---------------------------------------EGISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSP

Query:  VQCVPKKGGVTM----------------------------------------------------------------ITIAPEDQEKITFTCPYGTFAFRR
        VQCVPKKGG+T+                                                                I IAPEDQ+K TFTCP+GTFAFRR
Subjt:  VQCVPKKGGVTM----------------------------------------------------------------ITIAPEDQEKITFTCPYGTFAFRR

Query:  MPFGLCNAPATFQ------------------------------------CKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERL
        MPFGLCNA ATFQ                                     KVL+RCE+T+LVLNWEKCHFMV+EGIVLGH++SK  LEVDRAKIE IE+L
Subjt:  MPFGLCNAPATFQ------------------------------------CKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERL

Query:  EPPNSVKGIQSFLGHAGFYRR-------------------------------------------------------------------------------
         PP SVKGI+SFLGHAGFYRR                                                                               
Subjt:  EPPNSVKGIQSFLGHAGFYRR-------------------------------------------------------------------------------

Query:  ------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK------------------------------------VVKD
               LN+AQ+NYTTTEKELLAVVFAFEKFR +L+G+KV VFTDH+A++YL+ KK+AK                                     VKD
Subjt:  ------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK------------------------------------VVKD

Query:  --------------------APW--------------------------------------------------CVSGDEAKEILEQCHSSPYGGHFSGQR
                             PW                                                  CV  +E  +IL  CH S Y GH+ G+R
Subjt:  --------------------APW--------------------------------------------------CVSGDEAKEILEQCHSSPYGGHFSGQR

Query:  TAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFL
        TA ++L  GFFWPTLFKD + F   CD CQR GN+  R E+PL  ++EVE FDVWGIDFMGPF PS  N++IL+AVDYVSKWVEA A   +D+  V  FL
Subjt:  TAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFL

Query:  QSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGAI----
        +  IF RFGTPRA++SD G HF N     LLAKY +KH++A  YHPQ NGQAE++ R IK ILEKVV P+RKDWS +LD+ LWAYRT YKTPLG      
Subjt:  QSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGAI----

Query:  ---------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPF
                                         RMLQLNEL+E R  +YE+ R+YKEKTK  HDKKI  +EF KGQ VL YNSRLKLFPGK+KS+WSGPF
Subjt:  ---------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPF

Query:  VVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEFQSKYPSLRARKERENEEEEVAITPEVQKVKAKKK
         +++VFPHGA+ L D +  + FKVNGQ +KHY G EF  +  S+        +  +V I+P+    K   K
Subjt:  VVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEFQSKYPSLRARKERENEEEEVAITPEVQKVKAKKK

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]4.1e-26841.25Show/hide
Query:  SLEAMMKEFMARTDAAIQRKLPQILNTLKEGKEQVKAVTLRS----------ELETGQGAGGSNKDAGASGSVPYVE--------PPYVP----PPPYVP
        +LE  + +     +A  +   P   NT    KEQ KA+TLRS          E ET   A  + +          VE        PP +     PP    
Subjt:  SLEAMMKEFMARTDAAIQRKLPQILNTLKEGKEQVKAVTLRS----------ELETGQGAGGSNKDAGASGSVPYVE--------PPYVP----PPPYVP

Query:  PLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGR
        PLP+PQR + +  D QF KFL+I K++HINIP  +A+EQMPNYAKFLKDI++KK+RL EFETV L+EECSAI++  LP K KDPGSFT+P +IG     +
Subjt:  PLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGR

Query:  ALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRV
         LCDLGASINLMPLSVYRKLG+GE++ TT++LQL DRSI YP G IEDVLVKVDKFIFP DF++LD E ++ VP+ILG PF ATGRAL+DVQKGELT+RV
Subjt:  ALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRV

Query:  CNEEVKFNVFKAMKYLDEMEDCSFIRILESTIVKIAIQDSTDKHLE------GHG---------------------------------------------
          EEV FN+++AMK+ ++   C  + ++E  +V+   +D    HLE       H                                              
Subjt:  CNEEVKFNVFKAMKYLDEMEDCSFIRILESTIVKIAIQDSTDKHLE------GHG---------------------------------------------

Query:  --------------------------------------------------------------------------EGISPSFCMHKIALEEGSFRSIEQQR
                                                                                  +GISPS CMHKI +EE    SIE QR
Subjt:  --------------------------------------------------------------------------EGISPSFCMHKIALEEGSFRSIEQQR

Query:  RLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM------------------------------------------------------
        RLNPAMKEVV+ E++K L+ GIIY I D++WVSPVQ VPKKGG+T+                                                      
Subjt:  RLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM------------------------------------------------------

Query:  ----------ITIAPEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ------------------------------------CKVLKRCEDTHLVLNWE
                  I IAPEDQEK TFTCPYGTFAFRRMPFGLCNAPATFQ                                      VL+RCED +LVLNWE
Subjt:  ----------ITIAPEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ------------------------------------CKVLKRCEDTHLVLNWE

Query:  KCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYR----------------------------------------------
        KCHFMV+EGIVLGHR+S  G+EVDRAKI  IE+L PP +VKGI+SFLGHAGFYR                                              
Subjt:  KCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYR----------------------------------------------

Query:  ---------------------------------------RVLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK------
                                               R LNEAQ+NYTTTEKE+LAVVFA +KFR +L+ +KV VFTDHAA+RYL +KKDAK      
Subjt:  ---------------------------------------RVLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK------

Query:  -------------------------------------VVKDA-------------PW-------------------------------------------
                                             V+++A             PW                                           
Subjt:  -------------------------------------VVKDA-------------PW-------------------------------------------

Query:  -------CVSGDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPS
               CV  +E + IL  CHSS YGGHF   RTA ++L  GFFWP++F+D++   K CD CQR GN+  R E+PL  ILEVELFDVWGIDFMGPFPPS
Subjt:  -------CVSGDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPS

Query:  NGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKV
         G V+ILLAVDYVSKWVEAIA   +DAK V +FL  +IF RFGTPRA++SDEG HF N +   LL+KY +KH+IA  YHPQ NGQAEIS R IK ILEK 
Subjt:  NGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKV

Query:  VHPSRKDWSFRLDEALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKK
        V+ +RKDW+ +LD+ALWAYRTA+KTP+G                                       R+LQLNE++EFR  +YENA++YKE+TK  HDK+
Subjt:  VHPSRKDWSFRLDEALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKK

Query:  IKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEFQ
        I  +EF  GQ+VL +NSRLKLFPGK++S+W+GP+ + +V   GAI L+D K G +F+VNGQR+KHY+GE+ +
Subjt:  IKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEFQ

TrEMBL top hitse value%identityAlignment
A0A2G9GEK6 DNA-directed DNA polymerase5.6e-24742.07Show/hide
Query:  VNQVIEE--ACVYCGEDHNYEFCPSNPASVLAQPPQLSWGGQGSNMQTQQKVNQPGFAKAQVLPQQN--KQACPTKFRSSLEAMMKEFMARTDAAIQRKL
        VNQV      C  CGE H  E  P +  S+     Q     Q S          PG+ +       N  +Q    +F+   +  +++ M     +++  L
Subjt:  VNQVIEE--ACVYCGEDHNYEFCPSNPASVLAQPPQLSWGGQGSNMQTQQKVNQPGFAKAQVLPQQN--KQACPTKFRSSLEAMMKEFMARTDAAIQRKL

Query:  PQILNTL---------KEGKEQVKAVTLRSELETGQGAGGSNKDAGASGSVPYVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLV
         Q + +L         ++GK Q +AVTL +  E  +      K  G        E     P          +R + +  + QF+KFL+I K+LHINIP  
Subjt:  PQILNTL---------KEGKEQVKAVTLRSELETGQGAGGSNKDAGASGSVPYVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLV

Query:  EAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQL
        EA+EQMP+Y KF+KDIL+KK+RLG++E V+LTEE SAI++N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GE +PT++TLQL
Subjt:  EAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQL

Query:  GDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNE-------------------------------
         DR +TYP G IED+ +KVDK IFP DF++LD E +  +PIILG PF AT    +  ++   TM+  NE                               
Subjt:  GDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNE-------------------------------

Query:  --------EVKFNVFK---AMKYLDEMEDCSFIRILESTIVKIAIQDSTDKHLE---------------------GHGEGISPSFCMHKIALEEGSFRSI
                E    V K   A KY       S  R   S ++K +I++     L+                        +GISPSFCMHKI LE+    S+
Subjt:  --------EVKFNVFK---AMKYLDEMEDCSFIRILESTIVKIAIQDSTDKHLE---------------------GHGEGISPSFCMHKIALEEGSFRSI

Query:  EQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM--------------------------------------------------
        E QRRLN  MKEVVKKE+IKWLD GIIYPI +++WVSPVQCVPKK G+TM                                                  
Subjt:  EQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM--------------------------------------------------

Query:  --------------ITIAPEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ--------------CKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISK
                      I IAP+DQEK TFTCPYGTFAFRRMPFGLCNAPATFQ                VLKRCEDT+LVLNWEKCHFMV+EGIVLGH++S 
Subjt:  --------------ITIAPEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ--------------CKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISK

Query:  NGLEVDRAKIEVIERLEPPN-----------SVKGIQSFLG-------HAGFY-RRVLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTV-------
         G+E+D+AK+E IE+L PP            S   I + LG       H+ +Y  + LN+AQ+NYTTTEKELLAVVFAF+KFR +LV   + +       
Subjt:  NGLEVDRAKIEVIERLEPPN-----------SVKGIQSFLG-------HAGFY-RRVLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTV-------

Query:  --FTDHAAIRYLMAKKD---------------AKVVKDAPW--------------------------------------------------CVSGDEAKE
            DH +     AK D               A V  D PW                                                  CV   E   
Subjt:  --FTDHAAIRYLMAKKD---------------AKVVKDAPW--------------------------------------------------CVSGDEAKE

Query:  ILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKW
        +LEQCH+SPYGGHF   RTA +IL  GFF P LFKDAH F   CD CQR GN+  R EM L  ILEVELFDVWGIDFMGPF PS GN++IL AVDYVSKW
Subjt:  ILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKW

Query:  VEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEAL
        VEA+A   +D+K V  F++ +IF RFGTPRA++SD G HF N     LL+KY             A+GQ E+S R IK ILEK +  +RKDWS  LDEAL
Subjt:  VEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEAL

Query:  WAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYN
        WAYRTA+KTP+G                                       R+LQLNEL+EFR  +YENA++YKEKTK  HDKKI  + F  GQ VL +N
Subjt:  WAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYN

Query:  SRLKLFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEFQSKYPSLRARKE
        SRLKLF GK+KS+WSGPF + EVF HGA+ L++E     FKVN QR+KHYWG     ++ S+   ++
Subjt:  SRLKLFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEFQSKYPSLRARKE

A0A2K3NPD0 Reverse transcriptase2.7e-24139.22Show/hide
Query:  YVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPV
        ++   P    LP+P  Q  K+++ Q+ +FL+I K+L INIP  EA+EQMP YAKF+K+ILTKK++  + E + L   CSAI++  LP K KDPG  T+PV
Subjt:  YVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPV

Query:  SIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDV
        +IG   +G+AL DLG+SINL+PLSV  ++G  ++  T +TLQL D+SIT P G  EDVLVKVDKF+FPIDF+++D E +  VP+ILG PF  T R +ID+
Subjt:  SIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDV

Query:  QKGELTMRVCNEEVKFNVFKAMKY-------------------------------------------------------LDEMEDCSFIR----------
          G + +RV +EEV FN+++AMK+                                                       LDE+E+ S +           
Subjt:  QKGELTMRVCNEEVKFNVFKAMKY-------------------------------------------------------LDEMEDCSFIR----------

Query:  ----------------------------ILESTIVKIAIQDSTDKHLEGHG----------EGISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKE
                                    ++ S+ + I  ++S  + L+ +           +GISPS+CMH I +E+      + QRRLNP MKEVV+KE
Subjt:  ----------------------------ILESTIVKIAIQDSTDKHLEGHG----------EGISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKE

Query:  VIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM----------------------------------------------------------------ITI
        V+K L+ G+IYPI D++WVSPVQ VPKKGG+T+                                                                IT+
Subjt:  VIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM----------------------------------------------------------------ITI

Query:  APEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ-------------------------------C-----KVLKRCEDTHLVLNWEKCHFMVKEGIVLG
         PED EK  FTCP+G FA+RRMPFGLCNAPATFQ                               C      VLKRC +T+LVLNWEKCHFMV EGIVLG
Subjt:  APEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ-------------------------------C-----KVLKRCEDTHLVLNWEKCHFMVKEGIVLG

Query:  HRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRR----------------------------------------------------------
        H+IS  G+EVD+AK+EVIE+L PP ++KGI+SFLGHAGFYRR                                                          
Subjt:  HRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRR----------------------------------------------------------

Query:  ---------------------------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK-------------------
                                   VLN+AQINY TTEKELLA+V+A EKFR +L+GSK+ V+TDHAAI+YL+ K D+K                   
Subjt:  ---------------------------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK-------------------

Query:  -------------------------------------VVKDAPW--------------------------------------------------CVSGDE
                                             +V++ PW                                                  CV+ +E
Subjt:  -------------------------------------VVKDAPW--------------------------------------------------CVSGDE

Query:  AKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYV
        A  IL  CH+SPYGGH++G+RTA +IL  GFFWPT+FKD++ + + CD CQR G +  R+EMPL  ILEVE+FD WGIDF+GPFP S  N +IL+AVDYV
Subjt:  AKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYV

Query:  SKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLD
        SKWVEAIA  ++D KTV +FL+ +IF RFGTPR L+SD G HF N+ L K L  Y +KH+IA+PYHPQ NGQAE+S R IK ILEK V  SRKDWS +LD
Subjt:  SKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLD

Query:  EALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVL
        EALWAYRTA+K+P+G                                       R  QL+ELEE R  +YE++++YK+K K  HDK+I  ++F  GQKVL
Subjt:  EALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVL

Query:  HYNSRLKLFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEE
         +NSRLKLFPGK+KSKWSGPF++ EV P+GA+ ++D +  R + VNGQR+K Y+G E
Subjt:  HYNSRLKLFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEE

A0A2K3PBF7 Reverse transcriptase6.4e-24339.97Show/hide
Query:  LPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRA
        LP+P     K+++ Q+ +FLEI K+L IN+P  EA+EQMP YAKF+K+ILTKK+ L E ET+ L   CSAI++  LP K KDPG  T+PV+IG  ++G+A
Subjt:  LPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRA

Query:  LCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVC
        L DLG+SINL+PLSV  ++G  E+R T +TLQL D+SI  P G  EDVLVKVDKFIFPIDF+++D E +  VP+ILG  F  T R +ID+  G + +RV 
Subjt:  LCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVC

Query:  NEEVKFNVFKAMKYLDEMEDCSFIRILESTIVKIAIQ---------------------------------------------------DSTD--------
        +EEV FN+++AMK+  E + C  +   E  I+ + IQ                                                   DS +        
Subjt:  NEEVKFNVFKAMKYLDEMEDCSFIRILESTIVKIAIQ---------------------------------------------------DSTD--------

Query:  --KHLEGH---------------------------------------------GEGISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDT
          K L  H                                              +GISPS+CMH I +E+      + QRRLNP MKEVV+KEV+K L+ 
Subjt:  --KHLEGH---------------------------------------------GEGISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDT

Query:  GIIYPIVDNNWVSPVQCVPKKGGVTM----------------------------------------------------------------ITIAPEDQEK
        G+IYPI D+ WVSPVQ VPKKGG+T+                                                                IT+ PED EK
Subjt:  GIIYPIVDNNWVSPVQCVPKKGGVTM----------------------------------------------------------------ITIAPEDQEK

Query:  ITFTCPYGTFAFRRMPFGLCNAPATFQ-------------------------------C-----KVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNG
          FTCP+G FA+RRMPFGLCNAPATFQ                               C      VLKRC +T+LVLNWEKCHFMV EGIVLGH+IS  G
Subjt:  ITFTCPYGTFAFRRMPFGLCNAPATFQ-------------------------------C-----KVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNG

Query:  LEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRR-----------------------------------------------------------------
        +EVD+AK+EVIE+L PP +VKGI+SFLGHAGFYRR                                                                 
Subjt:  LEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRR-----------------------------------------------------------------

Query:  --------------------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK--------------------------
                            VLNEAQINY TTEKELLA+V+A EKFR +L+GSK+ V+TDHAAI+YL+ K D+K                          
Subjt:  --------------------VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK--------------------------

Query:  ------------------------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQ
                                      ++++ PW                                                  CV+ +EA  IL  
Subjt:  ------------------------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQ

Query:  CHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAI
        CH+SPYGGH++G+RTA ++L  GFFWPTLFKDA+   ++CD CQ  G +  R+EMPL  IL VE+FD WGIDF+GPFP S  N +IL+AVDYVSKWVEAI
Subjt:  CHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAI

Query:  ACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYR
        A  ++D KTV +FL+ +IF RFGTPR L+SD G HF N+ L K L  Y ++H++A+PYHPQ NGQAE+S R IK ILEK V  SRKDWS +LD+ALWAYR
Subjt:  ACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYR

Query:  TAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLK
        TA+K+P+G                                       R LQL+ELEE R  +YE++++YKEK K  HDKKI SKEF  GQ VL +NSRLK
Subjt:  TAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLK

Query:  LFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEF
        LFPGK+KSKWSGPF + E+ P+GA+ L+D K    + VNGQR+K Y+G EF
Subjt:  LFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRVKHYWGEEF

A0A2Z6MNM7 Reverse transcriptase1.8e-24038.36Show/hide
Query:  ILNTLKEGKEQVKAVTLRSELETGQGAG-----------------------------GSNKDAGASGSVPYVEPPYVPPPPYVPPLPFPQRQRPKNQDGQ
        + NT    KE  K++T RS  E G+G G                                K+     S    +   +PP  +   LP+P     K+++ Q
Subjt:  ILNTLKEGKEQVKAVTLRSELETGQGAG-----------------------------GSNKDAGASGSVPYVEPPYVPPPPYVPPLPFPQRQRPKNQDGQ

Query:  FKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSV
        + +FL+I K+L INIP  EA+EQMP YAKF+K+ILTKKKR+ + ET+ L   CSAI++  LP K KDPG  T+PV+IG   +G+AL DLG+SINL+PLSV
Subjt:  FKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSV

Query:  YRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY-
          ++G  ++  T +TLQL D+S+  P G  EDVLVKVDKF+FPIDF+++D E +  VP+ILG PF  T R +ID+  G + +RV +EEV FN+++AMKY 
Subjt:  YRKLGIGEVRPTTVTLQLGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY-

Query:  -------------------------------------------------------------------LDEMEDCS-------FIRILESTIVKIAIQDST
                                                                           ++EM+D S        ++ L S +  + +++ +
Subjt:  -------------------------------------------------------------------LDEMEDCS-------FIRILESTIVKIAIQDST

Query:  DK------HLEGHGE----------------------GISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCV
         K       L  HGE                      GISPS+CMH I +EE      + QRRLNP MKEVV+KEV+K L+ G+IYPI D+ WVSPVQ V
Subjt:  DK------HLEGHGE----------------------GISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCV

Query:  PKKGGVTM----------------------------------------------------------------ITIAPEDQEKITFTCPYGTFAFRRMPFG
        PKKGG+T+                                                                IT+ PED EK +FTCP+G FA RRMPFG
Subjt:  PKKGGVTM----------------------------------------------------------------ITIAPEDQEKITFTCPYGTFAFRRMPFG

Query:  LCNAPATFQ-------------------------------C-----KVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPN
        LCNAPATFQ                               C      VLKRC +T+LVLNWEKCHFMV EGIVLGH+IS  G+EVD+AKIEVIE+L PP 
Subjt:  LCNAPATFQ-------------------------------C-----KVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPN

Query:  SVKGIQSFLGHAGFYRR-----------------------------------------------------------------------------------
        +VKGI+SFLGHAGFYRR                                                                                   
Subjt:  SVKGIQSFLGHAGFYRR-----------------------------------------------------------------------------------

Query:  --VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK--------------------------------------------
          VLN+AQINY TTEKELLA+V+A EKFR +L+GSK+ V+TDHAAI+YL+ K D+K                                            
Subjt:  --VLNEAQINYTTTEKELLAVVFAFEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK--------------------------------------------

Query:  ------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQCHSSPYGGHFSGQRTAMR
                    +V++ PW                                                  CV+ +EA  IL  CH+SPYGGH++G+RTA +
Subjt:  ------------VVKDAPW--------------------------------------------------CVSGDEAKEILEQCHSSPYGGHFSGQRTAMR

Query:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHI
        IL  GFFWPTLFKD++ + + CD CQ+ G +  R+EMPL  ILEVE+FD WGIDF+GPFP S  N +IL+AVDYVSKWVEAIA  ++D KTV +FL+ +I
Subjt:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHI

Query:  FARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGAI--------
        F RFGTPR L+SD G HF N+ L K L  Y +KH++A+PYHPQ NGQAE+S R +K ILEK V  SRKDWS +LDEALWAYRTA+K+P+G          
Subjt:  FARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGAI--------

Query:  -----------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPFVVIE
                                     R  QL+E++E R  +YE++++YKEK K  HDK+I +K F  GQ VL +NSRLK+FPGK+KSKWSGPF V +
Subjt:  -----------------------------RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPFVVIE

Query:  VFPHGAITLQDEKDGRVFKVNGQRVKHYWGEE
        V P+GAI ++D +  R + VNGQR+K Y+G E
Subjt:  VFPHGAITLQDEKDGRVFKVNGQRVKHYWGEE

A0A6P8CBX2 Reverse transcriptase7.1e-25037.29Show/hide
Query:  FDELNPGIARPQIQAANFEMKRA--------DLAMIANALKNVTVISHQQPQHGPAAVVNQVIEEACVYCGEDHNYEFCPSNPASVLAQPPQLSW--GGQ
        +DE +  I      A N++ +R+        D+  IAN    ++ ++ Q  +   A   N      C  C   H+   C S   S      Q+++    Q
Subjt:  FDELNPGIARPQIQAANFEMKRA--------DLAMIANALKNVTVISHQQPQHGPAAVVNQVIEEACVYCGEDHNYEFCPSNPASVLAQPPQLSW--GGQ

Query:  GSNMQTQQKVNQPGFAKAQVLPQQN-----------------KQACPTKFRSSLEAMMKEFMARTDAAIQRKLPQILN----------------------
         SN         PG+        +N                 + A P + +S +E +M  +M +TD  +Q +   I N                      
Subjt:  GSNMQTQQKVNQPGFAKAQVLPQQN-----------------KQACPTKFRSSLEAMMKEFMARTDAAIQRKLPQILN----------------------

Query:  TLKEGKEQVKAVTLRS--ELETGQGAGGSNKDAGA--SGSVPYVEP--PYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN
          +E  + V A+ LRS  ELE       + +++     G     EP    +   PYVPP+PFP R + +  D QF KFL++ K+L INIP  EA++QMP+
Subjt:  TLKEGKEQVKAVTLRS--ELETGQGAGGSNKDAGA--SGSVPYVEP--PYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN

Query:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSI
        YA+F+KD+LTKK++    E V LT ECS IL+     LP K +D GSFT+P +IG       L D GASINLMPLS++RKLG+GE + T +TLQL DRSI
Subjt:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQLGDRSI

Query:  TYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYLDEMEDCSFIRILESTI-------
         YP+G +E+VLVKVDKFIFP+DFI+L+ E ++ VP+ILG PF ATG+ALIDV++G+LT+RV NE++ FNV+ A+K  D+ + C  I I++  I       
Subjt:  TYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYLDEMEDCSFIRILESTI-------

Query:  -----VKIAIQDSTD------------------------------------------------------------------------------------K
             ++  ++D  D                                                                                    +
Subjt:  -----VKIAIQDSTD------------------------------------------------------------------------------------K

Query:  HLEGHG------EGISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM----------------
        H E  G      +GISP  C H+I LE      ++ QRRLNP +KEVVKKEV+K LD GIIYPI D+ WVSPVQ VPKKGG+T+                
Subjt:  HLEGHG------EGISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTM----------------

Query:  ------------------------------------------------ITIAPEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ-CK------------
                                                        I IAPEDQEK TFTCPYGTFAFRRMPFGLCNAPATFQ C             
Subjt:  ------------------------------------------------ITIAPEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQ-CK------------

Query:  -----------------------VLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYR--------
                               VLKRC++T+L+LNWEKCHFMV+EGIVLGH++SK G+EVDRAK+E+IE+L PP S KG++SFLGHAGFYR        
Subjt:  -----------------------VLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYR--------

Query:  -----------------------------------------------------------------------------RVLNEAQINYTTTEKELLAVVFA
                                                                                     R LNEAQ NY TTEKELLAV+FA
Subjt:  -----------------------------------------------------------------------------RVLNEAQINYTTTEKELLAVVFA

Query:  FEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK-------------------------VVKD------------------------------APW------
         +KFR +L+GSK+ V+TDHAA++YL AK DAK                         VV D                               PW      
Subjt:  FEKFRQHLVGSKVTVFTDHAAIRYLMAKKDAK-------------------------VVKD------------------------------APW------

Query:  --------------------------------------------CVSGDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDAC
                                                    CV   E   I++ CHS   GGHF  +RTA +IL CGF+WP +F D   +   C  C
Subjt:  --------------------------------------------CVSGDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDAC

Query:  QRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTK
        QR GN+  R E+P   IL +ELFDVWGIDFMGPFP S  N +IL+AVDYVSKWVEA+A   +DA+ V RFL+ +IF+RFG PRA++SD G HF N    K
Subjt:  QRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTK

Query:  LLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG------------------------AI---------
        LL+KY + H+IATPYHPQ  GQ E+S R IK ILEK V+ SRKDWS +LD+ALWAYRTA+KTP+G                        AI         
Subjt:  LLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG------------------------AI---------

Query:  ----RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRV
            R+LQLN++ E R+ +YENAR+YKE+ K  HD+ I  +EF  GQKVL YNSRLKLFPGK+KS+WSGPFV+  VFP+GA+ L+ E D R FKVNG  +
Subjt:  ----RMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPFVVIEVFPHGAITLQDEKDGRVFKVNGQRV

Query:  KHYWGEE
        KHY+  E
Subjt:  KHYWGEE

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.8e-6327.2Show/hide
Query:  MFKSFKAEVELQLERKSKVIRSDRGGEYYGRYDGSGEQCPGPFALFLKECGIVPQYMMPGKPSMNEVAERRNRTLKDMVRSMVCHSSLSEFLWGEALKTA
        MF+ F A+ E     K   +  D G EY                 F  + GI     +P  P +N V+ER  RT+ +  R+MV  + L +  WGEA+ TA
Subjt:  MFKSFKAEVELQLERKSKVIRSDRGGEYYGRYDGSGEQCPGPFALFLKECGIVPQYMMPGKPSMNEVAERRNRTLKDMVRSMVCHSSLSEFLWGEALKTA

Query:  VYILNRVPNK---EVNKTPYELWTGKRPSLRNLHIWGCPTEARPYRPNEKKLDSRTISCYFVGN--------DMVTLPIIVSKEIQDNDTQ---------
         Y++NR+P++   + +KTPYE+W  K+P L++L ++G        +  + K D ++    FVG         D V    IV++++  ++T          
Subjt:  VYILNRVPNK---EVNKTPYELWTGKRPSLRNLHIWGCPTEARPYRPNEKKLDSRTISCYFVGN--------DMVTLPIIVSKEIQDNDTQ---------

Query:  DQVSTLDIITSQDNTQDTSLYSIDQTQQTQEVP-------LRRSTRERRSAIPNDYIVFFQ---------------------------------------
        + V   D   S++         I QT+   E         L+ S        PND     Q                                       
Subjt:  DQVSTLDIITSQDNTQDTSLYSIDQTQQTQEVP-------LRRSTRERRSAIPNDYIVFFQ---------------------------------------

Query:  ------------------EHEDSIGL---TENDPINFLQARK-----------------------------------------RSKVDKWIEVMRDKMKF
                          EH   IG+   T+ND I  +  R                                          R     W E +  ++  
Subjt:  ------------------EHEDSIGL---TENDPINFLQARK-----------------------------------------RSKVDKWIEVMRDKMKF

Query:  MTDNGVWDLVKLPEGKKHIGCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP-----------------------MDVKTTFLNGDIDETI
           N  W + K PE K  +  +W+F  K +  GN  RYKAR V +GFTQK  IDY+ETF P                       MDVKT FLNG + E I
Subjt:  MTDNGVWDLVKLPEGKKHIGCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP-----------------------MDVKTTFLNGDIDETI

Query:  YMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSG--SRYIFLVLYVDDTLLACSDLNLPQETNTFLTKHFEM
        YM  P+  +  S    VCKL KAIYGL+Q +R W+  F   +    F  + VD C+Y  + G  +  I+++LYVDD ++A  D+        +L + F M
Subjt:  YMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSG--SRYIFLVLYVDDTLLACSDLNLPQETNTFLTKHFEM

Query:  KDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQKIPYSSVVGSLMYAQVCTRPDIAFIVGV
         DL      +GI I  +  +  + LSQ +Y+ K+LS+++M  C    TP+  +  + L     SD    E    P  S++G LMY  +CTRPD+   V +
Subjt:  KDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQKIPYSSVVGSLMYAQVCTRPDIAFIVGV

Query:  LGRYLS
        L RY S
Subjt:  LGRYLS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-10135.91Show/hide
Query:  MFKSFKAEVELQLERKSKVIRSDRGGEYYGRYDGSGEQCPGPFALFLKECGIVPQYMMPGKPSMNEVAERRNRTLKDMVRSMVCHSSLSEFLWGEALKTA
        +F+ F A VE +  RK K +RSD GGEY  R           F  +    GI  +  +PG P  N VAER NRT+ + VRSM+  + L +  WGEA++TA
Subjt:  MFKSFKAEVELQLERKSKVIRSDRGGEYYGRYDGSGEQCPGPFALFLKECGIVPQYMMPGKPSMNEVAERRNRTLKDMVRSMVCHSSLSEFLWGEALKTA

Query:  VYILNRVPNKEVN-KTPYELWTGKRPSLRNLHIWGCPTEARPYRPNEKKLDSRTISCYFVG---------------------------------------
         Y++NR P+  +  + P  +WT K  S  +L ++GC   A   +    KLD ++I C F+G                                       
Subjt:  VYILNRVPNKEVN-KTPYELWTGKRPSLRNLHIWGCPTEARPYRPNEKKLDSRTISCYFVG---------------------------------------

Query:  -------NDMVTLPIIVSKEIQDNDTQDQVSTL----DIITSQDNTQDTSLYSIDQTQQTQE--VPLRRSTR---ERRSAIPNDYIVFFQEHEDSIGLTE
                + VT+P   +       T D+VS        +  Q    D  +  ++   Q +E   PLRRS R   E R     +Y++   + E       
Subjt:  -------NDMVTLPIIVSKEIQDNDTQDQVSTL----DIITSQDNTQDTSLYSIDQTQQTQE--VPLRRSTR---ERRSAIPNDYIVFFQEHEDSIGLTE

Query:  NDPINFLQARKRSKVDKWIEVMRDKMKFMTDNGVWDLVKLPEGKKHIGCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP-----------
          P +  +     + ++ ++ M+++M+ +  NG + LV+LP+GK+ + CKW+FK K+D    + RYKAR V KGF QK GID+ E F P           
Subjt:  NDPINFLQARKRSKVDKWIEVMRDKMKFMTDNGVWDLVKLPEGKKHIGCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP-----------

Query:  ------------MDVKTTFLNGDIDETIYMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKN-SGSRYIFLVLY
                    +DVKT FL+GD++E IYMEQPE F +   K MVCKL K++YGL+Q  RQWY KF + + S ++     D CVY K  S + +I L+LY
Subjt:  ------------MDVKTTFLNGDIDETIYMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKN-SGSRYIFLVLY

Query:  VDDTLLACSDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQK
        VDD L+   D  L  +    L+K F+MKDLG A  +LG+ I+R+R+   L LSQ+ YI++VL R++M   +P  TP+A   K S   CP +  E+  M K
Subjt:  VDDTLLACSDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQK

Query:  IPYSSVVGSLMYAQVCTRPDIAFIVGVLGRYLSDP
        +PYSS VGSLMYA VCTRPDIA  VGV+ R+L +P
Subjt:  IPYSSVVGSLMYAQVCTRPDIAFIVGVLGRYLSDP

P25600 Putative transposon Ty5-1 protein YCL074W5.6e-2631.93Show/hide
Query:  MDVKTTFLNGDIDETIYMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSGSRYIFLVLYVDDTLLACSDLNL
        MDV T FLN  +DE IY++QP  F    +   V +L   +YGL+Q    W    +N +    F  +  +  +Y +++    I++ +YVDD L+A     +
Subjt:  MDVKTTFLNGDIDETIYMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSGSRYIFLVLYVDDTLLACSDLNL

Query:  PQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQKI-PYSSVVGSLMY
               LTK + MKDLG     LG++I    S G + LS + YI K  S  ++N  +   TP+        N  P  +     ++ I PY S+VG L++
Subjt:  PQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQKI-PYSSVVGSLMY

Query:  AQVCTRPDIAFIVGVLGRYLSDPECVAGRLEGANSVLQ
             RPDI++ V +L R+L +P  +   LE A  VL+
Subjt:  AQVCTRPDIAFIVGVLGRYLSDPECVAGRLEGANSVLQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.8e-4432.17Show/hide
Query:  ENDPINFLQARKRSKVDKWIEVMRDKMKFMTDNGVWDLVKLPEGKKHI-GCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP---------
        E++P   +QA K    ++W   M  ++     N  WDLV  P     I GC+WIF  K +S G++ RYKAR V KG+ Q+ G+DY ETF P         
Subjt:  ENDPINFLQARKRSKVDKWIEVMRDKMKFMTDNGVWDLVKLPEGKKHI-GCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP---------

Query:  --------------MDVKTTFLNGDIDETIYMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSGSRYIFLVL
                      +DV   FL G + + +YM QP  F      + VCKL+KA+YGL+Q  R WY +  N + +  F  +V D  ++    G   +++++
Subjt:  --------------MDVKTTFLNGDIDETIYMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSGSRYIFLVL

Query:  YVDDTLLACSDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQ
        YVDD L+  +D  L   T   L++ F +KD     + LGI   R  +   L LSQ+ YI  +L+R +M   +P  TP+A   K SL    K         
Subjt:  YVDDTLLACSDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQ

Query:  KIPYSSVVGSLMYAQVCTRPDIAFIVGVLGRYLSDPECVAGRLEGANSVLQQNWEQKLPHHSSLAKLTNRILL
           Y  +VGSL Y    TRPDI++ V  L +++  P      L+    +L+  +    P+H    K  N + L
Subjt:  KIPYSSVVGSLMYAQVCTRPDIAFIVGVLGRYLSDPECVAGRLEGANSVLQQNWEQKLPHHSSLAKLTNRILL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-4324.32Show/hide
Query:  FKSFKAEVELQLERKSKVIRSDRGGEYYGRYDGSGEQCPGPFALFLKECGIVPQYMMPGKPSMNEVAERRNRTLKDMVRSMVCHSSLSEFLWGEALKTAV
        F  FK+ VE + + +   + SD GGE+    D            +L + GI      P  P  N ++ER++R + +M  +++ H+S+ +  W  A   AV
Subjt:  FKSFKAEVELQLERKSKVIRSDRGGEYYGRYDGSGEQCPGPFALFLKECGIVPQYMMPGKPSMNEVAERRNRTLKDMVRSMVCHSSLSEFLWGEALKTAV

Query:  YILNRVPNKEVN-KTPYELWTGKRPSLRNLHIWGCPTE--ARPYRPNEKKLDSRTISCYFVGNDMV----------TLPIIVSKEIQDNDTQDQVSTLD-
        Y++NR+P   +  ++P++   G+ P+   L ++GC      RPY  N  KL+ ++  C F+G  +           T  +  S+ +Q ++     ST + 
Subjt:  YILNRVPNKEVN-KTPYELWTGKRPSLRNLHIWGCPTE--ARPYRPNEKKLDSRTISCYFVGNDMV----------TLPIIVSKEIQDNDTQDQVSTLD-

Query:  -IITSQDNTQD-----------------------------------------------------TSLYSIDQTQQT------------------------
         + TSQ+   D                                                     +S+ S   ++ T                        
Subjt:  -IITSQDNTQD-----------------------------------------------------TSLYSIDQTQQT------------------------

Query:  ----------------QEVPLRR--------------------------STRERRSAIPNDYIVFF-----------------------QEHEDSIGLTE
                        Q  PL +                          ST      +P   I+                         Q++  +  L  
Subjt:  ----------------QEVPLRR--------------------------STRERRSAIPNDYIVFF-----------------------QEHEDSIGLTE

Query:  N-DPINFLQARKRSKVDKWIEVMRDKMKFMTDNGVWDLVKLPEGKKHI-GCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP---------
        N +P   +QA K    D+W + M  ++     N  WDLV  P     I GC+WIF  K +S G++ RYKAR V KG+ Q+ G+DY ETF P         
Subjt:  N-DPINFLQARKRSKVDKWIEVMRDKMKFMTDNGVWDLVKLPEGKKHI-GCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP---------

Query:  --------------MDVKTTFLNGDIDETIYMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSGSRYIFLVL
                      +DV   FL G + + +YM QP  F        VC+L+KAIYGL+Q  R WY +    + +  F  ++ D  ++    G   I++++
Subjt:  --------------MDVKTTFLNGDIDETIYMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSGSRYIFLVL

Query:  YVDDTLLACSDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQ
        YVDD L+  +D  L + T   L++ F +K+  +  + LGI   R   QG L LSQ+ Y   +L+R +M   +P  TP+A   K +L+   K         
Subjt:  YVDDTLLACSDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQ

Query:  KIPYSSVVGSLMYAQVCTRPDIAFIVGVLGRYLSDP
           Y  +VGSL Y    TRPD+++ V  L +Y+  P
Subjt:  KIPYSSVVGSLMYAQVCTRPDIAFIVGVLGRYLSDP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-4630.21Show/hide
Query:  IQDNDTQDQVSTLDIITSQDNTQDTSLYSIDQTQQTQEVPLRRSTRERRSAIPNDY------------IVFFQEHEDSIGLTENDPINFLQARKRSKVDK
        + D D     S++DI+ S +   D    S+  + +          R R+ A   DY            I  F  +E    L  +  +   +A++ S  ++
Subjt:  IQDNDTQDQVSTLDIITSQDNTQDTSLYSIDQTQQTQEVPLRRSTRERRSAIPNDY------------IVFFQEHEDSIGLTENDPINFLQARKRSKVDK

Query:  ------WIEVMRDKMKFMTDNGVWDLVKLPEGKKHIGCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP----------------------
              W   M D++  M     W++  LP  KK IGCKW++K K +S G IERYKAR V KG+TQ++GID+ ETF P                      
Subjt:  ------WIEVMRDKMKFMTDNGVWDLVKLPEGKKHIGCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFP----------------------

Query:  -MDVKTTFLNGDIDETIYMEQPENFALGSSKSM----VCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSGSRYIFLVLYVDDTLLAC
         +D+   FLNGD+DE IYM+ P  +A     S+    VC LKK+IYGL+Q SRQW+ KF   +  F F  +  D   + K + + ++ +++YVDD ++  
Subjt:  -MDVKTTFLNGDIDETIYMEQPENFALGSSKSM----VCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSGSRYIFLVLYVDDTLLAC

Query:  SDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQKIPYSSVVG
        ++     E  + L   F+++DLG   + LG+ I   RS   + + Q+ Y   +L    + GC+P   P+     FS +    S  + ++ +   Y  ++G
Subjt:  SDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQKIPYSSVVG

Query:  SLMYAQVCTRPDIAFIVGVLGRYLSDP
         LMY Q+ TR DI+F V  L ++   P
Subjt:  SLMYAQVCTRPDIAFIVGVLGRYLSDP

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.3e-0536.92Show/hide
Query:  NRTLKDMVRSMVCHSSLSEFLWGEALKTAVYILNRVPNKEVN-KTPYELWTGKRPSLRNLHIWGC
        NRT+ + VRSM+C   L +    +A  TAV+I+N+ P+  +N   P E+W    P+   L  +GC
Subjt:  NRTLKDMVRSMVCHSSLSEFLWGEALKTAVYILNRVPNKEVN-KTPYELWTGKRPSLRNLHIWGC

ATMG00750.1 GAG/POL/ENV polyprotein1.7e-1766.07Show/hide
Query:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM
        +L  GF+WPT FKDAH F   CDACQR+GN   R+EMP  +ILEVE+FDVWGI FM
Subjt:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM

ATMG00810.1 DNA/RNA polymerases superfamily protein6.8e-1133.33Show/hide
Query:  IFLVLYVDDTLLACSDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHE
        ++L+LYVDD LL  S   L       L+  F MKDLG   + LGI I    S   L LSQ  Y +++L+   M  C+P  TP+  +   S++     D  
Subjt:  IFLVLYVDDTLLACSDLNLPQETNTFLTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHE

Query:  RIEMQKIPYSSVVGSLMYAQVCTRPDIAFIVGVLGRYLSDP
                + S+VG+L Y  + TRPDI++ V ++ + + +P
Subjt:  RIEMQKIPYSSVVGSLMYAQVCTRPDIAFIVGVLGRYLSDP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.7e-1242.47Show/hide
Query:  WIEVMRDKMKFMTDNGVWDLVKLPEGKKHIGCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFPM
        W + M++++  ++ N  W LV  P  +  +GCKW+FKTK  S G ++R KAR V KGF Q++GI + ET+ P+
Subjt:  WIEVMRDKMKFMTDNGVWDLVKLPEGKKHIGCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKETFFPM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAATCTTTTAAAGCTGAAGTTGAGCTTCAACTTGAAAGGAAAAGTAAGGTCATTAGATCTGACCGTGGAGGAGAATATTATGGCAGATATGATGGATCAGGTGA
ACAATGTCCAGGACCCTTTGCGCTTTTCCTTAAAGAATGTGGCATAGTTCCTCAGTATATGATGCCTGGGAAACCTAGCATGAATGAAGTTGCTGAGAGACGTAATAGGA
CTCTTAAGGATATGGTAAGAAGTATGGTTTGTCATTCTTCTTTGTCAGAGTTCCTTTGGGGAGAAGCATTGAAAACTGCAGTATACATACTTAACAGAGTCCCTAACAAA
GAAGTAAACAAAACCCCATATGAGTTGTGGACTGGGAAAAGACCTAGCCTAAGGAATTTACACATTTGGGGTTGTCCAACAGAGGCAAGGCCTTATAGGCCAAATGAAAA
GAAGTTGGACTCAAGAACCATAAGCTGCTACTTTGTTGGGAATGACATGGTTACCTTACCTATCATTGTCAGTAAAGAAATTCAAGATAATGATACTCAAGATCAAGTTT
CAACACTTGATATCATTACTTCTCAAGACAACACTCAAGATACATCCCTCTACTCTATAGATCAAACTCAACAAACTCAAGAAGTGCCATTAAGAAGATCCACTAGAGAA
AGGAGAAGTGCAATACCAAATGATTATATTGTATTTTTTCAAGAACATGAAGATAGCATTGGCCTTACAGAAAATGATCCTATCAATTTTCTACAGGCAAGGAAACGTTC
TAAGGTAGACAAGTGGATTGAGGTTATGCGAGATAAGATGAAATTTATGACTGACAATGGTGTTTGGGATCTAGTTAAGTTGCCTGAAGGAAAGAAACATATTGGTTGTA
AATGGATATTTAAAACCAAAAGGGATTCAAAAGGCAATATTGAGAGATATAAAGCACGCCCTGTTACAAAAGGTTTCACTCAAAAGGATGGCATAGATTATAAAGAGACT
TTCTTTCCAATGGATGTCAAGACGACTTTCCTCAATGGAGACATTGATGAAACAATTTATATGGAGCAACCAGAAAATTTTGCGTTAGGAAGTTCAAAGTCAATGGTTTG
CAAACTGAAGAAAGCCATCTATGGACTCGAGCAAGTCTCTCGTCAATGGTACCACAAATTTCATAATGTCATTACCTCTTTCAGTTTTGAGGCTAATGTAGTTGATGAAT
GTGTATACCATAAGAATAGTGGGAGTAGGTATATTTTCCTGGTATTGTATGTAGATGACACCTTACTTGCTTGCAGTGATTTAAACCTACCACAAGAAACCAATACCTTT
CTAACAAAACATTTTGAAATGAAAGATCTTGGAAATGCCTCTTTTGTATTAGGTATACACATACTGCGAGATCGTTCCCAAGGTATCTTAGGGTTATCACAAAAGAGTTA
CATTGATAAAGTATTGAGTAGGTATGACATGAATGGATGTCAGCCAGGTGATACACCTGTAGCTAAAGAAGATAAATTTAGTTTAAACCAGTGCCCCAAAAGCGATCATG
AAAGAATTGAGATGCAAAAGATTCCTTATTCATCCGTTGTAGGGAGTCTAATGTATGCTCAAGTTTGCACCCGACCTGATATAGCTTTTATAGTGGGAGTGTTGGGTAGA
TATCTGAGTGACCCAGAATGTGTTGCTGGGCGACTGGAGGGAGCAAATTCTGTGTTGCAGCAAAACTGGGAGCAAAAACTGCCACATCACAGCTCATTAGCCAAGTTGAC
AAACCGAATTCTGTTGAGTTATTCTCGTGATAAAGGAGCAAGGAGAGCCCTACACGTGTCCAGATCCAGCGAACCCCCAGAATCGCTTGCTGCAGCAAAATCGCCGCTGG
AGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTTTTCCAATGTTTGATGAGTTGAATCCA
GGGATTGCACGTCCTCAAATCCAAGCGGCAAATTTTGAAATGAAACGGGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCC
CCAGCATGGACCTGCTGCAGTGGTGAACCAAGTCATAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTGG
CGCAACCACCCCAACTTTCATGGGGAGGACAAGGAAGTAATATGCAAACACAACAAAAGGTGAACCAGCCGGGATTTGCTAAAGCGCAGGTATTGCCCCAGCAAAATAAG
CAGGCTTGCCCCACAAAATTCAGGAGTTCTCTTGAGGCGATGATGAAGGAATTTATGGCTCGTACAGATGCCGCAATTCAAAGGAAACTTCCTCAGATACTGAACACCCT
GAAGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGAGTTGGAGACTGGTCAGGGTGCTGGAGGCAGCAATAAAGATGCTGGAGCATCTGGTTCTGTTCCAT
ATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATT
CTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAGATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTT
TGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCACCCAAGGCTAAGGATCCAGGATCATTTACTATACCTGTATCTATAGGTGGAAAAG
AGTTAGGGAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGCTAGGTATTGGTGAAGTTAGGCCTACCACAGTTACGCTCCAA
TTAGGTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTGAAGGTAGATAAATTCATATTTCCTATTGATTTTATTATTTTAGACTATGAGGCTAA
TAAACATGTCCCAATCATTTTAGGTCATCCATTTTCGGCTACTGGTAGGGCATTAATAGATGTTCAGAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAAT
TTAATGTGTTTAAAGCCATGAAGTATCTAGACGAAATGGAAGATTGCTCTTTCATCAGGATTCTAGAGAGCACAATTGTTAAGATAGCAATACAAGATTCGACTGACAAG
CATTTGGAAGGTCATGGAGAGGGAATTAGCCCATCTTTTTGTATGCATAAAATCGCTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGC
AATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATACCGGGATCATTTATCCAATTGTAGACAACAATTGGGTGAGCCCTGTCCAATGTGTTCCTAAGAAAG
GAGGTGTCACTATGATTACTATTGCTCCTGAGGATCAGGAAAAAATCACTTTCACCTGCCCTTATGGGACGTTTGCTTTCAGGCGAATGCCTTTTGGCCTTTGCAATGCT
CCAGCAACATTTCAGTGTAAGGTGTTAAAGAGATGTGAGGATACCCATCTAGTTCTTAATTGGGAAAAATGCCACTTCATGGTGAAGGAGGGCATAGTGTTAGGTCATAG
GATTTCTAAGAATGGTCTAGAAGTTGATAGAGCAAAAATTGAGGTTATTGAGAGACTAGAACCACCGAATTCAGTGAAGGGAATTCAGAGTTTTTTAGGCCATGCTGGAT
TTTATAGGAGGGTTTTAAATGAAGCACAAATCAACTACACAACTACTGAAAAGGAGTTGTTAGCTGTTGTGTTTGCTTTTGAGAAATTCCGGCAACATTTGGTTGGATCC
AAAGTCACGGTGTTCACGGATCATGCAGCAATAAGGTATTTAATGGCTAAGAAAGATGCAAAGGTAGTCAAGGATGCCCCTTGGTGTGTTTCAGGTGATGAAGCAAAGGA
AATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGCCATTTCAGCGGTCAGAGGACAGCTATGAGGATTTTGCATTGTGGATTTTTTTGGCCTACATTATTTAAGGATG
CCCATTGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAACTTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTAGAAGTTGAATTATTCGATGTATGG
GGTATTGATTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTCCAAGTGGGTGGAGGCCATCGCATGCCATCAGAGTGA
TGCCAAGACAGTAGCACGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGAGCTCTAGTGAGTGATGAGGGTATGCATTTTGTTAATAATATCTTAACTA
AGCTTTTAGCTAAGTATGAGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAAGAGGGTAATTAAAGCTATTCTGGAGAAA
GTAGTCCATCCATCTAGGAAGGATTGGTCTTTTAGGTTGGATGAGGCACTTTGGGCTTATAGGACAGCCTATAAGACTCCTCTAGGAGCAATAAGAATGCTGCAGCTTAA
TGAATTAGAGGAATTTCGCCAATTTTCTTATGAAAATGCGAGAATGTATAAGGAAAAGACTAAGCTGCGGCATGACAAGAAAATTAAATCTAAAGAGTTTGGCAAGGGTC
AGAAAGTCTTGCATTATAATTCTAGATTGAAATTATTTCCCGGGAAAGTAAAATCTAAATGGTCAGGACCGTTTGTTGTGATTGAGGTTTTCCCCCATGGAGCAATTACT
TTGCAGGATGAAAAAGATGGGAGAGTGTTCAAGGTGAATGGACAGCGTGTGAAACACTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGAGCGAGGAAAGA
AAGAGAGAATGAGGAGGAAGAGGTTGCGATTACCCCTGAGGTACAGAAAGTAAAGGCGAAGAAGAAAAAGACCCCGGAGGAGAAAGAAGCCAAGAGAAGAAGGAGGCAAC
ATAGGGTTGTAGAACAAGAAGAGGTTCAGGAAGTGGCAGAGGATGTTGCCACTACAGTAGCGGAAGGAGATACACAAGAACCTGAAGTGCAACACCCAGCGGCGGTCGAA
TCAATAGACGCTGATACCGAAGGAATGCAGGAAGAAAACCCTGAAGAAAATCAAGAAACACAAGCTGACGAGGTTCGAGAAGAAGAGGCAGCGACTGTGCCTGAAAAAGA
GACTGAAAGAGAGCCAGTTCAGGAGGCTCGTGTTGAGGTCGTCATGCCCAAACCTCCGAGACGTCGCCGCATTAAGCGGAAGGCTGGGCGCGTTCAGATGATTTGGACTG
ATACCCCATCACCATCATCATCGGATTCTGAAAAAAGAGGGCAGAAATTTCCCCCAGCGACATATAATGAGATGGTTGTAGCGCCATCTAATGAGCAGCTGAGTGAGGCC
CTGCGGGAGGTAGGTATTGAAGGGGCACAGTGGCAGCTGTCTAAGACTGAGAAGAGGACGTTCCAGTCGGCTTATTTGAAAAGGGAAGCGAACATGTGGATGAGATTTAT
CAGACAAAGGATGCTTCCAACAACTGACGACTCGACAATCTCCAGGGAACGGGTTCTTCTAGCTTTTGGAATTTTGCGGTCTCTCAGCATTGACGTAGGGAAGATCATTG
CTAGTGAAATTTCTGGATGTTGGAAAAAGAAAGTGGGGAAACTGTTTTTTCCGAACACAATCACGATGCTATGCAGCAGGGCAGGAGTGCCCACGGTTCTAGATGATATA
ATTTTGCTTGACAAGGGAATTATAGACACACCTAATTTGGCACGACTCAAGCGTATGCAGGAGGTACGTCAAGGTGGACTTGTCTACGGCATCAACACGATTTTAGAACA
ACTGGCACTTTCGGCCAGTAGGCAGGAGTTTGCTGAAAGGCAAGCTTTAACCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAATCTTTTAAAGCTGAAGTTGAGCTTCAACTTGAAAGGAAAAGTAAGGTCATTAGATCTGACCGTGGAGGAGAATATTATGGCAGATATGATGGATCAGGTGA
ACAATGTCCAGGACCCTTTGCGCTTTTCCTTAAAGAATGTGGCATAGTTCCTCAGTATATGATGCCTGGGAAACCTAGCATGAATGAAGTTGCTGAGAGACGTAATAGGA
CTCTTAAGGATATGGTAAGAAGTATGGTTTGTCATTCTTCTTTGTCAGAGTTCCTTTGGGGAGAAGCATTGAAAACTGCAGTATACATACTTAACAGAGTCCCTAACAAA
GAAGTAAACAAAACCCCATATGAGTTGTGGACTGGGAAAAGACCTAGCCTAAGGAATTTACACATTTGGGGTTGTCCAACAGAGGCAAGGCCTTATAGGCCAAATGAAAA
GAAGTTGGACTCAAGAACCATAAGCTGCTACTTTGTTGGGAATGACATGGTTACCTTACCTATCATTGTCAGTAAAGAAATTCAAGATAATGATACTCAAGATCAAGTTT
CAACACTTGATATCATTACTTCTCAAGACAACACTCAAGATACATCCCTCTACTCTATAGATCAAACTCAACAAACTCAAGAAGTGCCATTAAGAAGATCCACTAGAGAA
AGGAGAAGTGCAATACCAAATGATTATATTGTATTTTTTCAAGAACATGAAGATAGCATTGGCCTTACAGAAAATGATCCTATCAATTTTCTACAGGCAAGGAAACGTTC
TAAGGTAGACAAGTGGATTGAGGTTATGCGAGATAAGATGAAATTTATGACTGACAATGGTGTTTGGGATCTAGTTAAGTTGCCTGAAGGAAAGAAACATATTGGTTGTA
AATGGATATTTAAAACCAAAAGGGATTCAAAAGGCAATATTGAGAGATATAAAGCACGCCCTGTTACAAAAGGTTTCACTCAAAAGGATGGCATAGATTATAAAGAGACT
TTCTTTCCAATGGATGTCAAGACGACTTTCCTCAATGGAGACATTGATGAAACAATTTATATGGAGCAACCAGAAAATTTTGCGTTAGGAAGTTCAAAGTCAATGGTTTG
CAAACTGAAGAAAGCCATCTATGGACTCGAGCAAGTCTCTCGTCAATGGTACCACAAATTTCATAATGTCATTACCTCTTTCAGTTTTGAGGCTAATGTAGTTGATGAAT
GTGTATACCATAAGAATAGTGGGAGTAGGTATATTTTCCTGGTATTGTATGTAGATGACACCTTACTTGCTTGCAGTGATTTAAACCTACCACAAGAAACCAATACCTTT
CTAACAAAACATTTTGAAATGAAAGATCTTGGAAATGCCTCTTTTGTATTAGGTATACACATACTGCGAGATCGTTCCCAAGGTATCTTAGGGTTATCACAAAAGAGTTA
CATTGATAAAGTATTGAGTAGGTATGACATGAATGGATGTCAGCCAGGTGATACACCTGTAGCTAAAGAAGATAAATTTAGTTTAAACCAGTGCCCCAAAAGCGATCATG
AAAGAATTGAGATGCAAAAGATTCCTTATTCATCCGTTGTAGGGAGTCTAATGTATGCTCAAGTTTGCACCCGACCTGATATAGCTTTTATAGTGGGAGTGTTGGGTAGA
TATCTGAGTGACCCAGAATGTGTTGCTGGGCGACTGGAGGGAGCAAATTCTGTGTTGCAGCAAAACTGGGAGCAAAAACTGCCACATCACAGCTCATTAGCCAAGTTGAC
AAACCGAATTCTGTTGAGTTATTCTCGTGATAAAGGAGCAAGGAGAGCCCTACACGTGTCCAGATCCAGCGAACCCCCAGAATCGCTTGCTGCAGCAAAATCGCCGCTGG
AGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTTTTCCAATGTTTGATGAGTTGAATCCA
GGGATTGCACGTCCTCAAATCCAAGCGGCAAATTTTGAAATGAAACGGGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCC
CCAGCATGGACCTGCTGCAGTGGTGAACCAAGTCATAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTGG
CGCAACCACCCCAACTTTCATGGGGAGGACAAGGAAGTAATATGCAAACACAACAAAAGGTGAACCAGCCGGGATTTGCTAAAGCGCAGGTATTGCCCCAGCAAAATAAG
CAGGCTTGCCCCACAAAATTCAGGAGTTCTCTTGAGGCGATGATGAAGGAATTTATGGCTCGTACAGATGCCGCAATTCAAAGGAAACTTCCTCAGATACTGAACACCCT
GAAGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGAGTTGGAGACTGGTCAGGGTGCTGGAGGCAGCAATAAAGATGCTGGAGCATCTGGTTCTGTTCCAT
ATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATT
CTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAGATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTT
TGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCACCCAAGGCTAAGGATCCAGGATCATTTACTATACCTGTATCTATAGGTGGAAAAG
AGTTAGGGAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGCTAGGTATTGGTGAAGTTAGGCCTACCACAGTTACGCTCCAA
TTAGGTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTGAAGGTAGATAAATTCATATTTCCTATTGATTTTATTATTTTAGACTATGAGGCTAA
TAAACATGTCCCAATCATTTTAGGTCATCCATTTTCGGCTACTGGTAGGGCATTAATAGATGTTCAGAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAAT
TTAATGTGTTTAAAGCCATGAAGTATCTAGACGAAATGGAAGATTGCTCTTTCATCAGGATTCTAGAGAGCACAATTGTTAAGATAGCAATACAAGATTCGACTGACAAG
CATTTGGAAGGTCATGGAGAGGGAATTAGCCCATCTTTTTGTATGCATAAAATCGCTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGC
AATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATACCGGGATCATTTATCCAATTGTAGACAACAATTGGGTGAGCCCTGTCCAATGTGTTCCTAAGAAAG
GAGGTGTCACTATGATTACTATTGCTCCTGAGGATCAGGAAAAAATCACTTTCACCTGCCCTTATGGGACGTTTGCTTTCAGGCGAATGCCTTTTGGCCTTTGCAATGCT
CCAGCAACATTTCAGTGTAAGGTGTTAAAGAGATGTGAGGATACCCATCTAGTTCTTAATTGGGAAAAATGCCACTTCATGGTGAAGGAGGGCATAGTGTTAGGTCATAG
GATTTCTAAGAATGGTCTAGAAGTTGATAGAGCAAAAATTGAGGTTATTGAGAGACTAGAACCACCGAATTCAGTGAAGGGAATTCAGAGTTTTTTAGGCCATGCTGGAT
TTTATAGGAGGGTTTTAAATGAAGCACAAATCAACTACACAACTACTGAAAAGGAGTTGTTAGCTGTTGTGTTTGCTTTTGAGAAATTCCGGCAACATTTGGTTGGATCC
AAAGTCACGGTGTTCACGGATCATGCAGCAATAAGGTATTTAATGGCTAAGAAAGATGCAAAGGTAGTCAAGGATGCCCCTTGGTGTGTTTCAGGTGATGAAGCAAAGGA
AATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGCCATTTCAGCGGTCAGAGGACAGCTATGAGGATTTTGCATTGTGGATTTTTTTGGCCTACATTATTTAAGGATG
CCCATTGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAACTTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTAGAAGTTGAATTATTCGATGTATGG
GGTATTGATTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTCCAAGTGGGTGGAGGCCATCGCATGCCATCAGAGTGA
TGCCAAGACAGTAGCACGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGAGCTCTAGTGAGTGATGAGGGTATGCATTTTGTTAATAATATCTTAACTA
AGCTTTTAGCTAAGTATGAGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAAGAGGGTAATTAAAGCTATTCTGGAGAAA
GTAGTCCATCCATCTAGGAAGGATTGGTCTTTTAGGTTGGATGAGGCACTTTGGGCTTATAGGACAGCCTATAAGACTCCTCTAGGAGCAATAAGAATGCTGCAGCTTAA
TGAATTAGAGGAATTTCGCCAATTTTCTTATGAAAATGCGAGAATGTATAAGGAAAAGACTAAGCTGCGGCATGACAAGAAAATTAAATCTAAAGAGTTTGGCAAGGGTC
AGAAAGTCTTGCATTATAATTCTAGATTGAAATTATTTCCCGGGAAAGTAAAATCTAAATGGTCAGGACCGTTTGTTGTGATTGAGGTTTTCCCCCATGGAGCAATTACT
TTGCAGGATGAAAAAGATGGGAGAGTGTTCAAGGTGAATGGACAGCGTGTGAAACACTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGAGCGAGGAAAGA
AAGAGAGAATGAGGAGGAAGAGGTTGCGATTACCCCTGAGGTACAGAAAGTAAAGGCGAAGAAGAAAAAGACCCCGGAGGAGAAAGAAGCCAAGAGAAGAAGGAGGCAAC
ATAGGGTTGTAGAACAAGAAGAGGTTCAGGAAGTGGCAGAGGATGTTGCCACTACAGTAGCGGAAGGAGATACACAAGAACCTGAAGTGCAACACCCAGCGGCGGTCGAA
TCAATAGACGCTGATACCGAAGGAATGCAGGAAGAAAACCCTGAAGAAAATCAAGAAACACAAGCTGACGAGGTTCGAGAAGAAGAGGCAGCGACTGTGCCTGAAAAAGA
GACTGAAAGAGAGCCAGTTCAGGAGGCTCGTGTTGAGGTCGTCATGCCCAAACCTCCGAGACGTCGCCGCATTAAGCGGAAGGCTGGGCGCGTTCAGATGATTTGGACTG
ATACCCCATCACCATCATCATCGGATTCTGAAAAAAGAGGGCAGAAATTTCCCCCAGCGACATATAATGAGATGGTTGTAGCGCCATCTAATGAGCAGCTGAGTGAGGCC
CTGCGGGAGGTAGGTATTGAAGGGGCACAGTGGCAGCTGTCTAAGACTGAGAAGAGGACGTTCCAGTCGGCTTATTTGAAAAGGGAAGCGAACATGTGGATGAGATTTAT
CAGACAAAGGATGCTTCCAACAACTGACGACTCGACAATCTCCAGGGAACGGGTTCTTCTAGCTTTTGGAATTTTGCGGTCTCTCAGCATTGACGTAGGGAAGATCATTG
CTAGTGAAATTTCTGGATGTTGGAAAAAGAAAGTGGGGAAACTGTTTTTTCCGAACACAATCACGATGCTATGCAGCAGGGCAGGAGTGCCCACGGTTCTAGATGATATA
ATTTTGCTTGACAAGGGAATTATAGACACACCTAATTTGGCACGACTCAAGCGTATGCAGGAGGTACGTCAAGGTGGACTTGTCTACGGCATCAACACGATTTTAGAACA
ACTGGCACTTTCGGCCAGTAGGCAGGAGTTTGCTGAAAGGCAAGCTTTAACCTTCTGA
Protein sequenceShow/hide protein sequence
MFKSFKAEVELQLERKSKVIRSDRGGEYYGRYDGSGEQCPGPFALFLKECGIVPQYMMPGKPSMNEVAERRNRTLKDMVRSMVCHSSLSEFLWGEALKTAVYILNRVPNK
EVNKTPYELWTGKRPSLRNLHIWGCPTEARPYRPNEKKLDSRTISCYFVGNDMVTLPIIVSKEIQDNDTQDQVSTLDIITSQDNTQDTSLYSIDQTQQTQEVPLRRSTRE
RRSAIPNDYIVFFQEHEDSIGLTENDPINFLQARKRSKVDKWIEVMRDKMKFMTDNGVWDLVKLPEGKKHIGCKWIFKTKRDSKGNIERYKARPVTKGFTQKDGIDYKET
FFPMDVKTTFLNGDIDETIYMEQPENFALGSSKSMVCKLKKAIYGLEQVSRQWYHKFHNVITSFSFEANVVDECVYHKNSGSRYIFLVLYVDDTLLACSDLNLPQETNTF
LTKHFEMKDLGNASFVLGIHILRDRSQGILGLSQKSYIDKVLSRYDMNGCQPGDTPVAKEDKFSLNQCPKSDHERIEMQKIPYSSVVGSLMYAQVCTRPDIAFIVGVLGR
YLSDPECVAGRLEGANSVLQQNWEQKLPHHSSLAKLTNRILLSYSRDKGARRALHVSRSSEPPESLAAAKSPLEQNEQQNNQAENPILVANDRTRAIRAYAFPMFDELNP
GIARPQIQAANFEMKRADLAMIANALKNVTVISHQQPQHGPAAVVNQVIEEACVYCGEDHNYEFCPSNPASVLAQPPQLSWGGQGSNMQTQQKVNQPGFAKAQVLPQQNK
QACPTKFRSSLEAMMKEFMARTDAAIQRKLPQILNTLKEGKEQVKAVTLRSELETGQGAGGSNKDAGASGSVPYVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEI
LKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEVRPTTVTLQ
LGDRSITYPEGKIEDVLVKVDKFIFPIDFIILDYEANKHVPIILGHPFSATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYLDEMEDCSFIRILESTIVKIAIQDSTDK
HLEGHGEGISPSFCMHKIALEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDTGIIYPIVDNNWVSPVQCVPKKGGVTMITIAPEDQEKITFTCPYGTFAFRRMPFGLCNA
PATFQCKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRVLNEAQINYTTTEKELLAVVFAFEKFRQHLVGS
KVTVFTDHAAIRYLMAKKDAKVVKDAPWCVSGDEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW
GIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGMHFVNNILTKLLAKYEIKHRIATPYHPQANGQAEISKRVIKAILEK
VVHPSRKDWSFRLDEALWAYRTAYKTPLGAIRMLQLNELEEFRQFSYENARMYKEKTKLRHDKKIKSKEFGKGQKVLHYNSRLKLFPGKVKSKWSGPFVVIEVFPHGAIT
LQDEKDGRVFKVNGQRVKHYWGEEFQSKYPSLRARKERENEEEEVAITPEVQKVKAKKKKTPEEKEAKRRRRQHRVVEQEEVQEVAEDVATTVAEGDTQEPEVQHPAAVE
SIDADTEGMQEENPEENQETQADEVREEEAATVPEKETEREPVQEARVEVVMPKPPRRRRIKRKAGRVQMIWTDTPSPSSSDSEKRGQKFPPATYNEMVVAPSNEQLSEA
LREVGIEGAQWQLSKTEKRTFQSAYLKREANMWMRFIRQRMLPTTDDSTISRERVLLAFGILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCSRAGVPTVLDDI
ILLDKGIIDTPNLARLKRMQEVRQGGLVYGINTILEQLALSASRQEFAERQALTF