; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014392 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014392
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr12:310147..319927
RNA-Seq ExpressionLag0014392
SyntenyLag0014392
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR003441 - NAC domain
IPR012337 - Ribonuclease H-like superfamily
IPR036093 - NAC domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]0.0e+0054.87Show/hide
Query:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINL---MPLSV----------YRK-------
        MP+Y KF+KDIL+K++                                      G    GRALCDLG    L    P S+          Y K       
Subjt:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINL---MPLSV----------YRK-------

Query:  --------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQ
                      LD E D +VPIILGRPFLATGR LIDVQ                  KAMK+P+E ++C  + + ++ V            +E  + 
Subjt:  --------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQ

Query:  DSANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWT
        D  ++ +E+  E                      AP   +KPS+ E PTL+LKPLP+HL Y YLGESDTLP+I++S L     E L+++L+ ++ AIGWT
Subjt:  DSANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWT

Query:  LADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYR
        +ADI+GISPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPT+TVTGWRVCMDYR
Subjt:  LADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYR

Query:  RLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFS
        +LNKATRKDHFPL FIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAFRRMPFGLCNA ATFQRCM+AIF+DM+E+ +EVFMDDFS
Subjt:  RLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFS

Query:  VFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNL
        V+G SF  CL+NL  VLKRCEDT+L+LNWEKCHFMV+EGI LGH++S   +EVD+AK+E IE+L PP SVKG++SFLGHAGFYRRFIKDFSKISKPLCNL
Subjt:  VFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNL

Query:  LCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYL
        L  D  F F+  CR AF  LK  LISAPI+  P+W+ PFE+MCDASD AVGA+LGQ++ K    IYYASK LN+AQ+NYTTTEKEL AVVFAF+KFR YL
Subjt:  LCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYL

Query:  VGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYN
        VG+KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL+A +AS         V W Y 
Subjt:  VGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYN

Query:  AISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASK
         I   + C                                                                                            
Subjt:  AISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASK

Query:  GSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRG
        G  P D+  +QKKKF  D + +FWD+ F++KQ  D+I+RRCV   E  +ILEQC+ SPYGGHF G RTA +IL  GFFWP LFKDAH F   CD CQR G
Subjt:  GSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRG

Query:  NLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAK
        N+  R EMPL  ILEVELFDVWGIDFMGPF PS GN++IL+AVDYVSKWVEA A   +D+K V  F++ +IF RFGTPRA++SD GTHF N     LL+K
Subjt:  NLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAK

Query:  YGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
        YG+KH+I+TPYHPQ +GQ E+SN+EIK ILEK V  +RKDWS RLDEALWAYRTAYKTP+G
Subjt:  YGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]0.0e+0052.82Show/hide
Query:  MPNYAKFLKDILTKRK--GGKEL-----------------------GRALCDLGASINLMPLSVYRK---------------------------------
        MP+Y KF+KDIL+K++  G  E                         RALCDLGASINLMP S+YR                                  
Subjt:  MPNYAKFLKDILTKRK--GGKEL-----------------------GRALCDLGASINLMPLSVYRK---------------------------------

Query:  ------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQDS
                    LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++              +E  + D 
Subjt:  ------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQDS

Query:  ANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLA
         ++ +E+  E                      AP   +KPS+ E PTL+LKPLP HL Y YLGESDTLP+I++S L     E L+++L+ ++ AIGWT+A
Subjt:  ANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLA

Query:  DIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRL
        DI+GISPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+D +W+SPVQCVPKKGG+TVV N  NE IPTKTVTGWRVCMDYR+L
Subjt:  DIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRL

Query:  NKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVF
        NKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRR+PF LCNA ATFQRCM+AIF+DM+E+ +EVFMDDFSV+
Subjt:  NKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVF

Query:  GGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLC
        G SF  CL+NL  VLKRCEDT+LVLNWEKCHFMV+EGI LGH++S   +EVD+AK+E IE+L P  SVKG++SFLGHAGFYRRFIKDF KISKPLC LL 
Subjt:  GGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLC

Query:  TDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVG
         D  F F+  C  AF+ LK  LISAPI+  P+W+ PFE+MCDASD A+GA+LGQ++ K    IYYASK LN+AQ+NYTTTEKEL AVVFAF+KFR YLVG
Subjt:  TDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVG

Query:  SKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYNAI
        +KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG EN IADHLSRL+  +   E + I+D+FPDEQL+A +AS         V W Y  I
Subjt:  SKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYNAI

Query:  SELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASKGS
           + C                                                                                            G 
Subjt:  SELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASKGS

Query:  HPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNL
         P D+  +QKKKF  D + +FWD+ F++KQ  D+I+RRCV   E  +I EQC+ SPYGGHF   RTA +IL  GFFWP LFKD H F   CD CQR GN+
Subjt:  HPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNL

Query:  GPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYG
          R EMPL  ILEVELFDVWGIDFMGPF PS GN++IL+AVDY+SKWVEA+A   +D+K V  F++ +IF RFGTPRA++SD GTHF N     LL+KYG
Subjt:  GPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYG

Query:  IKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG-----------------------------------------
        +KH+I+TPYHPQ +GQ E+SN+EIK  LEK V  +RKDWS RLDEALWAYRTA+KTP+G                                         
Subjt:  IKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG-----------------------------------------

Query:  -------PFIVIEVFSNGAITLQDEKDGRVFKENGQCVKHYW
               PF + EV  +GA+ L+++     FK N Q +KHYW
Subjt:  -------PFIVIEVFSNGAITLQDEKDGRVFKENGQCVKHYW

XP_012838027.1 PREDICTED: uncharacterized protein LOC105958568 [Erythranthe guttata]0.0e+0052.25Show/hide
Query:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINLMPLSVYRK--------------------
        MP++ KFLKD+++K++                                      GGK   ++L DLGASINLMP SV++                     
Subjt:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINLMPLSVYRK--------------------

Query:  -------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILE----------
                                 LD E DK++P+ILGRPFL TGR +IDV  G LTM + NE +KF+VF+ MK+P E+++C    +L+          
Subjt:  -------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILE----------

Query:  -------STVIETTIQ-----------DSANKHSEKHGE--------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKL
               S  + T+++           DSA   S    E        +   KPS+IE P L+LK LP HLKY+YLG  +TLP+I++S L    E+ L+++
Subjt:  -------STVIETTIQ-----------DSANKHSEKHGE--------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKL

Query:  LQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKT
        L++++ A+GWT+ADI+GISP+FCMHKI LEE    S+EQQRRLNP MKEVVKKE+IKWLDAGII+PI+DS  VSPVQCVPKKGG+TV+ N+ NELIP++T
Subjt:  LQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKT

Query:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIE
        VTGWR+CMDYR+LNKATRKDHFPLPFIDQMLDRLA Q +YCFLDGYSGYNQI IAPEDQ+KTTFTCP+GTFAFRRMPFGLCNA ATFQRCM+AIF+DM+E
Subjt:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIE

Query:  STVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKD
          +E+FMDDFSVFG ++ +CL  L KVL+RCE+T+LVLNWEKCHFMV+EGI LGH++SK  LEVDRAK+E IE+L PP SVKGI+SFLGHAGFYRRFIKD
Subjt:  STVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKD

Query:  FSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAV
        FSK++KPLCNLL  D  F FN +C KAF+ LK  L++API+  P+WN PFE+MCDASD AVGA+LGQ++ K  H IYYASK LN+AQ+NYTTTEKEL A+
Subjt:  FSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAV

Query:  VFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYS
        VFAFEKFR YL+G+KV VFTDH+A++YL+ KK+AKPRLIRW+LLLQEFDLEI+D KGSEN +ADH SRL+    + +   I+++FPDEQL+A        
Subjt:  VFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYS

Query:  FNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTK
         N + V W  + ++ L+                                      +H+                                          
Subjt:  FNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTK

Query:  NQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWF
                      P ++ + Q+KKF HD  F+FW+E F+++Q SD +IRRCV  +E  +IL  C+ S Y GH+ G+RTA ++L  GFFWPTLFKD + F
Subjt:  NQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWF

Query:  YKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHF
           CD CQR GN+  R E+PL  ++EVE FDVWGIDFMGPF PS  N++IL+AVDYVSKWVEA A   +D+  V  FL+  IF RFGTPRA++SD GTHF
Subjt:  YKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHF

Query:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG--PFIVI
         N     LLAKYG+KH++A  YHPQ NGQAE++N+EIK ILEKVV P+RKDWS +LD+ALWAYRTAYKTPLG  PF +I
Subjt:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG--PFIVI

XP_012841295.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105961608 [Erythranthe guttata]0.0e+0052.08Show/hide
Query:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINLMPLSVYRK--------------------
        MP++ KFLKD+++K++                                      GGK   ++L DLGASINLMP SV++                     
Subjt:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINLMPLSVYRK--------------------

Query:  -------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILE----------
                                 LD E DK++P+ILGRPFL TGR +IDV  G L+M + +E +KF+VF+ MK+P E+++C    +L+          
Subjt:  -------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILE----------

Query:  -------STVIETTIQ-----------DSANKHSEKHGE--------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKL
               S  + T+++           DSA   S    E        +   KPS+IE P L+LK LP HLKY+YLG  +TLPII++S L    E+ L+++
Subjt:  -------STVIETTIQ-----------DSANKHSEKHGE--------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKL

Query:  LQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKT
        L++++ A+GWT+ADI+GISP+FCMHKI LEE    S+EQQRRLNP MKEVVKKE+IKWLDAGII+PI+DS WVS VQCVPK GG+TV+ N+ NELIP++T
Subjt:  LQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKT

Query:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIE
        VTGWR+CMDYR+LNKATRKDHFPLPFIDQMLDRLA Q +YCFLDGYSGYNQI IAPEDQ+KTTFTCP+GTFAFR MPFGLCNA ATFQRCM+AIF+DM+E
Subjt:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIE

Query:  STVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKD
          +E+FMDDFSVFG ++ +CL  L KVL+RCE+T+LVLNWEKCHFMV+EGI LGH++SK  LEVDRAK+E IE+L PP SVKGI+SFLGHAGFYRRFIKD
Subjt:  STVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKD

Query:  FSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAV
        FSK++KPLCNLL  D  F FN +C KAF+ LK  L++API+  P+WN PFE+MCDASD AVGA+LGQ++ K  H IYYASK LN+AQ+NYTTTEKEL AV
Subjt:  FSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAV

Query:  VFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYS
        VFAFEKFR YL+G+KV VFTDH+A++YL+ KK+AKPRLIRW+LLLQEFDLEI+D KGSEN +ADHLSRL+    + +   I+++FPDEQL+A        
Subjt:  VFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYS

Query:  FNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTK
         N + V W  + ++ L+                                      +H+                                          
Subjt:  FNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTK

Query:  NQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWF
                      P ++ + Q+KKF HD  F+FW+E F+++QCSD +IRRCV  +E ++IL  C+ S Y GH+ G+RTA ++L  GFFWPTLFKD + F
Subjt:  NQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWF

Query:  YKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHF
           CD CQR GN+  R E+PL  ++EVE FDVWGIDFMGPF PS  N++IL+AVDYVSKWVEA A   +D+  V  FL+  IF RFGTPRA+++D GTHF
Subjt:  YKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHF

Query:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG--PFIVI
         N     LLAKYG+KH++A  YHPQ NGQAE++N+EIK ILEKVV P+ KDWS +LD+ALWAYRTAYKTPLG  PF +I
Subjt:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG--PFIVI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]0.0e+0052.35Show/hide
Query:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINLMPLSVYRK--------------------
        MPNYAKFLKDI++K++                                      G     + LCDLGASINLMPLSVYRK                    
Subjt:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINLMPLSVYRK--------------------

Query:  -------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQD
                                 LD E D++VP+ILGRPFLATGRAL+DVQKGELT+RV  EEV FN+++AMK+P++   C  + ++E  V+E   +D
Subjt:  -------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQD

Query:  SANKHSEK--------------------------------------------HGEAPPIKPSLI------------EAPTLDLKPLPDHLKYVYLGESDT
            H E+                                              E   ++P +             +  T +LK LP+HL+Y +LG+S T
Subjt:  SANKHSEK--------------------------------------------HGEAPPIKPSLI------------EAPTLDLKPLPDHLKYVYLGESDT

Query:  LPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVP
         P+IVA+ L PE EE L+++L+++R A+GWT++DI+GISPS CMHKI +EE    SIE QRRLNPAMKEVV+ E++K L+AGIIY I+DS+WVSPVQ VP
Subjt:  LPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVP

Query:  KKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGL
        KKGG+TVV N +NE IPT+TVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG +YYCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGL
Subjt:  KKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGL

Query:  CNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNS
        CNA ATFQRCM+AIFSDM+E  +E+FMDDFSVFG SF  CL NL  VL+RCED +LVLNWEKCHFMV+EGI LGHR+S   +EVDRAK+  IE+L PP +
Subjt:  CNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNS

Query:  VKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYAS
        VKGI+SFLGHAGFYRRFIKDFSK+SKPLCNLL  +  F F+  C +AF  +K  LISAP++  P+W+ PFEVMCDASD A+GA+LGQ++ K    IYYAS
Subjt:  VKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYAS

Query:  KVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSA
        + LNEAQ+NYTTTEKE+ AVVFA +KFR YL+ +KV VFTDHAA+RYL SKKDAKPRLIRWILLLQEFDLE++DKKGSEN +ADHLSRL+    +     
Subjt:  KVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSA

Query:  ISDSFPDEQLIASIASYLYSFNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDA
        I ++FPDEQL A                                                              +  +P+    V   A +         
Subjt:  ISDSFPDEQLIASIASYLYSFNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDA

Query:  TVFPYSERAFKRQRLDAKTKNQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTA
                                          P D+ + Q+KKF HD K++ WDE  ++K+C D IIRRCV   E + IL  C+ S YGGHF   RTA
Subjt:  TVFPYSERAFKRQRLDAKTKNQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTA

Query:  MRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQS
         ++L  GFFWP++F+D++   K CD CQR GN+  R E+PL  ILEVELFDVWGIDFMGPFPPS G V+ILLAVDYVSKWVEAIA   +DAK V +FL  
Subjt:  MRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQS

Query:  HIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
        +IF RFGTPRA++SDEGTHF N +   LL+KYG+KH+IA  YHPQ NGQAEISN+EIK ILEK V+ +RKDW+ +LD+ALWAYRTA+KTP+G
Subjt:  HIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase0.0e+0054.87Show/hide
Query:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINL---MPLSV----------YRK-------
        MP+Y KF+KDIL+K++                                      G    GRALCDLG    L    P S+          Y K       
Subjt:  MPNYAKFLKDILTKRK--------------------------------------GGKELGRALCDLGASINL---MPLSV----------YRK-------

Query:  --------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQ
                      LD E D +VPIILGRPFLATGR LIDVQ                  KAMK+P+E ++C  + + ++ V            +E  + 
Subjt:  --------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQ

Query:  DSANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWT
        D  ++ +E+  E                      AP   +KPS+ E PTL+LKPLP+HL Y YLGESDTLP+I++S L     E L+++L+ ++ AIGWT
Subjt:  DSANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWT

Query:  LADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYR
        +ADI+GISPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPT+TVTGWRVCMDYR
Subjt:  LADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYR

Query:  RLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFS
        +LNKATRKDHFPL FIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAFRRMPFGLCNA ATFQRCM+AIF+DM+E+ +EVFMDDFS
Subjt:  RLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFS

Query:  VFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNL
        V+G SF  CL+NL  VLKRCEDT+L+LNWEKCHFMV+EGI LGH++S   +EVD+AK+E IE+L PP SVKG++SFLGHAGFYRRFIKDFSKISKPLCNL
Subjt:  VFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNL

Query:  LCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYL
        L  D  F F+  CR AF  LK  LISAPI+  P+W+ PFE+MCDASD AVGA+LGQ++ K    IYYASK LN+AQ+NYTTTEKEL AVVFAF+KFR YL
Subjt:  LCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYL

Query:  VGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYN
        VG+KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL+A +AS         V W Y 
Subjt:  VGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYN

Query:  AISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASK
         I   + C                                                                                            
Subjt:  AISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASK

Query:  GSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRG
        G  P D+  +QKKKF  D + +FWD+ F++KQ  D+I+RRCV   E  +ILEQC+ SPYGGHF G RTA +IL  GFFWP LFKDAH F   CD CQR G
Subjt:  GSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRG

Query:  NLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAK
        N+  R EMPL  ILEVELFDVWGIDFMGPF PS GN++IL+AVDYVSKWVEA A   +D+K V  F++ +IF RFGTPRA++SD GTHF N     LL+K
Subjt:  NLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAK

Query:  YGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
        YG+KH+I+TPYHPQ +GQ E+SN+EIK ILEK V  +RKDWS RLDEALWAYRTAYKTP+G
Subjt:  YGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

A0A2G9G6G2 Reverse transcriptase0.0e+0053.41Show/hide
Query:  MPNYAKFLKDILTKRK--GGKEL------------------------------------GRALCDLGASINLMPLSVYRK--------------------
        MP+Y KF+KDIL+K++  G  E+                                    GRALCDLGASINLMP S+YR                     
Subjt:  MPNYAKFLKDILTKRK--GGKEL------------------------------------GRALCDLGASINLMPLSVYRK--------------------

Query:  -------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILES---------
                                 LD E D +VPIILGRPFLATGR LIDVQ                  KAMK+P+E ++C  + + ++         
Subjt:  -------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILES---------

Query:  ---TVIETTIQDSANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKL
             +E  + D  ++ +EK  E                      AP   +KPS+ E PTL+LKPLP HL Y YLGESDTLP+I++S L     E L+++
Subjt:  ---TVIETTIQDSANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKL

Query:  LQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKT
        L+ ++  IGWT+ADI+GISPSFCMHKI LE+    SIE QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPT+T
Subjt:  LQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKT

Query:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIE
        VTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNA ATFQRCM+AIF+DM+E
Subjt:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIE

Query:  STVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKD
        + +EVFMD+FSV+G SF  CL+NL  VLKRCEDT+LVLNWEKCHFMV+EGI LGH++S   +EVD+AK+E IE+L PP SVKG++SFLGHAGFYRRFIKD
Subjt:  STVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKD

Query:  FSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAV
        FSKISKPLCNLL  D  F FN  C  AF  LK  LISAPI+  P+            D AVGA+LGQ++ K    IYYASK LN+AQ+NYTTTEKEL AV
Subjt:  FSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAV

Query:  VFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYS
        VFAF+KFR YLV +KV V+TDHAAIRYL+ KKDA P LI W+LLLQEFDLEI+D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL+A +AS    
Subjt:  VFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYS

Query:  FNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTK
             V W Y  I   + C                                                                                 
Subjt:  FNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTK

Query:  NQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWF
                   G  P D+  +QKKK   D + +FW++ F+ KQ  D+I+RRCV   E  +ILEQC+ SPYGGHF G RTA +IL  GFFWP LFKDAH F
Subjt:  NQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWF

Query:  YKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHF
           CD CQR  N+  R EMPL  ILEVELFDVWGIDFMGPF PS GN++IL+AVDYVSKWVEA A   +D+K V  F++ +IF RFGTPRA++SD  T+F
Subjt:  YKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHF

Query:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
         N     LL+KYG+KH+I TPYHPQ +G  E+SN+EIK ILEK V  +RKDWS RLDEALWAYRTAYKTP+G
Subjt:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

A0A2G9HWF8 Reverse transcriptase0.0e+0052.82Show/hide
Query:  MPNYAKFLKDILTKRK--GGKEL-----------------------GRALCDLGASINLMPLSVYRK---------------------------------
        MP+Y KF+KDIL+K++  G  E                         RALCDLGASINLMP S+YR                                  
Subjt:  MPNYAKFLKDILTKRK--GGKEL-----------------------GRALCDLGASINLMPLSVYRK---------------------------------

Query:  ------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQDS
                    LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++              +E  + D 
Subjt:  ------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQDS

Query:  ANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLA
         ++ +E+  E                      AP   +KPS+ E PTL+LKPLP HL Y YLGESDTLP+I++S L     E L+++L+ ++ AIGWT+A
Subjt:  ANKHSEKHGE----------------------APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLA

Query:  DIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRL
        DI+GISPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+D +W+SPVQCVPKKGG+TVV N  NE IPTKTVTGWRVCMDYR+L
Subjt:  DIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRL

Query:  NKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVF
        NKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRR+PF LCNA ATFQRCM+AIF+DM+E+ +EVFMDDFSV+
Subjt:  NKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVF

Query:  GGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLC
        G SF  CL+NL  VLKRCEDT+LVLNWEKCHFMV+EGI LGH++S   +EVD+AK+E IE+L P  SVKG++SFLGHAGFYRRFIKDF KISKPLC LL 
Subjt:  GGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLC

Query:  TDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVG
         D  F F+  C  AF+ LK  LISAPI+  P+W+ PFE+MCDASD A+GA+LGQ++ K    IYYASK LN+AQ+NYTTTEKEL AVVFAF+KFR YLVG
Subjt:  TDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVG

Query:  SKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYNAI
        +KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG EN IADHLSRL+  +   E + I+D+FPDEQL+A +AS         V W Y  I
Subjt:  SKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYNAI

Query:  SELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASKGS
           + C                                                                                            G 
Subjt:  SELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASKGS

Query:  HPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNL
         P D+  +QKKKF  D + +FWD+ F++KQ  D+I+RRCV   E  +I EQC+ SPYGGHF   RTA +IL  GFFWP LFKD H F   CD CQR GN+
Subjt:  HPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNL

Query:  GPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYG
          R EMPL  ILEVELFDVWGIDFMGPF PS GN++IL+AVDY+SKWVEA+A   +D+K V  F++ +IF RFGTPRA++SD GTHF N     LL+KYG
Subjt:  GPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYG

Query:  IKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG-----------------------------------------
        +KH+I+TPYHPQ +GQ E+SN+EIK  LEK V  +RKDWS RLDEALWAYRTA+KTP+G                                         
Subjt:  IKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG-----------------------------------------

Query:  -------PFIVIEVFSNGAITLQDEKDGRVFKENGQCVKHYW
               PF + EV  +GA+ L+++     FK N Q +KHYW
Subjt:  -------PFIVIEVFSNGAITLQDEKDGRVFKENGQCVKHYW

A0A6P8CBX2 Reverse transcriptase0.0e+0051.54Show/hide
Query:  MPNYAKFLKDILTKRK-----------------------------------------GGKELGRALCDLGASINLMPLSVYRK-----------------
        MP+YA+F+KD+LTK++                                         G       L D GASINLMPLS++RK                 
Subjt:  MPNYAKFLKDILTKRK-----------------------------------------GGKELGRALCDLGASINLMPLSVYRK-----------------

Query:  ----------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV----
                                    L+ E D++VP+ILGRPFLATG+ALIDV++G+LT+RV NE++ FNV+ A+K  D+ + C  I I++  +    
Subjt:  ----------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV----

Query:  --------IETTIQD-----SANKHSEKHGE--------------APPIKP--SLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKL
                +E+ ++D       ++H E+  E                  KP  SL ++P L+LKPLP HLKY YLG  DTLPII++S L  + E+ L+ +
Subjt:  --------IETTIQD-----SANKHSEKHGE--------------APPIKP--SLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKL

Query:  LQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKT
        L+++++AIGWT+ADI+GISP  C H+I LE      ++ QRRLNP +KEVVKKEV+K LDAGIIYPI+DS WVSPVQ VPKKGG+TVV N+ N+LIPT+T
Subjt:  LQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKT

Query:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIE
        VTGWRVC+DYR+LN ATRKDHFPLPFIDQML++LAG  YYCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNA ATFQRCM++IFSDM+E
Subjt:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIE

Query:  STVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKD
        + +E+FMDDFSVFG SF+SCL NLG VLKRC++T+L+LNWEKCHFMV+EGI LGH++SK  +EVDRAKVE+IE+L PP S KG++SFLGHAGFYRRFIKD
Subjt:  STVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKD

Query:  FSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAV
        FSKIS+PLCNLL  D  F FN +C +AF  LK  L SAP++ APNW LPFE+MCDASD AVGA+LGQ++GK  H IYYAS+ LNEAQ NY TTEKEL AV
Subjt:  FSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAV

Query:  VFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYS
        +FA +KFRPYL+GSK+ V+TDHAA++YL +K DAKPRLIRWILLLQEFDLEI+D KG+ENV+ADHLSRL+        S I++ FPDEQ        L+ 
Subjt:  VFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYS

Query:  FNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTK
          ++ + W  + ++ ++                                                                                   
Subjt:  FNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTK

Query:  NQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWF
          NI+         P  +  +QKKKF HD K++FWDE +++K C+D +IRRCV   E   I++ C+    GGHF  +RTA +IL CGF+WP +F D   +
Subjt:  NQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWF

Query:  YKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHF
           C  CQR GN+  R E+P   IL +ELFDVWGIDFMGPFP S  N +IL+AVDYVSKWVEA+A   +DA+ V RFL+ +IF+RFG PRA++SD G+HF
Subjt:  YKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHF

Query:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
         N    KLL+KYG+ H+IATPYHPQ  GQ E+SN+EIK ILEK V+ SRKDWS +LD+ALWAYRTA+KTP+G
Subjt:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

A0A6P8DLJ8 Reverse transcriptase0.0e+0053.71Show/hide
Query:  MPNYAKFLKDILTKRK--GGKELGRALCDLGASIN----------------LMPLSVYRKLDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEE
        MP+YA+F+KD+LTK++   G E    +   G SI                 + P+  +  L+ E D++VP+ILGRPFLATG+ALIDV++G+LT+RV NE+
Subjt:  MPNYAKFLKDILTKRK--GGKELGRALCDLGASIN----------------LMPLSVYRKLDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEE

Query:  VKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQD-----SANKHSEKHGE--------------APPIKP--SLIEAPTLDLKPLPD
        + FNV+ A+K  D+ + C  I I++  +            +E+ ++D       ++H E+  E                  KP  SL ++P L+LKPLP 
Subjt:  VKFNVFKAMKYPDEMEDCSFIRILESTV------------IETTIQD-----SANKHSEKHGE--------------APPIKP--SLIEAPTLDLKPLPD

Query:  HLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIA
        HLKY YLG  DTLPII++S L  + E+ L+ +L+++++AIGWT+ADI+GISP  C H+I LE      ++ QRRLNP +KEVVKKEV+K LDAGIIYPI+
Subjt:  HLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIA

Query:  DSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPY
        DS WVSPVQ VPKKGG+TVV N+ NELIPT+TVTGWRVC+DYR+LN ATRKDHFPLPFIDQML++L G  YYCFLDGYSGYNQI IAPEDQEKTTFTCPY
Subjt:  DSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPY

Query:  GTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAK
        GTFAFRRMPFGLCNA ATFQRCM++IFSDM+E+ +E+FMDDFSVFG SF+SCL NLG VLKRC++T+L+LNWEKCHFMV+EGI LGH++SK  +EVDRAK
Subjt:  GTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISKNCLEVDRAK

Query:  VEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQK
        VE+IE+L PP S KG++SFLGHAGFYRRFIKDFSKIS+PLCNLL  D  F FN +C +AF  LK  L SAP++ APNW LPFE+MC ASD AVGA+LGQ+
Subjt:  VEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQK

Query:  QGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSR
        +GK  H IYYAS+ LNEAQ NY TTEKEL AV+FA +KFRPYL+GSK+ V+TDHAA++YL +K DAKPRLIRWILLLQEFDLEI+D KG+ENV+ADHLSR
Subjt:  QGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSR

Query:  LDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLT
        L+        S I++ FPDEQ        L+   ++ + W  + ++ ++                                                   
Subjt:  LDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLT

Query:  ASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFS
                                          NI+         P  +  +QKKKF HD K++FWDE +++K C+D +IRRCV   E   I++ C+  
Subjt:  ASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFS

Query:  PYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQ
          GGHF  +RTA +IL CGF+WP +F D   +   C  CQR GN+  R E+P   IL +ELFDVWGIDFMGPFP S  N +IL+AVDYVSKWVEA+A   
Subjt:  PYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQ

Query:  SDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYK
        +DA+ V RFL+ +IF+R G PRA++SD G+HF N    KLL+KYG+ H+IATPYHPQ  GQ E+SN+EIK ILEK V+ SRKDWS +LD+ALWAYRTA+K
Subjt:  SDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYK

Query:  TPLG
        TP+G
Subjt:  TPLG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.1e-7536.07Show/hide
Query:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDG
        A ++ V+ ++   L+ GII   ++S + SP+  VPKK      S K            +R+ +DYR+LN+ T  D  P+P +D++L +L    Y+  +D 
Subjt:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDG

Query:  YSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHF
          G++QI + PE   KT F+  +G + + RMPFGL NA ATFQRCM  I   ++     V++DD  VF  S    L +LG V ++    +L L  +KC F
Subjt:  YSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHF

Query:  MVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTD-HVFYFNADCRKAFETLKAALISAPILCAP
        + +E   LGH ++ + ++ +  K+E I++   P   K I++FLG  G+YR+FI +F+ I+KP+   L  +  +   N +   AF+ LK  +   PIL  P
Subjt:  MVKEGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTD-HVFYFNADCRKAFETLKAALISAPILCAP

Query:  NWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILL
        ++   F +  DASDVA+GA+L Q      HP+ Y S+ LNE ++NY+T EKEL A+V+A + FR YL+G    + +DH  + +L   KD   +L RW + 
Subjt:  NWNLPFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILL

Query:  LQEFDLEIKDKKGSENVIADHLSRLD-PSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMV
        L EFD +IK  KG EN +AD LSR+    ++L EQ+  S    +  LI      L +FN +++
Subjt:  LQEFDLEIKDKKGSENVIADHLSRLD-PSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMV

P10394 Retrovirus-related Pol polyprotein from transposon 4127.7e-8227.27Show/hide
Query:  EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSG
        E ++ +V K +   I+ P + S + SP+  VPKK              P      WR+ +DYR++NK    D FPLP ID +LD+L    Y+  LD  SG
Subjt:  EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSG

Query:  YNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVK
        ++QI +    ++ T+F+   G++ F R+PFGL  A  +FQR M   FS +  S   ++MDD  V G S +  L NL +V  +C + +L L+ EKC F + 
Subjt:  YNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVK

Query:  EGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNL
        E   LGH+ +   +  D  K +VI+    P+     + F+    +YRRFIK+F+  S+ +  L   +  F +  +C+KAF  LK+ LI+  +L  P+++ 
Subjt:  EGIALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNL

Query:  PFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEF
         F +  DAS  A GA+L Q       P+ YAS+   + + N +TTE+EL+A+ +A   FRPY+ G   TV TDH  + YL S  +   +L R  L L+E+
Subjt:  PFEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEF

Query:  DLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYNAISELIGCSRREKMQRNEKSKS-GEKSNLGQQQASVE
        +  ++  KG +N +AD LSR+                  + +  +I      F            S    C+ +E++   +++K    + N+ +   + E
Subjt:  DLEIKDKKGSENVIADHLSRLDPSSFLLEQSAISDSFPDEQLIASIASYLYSFNLEMVNWSYNAISELIGCSRREKMQRNEKSKS-GEKSNLGQQQASVE

Query:  TLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASKGSHPIDM-DWRQKKKFKHDA--KFFFWDESFM--YK
           +  L  +    + +       RYD               +    LD     Q +   +       I M  W  KK F+H +  KF       +   K
Subjt:  TLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRLDAKTKNQNISFTSASKGSHPIDM-DWRQKKKFKHDA--KFFFWDESFM--YK

Query:  QCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVE-LFDVWGIDFMGPF
            + + +  +  E + IL   +  P  G  +G    +  +   ++W  + K    + ++C  CQ +       + P+T     E  FD   +D +GP 
Subjt:  QCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVE-LFDVWGIDFMGPF

Query:  PPS-NGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAI
        P S NGN + +  +  ++K++ AI      AKTVA+ +      ++G  +  ++D GT + N+I+T L     IK+  +T +H Q  G  E S++ +   
Subjt:  PPS-NGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNKEIKAI

Query:  LEKVVHPSRKDWSFRLDEALWAYRT
        +   +   + DW   L   ++ + T
Subjt:  LEKVVHPSRKDWSFRLDEALWAYRT

P20825 Retrovirus-related Pol polyprotein from transposon 2971.4e-6735.71Show/hide
Query:  VKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN
        V+ +V + L+ G+I   ++S + SP   VPKK   +  +NK            +RV +DYR+LN+ T  D +P+P +D++L +L    Y+  +D   G++
Subjt:  VKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN

Query:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEG
        QI +  E   KT F+   G + + RMPFGL NA ATFQRCM  I   ++     V++DD  +F  S    L+++  V  +  D +L L  +KC F+ KE 
Subjt:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEG

Query:  IALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLL-CTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLP
          LGH ++ + ++ +  KV+ I     P   K I++FLG  G+YR+FI +++ I+KP+ + L     +     +  +AFE LKA +I  PIL  P++   
Subjt:  IALGHRISKNCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLL-CTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLP

Query:  FEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFD
        F +  DAS++A+GA+L Q      HPI + S+ LN+ ++NY+  EKEL A+V+A + FR YL+G +  + +DH  +R+L + K+   +L RW + L E+ 
Subjt:  FEVMCDASDVAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFD

Query:  LEIKDKKGSENVIADHLSRL
         +I   KG EN +AD LSR+
Subjt:  LEIKDKKGSENVIADHLSRL

Q5Z6B6 NAC domain-containing protein 762.1e-7962.5Show/hide
Query:  PDSTVSSTHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKA
        P     S  PGFRFHPTDEELLYYYLRKKV+YEAI+LDVIRE+DLNKLEPWDLKD+CRIG+G QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDKA
Subjt:  PDSTVSSTHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKA

Query:  IHMGNNSKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQK--------PEVSQEQLDYYAHTKLGSS---SASADE
        I +  N+ RIGMRKTLVFY GRAPHG+KTDWIMHEYRLD +N  V +QEDGWVVCRVF KKS +          V  + L ++ H     +     +AD 
Subjt:  IHMGNNSKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQK--------PEVSQEQLDYYAHTKLGSS---SASADE

Query:  VGAGIGELKNNHMQEPYNDY----SFDSCMQLPQLFSPESSVACPSQL
           G G   ++H+ +P++ Y    SFD  MQLPQL S +     P  L
Subjt:  VGAGIGELKNNHMQEPYNDY----SFDSCMQLPQLFSPESSVACPSQL

Q9MA17 Protein SOMBRERO6.3e-8450.55Show/hide
Query:  STHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNN
        S  PGFRFHPT+EELLYYYL+KKVSYE I+LDVIREVDLNKLEPW+LK+KCRIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK+IH+ N+
Subjt:  STHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNN

Query:  SKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQKPEVSQEQLDYYAHTKLGSSSASADEVGAGIGELKNNH-----
        SK+IG+RKTLVFYTGRAPHGQKT+WIMHEYRLDD   E ++QEDGWVVCRVFKKK+      QEQ   + H     S+ +  +    I    NNH     
Subjt:  SKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQKPEVSQEQLDYYAHTKLGSSSASADEVGAGIGELKNNH-----

Query:  -------------------MQEPYNDYSFDSCMQLPQLFSPESSVACPSQLATP----ISLNAATDSCPQNIWRLTCGLLQQERLNTDWSFLNRLLAS--
                           + E  N  S  S M LPQLFSP+S+ A  +  A+       +N     C QN+ RLT           DWSFL++LL +  
Subjt:  -------------------MQEPYNDYSFDSCMQLPQLFSPESSVACPSQLATP----ISLNAATDSCPQNIWRLTCGLLQQERLNTDWSFLNRLLAS--

Query:  ------------------DQQSRTKSTLSDHLPAAMGHSNSKKFSFPFPYPYPHLSSAADFIKFSK
                          D  +   +  +DHL    G S+S   +  FP+ Y  L + A+ +KF K
Subjt:  ------------------DQQSRTKSTLSDHLPAAMGHSNSKKFSFPFPYPYPHLSSAADFIKFSK

Arabidopsis top hitse value%identityAlignment
AT1G33280.1 NAC domain containing protein 152.5e-6776.47Show/hide
Query:  PGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNNSKR
        PGFRFHPTDEELL+YYL+KK+SYE  E++VI+EVDLNK+EPWDL+D+C+IGS  QNEWYFFSHKD+KYPTG+RTNRAT +GFWKATGRDK I   N+ K+
Subjt:  PGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNNSKR

Query:  IGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKS
        IGMRKTLVFY GRAPHGQKTDWIMHEYR++D   E    EDGWVVCRVFKKK+
Subjt:  IGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKS

AT1G79580.1 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein4.5e-8550.55Show/hide
Query:  STHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNN
        S  PGFRFHPT+EELLYYYL+KKVSYE I+LDVIREVDLNKLEPW+LK+KCRIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK+IH+ N+
Subjt:  STHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNN

Query:  SKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQKPEVSQEQLDYYAHTKLGSSSASADEVGAGIGELKNNH-----
        SK+IG+RKTLVFYTGRAPHGQKT+WIMHEYRLDD   E ++QEDGWVVCRVFKKK+      QEQ   + H     S+ +  +    I    NNH     
Subjt:  SKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQKPEVSQEQLDYYAHTKLGSSSASADEVGAGIGELKNNH-----

Query:  -------------------MQEPYNDYSFDSCMQLPQLFSPESSVACPSQLATP----ISLNAATDSCPQNIWRLTCGLLQQERLNTDWSFLNRLLAS--
                           + E  N  S  S M LPQLFSP+S+ A  +  A+       +N     C QN+ RLT           DWSFL++LL +  
Subjt:  -------------------MQEPYNDYSFDSCMQLPQLFSPESSVACPSQLATP----ISLNAATDSCPQNIWRLTCGLLQQERLNTDWSFLNRLLAS--

Query:  ------------------DQQSRTKSTLSDHLPAAMGHSNSKKFSFPFPYPYPHLSSAADFIKFSK
                          D  +   +  +DHL    G S+S   +  FP+ Y  L + A+ +KF K
Subjt:  ------------------DQQSRTKSTLSDHLPAAMGHSNSKKFSFPFPYPYPHLSSAADFIKFSK

AT1G79580.2 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein4.5e-8550.55Show/hide
Query:  STHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNN
        S  PGFRFHPT+EELLYYYL+KKVSYE I+LDVIREVDLNKLEPW+LK+KCRIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK+IH+ N+
Subjt:  STHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNN

Query:  SKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQKPEVSQEQLDYYAHTKLGSSSASADEVGAGIGELKNNH-----
        SK+IG+RKTLVFYTGRAPHGQKT+WIMHEYRLDD   E ++QEDGWVVCRVFKKK+      QEQ   + H     S+ +  +    I    NNH     
Subjt:  SKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQKPEVSQEQLDYYAHTKLGSSSASADEVGAGIGELKNNH-----

Query:  -------------------MQEPYNDYSFDSCMQLPQLFSPESSVACPSQLATP----ISLNAATDSCPQNIWRLTCGLLQQERLNTDWSFLNRLLAS--
                           + E  N  S  S M LPQLFSP+S+ A  +  A+       +N     C QN+ RLT           DWSFL++LL +  
Subjt:  -------------------MQEPYNDYSFDSCMQLPQLFSPESSVACPSQLATP----ISLNAATDSCPQNIWRLTCGLLQQERLNTDWSFLNRLLAS--

Query:  ------------------DQQSRTKSTLSDHLPAAMGHSNSKKFSFPFPYPYPHLSSAADFIKFSK
                          D  +   +  +DHL    G S+S   +  FP+ Y  L + A+ +KF K
Subjt:  ------------------DQQSRTKSTLSDHLPAAMGHSNSKKFSFPFPYPYPHLSSAADFIKFSK

AT1G79580.3 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein4.5e-8550.55Show/hide
Query:  STHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNN
        S  PGFRFHPT+EELLYYYL+KKVSYE I+LDVIREVDLNKLEPW+LK+KCRIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK+IH+ N+
Subjt:  STHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNN

Query:  SKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQKPEVSQEQLDYYAHTKLGSSSASADEVGAGIGELKNNH-----
        SK+IG+RKTLVFYTGRAPHGQKT+WIMHEYRLDD   E ++QEDGWVVCRVFKKK+      QEQ   + H     S+ +  +    I    NNH     
Subjt:  SKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQKPEVSQEQLDYYAHTKLGSSSASADEVGAGIGELKNNH-----

Query:  -------------------MQEPYNDYSFDSCMQLPQLFSPESSVACPSQLATP----ISLNAATDSCPQNIWRLTCGLLQQERLNTDWSFLNRLLAS--
                           + E  N  S  S M LPQLFSP+S+ A  +  A+       +N     C QN+ RLT           DWSFL++LL +  
Subjt:  -------------------MQEPYNDYSFDSCMQLPQLFSPESSVACPSQLATP----ISLNAATDSCPQNIWRLTCGLLQQERLNTDWSFLNRLLAS--

Query:  ------------------DQQSRTKSTLSDHLPAAMGHSNSKKFSFPFPYPYPHLSSAADFIKFSK
                          D  +   +  +DHL    G S+S   +  FP+ Y  L + A+ +KF K
Subjt:  ------------------DQQSRTKSTLSDHLPAAMGHSNSKKFSFPFPYPYPHLSSAADFIKFSK

AT4G10350.1 NAC domain containing protein 701.7e-6867.58Show/hide
Query:  PGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNNSKR
        PGFRFHPTDEELL+YYL+KK+SY+  E++VIREVDLNKLEPWDL+++C+IGS  QNEWYFFSHKD+KYPTG+RTNRAT AGFWKATGRDK I   N+ K+
Subjt:  PGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKAIHMGNNSKR

Query:  IGMRKTLVFYTGRAPHGQKTDWIMHEYRLDD-ENPEVQLQEDGWVVCRVFKKKSQKPEVSQ-----EQLDYYAHTKLGSSSA
        IGMRKTLVFY GRAPHGQKTDWIMHEYRL+D ++P+    EDGWVVCRVF KK+    V++       LD + H    ++ A
Subjt:  IGMRKTLVFYTGRAPHGQKTDWIMHEYRLDD-ENPEVQLQEDGWVVCRVFKKKSQKPEVSQ-----EQLDYYAHTKLGSSSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAGAAAAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCT
TTCGGTCTATCGGAAGTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGGAGAAT
TAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAATATCCAGATGAAATGGAGGATTGCTCCTTCATCAGGATTCTGGAGAGCACAGTT
ATTGAGACAACAATACAAGATTCGGCTAATAAGCATTCAGAAAAGCATGGAGAGGCTCCCCCAATTAAGCCATCCCTAATTGAGGCACCCACTTTAGATTTGAAACCTTT
ACCGGATCATCTAAAGTATGTGTATCTTGGGGAGAGTGATACGTTGCCCATTATTGTCGCATCAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGC
AATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAA
CAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGT
CCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGATAATGAGCTGATTCCAACCAAGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGC
TTAATAAAGCCACCCGAAAGGATCATTTCCCTCTACCATTTATCGACCAGATGTTGGATCGATTGGCTGGTCAAGCCTATTACTGTTTCTTGGATGGTTATTCTGGGTAT
AACCAGATTACTATTGCTCCTGAAGATCAGGAGAAAACCACTTTCACCTGCCCTTATGGGACATTCGCTTTTAGGAGAATGCCTTTTGGCCTTTGCAATGCTCTAGCAAC
ATTTCAGCGGTGTATGCTAGCAATTTTTTCTGACATGATTGAGTCCACTGTTGAAGTCTTTATGGACGATTTTTCAGTTTTTGGAGGGTCTTTTCAGAGTTGTCTGGATA
ATTTAGGAAAGGTATTAAAAAGATGTGAGGATACCCATCTAGTTCTTAATTGGGAGAAGTGTCACTTCATGGTGAAGGAGGGGATAGCGTTAGGTCATAGGATTTCTAAA
AATTGTCTAGAAGTTGATAGAGCAAAAGTTGAGGTTATTGAAAGGCTAGAGCCACCGAATTCAGTGAAGGGAATTCAGAGTTTTTTAGGACATGCTGGATTTTATAGGAG
GTTCATAAAGGATTTTTCCAAAATCAGTAAACCTCTTTGTAACTTATTGTGCACTGATCATGTTTTTTACTTTAATGCAGATTGTAGGAAGGCTTTTGAAACGTTAAAGG
CTGCTTTAATCTCAGCACCCATTCTCTGCGCACCTAATTGGAATTTGCCATTCGAGGTAATGTGTGATGCGAGTGATGTTGCGGTAGGTGCGATGCTGGGGCAAAAGCAG
GGCAAATTTATTCATCCTATATATTATGCAAGCAAGGTTTTAAATGAAGCACAGGTCAACTACACAACTACTGAAAAGGAGTTGTCAGCTGTTGTGTTTGCTTTTGAGAA
ATTTCGACCATATTTGGTTGGATCTAAAGTCACGGTGTTCACGGATCATGCAGCAATAAGGTATTTAATGTCTAAGAAGGATGCAAAGCCGAGACTAATTCGTTGGATTT
TATTACTGCAGGAATTTGACTTGGAGATAAAAGACAAGAAGGGATCAGAGAATGTCATTGCAGATCATTTATCTCGTCTTGATCCATCATCATTTTTGCTGGAGCAATCT
GCCATTTCCGATTCTTTTCCAGATGAGCAGCTGATAGCGAGCATAGCTAGCTATCTTTACTCATTTAATTTAGAAATGGTTAATTGGAGCTATAATGCAATATCAGAGTT
AATCGGGTGCTCGAGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCAAATCTCGGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGC
GTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCG
TTTAAGAGGCAGCGTCTCGACGCTAAGACGAAAAATCAGAATATAAGCTTCACTTCGGCTAGTAAAGGGAGTCACCCTATTGATATGGATTGGAGGCAGAAGAAAAAGTT
TAAGCATGATGCGAAATTTTTCTTTTGGGATGAGTCGTTTATGTATAAGCAATGCTCTGACGATATTATTCGTAGGTGTGTTTCAGGTGCTGAAGCAAAAGAAATCCTGG
AGCAATGTTACTTTTCGCCATATGGAGGTCATTTCAGCGGACAAAGGACAGCTATGAGGATTTTGCATTGCGGATTCTTCTGGCCTACATTATTCAAGGATGCCCATTGG
TTCTACAAGCAATGTGACGCTTGCCAAAGGAGAGGAAATTTGGGGCCAAGAGATGAAATGCCTCTTACTTACATTTTAGAAGTTGAATTATTTGATGTATGGGGTATTGA
CTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTCTAAGTGGGTGGAGGCCATCGCATGCCATCAGAGTGATGCCAAGA
CAGTAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTTTAGTGAGTGATGAGGGTACACATTTTGTTAATAATATCTTAACCAAGCTTTTA
GCTAAGTATGGGATTAAGCATAGGATAGCTACTCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAATAAGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCA
TCCATCTCGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGACTCCTTTAGGACCGTTTATTGTGATTGAGGTTTTCTCCAATG
GAGCAATTACTTTGCAGGATGAAAAAGATGGGAGAGTATTCAAGGAGAATGGACAGTGTGTGAAGCATTATTGGGGTTCGGCAGATTGTTGCGGCAAAGATATTGCTGGA
GCAAAATATGCCGAGTTAGAAGGGTTTCCGTTGGAATTTATTATTGTTACCGTTGGATTTGGCTTTGAATTCTTGCAGAGGGCGAAAGAACAAGAAAAAGTGGCAGAGGA
TGAGGCTGCTGCAGTGGTTGAAGAAGACCTGGAAGAACCGCAAGAACAGATCCCAGATGAGGGGAAAGACGTGGCCTGCAAAACGGAAAACCTGCACACCGGTGTGGTGC
TAACCACACCGCCTCCGATGCTTAAGTCAGAAAACAGAAGTAGGAGGGCTAGAGAGCAAGAGAGAGAGGAGAATAGAGTTCAAGTCCCCTTATTCAGTGGCGAAGAGGGG
TATAAATACCTGCTCATGCTCCTAGGGTTTTTAGGAATTCGGAGGCGTTTCGGGACGAACCAGGCGGAACCAGGGCGGTTAGAGGCATCAAGGATCAAAAGGAGGCGACC
GAGCTCGACCCGCGCAAGCAGGCCGAATGGTCGGCCTCGGCCTTGGGCCGAGGCCAACCATAGGGGCCGGGCCAAGTTGGCCCGACCCTTTGGTCGGTTCTTCCTCTGGG
TCGGTCTCTTGGTCCTATTTCTGCTTGGATGTCCTCGTCAGCTCCTTGTGCATTGGAGTGATCCAAAATTACCTATAACACATGTATCAAATCCACATCACCAAATTGCA
ATTTTGGAACTTGGCATCAGTTCTAATTTCAATTTGATGATGTTGAATAATCTTGTCTCTTCTCCTCTCGTCGCTTCTAATTTCTCCCCAGACTCAACTGTTTCCTCAAC
ACACCCAGGCTTTCGATTCCATCCAACAGACGAGGAGCTTCTTTATTACTACCTCAGGAAGAAGGTTTCGTATGAGGCCATTGAGCTCGATGTTATCAGAGAAGTGGATC
TAAACAAACTGGAGCCTTGGGACCTCAAAGATAAATGTAGAATTGGATCTGGTCATCAAAACGAGTGGTATTTCTTTAGCCATAAAGACAAAAAATATCCAACTGGTACT
CGAACAAATAGAGCAACCAGTGCTGGATTTTGGAAGGCAACAGGGAGAGACAAAGCCATTCACATGGGCAATAATTCCAAGAGGATTGGCATGAGGAAGACACTGGTGTT
CTACACAGGTCGTGCTCCTCACGGCCAAAAGACTGATTGGATCATGCATGAATACCGTCTCGACGATGAAAATCCCGAGGTTCAGCTGCAGGAAGATGGGTGGGTGGTGT
GCAGGGTTTTCAAGAAAAAAAGTCAGAAGCCAGAGGTTTCGCAAGAGCAGTTAGATTATTATGCGCATACGAAGCTGGGCAGTAGTTCTGCTTCGGCCGACGAAGTGGGT
GCAGGAATAGGAGAGCTAAAAAACAATCATATGCAAGAGCCATATAATGATTATAGCTTTGATTCCTGCATGCAGCTGCCGCAGTTGTTTAGTCCAGAGTCATCAGTTGC
ATGCCCATCGCAGCTTGCAACACCTATCTCTTTAAACGCTGCCACAGATTCATGTCCTCAAAATATCTGGAGACTAACTTGCGGGCTCCTGCAACAGGAGCGGTTGAACA
CTGATTGGTCATTCTTGAATAGGCTGCTTGCTTCCGACCAACAATCACGTACCAAGTCTACGCTTTCGGATCACCTTCCAGCTGCTATGGGACACTCTAACTCTAAAAAG
TTTTCATTTCCATTTCCCTATCCTTATCCCCATCTTAGCTCTGCCGCCGACTTTATCAAATTCTCCAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAGAAAAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCT
TTCGGTCTATCGGAAGTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGGAGAAT
TAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAATATCCAGATGAAATGGAGGATTGCTCCTTCATCAGGATTCTGGAGAGCACAGTT
ATTGAGACAACAATACAAGATTCGGCTAATAAGCATTCAGAAAAGCATGGAGAGGCTCCCCCAATTAAGCCATCCCTAATTGAGGCACCCACTTTAGATTTGAAACCTTT
ACCGGATCATCTAAAGTATGTGTATCTTGGGGAGAGTGATACGTTGCCCATTATTGTCGCATCAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGC
AATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAA
CAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGT
CCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGATAATGAGCTGATTCCAACCAAGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGC
TTAATAAAGCCACCCGAAAGGATCATTTCCCTCTACCATTTATCGACCAGATGTTGGATCGATTGGCTGGTCAAGCCTATTACTGTTTCTTGGATGGTTATTCTGGGTAT
AACCAGATTACTATTGCTCCTGAAGATCAGGAGAAAACCACTTTCACCTGCCCTTATGGGACATTCGCTTTTAGGAGAATGCCTTTTGGCCTTTGCAATGCTCTAGCAAC
ATTTCAGCGGTGTATGCTAGCAATTTTTTCTGACATGATTGAGTCCACTGTTGAAGTCTTTATGGACGATTTTTCAGTTTTTGGAGGGTCTTTTCAGAGTTGTCTGGATA
ATTTAGGAAAGGTATTAAAAAGATGTGAGGATACCCATCTAGTTCTTAATTGGGAGAAGTGTCACTTCATGGTGAAGGAGGGGATAGCGTTAGGTCATAGGATTTCTAAA
AATTGTCTAGAAGTTGATAGAGCAAAAGTTGAGGTTATTGAAAGGCTAGAGCCACCGAATTCAGTGAAGGGAATTCAGAGTTTTTTAGGACATGCTGGATTTTATAGGAG
GTTCATAAAGGATTTTTCCAAAATCAGTAAACCTCTTTGTAACTTATTGTGCACTGATCATGTTTTTTACTTTAATGCAGATTGTAGGAAGGCTTTTGAAACGTTAAAGG
CTGCTTTAATCTCAGCACCCATTCTCTGCGCACCTAATTGGAATTTGCCATTCGAGGTAATGTGTGATGCGAGTGATGTTGCGGTAGGTGCGATGCTGGGGCAAAAGCAG
GGCAAATTTATTCATCCTATATATTATGCAAGCAAGGTTTTAAATGAAGCACAGGTCAACTACACAACTACTGAAAAGGAGTTGTCAGCTGTTGTGTTTGCTTTTGAGAA
ATTTCGACCATATTTGGTTGGATCTAAAGTCACGGTGTTCACGGATCATGCAGCAATAAGGTATTTAATGTCTAAGAAGGATGCAAAGCCGAGACTAATTCGTTGGATTT
TATTACTGCAGGAATTTGACTTGGAGATAAAAGACAAGAAGGGATCAGAGAATGTCATTGCAGATCATTTATCTCGTCTTGATCCATCATCATTTTTGCTGGAGCAATCT
GCCATTTCCGATTCTTTTCCAGATGAGCAGCTGATAGCGAGCATAGCTAGCTATCTTTACTCATTTAATTTAGAAATGGTTAATTGGAGCTATAATGCAATATCAGAGTT
AATCGGGTGCTCGAGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCAAATCTCGGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGC
GTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCG
TTTAAGAGGCAGCGTCTCGACGCTAAGACGAAAAATCAGAATATAAGCTTCACTTCGGCTAGTAAAGGGAGTCACCCTATTGATATGGATTGGAGGCAGAAGAAAAAGTT
TAAGCATGATGCGAAATTTTTCTTTTGGGATGAGTCGTTTATGTATAAGCAATGCTCTGACGATATTATTCGTAGGTGTGTTTCAGGTGCTGAAGCAAAAGAAATCCTGG
AGCAATGTTACTTTTCGCCATATGGAGGTCATTTCAGCGGACAAAGGACAGCTATGAGGATTTTGCATTGCGGATTCTTCTGGCCTACATTATTCAAGGATGCCCATTGG
TTCTACAAGCAATGTGACGCTTGCCAAAGGAGAGGAAATTTGGGGCCAAGAGATGAAATGCCTCTTACTTACATTTTAGAAGTTGAATTATTTGATGTATGGGGTATTGA
CTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTCTAAGTGGGTGGAGGCCATCGCATGCCATCAGAGTGATGCCAAGA
CAGTAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTTTAGTGAGTGATGAGGGTACACATTTTGTTAATAATATCTTAACCAAGCTTTTA
GCTAAGTATGGGATTAAGCATAGGATAGCTACTCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAATAAGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCA
TCCATCTCGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGACTCCTTTAGGACCGTTTATTGTGATTGAGGTTTTCTCCAATG
GAGCAATTACTTTGCAGGATGAAAAAGATGGGAGAGTATTCAAGGAGAATGGACAGTGTGTGAAGCATTATTGGGGTTCGGCAGATTGTTGCGGCAAAGATATTGCTGGA
GCAAAATATGCCGAGTTAGAAGGGTTTCCGTTGGAATTTATTATTGTTACCGTTGGATTTGGCTTTGAATTCTTGCAGAGGGCGAAAGAACAAGAAAAAGTGGCAGAGGA
TGAGGCTGCTGCAGTGGTTGAAGAAGACCTGGAAGAACCGCAAGAACAGATCCCAGATGAGGGGAAAGACGTGGCCTGCAAAACGGAAAACCTGCACACCGGTGTGGTGC
TAACCACACCGCCTCCGATGCTTAAGTCAGAAAACAGAAGTAGGAGGGCTAGAGAGCAAGAGAGAGAGGAGAATAGAGTTCAAGTCCCCTTATTCAGTGGCGAAGAGGGG
TATAAATACCTGCTCATGCTCCTAGGGTTTTTAGGAATTCGGAGGCGTTTCGGGACGAACCAGGCGGAACCAGGGCGGTTAGAGGCATCAAGGATCAAAAGGAGGCGACC
GAGCTCGACCCGCGCAAGCAGGCCGAATGGTCGGCCTCGGCCTTGGGCCGAGGCCAACCATAGGGGCCGGGCCAAGTTGGCCCGACCCTTTGGTCGGTTCTTCCTCTGGG
TCGGTCTCTTGGTCCTATTTCTGCTTGGATGTCCTCGTCAGCTCCTTGTGCATTGGAGTGATCCAAAATTACCTATAACACATGTATCAAATCCACATCACCAAATTGCA
ATTTTGGAACTTGGCATCAGTTCTAATTTCAATTTGATGATGTTGAATAATCTTGTCTCTTCTCCTCTCGTCGCTTCTAATTTCTCCCCAGACTCAACTGTTTCCTCAAC
ACACCCAGGCTTTCGATTCCATCCAACAGACGAGGAGCTTCTTTATTACTACCTCAGGAAGAAGGTTTCGTATGAGGCCATTGAGCTCGATGTTATCAGAGAAGTGGATC
TAAACAAACTGGAGCCTTGGGACCTCAAAGATAAATGTAGAATTGGATCTGGTCATCAAAACGAGTGGTATTTCTTTAGCCATAAAGACAAAAAATATCCAACTGGTACT
CGAACAAATAGAGCAACCAGTGCTGGATTTTGGAAGGCAACAGGGAGAGACAAAGCCATTCACATGGGCAATAATTCCAAGAGGATTGGCATGAGGAAGACACTGGTGTT
CTACACAGGTCGTGCTCCTCACGGCCAAAAGACTGATTGGATCATGCATGAATACCGTCTCGACGATGAAAATCCCGAGGTTCAGCTGCAGGAAGATGGGTGGGTGGTGT
GCAGGGTTTTCAAGAAAAAAAGTCAGAAGCCAGAGGTTTCGCAAGAGCAGTTAGATTATTATGCGCATACGAAGCTGGGCAGTAGTTCTGCTTCGGCCGACGAAGTGGGT
GCAGGAATAGGAGAGCTAAAAAACAATCATATGCAAGAGCCATATAATGATTATAGCTTTGATTCCTGCATGCAGCTGCCGCAGTTGTTTAGTCCAGAGTCATCAGTTGC
ATGCCCATCGCAGCTTGCAACACCTATCTCTTTAAACGCTGCCACAGATTCATGTCCTCAAAATATCTGGAGACTAACTTGCGGGCTCCTGCAACAGGAGCGGTTGAACA
CTGATTGGTCATTCTTGAATAGGCTGCTTGCTTCCGACCAACAATCACGTACCAAGTCTACGCTTTCGGATCACCTTCCAGCTGCTATGGGACACTCTAACTCTAAAAAG
TTTTCATTTCCATTTCCCTATCCTTATCCCCATCTTAGCTCTGCCGCCGACTTTATCAAATTCTCCAAGTAG
Protein sequenceShow/hide protein sequence
MPNYAKFLKDILTKRKGGKELGRALCDLGASINLMPLSVYRKLDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV
IETTIQDSANKHSEKHGEAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESDTLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQ
QRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTKTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGY
NQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNALATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGIALGHRISK
NCLEVDRAKVEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFYFNADCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMLGQKQ
GKFIHPIYYASKVLNEAQVNYTTTEKELSAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENVIADHLSRLDPSSFLLEQS
AISDSFPDEQLIASIASYLYSFNLEMVNWSYNAISELIGCSRREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERA
FKRQRLDAKTKNQNISFTSASKGSHPIDMDWRQKKKFKHDAKFFFWDESFMYKQCSDDIIRRCVSGAEAKEILEQCYFSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHW
FYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLL
AKYGIKHRIATPYHPQANGQAEISNKEIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGPFIVIEVFSNGAITLQDEKDGRVFKENGQCVKHYWGSADCCGKDIAG
AKYAELEGFPLEFIIVTVGFGFEFLQRAKEQEKVAEDEAAAVVEEDLEEPQEQIPDEGKDVACKTENLHTGVVLTTPPPMLKSENRSRRAREQEREENRVQVPLFSGEEG
YKYLLMLLGFLGIRRRFGTNQAEPGRLEASRIKRRRPSSTRASRPNGRPRPWAEANHRGRAKLARPFGRFFLWVGLLVLFLLGCPRQLLVHWSDPKLPITHVSNPHHQIA
ILELGISSNFNLMMLNNLVSSPLVASNFSPDSTVSSTHPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGT
RTNRATSAGFWKATGRDKAIHMGNNSKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLDDENPEVQLQEDGWVVCRVFKKKSQKPEVSQEQLDYYAHTKLGSSSASADEVG
AGIGELKNNHMQEPYNDYSFDSCMQLPQLFSPESSVACPSQLATPISLNAATDSCPQNIWRLTCGLLQQERLNTDWSFLNRLLASDQQSRTKSTLSDHLPAAMGHSNSKK
FSFPFPYPYPHLSSAADFIKFSK