; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035349 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035349
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:20091053..20099775
RNA-Seq ExpressionLag0035349
SyntenyLag0035349
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]5.2e-25840.82Show/hide
Query:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL
        S +FE+ KF+G GDF+LWRKK++A+LVQ KVAKIL++ E LP+ ++E+++++MDE+A+ T++LYLSD VLR VDE     E+WKKL+S+YL KSL NKI 
Subjt:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL

Query:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP
        +KE+ FGY+MD +K LE+NLDEF KI +D+ NIGEK+ DENQA+ILLNSL E+Y+E+KAAIKYG DSLT+ +VLDAL+++++E + +RK+ E L  RGR 
Subjt:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP

Query:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------
        E      K  K  +   RSKSKGK +R+CF CHK GH  K+C   +  +      N  +G NSA I +  DS +                          
Subjt:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------

Query:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G
                                     C +KG GSV++   D   +IL++VRYVP LKRNLISLG LD++G   + EN                   G
Subjt:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G

Query:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------
        LY L G +    A   + +    + ++LWHKRLAHVSE+GL  L +QGLLG  K+V LPFCE C +GKS RV+F +  H T+GILDYIHSDL        
Subjt:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------

Query:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------
                                 K + F  F  WK  VENQT +K+K ++TDN LEF++  F   C   G  +H T+  TPQQNGL+           
Subjt:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR
                        +E++ Q                    D  N P   SE                                  +L NYQLTRDR +
Subjt:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR

Query:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR
        R    P R+  A       D  AYAL C  D+  +EPLT+E+AI S++K QW   M+EEL SL KN TW+LV KP   KL+  KW+YK+KP   G +KPR
Subjt:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR

Query:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK
        +KARL+                            LS+   FD+ +EQMDVTT FLHGELEE IYM QP G+EV+GKE+ VCRL KSLYGLKQSPRQWY  
Subjt:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK

Query:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY
        FD FI   GF +  YD+CVY K       +YLL+YVDDM++ S+D + I  LK QL +EFEMKD+G   +ILGM++ R+R  G L I+Q  Y+ K++++Y
Subjt:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY

Query:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL
         + ++K V  PLA HFRLS +  P +++ER EM ++PY + VGS+MY+M C+RPD+ +AMS++SR+MS+ GK HW+AVKW++RY+KG+ +  L YSR   
Subjt:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL

Query:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH
        K + L G++DAD+AADLDKRRS SG+IF L+ N VSW+ +LQ V ALSTTESE ISL EAV E  WL  +V ELL       +  DSQSAI L+KNP +H
Subjt:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH

Query:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAAR
        +RSKHID ++HY+R     ++++L KV +  N +DMLTK L+A R
Subjt:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAAR

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]6.3e-28042.44Show/hide
Query:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL
        S +FE+ KF+G GDFSLWRKK++A+LVQ KVAKIL++ E LP+ ++E+++++MDE+A+ST++LYLSD VLR VDE     E+WKKL+S+YL KSL NKI 
Subjt:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL

Query:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP
        +KE+ FGY+MD +K LE+NLDEF KI +D+ NIGEK+ DENQA+ILLNSL E+Y+E+KAAIKYGRDSLT+ +VLDAL+++++E + +RK+ E L  RGR 
Subjt:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP

Query:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------
        E      K  K  +   RSKSKGK +R+CF CHK GH  K+C   +  +      N  +G NSA I + YDS +                          
Subjt:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------

Query:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G
                                     C +KG GSV++   D   +IL++VRYVP LKRNLISLG LD++G   + EN                   G
Subjt:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G

Query:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------
        LY L G +    A   + +      ++LWHKRLAHVSE+GL  L +QGLLG  K+V LPFCE C +GKS RV+F +  H T+GILDY+HSDL        
Subjt:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------

Query:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------
                                 K + F  F  WK  VENQT +K+K ++TDN LEF++  F   C   G  +H T+  TPQQNGL+           
Subjt:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR
                        +E++ Q                    D  N P   SE                                  +L NYQLTRDR +
Subjt:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR

Query:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR
        R    P R+  A       D  AYAL C  D+  +EPLT+E+AI S++K QW   M+EEL SL KN TW+LV KP   KL+  KW+YK+KP   G +KPR
Subjt:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR

Query:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK
        +KARLVAKGYTQ+ G+D+ EIFSPVVR+SSI ++LS+   FD+ +EQMDVTTAFLHGELEE IYM QP G+EV+GKE+ VCRL KSLYGLKQSPRQWY +
Subjt:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK

Query:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY
        FD FI   GF +  YD+CVY K       +YLL+YVDDM++ S+D ++I  LK QL +EFEMKD+G   +ILGM++ R++  G L I+Q  Y+ K++++Y
Subjt:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY

Query:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL
         + ++K V  PLA HFRLS +  P +++ER EM ++PY + VGS+MY+M C+RPD+ +AMS++SR+MS+ GK HW+AVKW++RY+KG+ +  L YSR   
Subjt:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL

Query:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH
        K + L G++DAD+AADLDKRRS SG+IF L+ N VSW+ +LQ V ALSTTESE ISL EAV E  WL  +V ELL       +  DSQSAI L+KNP +H
Subjt:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH

Query:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAARMMHLIDLLKI
        +RSKHID ++HY+R     ++++L KV +  N +DMLTK L+A R  +L+D L +
Subjt:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAARMMHLIDLLKI

KAF5775409.1 putative RNA-directed DNA polymerase [Helianthus annuus]1.6e-23838.19Show/hide
Query:  MSAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKI
        +S KFELEKFDG+ DFSLWR K++A+LV   V   L+  + LP  LS  ++KE+ E A S +IL L D VLR+V +  +A  +W KL+++Y+ KSL N++
Subjt:  MSAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKI

Query:  LLKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKE--TEALTVR
         LK+RL+ ++M+S K LE++ D+F K+ LD+ NI   ++DE++AI+ L SL  SY+     + Y R++L++  VL AL SK+++ +T  KE  ++ L VR
Subjt:  LLKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKE--TEALTVR

Query:  GRPENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDCRKLQYKNKHQNKKEGQNSANIHE----SYDST-----------------------
        GRPENR +         +  RSKSK K  R+CF C+   HL KDC + + K    +  +   S ++ E     YDS                        
Subjt:  GRPENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDCRKLQYKNKHQNKKEGQNSANIHE----SYDST-----------------------

Query:  -------------------------KCSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG-----------------------QPENGLY
                                  C I G+G++   + + T  +L +VRY+P LKR+L+SLGT ++ G                       + +N +Y
Subjt:  -------------------------KCSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG-----------------------QPENGLY

Query:  TLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL----------
         L+G+   +  + V+ +   N   L WH+RL HVS +GL EL+KQ ++ +  S  + FCE C +GK  RV+F +  H T+GILDY+HSDL          
Subjt:  TLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL----------

Query:  -----------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLSRE-----------
                               KS+ F+ F  WK  +ENQT +KIKK++TDN LE+ ++ F+ +C +SG  +HLTIP TPQQNGL+             
Subjt:  -----------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLSRE-----------

Query:  --------------------------------------------------------------------------------------------EEP-----
                                                                                                    E P     
Subjt:  --------------------------------------------------------------------------------------------EEP-----

Query:  --------------------------------------------QPEDPSNVPVQTSEENLTNYQLTRDRQRRTIRPPSRFEQANCNIEIPDSAAYALIC
                                                    Q E+   V  +T+ E   +Y L+RDR+RR +  P RF Q N N+     +A A I 
Subjt:  --------------------------------------------QPEDPSNVPVQTSEENLTNYQLTRDRQRRTIRPPSRFEQANCNIEIPDSAAYALIC

Query:  DDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRY
         ++  ++E LTY QAI S+ K +W   MQEE++SL+KN TW LV+KP++ K+V CKW+YK+K  + G    RFKARLVAKG+TQ+ G+DYTEIFSPVV++
Subjt:  DDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRY

Query:  SSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVV
        +SI +LL++ A  DLELEQ+DV TAFLHG L+E+IYM QP GF   G E  VC L +SLYGLKQSPRQWYK+FD F+ S GF +  YDSCVY +      
Subjt:  SSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVV

Query:  VVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEE
         VYLL+YVDDMLIA +   +I+ +K  LKSEFEMK++G A +ILGM + R+R    L + QS Y++KI++ Y +E  K V  PL  HF+LS   SPK+E+
Subjt:  VVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEE

Query:  ERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIF
        E + M  VPY++VVGS+MY+M C+RPD+A+A+SVVSRY+S  GK HWEAVKWI+RY+KGT ++GL++   +     +VG+ DADFA D DK RS +GY+F
Subjt:  ERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIF

Query:  TLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVA
         +  N VSW+S LQ + ALSTTESE ++L EAV E  WL G++ EL    N   +  D+Q AI LSKN  YH+R+KHI  + H++R+    + +++  V 
Subjt:  TLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVA

Query:  SANNPTDMLTKPLTAARM-MHLIDL
        +  N  D+ TK L  ++  M L D+
Subjt:  SANNPTDMLTKPLTAARM-MHLIDL

RVW58503.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.0e-25140.49Show/hide
Query:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL
        +AKF++EKF GK DF LWR K++A+LVQ  +   L   ++LP  + E ++ E+ E A S +IL L D+VLR+V +  +A E+W KL+S+Y+ KSL N++ 
Subjt:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL

Query:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKET--EALTVRG
         K +L+ ++M  +  +ED+LD F KI LD+ NI   I DE++AI+LL SL  SY  +K AI YGRDSLT   V   L ++++  + + KE   E L +RG
Subjt:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKET--EALTVRG

Query:  RPENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDCRKLQYKNKHQNKKEGQNSANIHESYDSTK--------------CSIKGVGSVKLNL
        R E R       K   +K RSKSK K+ + CF CHK GH  KDC   +     +   EG ++A I + YDS +              C I G G+V++  
Subjt:  RPENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDCRKLQYKNKHQNKKEGQNSANIHESYDSTK--------------CSIKGVGSVKLNL

Query:  MDETHKILSSVRYVPDLKRNLISLGTLDKAG-----QP-----------------ENGLYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLA
         D   ++L  VRY+P+LKRNLISLG LDK+G     +P                 +NGLYTL G +     + V+      ++  LWH+RL H+S +GL 
Subjt:  MDETHKILSSVRYVPDLKRNLISLGTLDKAG-----QP-----------------ENGLYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLA

Query:  ELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL-----------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENF
        EL+KQG+LG  K   LPFCE C  GK+ RV+F +++H+TQ  LDYIHSDL            S+ F  F  WK LVE QT++K+KK++TDN LEFLS +F
Subjt:  ELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL-----------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENF

Query:  KNMCTKSGFQKHLTIPNTPQQNGLS---------------------------------------------------------------------------
         ++C K G  +H T+  TPQQNGL+                                                                           
Subjt:  KNMCTKSGFQKHLTIPNTPQQNGLS---------------------------------------------------------------------------

Query:  -------------------------------------------------------------------REEEPQPEDPSNVPVQTSEE------------N
                                                                             E  QPE       +T++E             
Subjt:  -------------------------------------------------------------------REEEPQPEDPSNVPVQTSEE------------N

Query:  LTNYQLTRDRQRRTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYK
        L +Y L RDRQ+R ++PP R+ QA       +  A+AL   ++    EP TY++AI S    QWV  +QEE++SL KN TW LV KP++ K+V  KW++K
Subjt:  LTNYQLTRDRQRRTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYK

Query:  LKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLY
         K    G   PR+KARLVAKG++Q+ G+DY EIFSPVV++SSI +LL+ VA  DLEL+Q+DV TAFLHGEL+E IYM+ P GF    K+  VC L KSLY
Subjt:  LKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLY

Query:  GLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVIN
        GLKQSPRQWYK+FD ++   GF    +D CVY K      +VYLL+YVDDML+A ++   +E++K  LK+EFEMKD+GSA +ILGM + R+R    L ++
Subjt:  GLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVIN

Query:  QSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGT
        Q+ Y+ K++ R+ + N KTV  PL QHFRLS   +P++ EE++ ME +PY+S+VGS+MY M CSRPD+A+A+S++SRYMS  GKPHW+AVKW+ +Y+ GT
Subjt:  QSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGT

Query:  TNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQ
         + GL+Y   S  ++ L G+ DAD+A ++D R+S +GY+FT+F   VSW+++LQSV ALSTTE+E +++ EAV E  WL G+  EL     ++ V+ D+Q
Subjt:  TNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQ

Query:  SAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAARMMHLIDLLKI
        SAI L+KN  +H+RSKHID R H++R+     EI +GKV + +NP+DMLTK L   +  H ++L+ +
Subjt:  SAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAARMMHLIDLLKI

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.0e-24539.85Show/hide
Query:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL
        S +FE+ KF+G GDF+LWRKK++A+LVQ KVAKIL++ E LP+ ++E+++++MDE+A+ST++LYLSD VLR VDE     E+WKKL+S+YL KSL NKI 
Subjt:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL

Query:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP
        +KE+ FGY+MD +K LE+NLDEF KI +D+ NIGEK+ DENQA+ILLNSL E+Y+E+KAAIKYGRDSLT+ +VLDAL+++++E + +RK+ E L  RGR 
Subjt:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP

Query:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------
        E      K  K  +   RSKSKGK +R+CF CHK GH  K+C   +  +      N  +G NSA I + YDS +                          
Subjt:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------

Query:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G
                                     C +KG GSV++   D   +IL++VRYVP LKRNLISLG LD++G   + EN                   G
Subjt:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G

Query:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------
        LY L G +    A   + +      ++LWHKRLAHVSE+GL  L +QGLLG  K+V LPFCE C +GKS RV+F +  H T+GILDY+HSDL        
Subjt:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------

Query:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------
                                 K + F  F  WK  VENQT +K+K ++TDN LEF++  F   C   G  +H T+  TPQQNGL+           
Subjt:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR
                        +E++ Q                    D  N P   SE                                  +L NYQLTRDR +
Subjt:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR

Query:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR
        R    P R+  A       D  AYAL C  D+  +EPLT+E+AI S++K QW   M+EEL SL KN TW+LV KP   KL+  KW+YK+KP   G +KPR
Subjt:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR

Query:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK
        +KARLVAKGYTQ+ G+D+ EIFSPVVR+SSI ++LS+   FD+ +EQMDVTTAFLHGELEE IYM QP G+EV+GKE+ VCRL KSLYGLKQSPRQWY +
Subjt:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK

Query:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY
        FD FI   GF +  YD+CVY K       +YLL+YVDDM++ S+D ++I  LK QL +EFEMKD+G   +ILGM++ R++  G L I+Q  Y+ K++++Y
Subjt:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY

Query:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL
         + ++K V  PLA HFRLS +  P +++ER EM ++PY + VGS+MY+M C+RPD+ +AMS                          + +  L YSR   
Subjt:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL

Query:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH
        K + L G++DAD+AADLDKR                      S   L    +E ISL EAV E  WL  +V ELL       +  DSQSAI L+KNP +H
Subjt:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH

Query:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAAR
        +RSKHID ++HY+R     ++++L KV +  N +DMLTK L+A R
Subjt:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAAR

TrEMBL top hitse value%identityAlignment
A0A2P5YYC3 Uncharacterized protein7.1e-23738.26Show/hide
Query:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL
        S K+++EKF GK  FSLWR K++A+LVQ  + K L   + LP  LSE ++ +M E A S ++L L D VLR+V +   A  +W +L+S Y+ KSLTN++ 
Subjt:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL

Query:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETR-----TQRKETEALT
        LK+RL+  +M+   P+  +LD+F  I +D+ NI  KIDDE+QAII+L SL  SY+     + YGRD LTL  V +AL S ++  +      +  E E L 
Subjt:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETR-----TQRKETEALT

Query:  VRGRPENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDCRKLQYKNKHQNKKEGQ-NSANIHESYDST------------------------
         RGR        K      +K   +S+ K+  +C+YC K GH+  DC K + K++ Q ++  + N A+   S D+                         
Subjt:  VRGRPENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDCRKLQYKNKHQNKKEGQ-NSANIHESYDST------------------------

Query:  -----------------------KCSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDK----------------------AGQPENGLYTLN
                                C + G+G+V++ + D   + L+ VR++P++K+NLISL TLDK                       G  E GLY L+
Subjt:  -----------------------KCSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDK----------------------AGQPENGLYTLN

Query:  GASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL-------------
        G+S    A        ++    LWH RL H+SE+GL+ L K+GLL    +  L FCE C  GK  RV+F+  +HKT+G +DY HSDL             
Subjt:  GASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL-------------

Query:  --------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLSRE--------------
                            K +V   F  +K L+ENQT KKIK+ +TDN LEF S  F   C   G  +H T+  TPQQNG++                
Subjt:  --------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLSRE--------------

Query:  ------------------------------EEPQ------PEDPSNV---------------------------------------PVQTSEENLTNYQL
                                      + P+      P D S +                                       P   S + + +  +
Subjt:  ------------------------------EEPQ------PEDPSNV---------------------------------------PVQTSEENLTNYQL

Query:  TRDRQ---RRTIRPPSRFE-----------QANCNIEIPDSAAYALICDDDNEIS---------------------------------------------
        T D     R T     + E           Q  C ++ P       + DD  E+                                              
Subjt:  TRDRQ---RRTIRPPSRFE-----------QANCNIEIPDSAAYALICDDDNEIS---------------------------------------------

Query:  -EPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVL
         EP +Y +A+  +  +QW   M EE+ SL KN+TW LV+ P   K+V CKW++K K  + G    RFKARLVAKG+TQ+ GIDY E+FSPVV++SSI VL
Subjt:  -EPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVL

Query:  LSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLI
        L++VA+ DLELEQ+DV TAFLHGELEE IYMRQP GF V GKE+ VC L KSLYGLKQSPRQWYK+FD F+   G+ +  YD+CVY +  +    +YLL+
Subjt:  LSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLI

Query:  YVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEME
        YVDDMLIAS+++S+I +LK QL  EFEMKD+G+A +ILGM++ R+R AG L ++Q  Y+EK+++R+ ++ AKTV  PLA HF+LS  LSP+S+EE+++M 
Subjt:  YVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEME

Query:  HVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNT
        H+PYSS VGS+MY M C+RPDI+ A+SVVSRYMS  GK HW+AVKWI+RY++G+ +  L+Y +     S + GY D+D+A DLDKRRS +GY+FT     
Subjt:  HVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNT

Query:  VSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPT
        +SW++ LQS  ALSTTE+E ++L EAV E  W+ GLV  L    +   VF DSQSAI L+KN  +H+R+KHID RYH++RE     +I + KVA+  NP 
Subjt:  VSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPT

Query:  DMLTKPLTAARMMHLIDLLK
        DMLTK + A +  H +DL++
Subjt:  DMLTKPLTAARMMHLIDLLK

A0A438FEW0 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-25140.49Show/hide
Query:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL
        +AKF++EKF GK DF LWR K++A+LVQ  +   L   ++LP  + E ++ E+ E A S +IL L D+VLR+V +  +A E+W KL+S+Y+ KSL N++ 
Subjt:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL

Query:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKET--EALTVRG
         K +L+ ++M  +  +ED+LD F KI LD+ NI   I DE++AI+LL SL  SY  +K AI YGRDSLT   V   L ++++  + + KE   E L +RG
Subjt:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKET--EALTVRG

Query:  RPENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDCRKLQYKNKHQNKKEGQNSANIHESYDSTK--------------CSIKGVGSVKLNL
        R E R       K   +K RSKSK K+ + CF CHK GH  KDC   +     +   EG ++A I + YDS +              C I G G+V++  
Subjt:  RPENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDCRKLQYKNKHQNKKEGQNSANIHESYDSTK--------------CSIKGVGSVKLNL

Query:  MDETHKILSSVRYVPDLKRNLISLGTLDKAG-----QP-----------------ENGLYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLA
         D   ++L  VRY+P+LKRNLISLG LDK+G     +P                 +NGLYTL G +     + V+      ++  LWH+RL H+S +GL 
Subjt:  MDETHKILSSVRYVPDLKRNLISLGTLDKAG-----QP-----------------ENGLYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLA

Query:  ELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL-----------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENF
        EL+KQG+LG  K   LPFCE C  GK+ RV+F +++H+TQ  LDYIHSDL            S+ F  F  WK LVE QT++K+KK++TDN LEFLS +F
Subjt:  ELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL-----------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENF

Query:  KNMCTKSGFQKHLTIPNTPQQNGLS---------------------------------------------------------------------------
         ++C K G  +H T+  TPQQNGL+                                                                           
Subjt:  KNMCTKSGFQKHLTIPNTPQQNGLS---------------------------------------------------------------------------

Query:  -------------------------------------------------------------------REEEPQPEDPSNVPVQTSEE------------N
                                                                             E  QPE       +T++E             
Subjt:  -------------------------------------------------------------------REEEPQPEDPSNVPVQTSEE------------N

Query:  LTNYQLTRDRQRRTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYK
        L +Y L RDRQ+R ++PP R+ QA       +  A+AL   ++    EP TY++AI S    QWV  +QEE++SL KN TW LV KP++ K+V  KW++K
Subjt:  LTNYQLTRDRQRRTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYK

Query:  LKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLY
         K    G   PR+KARLVAKG++Q+ G+DY EIFSPVV++SSI +LL+ VA  DLEL+Q+DV TAFLHGEL+E IYM+ P GF    K+  VC L KSLY
Subjt:  LKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLY

Query:  GLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVIN
        GLKQSPRQWYK+FD ++   GF    +D CVY K      +VYLL+YVDDML+A ++   +E++K  LK+EFEMKD+GSA +ILGM + R+R    L ++
Subjt:  GLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVIN

Query:  QSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGT
        Q+ Y+ K++ R+ + N KTV  PL QHFRLS   +P++ EE++ ME +PY+S+VGS+MY M CSRPD+A+A+S++SRYMS  GKPHW+AVKW+ +Y+ GT
Subjt:  QSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGT

Query:  TNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQ
         + GL+Y   S  ++ L G+ DAD+A ++D R+S +GY+FT+F   VSW+++LQSV ALSTTE+E +++ EAV E  WL G+  EL     ++ V+ D+Q
Subjt:  TNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQ

Query:  SAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAARMMHLIDLLKI
        SAI L+KN  +H+RSKHID R H++R+     EI +GKV + +NP+DMLTK L   +  H ++L+ +
Subjt:  SAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAARMMHLIDLLKI

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class2.5e-25840.82Show/hide
Query:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL
        S +FE+ KF+G GDF+LWRKK++A+LVQ KVAKIL++ E LP+ ++E+++++MDE+A+ T++LYLSD VLR VDE     E+WKKL+S+YL KSL NKI 
Subjt:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL

Query:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP
        +KE+ FGY+MD +K LE+NLDEF KI +D+ NIGEK+ DENQA+ILLNSL E+Y+E+KAAIKYG DSLT+ +VLDAL+++++E + +RK+ E L  RGR 
Subjt:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP

Query:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------
        E      K  K  +   RSKSKGK +R+CF CHK GH  K+C   +  +      N  +G NSA I +  DS +                          
Subjt:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------

Query:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G
                                     C +KG GSV++   D   +IL++VRYVP LKRNLISLG LD++G   + EN                   G
Subjt:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G

Query:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------
        LY L G +    A   + +    + ++LWHKRLAHVSE+GL  L +QGLLG  K+V LPFCE C +GKS RV+F +  H T+GILDYIHSDL        
Subjt:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------

Query:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------
                                 K + F  F  WK  VENQT +K+K ++TDN LEF++  F   C   G  +H T+  TPQQNGL+           
Subjt:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR
                        +E++ Q                    D  N P   SE                                  +L NYQLTRDR +
Subjt:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR

Query:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR
        R    P R+  A       D  AYAL C  D+  +EPLT+E+AI S++K QW   M+EEL SL KN TW+LV KP   KL+  KW+YK+KP   G +KPR
Subjt:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR

Query:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK
        +KARL+                            LS+   FD+ +EQMDVTT FLHGELEE IYM QP G+EV+GKE+ VCRL KSLYGLKQSPRQWY  
Subjt:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK

Query:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY
        FD FI   GF +  YD+CVY K       +YLL+YVDDM++ S+D + I  LK QL +EFEMKD+G   +ILGM++ R+R  G L I+Q  Y+ K++++Y
Subjt:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY

Query:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL
         + ++K V  PLA HFRLS +  P +++ER EM ++PY + VGS+MY+M C+RPD+ +AMS++SR+MS+ GK HW+AVKW++RY+KG+ +  L YSR   
Subjt:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL

Query:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH
        K + L G++DAD+AADLDKRRS SG+IF L+ N VSW+ +LQ V ALSTTESE ISL EAV E  WL  +V ELL       +  DSQSAI L+KNP +H
Subjt:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH

Query:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAAR
        +RSKHID ++HY+R     ++++L KV +  N +DMLTK L+A R
Subjt:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAAR

A0A5A7UB25 Putative gag-pol polyprotein3.0e-28042.44Show/hide
Query:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL
        S +FE+ KF+G GDFSLWRKK++A+LVQ KVAKIL++ E LP+ ++E+++++MDE+A+ST++LYLSD VLR VDE     E+WKKL+S+YL KSL NKI 
Subjt:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL

Query:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP
        +KE+ FGY+MD +K LE+NLDEF KI +D+ NIGEK+ DENQA+ILLNSL E+Y+E+KAAIKYGRDSLT+ +VLDAL+++++E + +RK+ E L  RGR 
Subjt:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP

Query:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------
        E      K  K  +   RSKSKGK +R+CF CHK GH  K+C   +  +      N  +G NSA I + YDS +                          
Subjt:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------

Query:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G
                                     C +KG GSV++   D   +IL++VRYVP LKRNLISLG LD++G   + EN                   G
Subjt:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G

Query:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------
        LY L G +    A   + +      ++LWHKRLAHVSE+GL  L +QGLLG  K+V LPFCE C +GKS RV+F +  H T+GILDY+HSDL        
Subjt:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------

Query:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------
                                 K + F  F  WK  VENQT +K+K ++TDN LEF++  F   C   G  +H T+  TPQQNGL+           
Subjt:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR
                        +E++ Q                    D  N P   SE                                  +L NYQLTRDR +
Subjt:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR

Query:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR
        R    P R+  A       D  AYAL C  D+  +EPLT+E+AI S++K QW   M+EEL SL KN TW+LV KP   KL+  KW+YK+KP   G +KPR
Subjt:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR

Query:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK
        +KARLVAKGYTQ+ G+D+ EIFSPVVR+SSI ++LS+   FD+ +EQMDVTTAFLHGELEE IYM QP G+EV+GKE+ VCRL KSLYGLKQSPRQWY +
Subjt:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK

Query:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY
        FD FI   GF +  YD+CVY K       +YLL+YVDDM++ S+D ++I  LK QL +EFEMKD+G   +ILGM++ R++  G L I+Q  Y+ K++++Y
Subjt:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY

Query:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL
         + ++K V  PLA HFRLS +  P +++ER EM ++PY + VGS+MY+M C+RPD+ +AMS++SR+MS+ GK HW+AVKW++RY+KG+ +  L YSR   
Subjt:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL

Query:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH
        K + L G++DAD+AADLDKRRS SG+IF L+ N VSW+ +LQ V ALSTTESE ISL EAV E  WL  +V ELL       +  DSQSAI L+KNP +H
Subjt:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH

Query:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAARMMHLIDLLKI
        +RSKHID ++HY+R     ++++L KV +  N +DMLTK L+A R  +L+D L +
Subjt:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAARMMHLIDLLKI

A0A5D3DNU1 Putative gag-pol polyprotein4.9e-24639.85Show/hide
Query:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL
        S +FE+ KF+G GDF+LWRKK++A+LVQ KVAKIL++ E LP+ ++E+++++MDE+A+ST++LYLSD VLR VDE     E+WKKL+S+YL KSL NKI 
Subjt:  SAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKIL

Query:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP
        +KE+ FGY+MD +K LE+NLDEF KI +D+ NIGEK+ DENQA+ILLNSL E+Y+E+KAAIKYGRDSLT+ +VLDAL+++++E + +RK+ E L  RGR 
Subjt:  LKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRP

Query:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------
        E      K  K  +   RSKSKGK +R+CF CHK GH  K+C   +  +      N  +G NSA I + YDS +                          
Subjt:  ENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC---RKLQYKNKHQNKKEGQNSANIHESYDSTK--------------------------

Query:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G
                                     C +KG GSV++   D   +IL++VRYVP LKRNLISLG LD++G   + EN                   G
Subjt:  -----------------------------CSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAG---QPEN-------------------G

Query:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------
        LY L G +    A   + +      ++LWHKRLAHVSE+GL  L +QGLLG  K+V LPFCE C +GKS RV+F +  H T+GILDY+HSDL        
Subjt:  LYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL--------

Query:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------
                                 K + F  F  WK  VENQT +K+K ++TDN LEF++  F   C   G  +H T+  TPQQNGL+           
Subjt:  -------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR
                        +E++ Q                    D  N P   SE                                  +L NYQLTRDR +
Subjt:  ----------------REEEPQPE------------------DPSNVPVQTSE---------------------------------ENLTNYQLTRDRQR

Query:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR
        R    P R+  A       D  AYAL C  D+  +EPLT+E+AI S++K QW   M+EEL SL KN TW+LV KP   KL+  KW+YK+KP   G +KPR
Subjt:  RTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPR

Query:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK
        +KARLVAKGYTQ+ G+D+ EIFSPVVR+SSI ++LS+   FD+ +EQMDVTTAFLHGELEE IYM QP G+EV+GKE+ VCRL KSLYGLKQSPRQWY +
Subjt:  FKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKK

Query:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY
        FD FI   GF +  YD+CVY K       +YLL+YVDDM++ S+D ++I  LK QL +EFEMKD+G   +ILGM++ R++  G L I+Q  Y+ K++++Y
Subjt:  FDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRY

Query:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL
         + ++K V  PLA HFRLS +  P +++ER EM ++PY + VGS+MY+M C+RPD+ +AMS                          + +  L YSR   
Subjt:  CLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSL

Query:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH
        K + L G++DAD+AADLDKR                      S   L    +E ISL EAV E  WL  +V ELL       +  DSQSAI L+KNP +H
Subjt:  KDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYH

Query:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAAR
        +RSKHID ++HY+R     ++++L KV +  N +DMLTK L+A R
Subjt:  DRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAAR

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.9e-11225.35Show/hide
Query:  AKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILE--KPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKI
        AK  ++ FDG+  +++W+ +++A+L +  V K+++   P ++ +   +A+R      A ST+I YLSDS L        A +I + LD++Y  KSL +++
Subjt:  AKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILE--KPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKI

Query:  LLKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIK-YGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRG
         L++RL   ++ S   L  +   F ++  ++   G KI++ ++   LL +L   Y  I  AI+    ++LTL  V + L  ++++ +    +T    +  
Subjt:  LLKERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIK-YGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRG

Query:  RPENRLNTPK----INKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC----RKLQYKNKHQNKKEGQNSAN---------IHESYDSTKCSIKGVGSVK
           N  NT K     N+  + K   K   K   +C +C + GH+ KDC    R L  KNK +N+K+ Q + +         ++ +     C         
Subjt:  RPENRLNTPK----INKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC----RKLQYKNKHQNKKEGQNSAN---------IHESYDSTKCSIKGVGSVK

Query:  LNLMDETHKILSSVRYVPDLK--------------------------------------RNLISLGTLDKAGQ-----------PENGLYTLNGASCVD-
         +L+++      SV  VP LK                                       NL+S+  L +AG             +NGL  +  +  ++ 
Subjt:  LNLMDETHKILSSVRYVPDLK--------------------------------------RNLISLGTLDKAGQ-----------PENGLYTLNGASCVD-

Query:  ------PANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGL-----LGKTKSVSLPFCEQCTLGKSARVQFNRSMHKT--QGILDYIHSDL------
               A ++     NN    LWH+R  H+S+  L E++++ +     L     +S   CE C  GK AR+ F +   KT  +  L  +HSD+      
Subjt:  ------PANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGL-----LGKTKSVSLPFCEQCTLGKSARVQFNRSMHKT--QGILDYIHSDL------

Query:  ---------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLSRE-------
                                   KSDVF  F ++    E   N K+  +  DN  E+LS   +  C K G   HLT+P+TPQ NG+S         
Subjt:  ---------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLSRE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------EEPQPEDPSNVP------VQTSEEN----LTNYQLTRDRQ-----------RRTIRPPSRFEQANCN------
                                   ++ +  +  N P      +QT   N      N Q  +D +           R+ I+     E   C+      
Subjt:  ---------------------------EEPQPEDPSNVP------VQTSEEN----LTNYQLTRDRQ-----------RRTIRPPSRFEQANCN------

Query:  --------------------------------------------------------IEIPDSAAYAL-------ICDDDNEISE------------PLTY
                                                                IEI +  +  L         ++DN +++            P ++
Subjt:  --------------------------------------------------------IEIPDSAAYAL-------ICDDDNEISE------------PLTY

Query:  EQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQ
        ++    ++KS W   +  ELN+   NNTW + ++PE   +VD +W++ +K +  G N  R+KARLVA+G+TQ+Y IDY E F+PV R SS   +LSLV Q
Subjt:  EQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQ

Query:  FDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHV-VVVYLLIYVDDM
        ++L++ QMDV TAFL+G L+EEIYMR P G  +    ++VC+L K++YGLKQ+ R W++ F+  ++ C FV    D C+YI  + ++   +Y+L+YVDD+
Subjt:  FDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHV-VVVYLLIYVDDM

Query:  LIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYS
        +IA+ D++++   K  L  +F M D+      +G+ +        + ++QS Y++KI+ ++ +EN   V  PL    +++  L    E+      + P  
Subjt:  LIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYS

Query:  SVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQ-NTVSWR
        S++G LMY+M C+RPD+  A++++SRY S      W+ +K ++RY+KGT +  LI+ +    ++ ++GY D+D+A     R+ST+GY+F +F  N + W 
Subjt:  SVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQ-NTVSWR

Query:  SHLQSVTALSTTESESISLVEAVNEGTWLTGLVFEL-LGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDML
        +  Q+  A S+TE+E ++L EAV E  WL  L+  + +   N + ++ D+Q  I+++ NP  H R+KHID +YH+ RE+  N  I L  + + N   D+ 
Subjt:  SHLQSVTALSTTESESISLVEAVNEGTWLTGLVFEL-LGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDML

Query:  TKPLTAARMMHLIDLLKIMQ
        TKPL AAR + L D L ++Q
Subjt:  TKPLTAARMMHLIDLLKIMQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-18433.01Show/hide
Query:  KFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKILLK
        K+E+ KF+G   FS W+++++ +L+Q  + K+L+     P+ +      ++DE A S + L+LSD V+  + + + A  IW +L+S+Y++K+LTNK+ LK
Subjt:  KFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKILLK

Query:  ERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGR---
        ++L+   M        +L+ F  +   + N+G KI++E++AI+LLNSL  SY  +   I +G+ ++ L+ V  AL   +   +    + +AL   GR   
Subjt:  ERLFGYRMDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGR---

Query:  -PENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC-RKLQYKNKHQNKKEGQNSANIHESYDST--------KC-----------------
           +  N  +   R ++K+RSKS+    R C+ C++ GH  +DC    + K +   +K   N+A + ++ D+         +C                 
Subjt:  -PENRLNTPKINKRDQNKHRSKSKGKETRECFYCHKSGHLIKDC-RKLQYKNKHQNKKEGQNSANIHESYDST--------KC-----------------

Query:  -----------------------------SIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDK----------------------AGQPENGL
                                      I G+G + +        +L  VR+VPDL+ NLIS   LD+                       G     L
Subjt:  -----------------------------SIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDK----------------------AGQPENGL

Query:  YTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL---------
        Y  N   C    N       +     LWHKR+ H+SEKGL  L K+ L+   K  ++  C+ C  GK  RV F  S  +   ILD ++SD+         
Subjt:  YTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL---------

Query:  ------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS------------
                                K  VF+ F  +  LVE +T +K+K++++DN  E+ S  F+  C+  G +   T+P TPQ NG++            
Subjt:  ------------------------KSDVFETFSNWKTLVENQTNKKIKKIQTDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLS------------

Query:  ---------------------------------------------------------------REEEPQPED----------------------------
                                                                       +E+  + +D                            
Subjt:  ---------------------------------------------------------------REEEPQPED----------------------------

Query:  ---------------------------PSNVPVQTSEENLTNYQLTRD------------------------------RQRRTIRPPSRFEQANCNIEIP
                                   P+ V + ++  N T+ + T D                              +     +P  R E+        
Subjt:  ---------------------------PSNVPVQTSEENLTNYQLTRD------------------------------RQRRTIRPPSRFEQANCNIEIP

Query:  DSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENK-PRFKARLVAKGYTQQYGIDY
         S  Y LI DD     EP + ++ +    K+Q +  MQEE+ SL KN T+ LVE P+  + + CKW++KLK    G+ K  R+KARLV KG+ Q+ GID+
Subjt:  DSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENK-PRFKARLVAKGYTQQYGIDY

Query:  TEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSC
         EIFSPVV+ +SI  +LSL A  DLE+EQ+DV TAFLHG+LEEEIYM QP GFEV GK++ VC+L KSLYGLKQ+PRQWY KFD F++S  ++K   D C
Subjt:  TEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSC

Query:  VYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRL
        VY K  +    + LL+YVDDMLI  +D   I +LK  L   F+MKD+G A QILGM +VR R +  L ++Q  Y+E++++R+ ++NAK V  PLA H +L
Subjt:  VYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRL

Query:  SKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLD
        SK + P + EE+  M  VPYSS VGSLMY M C+RPDIA A+ VVSR++ + GK HWEAVKWI+RY++GTT   L +  GS  D  L GY+DAD A D+D
Subjt:  SKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLD

Query:  KRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTM
         R+S++GY+FT     +SW+S LQ   ALSTTE+E I+  E   E  WL   + EL     +  V+ DSQSAI LSKN  YH R+KHID RYH++RE   
Subjt:  KRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTM

Query:  NEEIKLGKVASANNPTDMLTK
        +E +K+ K+++  NP DMLTK
Subjt:  NEEIKLGKVASANNPTDMLTK

P25600 Putative transposon Ty5-1 protein YCL074W4.7e-3630.91Show/hide
Query:  MDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDIS
        MDV TAFL+  ++E IY++QPPGF      + V  L   +YGLKQ+P  W +  +  ++  GF + + +  +Y +  +    +Y+ +YVDD+L+A+    
Subjt:  MDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDIS

Query:  KIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMY
          +R+K +L   + MKD+G   + LG+N +     G + ++   Y+ K      +   K    PL     L +  SP  +      +  PY S+VG L++
Subjt:  KIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMY

Query:  VMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRS-HLQSVTA
             RPDI++ +S++SR++ +    H E+ + ++RY+  T +  L Y  GS     L  Y DA   A  D   ST GY+  L    V+W S  L+ V  
Subjt:  VMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRS-HLQSVTA

Query:  LSTTESESISLVEAVNE
        + +TE+E I+  E V E
Subjt:  LSTTESESISLVEAVNE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.0e-8035.88Show/hide
Query:  SEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEH-KLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHV
        SEP T  QA+K E   +W   M  E+N+ + N+TW LV  P  H  +V C+W++  K +  G +  R+KARLVAKGY Q+ G+DY E FSPV++ +SI +
Subjt:  SEPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEH-KLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHV

Query:  LLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLL
        +L +       + Q+DV  AFL G L +++YM QPPGF  + + N VC+L K+LYGLKQ+PR WY +   ++ + GFV    D+ +++  +    +VY+L
Subjt:  LLSLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLL

Query:  IYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEM
        +YVDD+LI   D + +      L   F +KD       LG+     R    L ++Q  Y+  ++ R  +  AK V  P+A   +LS     K  +  +  
Subjt:  IYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEM

Query:  EHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQN
            Y  +VGSL Y +  +RPDI++A++ +S++M    + H +A+K I+RY+ GT N+G+   +G+     L  YSDAD+A D D   ST+GYI  L  +
Subjt:  EHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQN

Query:  TVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFEL-LGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANN
         +SW S  Q     S+TE+E  S+    +E  W+  L+ EL +  T    ++ D+  A  L  NP +H R KHI   YH++R +  +  +++  V++ + 
Subjt:  TVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFEL-LGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANN

Query:  PTDMLTKPLT
          D LTKPL+
Subjt:  PTDMLTKPLT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-7532.71Show/hide
Query:  PNTPQQNGLSREEEPQPEDPSNVPVQTSEENLTNYQLTRDRQRRTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQ
        PN+P  +  S    P P  P+   +Q + +   N      R +  IR P++            S A +L  +     SEP T  QA+K +   +W   M 
Subjt:  PNTPQQNGLSREEEPQPEDPSNVPVQTSEENLTNYQLTRDRQRRTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTYEQAIKSENKSQWVTTMQ

Query:  EELNSLMKNNTWALVEKPEEH-KLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLH
         E+N+ + N+TW LV  P     +V C+W++  K +  G +  R+KARLVAKGY Q+ G+DY E FSPV++ +SI ++L +       + Q+DV  AFL 
Subjt:  EELNSLMKNNTWALVEKPEEH-KLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDVTTAFLH

Query:  GELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQL
        G L +E+YM QPPGF  + + + VCRL K++YGLKQ+PR WY +   ++ + GFV    D+ +++       ++Y+L+YVDD+LI   D   ++     L
Subjt:  GELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQL

Query:  KSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDI
           F +K+       LG+     R    L ++Q  Y   ++ R  +  AK V  P+A   +L+     K  +  +      Y  +VGSL Y +  +RPD+
Subjt:  KSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDI

Query:  AFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESIS
        ++A++ +S+YM      HW A+K ++RY+ GT ++G+   +G+     L  YSDAD+A D D   ST+GYI  L  + +SW S  Q     S+TE+E  S
Subjt:  AFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESIS

Query:  LVEAVNEGTWLTGLVFEL-LGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLT
        +    +E  W+  L+ EL +  ++   ++ D+  A  L  NP +H R KHI   YH++R +  +  +++  V++ +   D LTKPL+
Subjt:  LVEAVNEGTWLTGLVFEL-LGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.0e-7534.58Show/hide
Query:  EPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLL
        EP TY +A   +    W   M +E+ ++   +TW +   P   K + CKW+YK+K +  G  + R+KARLVAKGYTQQ GID+ E FSPV + +S+ ++L
Subjt:  EPLTYEQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLL

Query:  SLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKE----NSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVY
        ++ A ++  L Q+D++ AFL+G+L+EEIYM+ PPG+  R  +    N+VC L KS+YGLKQ+ RQW+ KF   +   GFV+   D   ++K     + + 
Subjt:  SLVAQFDLELEQMDVTTAFLHGELEEEIYMRQPPGFEVRGKE----NSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVY

Query:  LLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERK
        +L+YVDD++I S + + ++ LK QLKS F+++D+G     LG+ + R+  AG  +  + Y L+ +      E     C P +     S   S  S  +  
Subjt:  LLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERK

Query:  EMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLF
        + +   Y  ++G LMY +  +R DI+FA++ +S++       H +AV  I+ YIKGT   GL YS  S  +  L  +SDA F +  D RRST+GY   L 
Subjt:  EMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLF

Query:  QNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFEL-LGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASA
         + +SW+S  Q V + S+ E+E  +L  A +E  WL     EL L  +    +F D+ +AI ++ N  +H+R+KHI++  H +RE+++  +  L     A
Subjt:  QNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFEL-LGDTNQLSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASA

Query:  NNPTDMLTK---PLTAARMMHLIDLLKIMQLGRLL
         +  D  T+   P+    +M+++ +  +  L  L+
Subjt:  NNPTDMLTK---PLTAARMMHLIDLLKIMQLGRLL

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.2e-0737.18Show/hide
Query:  MTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGY
        +T +RPD+ FA++ +S++ S       +AV  ++ Y+KGT   GL YS  S  D  L  ++D+D+A+  D RRS +G+
Subjt:  MTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGY

ATMG00300.1 Gag-Pol-related retrotransposon family protein4.7e-1543.88Show/hide
Query:  GQPENGLYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL
        G   + LY L G S     +N+  T     E  LWH RLAH+S++G+  L K+G L  +K  SL FCE C  GK+ RV F+   H T+  LDY+HSDL
Subjt:  GQPENGLYTLNGASCVDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDL

ATMG00810.1 DNA/RNA polymerases superfamily protein4.2e-2430.8Show/hide
Query:  VYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEE
        +YLL+YVDD+L+     + +  L  QL S F MKD+G     LG+  ++   +G L ++Q+ Y E+I+    + + K +  PL    +L+ ++S     +
Subjt:  VYLLIYVDDMLIASQDISKIERLKLQLKSEFEMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEE

Query:  RKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFT
          +     + S+VG+L Y +T +RPDI++A+++V + M +     ++ +K ++RY+KGT  +GL   + S  +  +  + D+D+A     RRST+G+   
Subjt:  RKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDLGKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFT

Query:  LFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTW
        L  N +SW +  Q   + S+TE+E  +L     E TW
Subjt:  LFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.1e-1541.23Show/hide
Query:  LTYEQAIKSENKS--------QWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYS
        LT    IK E KS         W   MQEEL++L +N TW LV  P    ++ CKW++K K    G    R KARLVAKG+ Q+ GI + E +SPVVR +
Subjt:  LTYEQAIKSENKS--------QWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYS

Query:  SIHVLLSLVAQFDL
        +I  +L++  Q ++
Subjt:  SIHVLLSLVAQFDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCTAAGTTTGAGCTTGAGAAATTTGATGGTAAAGGGGATTTTAGCTTGTGGAGGAAGAAACTAAAGGCTATGTTAGTGCAACTTAAGGTTGCTAAGATTTTAGA
AAAACCAGAAGACCTTCCAGAACCCCTATCCGAAGCAAAGAGAAAAGAGATGGATGAAATTGCCTTTAGCACTATGATTCTTTATCTATCTGACTCTGTGTTACGCCAAG
TAGATGAGATAAACAATGCTGTTGAAATATGGAAGAAGCTAGATTCTATGTACTTGAATAAGTCTTTAACAAATAAAATCTTGTTGAAGGAAAGACTATTTGGTTATAGA
ATGGACTCAACAAAACCTTTGGAAGATAATCTAGATGAATTCGTTAAGATTTTCTTAGATGTAACAAACATCGGTGAGAAAATTGATGATGAAAACCAAGCAATAATATT
GTTAAATTCGCTGTCTGAAAGCTATAAAGAGATAAAAGCTGCTATAAAATATGGAAGGGACTCTTTGACATTGAGACTTGTCCTTGATGCTTTAAGATCAAAAGATATGG
AAACTAGAACACAGAGAAAAGAAACTGAGGCATTAACAGTGAGGGGAAGACCTGAAAATAGACTGAATACTCCAAAGATTAACAAGAGAGACCAAAACAAACATAGATCA
AAATCAAAGGGCAAAGAAACCAGGGAGTGTTTTTATTGCCATAAATCAGGTCACCTCATTAAAGACTGTAGGAAGCTTCAGTATAAAAACAAACATCAGAATAAGAAGGA
GGGCCAAAACTCAGCTAATATCCATGAGAGTTACGATTCAACAAAGTGTTCTATCAAAGGTGTTGGTTCTGTAAAACTCAACTTAATGGACGAAACTCATAAAATTCTGT
CATCCGTTAGATATGTCCCTGATTTGAAAAGAAACCTTATCTCATTAGGGACTTTAGATAAGGCTGGGCAACCTGAAAACGGATTGTATACTTTGAATGGGGCATCATGT
GTTGACCCTGCCAATAATGTAGTTCCAACCCAACCTAATAATAGTGAAGCCCTATTGTGGCACAAGAGATTAGCACATGTTAGTGAGAAGGGGTTGGCTGAATTGCAAAA
ACAGGGGCTTCTTGGAAAAACTAAATCAGTATCTTTACCCTTCTGTGAGCAGTGTACACTTGGGAAATCTGCTAGGGTCCAATTCAATCGATCAATGCATAAAACTCAAG
GCATTTTAGATTATATTCATTCTGATCTTAAATCAGATGTGTTTGAAACTTTCTCCAACTGGAAGACTCTTGTGGAAAACCAAACTAATAAGAAGATTAAGAAAATTCAA
ACAGATAATGACCTAGAGTTTTTAAGTGAAAACTTCAAGAACATGTGCACCAAATCTGGATTTCAGAAGCATTTAACTATACCAAACACACCCCAACAAAATGGTCTTTC
GAGAGAGGAAGAACCTCAACCAGAAGATCCTTCAAATGTCCCTGTACAAACTTCAGAAGAAAATTTAACCAACTACCAACTTACTAGAGATAGGCAGAGACGCACAATTA
GGCCTCCTTCCAGATTTGAACAGGCAAATTGCAACATAGAAATTCCAGATTCAGCGGCTTATGCATTAATATGTGATGATGATAATGAAATTTCAGAACCTTTAACCTAT
GAACAAGCCATAAAATCAGAAAACAAATCTCAGTGGGTTACAACAATGCAAGAGGAGTTAAACTCCCTTATGAAGAATAATACTTGGGCTTTAGTGGAGAAACCCGAAGA
GCATAAGTTGGTAGATTGTAAATGGTTGTACAAACTAAAACCTAGTCTTAAAGGAGAAAATAAGCCAAGATTTAAGGCTAGGTTAGTAGCCAAAGGGTACACACAACAAT
ATGGAATAGATTATACTGAAATATTTTCACCAGTTGTGAGATACTCTTCCATACATGTACTGTTATCCTTGGTAGCTCAATTTGATCTAGAACTAGAACAAATGGACGTT
ACAACCGCGTTCTTACATGGGGAACTAGAAGAGGAAATCTACATGAGACAACCACCGGGATTTGAGGTGAGGGGAAAAGAAAATTCGGTATGCAGATTATTAAAATCACT
TTATGGCTTGAAACAATCTCCTCGTCAATGGTATAAGAAATTTGATGGATTTATTCAGTCTTGTGGCTTTGTTAAAAGAAAATATGACAGCTGTGTGTATATTAAATATG
AGAACCATGTTGTTGTTGTATATCTCCTCATATATGTAGATGATATGCTTATAGCCTCTCAAGATATCAGTAAGATTGAACGACTAAAATTACAATTGAAGAGTGAATTT
GAAATGAAAGACATAGGGTCAGCATTTCAAATTCTTGGCATGAATTTAGTAAGAAACAGAGGAGCTGGAACCTTAGTTATAAATCAGTCATATTATCTTGAGAAAATAGT
GAAGAGATACTGCCTTGAGAATGCAAAAACAGTATGTGTACCTTTAGCACAACACTTTAGACTGTCTAAGGCTTTATCACCAAAATCAGAAGAAGAAAGGAAAGAGATGG
AACATGTACCATATTCAAGTGTTGTAGGAAGCTTGATGTATGTAATGACATGTTCTAGACCAGACATAGCCTTTGCCATGAGTGTTGTGAGTCGATACATGAGTGATCTA
GGGAAACCACACTGGGAAGCCGTTAAATGGATCATAAGGTACATCAAAGGAACAACCAATTATGGACTGATTTATTCAAGAGGGAGCTTAAAAGACAGTTGCCTAGTAGG
ATACTCTGATGCAGACTTCGCAGCAGATCTGGATAAGAGAAGATCAACCTCAGGGTATATTTTTACTTTGTTTCAGAATACAGTGAGTTGGAGATCCCATCTCCAATCAG
TAACAGCTCTCTCTACAACAGAATCGGAGTCTATCTCTCTCGTTGAAGCAGTAAATGAAGGTACTTGGTTAACGGGCTTAGTTTTCGAATTGCTTGGTGATACTAATCAG
CTTTCTGTTTTTAGTGACAGCCAGAGTGCCATTGCACTGTCTAAAAATCCACAGTACCATGATCGTTCGAAGCATATCGATACGAGGTATCATTACCTCAGGGAGAAAAC
TATGAATGAAGAAATCAAGCTGGGAAAGGTAGCATCTGCAAATAACCCAACCGATATGCTGACCAAACCTCTCACAGCAGCTAGAATGATGCACTTGATTGACCTACTTA
AAATCATGCAGTTGGGGCGTTTGTTGTCGGTTTTTGCCCGCGACGACACATGGTGGCTGAAGCTGCAAAATAGGGGCGGTGGGCATTGGGCTGCAGGTGGAGTGACGACG
ACAGTTTGGTGGTCAGACATGAAAGAGAAAGAAATCAATGGAAAAGGGGAGCTACATTGCTACGTTAAACCAAGAGAGGGAGAAGATGAGGACCCACGACGTGCGAAGAC
CCACGACCTCTCTCTCCCTCGTCCCTCCTTGAGCACGAAGACCCACGACTTCTCTCTCCCTCGTTCCTCATTGCGCGTGAAGACCCACGATGAGACAACCACAATTCTCT
CACTCCTCGTGTGCAAAGGGGAAAGGTCTGAAACGAAGGGGAAATCCCATTCGTTTCAAATCGATTCCAAGAAAGAGACGGAGATGGCGCAAGGGCGACGATGCGAGGAT
GGAGCGGCAGCGACGGCGATGGGGAAAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCTAAGTTTGAGCTTGAGAAATTTGATGGTAAAGGGGATTTTAGCTTGTGGAGGAAGAAACTAAAGGCTATGTTAGTGCAACTTAAGGTTGCTAAGATTTTAGA
AAAACCAGAAGACCTTCCAGAACCCCTATCCGAAGCAAAGAGAAAAGAGATGGATGAAATTGCCTTTAGCACTATGATTCTTTATCTATCTGACTCTGTGTTACGCCAAG
TAGATGAGATAAACAATGCTGTTGAAATATGGAAGAAGCTAGATTCTATGTACTTGAATAAGTCTTTAACAAATAAAATCTTGTTGAAGGAAAGACTATTTGGTTATAGA
ATGGACTCAACAAAACCTTTGGAAGATAATCTAGATGAATTCGTTAAGATTTTCTTAGATGTAACAAACATCGGTGAGAAAATTGATGATGAAAACCAAGCAATAATATT
GTTAAATTCGCTGTCTGAAAGCTATAAAGAGATAAAAGCTGCTATAAAATATGGAAGGGACTCTTTGACATTGAGACTTGTCCTTGATGCTTTAAGATCAAAAGATATGG
AAACTAGAACACAGAGAAAAGAAACTGAGGCATTAACAGTGAGGGGAAGACCTGAAAATAGACTGAATACTCCAAAGATTAACAAGAGAGACCAAAACAAACATAGATCA
AAATCAAAGGGCAAAGAAACCAGGGAGTGTTTTTATTGCCATAAATCAGGTCACCTCATTAAAGACTGTAGGAAGCTTCAGTATAAAAACAAACATCAGAATAAGAAGGA
GGGCCAAAACTCAGCTAATATCCATGAGAGTTACGATTCAACAAAGTGTTCTATCAAAGGTGTTGGTTCTGTAAAACTCAACTTAATGGACGAAACTCATAAAATTCTGT
CATCCGTTAGATATGTCCCTGATTTGAAAAGAAACCTTATCTCATTAGGGACTTTAGATAAGGCTGGGCAACCTGAAAACGGATTGTATACTTTGAATGGGGCATCATGT
GTTGACCCTGCCAATAATGTAGTTCCAACCCAACCTAATAATAGTGAAGCCCTATTGTGGCACAAGAGATTAGCACATGTTAGTGAGAAGGGGTTGGCTGAATTGCAAAA
ACAGGGGCTTCTTGGAAAAACTAAATCAGTATCTTTACCCTTCTGTGAGCAGTGTACACTTGGGAAATCTGCTAGGGTCCAATTCAATCGATCAATGCATAAAACTCAAG
GCATTTTAGATTATATTCATTCTGATCTTAAATCAGATGTGTTTGAAACTTTCTCCAACTGGAAGACTCTTGTGGAAAACCAAACTAATAAGAAGATTAAGAAAATTCAA
ACAGATAATGACCTAGAGTTTTTAAGTGAAAACTTCAAGAACATGTGCACCAAATCTGGATTTCAGAAGCATTTAACTATACCAAACACACCCCAACAAAATGGTCTTTC
GAGAGAGGAAGAACCTCAACCAGAAGATCCTTCAAATGTCCCTGTACAAACTTCAGAAGAAAATTTAACCAACTACCAACTTACTAGAGATAGGCAGAGACGCACAATTA
GGCCTCCTTCCAGATTTGAACAGGCAAATTGCAACATAGAAATTCCAGATTCAGCGGCTTATGCATTAATATGTGATGATGATAATGAAATTTCAGAACCTTTAACCTAT
GAACAAGCCATAAAATCAGAAAACAAATCTCAGTGGGTTACAACAATGCAAGAGGAGTTAAACTCCCTTATGAAGAATAATACTTGGGCTTTAGTGGAGAAACCCGAAGA
GCATAAGTTGGTAGATTGTAAATGGTTGTACAAACTAAAACCTAGTCTTAAAGGAGAAAATAAGCCAAGATTTAAGGCTAGGTTAGTAGCCAAAGGGTACACACAACAAT
ATGGAATAGATTATACTGAAATATTTTCACCAGTTGTGAGATACTCTTCCATACATGTACTGTTATCCTTGGTAGCTCAATTTGATCTAGAACTAGAACAAATGGACGTT
ACAACCGCGTTCTTACATGGGGAACTAGAAGAGGAAATCTACATGAGACAACCACCGGGATTTGAGGTGAGGGGAAAAGAAAATTCGGTATGCAGATTATTAAAATCACT
TTATGGCTTGAAACAATCTCCTCGTCAATGGTATAAGAAATTTGATGGATTTATTCAGTCTTGTGGCTTTGTTAAAAGAAAATATGACAGCTGTGTGTATATTAAATATG
AGAACCATGTTGTTGTTGTATATCTCCTCATATATGTAGATGATATGCTTATAGCCTCTCAAGATATCAGTAAGATTGAACGACTAAAATTACAATTGAAGAGTGAATTT
GAAATGAAAGACATAGGGTCAGCATTTCAAATTCTTGGCATGAATTTAGTAAGAAACAGAGGAGCTGGAACCTTAGTTATAAATCAGTCATATTATCTTGAGAAAATAGT
GAAGAGATACTGCCTTGAGAATGCAAAAACAGTATGTGTACCTTTAGCACAACACTTTAGACTGTCTAAGGCTTTATCACCAAAATCAGAAGAAGAAAGGAAAGAGATGG
AACATGTACCATATTCAAGTGTTGTAGGAAGCTTGATGTATGTAATGACATGTTCTAGACCAGACATAGCCTTTGCCATGAGTGTTGTGAGTCGATACATGAGTGATCTA
GGGAAACCACACTGGGAAGCCGTTAAATGGATCATAAGGTACATCAAAGGAACAACCAATTATGGACTGATTTATTCAAGAGGGAGCTTAAAAGACAGTTGCCTAGTAGG
ATACTCTGATGCAGACTTCGCAGCAGATCTGGATAAGAGAAGATCAACCTCAGGGTATATTTTTACTTTGTTTCAGAATACAGTGAGTTGGAGATCCCATCTCCAATCAG
TAACAGCTCTCTCTACAACAGAATCGGAGTCTATCTCTCTCGTTGAAGCAGTAAATGAAGGTACTTGGTTAACGGGCTTAGTTTTCGAATTGCTTGGTGATACTAATCAG
CTTTCTGTTTTTAGTGACAGCCAGAGTGCCATTGCACTGTCTAAAAATCCACAGTACCATGATCGTTCGAAGCATATCGATACGAGGTATCATTACCTCAGGGAGAAAAC
TATGAATGAAGAAATCAAGCTGGGAAAGGTAGCATCTGCAAATAACCCAACCGATATGCTGACCAAACCTCTCACAGCAGCTAGAATGATGCACTTGATTGACCTACTTA
AAATCATGCAGTTGGGGCGTTTGTTGTCGGTTTTTGCCCGCGACGACACATGGTGGCTGAAGCTGCAAAATAGGGGCGGTGGGCATTGGGCTGCAGGTGGAGTGACGACG
ACAGTTTGGTGGTCAGACATGAAAGAGAAAGAAATCAATGGAAAAGGGGAGCTACATTGCTACGTTAAACCAAGAGAGGGAGAAGATGAGGACCCACGACGTGCGAAGAC
CCACGACCTCTCTCTCCCTCGTCCCTCCTTGAGCACGAAGACCCACGACTTCTCTCTCCCTCGTTCCTCATTGCGCGTGAAGACCCACGATGAGACAACCACAATTCTCT
CACTCCTCGTGTGCAAAGGGGAAAGGTCTGAAACGAAGGGGAAATCCCATTCGTTTCAAATCGATTCCAAGAAAGAGACGGAGATGGCGCAAGGGCGACGATGCGAGGAT
GGAGCGGCAGCGACGGCGATGGGGAAAAGATAA
Protein sequenceShow/hide protein sequence
MSAKFELEKFDGKGDFSLWRKKLKAMLVQLKVAKILEKPEDLPEPLSEAKRKEMDEIAFSTMILYLSDSVLRQVDEINNAVEIWKKLDSMYLNKSLTNKILLKERLFGYR
MDSTKPLEDNLDEFVKIFLDVTNIGEKIDDENQAIILLNSLSESYKEIKAAIKYGRDSLTLRLVLDALRSKDMETRTQRKETEALTVRGRPENRLNTPKINKRDQNKHRS
KSKGKETRECFYCHKSGHLIKDCRKLQYKNKHQNKKEGQNSANIHESYDSTKCSIKGVGSVKLNLMDETHKILSSVRYVPDLKRNLISLGTLDKAGQPENGLYTLNGASC
VDPANNVVPTQPNNSEALLWHKRLAHVSEKGLAELQKQGLLGKTKSVSLPFCEQCTLGKSARVQFNRSMHKTQGILDYIHSDLKSDVFETFSNWKTLVENQTNKKIKKIQ
TDNDLEFLSENFKNMCTKSGFQKHLTIPNTPQQNGLSREEEPQPEDPSNVPVQTSEENLTNYQLTRDRQRRTIRPPSRFEQANCNIEIPDSAAYALICDDDNEISEPLTY
EQAIKSENKSQWVTTMQEELNSLMKNNTWALVEKPEEHKLVDCKWLYKLKPSLKGENKPRFKARLVAKGYTQQYGIDYTEIFSPVVRYSSIHVLLSLVAQFDLELEQMDV
TTAFLHGELEEEIYMRQPPGFEVRGKENSVCRLLKSLYGLKQSPRQWYKKFDGFIQSCGFVKRKYDSCVYIKYENHVVVVYLLIYVDDMLIASQDISKIERLKLQLKSEF
EMKDIGSAFQILGMNLVRNRGAGTLVINQSYYLEKIVKRYCLENAKTVCVPLAQHFRLSKALSPKSEEERKEMEHVPYSSVVGSLMYVMTCSRPDIAFAMSVVSRYMSDL
GKPHWEAVKWIIRYIKGTTNYGLIYSRGSLKDSCLVGYSDADFAADLDKRRSTSGYIFTLFQNTVSWRSHLQSVTALSTTESESISLVEAVNEGTWLTGLVFELLGDTNQ
LSVFSDSQSAIALSKNPQYHDRSKHIDTRYHYLREKTMNEEIKLGKVASANNPTDMLTKPLTAARMMHLIDLLKIMQLGRLLSVFARDDTWWLKLQNRGGGHWAAGGVTT
TVWWSDMKEKEINGKGELHCYVKPREGEDEDPRRAKTHDLSLPRPSLSTKTHDFSLPRSSLRVKTHDETTTILSLLVCKGERSETKGKSHSFQIDSKKETEMAQGRRCED
GAAATAMGKR