; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020573 (gene) of Snake gourd v1 genome

Gene IDTan0020573
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationLG03:58979151..58985621
RNA-Seq ExpressionTan0020573
SyntenyTan0020573
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN80930.1 hypothetical protein VITISV_005279 [Vitis vinifera]4.5e-20340.05Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  KA++ +H+ L+DHIF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KMKD +SV 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY+ ++M +VNQ+RL GE F DQ+VVEKIMVSVP KFE+KIS+ EES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI
        +   K + G+S+KGKF PC +CK+TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +WLI
Subjt:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI

Query:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------
        DSGCT+HM K +++F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K   +VLY+  L QNLLS+AQ+L N +++ FK+                 
Subjt:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------

Query:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCD----
                                                                   Q CESC+ GK  R PFP+  S RA  KLEL+HSD+C     
Subjt:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCD----

Query:  -------------------------NGKEYTSKQFDKFCKDLEIQH------------QLSIAYTPQQN------------------------DRYWA--
                                   K      F  F K +E Q             +L+  Y+PQQN                           WA  
Subjt:  -------------------------NGKEYTSKQFDKFCKDLEIQH------------QLSIAYTPQQN------------------------DRYWA--

Query:  ---------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWDFE
                       V+ KT              L   G          +R KLDE+A K VF+GYAA+SKGY I+ L+  KI++SRDV  DE+SYW+++
Subjt:  ---------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWDFE

Query:  EQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTIINGFFAMKEELEMINKNNTW
         +K+ +   T   ILE       +I+S   +G  PL      V+AT D P  K++ L               C  +    +    AMK E++ I +N TW
Subjt:  EQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTIINGFFAMKEELEMINKNNTW

Query:  ELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----------------SWHDMTQYDYCFSLHIW----GWKIFHLDVKSTFTNGDLEEEIYV-----
        +L E P  KNA+GVKWVFR K+NSDGS+ ++KA                 ++  + ++D    L       GWK++HLDVKS F NG L EEIYV     
Subjt:  ELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----------------SWHDMTQYDYCFSLHIW----GWKIFHLDVKSTFTNGDLEEEIYV-----

Query:  --------------------------------------------------------------------NQLKGMKILQEDMSIFIAQRKYAQGILKKFKM
                                                                            N   GM+I Q    IFI+QRKYA  ILKKFK+
Subjt:  --------------------------------------------------------------------NQLKGMKILQEDMSIFIAQRKYAQGILKKFKM

Query:  ETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDS
        E+CK V+T LA N K+SKNDGEKL  P+ YRSL+G LLYLT +RPDLMF ASLLSR+M+SPS +H G+ KRVL+Y+KGT + GIW+  +G +KL GY+DS
Subjt:  ETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDS

Query:  DWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI
        DWAG VDD KSTS Y F++G+ +  WNSRKQEVVAQST EAEYIS+
Subjt:  DWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI

RVW63137.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.8e-20438.8Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  K ++ +H+ L+DHIF +I++ +T KQ WDKL  EF+GS RVK V+LLTLKREFE++KMKD +SV 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY+ ++M +VNQ+RL GE F +Q+VVEKIMVSVP KFE+KIS  EES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI
        +   K + G+S+KGKF PC +CK+TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+QH  EQ  +    + ++   LFMAS   + ++  +WLI
Subjt:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI

Query:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------
        DSGCT+HM K +++F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K + +VLY+  L QNLLS+AQ+L N ++V FK+                 
Subjt:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------

Query:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----
                                                                   Q CESC+ GK  R PFP+  S RA  KLEL+HSD+C     
Subjt:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----

Query:  -------------------------------------------------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------
                                                               DNG EYTSK+F  FC++  I HQL+  Y+PQQN            
Subjt:  -------------------------------------------------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------

Query:  ------------DRYWA-----------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSE
                       WA                 V+ KT              L   G          +R KLDE+A K VF+GY A+SKGY I+ L+  
Subjt:  ------------DRYWA-----------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSE

Query:  KIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTII
        KI++SRDV  DE+SYW+++ +K+ +   T   ILE       +I+S   +G  PL      V+AT D P  K++ L               C  +    +
Subjt:  KIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTII

Query:  NGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA----------------------SWHDMTQYDYCFSLHIWGWKIFHLDV
            AMK E++ I +N TW+L E P  KNA+GVKWVFR K+NSDGS+ ++KA                      + HD  +     +  + GWK++HLDV
Subjt:  NGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA----------------------SWHDMTQYDYCFSLHIWGWKIFHLDV

Query:  KSTFTNGDLEEEIYV-------------------------------------------------------------------------------------
        KS F NG L EEIYV                                                                                     
Subjt:  KSTFTNGDLEEEIYV-------------------------------------------------------------------------------------

Query:  ------------------NQLKGMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFA
                          N   GM+I Q    IFI+QRKY   ILKKFK+E+CK V+T LA N K+SKNDGEKL  P+ YRSL+GSLLYLT +RPDLMF 
Subjt:  ------------------NQLKGMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFA

Query:  ASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI
        ASLLSR+M+ PS +H G+ KRVL+Y+KGT + GIW+  TG +KL GY+DSDWAG VDD KST  YVF++G+ +  WNSRKQEV AQST EAEYIS+
Subjt:  ASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI

RVW63791.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.1e-20539.77Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  KA++ +H+ L+DHIF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KMKD +SV 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY+ ++M +VNQ+RL GE F DQ+VVEKIMVSVP KFE+KIS+ EES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI
        +   K + G+S+KGKFPPC +CK+TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +WLI
Subjt:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI

Query:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------
        DSGCT+HM K +++F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K + +VLY+  L QNLLS+AQ+L N ++V FK+                 
Subjt:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------

Query:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCD----
                                                                   Q CESC+ GK  R PFP+  S RA  KLEL+HSD+C     
Subjt:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCD----

Query:  -------------------------NGKEYTSKQFDKFCKDLEIQHQLSI---------AYTPQQNDRYWAVEDKTLMKLG-------------------
                                   K      F  F K +E Q   ++          Y+PQQN      +++T+M++                    
Subjt:  -------------------------NGKEYTSKQFDKFCKDLEIQHQLSI---------AYTPQQNDRYWAVEDKTLMKLG-------------------

Query:  ---------------------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSN
                             +R KLDE+A K VF+GYAA+SKGY I+ L+  KI++SRDV  DE+SYW+++ +K+ +   T   ILE       +I+S 
Subjt:  ---------------------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSN

Query:  DRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSV
          +G  PL      V+AT D P  K++ L               C  +    +    AMK E++ I +N TW+L E P  KNA+GVKWVFR K+NSDGS+
Subjt:  DRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSV

Query:  NKYKA----------------------SWHDMTQYDYCFSLHIWGWKIFHLDVKSTFTNGDLEEEIYV--------------------------------
         ++KA                      + HD  +     +  + GWK++HLDVKS F NG L EEIYV                                
Subjt:  NKYKA----------------------SWHDMTQYDYCFSLHIWGWKIFHLDVKSTFTNGDLEEEIYV--------------------------------

Query:  -----------------------------------------------------------------------NQLKGMKILQEDMSIFIAQRKYAQGILKK
                                                                               N   GM+I Q    IFI+QRKYA  ILKK
Subjt:  -----------------------------------------------------------------------NQLKGMKILQEDMSIFIAQRKYAQGILKK

Query:  FKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGY
        FK+E+CK V+T LA N K+SKNDGEKL  P+ YRSL+GSLLYLT +RPDLMF ASLLSR+M+SPS +H G+ KRVL+Y+KGT + GIW+  TG +KL GY
Subjt:  FKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGY

Query:  SDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI
        +DSDWAG VDD KSTS Y F++G+ +  WNSRKQEVVAQST EAEYIS+
Subjt:  SDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI

RVW88032.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-20540.58Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  KA++ +H+ L+DHIF +I++ +T K  WDKL  EFEGS RVK V+LL LKR FE++KMKD + V 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY  ++M +VNQIRL GE F DQ+VVEKIMV VP KFE+KIS+ +ES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKG-GASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQT-QHPQEQANCA-HNNHETNFLFMASHT------NDNK
        +   KGK  G+S+KGKFP C +C++TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+ Q P++ AN    + ++   LF+AS        + ++
Subjt:  DQGIKGKG-GASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQT-QHPQEQANCA-HNNHETNFLFMASHT------NDNK

Query:  STSWLIDSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFS----------------
          +WLIDSGCT+HM K +++F+ ID+S+Q KV LG+GE V A+GKGT  + TK+  K + +VLY+  L QNLLS+AQ+L N ++                
Subjt:  STSWLIDSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFS----------------

Query:  VIFKDQICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----------------------------------------------------------
        +    Q CESC+ GK  R PFP+  S +A  KLEL++SD+C                                                           
Subjt:  VIFKDQICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----------------------------------------------------------

Query:  -DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQNDRYWAVEDKTLMKLG--------------------------------QRSKLDEKAIKRVFIGYAA
         DNG EY SK+ + FC++  I HQL   Y+PQQN   +  +++T+M++                                 +R KLDE+A K VF+GYAA
Subjt:  -DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQNDRYWAVEDKTLMKLG--------------------------------QRSKLDEKAIKRVFIGYAA

Query:  DSKGYIIFDLNSEKIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL----------
        +SKGY I+ L+  KI++SRDV  DE+SYW+++ +K+ +   T   ILE       +I+S   +G  PL      V+AT D    K++ L           
Subjt:  DSKGYIIFDLNSEKIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL----------

Query:  ---RCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA----------------------SWHDMTQYDYCFS
            C  +    +     MK E++ I +N TW+L E P  KNA+GVKWVFR K+NSDGS+ ++KA                      + HD  +     +
Subjt:  ---RCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA----------------------SWHDMTQYDYCFS

Query:  LHIWGWKIFHLDVKSTFTNGDLEEEIYVNQLK--------------------------------------------------------------------
          + GWK++HLDVKSTF NG L EEIYV Q K                                                                    
Subjt:  LHIWGWKIFHLDVKSTFTNGDLEEEIYVNQLK--------------------------------------------------------------------

Query:  -----------------------------------GMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSL
                                           GM+I Q    IFI+QRKYA  ILKKFK+E+CK V+T LA N K+SKNDGEKL  P+ YRSL+GSL
Subjt:  -----------------------------------GMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSL

Query:  LYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQS
        LYLT ++PDLMF ASLLSR+++SP  +H G+ KRVL+Y+KGT + GIW+  TG +KL GY+DSDWAG VDD KSTS YVF++ + +  WNSRKQEVVAQS
Subjt:  LYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQS

Query:  TVEAEYISI
        T EAEYIS+
Subjt:  TVEAEYISI

RVX13462.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]6.6e-20739.98Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  KA++ +H+ L+DHIF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KMKD++SV 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY+ ++M +VNQ+RL GE F DQ+VVEKIMVSVP KFE+KIS+ EES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI
        +   K + G+S+KGKFPPC +CK+TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W I
Subjt:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI

Query:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------
        DSGCT+HM K +++F+ I++S+Q KV LG+GE V A+GKGT  + TK+G K + +VLY+  L QNLLS+AQ+L N ++V FK+                 
Subjt:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------

Query:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----
                                                                   Q CESC+ GK  R PFP+  S RA  KL+L+HSD+C     
Subjt:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----

Query:  -------------------------------------------------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------
                                                               DNG EYTSK+F  FC++  I HQL+  Y+PQQN            
Subjt:  -------------------------------------------------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------

Query:  ------------DRYWA-----------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSE
                       WA                 V+ KT              L   G          +R KLDE+A K VF+GYAA SKGY I+ L   
Subjt:  ------------DRYWA-----------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSE

Query:  KIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTII
        KI++SRDV  DE+SYW+++ +K+ +   T   ILE       +I+S   +G  PL      V+AT D P  K++ L               C  +    +
Subjt:  KIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTII

Query:  NGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDY-----CFSLH-----------IWGWKIFHLDVK
            AMK E++ I +N TW+L E P  KNA+GVKWVFR K+NS GS+ ++KA      +  +   DY       ++H             GWK++HLDVK
Subjt:  NGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDY-----CFSLH-----------IWGWKIFHLDVK

Query:  STFTNGDLEEEIYVNQLKGMKI------------------------------------------------------LQEDM-------------------
        S F NG L EEIYV Q +G ++                                                      LQ D+                   
Subjt:  STFTNGDLEEEIYVNQLKGMKI------------------------------------------------------LQEDM-------------------

Query:  ---SIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKG
            IFI+QRKYA  ILKKFK+E+CK V+T LA N K+SKND EKL  P+ YRSL+GSLLYLT +RPDLMF ASLLSR+M+SPS +H G+ KRVL+Y+KG
Subjt:  ---SIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKG

Query:  TLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI
        T + GIW+  TG +KL GY+DSDWAG VDD KSTS Y F++G+ +  WNSRKQEVVAQST EAEYIS+
Subjt:  TLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI

TrEMBL top hitse value%identityAlignment
A0A438FT52 Retrovirus-related Pol polyprotein from transposon RE18.7e-20538.8Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  K ++ +H+ L+DHIF +I++ +T KQ WDKL  EF+GS RVK V+LLTLKREFE++KMKD +SV 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY+ ++M +VNQ+RL GE F +Q+VVEKIMVSVP KFE+KIS  EES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI
        +   K + G+S+KGKF PC +CK+TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+QH  EQ  +    + ++   LFMAS   + ++  +WLI
Subjt:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI

Query:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------
        DSGCT+HM K +++F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K + +VLY+  L QNLLS+AQ+L N ++V FK+                 
Subjt:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------

Query:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----
                                                                   Q CESC+ GK  R PFP+  S RA  KLEL+HSD+C     
Subjt:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----

Query:  -------------------------------------------------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------
                                                               DNG EYTSK+F  FC++  I HQL+  Y+PQQN            
Subjt:  -------------------------------------------------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------

Query:  ------------DRYWA-----------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSE
                       WA                 V+ KT              L   G          +R KLDE+A K VF+GY A+SKGY I+ L+  
Subjt:  ------------DRYWA-----------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSE

Query:  KIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTII
        KI++SRDV  DE+SYW+++ +K+ +   T   ILE       +I+S   +G  PL      V+AT D P  K++ L               C  +    +
Subjt:  KIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTII

Query:  NGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA----------------------SWHDMTQYDYCFSLHIWGWKIFHLDV
            AMK E++ I +N TW+L E P  KNA+GVKWVFR K+NSDGS+ ++KA                      + HD  +     +  + GWK++HLDV
Subjt:  NGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA----------------------SWHDMTQYDYCFSLHIWGWKIFHLDV

Query:  KSTFTNGDLEEEIYV-------------------------------------------------------------------------------------
        KS F NG L EEIYV                                                                                     
Subjt:  KSTFTNGDLEEEIYV-------------------------------------------------------------------------------------

Query:  ------------------NQLKGMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFA
                          N   GM+I Q    IFI+QRKY   ILKKFK+E+CK V+T LA N K+SKNDGEKL  P+ YRSL+GSLLYLT +RPDLMF 
Subjt:  ------------------NQLKGMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFA

Query:  ASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI
        ASLLSR+M+ PS +H G+ KRVL+Y+KGT + GIW+  TG +KL GY+DSDWAG VDD KST  YVF++G+ +  WNSRKQEV AQST EAEYIS+
Subjt:  ASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI

A0A438FV11 Retrovirus-related Pol polyprotein from transposon RE13.9e-20539.77Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  KA++ +H+ L+DHIF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KMKD +SV 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY+ ++M +VNQ+RL GE F DQ+VVEKIMVSVP KFE+KIS+ EES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI
        +   K + G+S+KGKFPPC +CK+TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +WLI
Subjt:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI

Query:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------
        DSGCT+HM K +++F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K + +VLY+  L QNLLS+AQ+L N ++V FK+                 
Subjt:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------

Query:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCD----
                                                                   Q CESC+ GK  R PFP+  S RA  KLEL+HSD+C     
Subjt:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCD----

Query:  -------------------------NGKEYTSKQFDKFCKDLEIQHQLSI---------AYTPQQNDRYWAVEDKTLMKLG-------------------
                                   K      F  F K +E Q   ++          Y+PQQN      +++T+M++                    
Subjt:  -------------------------NGKEYTSKQFDKFCKDLEIQHQLSI---------AYTPQQNDRYWAVEDKTLMKLG-------------------

Query:  ---------------------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSN
                             +R KLDE+A K VF+GYAA+SKGY I+ L+  KI++SRDV  DE+SYW+++ +K+ +   T   ILE       +I+S 
Subjt:  ---------------------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSN

Query:  DRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSV
          +G  PL      V+AT D P  K++ L               C  +    +    AMK E++ I +N TW+L E P  KNA+GVKWVFR K+NSDGS+
Subjt:  DRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSV

Query:  NKYKA----------------------SWHDMTQYDYCFSLHIWGWKIFHLDVKSTFTNGDLEEEIYV--------------------------------
         ++KA                      + HD  +     +  + GWK++HLDVKS F NG L EEIYV                                
Subjt:  NKYKA----------------------SWHDMTQYDYCFSLHIWGWKIFHLDVKSTFTNGDLEEEIYV--------------------------------

Query:  -----------------------------------------------------------------------NQLKGMKILQEDMSIFIAQRKYAQGILKK
                                                                               N   GM+I Q    IFI+QRKYA  ILKK
Subjt:  -----------------------------------------------------------------------NQLKGMKILQEDMSIFIAQRKYAQGILKK

Query:  FKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGY
        FK+E+CK V+T LA N K+SKNDGEKL  P+ YRSL+GSLLYLT +RPDLMF ASLLSR+M+SPS +H G+ KRVL+Y+KGT + GIW+  TG +KL GY
Subjt:  FKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGY

Query:  SDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI
        +DSDWAG VDD KSTS Y F++G+ +  WNSRKQEVVAQST EAEYIS+
Subjt:  SDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI

A0A438HU89 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-20640.58Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  KA++ +H+ L+DHIF +I++ +T K  WDKL  EFEGS RVK V+LL LKR FE++KMKD + V 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY  ++M +VNQIRL GE F DQ+VVEKIMV VP KFE+KIS+ +ES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKG-GASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQT-QHPQEQANCA-HNNHETNFLFMASHT------NDNK
        +   KGK  G+S+KGKFP C +C++TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+ Q P++ AN    + ++   LF+AS        + ++
Subjt:  DQGIKGKG-GASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQT-QHPQEQANCA-HNNHETNFLFMASHT------NDNK

Query:  STSWLIDSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFS----------------
          +WLIDSGCT+HM K +++F+ ID+S+Q KV LG+GE V A+GKGT  + TK+  K + +VLY+  L QNLLS+AQ+L N ++                
Subjt:  STSWLIDSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFS----------------

Query:  VIFKDQICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----------------------------------------------------------
        +    Q CESC+ GK  R PFP+  S +A  KLEL++SD+C                                                           
Subjt:  VIFKDQICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----------------------------------------------------------

Query:  -DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQNDRYWAVEDKTLMKLG--------------------------------QRSKLDEKAIKRVFIGYAA
         DNG EY SK+ + FC++  I HQL   Y+PQQN   +  +++T+M++                                 +R KLDE+A K VF+GYAA
Subjt:  -DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQNDRYWAVEDKTLMKLG--------------------------------QRSKLDEKAIKRVFIGYAA

Query:  DSKGYIIFDLNSEKIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL----------
        +SKGY I+ L+  KI++SRDV  DE+SYW+++ +K+ +   T   ILE       +I+S   +G  PL      V+AT D    K++ L           
Subjt:  DSKGYIIFDLNSEKIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL----------

Query:  ---RCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA----------------------SWHDMTQYDYCFS
            C  +    +     MK E++ I +N TW+L E P  KNA+GVKWVFR K+NSDGS+ ++KA                      + HD  +     +
Subjt:  ---RCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA----------------------SWHDMTQYDYCFS

Query:  LHIWGWKIFHLDVKSTFTNGDLEEEIYVNQLK--------------------------------------------------------------------
          + GWK++HLDVKSTF NG L EEIYV Q K                                                                    
Subjt:  LHIWGWKIFHLDVKSTFTNGDLEEEIYVNQLK--------------------------------------------------------------------

Query:  -----------------------------------GMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSL
                                           GM+I Q    IFI+QRKYA  ILKKFK+E+CK V+T LA N K+SKNDGEKL  P+ YRSL+GSL
Subjt:  -----------------------------------GMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSL

Query:  LYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQS
        LYLT ++PDLMF ASLLSR+++SP  +H G+ KRVL+Y+KGT + GIW+  TG +KL GY+DSDWAG VDD KSTS YVF++ + +  WNSRKQEVVAQS
Subjt:  LYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQS

Query:  TVEAEYISI
        T EAEYIS+
Subjt:  TVEAEYISI

A0A438JWX0 Retrovirus-related Pol polyprotein from transposon RE23.2e-20739.98Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  KA++ +H+ L+DHIF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KMKD++SV 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY+ ++M +VNQ+RL GE F DQ+VVEKIMVSVP KFE+KIS+ EES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI
        +   K + G+S+KGKFPPC +CK+TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W I
Subjt:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI

Query:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------
        DSGCT+HM K +++F+ I++S+Q KV LG+GE V A+GKGT  + TK+G K + +VLY+  L QNLLS+AQ+L N ++V FK+                 
Subjt:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------

Query:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----
                                                                   Q CESC+ GK  R PFP+  S RA  KL+L+HSD+C     
Subjt:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVC-----

Query:  -------------------------------------------------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------
                                                               DNG EYTSK+F  FC++  I HQL+  Y+PQQN            
Subjt:  -------------------------------------------------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------

Query:  ------------DRYWA-----------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSE
                       WA                 V+ KT              L   G          +R KLDE+A K VF+GYAA SKGY I+ L   
Subjt:  ------------DRYWA-----------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSE

Query:  KIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTII
        KI++SRDV  DE+SYW+++ +K+ +   T   ILE       +I+S   +G  PL      V+AT D P  K++ L               C  +    +
Subjt:  KIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTII

Query:  NGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDY-----CFSLH-----------IWGWKIFHLDVK
            AMK E++ I +N TW+L E P  KNA+GVKWVFR K+NS GS+ ++KA      +  +   DY       ++H             GWK++HLDVK
Subjt:  NGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDY-----CFSLH-----------IWGWKIFHLDVK

Query:  STFTNGDLEEEIYVNQLKGMKI------------------------------------------------------LQEDM-------------------
        S F NG L EEIYV Q +G ++                                                      LQ D+                   
Subjt:  STFTNGDLEEEIYVNQLKGMKI------------------------------------------------------LQEDM-------------------

Query:  ---SIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKG
            IFI+QRKYA  ILKKFK+E+CK V+T LA N K+SKND EKL  P+ YRSL+GSLLYLT +RPDLMF ASLLSR+M+SPS +H G+ KRVL+Y+KG
Subjt:  ---SIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKG

Query:  TLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI
        T + GIW+  TG +KL GY+DSDWAG VDD KSTS Y F++G+ +  WNSRKQEVVAQST EAEYIS+
Subjt:  TLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI

A5B9M8 Integrase catalytic domain-containing protein2.2e-20340.05Show/hide
Query:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG
        GLW  V +  DP PLG N T+ Q++ +EEEKLK  KA++ +H+ L+DHIF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KMKD +SV 
Subjt:  GLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMKDSDSVG

Query:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK
        DY+ ++M +VNQ+RL GE F DQ+VVEKIMVSVP KFE+KIS+ EES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       +   K
Subjt:  DYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETK

Query:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI
        +   K + G+S+KGKF PC +CK+TNH EK+CW K K  ++C +CNK GH+EK+C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +WLI
Subjt:  DQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWLI

Query:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------
        DSGCT+HM K +++F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K   +VLY+  L QNLLS+AQ+L N +++ FK+                 
Subjt:  DSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKD-----------------

Query:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCD----
                                                                   Q CESC+ GK  R PFP+  S RA  KLEL+HSD+C     
Subjt:  -----------------------------------------------------------QICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCD----

Query:  -------------------------NGKEYTSKQFDKFCKDLEIQH------------QLSIAYTPQQN------------------------DRYWA--
                                   K      F  F K +E Q             +L+  Y+PQQN                           WA  
Subjt:  -------------------------NGKEYTSKQFDKFCKDLEIQH------------QLSIAYTPQQN------------------------DRYWA--

Query:  ---------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWDFE
                       V+ KT              L   G          +R KLDE+A K VF+GYAA+SKGY I+ L+  KI++SRDV  DE+SYW+++
Subjt:  ---------------VEDKT--------------LMKLG----------QRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWDFE

Query:  EQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTIINGFFAMKEELEMINKNNTW
         +K+ +   T   ILE       +I+S   +G  PL      V+AT D P  K++ L               C  +    +    AMK E++ I +N TW
Subjt:  EQKIVRDSSTNFEILEQIEFSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLL-------------RCICKFFTIINGFFAMKEELEMINKNNTW

Query:  ELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----------------SWHDMTQYDYCFSLHIW----GWKIFHLDVKSTFTNGDLEEEIYV-----
        +L E P  KNA+GVKWVFR K+NSDGS+ ++KA                 ++  + ++D    L       GWK++HLDVKS F NG L EEIYV     
Subjt:  ELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----------------SWHDMTQYDYCFSLHIW----GWKIFHLDVKSTFTNGDLEEEIYV-----

Query:  --------------------------------------------------------------------NQLKGMKILQEDMSIFIAQRKYAQGILKKFKM
                                                                            N   GM+I Q    IFI+QRKYA  ILKKFK+
Subjt:  --------------------------------------------------------------------NQLKGMKILQEDMSIFIAQRKYAQGILKKFKM

Query:  ETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDS
        E+CK V+T LA N K+SKNDGEKL  P+ YRSL+G LLYLT +RPDLMF ASLLSR+M+SPS +H G+ KRVL+Y+KGT + GIW+  +G +KL GY+DS
Subjt:  ETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDS

Query:  DWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI
        DWAG VDD KSTS Y F++G+ +  WNSRKQEVVAQST EAEYIS+
Subjt:  DWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISI

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.0e-3022.02Show/hide
Query:  QNLLSIAQLLHNKFSVIFKDQICESCQEGKMHRLPFPK-GGSFRAKDKLELVHSDVC-------------------------------------------
        +N+ S   LL+N   +    +ICE C  GK  RLPF +       K  L +VHSDVC                                           
Subjt:  QNLLSIAQLLHNKFSVIFKDQICESCQEGKMHRLPFPK-GGSFRAKDKLELVHSDVC-------------------------------------------

Query:  -----------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------------------DRYWAVEDKTLMKL------------
                         DNG+EY S +  +FC    I + L++ +TPQ N                          +W     T   L            
Subjt:  -----------------DNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN------------------------DRYWAVEDKTLMKL------------

Query:  ------------------------------GQRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIID-----------------------EDS
                                       ++ K D+K+ K +F+GY  +  G+ ++D  +EK I++RDV++D                       E+ 
Subjt:  ------------------------------GQRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIID-----------------------EDS

Query:  YWDFEEQKIVRDSSTN-FEILEQIEFSFDSIDSNDRD--------------------------------GQYPLS------NDDDLVDATG---------
         +  + +KI++    N  +  + I+F  DS +S +++                                 +Y L+       DD L ++ G         
Subjt:  YWDFEEQKIVRDSSTN-FEILEQIEFSFDSIDSNDRD--------------------------------GQYPLS------NDDDLVDATG---------

Query:  ------------DNP------------------------NYKVKSLLRCICKFFTIINGF-----------------FAMKEELEMINKNNTWELVEQPR
                    DNP                        N +  SL + +    TI N                    A+  EL     NNTW + ++P 
Subjt:  ------------DNP------------------------NYKVKSLLRCICKFFTIINGF-----------------FAMKEELEMINKNNTWELVEQPR

Query:  GKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDY---------------CFSLHI-WGWKIFHLDVKSTFTNGDLEEEIY-------------
         KN +  +WVF +KYN  G+  +YKA      +    Q DY                 SL I +  K+  +DVK+ F NG L+EEIY             
Subjt:  GKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDY---------------CFSLHI-WGWKIFHLDVKSTFTNGDLEEEIY-------------

Query:  --------------------------------------------------------------------------------------VNQLK---GMKILQ
                                                                                              +N++K   G++I  
Subjt:  --------------------------------------------------------------------------------------VNQLK---GMKILQ

Query:  EDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLY-LTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLK
        ++  I+++Q  Y + IL KF ME C AVST L + +     + ++  N T  RSLIG L+Y +  +RPDL  A ++LSRY +  +   +   KRVLRYLK
Subjt:  EDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLY-LTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLK

Query:  GTLDFGIWFE--FTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGN-RIFSWNSRKQEVVAQSTVEAEYISI
        GT+D  + F+       K+ GY DSDWAG   D KST+ Y+F + +  +  WN+++Q  VA S+ EAEY+++
Subjt:  GTLDFGIWFE--FTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGN-RIFSWNSRKQEVVAQSTVEAEYISI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-3624.18Show/hide
Query:  KDQICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCDNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN-----------------------------
        KDQ+ +  Q  K H L   + G      KL+ + S   DNG EYTS++F+++C    I+H+ ++  TPQ N                             
Subjt:  KDQICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCDNGKEYTSKQFDKFCKDLEIQHQLSIAYTPQQN-----------------------------

Query:  -----------------------DRYWAVEDKTLMKL-------------GQRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWD
                               +R W  ++ +   L              QR+KLD+K+I  +FIGY  +  GY ++D   +K+I SRDV+  E     
Subjt:  -----------------------DRYWAVEDKTLMKL-------------GQRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWD

Query:  FEE--QKIVRDSSTNF----------------------------EILEQIEFSFDSIDSNDRDGQ-------------------------YPLSNDDDLV
          +  +K+      NF                            E++EQ E   + ++  +   Q                         Y L +DD   
Subjt:  FEE--QKIVRDSSTNF----------------------------EILEQIEFSFDSIDSNDRDGQ-------------------------YPLSNDDDLV

Query:  DATGDNPNYKVKSLLRCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-------SWHDMTQYDYCFSLHI
        ++  +  ++  K+ L              AM+EE+E + KN T++LVE P+GK  L  KWVF++K + D  + +YKA              +D  FS  +
Subjt:  DATGDNPNYKVKSLLRCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-------SWHDMTQYDYCFSLHI

Query:  --------------WGWKIFHLDVKSTFTNGDLEEEIYV-------------------------------------------------------------
                         ++  LDVK+ F +GDLEEEIY+                                                             
Subjt:  --------------WGWKIFHLDVKSTFTNGDLEEEIYV-------------------------------------------------------------

Query:  ------------------------------------------NQLKGMKILQEDMS--IFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSK----ND
                                                   Q+ GMKI++E  S  ++++Q KY + +L++F M+  K VST LA +LK+SK      
Subjt:  ------------------------------------------NQLKGMKILQEDMS--IFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSK----ND

Query:  GEKLSNPTN--YRSLIGSLLY-LTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVF
         E+  N     Y S +GSL+Y +  +RPD+  A  ++SR++ +P + H+   K +LRYL+GT    + F  +  + L GY+D+D AG +D+ KS++ Y+F
Subjt:  GEKLSNPTN--YRSLIGSLLY-LTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVF

Query:  SLGNRIFSWNSRKQEVVAQSTVEAEYIS
        +      SW S+ Q+ VA ST EAEYI+
Subjt:  SLGNRIFSWNSRKQEVVAQSTVEAEYIS

P92519 Uncharacterized mitochondrial protein AtMg008102.3e-2935.71Show/hide
Query:  VKSTFTNGDLEEEIYVNQLKGMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAAS
        + STF+  DL     V+   G++I      +F++Q KYA+ IL    M  CK +ST L   L  S +   K  +P+++RS++G+L YLT +RPD+ +A +
Subjt:  VKSTFTNGDLEEEIYVNQLKGMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAAS

Query:  LLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISICI
        ++ + M+ P+   F + KRVLRY+KGT+  G++      L +  + DSDWAGC    +ST+ +   LG  I SW++++Q  V++S+ E EY ++ +
Subjt:  LLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISICI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.8e-3627.84Show/hide
Query:  AMKEELEMINKNNTWELVEQPRGK-NALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDYC---------FSLHI-------WGWKIFHLDVKSTF
        AM  E+     N+TW+LV  P      +G +W+F  KYNSDGS+N+YKA      ++     DY           S+ I         W I  LDV + F
Subjt:  AMKEELEMINKNNTWELVEQPRGK-NALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDYC---------FSLHI-------WGWKIFHLDVKSTF

Query:  TNGDLEEEIYVNQ-------------------LKGMK-----------------------------ILQEDMSI--------------------------
          G L +++Y++Q                   L G+K                             +LQ   SI                          
Subjt:  TNGDLEEEIYVNQ-------------------LKGMK-----------------------------ILQEDMSI--------------------------

Query:  ----------------------------FIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLS
                                     ++QR+Y   +L +  M T K V+T +A + K+S   G KL++PT YR ++GSL YL  +RPD+ +A + LS
Subjt:  ----------------------------FIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLS

Query:  RYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISICIMYEIKQY
        ++M+ P+E H    KR+LRYL GT + GI+ +    L L  YSD+DWAG  DD  ST+ Y+  LG+   SW+S+KQ+ V +S+ EAEY S+       Q+
Subjt:  RYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISICIMYEIKQY

Query:  GFANCLVIWDLNLKNLPI--CSVITSLLLLLRPIQYNMVEQSTSKSNFI
           + L    + L   P+  C  + +  L   P+ ++ ++      +FI
Subjt:  GFANCLVIWDLNLKNLPI--CSVITSLLLLLRPIQYNMVEQSTSKSNFI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-3728.51Show/hide
Query:  AMKEELEMINKNNTWELV-EQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDYC---------FSLHI-------WGWKIFHLDVKSTF
        AM  E+     N+TW+LV   P     +G +W+F  K+NSDGS+N+YKA      ++     DY           S+ I         W I  LDV + F
Subjt:  AMKEELEMINKNNTWELV-EQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDYC---------FSLHI-------WGWKIFHLDVKSTF

Query:  TNGDLEEEIYVNQ-------------------LKGMK-----------------------------ILQ-------------------------------
          G L +E+Y++Q                   + G+K                             +LQ                               
Subjt:  TNGDLEEEIYVNQ-------------------LKGMK-----------------------------ILQ-------------------------------

Query:  ----------EDMSIF-------------IAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLS
                  ED+  F             ++QR+Y   +L +  M T K V+T +AT+ K++ + G KL +PT YR ++GSL YL  +RPDL +A + LS
Subjt:  ----------EDMSIF-------------IAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAASLLS

Query:  RYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISICIMYEIKQY
        +YM+ P++ H+   KRVLRYL GT D GI+ +    L L  YSD+DWAG  DD  ST+ Y+  LG+   SW+S+KQ+ V +S+ EAEY S+       Q+
Subjt:  RYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISICIMYEIKQY

Query:  GFANCLVIWDLNLKNLPI--CSVITSLLLLLRPIQYNMVEQSTSKSNFI
           + L    + L + P+  C  + +  L   P+ ++ ++      +FI
Subjt:  GFANCLVIWDLNLKNLPI--CSVITSLLLLLRPIQYNMVEQSTSKSNFI

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein2.0e-1521.05Show/hide
Query:  GLWESVSTNVDPQP-----LGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSA--RVKAVKLLTLKREFEMLKM
        GLW+ V   V   P     L   +   ++    +  +K+ KAL ++ ++L+D +F + +   +AK  WD L +  E +   R++ V +  L+++ E LKM
Subjt:  GLWESVSTNVDPQP-----LGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSA--RVKAVKLLTLKREFEMLKM

Query:  KDSDSVGDYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAE
         D +S   Y  K + I+ ++  +     D  + + +  ++   F+   S  EE  D+  ++   L+     +     +     +E  F         + +
Subjt:  KDSDSVGDYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAE

Query:  DDRLETKDQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQANCAHNNHETNFLFMASHTNDNKST
        D RL++K +                C  C K NH +++C  +              HT+K         +  +++    +       L   ++ +D    
Subjt:  DDRLETKDQGIKGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQANCAHNNHETNFLFMASHTNDNKST

Query:  SWLIDSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKK-IFSVLYVLRLSQNLLSIAQLLHNKFSVIFKDQ-ICESCQEGK
         W+I      +M   +  F+ +D++ ++ V    G  +L EGKG   +  K+G+KK I +V++V  L++N+LS  +++  ++S+    Q  C  C  G+
Subjt:  SWLIDSGCTTHMAKDINLFSQIDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKK-IFSVLYVLRLSQNLLSIAQLLHNKFSVIFKDQ-ICESCQEGK

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.3e-3024.55Show/hide
Query:  AMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDYCFSLH----------------IWGWKIFHLDVKSTFT
        AM +E+  +   +TWE+   P  K  +G KWV+++KYNSDG++ +YKA      +      D+  +                  I+ + +  LD+ + F 
Subjt:  AMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA-----SWHDMTQYDYCFSLH----------------IWGWKIFHLDVKSTFT

Query:  NGDLEEEIYV------------------------------------------------------------------------------------------
        NGDL+EEIY+                                                                                          
Subjt:  NGDLEEEIYV------------------------------------------------------------------------------------------

Query:  -NQLK---------------GMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAAS
         +QLK               G++I +    I I QRKYA  +L +  +  CK  S  +  ++  S + G    +   YR LIG L+YL  +R D+ FA +
Subjt:  -NQLK---------------GMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAAS

Query:  LLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEY
         LS++  +P   H     ++L Y+KGT+  G+++     ++L  +SD+ +  C D  +ST+ Y   LG  + SW S+KQ+VV++S+ EAEY
Subjt:  LLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEY

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.9e-1037.18Show/hide
Query:  LYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSY
        +YLT +RPDL FA + LS++ ++          +VL Y+KGT+  G+++  T +L+L  ++DSDWA C D  +S + +
Subjt:  LYLTTSRPDLMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSY

ATMG00810.1 DNA/RNA polymerases superfamily protein1.7e-3035.71Show/hide
Query:  VKSTFTNGDLEEEIYVNQLKGMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAAS
        + STF+  DL     V+   G++I      +F++Q KYA+ IL    M  CK +ST L   L  S +   K  +P+++RS++G+L YLT +RPD+ +A +
Subjt:  VKSTFTNGDLEEEIYVNQLKGMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPDLMFAAS

Query:  LLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISICI
        ++ + M+ P+   F + KRVLRY+KGT+  G++      L +  + DSDWAGC    +ST+ +   LG  I SW++++Q  V++S+ E EY ++ +
Subjt:  LLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISICI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.9e-0850Show/hide
Query:  AMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA
        AM+EEL+ +++N TW LV  P  +N LG KWVF+ K +SDG++++ KA
Subjt:  AMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACGCCACCGTACAAATTATTGCGAGAGATCGAAAGACCTCCGTGAACCGTGAGGTCGACTTAGTTAAAGTATTGTGGCGGAACCACCTTACGGAAGAAGCTAC
GGGGGAGCGAGAAGAGGAAGGAAGCACCGCCGTCGAACACTCCACGTTCGGGATTCCTCGGTTGAGTTGCAGTCTCCTCTCGCGGAGCCCTTGCGCACTTGGCGATACCA
CTCGTGATCCCCCGCCCACCGCCACCATTCGCGCACAAATTCCGTTTCACGACGGCGAGTCCATCGGAGCAGTGAACGCTCGGCCGCTCGCGCCCACCAACCTCGGACGG
AAGGTCATTCGCCAGCCTTCTCGCCGAATCTCGACTCAGATCGCCCTCGTTAGCCGCCGCGAAGCCGCTCTCGTCGGCGGTAGAAGTCCGCCACCGCTTCGTGAGGTGAC
TTTCGAAGCCGATCGCAAGCCGTCGCAAGACCGAACCACCGCCGATCCACGAACGCTGCGGCGGCAAAGACCGGCTTCAATTCGCGACTCGACTCCACGCACGCCGGACA
CCGCGATTTGTCAGCCGGTACCTTACAATTATGATTTTAGTGAAGGCTTAGTGAATAATTTCGAACTGTTGTGTGAGTCAGGTTCCACCAAAAGAGGAGTAGTCAAGGAA
TTCGGGTTAAGTCTTGGAGTGGAATTCGAATTAAGGGGCGTTACACTTGGCCTTTGGGAATCTGTCTCAACTAATGTTGATCCTCAACCATTAGGAGAAAATCTGACGTT
GAATCAGATAAGACTACACGAAGAGGAGAAATTGAAAAATCCGAAAGCTTTATCCGTTATTCATGCTGCTTTATCAGATCATATTTTTGCTAGGATCATTGATTGTAAAA
CAGCAAAACAAGCTTGGGATAAATTGCATGAGGAATTTGAAGGAAGTGCGAGGGTGAAAGCTGTCAAATTATTGACTCTCAAGAGAGAGTTTGAGATGCTGAAAATGAAG
GATTCAGATTCTGTGGGGGACTACACAACAAAAGTGATGACTATCGTAAACCAGATAAGACTATCTGGTGAAAATTTTCCAGATCAAAGAGTTGTGGAAAAAATAATGGT
TAGTGTTCCCAACAAATTTGAATCGAAGATCTCATCCAGAGAGGAGTCTTCTGATTTGACTACTCTGTCTATAGCTGAGTTAATTAGCAAATTACAAGCTCAAGAACAAA
GGGATACAATGCGCAATGAAGAGCATGATGAGGGTGCATTTCATGCCAAGTCTAAAGGCAAGAAACCTGTTGCAGAAGATGATAGACTAGAGACCAAAGATCAGGGGATC
AAGGGAAAAGGAGGAGCATCAAAGAAAGGTAAATTTCCTCCTTGTCATTATTGTAAAAAGACCAATCATATTGAGAAAAATTGTTGGTCCAAAAACAAGCAAGCCTATCA
TTGTGAATATTGCAACAAGTATGGTCATACAGAGAAGTTTTGTTGTGCCAAGAAAACCCAAACCCAACATCCTCAGGAACAAGCGAATTGTGCCCACAATAATCATGAAA
CAAATTTTTTGTTTATGGCATCTCATACCAATGACAACAAGTCAACCTCTTGGCTTATTGATAGTGGATGCACTACTCACATGGCTAAAGATATCAATCTTTTTAGCCAA
ATTGACAAATCCATACAGTCTAAGGTGGTCCTTGGACATGGTGAGACAGTACTAGCTGAAGGTAAAGGTACTGCCATTATGCATACTAAGCAAGGTGAAAAGAAAATATT
CAGTGTCTTATATGTTCTAAGATTATCTCAAAACTTGCTCAGTATTGCTCAATTGTTGCATAACAAATTTTCTGTGATTTTCAAGGACCAAATTTGTGAAAGTTGTCAAG
AAGGGAAGATGCACAGATTACCTTTTCCCAAAGGCGGCAGCTTTAGAGCCAAAGACAAGCTTGAATTAGTGCACAGTGATGTTTGCGATAACGGGAAGGAATATACTTCC
AAGCAATTTGACAAGTTTTGTAAAGATTTGGAAATCCAACACCAGTTGTCCATTGCCTATACACCACAACAGAATGATCGCTACTGGGCGGTTGAGGACAAGACTCTTAT
GAAGCTTGGTCAGAGATCTAAACTTGATGAGAAGGCAATTAAGAGAGTATTTATTGGCTATGCTGCTGATTCAAAAGGGTACATAATTTTTGATTTGAACTCAGAAAAGA
TTATCTTAAGTCGTGATGTCATTATTGATGAAGATTCTTATTGGGATTTTGAGGAGCAAAAAATTGTGAGAGATTCATCCACCAATTTTGAAATCTTAGAACAAATTGAA
TTCTCCTTTGATTCAATTGATTCTAATGATCGAGATGGCCAATATCCTCTTTCAAACGATGATGATCTTGTTGATGCAACTGGTGATAATCCAAATTACAAGGTAAAATC
ACTTCTTCGATGTATATGCAAGTTCTTCACAATCATCAATGGATTCTTTGCTATGAAAGAAGAATTGGAGATGATCAATAAAAATAATACATGGGAGCTTGTAGAGCAAC
CACGAGGAAAGAACGCTTTGGGTGTGAAATGGGTGTTCAGAATGAAGTATAATTCTGATGGTTCAGTCAACAAATATAAAGCAAGTTGGCACGACATGACACAATATGAT
TATTGCTTTAGTTTGCACATTTGGGGGTGGAAAATATTCCATCTAGATGTCAAATCGACTTTTACGAATGGTGATCTTGAGGAGGAGATTTATGTGAATCAACTGAAGGG
TATGAAAATTCTTCAAGAAGATATGAGTATTTTTATTGCCCAAAGAAAGTATGCTCAAGGAATTTTGAAGAAATTTAAAATGGAAACATGTAAAGCTGTTTCCACTCTTT
TAGCGACAAACTTGAAAGTATCAAAGAATGATGGTGAGAAGCTATCTAATCCTACAAATTATAGAAGTTTAATAGGAAGTCTCTTGTATTTGACAACATCCAGGCCAGAC
TTGATGTTTGCTGCAAGTTTGCTCTCAAGATATATGAATTCACCTAGTGAAATTCATTTTGGTATTACTAAAAGAGTCCTTAGATACTTAAAGGGTACTCTTGATTTTGG
AATTTGGTTCGAATTTACTGGTAATTTGAAATTAACTGGATATTCCGACAGTGATTGGGCTGGTTGTGTTGATGATTCTAAAAGCACTTCCAGTTATGTTTTTTCTCTTG
GAAATAGAATTTTCTCTTGGAATTCAAGAAAACAAGAAGTAGTGGCTCAATCTACTGTTGAAGCTGAATATATTTCAATTTGCATCATGTATGAAATCAAGCAATATGGC
TTCGCAAATTGCTTAGTGATTTGGGATTTGAACCTAAAGAACCTACCAATTTGTTCTGTGATAACAAGTCTGCTATTGCTATTGCGTCCAATCCAGTACAACATGGTAGA
ACAAAGCACATCAAAGTCAAATTTCATTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAACGCCACCGTACAAATTATTGCGAGAGATCGAAAGACCTCCGTGAACCGTGAGGTCGACTTAGTTAAAGTATTGTGGCGGAACCACCTTACGGAAGAAGCTAC
GGGGGAGCGAGAAGAGGAAGGAAGCACCGCCGTCGAACACTCCACGTTCGGGATTCCTCGGTTGAGTTGCAGTCTCCTCTCGCGGAGCCCTTGCGCACTTGGCGATACCA
CTCGTGATCCCCCGCCCACCGCCACCATTCGCGCACAAATTCCGTTTCACGACGGCGAGTCCATCGGAGCAGTGAACGCTCGGCCGCTCGCGCCCACCAACCTCGGACGG
AAGGTCATTCGCCAGCCTTCTCGCCGAATCTCGACTCAGATCGCCCTCGTTAGCCGCCGCGAAGCCGCTCTCGTCGGCGGTAGAAGTCCGCCACCGCTTCGTGAGGTGAC
TTTCGAAGCCGATCGCAAGCCGTCGCAAGACCGAACCACCGCCGATCCACGAACGCTGCGGCGGCAAAGACCGGCTTCAATTCGCGACTCGACTCCACGCACGCCGGACA
CCGCGATTTGTCAGCCGGTACCTTACAATTATGATTTTAGTGAAGGCTTAGTGAATAATTTCGAACTGTTGTGTGAGTCAGGTTCCACCAAAAGAGGAGTAGTCAAGGAA
TTCGGGTTAAGTCTTGGAGTGGAATTCGAATTAAGGGGCGTTACACTTGGCCTTTGGGAATCTGTCTCAACTAATGTTGATCCTCAACCATTAGGAGAAAATCTGACGTT
GAATCAGATAAGACTACACGAAGAGGAGAAATTGAAAAATCCGAAAGCTTTATCCGTTATTCATGCTGCTTTATCAGATCATATTTTTGCTAGGATCATTGATTGTAAAA
CAGCAAAACAAGCTTGGGATAAATTGCATGAGGAATTTGAAGGAAGTGCGAGGGTGAAAGCTGTCAAATTATTGACTCTCAAGAGAGAGTTTGAGATGCTGAAAATGAAG
GATTCAGATTCTGTGGGGGACTACACAACAAAAGTGATGACTATCGTAAACCAGATAAGACTATCTGGTGAAAATTTTCCAGATCAAAGAGTTGTGGAAAAAATAATGGT
TAGTGTTCCCAACAAATTTGAATCGAAGATCTCATCCAGAGAGGAGTCTTCTGATTTGACTACTCTGTCTATAGCTGAGTTAATTAGCAAATTACAAGCTCAAGAACAAA
GGGATACAATGCGCAATGAAGAGCATGATGAGGGTGCATTTCATGCCAAGTCTAAAGGCAAGAAACCTGTTGCAGAAGATGATAGACTAGAGACCAAAGATCAGGGGATC
AAGGGAAAAGGAGGAGCATCAAAGAAAGGTAAATTTCCTCCTTGTCATTATTGTAAAAAGACCAATCATATTGAGAAAAATTGTTGGTCCAAAAACAAGCAAGCCTATCA
TTGTGAATATTGCAACAAGTATGGTCATACAGAGAAGTTTTGTTGTGCCAAGAAAACCCAAACCCAACATCCTCAGGAACAAGCGAATTGTGCCCACAATAATCATGAAA
CAAATTTTTTGTTTATGGCATCTCATACCAATGACAACAAGTCAACCTCTTGGCTTATTGATAGTGGATGCACTACTCACATGGCTAAAGATATCAATCTTTTTAGCCAA
ATTGACAAATCCATACAGTCTAAGGTGGTCCTTGGACATGGTGAGACAGTACTAGCTGAAGGTAAAGGTACTGCCATTATGCATACTAAGCAAGGTGAAAAGAAAATATT
CAGTGTCTTATATGTTCTAAGATTATCTCAAAACTTGCTCAGTATTGCTCAATTGTTGCATAACAAATTTTCTGTGATTTTCAAGGACCAAATTTGTGAAAGTTGTCAAG
AAGGGAAGATGCACAGATTACCTTTTCCCAAAGGCGGCAGCTTTAGAGCCAAAGACAAGCTTGAATTAGTGCACAGTGATGTTTGCGATAACGGGAAGGAATATACTTCC
AAGCAATTTGACAAGTTTTGTAAAGATTTGGAAATCCAACACCAGTTGTCCATTGCCTATACACCACAACAGAATGATCGCTACTGGGCGGTTGAGGACAAGACTCTTAT
GAAGCTTGGTCAGAGATCTAAACTTGATGAGAAGGCAATTAAGAGAGTATTTATTGGCTATGCTGCTGATTCAAAAGGGTACATAATTTTTGATTTGAACTCAGAAAAGA
TTATCTTAAGTCGTGATGTCATTATTGATGAAGATTCTTATTGGGATTTTGAGGAGCAAAAAATTGTGAGAGATTCATCCACCAATTTTGAAATCTTAGAACAAATTGAA
TTCTCCTTTGATTCAATTGATTCTAATGATCGAGATGGCCAATATCCTCTTTCAAACGATGATGATCTTGTTGATGCAACTGGTGATAATCCAAATTACAAGGTAAAATC
ACTTCTTCGATGTATATGCAAGTTCTTCACAATCATCAATGGATTCTTTGCTATGAAAGAAGAATTGGAGATGATCAATAAAAATAATACATGGGAGCTTGTAGAGCAAC
CACGAGGAAAGAACGCTTTGGGTGTGAAATGGGTGTTCAGAATGAAGTATAATTCTGATGGTTCAGTCAACAAATATAAAGCAAGTTGGCACGACATGACACAATATGAT
TATTGCTTTAGTTTGCACATTTGGGGGTGGAAAATATTCCATCTAGATGTCAAATCGACTTTTACGAATGGTGATCTTGAGGAGGAGATTTATGTGAATCAACTGAAGGG
TATGAAAATTCTTCAAGAAGATATGAGTATTTTTATTGCCCAAAGAAAGTATGCTCAAGGAATTTTGAAGAAATTTAAAATGGAAACATGTAAAGCTGTTTCCACTCTTT
TAGCGACAAACTTGAAAGTATCAAAGAATGATGGTGAGAAGCTATCTAATCCTACAAATTATAGAAGTTTAATAGGAAGTCTCTTGTATTTGACAACATCCAGGCCAGAC
TTGATGTTTGCTGCAAGTTTGCTCTCAAGATATATGAATTCACCTAGTGAAATTCATTTTGGTATTACTAAAAGAGTCCTTAGATACTTAAAGGGTACTCTTGATTTTGG
AATTTGGTTCGAATTTACTGGTAATTTGAAATTAACTGGATATTCCGACAGTGATTGGGCTGGTTGTGTTGATGATTCTAAAAGCACTTCCAGTTATGTTTTTTCTCTTG
GAAATAGAATTTTCTCTTGGAATTCAAGAAAACAAGAAGTAGTGGCTCAATCTACTGTTGAAGCTGAATATATTTCAATTTGCATCATGTATGAAATCAAGCAATATGGC
TTCGCAAATTGCTTAGTGATTTGGGATTTGAACCTAAAGAACCTACCAATTTGTTCTGTGATAACAAGTCTGCTATTGCTATTGCGTCCAATCCAGTACAACATGGTAGA
ACAAAGCACATCAAAGTCAAATTTCATTTATTGA
Protein sequenceShow/hide protein sequence
MANATVQIIARDRKTSVNREVDLVKVLWRNHLTEEATGEREEEGSTAVEHSTFGIPRLSCSLLSRSPCALGDTTRDPPPTATIRAQIPFHDGESIGAVNARPLAPTNLGR
KVIRQPSRRISTQIALVSRREAALVGGRSPPPLREVTFEADRKPSQDRTTADPRTLRRQRPASIRDSTPRTPDTAICQPVPYNYDFSEGLVNNFELLCESGSTKRGVVKE
FGLSLGVEFELRGVTLGLWESVSTNVDPQPLGENLTLNQIRLHEEEKLKNPKALSVIHAALSDHIFARIIDCKTAKQAWDKLHEEFEGSARVKAVKLLTLKREFEMLKMK
DSDSVGDYTTKVMTIVNQIRLSGENFPDQRVVEKIMVSVPNKFESKISSREESSDLTTLSIAELISKLQAQEQRDTMRNEEHDEGAFHAKSKGKKPVAEDDRLETKDQGI
KGKGGASKKGKFPPCHYCKKTNHIEKNCWSKNKQAYHCEYCNKYGHTEKFCCAKKTQTQHPQEQANCAHNNHETNFLFMASHTNDNKSTSWLIDSGCTTHMAKDINLFSQ
IDKSIQSKVVLGHGETVLAEGKGTAIMHTKQGEKKIFSVLYVLRLSQNLLSIAQLLHNKFSVIFKDQICESCQEGKMHRLPFPKGGSFRAKDKLELVHSDVCDNGKEYTS
KQFDKFCKDLEIQHQLSIAYTPQQNDRYWAVEDKTLMKLGQRSKLDEKAIKRVFIGYAADSKGYIIFDLNSEKIILSRDVIIDEDSYWDFEEQKIVRDSSTNFEILEQIE
FSFDSIDSNDRDGQYPLSNDDDLVDATGDNPNYKVKSLLRCICKFFTIINGFFAMKEELEMINKNNTWELVEQPRGKNALGVKWVFRMKYNSDGSVNKYKASWHDMTQYD
YCFSLHIWGWKIFHLDVKSTFTNGDLEEEIYVNQLKGMKILQEDMSIFIAQRKYAQGILKKFKMETCKAVSTLLATNLKVSKNDGEKLSNPTNYRSLIGSLLYLTTSRPD
LMFAASLLSRYMNSPSEIHFGITKRVLRYLKGTLDFGIWFEFTGNLKLTGYSDSDWAGCVDDSKSTSSYVFSLGNRIFSWNSRKQEVVAQSTVEAEYISICIMYEIKQYG
FANCLVIWDLNLKNLPICSVITSLLLLLRPIQYNMVEQSTSKSNFIY