; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G19740 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G19740
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:15427369..15430582
RNA-Seq ExpressionCSPI01G19740
SyntenyCSPI01G19740
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN75440.1 hypothetical protein VITISV_007304 [Vitis vinifera]0.0e+0057.51Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q+S WVID GASIH T +++FF SY+  DFG+VR GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH +L H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----
         CLAG+Q RV FK+  H+RKP +L+LV+SDV GP+KTK+LG +LYFVTF +DHSRKIW+YTLKTKDQVL  FKQF+A VER++GEKLKC+RTDNG     
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----

Query:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH
                                                                                         + +ISY HLRVFG KAFVH
Subjt:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH

Query:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIED--------------
        +PKDERSKLDAKT+ CVF+GYGQDE GY+ YD + KKL RS DVVF+E+ TI  IEK +  ES+HS  LIDL    LT   TQ+ED              
Subjt:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIED--------------

Query:  --EVQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA
          +V++E   D ++                 EQ      P+D+ LRRS RDR PSTRYS ++Y+LLTD GEPESY EA++DE+K +W DAM+DEMESLH 
Subjt:  --EVQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA

Query:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN
        NH+FELVKLPKGK  LKN+WVY++K EEHTS+P YKARLVVK    K+                         S DLE++QMDVKTAFLHGDLDK+    
Subjt:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN

Query:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG
             + K K             +  APRQWYKKFESVMG+QGYRKTTSDHC+FVQK  +DDFVILLLYVDDILIV RNVSRI+NLKKQLSKS AMKDLG
Subjt:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG

Query:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF
          K++LGI+I RDRASKKL M QE+YIEKV  RF MS+ K VSS L SHFK+ S+ SPST+K+ EDM +V YASA+ SLMY MVCTRPDIA+ VGVV+RF
Subjt:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF

Query:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH
        +SNP + HWEAVK IMRY RGTS L+LTFG GK ILVGY DSDMAGD+ +RKSTSGYLMTF+GG+VSWQS+LQKC+ALSTTEAEYIAA EACKE+LW+K 
Subjt:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH

Query:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK
        F++ELGFKQQRYV+Y DNQSAIH  +N+++H+R+K IDVRYHW+RDALND LFE+EKIHTD+NGSDMLTK  PR KL +  S  GM +S+ +
Subjt:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK

RVW22327.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0063.88Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q++SWVID+GASIH T +++FF SY+  DFG+VR GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH RL H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNGDKDIS
         CLAG+Q RV FK+ +H+RKP                             +DHSRKIW+YTLKTKDQ+L  FKQF+A VER++GEKLKC+RTDNG +   
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNGDKDIS

Query:  YSHLRVFGSKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIED
          +   F      H   DERSKLDAKT+ CVF+GYGQDE GYR YDP+ KKL+RSRDVVF+E+ TI  IEK +  ES+HS  LIDL    LT   TQ+ED
Subjt:  YSHLRVFGSKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIED

Query:  EVQNEQF----FDT-------YESSEQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFEL
        E  ++Q      +T        +  EQ      P D+ LRRS RDR PSTRYS ++Y+LLTDGGEPESY EA+EDE+K +W DAM+DEMESLH NH+FEL
Subjt:  EVQNEQF----FDT-------YESSEQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFEL

Query:  VKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLI
        VKLPKGK  LKN+WVY++K EEHTS+P YKARLVVK F+QKKGIDFDEIF+PVVKMSSIRVVLGL  SLDLE++QMDVKTAFLHGDLDK+         +
Subjt:  VKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLI

Query:  RKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVL
         K K             +  APRQWYKKFESVMG+QGYRKTTSDHC+FVQK  +DDFVILLLYVDDILIVGRNVSRI++LKKQLSKS AMKDLG  K++L
Subjt:  RKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVL

Query:  GIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRK
        GI+I RDRASKKL M QE+YIEKVL RF MS+ K VSS L SHFK+ S+ SPST+K+ EDM +V YASAV SLMY MVCTRPDIA+ VGVV+RF+SNPR+
Subjt:  GIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRK

Query:  QHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELG
         HWEAVK IMRY RGTS L+LTFG GK ILVGY DSDMAGD+ +R+STSGYLMTF+GG+VSWQS+LQKC+ALSTTEAEYIAAAEACKE+LW+K F++ELG
Subjt:  QHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELG

Query:  FKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM
        FKQQRYV+Y DNQSAIH  +N+++H+R+K ID+RYHW+RDALND LFE+EKIHTD+NGSDMLTK  P  KL +  S  GM
Subjt:  FKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM

RVW24680.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0059.93Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q++SWVID+GASIH T +++FF SY+  DFG+VR GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH RL H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----
         CLAG+Q RV FK+ +H+RKP +L+LV+SDVCGP+KTK+LG +LYFVTF +DHSRKIW+YTLKTKDQVL  FKQF+A VER++GEKLKC+RTDNG     
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----

Query:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH
                                                                                         + +ISY HLRVFG KAFVH
Subjt:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH

Query:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDEVQNEQF-------
        +PKDERSKLDAKT+ CVF+GYGQDE GYR YDP+ KKL+RSRDVVF+E+ TI  IEK +  ES+HS  LIDL    LT   TQ+EDE  ++Q        
Subjt:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDEVQNEQF-------

Query:  -----FDTYESSEQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKNK
              + ++    + +   PT V +   V ++ P+   +P++  L        SY EA+EDE+K +W DAM+DEMESLH NH+FELVKLPKGK  LKN+
Subjt:  -----FDTYESSEQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKNK

Query:  WVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLIRKEK---------
        WVY++K EEHTS+P YKARLVVK F+QKKGIDFDEIF+PVVKMSSIRVVLGLA SLDLE++QMDVKTAFLHGDLDK+         + K K         
Subjt:  WVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLIRKEK---------

Query:  ---SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKL
            +  APRQWYKKFESVMG+QGYRKTTSDHC+FVQK  +DDFVILLLYVDDILIVGRNVSRI++LKKQLSKS AMKDLG  K++LGI+I +DRASKKL
Subjt:  ---SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKL

Query:  YMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYS
         M QE+YIEKVL RF MS+ K VSS L SHFK+ S+ SPST+K+ EDM +V YASAV SLMY MVCTRPDIA+ VGVV+RF+SNP + HWEAVK IMRY 
Subjt:  YMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYS

Query:  RGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQRYVIYYDNQ
        RGTS L+LTFG GK ILVGY DSDMAGD+ +R+STSGYLMTF+GG+VSWQS+LQKC+ALSTTEAEYIAAAEACKE+LW+K F++ELGFKQQRYV+Y DNQ
Subjt:  RGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQRYVIYYDNQ

Query:  SAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK
        SAIH  +N+++H+R+K IDVRYHW+RDALND LFE+EKIHTD+NGSDMLTK  PR KL +  S  GM +S+S+
Subjt:  SAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK

RVW85908.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0060.9Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q++SWVID+GASIH T +++FF SY+  DFG+VR GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH RL H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----
         CLAG+Q RV FK+ +H+RKP +L+LV+SDVCGP+KTK+LG +LYFVTF +DHSRKIW+YTLKTKDQVL  FKQF+A VER++GEKLKC+RTDNG     
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----

Query:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH
                                                                                         + +ISY HLRVFG KAFVH
Subjt:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH

Query:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDE-------------
        +PKDERSKLDAKT+ CVF+GYGQDE GYR YDP+ KKL+RSRDVVF+E+ TI  IEK +  ES+HS  LIDL    L    TQ+EDE             
Subjt:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDE-------------

Query:  ---VQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA
           V++E   D ++                 EQ      P+D+ LRRS RDR PSTRYS ++Y+LLTDGGEPESY EA+EDE+K +W DAM+DEMESLH 
Subjt:  ---VQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA

Query:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN
        NH+FELVKLPKGK  LKN+WVY++K EEHTS+P YKARLVVK F+QKKGIDFDEIF+PVVKMSSIRVVLGLA SLDLE++QMDVKTAFLHGDLDK+    
Subjt:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN

Query:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG
             + K K             +  APRQWYKKFESVMG+QGYRKTTSDHC+FVQK  +DDFVILLLYVDDILIVGRNVSRI+NLKKQLSKS AMKDLG
Subjt:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG

Query:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF
          K++LGI+I RDRASKKL M QE+YIEKVL RF MS+ K VSS L SHFK+ S+ SPST+K+ EDM +V YASAV SLMYVMVCTRPDIA+ VGVV+RF
Subjt:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF

Query:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH
        +SNP + HWEAVK IMRY RGTS L+LTFG GK ILVGY DSDMAGD+ +R+STSGYL TF+GG+VSWQS+LQKC+ LSTTEAEYIAAAEACKE+LW+K 
Subjt:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH

Query:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK
        F++ELGFKQQRYV+Y DNQSAIH  +N+++H+R+K IDVRYHW+RDALND LFE+EKIHTD+NGSDMLTK  PR KL +  S VGM +S+ +
Subjt:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK

RVW88205.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0060.07Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q++SWVID+GASIH T +++FF SY+  DFG+V  GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH RL H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----
         CLAG+Q RV FK+ +H+RKP +L+LV+SDVCGP+KTK+LG +LYFVTF +DHSRKIW+YTLKTKDQVL  FKQF+A VER++GEKLKC+RTDNG     
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----

Query:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH
                                                                                         + +ISY HLRVFG KAFVH
Subjt:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH

Query:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDE-------------
        +PKDERSKLDAKT+ CVF+GYGQDE GYR YDP+ KKL+RSRDVVF+E+ TI  IEK +  ES+HS  LIDL    L    TQ+EDE             
Subjt:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDE-------------

Query:  ---VQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA
           V++E   D ++                 EQ     VP+D+ LRRS RDR PST YS ++Y+LLTDGGE ESY EA+EDE+K +W DAM+DEMESLH 
Subjt:  ---VQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA

Query:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN
        NH+FELVKLPKGK  LKN+WVY++K EEHTS+P YKARLVVK F+QKKGIDFDEIF+PVVKMSSIRVVLGLA SLDLE++QMDVKTAFLHGDLDK+    
Subjt:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN

Query:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG
             + K K             +  APRQWYKKFESVMG+QGYRKTT DHC+FVQK  +DDFVILLLYVDDILIVGRNVSRI+NLKKQLSKS AMKDLG
Subjt:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG

Query:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF
          K++LGI+I RDRASKKL M QE+YIEK+L RF M + K VSS L SHFK+ S+ SPST+K+ EDM +V YASAV SLMY MVCTRPDIA+ VGVV+RF
Subjt:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF

Query:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH
        +SNP + HWEAVK IMRY RGTS L+LTFG GK I VGY DSDM GD+ +R+STSGYLMTF+GG+VSWQS+LQKC+ALSTTEAEYIAA EACKE+LW+K 
Subjt:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH

Query:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK
        F++ELGFKQQRYV+Y DNQSAIH  +N+++H+R+K IDVRYH +RDALND LFE+EKIHTD+NGSDMLTK  PR KL +  S  GM +S+ +
Subjt:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK

TrEMBL top hitse value%identityAlignment
A0A438CGI4 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0063.88Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q++SWVID+GASIH T +++FF SY+  DFG+VR GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH RL H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNGDKDIS
         CLAG+Q RV FK+ +H+RKP                             +DHSRKIW+YTLKTKDQ+L  FKQF+A VER++GEKLKC+RTDNG +   
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNGDKDIS

Query:  YSHLRVFGSKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIED
          +   F      H   DERSKLDAKT+ CVF+GYGQDE GYR YDP+ KKL+RSRDVVF+E+ TI  IEK +  ES+HS  LIDL    LT   TQ+ED
Subjt:  YSHLRVFGSKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIED

Query:  EVQNEQF----FDT-------YESSEQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFEL
        E  ++Q      +T        +  EQ      P D+ LRRS RDR PSTRYS ++Y+LLTDGGEPESY EA+EDE+K +W DAM+DEMESLH NH+FEL
Subjt:  EVQNEQF----FDT-------YESSEQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFEL

Query:  VKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLI
        VKLPKGK  LKN+WVY++K EEHTS+P YKARLVVK F+QKKGIDFDEIF+PVVKMSSIRVVLGL  SLDLE++QMDVKTAFLHGDLDK+         +
Subjt:  VKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLI

Query:  RKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVL
         K K             +  APRQWYKKFESVMG+QGYRKTTSDHC+FVQK  +DDFVILLLYVDDILIVGRNVSRI++LKKQLSKS AMKDLG  K++L
Subjt:  RKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVL

Query:  GIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRK
        GI+I RDRASKKL M QE+YIEKVL RF MS+ K VSS L SHFK+ S+ SPST+K+ EDM +V YASAV SLMY MVCTRPDIA+ VGVV+RF+SNPR+
Subjt:  GIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRK

Query:  QHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELG
         HWEAVK IMRY RGTS L+LTFG GK ILVGY DSDMAGD+ +R+STSGYLMTF+GG+VSWQS+LQKC+ALSTTEAEYIAAAEACKE+LW+K F++ELG
Subjt:  QHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELG

Query:  FKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM
        FKQQRYV+Y DNQSAIH  +N+++H+R+K ID+RYHW+RDALND LFE+EKIHTD+NGSDMLTK  P  KL +  S  GM
Subjt:  FKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM

A0A438CN91 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0059.93Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q++SWVID+GASIH T +++FF SY+  DFG+VR GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH RL H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----
         CLAG+Q RV FK+ +H+RKP +L+LV+SDVCGP+KTK+LG +LYFVTF +DHSRKIW+YTLKTKDQVL  FKQF+A VER++GEKLKC+RTDNG     
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----

Query:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH
                                                                                         + +ISY HLRVFG KAFVH
Subjt:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH

Query:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDEVQNEQF-------
        +PKDERSKLDAKT+ CVF+GYGQDE GYR YDP+ KKL+RSRDVVF+E+ TI  IEK +  ES+HS  LIDL    LT   TQ+EDE  ++Q        
Subjt:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDEVQNEQF-------

Query:  -----FDTYESSEQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKNK
              + ++    + +   PT V +   V ++ P+   +P++  L        SY EA+EDE+K +W DAM+DEMESLH NH+FELVKLPKGK  LKN+
Subjt:  -----FDTYESSEQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKNK

Query:  WVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLIRKEK---------
        WVY++K EEHTS+P YKARLVVK F+QKKGIDFDEIF+PVVKMSSIRVVLGLA SLDLE++QMDVKTAFLHGDLDK+         + K K         
Subjt:  WVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLIRKEK---------

Query:  ---SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKL
            +  APRQWYKKFESVMG+QGYRKTTSDHC+FVQK  +DDFVILLLYVDDILIVGRNVSRI++LKKQLSKS AMKDLG  K++LGI+I +DRASKKL
Subjt:  ---SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKL

Query:  YMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYS
         M QE+YIEKVL RF MS+ K VSS L SHFK+ S+ SPST+K+ EDM +V YASAV SLMY MVCTRPDIA+ VGVV+RF+SNP + HWEAVK IMRY 
Subjt:  YMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYS

Query:  RGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQRYVIYYDNQ
        RGTS L+LTFG GK ILVGY DSDMAGD+ +R+STSGYLMTF+GG+VSWQS+LQKC+ALSTTEAEYIAAAEACKE+LW+K F++ELGFKQQRYV+Y DNQ
Subjt:  RGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQRYVIYYDNQ

Query:  SAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK
        SAIH  +N+++H+R+K IDVRYHW+RDALND LFE+EKIHTD+NGSDMLTK  PR KL +  S  GM +S+S+
Subjt:  SAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK

A0A438HN89 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0060.9Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q++SWVID+GASIH T +++FF SY+  DFG+VR GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH RL H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----
         CLAG+Q RV FK+ +H+RKP +L+LV+SDVCGP+KTK+LG +LYFVTF +DHSRKIW+YTLKTKDQVL  FKQF+A VER++GEKLKC+RTDNG     
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----

Query:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH
                                                                                         + +ISY HLRVFG KAFVH
Subjt:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH

Query:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDE-------------
        +PKDERSKLDAKT+ CVF+GYGQDE GYR YDP+ KKL+RSRDVVF+E+ TI  IEK +  ES+HS  LIDL    L    TQ+EDE             
Subjt:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDE-------------

Query:  ---VQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA
           V++E   D ++                 EQ      P+D+ LRRS RDR PSTRYS ++Y+LLTDGGEPESY EA+EDE+K +W DAM+DEMESLH 
Subjt:  ---VQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA

Query:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN
        NH+FELVKLPKGK  LKN+WVY++K EEHTS+P YKARLVVK F+QKKGIDFDEIF+PVVKMSSIRVVLGLA SLDLE++QMDVKTAFLHGDLDK+    
Subjt:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN

Query:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG
             + K K             +  APRQWYKKFESVMG+QGYRKTTSDHC+FVQK  +DDFVILLLYVDDILIVGRNVSRI+NLKKQLSKS AMKDLG
Subjt:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG

Query:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF
          K++LGI+I RDRASKKL M QE+YIEKVL RF MS+ K VSS L SHFK+ S+ SPST+K+ EDM +V YASAV SLMYVMVCTRPDIA+ VGVV+RF
Subjt:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF

Query:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH
        +SNP + HWEAVK IMRY RGTS L+LTFG GK ILVGY DSDMAGD+ +R+STSGYL TF+GG+VSWQS+LQKC+ LSTTEAEYIAAAEACKE+LW+K 
Subjt:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH

Query:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK
        F++ELGFKQQRYV+Y DNQSAIH  +N+++H+R+K IDVRYHW+RDALND LFE+EKIHTD+NGSDMLTK  PR KL +  S VGM +S+ +
Subjt:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK

A0A438HUT4 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0060.07Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q++SWVID+GASIH T +++FF SY+  DFG+V  GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH RL H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----
         CLAG+Q RV FK+ +H+RKP +L+LV+SDVCGP+KTK+LG +LYFVTF +DHSRKIW+YTLKTKDQVL  FKQF+A VER++GEKLKC+RTDNG     
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----

Query:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH
                                                                                         + +ISY HLRVFG KAFVH
Subjt:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH

Query:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDE-------------
        +PKDERSKLDAKT+ CVF+GYGQDE GYR YDP+ KKL+RSRDVVF+E+ TI  IEK +  ES+HS  LIDL    L    TQ+EDE             
Subjt:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDE-------------

Query:  ---VQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA
           V++E   D ++                 EQ     VP+D+ LRRS RDR PST YS ++Y+LLTDGGE ESY EA+EDE+K +W DAM+DEMESLH 
Subjt:  ---VQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA

Query:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN
        NH+FELVKLPKGK  LKN+WVY++K EEHTS+P YKARLVVK F+QKKGIDFDEIF+PVVKMSSIRVVLGLA SLDLE++QMDVKTAFLHGDLDK+    
Subjt:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN

Query:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG
             + K K             +  APRQWYKKFESVMG+QGYRKTT DHC+FVQK  +DDFVILLLYVDDILIVGRNVSRI+NLKKQLSKS AMKDLG
Subjt:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG

Query:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF
          K++LGI+I RDRASKKL M QE+YIEK+L RF M + K VSS L SHFK+ S+ SPST+K+ EDM +V YASAV SLMY MVCTRPDIA+ VGVV+RF
Subjt:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF

Query:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH
        +SNP + HWEAVK IMRY RGTS L+LTFG GK I VGY DSDM GD+ +R+STSGYLMTF+GG+VSWQS+LQKC+ALSTTEAEYIAA EACKE+LW+K 
Subjt:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH

Query:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK
        F++ELGFKQQRYV+Y DNQSAIH  +N+++H+R+K IDVRYH +RDALND LFE+EKIHTD+NGSDMLTK  PR KL +  S  GM +S+ +
Subjt:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK

A5C9D7 Integrase catalytic domain-containing protein0.0e+0057.51Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KND+  + D +     DF I+ D DVVN A Q+S WVID GASIH T +++FF SY+  DFG+VR GNDGSA A+ + DV L+  NG+ L LKNVKHIPD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLISTGKLDDEGFCNTF +  WKLT+GS+VIAKG   SSLY M A+++DS IN V+D++  ELWH +L H+SEKGL IL KK+ L  +K   LKR  
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----
         CLAG+Q RV FK+  H+RKP +L+LV+SDV GP+KTK+LG +LYFVTF +DHSRKIW+YTLKTKDQVL  FKQF+A VER++GEKLKC+RTDNG     
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----

Query:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH
                                                                                         + +ISY HLRVFG KAFVH
Subjt:  ---------------------------------------------------------------------------------DKDISYSHLRVFGSKAFVH

Query:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIED--------------
        +PKDERSKLDAKT+ CVF+GYGQDE GY+ YD + KKL RS DVVF+E+ TI  IEK +  ES+HS  LIDL    LT   TQ+ED              
Subjt:  VPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIED--------------

Query:  --EVQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA
          +V++E   D ++                 EQ      P+D+ LRRS RDR PSTRYS ++Y+LLTD GEPESY EA++DE+K +W DAM+DEMESLH 
Subjt:  --EVQNEQFFDTYESS---------------EQLVETVVPTDVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHA

Query:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN
        NH+FELVKLPKGK  LKN+WVY++K EEHTS+P YKARLVVK    K+                         S DLE++QMDVKTAFLHGDLDK+    
Subjt:  NHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWN

Query:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG
             + K K             +  APRQWYKKFESVMG+QGYRKTTSDHC+FVQK  +DDFVILLLYVDDILIV RNVSRI+NLKKQLSKS AMKDLG
Subjt:  NQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLG

Query:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF
          K++LGI+I RDRASKKL M QE+YIEKV  RF MS+ K VSS L SHFK+ S+ SPST+K+ EDM +V YASA+ SLMY MVCTRPDIA+ VGVV+RF
Subjt:  SAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKD-EDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRF

Query:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH
        +SNP + HWEAVK IMRY RGTS L+LTFG GK ILVGY DSDMAGD+ +RKSTSGYLMTF+GG+VSWQS+LQKC+ALSTTEAEYIAA EACKE+LW+K 
Subjt:  MSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKH

Query:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK
        F++ELGFKQQRYV+Y DNQSAIH  +N+++H+R+K IDVRYHW+RDALND LFE+EKIHTD+NGSDMLTK  PR KL +  S  GM +S+ +
Subjt:  FVKELGFKQQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.7e-10027.8Show/hide
Query:  RNGSRLILKNVKHIPDTRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTK
        RN   + L++V    +   NL+S  +L + G    FD     ++K  +++ K  N   L  +      +Y      + N  LWH+R  HIS+  L  + +
Subjt:  RNGSRLILKNVKHIPDTRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTK

Query:  KSYLPD---LKSTPL--KRYPRCLAGRQMRVTF---KSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFY
        K+   D   L +  L  +    CL G+Q R+ F   K   H ++P  L +VHSDVCGP+   +L    YFV F +  +     Y +K K  V   F+ F 
Subjt:  KSYLPD---LKSTPL--KRYPRCLAGRQMRVTF---KSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFY

Query:  AFVERETGEKLKCVRTDNG-------------DKDISY--------------------------------------------------------------
        A  E     K+  +  DNG              K ISY                                                              
Subjt:  AFVERETGEKLKCVRTDNG-------------DKDISY--------------------------------------------------------------

Query:  --------------SHLRVFGSKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTI----AYIEKI---DEPESKHS
                       HLRVFG+  +VH+ K+++ K D K+   +F+GY  +  G++L+D +N+K I +RDVV  E   +       E +   D  ES++ 
Subjt:  --------------SHLRVFGSKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTI----AYIEKI---DEPESKHS

Query:  DILIDLSSTSLTRSSTQIEDEVQNEQFF-DTYES--------SEQLVETVVPTDVSLRRSV---RDRRPSTRY---------------------SPNEYL
        +   D      T    +   E  N QF  D+ ES        S ++++T  P +     ++   +D + S +Y                     +PNE  
Subjt:  DILIDLSSTSLTRSSTQIEDEVQNEQFF-DTYES--------SEQLVETVVPTDVSLRRSV---RDRRPSTRY---------------------SPNEYL

Query:  ----------------LLTDGGE--------------------------------------PESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLP
                           DG E                                      P S++E    + K+ W +A+  E+ +   N+T+ + K P
Subjt:  ----------------LLTDGGE--------------------------------------PESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLP

Query:  KGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDK-------KSIWNNQR
        + K  + ++WV+ +K+ E  +   YKARLV + F+QK  ID++E FAPV ++SS R +L L    +L+V QMDVKTAFL+G L +       + I  N  
Subjt:  KGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDK-------KSIWNNQR

Query:  VLIRKEKSI---PFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYN-DDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIV
         + +  K+I     A R W++ FE  + +  +  ++ D CI++    N ++ + +LLYVDD++I   +++R+NN K+ L +   M DL   K  +GI+I 
Subjt:  VLIRKEKSI---PFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYN-DDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIV

Query:  RDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKDEDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAV
         +    K+Y+SQ  Y++K+L +F M    +VS+ LPS            N DED +     S +  LMY+M+CTRPD+   V +++R+ S    + W+ +
Subjt:  RDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKDEDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAV

Query:  KCIMRYSRGTSSLRLTFGDG---KSILVGYIDSDMAGDLASRKSTSGYLM-TFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFK
        K ++RY +GT  ++L F      ++ ++GY+DSD AG    RKST+GYL   F    + W +K Q  +A S+TEAEY+A  EA +E LWLK  +  +  K
Subjt:  KCIMRYSRGTSSLRLTFGDG---KSILVGYIDSDMAGDLASRKSTSGYLM-TFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFK

Query:  QQRYV-IYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM
         +  + IY DNQ  I    N S H R K ID++YH+ R+ + + +  LE I T++  +D+ TK  P ++    R  +G+
Subjt:  QQRYV-IYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM

P0CV72 Secreted RxLR effector protein 1612.3e-3051.88Show/hide
Query:  MSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTF-GDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGS
        M  V Y SAV ++MY+MV TRPD+A  VGV+++F S+P   HW+A+K ++RY + T +  L F   G + LVGY D+D AGD+ SR+STSGYL    GG 
Subjt:  MSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTF-GDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGS

Query:  VSWQSKLQKCIALSTTEAEYIAAAEACKEMLWL
        VSW+SK Q+ +ALS+TE EY+A +EA +E +WL
Subjt:  VSWQSKLQKCIALSTTEAEYIAAAEACKEMLWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.0e-26547.85Show/hide
Query:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD
        KNDD++ A      N   FI  + + ++L+  +S WV+D  AS H T  R+ F  Y   DFG V+ GN   +    I D+ +K   G  L+LK+V+H+PD
Subjt:  KNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPD

Query:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP
         RMNLIS   LD +G+ + F N  W+LTKGS+VIAKG    +LY  +A+I    +N   DE +V+LWHKR+ H+SEKGL+IL KKS +   K T +K   
Subjt:  TRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYP

Query:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----
         CL G+Q RV+F++S   RK N+L+LV+SDVCGP++ +S+G   YFVTF +D SRK+W+Y LKTKDQV Q F++F+A VERETG KLK +R+DNG     
Subjt:  RCLAGRQMRVTFKSSQHSRKPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNG-----

Query:  ----------------------------------------------------------------------------------DKDISYSHLRVFGSKAFV
                                                                                          +K++SYSHL+VFG +AF 
Subjt:  ----------------------------------------------------------------------------------DKDISYSHLRVFGSKAFV

Query:  HVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTS---LTRSSTQIEDEVQNEQFFDT
        HVPK++R+KLD K+  C+F+GYG +EFGYRL+DP+ KK+IRSRDVVF E +     +  ++ ++      + + STS    +  ST  E   Q EQ  + 
Subjt:  HVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTS---LTRSSTQIEDEVQNEQFFDT

Query:  YESSEQLVETVV----PTD-----VSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKN
         E  EQL E V     PT        LRRS R R  S RY   EY+L++D  EPES +E +    KN+   AM++EMESL  N T++LV+LPKGK  LK 
Subjt:  YESSEQLVETVV----PTD-----VSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKN

Query:  KWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKK---------SIWNNQRVLIRKE
        KWV+K+K +       YKARLVVK F QKKGIDFDEIF+PVVKM+SIR +L LA SLDLEVEQ+DVKTAFLHGDL+++          +   + ++ +  
Subjt:  KWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKK---------SIWNNQRVLIRKE

Query:  KS---IPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKK
        KS   +  APRQWY KF+S M  Q Y KT SD C++ ++   ++F+ILLLYVDD+LIVG++   I  LK  LSKS  MKDLG A+++LG++IVR+R S+K
Subjt:  KS---IPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKK

Query:  LYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKDE-DMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRY
        L++SQEKYIE+VLERF M   K VS+ L  H K+  K  P+T +++ +M+KV Y+SAV SLMY MVCTRPDIAH VGVV+RF+ NP K+HWEAVK I+RY
Subjt:  LYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKDE-DMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRY

Query:  SRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQRYVIYYDN
         RGT+   L FG    IL GY D+DMAGD+ +RKS++GYL TF+GG++SWQSKLQKC+ALSTTEAEYIAA E  KEM+WLK F++ELG  Q+ YV+Y D+
Subjt:  SRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQRYVIYYDN

Query:  QSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM
        QSAI   +N+ +H+RTK IDVRYHW+R+ ++DE  ++ KI T+ N +DMLTK  PR+K EL +  VGM
Subjt:  QSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-6332.71Show/hide
Query:  RPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKN-KWVYKIKHEEHTSKPHYKARLVVKEFSQKKGID
        +P+ +YS    + L    EP +  +A++DE    W +AM  E+ +   NHT++LV  P     +   +W++  K+    S   YKARLV K ++Q+ G+D
Subjt:  RPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKN-KWVYKIKHEEHTSKPHYKARLVVKEFSQKKGID

Query:  FDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDH
        + E F+PV+K +SIR+VLG+A      + Q+DV  AFL G L      +     I K++             +  APR WY +  + +   G+  + SD 
Subjt:  FDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSDH

Query:  CIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSL-PSHF
         +FV +      V +L+YVDDILI G + + ++N    LS+  ++KD       LGI+    R    L++SQ +YI  +L R  M   K V++ + PS  
Subjt:  CIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSL-PSHF

Query:  KMISKQSPSTNKDEDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSI-LVGYIDSDMAGDLAS
          +   +  T+  E      Y   V SL Y +  TRPDI++ V  +++FM  P ++H +A+K I+RY  GT +  +    G ++ L  Y D+D AGD   
Subjt:  KMISKQSPSTNKDEDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSI-LVGYIDSDMAGDLAS

Query:  RKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQR-YVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALN
          ST+GY++      +SW SK QK +  S+TEAEY + A    EM W+   + ELG +  R  VIY DN  A +   N  FHSR K I + YH++R+ + 
Subjt:  RKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQR-YVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALN

Query:  DELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMT
             +  + T    +D LTK   R+  + + S +G+T
Subjt:  DELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-5930.73Show/hide
Query:  RRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKN-KWVYKIKHEEHTSKPHYKARLVVKEFSQKKGI
        R+P+ +YS      L    EP +  +A++D+    W  AM  E+ +   NHT++LV  P     +   +W++  K     S   YKARLV K ++Q+ G+
Subjt:  RRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKN-KWVYKIKHEEHTSKPHYKARLVVKEFSQKKGI

Query:  DFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSD
        D+ E F+PV+K +SIR+VLG+A      + Q+DV  AFL G L  +   +     + K++             +  APR WY +  + +   G+  + SD
Subjt:  DFDEIFAPVVKMSSIRVVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLIRKEK------------SIPFAPRQWYKKFESVMGKQGYRKTTSD

Query:  HCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHF
          +FV +      + +L+YVDDILI G +   + +    LS+  ++K+       LGI+    R  + L++SQ +Y   +L R  M   K V++ + +  
Subjt:  HCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHF

Query:  KMISKQSPSTNKDEDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSI-LVGYIDSDMAGDLAS
        K+      S  K  D ++  Y   V SL Y +  TRPD+++ V  ++++M  P   HW A+K ++RY  GT    +    G ++ L  Y D+D AGD   
Subjt:  KMISKQSPSTNKDEDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSI-LVGYIDSDMAGDLAS

Query:  RKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFK-QQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALN
          ST+GY++      +SW SK QK +  S+TEAEY + A    E+ W+   + ELG +     VIY DN  A +   N  FHSR K I + YH++R+ + 
Subjt:  RKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFK-QQRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALN

Query:  DELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM
             +  + T    +D LTK   R   + +   +G+
Subjt:  DELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.7e-6033.61Show/hide
Query:  EPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLG
        EP +Y EA E      W  AM DE+ ++   HT+E+  LP  K  +  KWVYKIK+    +   YKARLV K ++Q++GIDF E F+PV K++S++++L 
Subjt:  EPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLG

Query:  LATSLDLEVEQMDVKTAFLHGDLDKK---------------SIWNNQRVLIRKE-KSIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVIL
        ++   +  + Q+D+  AFL+GDLD++               S+  N    ++K    +  A RQW+ KF   +   G+ ++ SDH  F+ KI    F+ +
Subjt:  LATSLDLEVEQMDVKTAFLHGDLDKK---------------SIWNNQRVLIRKE-KSIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVIL

Query:  LLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKDED-
        L+YVDDI+I   N + ++ LK QL     ++DLG  K  LG++I R  A   + + Q KY   +L+   +   K      PS   M    + S +   D 
Subjt:  LLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKDED-

Query:  MSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSI-LVGYIDSDMAGDLASRKSTSGYLMTFAGGS
        +   +Y   +  LMY+ + TR DI+  V  +++F   PR  H +AV  I+ Y +GT    L +     + L  + D+       +R+ST+GY M      
Subjt:  MSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSI-LVGYIDSDMAGDLASRKSTSGYLMTFAGGS

Query:  VSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQR-YVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRD
        +SW+SK Q+ ++ S+ EAEY A + A  EM+WL  F +EL     +  +++ DN +AIH   NA FH RTK I+   H +R+
Subjt:  VSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQQR-YVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRD

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.4e-1438.6Show/hide
Query:  GIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYIN---TVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYPRCLAGRQMRVTFKSSQHSR
        G+ K+ KG   I KG    SLY +   +     N   T  DE    LWH RL+H+S++G+++L KK +L   K + LK    C+ G+  RV F + QH+ 
Subjt:  GIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYIN---TVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYPRCLAGRQMRVTFKSSQHSR

Query:  KPNVLELVHSDVCG
        K N L+ VHSD+ G
Subjt:  KPNVLELVHSDVCG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.0e-2536.64Show/hide
Query:  LLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKDED
        LLLYVDDIL+ G + + +N L  QLS + +MKDLG     LGIQI    +   L++SQ KY E++L    M   K +S+ LP    +    S ST K  D
Subjt:  LLLYVDDILIVGRNVSRINNLKKQLSKSLAMKDLGSAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKDED

Query:  MSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILV-GYIDSDMAGDLASRKSTSGYLMTFAGGS
         S   + S V +L Y +  TRPDI++ V +V + M  P    ++ +K ++RY +GT    L       + V  + DSD AG  ++R+ST+G+        
Subjt:  MSKVSYASAVESLMYVMVCTRPDIAHVVGVVNRFMSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILV-GYIDSDMAGDLASRKSTSGYLMTFAGGS

Query:  VSWQSKLQKCIALSTTEAEYIAAAEACKEMLW
        +SW +K Q  ++ S+TE EY A A    E+ W
Subjt:  VSWQSKLQKCIALSTTEAEYIAAAEACKEMLW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-1036.76Show/hide
Query:  EPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLG
        EP+S   A++D     W  AM++E+++L  N T+ LV  P  +  L  KWV+K K     +    KARLV K F Q++GI F E ++PVV+ ++IR +L 
Subjt:  EPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIRVVLG

Query:  LATSLDL-EVEQMDVKTAFLHGDLDKKSIWNNQRVL
        +A  L++ +      K  F  G   KK I  N  VL
Subjt:  LATSLDL-EVEQMDVKTAFLHGDLDKKSIWNNQRVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATGACGATGATAGTGATGCTGATACAATCACTGTAGCCAATGAAGATTTTTTCATCTTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACAAAGCAGTTG
GGTGATTGATAATGGTGCATCAATTCATGGTACTTCGAAGAGAGAATTTTTTGCATCATATTCTCCTAGTGATTTTGGCAACGTTAGGACGGGTAATGACGGATCAGCAA
ATGCAGTTGTCATCGAAGATGTATACTTGAAGAACAGAAATGGTTCTAGGTTGATTTTGAAAAATGTGAAACATATTCCTGATACTCGCATGAACTTGATTTCTACAGGT
AAGCTTGATGACGAAGGTTTCTGCAATACCTTCGACAATGGCATATGGAAGCTTACTAAAGGTTCAATAGTTATAGCAAAGGGACAAAATTTTTCTTCACTTTACTACAT
GGATGCAAAAATCATGGATTCTTATATAAATACAGTGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGTCATATAAGTGAGAAAGGTTTAAAGATTTTAA
CCAAAAAAAGTTATCTTCCTGATTTAAAGAGTACACCTCTAAAACGGTATCCTCGTTGTTTGGCAGGAAGGCAGATGAGAGTTACCTTTAAATCATCTCAGCATTCAAGG
AAGCCAAATGTACTAGAGTTGGTACATTCTGATGTGTGTGGTCCCGTGAAAACAAAGTCGCTTGGGAGTGCTTTGTATTTTGTGACATTTACTAATGATCATTCAAGGAA
AATATGGATTTACACCTTGAAGACTAAAGATCAAGTGTTGCAAGCGTTTAAACAGTTTTATGCCTTTGTTGAGAGAGAAACTGGTGAAAAGCTCAAGTGTGTTAGAACTG
ATAATGGAGATAAGGATATATCTTACAGTCACCTACGTGTCTTTGGTTCTAAAGCTTTTGTTCATGTACCCAAAGATGAGAGATCAAAGCTTGATGCAAAAACTAAAGCA
TGTGTGTTTCTTGGTTATGGCCAAGATGAGTTTGGTTATAGATTATATGATCCAATTAACAAAAAACTTATAAGAAGTCGAGATGTTGTATTTGTTGAAGAGCAAACAAT
AGCATACATTGAGAAAATAGATGAACCAGAGTCTAAACATAGTGATATTCTGATTGATTTGAGCTCAACCTCTTTGACACGGTCTTCTACACAGATAGAAGATGAGGTTC
AAAATGAACAGTTTTTTGATACATATGAGAGTTCTGAGCAGTTAGTTGAAACAGTTGTTCCTACAGATGTTTCACTCAGGAGATCTGTTAGAGATCGACGTCCGTCAACA
AGATATTCACCTAATGAATATTTGTTATTGACTGATGGGGGAGAACCTGAGAGTTATGAAGAGGCTATAGAGGATGAGCACAAAAATGAGTGGAATGATGCAATGAAAGA
TGAGATGGAGTCTTTACATGCAAACCACACTTTTGAGCTTGTTAAGTTGCCCAAAGGAAAAACTGAACTGAAAAATAAGTGGGTTTACAAAATTAAACATGAAGAACATA
CCTCAAAGCCACATTACAAAGCAAGATTGGTTGTCAAAGAGTTCAGTCAGAAGAAAGGTATCGACTTCGATGAAATTTTTGCTCCAGTTGTCAAGATGTCCTCCATACGT
GTTGTTTTAGGTTTGGCAACCAGTCTTGACTTAGAGGTTGAGCAGATGGATGTTAAGACTGCATTTCTTCATGGGGATTTAGACAAGAAATCTATATGGAACAACCAGAG
GGTTTTGATCAGAAAGGAAAAGAGCATCCCATTTGCACCGAGACAGTGGTACAAGAAGTTTGAATCAGTTATGGGGAAGCAAGGCTACAGAAAAACTACTTCTGATCATT
GTATTTTTGTCCAAAAAATTTATAATGATGATTTTGTTATATTATTACTCTATGTTGATGATATTTTGATTGTTGGCAGGAATGTTTCGAGAATAAATAACTTGAAGAAA
CAGTTGAGCAAATCCCTTGCCATGAAGGATTTGGGGTCAGCAAAGAAAGTTCTTGGAATTCAAATAGTTCGAGACAGAGCATCCAAGAAGTTATACATGTCACAGGAAAA
ATACATAGAGAAAGTACTTGAACGTTTCAAGATGAGTCAAGTAAAATCAGTTAGTTCTTCTTTACCCAGTCACTTCAAAATGATCAGTAAACAGAGCCCTTCTACAAATA
AAGATGAGGATATGAGTAAGGTCTCGTATGCTTCAGCAGTTGAAAGCCTAATGTATGTCATGGTATGTACTAGACCTGATATTGCTCATGTTGTTGGTGTTGTTAATCGT
TTTATGTCTAATCCAAGAAAACAACATTGGGAGGCAGTCAAGTGCATCATGAGATATTCGAGAGGTACTTCCAGTTTGAGGCTTACTTTTGGGGATGGAAAGTCAATACT
TGTTGGGTATATTGATTCAGATATGGCAGGGGACTTAGCCAGCAGAAAGTCTACTTCCGGTTACTTGATGACATTTGCAGGTGGTTCAGTGTCCTGGCAGTCAAAATTGC
AGAAATGTATTGCCCTTTCTACAACTGAAGCAGAGTATATTGCAGCAGCAGAAGCATGTAAAGAGATGTTGTGGTTGAAGCACTTTGTAAAAGAGCTTGGCTTCAAGCAA
CAACGATATGTGATATACTATGACAATCAGAGTGCTATTCACTTTGGTCAGAATGCTTCATTTCATTCCAGAACAAAGGATATTGATGTGAGATATCACTGGCTTAGAGA
TGCTTTAAATGATGAATTGTTTGAGCTTGAGAAAATACACACTGATCATAATGGATCTGATATGTTGACAAAGAACAAACCAAGATCAAAGCTTGAGTTATATCGCTCCA
CAGTGGGAATGACAACTTCATCCTCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATGACGATGATAGTGATGCTGATACAATCACTGTAGCCAATGAAGATTTTTTCATCTTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACAAAGCAGTTG
GGTGATTGATAATGGTGCATCAATTCATGGTACTTCGAAGAGAGAATTTTTTGCATCATATTCTCCTAGTGATTTTGGCAACGTTAGGACGGGTAATGACGGATCAGCAA
ATGCAGTTGTCATCGAAGATGTATACTTGAAGAACAGAAATGGTTCTAGGTTGATTTTGAAAAATGTGAAACATATTCCTGATACTCGCATGAACTTGATTTCTACAGGT
AAGCTTGATGACGAAGGTTTCTGCAATACCTTCGACAATGGCATATGGAAGCTTACTAAAGGTTCAATAGTTATAGCAAAGGGACAAAATTTTTCTTCACTTTACTACAT
GGATGCAAAAATCATGGATTCTTATATAAATACAGTGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGTCATATAAGTGAGAAAGGTTTAAAGATTTTAA
CCAAAAAAAGTTATCTTCCTGATTTAAAGAGTACACCTCTAAAACGGTATCCTCGTTGTTTGGCAGGAAGGCAGATGAGAGTTACCTTTAAATCATCTCAGCATTCAAGG
AAGCCAAATGTACTAGAGTTGGTACATTCTGATGTGTGTGGTCCCGTGAAAACAAAGTCGCTTGGGAGTGCTTTGTATTTTGTGACATTTACTAATGATCATTCAAGGAA
AATATGGATTTACACCTTGAAGACTAAAGATCAAGTGTTGCAAGCGTTTAAACAGTTTTATGCCTTTGTTGAGAGAGAAACTGGTGAAAAGCTCAAGTGTGTTAGAACTG
ATAATGGAGATAAGGATATATCTTACAGTCACCTACGTGTCTTTGGTTCTAAAGCTTTTGTTCATGTACCCAAAGATGAGAGATCAAAGCTTGATGCAAAAACTAAAGCA
TGTGTGTTTCTTGGTTATGGCCAAGATGAGTTTGGTTATAGATTATATGATCCAATTAACAAAAAACTTATAAGAAGTCGAGATGTTGTATTTGTTGAAGAGCAAACAAT
AGCATACATTGAGAAAATAGATGAACCAGAGTCTAAACATAGTGATATTCTGATTGATTTGAGCTCAACCTCTTTGACACGGTCTTCTACACAGATAGAAGATGAGGTTC
AAAATGAACAGTTTTTTGATACATATGAGAGTTCTGAGCAGTTAGTTGAAACAGTTGTTCCTACAGATGTTTCACTCAGGAGATCTGTTAGAGATCGACGTCCGTCAACA
AGATATTCACCTAATGAATATTTGTTATTGACTGATGGGGGAGAACCTGAGAGTTATGAAGAGGCTATAGAGGATGAGCACAAAAATGAGTGGAATGATGCAATGAAAGA
TGAGATGGAGTCTTTACATGCAAACCACACTTTTGAGCTTGTTAAGTTGCCCAAAGGAAAAACTGAACTGAAAAATAAGTGGGTTTACAAAATTAAACATGAAGAACATA
CCTCAAAGCCACATTACAAAGCAAGATTGGTTGTCAAAGAGTTCAGTCAGAAGAAAGGTATCGACTTCGATGAAATTTTTGCTCCAGTTGTCAAGATGTCCTCCATACGT
GTTGTTTTAGGTTTGGCAACCAGTCTTGACTTAGAGGTTGAGCAGATGGATGTTAAGACTGCATTTCTTCATGGGGATTTAGACAAGAAATCTATATGGAACAACCAGAG
GGTTTTGATCAGAAAGGAAAAGAGCATCCCATTTGCACCGAGACAGTGGTACAAGAAGTTTGAATCAGTTATGGGGAAGCAAGGCTACAGAAAAACTACTTCTGATCATT
GTATTTTTGTCCAAAAAATTTATAATGATGATTTTGTTATATTATTACTCTATGTTGATGATATTTTGATTGTTGGCAGGAATGTTTCGAGAATAAATAACTTGAAGAAA
CAGTTGAGCAAATCCCTTGCCATGAAGGATTTGGGGTCAGCAAAGAAAGTTCTTGGAATTCAAATAGTTCGAGACAGAGCATCCAAGAAGTTATACATGTCACAGGAAAA
ATACATAGAGAAAGTACTTGAACGTTTCAAGATGAGTCAAGTAAAATCAGTTAGTTCTTCTTTACCCAGTCACTTCAAAATGATCAGTAAACAGAGCCCTTCTACAAATA
AAGATGAGGATATGAGTAAGGTCTCGTATGCTTCAGCAGTTGAAAGCCTAATGTATGTCATGGTATGTACTAGACCTGATATTGCTCATGTTGTTGGTGTTGTTAATCGT
TTTATGTCTAATCCAAGAAAACAACATTGGGAGGCAGTCAAGTGCATCATGAGATATTCGAGAGGTACTTCCAGTTTGAGGCTTACTTTTGGGGATGGAAAGTCAATACT
TGTTGGGTATATTGATTCAGATATGGCAGGGGACTTAGCCAGCAGAAAGTCTACTTCCGGTTACTTGATGACATTTGCAGGTGGTTCAGTGTCCTGGCAGTCAAAATTGC
AGAAATGTATTGCCCTTTCTACAACTGAAGCAGAGTATATTGCAGCAGCAGAAGCATGTAAAGAGATGTTGTGGTTGAAGCACTTTGTAAAAGAGCTTGGCTTCAAGCAA
CAACGATATGTGATATACTATGACAATCAGAGTGCTATTCACTTTGGTCAGAATGCTTCATTTCATTCCAGAACAAAGGATATTGATGTGAGATATCACTGGCTTAGAGA
TGCTTTAAATGATGAATTGTTTGAGCTTGAGAAAATACACACTGATCATAATGGATCTGATATGTTGACAAAGAACAAACCAAGATCAAAGCTTGAGTTATATCGCTCCA
CAGTGGGAATGACAACTTCATCCTCAAAGTGA
Protein sequenceShow/hide protein sequence
MKNDDDSDADTITVANEDFFILSDGDVVNLATQQSSWVIDNGASIHGTSKREFFASYSPSDFGNVRTGNDGSANAVVIEDVYLKNRNGSRLILKNVKHIPDTRMNLISTG
KLDDEGFCNTFDNGIWKLTKGSIVIAKGQNFSSLYYMDAKIMDSYINTVNDEANVELWHKRLSHISEKGLKILTKKSYLPDLKSTPLKRYPRCLAGRQMRVTFKSSQHSR
KPNVLELVHSDVCGPVKTKSLGSALYFVTFTNDHSRKIWIYTLKTKDQVLQAFKQFYAFVERETGEKLKCVRTDNGDKDISYSHLRVFGSKAFVHVPKDERSKLDAKTKA
CVFLGYGQDEFGYRLYDPINKKLIRSRDVVFVEEQTIAYIEKIDEPESKHSDILIDLSSTSLTRSSTQIEDEVQNEQFFDTYESSEQLVETVVPTDVSLRRSVRDRRPST
RYSPNEYLLLTDGGEPESYEEAIEDEHKNEWNDAMKDEMESLHANHTFELVKLPKGKTELKNKWVYKIKHEEHTSKPHYKARLVVKEFSQKKGIDFDEIFAPVVKMSSIR
VVLGLATSLDLEVEQMDVKTAFLHGDLDKKSIWNNQRVLIRKEKSIPFAPRQWYKKFESVMGKQGYRKTTSDHCIFVQKIYNDDFVILLLYVDDILIVGRNVSRINNLKK
QLSKSLAMKDLGSAKKVLGIQIVRDRASKKLYMSQEKYIEKVLERFKMSQVKSVSSSLPSHFKMISKQSPSTNKDEDMSKVSYASAVESLMYVMVCTRPDIAHVVGVVNR
FMSNPRKQHWEAVKCIMRYSRGTSSLRLTFGDGKSILVGYIDSDMAGDLASRKSTSGYLMTFAGGSVSWQSKLQKCIALSTTEAEYIAAAEACKEMLWLKHFVKELGFKQ
QRYVIYYDNQSAIHFGQNASFHSRTKDIDVRYHWLRDALNDELFELEKIHTDHNGSDMLTKNKPRSKLELYRSTVGMTTSSSK