; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0003152 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0003152
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag-pol polyprotein
Genome locationchr03:7927341..7934619
RNA-Seq ExpressionIVF0003152
SyntenyIVF0003152
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035966.1 F5J5.1 [Cucumis melo var. makuwa]1.29e-29445.69Show/hide
Query:  MIFFIKTLNGKAWRVLVA--------------------------------------------------------AKEAWRILEFAYEGTSKVKISRLQLI
        MIFFIKTL GKAWR LVA                                                        AKEAW+ LE AYEGTSKVKISRLQLI
Subjt:  MIFFIKTLNGKAWRVLVA--------------------------------------------------------AKEAWRILEFAYEGTSKVKISRLQLI

Query:  TLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDELFGSLLTFEMAISDRENNKGKGVTFKS
        T KFEAL+M+EDESVS YN+ VL+IANES LLGEKIP+SKIV KVL S+SRKFDMKVT IEEAHDITTLKLDELFGSLLTFEMA +DRE+ KGKG++FKS
Subjt:  TLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDELFGSLLTFEMAISDRENNKGKGVTFKS

Query:  TYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRFREYGGVGHYQAKCPTFLRRQKKNYHA
        T+  +      D +ANMDE IALLTKQF+  +R  +                                                 +CPTFLR+QKKN+  
Subjt:  TYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRFREYGGVGHYQAKCPTFLRRQKKNYHA

Query:  TLSDEDT-DDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEENERLMSVIASLKLKLKEVQNEYDQTIK
        TLSDE++ D  +DD ++NAF    T+ +  D+S CS E  N +LS E+LK L KED +AR IQKE IQDL+EENE LMSVI+SLKLKL+EVQNE DQ +K
Subjt:  TLSDEDT-DDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEENERLMSVIASLKLKLKEVQNEYDQTIK

Query:  YVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV---------------------------------------------------------
         V MLNSG +NLDSILK+G N S +YGLGF +S SS  +TSE+                                                         
Subjt:  YVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV---------------------------------------------------------

Query:  -----------------------------------------RRMIGNGSFFSELKECVSGYVTFGDGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLI
                                                 R M GN S+F  L +CV G+VTFGDGA+G+IIAKGNI+K++LP LNDVRYVDGLKANLI
Subjt:  -----------------------------------------RRMIGNGSFFSELKECVSGYVTFGDGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLI

Query:  SVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYH-----------------------------------------------------------
        S++QL  QGY V+F   GCVV +K+NQ+ M G+RQ D CYH                                                           
Subjt:  SVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYH-----------------------------------------------------------

Query:  ---------------------LKGKSDTVKICIGLCLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRV
                             LKGK+D V+IC  LCL LQ EK KKI +IRSDH                             EQGIFLGYSQNS+AYRV
Subjt:  ---------------------LKGKSDTVKICIGLCLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRV

Query:  FNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPV-------------DSSTLPAEVLKVD------AQADDKTDEAGCMT----------------
        +NNRS +V+ETINV++ND +S+ K+  DE+DET NM               +SS  PA  +  D       +  DK D    +                 
Subjt:  FNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPV-------------DSSTLPAEVLKVD------AQADDKTDEAGCMT----------------

Query:  -------------------NNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKD
                           NN   LVAQGY QVEGVDFDE FALVARLEAIRL+L G    +K                      + Q++ K        
Subjt:  -------------------NNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKD

Query:  INSNSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDL
            SAFLNGYL  EVYVAQP+GF+D ++P+HVYKLNKALYGLKQAPRAWY RLT+YL  +GYSRG  DKTLFIHR  DQL+VAQIYVDDIIF GFPQDL
Subjt:  INSNSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDL

Query:  VNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICAR
        VNNFI+IM+SEFEMSMVGELSCFLGLQIKQK++ IFISQEK                                  SI+GSLLYLTASR DIAY +GICAR
Subjt:  VNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICAR

Query:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKK
        YQ DPRI+HLE VKRILKYVHG SDFG++Y Y+TT  L+GY+D DW GS+DD+K
Subjt:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKK

KAA0053200.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.11e-29945.89Show/hide
Query:  AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDEL
        AKEAW+ L+  YEGTSKVKI+RLQLIT KFEAL+M E+ESVS YN+ VLEI NES LL EKIP+SKIV KVL SL RKFDMKV  IEEAHDITTLKLDEL
Subjt:  AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDEL

Query:  FGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRF
        FG LLTFEMA +D EN K KG+ FKST+  E      D EANMDE             R+ +  +                 R  DY KKKEG+ + FR 
Subjt:  FGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRF

Query:  REYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDS-SMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEEN
        RE GGVGHYQA+CPTF R+QKKN+  TLSDE+  D+ DD+ ++NAFT   T+ +  DDS CS E  N +L  E+L+ L KED +AR IQKERIQDL+EEN
Subjt:  REYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDS-SMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEEN

Query:  ERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEVRRMIGNGSFFSELKECVSGYVT----FGD--
        ERLMSVI+SLKLKL+EVQNE DQ +K   MLNSGTENLDSILK+G N S ++GLGF AS SS  +TSE++ +  +     +     +G  T    FG   
Subjt:  ERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEVRRMIGNGSFFSELKECVSGYVT----FGD--

Query:  ---GARG---RIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGL-----
           G +G   +IIAK NID ++LP LNDVRYVDGLKANLIS+SQL  QGY V+F   GC++  + +Q  +  R+         GK    K  +G+     
Subjt:  ---GARG---RIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGL-----

Query:  ---CLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDE---
             +  C+ GK+     S  GK +           + Y  Y  KWD +SEQGIFLGYSQNSR Y V+NNRS +V+ETINV++ND +S  K+  D+   
Subjt:  ---CLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDE---

Query:  ---------------------------------------------------------DDETLNM------PVDSSTLPAEVLKVDAQADDKTDEAGCMTN
                                                                  +E L         + S      V+       +K D  GC+T 
Subjt:  ---------------------------------------------------------DDETLNM------PVDSSTLPAEVLKVDAQADDKTDEAGCMTN

Query:  NKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQ
        NKARLVAQGY QVEGVDFDE FA VARLE IRL+L G    +K           LY                        ++  S FLNGYL EEVYVAQ
Subjt:  NKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQ

Query:  PEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL
        P+GF+DS++P+HVYKLNKALYGLKQA +AWY+RLT+                                                      EFEMSMVGEL
Subjt:  PEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL

Query:  SCFLGLQIKQKSEGIFISQEKHAKNIVKKFG--------TSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQADPRISHLEA
        SCFLGLQIKQK++GIFISQEK+A+++VKKFG        T AATHVK+ KD +G   D+KLY+                            +PRI+HLEA
Subjt:  SCFLGLQIKQKSEGIFISQEKHAKNIVKKFG--------TSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQADPRISHLEA

Query:  VKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNMLDEYGFTQD
        VKRILKYVH TSDFG++Y YDTT  L+GY DADW GS+DD KSTS G                                S CTQLIWMKNML EYGF QD
Subjt:  VKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNMLDEYGFTQD

Query:  IMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLRT----------------------------
         MTLY DN+SAIDISKN VQHSRT HIDIR+HFI ELVE+K +  DHI SNLQLAD F+KPLDA++FE+LR                             
Subjt:  IMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLRT----------------------------

Query:  -----------------------HASNVPETFLSDIDSDDLDDVLLAQLLKKTTVPEVPVVMPTAPFMSIHSQESSSIEGVFVPTFGVHHTFNVQPGPSI
                               HA    E  + D+DSDD D+VLL  LLKK + P     +P+ P  +IH QESSSIEGVF+PT G     + +  P+I
Subjt:  -----------------------HASNVPETFLSDIDSDDLDDVLLAQLLKKTTVPEVPVVMPTAPFMSIHSQESSSIEGVFVPTFGVHHTFNVQPGPSI

Query:  P
        P
Subjt:  P

KAA0059924.1 Peptidase aspartic, catalytic [Cucumis melo var. makuwa]0.094.74Show/hide
Query:  VAAKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLD
        + AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLD
Subjt:  VAAKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLD

Query:  ELFGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFS--KVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGR
        ELFGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQ       RR+        E SN         RSGDYGKKKEGEGR
Subjt:  ELFGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFS--KVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGR

Query:  FFRFREYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLM
        FFRFREYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLM
Subjt:  FFRFREYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLM

Query:  EENERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV----RRMIGNGSFFSELKECVSGYVTFG
        EENERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV    RRMIGNGSFFSELKECVSGYVTFG
Subjt:  EENERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV----RRMIGNGSFFSELKECVSGYVTFG

Query:  DGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQLQGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGLCLNLQCEKGKKI
        DGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQLQGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGLCLNLQCEKGKKI
Subjt:  DGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQLQGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGLCLNLQCEKGKKI

Query:  IKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPVDSSTLPAEV
        IKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPVDSSTLPAEV
Subjt:  IKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPVDSSTLPAEV

Query:  LKVDAQAD
        LKVDAQAD
Subjt:  LKVDAQAD

TYK00141.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.57e-30045.89Show/hide
Query:  AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDEL
        AKEAW+ L+  YEGTSKVKI+RLQLIT KFEAL+M E+ESVS YN+ VLEI NES LL EKIP+SKIV KVL SL RKFDMKV  IEEAHDITTLKLDEL
Subjt:  AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDEL

Query:  FGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRF
        FG LLTFEMA +D EN K KG+ FKST+  E      D EANMDE             R+ +  +                 R  DY KKKEG+ + FR 
Subjt:  FGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRF

Query:  REYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDS-SMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEEN
        RE GGVGHYQA+CPTF R+QKKN+  TLSDE+  D+ DD+ ++NAFT   T+ +  DDS CS E  N +L  E+L+ L KED +AR IQKERIQDL+EEN
Subjt:  REYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDS-SMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEEN

Query:  ERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEVRRMIGNGSFFSELKECVSGYVT----FGD--
        ERLMSVI+SLKLKL+EVQNE DQ +K   MLNSGTENLDSILK+G N S ++GLGF AS SS  +TSE++ +  +     +     +G  T    FG   
Subjt:  ERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEVRRMIGNGSFFSELKECVSGYVT----FGD--

Query:  ---GARG---RIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGL-----
           G +G   +IIAK NID ++LP LNDVRYVDGLKANLIS+SQL  QGY V+F   GC++  + +Q  +  R+         GK    K  +G+     
Subjt:  ---GARG---RIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGL-----

Query:  ---CLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDE---
             +  C+ GK+     S  GK +           + Y  Y  KWD +SEQGIFLGYSQNSR Y V+NNRS +V+ETINV++ND +S  K+  D+   
Subjt:  ---CLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDE---

Query:  ---------------------------------------------------------DDETLNM------PVDSSTLPAEVLKVDAQADDKTDEAGCMTN
                                                                  +E L         + S      V+       +K DE GC+T 
Subjt:  ---------------------------------------------------------DDETLNM------PVDSSTLPAEVLKVDAQADDKTDEAGCMTN

Query:  NKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQ
        NKARLVAQGY QVEGVDFDE FA VARLE IRL+L G    +K           LY                        ++  S FLNGYL EEVYVAQ
Subjt:  NKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQ

Query:  PEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL
        P+GF+DS++P+HVYKLNK+LYGLKQA +AWY+RLT+                                                      EFEMSMVGEL
Subjt:  PEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL

Query:  SCFLGLQIKQKSEGIFISQEKHAKNIVKKFG--------TSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQADPRISHLEA
        SCFLGLQIKQK++GIFISQEK+A+++VKKFG        T AATHVK+ KD +G   D+KLY+                            +PRI+HLEA
Subjt:  SCFLGLQIKQKSEGIFISQEKHAKNIVKKFG--------TSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQADPRISHLEA

Query:  VKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNMLDEYGFTQD
        VKRILKYVH TSDFG++Y YDTT  L+GY DADW GS+DD KSTS G                                S CTQLIWMKNML EYGF QD
Subjt:  VKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNMLDEYGFTQD

Query:  IMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLRT----------------------------
         MTLY DN+SAIDISKN VQHSRT HIDIR+HFI ELVE+K +  DHI SNLQLAD F+KPLDA++FE+LR                             
Subjt:  IMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLRT----------------------------

Query:  -----------------------HASNVPETFLSDIDSDDLDDVLLAQLLKKTTVPEVPVVMPTAPFMSIHSQESSSIEGVFVPTFGVHHTFNVQPGPSI
                               HA    E  + D+DSDD D+VLL  LLKK + P     +P+ P  +IH QESSSIEGVF+PT G     + +  P+I
Subjt:  -----------------------HASNVPETFLSDIDSDDLDDVLLAQLLKKTTVPEVPVVMPTAPFMSIHSQESSSIEGVFVPTFGVHHTFNVQPGPSI

Query:  P
        P
Subjt:  P

TYK30437.1 F5J5.1 [Cucumis melo var. makuwa]1.09e-29445.69Show/hide
Query:  MIFFIKTLNGKAWRVLVA--------------------------------------------------------AKEAWRILEFAYEGTSKVKISRLQLI
        MIFFIKTL GKAWR LVA                                                        AKEAW+ LE AYEGTSKVKISRLQLI
Subjt:  MIFFIKTLNGKAWRVLVA--------------------------------------------------------AKEAWRILEFAYEGTSKVKISRLQLI

Query:  TLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDELFGSLLTFEMAISDRENNKGKGVTFKS
        T KFEAL+M+EDESVS YN+ VL+IANES LLGEKIP+SKIV KVL S+SRKFDMKVT IEEAHDITTLKLDELFGSLLTFEMA +DRE+ KGKG++FKS
Subjt:  TLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDELFGSLLTFEMAISDRENNKGKGVTFKS

Query:  TYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRFREYGGVGHYQAKCPTFLRRQKKNYHA
        T+  +      D +ANMDE IALLTKQF+  +R  +                                                 +CPTFLR+QKKN+  
Subjt:  TYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRFREYGGVGHYQAKCPTFLRRQKKNYHA

Query:  TLSDEDT-DDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEENERLMSVIASLKLKLKEVQNEYDQTIK
        TLSDE++ D  +DD ++NAF    T+ +  D+S CS E  N +LS E+LK L KED +AR IQKE IQDL+EENE LMSVI+SLKLKL+EVQNE DQ +K
Subjt:  TLSDEDT-DDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEENERLMSVIASLKLKLKEVQNEYDQTIK

Query:  YVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV---------------------------------------------------------
         V MLNSG +NLDSILK+G N S +YGLGF +S SS  +TSE+                                                         
Subjt:  YVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV---------------------------------------------------------

Query:  -----------------------------------------RRMIGNGSFFSELKECVSGYVTFGDGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLI
                                                 R M GN S+F  L +CV G+VTFGDGA+G+IIAKGNI+K++LP LNDVRYVDGLKANLI
Subjt:  -----------------------------------------RRMIGNGSFFSELKECVSGYVTFGDGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLI

Query:  SVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYH-----------------------------------------------------------
        S++QL  QGY V+F   GCVV +K+NQ+ M G+RQ D CYH                                                           
Subjt:  SVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYH-----------------------------------------------------------

Query:  ---------------------LKGKSDTVKICIGLCLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRV
                             LKGK+D V+IC  LCL LQ EK KKI +IRSDH                             EQGIFLGYSQNS+AYRV
Subjt:  ---------------------LKGKSDTVKICIGLCLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRV

Query:  FNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPV-------------DSSTLPAEVLKVD------AQADDKTDEAGCMT----------------
        +NNRS +V+ETINV++ND +S+ K+  DE+DET NM               +SS  PA  +  D       +  DK D    +                 
Subjt:  FNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPV-------------DSSTLPAEVLKVD------AQADDKTDEAGCMT----------------

Query:  -------------------NNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKD
                           NN   LVAQGY QVEGVDFDE FALVARLEAIRL+L G    +K                      + Q++ K        
Subjt:  -------------------NNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKD

Query:  INSNSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDL
            SAFLNGYL  EVYVAQP+GF+D ++P+HVYKLNKALYGLKQAPRAWY RLT+YL  +GYSRG  DKTLFIHR  DQL+VAQIYVDDIIF GFPQDL
Subjt:  INSNSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDL

Query:  VNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICAR
        VNNFI+IM+SEFEMSMVGELSCFLGLQIKQK++ IFISQEK                                  SI+GSLLYLTASR DIAY +GICAR
Subjt:  VNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICAR

Query:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKK
        YQ DPRI+HLE VKRILKYVHG SDFG++Y Y+TT  L+GY+D DW GS+DD+K
Subjt:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKK

TrEMBL top hitse value%identityAlignment
A0A5A7T169 F5J5.17.8e-25545.69Show/hide
Query:  MIFFIKTLNGKAWRVLVA--------------------------------------------------------AKEAWRILEFAYEGTSKVKISRLQLI
        MIFFIKTL GKAWR LVA                                                        AKEAW+ LE AYEGTSKVKISRLQLI
Subjt:  MIFFIKTLNGKAWRVLVA--------------------------------------------------------AKEAWRILEFAYEGTSKVKISRLQLI

Query:  TLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDELFGSLLTFEMAISDRENNKGKGVTFKS
        T KFEAL+M+EDESVS YN+ VL+IANES LLGEKIP+SKIV KVL S+SRKFDMKVT IEEAHDITTLKLDELFGSLLTFEMA +DRE+ KGKG++FKS
Subjt:  TLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDELFGSLLTFEMAISDRENNKGKGVTFKS

Query:  TYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRFREYGGVGHYQAKCPTFLRRQKKNYHA
        T+  +      D +ANMDE IALLTKQF+  +R  +                                                 +CPTFLR+QKKN+  
Subjt:  TYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRFREYGGVGHYQAKCPTFLRRQKKNYHA

Query:  TLSDEDT-DDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEENERLMSVIASLKLKLKEVQNEYDQTIK
        TLSDE++ D  +DD ++NAF    T+ +  D+S CS E  N +LS E+LK L KED +AR IQKE IQDL+EENE LMSVI+SLKLKL+EVQNE DQ +K
Subjt:  TLSDEDT-DDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEENERLMSVIASLKLKLKEVQNEYDQTIK

Query:  YVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV---------------------------------------------------------
         V MLNSG +NLDSILK+G N S +YGLGF +S SS  +TSE+                                                         
Subjt:  YVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV---------------------------------------------------------

Query:  -----------------------------------------RRMIGNGSFFSELKECVSGYVTFGDGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLI
                                                 R M GN S+F  L +CV G+VTFGDGA+G+IIAKGNI+K++LP LNDVRYVDGLKANLI
Subjt:  -----------------------------------------RRMIGNGSFFSELKECVSGYVTFGDGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLI

Query:  SVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYH-----------------------------------------------------------
        S++QL  QGY V+F   GCVV +K+NQ+ M G+RQ D CYH                                                           
Subjt:  SVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYH-----------------------------------------------------------

Query:  ---------------------LKGKSDTVKICIGLCLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRV
                             LKGK+D V+IC  LCL LQ EK KKI +IRSDH                             EQGIFLGYSQNS+AYRV
Subjt:  ---------------------LKGKSDTVKICIGLCLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRV

Query:  FNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPV-------------DSSTLPAEVLKVD------AQADDKTDEAGCMT----------------
        +NNRS +V+ETINV++ND +S+ K+  DE+DET NM               +SS  PA  +  D       +  DK D    +                 
Subjt:  FNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPV-------------DSSTLPAEVLKVD------AQADDKTDEAGCMT----------------

Query:  -------------------NNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKD
                           NN   LVAQGY QVEGVDFDE FALVARLEAIRL+L G    +K                      + Q++ K        
Subjt:  -------------------NNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKD

Query:  INSNSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDL
            SAFLNGYL  EVYVAQP+GF+D ++P+HVYKLNKALYGLKQAPRAWY RLT+YL  +GYSRG  DKTLFIHR  DQL+VAQIYVDDIIF GFPQDL
Subjt:  INSNSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDL

Query:  VNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICAR
        VNNFI+IM+SEFEMSMVGELSCFLGLQIKQK++ IFISQE                                  KSI+GSLLYLTASR DIAY +GICAR
Subjt:  VNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICAR

Query:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKK
        YQ DPRI+HLE VKRILKYVHG SDFG++Y Y+TT  L+GY+D DW GS+DD+K
Subjt:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKK

A0A5A7UDB5 Gag-pol polyprotein2.0e-25545.37Show/hide
Query:  AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDEL
        AKEAW+ L+  YEGTSKVKI+RLQLIT KFEAL+M E+ESVS YN+ VLEI NES LL EKIP+SKIV KVL SL RKFDMKV  IEEAHDITTLKLDEL
Subjt:  AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDEL

Query:  FGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRF
        FG LLTFEMA +D EN K KG+ FKST+  E      D EANMDE                +K E                 R  DY KKKEG+ + FR 
Subjt:  FGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRF

Query:  REYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDS-SMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEEN
        RE GGVGHYQA+CPTF R+QKKN+  TLSDE+  D+ DD+ ++NAFT   T+ +  DDS CS E  N +L  E+L+ L KED +AR IQKERIQDL+EEN
Subjt:  REYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDS-SMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEEN

Query:  ERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEVRRMIGNGSFFSELKECVSGYVT----FGD--
        ERLMSVI+SLKLKL+EVQNE DQ +K   MLNSGTENLDSILK+G N S ++GLGF AS SS  +TSE++ +  +     +     +G  T    FG   
Subjt:  ERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEVRRMIGNGSFFSELKECVSGYVT----FGD--

Query:  ---GARG---RIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGL-----
           G +G   +IIAK NID ++LP LNDVRYVDGLKANLIS+SQL  QGY V+F   GC++  + +Q  +  R+         GK    K  +G+     
Subjt:  ---GARG---RIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGL-----

Query:  ---CLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDE---
             +  C+ GK   +  S  GK +           + Y  Y  KWD +SEQGIFLGYSQNSR Y V+NNRS +V+ETINV++ND +S  K+  D+   
Subjt:  ---CLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDE---

Query:  ---------------------------------------------------------DDETLNM------PVDSSTLPAEVLKVDAQADDKTDEAGCMTN
                                                                  +E L         + S      V+       +K D  GC+T 
Subjt:  ---------------------------------------------------------DDETLNM------PVDSSTLPAEVLKVDAQADDKTDEAGCMTN

Query:  NKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQ
        NKARLVAQGY QVEGVDFDE FA VARLE IRL+L G    +K                      + Q++ K            S FLNGYL EEVYVAQ
Subjt:  NKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQ

Query:  PEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL
        P+GF+DS++P+HVYKLNKALYGLKQA +AWY+RLT+                                                      EFEMSMVGEL
Subjt:  PEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL

Query:  SCFLGLQIKQKSEGIFISQEKHAKNIVKKFG--------TSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQADPRISHLEA
        SCFLGLQIKQK++GIFISQEK+A+++VKKFG        T AATHVK+ KD +G   D+KLY+                            +PRI+HLEA
Subjt:  SCFLGLQIKQKSEGIFISQEKHAKNIVKKFG--------TSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQADPRISHLEA

Query:  VKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNMLDEYGFTQD
        VKRILKYVH TSDFG++Y YDTT  L+GY DADW GS+DD KSTS G                                S CTQLIWMKNML EYGF QD
Subjt:  VKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNMLDEYGFTQD

Query:  IMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLR-----------------------------
         MTLY DN+SAIDISKN VQHSRT HIDIR+HFI ELVE+K +  DHI SNLQLAD F+KPLDA++FE+LR                             
Subjt:  IMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLR-----------------------------

Query:  ----------------------THASNVPETFLSDIDSDDLDDVLLAQLLKKTTVPEVPVVMPTAPFMSIHSQESSSIEGVFVPTFGVHHTFNVQPGPSI
                               HA    E  + D+DSDD D+VLL  LLKK + P     +P+ P  +IH QESSSIEGVF+PT G     + +  P+I
Subjt:  ----------------------THASNVPETFLSDIDSDDLDDVLLAQLLKKTTVPEVPVVMPTAPFMSIHSQESSSIEGVFVPTFGVHHTFNVQPGPSI

Query:  PFESNVAHASVPGDVSAAPEVRTDVRSDKNELDPPNPD
        P   +    SV    S  P  + D      E  PP  D
Subjt:  PFESNVAHASVPGDVSAAPEVRTDVRSDKNELDPPNPD

A0A5A7UY03 Peptidase aspartic, catalytic0.0e+0094.74Show/hide
Query:  VAAKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLD
        + AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLD
Subjt:  VAAKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLD

Query:  ELFGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQ--FSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGR
        ELFGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQ       RR+        E SN         RSGDYGKKKEGEGR
Subjt:  ELFGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQ--FSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGR

Query:  FFRFREYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLM
        FFRFREYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLM
Subjt:  FFRFREYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLM

Query:  EENERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV----RRMIGNGSFFSELKECVSGYVTFG
        EENERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV    RRMIGNGSFFSELKECVSGYVTFG
Subjt:  EENERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV----RRMIGNGSFFSELKECVSGYVTFG

Query:  DGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQLQGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGLCLNLQCEKGKKI
        DGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQLQGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGLCLNLQCEKGKKI
Subjt:  DGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQLQGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGLCLNLQCEKGKKI

Query:  IKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPVDSSTLPAEV
        IKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPVDSSTLPAEV
Subjt:  IKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPVDSSTLPAEV

Query:  LKVDAQAD
        LKVDAQAD
Subjt:  LKVDAQAD

A0A5D3BJZ7 Gag-pol polyprotein9.2e-25645.37Show/hide
Query:  AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDEL
        AKEAW+ L+  YEGTSKVKI+RLQLIT KFEAL+M E+ESVS YN+ VLEI NES LL EKIP+SKIV KVL SL RKFDMKV  IEEAHDITTLKLDEL
Subjt:  AKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDEL

Query:  FGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRF
        FG LLTFEMA +D EN K KG+ FKST+  E      D EANMDE                +K E                 R  DY KKKEG+ + FR 
Subjt:  FGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRF

Query:  REYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDS-SMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEEN
        RE GGVGHYQA+CPTF R+QKKN+  TLSDE+  D+ DD+ ++NAFT   T+ +  DDS CS E  N +L  E+L+ L KED +AR IQKERIQDL+EEN
Subjt:  REYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDS-SMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEEN

Query:  ERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEVRRMIGNGSFFSELKECVSGYVT----FGD--
        ERLMSVI+SLKLKL+EVQNE DQ +K   MLNSGTENLDSILK+G N S ++GLGF AS SS  +TSE++ +  +     +     +G  T    FG   
Subjt:  ERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEVRRMIGNGSFFSELKECVSGYVT----FGD--

Query:  ---GARG---RIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGL-----
           G +G   +IIAK NID ++LP LNDVRYVDGLKANLIS+SQL  QGY V+F   GC++  + +Q  +  R+         GK    K  +G+     
Subjt:  ---GARG---RIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGL-----

Query:  ---CLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDE---
             +  C+ GK   +  S  GK +           + Y  Y  KWD +SEQGIFLGYSQNSR Y V+NNRS +V+ETINV++ND +S  K+  D+   
Subjt:  ---CLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDE---

Query:  ---------------------------------------------------------DDETLNM------PVDSSTLPAEVLKVDAQADDKTDEAGCMTN
                                                                  +E L         + S      V+       +K DE GC+T 
Subjt:  ---------------------------------------------------------DDETLNM------PVDSSTLPAEVLKVDAQADDKTDEAGCMTN

Query:  NKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQ
        NKARLVAQGY QVEGVDFDE FA VARLE IRL+L G    +K                      + Q++ K            S FLNGYL EEVYVAQ
Subjt:  NKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQ

Query:  PEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL
        P+GF+DS++P+HVYKLNK+LYGLKQA +AWY+RLT+                                                      EFEMSMVGEL
Subjt:  PEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL

Query:  SCFLGLQIKQKSEGIFISQEKHAKNIVKKFG--------TSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQADPRISHLEA
        SCFLGLQIKQK++GIFISQEK+A+++VKKFG        T AATHVK+ KD +G   D+KLY+                            +PRI+HLEA
Subjt:  SCFLGLQIKQKSEGIFISQEKHAKNIVKKFG--------TSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQADPRISHLEA

Query:  VKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNMLDEYGFTQD
        VKRILKYVH TSDFG++Y YDTT  L+GY DADW GS+DD KSTS G                                S CTQLIWMKNML EYGF QD
Subjt:  VKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNMLDEYGFTQD

Query:  IMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLR-----------------------------
         MTLY DN+SAIDISKN VQHSRT HIDIR+HFI ELVE+K +  DHI SNLQLAD F+KPLDA++FE+LR                             
Subjt:  IMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLR-----------------------------

Query:  ----------------------THASNVPETFLSDIDSDDLDDVLLAQLLKKTTVPEVPVVMPTAPFMSIHSQESSSIEGVFVPTFGVHHTFNVQPGPSI
                               HA    E  + D+DSDD D+VLL  LLKK + P     +P+ P  +IH QESSSIEGVF+PT G     + +  P+I
Subjt:  ----------------------THASNVPETFLSDIDSDDLDDVLLAQLLKKTTVPEVPVVMPTAPFMSIHSQESSSIEGVFVPTFGVHHTFNVQPGPSI

Query:  PFESNVAHASVPGDVSAAPEVRTDVRSDKNELDPPNPD
        P   +    SV    S  P  + D      E  PP  D
Subjt:  PFESNVAHASVPGDVSAAPEVRTDVRSDKNELDPPNPD

A0A5D3E2Y4 F5J5.17.8e-25545.69Show/hide
Query:  MIFFIKTLNGKAWRVLVA--------------------------------------------------------AKEAWRILEFAYEGTSKVKISRLQLI
        MIFFIKTL GKAWR LVA                                                        AKEAW+ LE AYEGTSKVKISRLQLI
Subjt:  MIFFIKTLNGKAWRVLVA--------------------------------------------------------AKEAWRILEFAYEGTSKVKISRLQLI

Query:  TLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDELFGSLLTFEMAISDRENNKGKGVTFKS
        T KFEAL+M+EDESVS YN+ VL+IANES LLGEKIP+SKIV KVL S+SRKFDMKVT IEEAHDITTLKLDELFGSLLTFEMA +DRE+ KGKG++FKS
Subjt:  TLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNSKIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDELFGSLLTFEMAISDRENNKGKGVTFKS

Query:  TYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRFREYGGVGHYQAKCPTFLRRQKKNYHA
        T+  +      D +ANMDE IALLTKQF+  +R  +                                                 +CPTFLR+QKKN+  
Subjt:  TYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFKSILKKRWRSGDYGKKKEGEGRFFRFREYGGVGHYQAKCPTFLRRQKKNYHA

Query:  TLSDEDT-DDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEENERLMSVIASLKLKLKEVQNEYDQTIK
        TLSDE++ D  +DD ++NAF    T+ +  D+S CS E  N +LS E+LK L KED +AR IQKE IQDL+EENE LMSVI+SLKLKL+EVQNE DQ +K
Subjt:  TLSDEDT-DDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKARAIQKERIQDLMEENERLMSVIASLKLKLKEVQNEYDQTIK

Query:  YVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV---------------------------------------------------------
         V MLNSG +NLDSILK+G N S +YGLGF +S SS  +TSE+                                                         
Subjt:  YVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEV---------------------------------------------------------

Query:  -----------------------------------------RRMIGNGSFFSELKECVSGYVTFGDGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLI
                                                 R M GN S+F  L +CV G+VTFGDGA+G+IIAKGNI+K++LP LNDVRYVDGLKANLI
Subjt:  -----------------------------------------RRMIGNGSFFSELKECVSGYVTFGDGARGRIIAKGNIDKNNLPCLNDVRYVDGLKANLI

Query:  SVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYH-----------------------------------------------------------
        S++QL  QGY V+F   GCVV +K+NQ+ M G+RQ D CYH                                                           
Subjt:  SVSQL--QGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYH-----------------------------------------------------------

Query:  ---------------------LKGKSDTVKICIGLCLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRV
                             LKGK+D V+IC  LCL LQ EK KKI +IRSDH                             EQGIFLGYSQNS+AYRV
Subjt:  ---------------------LKGKSDTVKICIGLCLNLQCEKGKKIIKIRSDHGKEFDNEDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRV

Query:  FNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPV-------------DSSTLPAEVLKVD------AQADDKTDEAGCMT----------------
        +NNRS +V+ETINV++ND +S+ K+  DE+DET NM               +SS  PA  +  D       +  DK D    +                 
Subjt:  FNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPV-------------DSSTLPAEVLKVD------AQADDKTDEAGCMT----------------

Query:  -------------------NNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKD
                           NN   LVAQGY QVEGVDFDE FALVARLEAIRL+L G    +K                      + Q++ K        
Subjt:  -------------------NNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKD

Query:  INSNSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDL
            SAFLNGYL  EVYVAQP+GF+D ++P+HVYKLNKALYGLKQAPRAWY RLT+YL  +GYSRG  DKTLFIHR  DQL+VAQIYVDDIIF GFPQDL
Subjt:  INSNSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDL

Query:  VNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICAR
        VNNFI+IM+SEFEMSMVGELSCFLGLQIKQK++ IFISQE                                  KSI+GSLLYLTASR DIAY +GICAR
Subjt:  VNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICAR

Query:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKK
        YQ DPRI+HLE VKRILKYVHG SDFG++Y Y+TT  L+GY+D DW GS+DD+K
Subjt:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKK

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.2e-6433.47Show/hide
Query:  KTDEAGCMTNNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNG
        K +E G     KARLVA+G+ Q   +D++E FA VAR+ + R +L                              ++Q N K  Q     ++  +AFLNG
Subjt:  KTDEAGCMTNNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNG

Query:  YLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFI--HRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIM
         LKEE+Y+  P+G   S    +V KLNKA+YGLKQA R W+E     L    +     D+ ++I      ++ I   +YVDD++        +NNF   +
Subjt:  YLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFI--HRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIM

Query:  KSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHV------KINKDNDGARADYKL-YKSIIGSLLY-LTASRPDIAYVIGICAR
          +F M+ + E+  F+G++I+ + + I++SQ  + K I+ KF       V      KIN +   +  D     +S+IG L+Y +  +RPD+   + I +R
Subjt:  KSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHV------KINKDNDGARADYKL-YKSIIGSLLY-LTASRPDIAYVIGICAR

Query:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTT--SILIGYYDADWTGSSDDKKSTSEGCFFLENY-LILWFSKKQNCVSLSTTEAEYIATRSACTQL
        Y +       + +KR+L+Y+ GT D  +++  +    + +IGY D+DW GS  D+KST+   F + ++ LI W +K+QN V+ S+TEAEY+A   A  + 
Subjt:  YQADPRISHLEAVKRILKYVHGTSDFGILYFYDTT--SILIGYYDADWTGSSDDKKSTSEGCFFLENY-LILWFSKKQNCVSLSTTEAEYIATRSACTQL

Query:  IWMKNMLDEYGF-TQDIMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLR
        +W+K +L       ++ + +Y DN   I I+ N   H R KHIDI+YHF  E V+N  I  ++I +  QLAD F+KPL A  F  LR
Subjt:  IWMKNMLDEYGF-TQDIMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEHLR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-7436.27Show/hide
Query:  KARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQP
        KARLV +G+ Q +G+DFDE+F+ V ++ +IR +L          AA              LD+ + QL+ K            +AFL+G L+EE+Y+ QP
Subjt:  KARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQP

Query:  EGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHR-THDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL
        EGF  +     V KLNK+LYGLKQAPR WY +   ++  + Y +  +D  ++  R + +  I+  +YVDD++  G  + L+      +   F+M  +G  
Subjt:  EGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHR-THDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGEL

Query:  SCFLGLQI--KQKSEGIFISQEKHAKNIVKKF--------GTSAATHVKINKDNDGARADYK------LYKSIIGSLLY-LTASRPDIAYVIGICARYQA
           LG++I  ++ S  +++SQEK+ + ++++F         T  A H+K++K       + K       Y S +GSL+Y +  +RPDIA+ +G+ +R+  
Subjt:  SCFLGLQI--KQKSEGIFISQEKHAKNIVKKF--------GTSAATHVKINKDNDGARADYK------LYKSIIGSLLY-LTASRPDIAYVIGICARYQA

Query:  DPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNM
        +P   H EAVK IL+Y+ GT+    L F  +  IL GY DAD  G  D++KS++   F      I W SK Q CV+LSTTEAEYIA      ++IW+K  
Subjt:  DPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNM

Query:  LDEYGFTQDIMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFE
        L E G  Q    +Y D+ SAID+SKN + H+RTKHID+RYH+I E+V+++ +    I +N   AD  +K +  N FE
Subjt:  LDEYGFTQDIMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFE

P25600 Putative transposon Ty5-1 protein YCL074W3.6e-3130.49Show/hide
Query:  NSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNN
        ++AFLN  + E +YV QP GF++ + P +V++L   +YGLKQAP  W E +   L   G+ R   +  L+   T D  I   +YVDD++       + + 
Subjt:  NSAFLNGYLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNN

Query:  FIDIMKSEFEMSMVGELSCFLGLQIKQKSEG-IFIS-QEKHAK-------NIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLY-LTASRPDIAY
            +   + M  +G++  FLGL I Q S G I +S Q+  AK       N  K   T       + +       D   Y+SI+G LL+     RPDI+Y
Subjt:  FIDIMKSEFEMSMVGELSCFLGLQIKQKSEG-IFIS-QEKHAK-------NIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLY-LTASRPDIAY

Query:  VIGICARYQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKK-QNCVSLSTTEAEYIATRS
         + + +R+  +PR  HLE+ +R+L+Y++ T    + Y   +   L  Y DA      D   ST      L    + W SKK +  + + +TEAEYI    
Subjt:  VIGICARYQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKK-QNCVSLSTTEAEYIATRS

Query:  ACTQL
           ++
Subjt:  ACTQL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.3e-7235.42Show/hide
Query:  KTDEAGCMTNNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNG
        K +  G +   KARLVA+GY Q  G+D+ E F+ V +  +IR+VL           A D+        I  LD+                   N+AFL G
Subjt:  KTDEAGCMTNNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNG

Query:  YLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKS
         L ++VY++QP GFID   P +V KL KALYGLKQAPRAWY  L  YL   G+    +D +LF+ +    ++   +YVDDI+  G    L++N +D +  
Subjt:  YLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKS

Query:  EFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIV--------KKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQ
         F +    EL  FLG++ K+   G+ +SQ ++  +++        K   T  A   K++  +     D   Y+ I+GSL YL  +RPDI+Y +   +++ 
Subjt:  EFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIV--------KKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQ

Query:  ADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKN
          P   HL+A+KRIL+Y+ GT + GI      T  L  Y DADW G  DD  ST+    +L ++ I W SKKQ  V  S+TEAEY +  +  +++ W+ +
Subjt:  ADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKN

Query:  MLDEYGF-TQDIMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEH
        +L E G        +Y DN+ A  +  N V HSR KHI I YHFI   V++  +   H+ ++ QLAD  +KPL    F++
Subjt:  MLDEYGF-TQDIMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.1e-7234.79Show/hide
Query:  KTDEAGCMTNNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNG
        K +  G +   KARLVA+GY Q  G+D+ E F+ V +  +IR+VL           A D+        I  LD+                   N+AFL G
Subjt:  KTDEAGCMTNNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNG

Query:  YLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKS
         L +EVY++QP GF+D   P +V +L KA+YGLKQAPRAWY  L  YL   G+    +D +LF+ +    +I   +YVDDI+  G    L+ + +D +  
Subjt:  YLKEEVYVAQPEGFIDSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKS

Query:  EFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIV--------KKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQ
         F +    +L  FLG++ K+  +G+ +SQ ++  +++        K   T  AT  K+   +     D   Y+ I+GSL YL  +RPD++Y +   ++Y 
Subjt:  EFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIV--------KKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQ

Query:  ADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKN
          P   H  A+KR+L+Y+ GT D GI      T  L  Y DADW G +DD  ST+    +L ++ I W SKKQ  V  S+TEAEY +  +  ++L W+ +
Subjt:  ADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKN

Query:  MLDEYGF-TQDIMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEH
        +L E G        +Y DN+ A  +  N V HSR KHI + YHFI   V++  +   H+ ++ QLAD  +KPL    F++
Subjt:  MLDEYGF-TQDIMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNLQLADNFSKPLDANTFEH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.9e-5833.04Show/hide
Query:  KTDEAGCMTNNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNG
        K +  G +   KARLVA+GY Q EG+DF E F+ V +L +++L+L                 A+    ++ LDI                   ++AFLNG
Subjt:  KTDEAGCMTNNKARLVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNG

Query:  YLKEEVYVAQPEGFI----DSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFID
         L EE+Y+  P G+     DS  P  V  L K++YGLKQA R W+ + ++ L   G+ +  +D T F+  T    +   +YVDDII        V+    
Subjt:  YLKEEVYVAQPEGFI----DSQYPQHVYKLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFID

Query:  IMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKD--------NDGARADYKLYKSIIGSLLYLTASRPDIAYVIGIC
         +KS F++  +G L  FLGL+I + + GI I Q K+A +++ + G        +  D        + G   D K Y+ +IG L+YL  +R DI++ +   
Subjt:  IMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFGTSAATHVKINKD--------NDGARADYKLYKSIIGSLLYLTASRPDIAYVIGIC

Query:  ARYQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLI
        +++   PR++H +AV +IL Y+ GT   G+ Y       L  + DA +    D ++ST+  C FL   LI W SKKQ  VS S+ EAEY A   A  +++
Subjt:  ARYQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLI

Query:  WMKNMLDEYGFTQDIMT-LYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGE
        W+     E        T L+ DN +AI I+ N V H RTKHI+   H + E
Subjt:  WMKNMLDEYGFTQDIMT-LYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGE

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.2e-0728.89Show/hide
Query:  LYLTASRPDIAYVIGICARYQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWF
        +YLT +RPD+ + +   +++ +  R + ++AV ++L YV GT   G+ Y   +   L  + D+DW    D ++S +  C    + + LWF
Subjt:  LYLTASRPDIAYVIGICARYQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWF

ATMG00810.1 DNA/RNA polymerases superfamily protein1.3e-3134.84Show/hide
Query:  IYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFG------TSAATHVKINKDNDGAR-ADYKLYKSII
        +YVDDI+  G    L+N  I  + S F M  +G +  FLG+QIK    G+F+SQ K+A+ I+   G       S    +K+N     A+  D   ++SI+
Subjt:  IYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAKNIVKKFG------TSAATHVKINKDNDGAR-ADYKLYKSII

Query:  GSLLYLTASRPDIAYVIGICARYQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCV
        G+L YLT +RPDI+Y + I  +   +P ++  + +KR+L+YV GT   G+    ++   +  + D+DW G +  ++ST+  C FL   +I W +K+Q  V
Subjt:  GSLLYLTASRPDIAYVIGICARYQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKKSTSEGCFFLENYLILWFSKKQNCV

Query:  SLSTTEAEYIATRSACTQLIW
        S S+TE EY A      +L W
Subjt:  SLSTTEAEYIATRSACTQLIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATAATCAGAGAGGGACCATCAGCTTCTCGTCCTCCTGTACTTGATGGAAAAAATTACTCATATTGGAAGCCTCGTATGATATTTTTTATTAAAACATTAAATGG
AAAAGCTTGGAGAGTCCTTGTTGCTGCCAAAGAAGCATGGAGAATATTGGAATTTGCCTATGAAGGGACTTCTAAAGTCAAGATATCCAGATTACAACTGATAACATTGA
AATTTGAAGCATTGAAAATGTCTGAAGATGAGTCGGTTTCTAAGTACAATGAGGGAGTTCTGGAGATTGCTAATGAATCTCCGTTGCTTGGTGAAAAAATCCCTAATTCT
AAGATTGTGTGGAAAGTGCTTTGGTCTCTATCGAGGAAGTTTGACATGAAGGTCACTACCATAGAGGAAGCACATGATATTACTACATTAAAACTTGATGAGTTATTTGG
GTCCCTACTTACATTTGAAATGGCTATATCTGATAGAGAGAACAATAAAGGCAAGGGGGTCACATTTAAATCCACATATGAAGAAGAGGCAACAGTGAATCATTCTGATA
ATGAAGCTAACATGGATGAGTTAATAGCTCTACTGACAAAGCAGTTCTCTAAAGTGGTCAGAAGATTTCAAAAATATGAATACTACAGGATTGAATGCTCAAACTTCAAA
TCAATATTGAAGAAAAGATGGAGGAGCGGTGATTATGGAAAGAAAAAGGAAGGAGAAGGAAGGTTTTTCAGATTTAGAGAATATGGTGGAGTTGGTCATTACCAAGCTAA
ATGTCCCACATTTTTAAGAAGACAAAAGAAAAATTATCATGCTACCTTGTCAGATGAAGACACTGATGATACTGAAGATGATAGTAGCATGAATGCCTTCACAACATGCT
TTACCGAAATTGATCTCATAGACGATAGTGGGTGTTCTAATGAAGATGGTAATAAAGATCTAAGTTTTGAAGAACTCAAGATGTTGAGGAAAGAAGACATTAAAGCTAGA
GCAATTCAAAAAGAGAGAATTCAAGATCTGATGGAAGAAAATGAACGGTTGATGTCAGTCATAGCATCTCTAAAGCTAAAATTGAAAGAGGTTCAGAATGAGTATGATCA
GACAATTAAATATGTAAACATGTTAAACTCAGGAACTGAAAACTTAGACTCAATACTAAAATCAGGACAGAACAGTTCAAGTAAATATGGTCTTGGTTTTGATGCCTCAG
TAAGTAGTCGTAACTCTACATCTGAAGTAAGACGTATGATTGGTAACGGGTCGTTCTTCTCTGAGTTAAAGGAATGTGTCTCGGGTTATGTTACTTTTGGTGATGGTGCA
AGAGGAAGAATTATAGCTAAAGGAAACATTGATAAAAATAATCTACCCTGTCTAAATGATGTTAGATATGTGGATGGATTAAAGGCGAATTTGATTAGTGTAAGTCAGCT
TCAAGGTTATAGTGTAAATTTTAGCAAAACTGGTTGTGTTGTTACTGACAAAGATAATCAGGTTCTTATGAAGGGTAGGCGACAAGAAGATAAATGTTATCACTTGAAAG
GAAAATCAGATACTGTTAAAATTTGTATCGGTCTATGCTTGAATTTGCAATGTGAAAAAGGGAAAAAGATAATCAAGATCAGAAGTGATCATGGTAAGGAGTTTGATAAT
GAAGATCTAAATAGTTTCTGTCAGTCGGAAGCTTATAAAGAATATCATCGAAAGTGGGATGTGAAATCAGAACAAGGTATCTTTCTTGGATATTCTCAAAATAGCCGAGC
TTATAGAGTCTTTAATAATAGATCTGGTACAGTTATAGAAACGATCAATGTTATGGTAAATGATTTTGAATCAACTACCAAACGAACTTATGATGAGGATGATGAGACTC
TAAATATGCCTGTAGATTCTTCTACGCTTCCTGCAGAAGTACTAAAAGTTGATGCTCAGGCAGATGATAAGACTGATGAAGCAGGGTGTATGACAAATAATAAAGCTCGA
TTAGTTGCTCAAGGTTATGCTCAAGTCGAGGGGGTTGACTTTGATGAAATGTTTGCACTTGTTGCCAGGCTTGAAGCTATTCGCTTAGTGCTTGATGGAACAAAAACATC
AGAGAAACATGCCGCAGCGGAAGATAAGGATCATGCTTTACTTTATATGCATATAAATGATTTAGATATTAACATGTTGCAACTAAACGACAAACCTAAACAAAGAATAG
AGAAGGACATAAATTCCAACAGTGCTTTCCTGAATGGTTATTTAAAAGAAGAAGTCTATGTTGCTCAACCTGAGGGATTTATCGATTCTCAATATCCTCAGCATGTGTAT
AAGCTTAATAAAGCTTTATATGGCCTTAAGCAAGCTCCTAGAGCTTGGTACGAACGATTGACAATTTATTTAAGTTGTAAAGGGTATTCCAGAGGCGGGGCTGACAAAAC
ATTATTTATTCACAGAACACATGATCAACTCATTGTCGCACAAATCTATGTTGATGATATCATTTTTGAGGGGTTTCCTCAAGATCTTGTTAATAACTTCATTGATATCA
TGAAGTCAGAATTTGAGATGAGCATGGTGGGAGAACTATCATGTTTTCTGGGACTTCAAATCAAGCAGAAGAGTGAGGGTATATTCATATCTCAAGAAAAGCATGCCAAG
AACATAGTTAAAAAATTCGGGACTTCAGCTGCAACACACGTTAAAATTAACAAAGATAATGATGGTGCAAGAGCAGATTACAAACTTTATAAAAGCATAATCGGTAGTCT
GTTGTATCTAACTGCCAGTCGACCTGACATTGCTTATGTTATTGGGATATGTGCTCGTTATCAGGCTGATCCTCGAATATCACATTTAGAAGCTGTTAAAAGGATCCTCA
AGTACGTTCATGGAACAAGTGATTTTGGAATTCTGTATTTCTATGACACGACTTCGATTTTGATTGGATATTACGATGCCGATTGGACAGGCTCTTCTGATGATAAGAAA
AGCACCTCTGAAGGCTGTTTCTTTCTTGAGAATTATCTTATCTTATGGTTTAGTAAGAAACAAAATTGTGTTTCCTTGTCTACAACAGAAGCTGAATACATAGCGACAAG
GAGTGCTTGTACCCAGTTGATTTGGATGAAAAATATGTTGGATGAGTATGGATTCACACAAGATATCATGACTTTATATTCTGATAATATGAGTGCCATTGATATATCAA
AAAATCATGTTCAACATAGTCGAACCAAACATATTGATATTAGGTATCATTTTATTGGAGAACTTGTTGAAAATAAGGGTATTACACCGGATCATATTCGATCAAACTTG
CAATTAGCAGATAATTTCTCTAAGCCACTTGATGCAAACACATTTGAGCATTTACGTACACATGCATCGAATGTTCCTGAGACTTTTCTATCTGATATAGATTCAGATGA
CTTGGATGATGTCCTTTTGGCTCAATTGTTGAAGAAGACCACTGTTCCTGAGGTTCCTGTTGTAATGCCTACTGCTCCTTTTATGTCTATTCATTCTCAGGAAAGCTCGT
CCATAGAAGGAGTGTTTGTTCCTACTTTTGGTGTTCATCACACTTTTAATGTTCAACCTGGACCTTCAATTCCATTTGAATCGAATGTTGCTCATGCTTCTGTTCCTGGT
GATGTTTCTGCTGCACCTGAAGTGAGAACTGATGTTCGTAGTGATAAGAATGAGTTGGATCCTCCCAATCCTGACATTCATTCTAAAGAAGTTCCTGTTGATGCTGATAA
TAATCCAAATGTTCCACCTGCGTCACCTGAAGTGCTTGTTGCACCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGATAATCAGAGAGGGACCATCAGCTTCTCGTCCTCCTGTACTTGATGGAAAAAATTACTCATATTGGAAGCCTCGTATGATATTTTTTATTAAAACATTAAATGG
AAAAGCTTGGAGAGTCCTTGTTGCTGCCAAAGAAGCATGGAGAATATTGGAATTTGCCTATGAAGGGACTTCTAAAGTCAAGATATCCAGATTACAACTGATAACATTGA
AATTTGAAGCATTGAAAATGTCTGAAGATGAGTCGGTTTCTAAGTACAATGAGGGAGTTCTGGAGATTGCTAATGAATCTCCGTTGCTTGGTGAAAAAATCCCTAATTCT
AAGATTGTGTGGAAAGTGCTTTGGTCTCTATCGAGGAAGTTTGACATGAAGGTCACTACCATAGAGGAAGCACATGATATTACTACATTAAAACTTGATGAGTTATTTGG
GTCCCTACTTACATTTGAAATGGCTATATCTGATAGAGAGAACAATAAAGGCAAGGGGGTCACATTTAAATCCACATATGAAGAAGAGGCAACAGTGAATCATTCTGATA
ATGAAGCTAACATGGATGAGTTAATAGCTCTACTGACAAAGCAGTTCTCTAAAGTGGTCAGAAGATTTCAAAAATATGAATACTACAGGATTGAATGCTCAAACTTCAAA
TCAATATTGAAGAAAAGATGGAGGAGCGGTGATTATGGAAAGAAAAAGGAAGGAGAAGGAAGGTTTTTCAGATTTAGAGAATATGGTGGAGTTGGTCATTACCAAGCTAA
ATGTCCCACATTTTTAAGAAGACAAAAGAAAAATTATCATGCTACCTTGTCAGATGAAGACACTGATGATACTGAAGATGATAGTAGCATGAATGCCTTCACAACATGCT
TTACCGAAATTGATCTCATAGACGATAGTGGGTGTTCTAATGAAGATGGTAATAAAGATCTAAGTTTTGAAGAACTCAAGATGTTGAGGAAAGAAGACATTAAAGCTAGA
GCAATTCAAAAAGAGAGAATTCAAGATCTGATGGAAGAAAATGAACGGTTGATGTCAGTCATAGCATCTCTAAAGCTAAAATTGAAAGAGGTTCAGAATGAGTATGATCA
GACAATTAAATATGTAAACATGTTAAACTCAGGAACTGAAAACTTAGACTCAATACTAAAATCAGGACAGAACAGTTCAAGTAAATATGGTCTTGGTTTTGATGCCTCAG
TAAGTAGTCGTAACTCTACATCTGAAGTAAGACGTATGATTGGTAACGGGTCGTTCTTCTCTGAGTTAAAGGAATGTGTCTCGGGTTATGTTACTTTTGGTGATGGTGCA
AGAGGAAGAATTATAGCTAAAGGAAACATTGATAAAAATAATCTACCCTGTCTAAATGATGTTAGATATGTGGATGGATTAAAGGCGAATTTGATTAGTGTAAGTCAGCT
TCAAGGTTATAGTGTAAATTTTAGCAAAACTGGTTGTGTTGTTACTGACAAAGATAATCAGGTTCTTATGAAGGGTAGGCGACAAGAAGATAAATGTTATCACTTGAAAG
GAAAATCAGATACTGTTAAAATTTGTATCGGTCTATGCTTGAATTTGCAATGTGAAAAAGGGAAAAAGATAATCAAGATCAGAAGTGATCATGGTAAGGAGTTTGATAAT
GAAGATCTAAATAGTTTCTGTCAGTCGGAAGCTTATAAAGAATATCATCGAAAGTGGGATGTGAAATCAGAACAAGGTATCTTTCTTGGATATTCTCAAAATAGCCGAGC
TTATAGAGTCTTTAATAATAGATCTGGTACAGTTATAGAAACGATCAATGTTATGGTAAATGATTTTGAATCAACTACCAAACGAACTTATGATGAGGATGATGAGACTC
TAAATATGCCTGTAGATTCTTCTACGCTTCCTGCAGAAGTACTAAAAGTTGATGCTCAGGCAGATGATAAGACTGATGAAGCAGGGTGTATGACAAATAATAAAGCTCGA
TTAGTTGCTCAAGGTTATGCTCAAGTCGAGGGGGTTGACTTTGATGAAATGTTTGCACTTGTTGCCAGGCTTGAAGCTATTCGCTTAGTGCTTGATGGAACAAAAACATC
AGAGAAACATGCCGCAGCGGAAGATAAGGATCATGCTTTACTTTATATGCATATAAATGATTTAGATATTAACATGTTGCAACTAAACGACAAACCTAAACAAAGAATAG
AGAAGGACATAAATTCCAACAGTGCTTTCCTGAATGGTTATTTAAAAGAAGAAGTCTATGTTGCTCAACCTGAGGGATTTATCGATTCTCAATATCCTCAGCATGTGTAT
AAGCTTAATAAAGCTTTATATGGCCTTAAGCAAGCTCCTAGAGCTTGGTACGAACGATTGACAATTTATTTAAGTTGTAAAGGGTATTCCAGAGGCGGGGCTGACAAAAC
ATTATTTATTCACAGAACACATGATCAACTCATTGTCGCACAAATCTATGTTGATGATATCATTTTTGAGGGGTTTCCTCAAGATCTTGTTAATAACTTCATTGATATCA
TGAAGTCAGAATTTGAGATGAGCATGGTGGGAGAACTATCATGTTTTCTGGGACTTCAAATCAAGCAGAAGAGTGAGGGTATATTCATATCTCAAGAAAAGCATGCCAAG
AACATAGTTAAAAAATTCGGGACTTCAGCTGCAACACACGTTAAAATTAACAAAGATAATGATGGTGCAAGAGCAGATTACAAACTTTATAAAAGCATAATCGGTAGTCT
GTTGTATCTAACTGCCAGTCGACCTGACATTGCTTATGTTATTGGGATATGTGCTCGTTATCAGGCTGATCCTCGAATATCACATTTAGAAGCTGTTAAAAGGATCCTCA
AGTACGTTCATGGAACAAGTGATTTTGGAATTCTGTATTTCTATGACACGACTTCGATTTTGATTGGATATTACGATGCCGATTGGACAGGCTCTTCTGATGATAAGAAA
AGCACCTCTGAAGGCTGTTTCTTTCTTGAGAATTATCTTATCTTATGGTTTAGTAAGAAACAAAATTGTGTTTCCTTGTCTACAACAGAAGCTGAATACATAGCGACAAG
GAGTGCTTGTACCCAGTTGATTTGGATGAAAAATATGTTGGATGAGTATGGATTCACACAAGATATCATGACTTTATATTCTGATAATATGAGTGCCATTGATATATCAA
AAAATCATGTTCAACATAGTCGAACCAAACATATTGATATTAGGTATCATTTTATTGGAGAACTTGTTGAAAATAAGGGTATTACACCGGATCATATTCGATCAAACTTG
CAATTAGCAGATAATTTCTCTAAGCCACTTGATGCAAACACATTTGAGCATTTACGTACACATGCATCGAATGTTCCTGAGACTTTTCTATCTGATATAGATTCAGATGA
CTTGGATGATGTCCTTTTGGCTCAATTGTTGAAGAAGACCACTGTTCCTGAGGTTCCTGTTGTAATGCCTACTGCTCCTTTTATGTCTATTCATTCTCAGGAAAGCTCGT
CCATAGAAGGAGTGTTTGTTCCTACTTTTGGTGTTCATCACACTTTTAATGTTCAACCTGGACCTTCAATTCCATTTGAATCGAATGTTGCTCATGCTTCTGTTCCTGGT
GATGTTTCTGCTGCACCTGAAGTGAGAACTGATGTTCGTAGTGATAAGAATGAGTTGGATCCTCCCAATCCTGACATTCATTCTAAAGAAGTTCCTGTTGATGCTGATAA
TAATCCAAATGTTCCACCTGCGTCACCTGAAGTGCTTGTTGCACCATAG
Protein sequenceShow/hide protein sequence
MEIIREGPSASRPPVLDGKNYSYWKPRMIFFIKTLNGKAWRVLVAAKEAWRILEFAYEGTSKVKISRLQLITLKFEALKMSEDESVSKYNEGVLEIANESPLLGEKIPNS
KIVWKVLWSLSRKFDMKVTTIEEAHDITTLKLDELFGSLLTFEMAISDRENNKGKGVTFKSTYEEEATVNHSDNEANMDELIALLTKQFSKVVRRFQKYEYYRIECSNFK
SILKKRWRSGDYGKKKEGEGRFFRFREYGGVGHYQAKCPTFLRRQKKNYHATLSDEDTDDTEDDSSMNAFTTCFTEIDLIDDSGCSNEDGNKDLSFEELKMLRKEDIKAR
AIQKERIQDLMEENERLMSVIASLKLKLKEVQNEYDQTIKYVNMLNSGTENLDSILKSGQNSSSKYGLGFDASVSSRNSTSEVRRMIGNGSFFSELKECVSGYVTFGDGA
RGRIIAKGNIDKNNLPCLNDVRYVDGLKANLISVSQLQGYSVNFSKTGCVVTDKDNQVLMKGRRQEDKCYHLKGKSDTVKICIGLCLNLQCEKGKKIIKIRSDHGKEFDN
EDLNSFCQSEAYKEYHRKWDVKSEQGIFLGYSQNSRAYRVFNNRSGTVIETINVMVNDFESTTKRTYDEDDETLNMPVDSSTLPAEVLKVDAQADDKTDEAGCMTNNKAR
LVAQGYAQVEGVDFDEMFALVARLEAIRLVLDGTKTSEKHAAAEDKDHALLYMHINDLDINMLQLNDKPKQRIEKDINSNSAFLNGYLKEEVYVAQPEGFIDSQYPQHVY
KLNKALYGLKQAPRAWYERLTIYLSCKGYSRGGADKTLFIHRTHDQLIVAQIYVDDIIFEGFPQDLVNNFIDIMKSEFEMSMVGELSCFLGLQIKQKSEGIFISQEKHAK
NIVKKFGTSAATHVKINKDNDGARADYKLYKSIIGSLLYLTASRPDIAYVIGICARYQADPRISHLEAVKRILKYVHGTSDFGILYFYDTTSILIGYYDADWTGSSDDKK
STSEGCFFLENYLILWFSKKQNCVSLSTTEAEYIATRSACTQLIWMKNMLDEYGFTQDIMTLYSDNMSAIDISKNHVQHSRTKHIDIRYHFIGELVENKGITPDHIRSNL
QLADNFSKPLDANTFEHLRTHASNVPETFLSDIDSDDLDDVLLAQLLKKTTVPEVPVVMPTAPFMSIHSQESSSIEGVFVPTFGVHHTFNVQPGPSIPFESNVAHASVPG
DVSAAPEVRTDVRSDKNELDPPNPDIHSKEVPVDADNNPNVPPASPEVLVAP