; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G00335 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G00335
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCytochrome P450 78A7-like
Genome locationClcChr08:454023..458996
RNA-Seq ExpressionClc08G00335
SyntenyClc08G00335
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016787 - hydrolase activity (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR023780 - Chromo domain
IPR021109 - Aspartic peptidase domain superfamily
IPR016197 - Chromo-like domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR005162 - Retrotransposon gag domain
IPR001584 - Integrase, catalytic core
IPR000953 - Chromo/chromo shadow domain
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050296.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0057.3Show/hide
Query:  EAVTTRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEE
        E   TRGR E+ +  +R KSK  A R+ ++TRLT LE+ + D+Q ++GRLS ++EEL QENAE+T V KE+IE++G+TF+ E       V  L+ FVE E
Subjt:  EAVTTRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEE

Query:  LNGLRKEVSDIK-------EQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLW
        L+ L  +   ++        +C ++H  S   STS+     GT N+KVPKPD ++GVRNAT+VDNFLFGLERYF ALGV DD ARIN+A T+LRD+AQLW
Subjt:  LNGLRKEVSDIK-------EQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLW

Query:  WKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDA
        W+ K+ ++ ++  ++H+WEQFK+ELRKHFVPHNAE E++GKLRRLR TG ILDYVKEFTTLML++  LPEK+ALF FKDGLKDWA++EL+RRNVQTLDDA
Subjt:  WKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDA

Query:  IAAVEGLTDYSTQ-KGK---------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEE
        IA  E L DYS Q KGK         K D TK        K K+ Q + GK    ++GESS P KPCFIC GPHWTR+CPN+KA+NALVA+ +E +  E+
Subjt:  IAAVEGLTDYSTQ-KGK---------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEE

Query:  SEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGT
        +   ++GS+QQ+  + +       E+K L+Y    I+G+ A+AM DTGA+HNFMD ++A RLGL      GT+K+VN+  +   GIAK V +KIG WQ  
Subjt:  SEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGT

Query:  LDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITK
        LDF+I PMDDF +VLGL FF K +        +L I+D     ++ VR G+ V+  +LSALQFKR + K+EC++A+++ +  ET E K+ E  +P  I  
Subjt:  LDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITK

Query:  VLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSK--------------APFGASLGKAKYFSKIDL
        VL EY+D+M  ELPKKLP  +EVDH IELEPGAKPP MAPYR+AP E +EL +QLK+LL+AGYIQP+               A     LG A++FSKIDL
Subjt:  VLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSK--------------APFGASLGKAKYFSKIDL

Query:  RSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSF
         SGYYQVRIK GDE KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLF P+LD+FVVVYLDDIVVYSQT+EEHV+HLR+VFQV R+NELYIKLEK SF
Subjt:  RSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSF

Query:  AQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPD
        A+  VEFLGHW K+GKLMMD                                 KGYS IVAPLTNLLKK+QTW+WT  CQRAF  LK A+ +EPVM+L D
Subjt:  AQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPD

Query:  HTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQARWQDFLAEF
        HT+PFE+HT+ASD+AIGGVL++      +  R+ +D                HC              +  D  A +S+ QTQKKL+PKQA+WQDFLAEF
Subjt:  HTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQARWQDFLAEF

Query:  DFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMW
        DFK+EYK G+AN+VADALS KAE + I  S P   F E+IKEG++ D+LA NLLKLA+EGKTRRF E+N TLLT                          
Subjt:  DFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMW

Query:  AGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANV
                   +V DKYYWP+M+D+IE+YV+TCL+CQQDKGEQQLPA LLEPLP+AE+PW+SVTMDFIVALPKS   G+IMVVVD FSKYATFIPC  +V
Subjt:  AGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANV

Query:  KVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSE
        KVDEAARLF KNIVK WG+P+SIISDRD RFTGKFWRELFKL G +LNFST+FH QSDGQTER+N LLEQYLRHYVSAHQKDWA LLD+AQFSYNLQRSE
Subjt:  KVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSE

Query:  STGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------QGLIRKYE
        STGKSPFEI+  +QPNTP  L + Y GPNPSA+   K+W E+QDI+RACL KAA++MKKWAD  RRPKEY  GE                  +GL+R+YE
Subjt:  STGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------QGLIRKYE

Query:  GPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPDSEASWEKE
        GPF I+ERVGK AY+V+LP +LKIH++FHVSMLKPFH+D EDP R+ +++AP GV TE+DK ++ ILA R++  +GVPS+ EYL+ W+GL +SEASWE E
Subjt:  GPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPDSEASWEKE

Query:  GHLWQFADKISDY
          LWQF  +I  +
Subjt:  GHLWQFADKISDY

KAA0054276.1 polyprotein [Cucumis melo var. makuwa]0.0e+0059.3Show/hide
Query:  KSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEELNGLRKE-------VSDI
        KSK PA R+ ++TRLT LE+ + D+Q ++GRLS ++EEL QENAE+T V KE+IED+G+TF+ E       V  L+ FVE EL+ L  +       +  +
Subjt:  KSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEELNGLRKE-------VSDI

Query:  KEQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAEREKDQCSIHTW
          +C ++H  S+  STS+     GT N+KVPKPD ++GVRNAT+VDNFLFGLERYF ALGV DD ARIN+APT+LRD+AQLWW+ K+A++  +  +IH+W
Subjt:  KEQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAEREKDQCSIHTW

Query:  EQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTDYSTQ-KGK-
        EQFK+ELRKHFVPHNAE E++G                           LPEK+ALF FKDGLKDWA++EL+RRNVQTLDDAIAA E L DYS Q KGK 
Subjt:  EQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTDYSTQ-KGK-

Query:  --------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEA-KVGSIQQLNALADL
                KSD TK+       K K+ Q K GK    ++GESS PPKPCFIC GPHWTR+CPN+KA+NALVA+ +E +  E++   ++GSIQQ+      
Subjt:  --------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEA-KVGSIQQLNALADL

Query:  FSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLGLT
                                               +A RLGL      GT+K+VN+  +   G+AK V +KIG WQ  LDF++ PMDDF +VLGL 
Subjt:  FSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLGLT

Query:  FFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITKVLAEYQDVMPTELPKKLP
        FF K +        +L I+D   +  IP+R G+ V+  +LSALQFKR + K++C++A+++ ++ E  E KT E  +P  I KVL EY+D+MP+ELPKKLP
Subjt:  FFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITKVLAEYQDVMPTELPKKLP

Query:  SRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGKA
         R+EVDH IELE GAKPPAMAPYR+APPELEEL +QLK+LL+AGYIQPSKAP+GA                                         LGKA
Subjt:  SRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGKA

Query:  KYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELY
        ++FSKIDLRSGYYQVRIK GDE KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLF P+LD+FV+VYLDDIVVYSQT+EEHV+HLR+VFQVLR+NELY
Subjt:  KYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELY

Query:  IKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQK
        IKLEK SFA+  VEFLGHW K+GKLMMD  K++AI EW+ PTKVPELRSFLGFVNYYRRFIKGYS + APLTNLLKK+QTW WT  CQRAF  LK AV +
Subjt:  IKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQK

Query:  EPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQAR
        EPVM+L DHT+PFE+HT+ASD+AIGGVL++      +  R+ +D                HC              +  D  A +S+ QTQKKL+PKQAR
Subjt:  EPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQAR

Query:  WQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREII
        WQDFLAEFDFK+EYK G+ANVVADALSRKAE   I  S P  +F E+IKEG++ D+LA NLLKLA+EGKTRRFWE++ TLLT G RLF+P+WG LR++++
Subjt:  WQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREII

Query:  KECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYAT
        +ECHDS+WAGHPGM+RT+ALV DKYYWP+M+D+IE+YV+TCL+CQQDKGEQQLPAGLLEPLP+AE+PW+S+TMDFIVALPKS   G+IMVVVD FSKYAT
Subjt:  KECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYAT

Query:  FIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQF
        FIPCS +VKVDEAARLF KN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMGT+LNFST+FHPQSDGQTER+NALLEQYLRHYVSAHQKDW  LLD+AQF
Subjt:  FIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQF

Query:  SYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------
        SYNLQRSE+TGKSPFE++  +QPNTP  L + Y GPNPSA+  AK+W E+QDI+RACL KAA++MKKWAD  RRPKEY  G+                  
Subjt:  SYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------

Query:  QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPD
        +GL+R+YEGPF I+ERVGK AY+V+LP RLKIH+VFHVSMLKPFH+D EDP R+ ++RAP GV  E+D+ ++ ILA R++  +GVPSH EYL+ W+GLP+
Subjt:  QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPD

Query:  SEASWEKEGHLWQFADKISDY
        SEASWE+E  LWQF  +I  +
Subjt:  SEASWEKEGHLWQFADKISDY

KAA0065760.1 polyprotein [Cucumis melo var. makuwa]0.0e+0061.72Show/hide
Query:  MELLPEAVTTRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRK
        +E + E   TRGR E+ +  +R KSK PA R+ ++TRLT LE+ + D+Q ++GRLS ++EEL QENAE+T V KE+IED+G+TF+ E       V  L+ 
Subjt:  MELLPEAVTTRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRK

Query:  FVEEELNGLRKE-------VSDIKEQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRD
        FVE EL+ L  +       +  +  +C ++H  S+  STS+     GT N+KVPKPD ++GVRNAT+VDNFLFGLERYF ALGV DD ARIN+APT+LRD
Subjt:  FVEEELNGLRKE-------VSDIKEQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRD

Query:  SAQLWWKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQ
        +AQLWW+ K+A++  +  +IH+WEQFK+ELRKHFVPHNAE E++GKLRRLR TG IL+YVKEFTTLMLE+  LPEK+ALF FKDGLKDWA++EL+RRNVQ
Subjt:  SAQLWWKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQ

Query:  TLDDAIAAVEGLTDYSTQ-KGK---------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREA
        TLDDAIAA E L DYS Q KGK         KSD TK+       K K+ Q K GK    ++GESS PPKPCFIC GPHWTR+CPN+KA+NALVA+ +E 
Subjt:  TLDDAIAAVEGLTDYSTQ-KGK---------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREA

Query:  EANEESEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIG
        +  E++   ++GS+QQ+  +        VE+KGL+Y    I+G+ A+AM DTGA+HNFMD ++A RLGL      GT+K+VN+  +   G+AK V +KIG
Subjt:  EANEESEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIG

Query:  HWQGTLDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLP
         WQ  LDF++ PMDDF +VLGL FF K +        +L I+D   +  IP+R G+ ++  +LSALQFKR + K++C++A+++ L+ E  E KT E  +P
Subjt:  HWQGTLDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLP

Query:  SCITKVLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------
          I KVL EY+D+MP+ELPKKLP R+EVDH IELEPGAKPPAMAPYR+AP ELEEL +QLK+LL+AGYIQPSKAP+GA                      
Subjt:  SCITKVLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------

Query:  ------------------SLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYS
                           LGKA++FSKIDLRSGYYQVRIK GDE KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLF P+LD+FVVVYLDDIVVYS
Subjt:  ------------------SLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYS

Query:  QTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKK
        QT+EEHV+HL++VFQVLR+NELYIKLEK SFA+  VEFLGHW K+GKLMMD  +++AI EW+APTKVPELRSFLGFVNYYRRFIKGYS I APLTNLLKK
Subjt:  QTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKK

Query:  SQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PL
        +QTW WT  CQ+AF  LK AV +E VM+L DHT+PFE+HT+ASD+AIGGVLI+      +  R+ +D                HC              +
Subjt:  SQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PL

Query:  PADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESN
          D  A +S+ QTQKKL+PKQARWQDFLAEFDFK+EYK G+ANVVADALSRKAE   I  S P  +F E+IKEG++ D+LA NLLKLA+EGKTRRFWE++
Subjt:  PADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESN

Query:  DTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIV
         TLLT G RLF+ +WG LR+++++ECHDS+WAGHPGM+RT+ALV DKYYWP+M+D+IE+YV+TCL+CQQDKGEQQLPAGLLEPLP+AE+PW+SVTMDFIV
Subjt:  DTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIV

Query:  ALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLE
        ALPKS   G+IMVVVD FSKYATFIPCS +VKVDEAARLF KN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMGT+LNFST+FHPQSDGQTER+NALLE
Subjt:  ALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLE

Query:  QYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKE
        QYLRHYVSAHQKDWA LLD+AQFSYNLQRSESTGKSPFEI+  +QPNTP  L + Y GPNPSA+  AK+W E+QDI+RACL KAA++MKKWAD  RRPKE
Subjt:  QYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKE

Query:  YNEGE------------------QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAH
        Y  G+                  +GL+R+YEGPF I+ERVGK AY+V+LP RLKIH+VFH+SMLKPFH+D EDP R+ ++RAP GV TE+D+ ++ ILA 
Subjt:  YNEGE------------------QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAH

Query:  RRVSSKGVPSHLEYLVKWQGLPDSEASWEKEGHLWQFADKISDY
        R++  +GVPSH EYL+ W+GLP+SEASWE+E  LWQF  +I  +
Subjt:  RRVSSKGVPSHLEYLVKWQGLPDSEASWEKEGHLWQFADKISDY

KAF5791879.1 putative nucleotidyltransferase, Ribonuclease H [Helianthus annuus]0.0e+0053.61Show/hide
Query:  TRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAEVAD-LRKFVEEELNGLRKEVSD
        TRGR +     K  K +  +  K LE R+  LE  +  L A +       +E+    +++    K  +E     F+A+V + +++  ++ +  LRK++ D
Subjt:  TRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAEVAD-LRKFVEEELNGLRKEVSD

Query:  IKEQC--------LARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAERE
        +               +  S   +++ +S    T  ++VPKP  F G R A  VD+FL+ +ERYF+ + V DDA +I     YL+D+A LWW+ KHAE E
Subjt:  IKEQC--------LARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAERE

Query:  KDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTD
        K  C ++TW  F  E++K F P NAE EAK +LR+L+QTG I DY+KEFTTL+LE+  + +KD+LFYF DGL+ WA+ ELERR VQ L  AIA  EGLTD
Subjt:  KDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTD

Query:  YSTQKGKKSDSTKDKNKSSQDKRGKGNKGESSKPP--------KP------CFICDGPHWTRECPNKKAMNALVAEMREAEANEESEAKVGSIQQLNALA
        YS ++ +  D      K   +K  +  +  + K P        KP      CFICDGPH+ RECP + +++A++A+      +EE +  +GS+Q L A+ 
Subjt:  YSTQKGKKSDSTKDKNKSSQDKRGKGNKGESSKPP--------KP------CFICDGPHWTRECPNKKAMNALVAEMREAEANEESEAKVGSIQQLNALA

Query:  DLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLG
         +   R   NKGL YV A I G    A++DTGA+HNFM   +A RLG+  + G G +K VN++AKP  G AKDV I+IG W G ++F+I  MDDFK+VLG
Subjt:  DLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLG

Query:  LTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKETLPSCITKVLAEYQDVMPTELPKKL
        + FF K  AFP+P  NSL ILD+ K  ++    G K     LSA+QFK+   K E    +V ++ EE    K KE +P  I KVL E++DVMPTELPKKL
Subjt:  LTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKETLPSCITKVLAEYQDVMPTELPKKL

Query:  PSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGK
        P R+EVDH IELEPG+KPP+ APYR+ PPELEEL KQLK+LL+AGYI+PSKAP+GA                                         LG 
Subjt:  PSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGK

Query:  AKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENEL
        A+YFSK+DLRSGYYQVRI  GDE KT C+TRYG+YEFLVMPFGLTNAPATFCTLMNKLFHP+LDKFVVVYLDDIVVYS+T++EHVEHLR+VFQVLRENEL
Subjt:  AKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENEL

Query:  YIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQ
        Y+KLEK SFAQ  VEFLGH  K+G+LMMD  K+KAIQ W  PTKVPELRSFLG VNYYRRFI+GYSA  APLT+LLKK++ W+W   CQ AF+ LKEAV 
Subjt:  YIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQ

Query:  KEPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQA
         EPV+ LPD ++PFE+HT+ASD+AIGGVLI+      +  R+ +D                HC              +  D  A +S+ QTQKKLSPKQA
Subjt:  KEPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQA

Query:  RWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREI
        RWQDFLAEFD+ +EYK GKANVVADALSRKAE AAI  + P    ++QIKEGL QD  A +L+ LAR+GKT RFW   D L T G RL++P+WGNLRR +
Subjt:  RWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREI

Query:  IKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYA
        +KECHDS WAGHPG+ RT+AL+   +YWP+M D++EAYV TCL+CQQDK EQ+ P GLLEPLP+   PWES++MDFI  LPKS   GSI+VVVD FSKY 
Subjt:  IKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYA

Query:  TFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQ
        TFIP  A+V   E ARLF K++VK WGIP++I+SDRDPRFTG+FW ELFK+MGTELNFST+FHPQ+DGQTERVNALLE YLRHYVSA+Q+DWA LLD+AQ
Subjt:  TFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQ

Query:  FSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGEQ----------------
        FSYN+QRSE+TGKSPFEI+  RQP TP  L + Y G +P+A+K  KEW E+ D+AR  L KAAK+MKKWAD  RR  E+  G+Q                
Subjt:  FSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGEQ----------------

Query:  --GLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLP
          GL+RKYEGPFP++ RVGK +YR+QLP +LKIH VFHVS LKP+H D ED  R +S RAP  V T +D+ V  I++ R +  +GVPS+ EYL+KW+ LP
Subjt:  --GLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLP

Query:  DSEASWEKEGHLWQFADKISDY
        DSEASWE E  LWQF D+I  Y
Subjt:  DSEASWEKEGHLWQFADKISDY

XP_008460615.1 PREDICTED: uncharacterized protein LOC103499392 [Cucumis melo]0.0e+0059.53Show/hide
Query:  DLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEELNGLRKE-------VSDIKEQCLARHQASSPASTSSTSAAQ
        D+Q ++GRLS ++EEL QENAE+T V KE+IED+G+TF+ E       V  L+ FVE EL+ L  +       +  +  +C ++H  S+  STS+     
Subjt:  DLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEELNGLRKE-------VSDIKEQCLARHQASSPASTSSTSAAQ

Query:  GTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGK
        GT N+KVPKPD ++GVRNAT+VDNFLFGLERYF ALGV DD ARIN+APT+LRD+AQLWW+ K+A++  +  +IH+WEQFK+ELRKHFVPHNAE E++G 
Subjt:  GTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGK

Query:  LRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTDYSTQ-KGK---------KSDSTKD-------K
                                  LPEK+ALF FKDGLKDWA++EL+RRNVQTLDDAIAA E L DYS Q KGK         KSD TK+       K
Subjt:  LRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTDYSTQ-KGK---------KSDSTKD-------K

Query:  NKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPA
         K+ Q K GK    ++GESS PPKPCFIC GPHWTR+CPN+KA+NALVA+ +E +  E++   ++GSIQQ+                             
Subjt:  NKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPA

Query:  SAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRK
                        +A RLGL      GT+K+VN+  +   G+AK V +KIG WQ  LDF++ PMDDF +VLGL FF K +        +L I+D   
Subjt:  SAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRK

Query:  IRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITKVLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPY
        +  IP+R G+ V+  +LSALQFKR + K++C++A+++ ++ E  E KT E  +P  I KVL EY+D+MP+ELPKKLP R+EVDH IELE GAKPPAMAPY
Subjt:  IRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITKVLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPY

Query:  RIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGKAKYFSKIDLRSGYYQVRIKAGDEP
        R+APPELEEL +QLK+LL+AGYIQPSKAP+GA                                         LGKA++FSKIDLRSGYYQVRIK GDE 
Subjt:  RIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGKAKYFSKIDLRSGYYQVRIKAGDEP

Query:  KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDG
        KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLF P+LD+FV+VYLDDIVVYSQT+EEHV+HLR+VFQVLR+NELYIKLEK SFA+  VEFLGHW K+G
Subjt:  KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDG

Query:  KLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYA
        KLMMD  K++AI EW+ PTKVPELRSFLGFVNYYRRFIKGYS + APLTNLLKK+QTW WT  CQRAF  LK AV +EPVM+L DHT+PFE+HT+ASD+A
Subjt:  KLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYA

Query:  IGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVA
        IGGVL++      +  R+ +D                HC              +  D  A +S+ QTQKKL+PKQARWQDFLAEFDFK+EYK G+ANVVA
Subjt:  IGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVA

Query:  DALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQD
        DALSRKAE   I  S P  +F E+IKEG++ D+LA NLLKLA+EGKTRRFWE++ TLLT G RLF+P+WG LR+++++ECHDS+WAGHPGM+RT+ALV D
Subjt:  DALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQD

Query:  KYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVK
        KYYWP+M+D+IE+YV+TCL+CQQDKGEQQLPAGLLEPLP+AE+PW+S+TMDFIVALPKS   G+IMVVVD FSKYATFIPCS +VKVDEAARLF KN+VK
Subjt:  KYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVK

Query:  LWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQP
        LWGIP+SIISDRDPRFTGKFWRELFKLMGT+LNFST+FHPQSDGQTER+NALLEQYLRHYVSAHQKDW  LLD+AQFSYNLQRSE+TGKSPFE++  +QP
Subjt:  LWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQP

Query:  NTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------QGLIRKYEGPFPILERVGKTAYR
        NTP  L + Y GPNPSA+  AK+W E+QDI+RACL KAA++MKKWAD  RRPKEY  G+                  +GL+R+YEGPF I+ERVGK AY+
Subjt:  NTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------QGLIRKYEGPFPILERVGKTAYR

Query:  VQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPDSEASWEKEGHLWQFADKI
        V+LP RLKIH+VFHVSMLKPFH+D EDP R+ ++RAP GV  E+D+ ++ ILA R++  +GVPSH EYL+ W+GLP+SEASWE+E  LWQF  +I
Subjt:  VQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPDSEASWEKEGHLWQFADKI

TrEMBL top hitse value%identityAlignment
A0A1S3CE17 uncharacterized protein LOC1034993920.0e+0059.53Show/hide
Query:  DLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEELNGLRKE-------VSDIKEQCLARHQASSPASTSSTSAAQ
        D+Q ++GRLS ++EEL QENAE+T V KE+IED+G+TF+ E       V  L+ FVE EL+ L  +       +  +  +C ++H  S+  STS+     
Subjt:  DLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEELNGLRKE-------VSDIKEQCLARHQASSPASTSSTSAAQ

Query:  GTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGK
        GT N+KVPKPD ++GVRNAT+VDNFLFGLERYF ALGV DD ARIN+APT+LRD+AQLWW+ K+A++  +  +IH+WEQFK+ELRKHFVPHNAE E++G 
Subjt:  GTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGK

Query:  LRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTDYSTQ-KGK---------KSDSTKD-------K
                                  LPEK+ALF FKDGLKDWA++EL+RRNVQTLDDAIAA E L DYS Q KGK         KSD TK+       K
Subjt:  LRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTDYSTQ-KGK---------KSDSTKD-------K

Query:  NKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPA
         K+ Q K GK    ++GESS PPKPCFIC GPHWTR+CPN+KA+NALVA+ +E +  E++   ++GSIQQ+                             
Subjt:  NKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPA

Query:  SAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRK
                        +A RLGL      GT+K+VN+  +   G+AK V +KIG WQ  LDF++ PMDDF +VLGL FF K +        +L I+D   
Subjt:  SAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRK

Query:  IRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITKVLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPY
        +  IP+R G+ V+  +LSALQFKR + K++C++A+++ ++ E  E KT E  +P  I KVL EY+D+MP+ELPKKLP R+EVDH IELE GAKPPAMAPY
Subjt:  IRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITKVLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPY

Query:  RIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGKAKYFSKIDLRSGYYQVRIKAGDEP
        R+APPELEEL +QLK+LL+AGYIQPSKAP+GA                                         LGKA++FSKIDLRSGYYQVRIK GDE 
Subjt:  RIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGKAKYFSKIDLRSGYYQVRIKAGDEP

Query:  KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDG
        KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLF P+LD+FV+VYLDDIVVYSQT+EEHV+HLR+VFQVLR+NELYIKLEK SFA+  VEFLGHW K+G
Subjt:  KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDG

Query:  KLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYA
        KLMMD  K++AI EW+ PTKVPELRSFLGFVNYYRRFIKGYS + APLTNLLKK+QTW WT  CQRAF  LK AV +EPVM+L DHT+PFE+HT+ASD+A
Subjt:  KLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYA

Query:  IGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVA
        IGGVL++      +  R+ +D                HC              +  D  A +S+ QTQKKL+PKQARWQDFLAEFDFK+EYK G+ANVVA
Subjt:  IGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVA

Query:  DALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQD
        DALSRKAE   I  S P  +F E+IKEG++ D+LA NLLKLA+EGKTRRFWE++ TLLT G RLF+P+WG LR+++++ECHDS+WAGHPGM+RT+ALV D
Subjt:  DALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQD

Query:  KYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVK
        KYYWP+M+D+IE+YV+TCL+CQQDKGEQQLPAGLLEPLP+AE+PW+S+TMDFIVALPKS   G+IMVVVD FSKYATFIPCS +VKVDEAARLF KN+VK
Subjt:  KYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVK

Query:  LWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQP
        LWGIP+SIISDRDPRFTGKFWRELFKLMGT+LNFST+FHPQSDGQTER+NALLEQYLRHYVSAHQKDW  LLD+AQFSYNLQRSE+TGKSPFE++  +QP
Subjt:  LWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQP

Query:  NTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------QGLIRKYEGPFPILERVGKTAYR
        NTP  L + Y GPNPSA+  AK+W E+QDI+RACL KAA++MKKWAD  RRPKEY  G+                  +GL+R+YEGPF I+ERVGK AY+
Subjt:  NTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------QGLIRKYEGPFPILERVGKTAYR

Query:  VQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPDSEASWEKEGHLWQFADKI
        V+LP RLKIH+VFHVSMLKPFH+D EDP R+ ++RAP GV  E+D+ ++ ILA R++  +GVPSH EYL+ W+GLP+SEASWE+E  LWQF  +I
Subjt:  VQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPDSEASWEKEGHLWQFADKI

A0A5A7UJ25 Polyprotein0.0e+0059.3Show/hide
Query:  KSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEELNGLRKE-------VSDI
        KSK PA R+ ++TRLT LE+ + D+Q ++GRLS ++EEL QENAE+T V KE+IED+G+TF+ E       V  L+ FVE EL+ L  +       +  +
Subjt:  KSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEELNGLRKE-------VSDI

Query:  KEQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAEREKDQCSIHTW
          +C ++H  S+  STS+     GT N+KVPKPD ++GVRNAT+VDNFLFGLERYF ALGV DD ARIN+APT+LRD+AQLWW+ K+A++  +  +IH+W
Subjt:  KEQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAEREKDQCSIHTW

Query:  EQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTDYSTQ-KGK-
        EQFK+ELRKHFVPHNAE E++G                           LPEK+ALF FKDGLKDWA++EL+RRNVQTLDDAIAA E L DYS Q KGK 
Subjt:  EQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTDYSTQ-KGK-

Query:  --------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEA-KVGSIQQLNALADL
                KSD TK+       K K+ Q K GK    ++GESS PPKPCFIC GPHWTR+CPN+KA+NALVA+ +E +  E++   ++GSIQQ+      
Subjt:  --------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEA-KVGSIQQLNALADL

Query:  FSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLGLT
                                               +A RLGL      GT+K+VN+  +   G+AK V +KIG WQ  LDF++ PMDDF +VLGL 
Subjt:  FSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLGLT

Query:  FFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITKVLAEYQDVMPTELPKKLP
        FF K +        +L I+D   +  IP+R G+ V+  +LSALQFKR + K++C++A+++ ++ E  E KT E  +P  I KVL EY+D+MP+ELPKKLP
Subjt:  FFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITKVLAEYQDVMPTELPKKLP

Query:  SRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGKA
         R+EVDH IELE GAKPPAMAPYR+APPELEEL +QLK+LL+AGYIQPSKAP+GA                                         LGKA
Subjt:  SRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------------------------SLGKA

Query:  KYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELY
        ++FSKIDLRSGYYQVRIK GDE KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLF P+LD+FV+VYLDDIVVYSQT+EEHV+HLR+VFQVLR+NELY
Subjt:  KYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELY

Query:  IKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQK
        IKLEK SFA+  VEFLGHW K+GKLMMD  K++AI EW+ PTKVPELRSFLGFVNYYRRFIKGYS + APLTNLLKK+QTW WT  CQRAF  LK AV +
Subjt:  IKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQK

Query:  EPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQAR
        EPVM+L DHT+PFE+HT+ASD+AIGGVL++      +  R+ +D                HC              +  D  A +S+ QTQKKL+PKQAR
Subjt:  EPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQAR

Query:  WQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREII
        WQDFLAEFDFK+EYK G+ANVVADALSRKAE   I  S P  +F E+IKEG++ D+LA NLLKLA+EGKTRRFWE++ TLLT G RLF+P+WG LR++++
Subjt:  WQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREII

Query:  KECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYAT
        +ECHDS+WAGHPGM+RT+ALV DKYYWP+M+D+IE+YV+TCL+CQQDKGEQQLPAGLLEPLP+AE+PW+S+TMDFIVALPKS   G+IMVVVD FSKYAT
Subjt:  KECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYAT

Query:  FIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQF
        FIPCS +VKVDEAARLF KN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMGT+LNFST+FHPQSDGQTER+NALLEQYLRHYVSAHQKDW  LLD+AQF
Subjt:  FIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQF

Query:  SYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------
        SYNLQRSE+TGKSPFE++  +QPNTP  L + Y GPNPSA+  AK+W E+QDI+RACL KAA++MKKWAD  RRPKEY  G+                  
Subjt:  SYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------

Query:  QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPD
        +GL+R+YEGPF I+ERVGK AY+V+LP RLKIH+VFHVSMLKPFH+D EDP R+ ++RAP GV  E+D+ ++ ILA R++  +GVPSH EYL+ W+GLP+
Subjt:  QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPD

Query:  SEASWEKEGHLWQFADKISDY
        SEASWE+E  LWQF  +I  +
Subjt:  SEASWEKEGHLWQFADKISDY

A0A5A7VEX8 Polyprotein0.0e+0061.72Show/hide
Query:  MELLPEAVTTRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRK
        +E + E   TRGR E+ +  +R KSK PA R+ ++TRLT LE+ + D+Q ++GRLS ++EEL QENAE+T V KE+IED+G+TF+ E       V  L+ 
Subjt:  MELLPEAVTTRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRK

Query:  FVEEELNGLRKE-------VSDIKEQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRD
        FVE EL+ L  +       +  +  +C ++H  S+  STS+     GT N+KVPKPD ++GVRNAT+VDNFLFGLERYF ALGV DD ARIN+APT+LRD
Subjt:  FVEEELNGLRKE-------VSDIKEQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRD

Query:  SAQLWWKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQ
        +AQLWW+ K+A++  +  +IH+WEQFK+ELRKHFVPHNAE E++GKLRRLR TG IL+YVKEFTTLMLE+  LPEK+ALF FKDGLKDWA++EL+RRNVQ
Subjt:  SAQLWWKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQ

Query:  TLDDAIAAVEGLTDYSTQ-KGK---------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREA
        TLDDAIAA E L DYS Q KGK         KSD TK+       K K+ Q K GK    ++GESS PPKPCFIC GPHWTR+CPN+KA+NALVA+ +E 
Subjt:  TLDDAIAAVEGLTDYSTQ-KGK---------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREA

Query:  EANEESEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIG
        +  E++   ++GS+QQ+  +        VE+KGL+Y    I+G+ A+AM DTGA+HNFMD ++A RLGL      GT+K+VN+  +   G+AK V +KIG
Subjt:  EANEESEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIG

Query:  HWQGTLDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLP
         WQ  LDF++ PMDDF +VLGL FF K +        +L I+D   +  IP+R G+ ++  +LSALQFKR + K++C++A+++ L+ E  E KT E  +P
Subjt:  HWQGTLDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLP

Query:  SCITKVLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------
          I KVL EY+D+MP+ELPKKLP R+EVDH IELEPGAKPPAMAPYR+AP ELEEL +QLK+LL+AGYIQPSKAP+GA                      
Subjt:  SCITKVLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA----------------------

Query:  ------------------SLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYS
                           LGKA++FSKIDLRSGYYQVRIK GDE KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLF P+LD+FVVVYLDDIVVYS
Subjt:  ------------------SLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYS

Query:  QTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKK
        QT+EEHV+HL++VFQVLR+NELYIKLEK SFA+  VEFLGHW K+GKLMMD  +++AI EW+APTKVPELRSFLGFVNYYRRFIKGYS I APLTNLLKK
Subjt:  QTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKK

Query:  SQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PL
        +QTW WT  CQ+AF  LK AV +E VM+L DHT+PFE+HT+ASD+AIGGVLI+      +  R+ +D                HC              +
Subjt:  SQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PL

Query:  PADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESN
          D  A +S+ QTQKKL+PKQARWQDFLAEFDFK+EYK G+ANVVADALSRKAE   I  S P  +F E+IKEG++ D+LA NLLKLA+EGKTRRFWE++
Subjt:  PADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESN

Query:  DTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIV
         TLLT G RLF+ +WG LR+++++ECHDS+WAGHPGM+RT+ALV DKYYWP+M+D+IE+YV+TCL+CQQDKGEQQLPAGLLEPLP+AE+PW+SVTMDFIV
Subjt:  DTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIV

Query:  ALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLE
        ALPKS   G+IMVVVD FSKYATFIPCS +VKVDEAARLF KN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMGT+LNFST+FHPQSDGQTER+NALLE
Subjt:  ALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLE

Query:  QYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKE
        QYLRHYVSAHQKDWA LLD+AQFSYNLQRSESTGKSPFEI+  +QPNTP  L + Y GPNPSA+  AK+W E+QDI+RACL KAA++MKKWAD  RRPKE
Subjt:  QYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKE

Query:  YNEGE------------------QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAH
        Y  G+                  +GL+R+YEGPF I+ERVGK AY+V+LP RLKIH+VFH+SMLKPFH+D EDP R+ ++RAP GV TE+D+ ++ ILA 
Subjt:  YNEGE------------------QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAH

Query:  RRVSSKGVPSHLEYLVKWQGLPDSEASWEKEGHLWQFADKISDY
        R++  +GVPSH EYL+ W+GLP+SEASWE+E  LWQF  +I  +
Subjt:  RRVSSKGVPSHLEYLVKWQGLPDSEASWEKEGHLWQFADKISDY

A0A5D3BWR0 Reverse transcriptase0.0e+0057.3Show/hide
Query:  EAVTTRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEE
        E   TRGR E+ +  +R KSK  A R+ ++TRLT LE+ + D+Q ++GRLS ++EEL QENAE+T V KE+IE++G+TF+ E       V  L+ FVE E
Subjt:  EAVTTRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAE-------VADLRKFVEEE

Query:  LNGLRKEVSDIK-------EQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLW
        L+ L  +   ++        +C ++H  S   STS+     GT N+KVPKPD ++GVRNAT+VDNFLFGLERYF ALGV DD ARIN+A T+LRD+AQLW
Subjt:  LNGLRKEVSDIK-------EQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLW

Query:  WKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDA
        W+ K+ ++ ++  ++H+WEQFK+ELRKHFVPHNAE E++GKLRRLR TG ILDYVKEFTTLML++  LPEK+ALF FKDGLKDWA++EL+RRNVQTLDDA
Subjt:  WKHKHAEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDA

Query:  IAAVEGLTDYSTQ-KGK---------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEE
        IA  E L DYS Q KGK         K D TK        K K+ Q + GK    ++GESS P KPCFIC GPHWTR+CPN+KA+NALVA+ +E +  E+
Subjt:  IAAVEGLTDYSTQ-KGK---------KSDSTKD-------KNKSSQDKRGK---GNKGESSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEE

Query:  SEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGT
        +   ++GS+QQ+  + +       E+K L+Y    I+G+ A+AM DTGA+HNFMD ++A RLGL      GT+K+VN+  +   GIAK V +KIG WQ  
Subjt:  SEA-KVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGT

Query:  LDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITK
        LDF+I PMDDF +VLGL FF K +        +L I+D     ++ VR G+ V+  +LSALQFKR + K+EC++A+++ +  ET E K+ E  +P  I  
Subjt:  LDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVRELQEETGETKTKE-TLPSCITK

Query:  VLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSK--------------APFGASLGKAKYFSKIDL
        VL EY+D+M  ELPKKLP  +EVDH IELEPGAKPP MAPYR+AP E +EL +QLK+LL+AGYIQP+               A     LG A++FSKIDL
Subjt:  VLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSK--------------APFGASLGKAKYFSKIDL

Query:  RSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSF
         SGYYQVRIK GDE KTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLF P+LD+FVVVYLDDIVVYSQT+EEHV+HLR+VFQV R+NELYIKLEK SF
Subjt:  RSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSF

Query:  AQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPD
        A+  VEFLGHW K+GKLMMD                                 KGYS IVAPLTNLLKK+QTW+WT  CQRAF  LK A+ +EPVM+L D
Subjt:  AQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPD

Query:  HTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQARWQDFLAEF
        HT+PFE+HT+ASD+AIGGVL++      +  R+ +D                HC              +  D  A +S+ QTQKKL+PKQA+WQDFLAEF
Subjt:  HTRPFEIHTNASDYAIGGVLIE----AIYRPRERDD---------------CHC-------------PLPADMAALSSWKQTQKKLSPKQARWQDFLAEF

Query:  DFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMW
        DFK+EYK G+AN+VADALS KAE + I  S P   F E+IKEG++ D+LA NLLKLA+EGKTRRF E+N TLLT                          
Subjt:  DFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMW

Query:  AGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANV
                   +V DKYYWP+M+D+IE+YV+TCL+CQQDKGEQQLPA LLEPLP+AE+PW+SVTMDFIVALPKS   G+IMVVVD FSKYATFIPC  +V
Subjt:  AGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANV

Query:  KVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSE
        KVDEAARLF KNIVK WG+P+SIISDRD RFTGKFWRELFKL G +LNFST+FH QSDGQTER+N LLEQYLRHYVSAHQKDWA LLD+AQFSYNLQRSE
Subjt:  KVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSE

Query:  STGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------QGLIRKYE
        STGKSPFEI+  +QPNTP  L + Y GPNPSA+   K+W E+QDI+RACL KAA++MKKWAD  RRPKEY  GE                  +GL+R+YE
Subjt:  STGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE------------------QGLIRKYE

Query:  GPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPDSEASWEKE
        GPF I+ERVGK AY+V+LP +LKIH++FHVSMLKPFH+D EDP R+ +++AP GV TE+DK ++ ILA R++  +GVPS+ EYL+ W+GL +SEASWE E
Subjt:  GPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPDSEASWEKE

Query:  GHLWQFADKISDY
          LWQF  +I  +
Subjt:  GHLWQFADKISDY

A5BX03 Uncharacterized protein0.0e+0050.55Show/hide
Query:  TRGRAEKRAQAKRVKSKSPAARKV---LETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKE---------LIEDLGQTFRAEVADLRKFVEE
        TRGR  +     R + K   +R V   +E RLT +E  + D +  +  +    E+  ++  E  +  +E         +  +   +F+ +V  +   VE 
Subjt:  TRGRAEKRAQAKRVKSKSPAARKV---LETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKE---------LIEDLGQTFRAEVADLRKFVEE

Query:  ELNGLRKEVSDIKEQCLARHQASSPASTSS-TSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKH
         +  L   + D+++Q + +  A    + S+   A      ++VPK  TFSG R+A  +DNFL+ +ERYF+A+ + D+A ++  A  YL D+A LWW+ + 
Subjt:  ELNGLRKEVSDIKEQCLARHQASSPASTSS-TSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKH

Query:  AEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVE
        A+ E+  C+I TW+ FK E+++ F P +    A+  L+RL+ TG I +YVKEF+TLMLE+ ++ E++ LF F D L+ WA  EL RR VQ L  A+A  E
Subjt:  AEREKDQCSIHTWEQFKSELRKHFVPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVE

Query:  GLTDY--------------STQKG---KKSDSTKDKNKSSQDKRGKGNKGESSK----PPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEAK
         L DY              +  KG   K+S     K  SS+   GK  KG+  +    P   CF+CDGPH  R+CP +KA+NA++      E  +E +AK
Subjt:  GLTDY--------------STQKG---KKSDSTKDKNKSSQDKRGKGNKGESSK----PPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEAK

Query:  VGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTI
        +GS+  L AL     P+  ++K LMYV+  + G+   A++DTGATHNF+ E +A RL L  +   G +K VNS AKP+ G+A+ V + IG W+G +DFT+
Subjt:  VGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGRGTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTI

Query:  APMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDE-CFIASVRELQEETGETKTKETLPSCITKVLAEY
        APMDDFKMVLG+ F +K  A P+P   S+ IL++ K  ++P       + P+LSA+Q K+ + ++E  ++A+++E +++     + E +P  I  VL E+
Subjt:  APMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDE-CFIASVRELQEETGETKTKETLPSCITKVLAEY

Query:  QDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA--------------------------------
        +DVM  ELPK+L  R+E +H I+LE GAKP AM PYR+APPELEEL +QLK+LL+AG+IQPSKAP+GA                                
Subjt:  QDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGA--------------------------------

Query:  --------SLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHL
                 LG+A+YF+K+DLRSGYYQVRI  GDEPKT CVTRYG+YEFLVMPFGLTNAPATFCTLMNK+FHPYLDKFVV YLDDIV+YS T++EH EHL
Subjt:  --------SLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHL

Query:  RKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSC
        RKVF++LR+N+LY+K EK SFA+  V FLGH  +DGKLMMD +K+KAIQEW  PTKVP+LRSFLG VNYYRRFIKGYS   APLT+LLKK++ WEW   C
Subjt:  RKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSC

Query:  QRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLI----------------EAIYRPRERD----------------DCHCPLPADMAALSSW
        Q+AF++LK+AV +EPV+ LPDHT+ FE+HT+ASD+AIGGVL+                E  Y  +E++                  H  +  D  A +S+
Subjt:  QRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLI----------------EAIYRPRERD----------------DCHCPLPADMAALSSW

Query:  KQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRL
         QTQKKLSPKQARWQDFLAEFD+ +EYK G AN VADALSRKAE A+I +S P+      ++EGL+ D +A +L+ LA EGKT+RFW  +  L T GRRL
Subjt:  KQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRL

Query:  FIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGS
        ++PKWGN+RR +IKECHD+ WAGHPG  RT AL++  YYWP++RD +EAYV+TCL+CQQDK EQ+ P GLLEPLPVAE PW+SVTMDFI+ LPKS   GS
Subjt:  FIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGS

Query:  IMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAH
        I+VVVD FSKYATFI    +   +E ARLFLK++VK WG+P+ IISDRDPRFTGKFW ELFKLMG+EL+FST+FHPQ+DGQTER NALLE YLRH+VSA+
Subjt:  IMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAH

Query:  QKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE-----
        Q+DWA LLDIAQFSYNLQRSE+T KSPFE+   +QP TP  L   Y+G +P+A+K AK W E+ DIA + L KAAKKMKKWAD  RR  EY  G+     
Subjt:  QKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGE-----

Query:  -------------QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPS
                     +GL+R+YEGPFPIL +VGK +Y+V+LP RLKIH VFHVS L P+H+D +DP R +S RAP  V T YDK VE I+A R +  +GVP 
Subjt:  -------------QGLIRKYEGPFPILERVGKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPS

Query:  HLEYLVKWQGLPDSEASWEKEGHLWQFADKISDYWQE
          EYLVKW+GLP+SEASWE    LWQF ++I  +  E
Subjt:  HLEYLVKWQGLPDSEASWEKEGHLWQFADKISDYWQE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein5.9e-10529.98Show/hide
Query:  YIQPSKAPFG------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQ
        Y++P+  P        A +  +  F+K+DL+S Y+ +R++ GDE K A     G +E+LVMP+G++ APA F   +N +     +  VV Y+DDI+++S+
Subjt:  YIQPSKAPFG------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQ

Query:  TMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKS
        +  EHV+H++ V Q L+   L I   K  F Q  V+F+G+   +         I  + +W+ P    ELR FLG VNY R+FI   S +  PL NLLKK 
Subjt:  TMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKS

Query:  QTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIEAIYRPRERDDCHCP---------------------LPADMAALSSWKQ
          W+WT +  +A + +K+ +   PV+   D ++   + T+ASD A+G VL +     +  DD + P                     + A + +L  W+ 
Subjt:  QTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIEAIYRPRERDDCHCP---------------------LPADMAALSSWKQ

Query:  -TQKKLSP-----------------------KQARWQDFLAEFDFKMEYKLGKANVVADALSRKA-EFAAIAASTPRCS------------FEEQIKEGL
          +  + P                       + ARWQ FL +F+F++ Y+ G AN +ADALSR   E   I   +   S            F+ Q+    
Subjt:  -TQKKLSP-----------------------KQARWQDFLAEFDFKMEYKLGKANVVADALSRKA-EFAAIAASTPRCS------------FEEQIKEGL

Query:  KQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQ
          D   +NLL    +         +  L+    ++ +P    L R IIK+ H+     HPG++    ++  ++ W  +R  I+ YVQ C  CQ +K    
Subjt:  KQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQ

Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
         P G L+P+P +E PWES++MDFI ALP+S    ++ VVVD FSK A  +PC+ ++  ++ AR+F + ++  +G P+ II+D D  FT + W++      
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAK
          + FS  + PQ+DGQTER N  +E+ LR   S H   W D + + Q SYN     +T  +PFEIV+   P           D T + S      ++  K
Subjt:  TELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAK

Query:  EWKEKQDIARACLHKAAKKMKKWADLN-RRPKEYNEGEQGLIRK---------------YEGPFPILERVGKTAYRVQLPARLK--IHDVFHVSMLKPFH
        E     +I          KMKK+ D+  +  +E+  G+  ++++               + GPF +L++ G   Y + LP  +K      FHVS L+ + 
Subjt:  EWKEKQDIARACLHKAAKKMKKWADLN-RRPKEYNEGEQGLIRK---------------YEGPFPILERVGKTAYRVQLPARLK--IHDVFHVSMLKPFH

Query:  QDME
         + E
Subjt:  QDME

P0CT35 Transposon Tf2-2 polyprotein5.9e-10529.98Show/hide
Query:  YIQPSKAPFG------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQ
        Y++P+  P        A +  +  F+K+DL+S Y+ +R++ GDE K A     G +E+LVMP+G++ APA F   +N +     +  VV Y+DDI+++S+
Subjt:  YIQPSKAPFG------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQ

Query:  TMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKS
        +  EHV+H++ V Q L+   L I   K  F Q  V+F+G+   +         I  + +W+ P    ELR FLG VNY R+FI   S +  PL NLLKK 
Subjt:  TMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKS

Query:  QTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIEAIYRPRERDDCHCP---------------------LPADMAALSSWKQ
          W+WT +  +A + +K+ +   PV+   D ++   + T+ASD A+G VL +     +  DD + P                     + A + +L  W+ 
Subjt:  QTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIEAIYRPRERDDCHCP---------------------LPADMAALSSWKQ

Query:  -TQKKLSP-----------------------KQARWQDFLAEFDFKMEYKLGKANVVADALSRKA-EFAAIAASTPRCS------------FEEQIKEGL
          +  + P                       + ARWQ FL +F+F++ Y+ G AN +ADALSR   E   I   +   S            F+ Q+    
Subjt:  -TQKKLSP-----------------------KQARWQDFLAEFDFKMEYKLGKANVVADALSRKA-EFAAIAASTPRCS------------FEEQIKEGL

Query:  KQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQ
          D   +NLL    +         +  L+    ++ +P    L R IIK+ H+     HPG++    ++  ++ W  +R  I+ YVQ C  CQ +K    
Subjt:  KQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQ

Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
         P G L+P+P +E PWES++MDFI ALP+S    ++ VVVD FSK A  +PC+ ++  ++ AR+F + ++  +G P+ II+D D  FT + W++      
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAK
          + FS  + PQ+DGQTER N  +E+ LR   S H   W D + + Q SYN     +T  +PFEIV+   P           D T + S      ++  K
Subjt:  TELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAK

Query:  EWKEKQDIARACLHKAAKKMKKWADLN-RRPKEYNEGEQGLIRK---------------YEGPFPILERVGKTAYRVQLPARLK--IHDVFHVSMLKPFH
        E     +I          KMKK+ D+  +  +E+  G+  ++++               + GPF +L++ G   Y + LP  +K      FHVS L+ + 
Subjt:  EWKEKQDIARACLHKAAKKMKKWADLN-RRPKEYNEGEQGLIRK---------------YEGPFPILERVGKTAYRVQLPARLK--IHDVFHVSMLKPFH

Query:  QDME
         + E
Subjt:  QDME

P0CT36 Transposon Tf2-3 polyprotein5.9e-10529.98Show/hide
Query:  YIQPSKAPFG------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQ
        Y++P+  P        A +  +  F+K+DL+S Y+ +R++ GDE K A     G +E+LVMP+G++ APA F   +N +     +  VV Y+DDI+++S+
Subjt:  YIQPSKAPFG------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQ

Query:  TMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKS
        +  EHV+H++ V Q L+   L I   K  F Q  V+F+G+   +         I  + +W+ P    ELR FLG VNY R+FI   S +  PL NLLKK 
Subjt:  TMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKS

Query:  QTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIEAIYRPRERDDCHCP---------------------LPADMAALSSWKQ
          W+WT +  +A + +K+ +   PV+   D ++   + T+ASD A+G VL +     +  DD + P                     + A + +L  W+ 
Subjt:  QTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIEAIYRPRERDDCHCP---------------------LPADMAALSSWKQ

Query:  -TQKKLSP-----------------------KQARWQDFLAEFDFKMEYKLGKANVVADALSRKA-EFAAIAASTPRCS------------FEEQIKEGL
          +  + P                       + ARWQ FL +F+F++ Y+ G AN +ADALSR   E   I   +   S            F+ Q+    
Subjt:  -TQKKLSP-----------------------KQARWQDFLAEFDFKMEYKLGKANVVADALSRKA-EFAAIAASTPRCS------------FEEQIKEGL

Query:  KQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQ
          D   +NLL    +         +  L+    ++ +P    L R IIK+ H+     HPG++    ++  ++ W  +R  I+ YVQ C  CQ +K    
Subjt:  KQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQ

Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
         P G L+P+P +E PWES++MDFI ALP+S    ++ VVVD FSK A  +PC+ ++  ++ AR+F + ++  +G P+ II+D D  FT + W++      
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAK
          + FS  + PQ+DGQTER N  +E+ LR   S H   W D + + Q SYN     +T  +PFEIV+   P           D T + S      ++  K
Subjt:  TELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAK

Query:  EWKEKQDIARACLHKAAKKMKKWADLN-RRPKEYNEGEQGLIRK---------------YEGPFPILERVGKTAYRVQLPARLK--IHDVFHVSMLKPFH
        E     +I          KMKK+ D+  +  +E+  G+  ++++               + GPF +L++ G   Y + LP  +K      FHVS L+ + 
Subjt:  EWKEKQDIARACLHKAAKKMKKWADLN-RRPKEYNEGEQGLIRK---------------YEGPFPILERVGKTAYRVQLPARLK--IHDVFHVSMLKPFH

Query:  QDME
         + E
Subjt:  QDME

P0CT41 Transposon Tf2-12 polyprotein5.9e-10529.98Show/hide
Query:  YIQPSKAPFG------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQ
        Y++P+  P        A +  +  F+K+DL+S Y+ +R++ GDE K A     G +E+LVMP+G++ APA F   +N +     +  VV Y+DDI+++S+
Subjt:  YIQPSKAPFG------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQ

Query:  TMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKS
        +  EHV+H++ V Q L+   L I   K  F Q  V+F+G+   +         I  + +W+ P    ELR FLG VNY R+FI   S +  PL NLLKK 
Subjt:  TMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKS

Query:  QTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIEAIYRPRERDDCHCP---------------------LPADMAALSSWKQ
          W+WT +  +A + +K+ +   PV+   D ++   + T+ASD A+G VL +     +  DD + P                     + A + +L  W+ 
Subjt:  QTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIEAIYRPRERDDCHCP---------------------LPADMAALSSWKQ

Query:  -TQKKLSP-----------------------KQARWQDFLAEFDFKMEYKLGKANVVADALSRKA-EFAAIAASTPRCS------------FEEQIKEGL
          +  + P                       + ARWQ FL +F+F++ Y+ G AN +ADALSR   E   I   +   S            F+ Q+    
Subjt:  -TQKKLSP-----------------------KQARWQDFLAEFDFKMEYKLGKANVVADALSRKA-EFAAIAASTPRCS------------FEEQIKEGL

Query:  KQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQ
          D   +NLL    +         +  L+    ++ +P    L R IIK+ H+     HPG++    ++  ++ W  +R  I+ YVQ C  CQ +K    
Subjt:  KQDKLAINLLKLAREGKTRRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQ

Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
         P G L+P+P +E PWES++MDFI ALP+S    ++ VVVD FSK A  +PC+ ++  ++ AR+F + ++  +G P+ II+D D  FT + W++      
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAK
          + FS  + PQ+DGQTER N  +E+ LR   S H   W D + + Q SYN     +T  +PFEIV+   P           D T + S      ++  K
Subjt:  TELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAK

Query:  EWKEKQDIARACLHKAAKKMKKWADLN-RRPKEYNEGEQGLIRK---------------YEGPFPILERVGKTAYRVQLPARLK--IHDVFHVSMLKPFH
        E     +I          KMKK+ D+  +  +E+  G+  ++++               + GPF +L++ G   Y + LP  +K      FHVS L+ + 
Subjt:  EWKEKQDIARACLHKAAKKMKKWADLN-RRPKEYNEGEQGLIRK---------------YEGPFPILERVGKTAYRVQLPARLK--IHDVFHVSMLKPFH

Query:  QDME
         + E
Subjt:  QDME

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.0e-10629.94Show/hide
Query:  GKDECFIASVRELQEETGETKTKETLPSCITKVLAEYQDVMPTELPKKLP--SRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQP
        GK    +++++ ++    +   K+T  +    +  +Y++++  +LP +    +   V H IE++PGA+ P + PY +     +E++K ++ LL+  +I P
Subjt:  GKDECFIASVRELQEETGETKTKETLPSCITKVLAEYQDVMPTELPKKLP--SRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQP

Query:  SKAPFG----------------------------------------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPA
        SK+P                                          + +G A+ F+ +DL SGY+Q+ ++  D  KTA VT  G YE+ VMPFGL NAP+
Subjt:  SKAPFG----------------------------------------ASLGKAKYFSKIDLRSGYYQVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPA

Query:  TFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELR
        TF   M   F     +FV VYLDDI+++S++ EEH +HL  V + L+   L +K +K  FA    EFLG+     K+     K  AI+++  P  V + +
Subjt:  TFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDGKLMMDPTKIKAIQEWRAPTKVPELR

Query:  SFLGFVNYYRRFIKGYSAIVAPLTNLL-KKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVL-------------------
         FLG +NYYRRFI   S I  P+   +  KSQ   WT    +A ++LK A+   PV++  ++   + + T+AS   IG VL                   
Subjt:  SFLGFVNYYRRFIKGYSAIVAPLTNLL-KKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVL-------------------

Query:  ------------------IEAIYRPRER-DDCHCPLPADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPR
                          I+A++  R      H  L  D  +L S  Q + + + +  RW D LA +DF +EY  G  NVVADA+SR      I   T R
Subjt:  ------------------IEAIYRPRER-DDCHCPLPADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPR

Query:  CSFEEQIKEGLKQDKLA----INLLKLAREGKT--------------------RRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHD-SMWAGHPGMDR
            E  K   K D L     I++ +L +   T                    R+ +   D ++    RL +P     +  +++  HD +++ GH G+  
Subjt:  CSFEEQIKEGLKQDKLA----INLLKLAREGKT--------------------RRFWESNDTLLTLGRRLFIPKWGNLRREIIKECHD-SMWAGHPGMDR

Query:  TIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVAL-PKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAAR
        T+A +   YYWPK++ +I  Y++TC+ CQ  K  +    GLL+PLP+AE  W  ++MDF+  L P S +   I+VVVD FSK A FI     +   +   
Subjt:  TIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVAL-PKSRSCGSIMVVVDWFSKYATFIPCSANVKVDEAAR

Query:  LFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPF
        L  + I    G PR+I SDRD R T   ++EL K +G +   S+  HPQ+DGQ+ER    L + LR YVS + ++W   L   +F YN   + + GKSPF
Subjt:  LFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLLDIAQFSYNLQRSESTGKSPF

Query:  EIVNWRQPNTPKDLTSD-YSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGEQGLIRK---------------YEGPFPILERV
        EI     PNTP   + D  +  + +A ++AK  K      +  L  A  +M+   +  R+P   N G+  L+ +               Y GPF +++++
Subjt:  EIVNWRQPNTPKDLTSD-YSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGEQGLIRK---------------YEGPFPILERV

Query:  GKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDK
           AY + L +  K H V +V  LK  +  ++   + I+  AP+    E+ K
Subjt:  GKTAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.7e-2242.42Show/hide
Query:  VEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLG--HWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTW
        + HL  V Q+  +++ Y   +K +F QP + +LG  H      +  DP K++A+  W  P    ELR FLG   YYRRF+K Y  IV PLT LLKK+ + 
Subjt:  VEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLG--HWAKDGKLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTW

Query:  EWTTSCQRAFKELKEAVQKEPVMMLPDHTRPF
        +WT     AFK LK AV   PV+ LPD   PF
Subjt:  EWTTSCQRAFKELKEAVQKEPVMMLPDHTRPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTGCTACCCGAGGCGGTCACAACGCGTGGGCGAGCCGAGAAACGCGCCCAGGCCAAACGTGTTAAGTCGAAGAGCCCCGCCGCCCGCAAGGTTTTGGAGACCCG
TCTAACTGCCCTTGAGGAGACTGTAGGCGACCTGCAGGCCTCCATTGGCCGACTAAGCGGCGACTATGAAGAGTTGGCACAAGAGAATGCCGAAGTGACCAAGGTCACCA
AGGAGCTCATCGAAGATCTCGGACAAACCTTCAGGGCCGAGGTAGCGGATCTTCGAAAATTTGTAGAAGAAGAATTAAATGGGCTCCGCAAGGAGGTGAGCGACATCAAG
GAGCAATGCCTCGCTCGCCACCAAGCTAGTAGTCCGGCTTCTACTAGCTCGACTTCCGCCGCGCAAGGTACGTTTAACCTCAAAGTTCCCAAGCCCGATACTTTCAGTGG
AGTTCGGAATGCTACCATTGTGGATAACTTCTTATTTGGGCTTGAGCGATACTTTGATGCTCTTGGGGTGCATGATGATGCGGCACGCATAAACAATGCCCCAACGTACC
TAAGAGACTCCGCTCAGCTGTGGTGGAAACACAAGCATGCTGAGCGAGAGAAAGACCAATGTAGCATCCACACGTGGGAACAATTTAAAAGTGAGCTTCGAAAACATTTT
GTCCCTCACAATGCCGAAGAAGAGGCCAAGGGAAAACTACGAAGGTTGAGGCAAACCGGTGGAATTCTTGACTATGTCAAGGAGTTCACAACCCTTATGCTGGAAGTTGA
TAGCTTGCCCGAAAAGGATGCCCTCTTCTACTTCAAGGATGGACTGAAAGACTGGGCACGCGTTGAACTTGAACGCCGCAACGTCCAGACACTCGACGATGCGATCGCCG
CAGTGGAGGGCTTGACGGATTACTCAACGCAGAAGGGAAAGAAGTCCGATTCCACGAAGGACAAGAACAAATCTTCCCAAGATAAACGCGGGAAAGGGAATAAAGGTGAG
TCCTCCAAACCTCCTAAACCATGCTTTATATGTGATGGGCCTCATTGGACGAGAGAATGCCCAAATAAAAAAGCCATGAATGCTCTCGTCGCCGAGATGCGTGAGGCAGA
AGCAAATGAAGAATCCGAAGCCAAGGTCGGATCGATCCAACAGCTTAATGCCCTTGCCGACCTCTTTTCTCCCCGCAAGGTTGAGAACAAAGGCCTCATGTACGTAGATG
CTACAATCAAAGGCCAACCTGCCTCCGCTATGCTAGACACGGGAGCCACGCACAATTTCATGGATGAAAGGAAGGCTACCCGTCTTGGCCTCAACATCACCCACGGGCGA
GGAACCATTAAGATAGTTAATAGTGACGCCAAGCCTGCTGCGGGCATAGCCAAGGACGTACCCATCAAGATTGGGCATTGGCAAGGTACGCTTGATTTCACTATAGCCCC
TATGGATGATTTCAAGATGGTTCTCGGGCTTACTTTCTTCAAGAAGGCACTAGCCTTCCCTGTCCCCGCATACAACTCCTTAGTAATACTCGATGATCGAAAAATAAGGG
TCATCCCGGTAAGGCACGGGGAGAAAGTAGAAGGCCCCGTCTTATCAGCCTTACAATTCAAAAGAAGGATGGGGAAGGACGAGTGCTTCATTGCATCTGTTCGAGAACTA
CAAGAGGAAACAGGGGAGACAAAAACTAAAGAGACCTTGCCAAGCTGCATAACTAAAGTCCTGGCCGAGTACCAAGACGTTATGCCCACGGAGCTCCCAAAGAAACTTCC
ATCCAGGCAGGAAGTAGACCATGCGATAGAGCTGGAACCCGGTGCTAAGCCTCCAGCAATGGCGCCCTATCGCATTGCGCCACCCGAACTCGAGGAGCTGCACAAGCAAT
TGAAGGACTTGCTGGAAGCCGGTTACATACAGCCCTCGAAGGCCCCATTTGGCGCCTCGCTTGGAAAGGCCAAGTACTTTAGCAAGATCGACTTACGCTCGGGATACTAT
CAAGTGCGGATCAAGGCAGGAGATGAGCCCAAAACAGCTTGTGTGACACGGTATGGGGCATATGAGTTCCTTGTAATGCCCTTTGGCTTGACCAATGCCCCCGCAACCTT
TTGCACCTTGATGAACAAGCTGTTCCATCCGTACTTAGACAAGTTCGTGGTCGTGTATCTGGACGACATTGTGGTGTATAGCCAGACAATGGAAGAACACGTTGAGCACC
TCCGCAAGGTCTTCCAGGTTTTACGAGAAAACGAACTATATATCAAACTAGAAAAGTGGTCGTTTGCTCAACCGGGGGTAGAGTTCCTCGGACATTGGGCCAAGGATGGC
AAGCTAATGATGGATCCCACAAAGATCAAGGCCATCCAAGAGTGGAGAGCACCCACAAAGGTCCCTGAGCTAAGGTCCTTTCTAGGCTTTGTCAACTACTACCGACGGTT
TATCAAGGGATATTCAGCAATCGTTGCCCCTCTCACCAATCTATTAAAGAAAAGCCAAACGTGGGAGTGGACGACAAGCTGTCAGCGAGCCTTTAAAGAATTGAAGGAGG
CAGTACAAAAGGAGCCTGTTATGATGCTTCCAGATCATACCAGACCCTTTGAGATACACACTAATGCCTCCGACTACGCGATTGGAGGAGTACTGATAGAGGCGATATAC
CGTCCAAGAGAAAGAGATGACTGCCATTGTCCATTGCCTGCGGACATGGCGGCACTATCTTCTTGGAAGCAAACGCAAAAAAAACTGTCCCCGAAGCAGGCTCGGTGGCA
GGACTTCCTGGCTGAGTTCGATTTCAAGATGGAGTACAAGCTAGGAAAGGCAAACGTTGTTGCTGACGCTCTCAGCCGCAAGGCTGAGTTCGCAGCCATTGCCGCCAGCA
CGCCCCGATGCTCGTTCGAAGAACAAATCAAAGAGGGGCTAAAACAGGATAAACTAGCCATTAACCTATTAAAATTGGCTAGAGAGGGCAAAACCCGACGCTTCTGGGAG
AGCAACGACACTCTCCTCACTCTAGGAAGAAGACTCTTCATACCTAAGTGGGGAAACCTAAGGAGAGAAATTATAAAAGAGTGCCATGACTCTATGTGGGCAGGACACCC
CGGGATGGACCGCACAATAGCTCTTGTCCAAGACAAGTACTACTGGCCAAAGATGAGGGACAACATCGAAGCTTATGTCCAGACCTGTCTCATTTGCCAACAAGACAAAG
GGGAGCAACAACTGCCTGCTGGTCTCCTAGAACCACTGCCTGTGGCGGAGGAGCCTTGGGAGAGTGTCACTATGGATTTCATCGTGGCACTACCAAAATCCAGAAGCTGC
GGCAGCATAATGGTGGTGGTAGACTGGTTCAGCAAGTATGCAACCTTCATACCTTGCTCTGCAAATGTTAAGGTAGACGAAGCTGCTCGGTTGTTCTTAAAAAATATTGT
GAAACTGTGGGGGATTCCGCGAAGCATCATCAGTGACCGAGACCCACGGTTCACAGGCAAGTTCTGGAGGGAGCTATTCAAGTTGATGGGGACCGAGTTGAACTTCTCAA
CCAACTTTCATCCCCAAAGTGATGGACAAACCGAAAGGGTGAATGCCCTCTTAGAGCAATATTTGAGGCACTATGTTAGTGCTCATCAGAAGGATTGGGCAGACCTACTG
GACATTGCCCAATTCTCTTACAATCTCCAAAGGAGCGAATCGACGGGGAAAAGTCCTTTCGAGATTGTCAATTGGCGGCAACCTAATACGCCCAAGGACCTGACCTCTGA
CTACTCTGGACCAAACCCTTCGGCATATAAGATGGCCAAGGAGTGGAAAGAAAAACAAGACATTGCCAGGGCATGCCTCCACAAAGCTGCCAAGAAAATGAAGAAATGGG
CTGATCTAAATAGGCGCCCCAAGGAGTACAACGAAGGGGAACAGGGGTTAATCAGAAAATACGAAGGCCCATTTCCCATCCTCGAAAGAGTTGGAAAGACAGCTTATCGA
GTACAGTTGCCCGCAAGACTTAAAATCCACGACGTCTTCCATGTGAGCATGCTAAAACCATTCCATCAGGACATGGAAGACCCCGGACGAGCCATCTCGAATAGGGCCCC
AATGGGTGTGACAACTGAGTATGACAAGAATGTTGAAGCCATTCTGGCCCACAGAAGGGTAAGTAGTAAGGGTGTCCCAAGTCACTTAGAGTATCTTGTGAAATGGCAAG
GACTCCCAGACTCGGAGGCAAGCTGGGAGAAAGAAGGACACCTATGGCAATTCGCCGACAAGATCTCAGACTATTGGCAGGAGCATGGCCAAGCAAGGATGGGGAAAATG
CTTGCCGCAGCCCCTTGGGAGGCGCATGCGTCTGTCGCACCCCACCGGCACGTATGGCCCAGGGAGGATGGACGGGAGAAAGATGCGTCGACCCCAGGCACTGGTATGGT
CGACGCATTGCTAGACAAGGCTGCCAGGCGCAAGGCTGCTCGACGCATAGTACCGCAAGCCATACCCGAGAAGCGCATAGGCCTAGACACGTCTGGAAACAAGGAGACAG
GCGGCGAAGGATCCAGAAGGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTGCTACCCGAGGCGGTCACAACGCGTGGGCGAGCCGAGAAACGCGCCCAGGCCAAACGTGTTAAGTCGAAGAGCCCCGCCGCCCGCAAGGTTTTGGAGACCCG
TCTAACTGCCCTTGAGGAGACTGTAGGCGACCTGCAGGCCTCCATTGGCCGACTAAGCGGCGACTATGAAGAGTTGGCACAAGAGAATGCCGAAGTGACCAAGGTCACCA
AGGAGCTCATCGAAGATCTCGGACAAACCTTCAGGGCCGAGGTAGCGGATCTTCGAAAATTTGTAGAAGAAGAATTAAATGGGCTCCGCAAGGAGGTGAGCGACATCAAG
GAGCAATGCCTCGCTCGCCACCAAGCTAGTAGTCCGGCTTCTACTAGCTCGACTTCCGCCGCGCAAGGTACGTTTAACCTCAAAGTTCCCAAGCCCGATACTTTCAGTGG
AGTTCGGAATGCTACCATTGTGGATAACTTCTTATTTGGGCTTGAGCGATACTTTGATGCTCTTGGGGTGCATGATGATGCGGCACGCATAAACAATGCCCCAACGTACC
TAAGAGACTCCGCTCAGCTGTGGTGGAAACACAAGCATGCTGAGCGAGAGAAAGACCAATGTAGCATCCACACGTGGGAACAATTTAAAAGTGAGCTTCGAAAACATTTT
GTCCCTCACAATGCCGAAGAAGAGGCCAAGGGAAAACTACGAAGGTTGAGGCAAACCGGTGGAATTCTTGACTATGTCAAGGAGTTCACAACCCTTATGCTGGAAGTTGA
TAGCTTGCCCGAAAAGGATGCCCTCTTCTACTTCAAGGATGGACTGAAAGACTGGGCACGCGTTGAACTTGAACGCCGCAACGTCCAGACACTCGACGATGCGATCGCCG
CAGTGGAGGGCTTGACGGATTACTCAACGCAGAAGGGAAAGAAGTCCGATTCCACGAAGGACAAGAACAAATCTTCCCAAGATAAACGCGGGAAAGGGAATAAAGGTGAG
TCCTCCAAACCTCCTAAACCATGCTTTATATGTGATGGGCCTCATTGGACGAGAGAATGCCCAAATAAAAAAGCCATGAATGCTCTCGTCGCCGAGATGCGTGAGGCAGA
AGCAAATGAAGAATCCGAAGCCAAGGTCGGATCGATCCAACAGCTTAATGCCCTTGCCGACCTCTTTTCTCCCCGCAAGGTTGAGAACAAAGGCCTCATGTACGTAGATG
CTACAATCAAAGGCCAACCTGCCTCCGCTATGCTAGACACGGGAGCCACGCACAATTTCATGGATGAAAGGAAGGCTACCCGTCTTGGCCTCAACATCACCCACGGGCGA
GGAACCATTAAGATAGTTAATAGTGACGCCAAGCCTGCTGCGGGCATAGCCAAGGACGTACCCATCAAGATTGGGCATTGGCAAGGTACGCTTGATTTCACTATAGCCCC
TATGGATGATTTCAAGATGGTTCTCGGGCTTACTTTCTTCAAGAAGGCACTAGCCTTCCCTGTCCCCGCATACAACTCCTTAGTAATACTCGATGATCGAAAAATAAGGG
TCATCCCGGTAAGGCACGGGGAGAAAGTAGAAGGCCCCGTCTTATCAGCCTTACAATTCAAAAGAAGGATGGGGAAGGACGAGTGCTTCATTGCATCTGTTCGAGAACTA
CAAGAGGAAACAGGGGAGACAAAAACTAAAGAGACCTTGCCAAGCTGCATAACTAAAGTCCTGGCCGAGTACCAAGACGTTATGCCCACGGAGCTCCCAAAGAAACTTCC
ATCCAGGCAGGAAGTAGACCATGCGATAGAGCTGGAACCCGGTGCTAAGCCTCCAGCAATGGCGCCCTATCGCATTGCGCCACCCGAACTCGAGGAGCTGCACAAGCAAT
TGAAGGACTTGCTGGAAGCCGGTTACATACAGCCCTCGAAGGCCCCATTTGGCGCCTCGCTTGGAAAGGCCAAGTACTTTAGCAAGATCGACTTACGCTCGGGATACTAT
CAAGTGCGGATCAAGGCAGGAGATGAGCCCAAAACAGCTTGTGTGACACGGTATGGGGCATATGAGTTCCTTGTAATGCCCTTTGGCTTGACCAATGCCCCCGCAACCTT
TTGCACCTTGATGAACAAGCTGTTCCATCCGTACTTAGACAAGTTCGTGGTCGTGTATCTGGACGACATTGTGGTGTATAGCCAGACAATGGAAGAACACGTTGAGCACC
TCCGCAAGGTCTTCCAGGTTTTACGAGAAAACGAACTATATATCAAACTAGAAAAGTGGTCGTTTGCTCAACCGGGGGTAGAGTTCCTCGGACATTGGGCCAAGGATGGC
AAGCTAATGATGGATCCCACAAAGATCAAGGCCATCCAAGAGTGGAGAGCACCCACAAAGGTCCCTGAGCTAAGGTCCTTTCTAGGCTTTGTCAACTACTACCGACGGTT
TATCAAGGGATATTCAGCAATCGTTGCCCCTCTCACCAATCTATTAAAGAAAAGCCAAACGTGGGAGTGGACGACAAGCTGTCAGCGAGCCTTTAAAGAATTGAAGGAGG
CAGTACAAAAGGAGCCTGTTATGATGCTTCCAGATCATACCAGACCCTTTGAGATACACACTAATGCCTCCGACTACGCGATTGGAGGAGTACTGATAGAGGCGATATAC
CGTCCAAGAGAAAGAGATGACTGCCATTGTCCATTGCCTGCGGACATGGCGGCACTATCTTCTTGGAAGCAAACGCAAAAAAAACTGTCCCCGAAGCAGGCTCGGTGGCA
GGACTTCCTGGCTGAGTTCGATTTCAAGATGGAGTACAAGCTAGGAAAGGCAAACGTTGTTGCTGACGCTCTCAGCCGCAAGGCTGAGTTCGCAGCCATTGCCGCCAGCA
CGCCCCGATGCTCGTTCGAAGAACAAATCAAAGAGGGGCTAAAACAGGATAAACTAGCCATTAACCTATTAAAATTGGCTAGAGAGGGCAAAACCCGACGCTTCTGGGAG
AGCAACGACACTCTCCTCACTCTAGGAAGAAGACTCTTCATACCTAAGTGGGGAAACCTAAGGAGAGAAATTATAAAAGAGTGCCATGACTCTATGTGGGCAGGACACCC
CGGGATGGACCGCACAATAGCTCTTGTCCAAGACAAGTACTACTGGCCAAAGATGAGGGACAACATCGAAGCTTATGTCCAGACCTGTCTCATTTGCCAACAAGACAAAG
GGGAGCAACAACTGCCTGCTGGTCTCCTAGAACCACTGCCTGTGGCGGAGGAGCCTTGGGAGAGTGTCACTATGGATTTCATCGTGGCACTACCAAAATCCAGAAGCTGC
GGCAGCATAATGGTGGTGGTAGACTGGTTCAGCAAGTATGCAACCTTCATACCTTGCTCTGCAAATGTTAAGGTAGACGAAGCTGCTCGGTTGTTCTTAAAAAATATTGT
GAAACTGTGGGGGATTCCGCGAAGCATCATCAGTGACCGAGACCCACGGTTCACAGGCAAGTTCTGGAGGGAGCTATTCAAGTTGATGGGGACCGAGTTGAACTTCTCAA
CCAACTTTCATCCCCAAAGTGATGGACAAACCGAAAGGGTGAATGCCCTCTTAGAGCAATATTTGAGGCACTATGTTAGTGCTCATCAGAAGGATTGGGCAGACCTACTG
GACATTGCCCAATTCTCTTACAATCTCCAAAGGAGCGAATCGACGGGGAAAAGTCCTTTCGAGATTGTCAATTGGCGGCAACCTAATACGCCCAAGGACCTGACCTCTGA
CTACTCTGGACCAAACCCTTCGGCATATAAGATGGCCAAGGAGTGGAAAGAAAAACAAGACATTGCCAGGGCATGCCTCCACAAAGCTGCCAAGAAAATGAAGAAATGGG
CTGATCTAAATAGGCGCCCCAAGGAGTACAACGAAGGGGAACAGGGGTTAATCAGAAAATACGAAGGCCCATTTCCCATCCTCGAAAGAGTTGGAAAGACAGCTTATCGA
GTACAGTTGCCCGCAAGACTTAAAATCCACGACGTCTTCCATGTGAGCATGCTAAAACCATTCCATCAGGACATGGAAGACCCCGGACGAGCCATCTCGAATAGGGCCCC
AATGGGTGTGACAACTGAGTATGACAAGAATGTTGAAGCCATTCTGGCCCACAGAAGGGTAAGTAGTAAGGGTGTCCCAAGTCACTTAGAGTATCTTGTGAAATGGCAAG
GACTCCCAGACTCGGAGGCAAGCTGGGAGAAAGAAGGACACCTATGGCAATTCGCCGACAAGATCTCAGACTATTGGCAGGAGCATGGCCAAGCAAGGATGGGGAAAATG
CTTGCCGCAGCCCCTTGGGAGGCGCATGCGTCTGTCGCACCCCACCGGCACGTATGGCCCAGGGAGGATGGACGGGAGAAAGATGCGTCGACCCCAGGCACTGGTATGGT
CGACGCATTGCTAGACAAGGCTGCCAGGCGCAAGGCTGCTCGACGCATAGTACCGCAAGCCATACCCGAGAAGCGCATAGGCCTAGACACGTCTGGAAACAAGGAGACAG
GCGGCGAAGGATCCAGAAGGCTCTAG
Protein sequenceShow/hide protein sequence
MELLPEAVTTRGRAEKRAQAKRVKSKSPAARKVLETRLTALEETVGDLQASIGRLSGDYEELAQENAEVTKVTKELIEDLGQTFRAEVADLRKFVEEELNGLRKEVSDIK
EQCLARHQASSPASTSSTSAAQGTFNLKVPKPDTFSGVRNATIVDNFLFGLERYFDALGVHDDAARINNAPTYLRDSAQLWWKHKHAEREKDQCSIHTWEQFKSELRKHF
VPHNAEEEAKGKLRRLRQTGGILDYVKEFTTLMLEVDSLPEKDALFYFKDGLKDWARVELERRNVQTLDDAIAAVEGLTDYSTQKGKKSDSTKDKNKSSQDKRGKGNKGE
SSKPPKPCFICDGPHWTRECPNKKAMNALVAEMREAEANEESEAKVGSIQQLNALADLFSPRKVENKGLMYVDATIKGQPASAMLDTGATHNFMDERKATRLGLNITHGR
GTIKIVNSDAKPAAGIAKDVPIKIGHWQGTLDFTIAPMDDFKMVLGLTFFKKALAFPVPAYNSLVILDDRKIRVIPVRHGEKVEGPVLSALQFKRRMGKDECFIASVREL
QEETGETKTKETLPSCITKVLAEYQDVMPTELPKKLPSRQEVDHAIELEPGAKPPAMAPYRIAPPELEELHKQLKDLLEAGYIQPSKAPFGASLGKAKYFSKIDLRSGYY
QVRIKAGDEPKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFHPYLDKFVVVYLDDIVVYSQTMEEHVEHLRKVFQVLRENELYIKLEKWSFAQPGVEFLGHWAKDG
KLMMDPTKIKAIQEWRAPTKVPELRSFLGFVNYYRRFIKGYSAIVAPLTNLLKKSQTWEWTTSCQRAFKELKEAVQKEPVMMLPDHTRPFEIHTNASDYAIGGVLIEAIY
RPRERDDCHCPLPADMAALSSWKQTQKKLSPKQARWQDFLAEFDFKMEYKLGKANVVADALSRKAEFAAIAASTPRCSFEEQIKEGLKQDKLAINLLKLAREGKTRRFWE
SNDTLLTLGRRLFIPKWGNLRREIIKECHDSMWAGHPGMDRTIALVQDKYYWPKMRDNIEAYVQTCLICQQDKGEQQLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSC
GSIMVVVDWFSKYATFIPCSANVKVDEAARLFLKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTNFHPQSDGQTERVNALLEQYLRHYVSAHQKDWADLL
DIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQDIARACLHKAAKKMKKWADLNRRPKEYNEGEQGLIRKYEGPFPILERVGKTAYR
VQLPARLKIHDVFHVSMLKPFHQDMEDPGRAISNRAPMGVTTEYDKNVEAILAHRRVSSKGVPSHLEYLVKWQGLPDSEASWEKEGHLWQFADKISDYWQEHGQARMGKM
LAAAPWEAHASVAPHRHVWPREDGREKDASTPGTGMVDALLDKAARRKAARRIVPQAIPEKRIGLDTSGNKETGGEGSRRL