; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G18010 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G18010
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr02:30554515..30560026
RNA-Seq ExpressionClc02G18010
SyntenyClc02G18010
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN60947.1 hypothetical protein VITISV_015758 [Vitis vinifera]9.8e-25343.34Show/hide
Query:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK
        MNLG TIK+GN  S+Q++AKA+IFLRHHLHEGLK EYLT+KDP  LW NLK+RYDHQKTVILPKARY+WMHLRLQ+FK+VS+YNS LFKISS+L LCGEK
Subjt:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGRN-SYYFHGGHSN-
        IT+ DMLEKTF+TFHASN+LLQQQYRE+ F +YSELISCLLVAEQNNELLM+NH+SRPTG+ PFPEVNA++   RGRGRGR  G GR  +  +HG +SN 
Subjt:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGRN-SYYFHGGHSN-

Query:  --------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQ------
                H      T      GK  Q K  K+ E+ C+RCGM GHWSR C T KHLVDLYQAS+K KGK++E NF         + D D F        
Subjt:  --------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQ------

Query:  ---------------------SNMTNL------------------------------------DV-----------------------------------
                             S  TNL                                    D+                                   
Subjt:  ---------------------SNMTNL------------------------------------DV-----------------------------------

Query:  -----------------------------------------------------ADFFESPEEK-------------------------------------
                                                             A F + P +                                      
Subjt:  -----------------------------------------------------ADFFESPEEK-------------------------------------

Query:  ---------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------
                                   GHAI+H A+L RIRP  YH+YSP QL  G++ NI HLRIFGCAVYVPIAP Q TKMGPQRR G          
Subjt:  ---------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------

Query:  ----------------------------------IKKLEKEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKI
                                          I +  +EI+W  S +++LDPRTNQCELEVQ+IIHLQN+ NQLPDAFID KK+TKSHIP AN  ++I
Subjt:  ----------------------------------IKKLEKEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKI

Query:  DIPTQQVVINESGTRQKR----------------------------------------------------------------------------------
        D+P  Q+  NES  R KR                                                                                  
Subjt:  DIPTQQVVINESGTRQKR----------------------------------------------------------------------------------

Query:  DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-----------------------------------------------
        DPEP++VEE RHRNDW +WKE I+AELNSL+KR+VFGPVV+TPE VKPVGYKW                                               
Subjt:  DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-----------------------------------------------

Query:  -----------------------------------------------------------------------------------------------KSQSG
                                                                                                       KS++ 
Subjt:  -----------------------------------------------------------------------------------------------KSQSG

Query:  FAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNE
        FAII+VYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQIEH  +G+ VHQSTY +K+LKRFYMDKAH L+ PM VRSLDVKKD FRP   +E
Subjt:  FAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNE

Query:  ELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSG
        ELLGPEVPYL+AIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNGIKHILRYLRGT D+GLFYS +S   L+GYAD+GYLSD HK RSQTGY+F   
Subjt:  ELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSG

Query:  GTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK
        GTAISWRSVKQT+ ATSSNH+EILAIHEASREC+WLRSM QHIRE+CGLSS K
Subjt:  GTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK

CAN66208.1 hypothetical protein VITISV_035070 [Vitis vinifera]2.0e-24240.18Show/hide
Query:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK
        MNLG TIK+GN  S+Q++AKA+IFLRH+LHEGLK EYLT+KDP  LW NLK+RYDHQKTVILPKARY+WMHLRLQ+FK+VS+YNS LFKISS+L LCGEK
Subjt:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR---NSYYFHGGHS
        IT+ DMLEKTF+TFHASN+LLQQQYRE  F +YSELISCLLVAEQNNELLM+NH+SRPTG+ PFPEVNA++   RGRGRGR  GRGR    +  +HG +S
Subjt:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR---NSYYFHGGHS

Query:  N---------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANFAHQDN--------DIFDQSNMT
        N         H      T      GK  Q K  K+ E+ C+RCGM GHWSR C T KHLVDLYQAS+K KGK++E NF + D         D F     T
Subjt:  N---------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANFAHQDN--------DIFDQSNMT

Query:  NLDVAD---FFE----------------------------------------------------------------------------------------
        +  + D   F E                                                                                        
Subjt:  NLDVAD---FFE----------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------SPEEK------------------------------
                                                                         SP E                               
Subjt:  -----------------------------------------------------------------SPEEK------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------I
                                  GHAI+HAA+L RIRP  YH+YSP QL  G++ NISHLRIFGCAVYVPIAP QRTKMGPQRR G          I
Subjt:  --------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------I

Query:  KKLE----------------------------------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKID
        + LE                                  +EI+W  S +++LDPRTNQCELEVQ+IIHLQN+ANQLPDAFID KK+TKSHIP AN  ++ID
Subjt:  KKLE----------------------------------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKID

Query:  IPTQQVVINESGTRQKR-----------------------------------------------------------------------------------
        +P  Q+  NES  R KR                                                                                   
Subjt:  IPTQQVVINESGTRQKR-----------------------------------------------------------------------------------

Query:  ----------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-------------------------------
                        DPEP++VEE RHRNDW +WKEAI+AELNSL+KR+VFGPVV+TPE VKPVGYKW                               
Subjt:  ----------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVR
                   KS++GFAII+VYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQIEH  +G+ VHQSTY +K+LKRFYMDKAH L+ PM VR
Subjt:  -----------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVR

Query:  SLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLS
        SLDVKKD FRP   +EELLGPEVPYLSAIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNGIKHILRYLRGT D+ LFYS +S   L+GYAD+GYLS
Subjt:  SLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLS

Query:  DLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK
        D HK RSQTGY+F   GTAISWRSVKQT+ ATSSNH EILAIHEASREC+WLRSM QHIRE+CGLSS K
Subjt:  DLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK

CAN77974.1 hypothetical protein VITISV_006175 [Vitis vinifera]1.7e-24139.81Show/hide
Query:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK
        MNLG TIK+GN  S+Q++AKA+IFLRHHLHEGLK EY T+KDP  LW NLK+RYDHQKTVILPKARY+WMHLRLQ+FK+VS+YNS LFKISS+L LCGEK
Subjt:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR-NSYYFHGGHSN-
        IT+ DMLEKTF+TFHASN+LLQQQYRE+ F +YSELISCLLVAEQNNELLM+NH+SRPTG+ PFPEVNA++   RGRGRGR  GRGR  +  +HG +SN 
Subjt:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR-NSYYFHGGHSN-

Query:  --------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNL
                H      T      GK  Q K  K+ E+ C+RCGM GHWSR C T KHLVDLYQAS+K KGK++E NF         + D D F     T+ 
Subjt:  --------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNL

Query:  DVAD---FFE------------------------------------------------------------------------------------------
         + D   F E                                                                                          
Subjt:  DVAD---FFE------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SPEEK--------------------------------
                                                                       SP E                                 
Subjt:  ---------------------------------------------------------------SPEEK--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------IKK
                                GHA +HAA+L R+RP  YH+YSP QL  G++ NISHLRIFGCAVYVPIAP QRTKMGPQRR G          I+ 
Subjt:  ------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------IKK

Query:  LE----------------------------------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIP
        LE                                  +EI+W  S +++LDPRTNQCELEVQ+IIHLQN+ANQLPDAFID KK+TKSHIP AN  ++ID+P
Subjt:  LE----------------------------------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIP

Query:  TQQVVINESGTRQKR-------------------------------------------------------------------------------------
          Q+  NES  R KR                                                                                     
Subjt:  TQQVVINESGTRQKR-------------------------------------------------------------------------------------

Query:  ------------------------------------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-----
                                                  DPEP++VEE RHRNDW +WKEAI+AELNSL+KR+VFGPVV+TPE VKPVGYKW     
Subjt:  ------------------------------------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNI
                        KS++GFAII+VYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQIEH  +G+ VHQSTY +K+LKRFYMDKAH L+ 
Subjt:  ----------------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNI

Query:  PMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYAD
        PM VRSLDVKKD FRP   +EELLGPEVPYLSAIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNGIKHILRYLRGT D+GLFYS +S   L+GYAD
Subjt:  PMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYAD

Query:  SGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK
        +GYLSD HK RSQTGY+F   G AISWRSVKQT+ ATSSNH+EILAIHEASREC+WLRSM QHI E+CGLSS K
Subjt:  SGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK

RVW80615.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.7e-24448.89Show/hide
Query:  KAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEKITDADMLEKTFSTFHASNM
        KA+I LR  +HEGLK EYLT+KDP  LW NLK+RYDHQKTVILPKARY+WMHLRLQ+FK+VS+YNS LFKISS+L LCGEKIT+ DMLEKTF+TFHASN+
Subjt:  KAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEKITDADMLEKTFSTFHASNM

Query:  LLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGR----GRNSYYFHGGHSN---------HPNFKRTT
        LLQQQYRE+ F +YSELISCLLVAEQNNELLM+NH+SRPT   PFPEVNA++   RGRGRGR  GR    GRNS Y HG +SN         H      T
Subjt:  LLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGR----GRNSYYFHGGHSN---------HPNFKRTT

Query:  RNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNLDVAD---FFE------
              GK  Q K  K+ E+ C+RCGM GHWSR C T KHLVDLYQAS+K KGK++E NF         + D D F     T+  + D   F E      
Subjt:  RNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNLDVAD---FFE------

Query:  ------------------------------------------SPEEK--------------------------------------------------GHA
                                                  SP E                                                   GHA
Subjt:  ------------------------------------------SPEEK--------------------------------------------------GHA

Query:  ILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------IKKLE-------------------------
        I+HAA+L RIRP  YH+YSP QL  G++ NISHLRIFGCAVYVPIAP QRTKMGPQRR G          I+ LE                         
Subjt:  ILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------IKKLE-------------------------

Query:  ---------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIPTQQ------------------------
                 +EI+W  S +++LDPRTNQCELEVQ+IIHLQN+ANQLPDAFID KK+TKSHIP AN  ++ID+P  Q                        
Subjt:  ---------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIPTQQ------------------------

Query:  -------------------------VVINESGTRQK----------------------------------------------------------------
                                 + + E+   QK                                                                
Subjt:  -------------------------VVINESGTRQK----------------------------------------------------------------

Query:  -------------RDPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW----KSQSGFAII--------SVYVDDLNIIGT
                      DPEP++VEE RHRNDW +WKEAI+AELNSL+KR+VFGPVV+T E VKPVGYKW    K      II         VYVDDLN++GT
Subjt:  -------------RDPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW----KSQSGFAII--------SVYVDDLNIIGT

Query:  PEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGAL
        PEEL++   YLKKEFEMKDLGKTKFCLGLQIEH  +G+ VHQSTY +KILKRFYMDKAH L+ PM VRSLDVKKD FRP   +EELLGPEVPYLSA+GAL
Subjt:  PEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGAL

Query:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITAT
        MYLAN TRPDIAFSVNLLARYSS+PT+RHWN                               D+GYLSD HK RSQTGY+F   GTAISWRSVKQT+ AT
Subjt:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITAT

Query:  SSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK
        SSNH+EILAIHEASREC+WLRSM QHIRE+CGLSS K
Subjt:  SSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK

RVW94544.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.6e-23442.59Show/hide
Query:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK
        MNLG TIK+GN  S+Q++AKA+IFLRHHLHEGLK EYLT+KDP  LW NLK+RYDHQKTVILPKARY+WMHLRLQ+FK+VS+YNS LFKISS+L LCGEK
Subjt:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR-NSYYFHGGHSNH
        IT+ DMLEKTF+TFHASN+LLQQQYRE+ F +YSELISCLLVAEQNNELLM+NH+SRPTG+ PFPEVNA++   RGRGRGR  GRGR  +  +HG +SN+
Subjt:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR-NSYYFHGGHSNH

Query:  P-------NFKRTTRND--DHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNL
                + ++   N+     GK  Q K  K+ E+ C+RCGM GHWSR C T KHLVDLYQAS+K KGK++E NF         + D D F     T+ 
Subjt:  P-------NFKRTTRND--DHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNL

Query:  DVAD---FFESP----------------EEKGHAIL---------------------HAASLARIRPVAYH---------KYSPL-QLAYGQELNISHLR
         + D   F E                  E  G A +                     +  S   IR   YH         +Y  +  +  GQ+L +  L 
Subjt:  DVAD---FFESP----------------EEKGHAIL---------------------HAASLARIRPVAYH---------KYSPL-QLAYGQELNISHLR

Query:  IFGCAVYVPIAPP---------------------------------------------------------------------------------------
         F   +Y  I  P                                                                                       
Subjt:  IFGCAVYVPIAPP---------------------------------------------------------------------------------------

Query:  --------QRTKMGPQ------------------------------RRGGIKKLEKEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAK
                 +TK+                                 R   I +  +EI+W  S +++LDPRTNQCELEVQ+IIHLQN+ANQLPDAFID K
Subjt:  --------QRTKMGPQ------------------------------RRGGIKKLEKEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAK

Query:  KLTKSHIPVANVSSKIDIPTQQVVINESGTRQKR------------------------------------------------------------------
        K+TKSHIP AN  ++ID+P  Q+  NES  R KR                                                                  
Subjt:  KLTKSHIPVANVSSKIDIPTQQVVINESGTRQKR------------------------------------------------------------------

Query:  -------------------------------------------------------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPV
                                                                     DPEP++VEE RHRNDW +WKEAI+AELNSL+KR+VFGPV
Subjt:  -------------------------------------------------------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPV

Query:  VRTPEAVKPVGYKW--------------------------------------------------------------------------------------
        V+TPE VKPVGYKW                                                                                      
Subjt:  VRTPEAVKPVGYKW--------------------------------------------------------------------------------------

Query:  --------------------------------------------------------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKT
                                                                KS++GFAII+VYVDDLN++GTPEEL++   YLKKEFEMKDLGKT
Subjt:  --------------------------------------------------------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKT

Query:  KFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSS
        KFCLGLQIEH  +G+ VHQSTY +K+LKRFYMDKAH L+  M VRSLDVKKD FRP   +EELLGPEVPYLSAIGALMYLAN TRPDIAFSVNLLARYSS
Subjt:  KFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSS

Query:  SPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSM
        SPT+RHWNGIKHILRYLRGT D+GLFYS +S   L+GYAD+GYLSD HK RSQTGY+F   GTAISWRSVKQT+ ATSSNH+EILAIHEASREC+WLRSM
Subjt:  SPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSM

Query:  TQHIRETCGLSSSK
         QHIRE+CGLSS K
Subjt:  TQHIRETCGLSSSK

TrEMBL top hitse value%identityAlignment
A0A2N9IT14 Uncharacterized protein1.9e-24951.34Show/hide
Query:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK
        MNLG+TIK+GN  S Q +AKAMIF+RHHLHE LK EYLT+KDP ILW NLK+RY+HQKTVILPKARY+WMHLRLQ+FKSVS+YNS LFKISS+L L GEK
Subjt:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNF-NNRG-RGRGRDHGRGRNSYYFHGGHSN
        ITD DMLEKT++TFHASN+LLQQQYRE+ F +YSELISCLLVAEQNNELL+KNH+SRPTG+TPFPEVN  ++ NNRG  GRGR HGR R     +    N
Subjt:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNF-NNRG-RGRGRDHGRGRNSYYFHGGHSN

Query:  HPNFKRTTRNDDHK----GKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANFAHQD--------NDIFDQSNMTNLDVAD
           + R    ++ K    G+  Q K  K  E KC RCGM GHW   C   KHLVDLYQAS+KEKGK +E NFAH          +DI ++ ++T+LDV+D
Subjt:  HPNFKRTTRNDDHK----GKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANFAHQD--------NDIFDQSNMTNLDVAD

Query:  FFESPEEKGHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGGIKKLEKEIAWNASLLSYLDPRTNQCELEV
        FFE    K   I H                         +   + R   C     I PP    +G ++   + +  +EI+WN S LS+LDPRTNQ ELEV
Subjt:  FFESPEEKGHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGGIKKLEKEIAWNASLLSYLDPRTNQCELEV

Query:  QKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIPTQQVV---INESGTRQKR------------------------------------------
        Q+IIHLQ IANQLPD F D+KK+ KSHIP AN  + I++P  Q++    NES TR KR                                          
Subjt:  QKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIPTQQVV---INESGTRQKR------------------------------------------

Query:  --------------------------------------------------------------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSK
                                                                            + EP++VE+ R RNDW +WKEAI+AELNSL K
Subjt:  --------------------------------------------------------------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSK

Query:  RQVFGPVVRTPEAVKPVGYKW-------------------------------------------------------------------------------
        R+VFGP+V+ PE V P+GYKW                                                                               
Subjt:  RQVFGPVVRTPEAVKPVGYKW-------------------------------------------------------------------------------

Query:  ------------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEV
                    + +SGFAI++VYVDDLN++GTPEEL+K  +YLK EFEMKDLGKTKFCLGLQIEHL DGI +HQSTYT K+LK F+MDKAH L+ PM V
Subjt:  ------------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEV

Query:  RSLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYL
        RSLDVKKD FRP+   EE LGPEVPYLSAIGALMYLAN TRPDIAFSVNLLARYSS+PT RHWNG+KH+LRYLRGT D+GLFY NKSN  LVGYAD+GYL
Subjt:  RSLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYL

Query:  SDLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK
        SD HK RSQTGYLFT G TAISWRSVKQTI+ATSSNH+EI+AIHEASRECVWLRS+ QHIRE CGLSS K
Subjt:  SDLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK

A0A438H801 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-24448.89Show/hide
Query:  KAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEKITDADMLEKTFSTFHASNM
        KA+I LR  +HEGLK EYLT+KDP  LW NLK+RYDHQKTVILPKARY+WMHLRLQ+FK+VS+YNS LFKISS+L LCGEKIT+ DMLEKTF+TFHASN+
Subjt:  KAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEKITDADMLEKTFSTFHASNM

Query:  LLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGR----GRNSYYFHGGHSN---------HPNFKRTT
        LLQQQYRE+ F +YSELISCLLVAEQNNELLM+NH+SRPT   PFPEVNA++   RGRGRGR  GR    GRNS Y HG +SN         H      T
Subjt:  LLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGR----GRNSYYFHGGHSN---------HPNFKRTT

Query:  RNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNLDVAD---FFE------
              GK  Q K  K+ E+ C+RCGM GHWSR C T KHLVDLYQAS+K KGK++E NF         + D D F     T+  + D   F E      
Subjt:  RNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNLDVAD---FFE------

Query:  ------------------------------------------SPEEK--------------------------------------------------GHA
                                                  SP E                                                   GHA
Subjt:  ------------------------------------------SPEEK--------------------------------------------------GHA

Query:  ILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------IKKLE-------------------------
        I+HAA+L RIRP  YH+YSP QL  G++ NISHLRIFGCAVYVPIAP QRTKMGPQRR G          I+ LE                         
Subjt:  ILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------IKKLE-------------------------

Query:  ---------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIPTQQ------------------------
                 +EI+W  S +++LDPRTNQCELEVQ+IIHLQN+ANQLPDAFID KK+TKSHIP AN  ++ID+P  Q                        
Subjt:  ---------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIPTQQ------------------------

Query:  -------------------------VVINESGTRQK----------------------------------------------------------------
                                 + + E+   QK                                                                
Subjt:  -------------------------VVINESGTRQK----------------------------------------------------------------

Query:  -------------RDPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW----KSQSGFAII--------SVYVDDLNIIGT
                      DPEP++VEE RHRNDW +WKEAI+AELNSL+KR+VFGPVV+T E VKPVGYKW    K      II         VYVDDLN++GT
Subjt:  -------------RDPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW----KSQSGFAII--------SVYVDDLNIIGT

Query:  PEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGAL
        PEEL++   YLKKEFEMKDLGKTKFCLGLQIEH  +G+ VHQSTY +KILKRFYMDKAH L+ PM VRSLDVKKD FRP   +EELLGPEVPYLSA+GAL
Subjt:  PEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGAL

Query:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITAT
        MYLAN TRPDIAFSVNLLARYSS+PT+RHWN                               D+GYLSD HK RSQTGY+F   GTAISWRSVKQT+ AT
Subjt:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITAT

Query:  SSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK
        SSNH+EILAIHEASREC+WLRSM QHIRE+CGLSS K
Subjt:  SSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK

A5AJ43 Uncharacterized protein8.4e-24239.81Show/hide
Query:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK
        MNLG TIK+GN  S+Q++AKA+IFLRHHLHEGLK EY T+KDP  LW NLK+RYDHQKTVILPKARY+WMHLRLQ+FK+VS+YNS LFKISS+L LCGEK
Subjt:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR-NSYYFHGGHSN-
        IT+ DMLEKTF+TFHASN+LLQQQYRE+ F +YSELISCLLVAEQNNELLM+NH+SRPTG+ PFPEVNA++   RGRGRGR  GRGR  +  +HG +SN 
Subjt:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR-NSYYFHGGHSN-

Query:  --------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNL
                H      T      GK  Q K  K+ E+ C+RCGM GHWSR C T KHLVDLYQAS+K KGK++E NF         + D D F     T+ 
Subjt:  --------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQSNMTNL

Query:  DVAD---FFE------------------------------------------------------------------------------------------
         + D   F E                                                                                          
Subjt:  DVAD---FFE------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SPEEK--------------------------------
                                                                       SP E                                 
Subjt:  ---------------------------------------------------------------SPEEK--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------IKK
                                GHA +HAA+L R+RP  YH+YSP QL  G++ NISHLRIFGCAVYVPIAP QRTKMGPQRR G          I+ 
Subjt:  ------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------IKK

Query:  LE----------------------------------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIP
        LE                                  +EI+W  S +++LDPRTNQCELEVQ+IIHLQN+ANQLPDAFID KK+TKSHIP AN  ++ID+P
Subjt:  LE----------------------------------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIP

Query:  TQQVVINESGTRQKR-------------------------------------------------------------------------------------
          Q+  NES  R KR                                                                                     
Subjt:  TQQVVINESGTRQKR-------------------------------------------------------------------------------------

Query:  ------------------------------------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-----
                                                  DPEP++VEE RHRNDW +WKEAI+AELNSL+KR+VFGPVV+TPE VKPVGYKW     
Subjt:  ------------------------------------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNI
                        KS++GFAII+VYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQIEH  +G+ VHQSTY +K+LKRFYMDKAH L+ 
Subjt:  ----------------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNI

Query:  PMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYAD
        PM VRSLDVKKD FRP   +EELLGPEVPYLSAIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNGIKHILRYLRGT D+GLFYS +S   L+GYAD
Subjt:  PMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYAD

Query:  SGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK
        +GYLSD HK RSQTGY+F   G AISWRSVKQT+ ATSSNH+EILAIHEASREC+WLRSM QHI E+CGLSS K
Subjt:  SGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK

A5B4M9 Uncharacterized protein4.7e-25343.34Show/hide
Query:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK
        MNLG TIK+GN  S+Q++AKA+IFLRHHLHEGLK EYLT+KDP  LW NLK+RYDHQKTVILPKARY+WMHLRLQ+FK+VS+YNS LFKISS+L LCGEK
Subjt:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGRN-SYYFHGGHSN-
        IT+ DMLEKTF+TFHASN+LLQQQYRE+ F +YSELISCLLVAEQNNELLM+NH+SRPTG+ PFPEVNA++   RGRGRGR  G GR  +  +HG +SN 
Subjt:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGRN-SYYFHGGHSN-

Query:  --------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQ------
                H      T      GK  Q K  K+ E+ C+RCGM GHWSR C T KHLVDLYQAS+K KGK++E NF         + D D F        
Subjt:  --------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANF--------AHQDNDIFDQ------

Query:  ---------------------SNMTNL------------------------------------DV-----------------------------------
                             S  TNL                                    D+                                   
Subjt:  ---------------------SNMTNL------------------------------------DV-----------------------------------

Query:  -----------------------------------------------------ADFFESPEEK-------------------------------------
                                                             A F + P +                                      
Subjt:  -----------------------------------------------------ADFFESPEEK-------------------------------------

Query:  ---------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------
                                   GHAI+H A+L RIRP  YH+YSP QL  G++ NI HLRIFGCAVYVPIAP Q TKMGPQRR G          
Subjt:  ---------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------

Query:  ----------------------------------IKKLEKEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKI
                                          I +  +EI+W  S +++LDPRTNQCELEVQ+IIHLQN+ NQLPDAFID KK+TKSHIP AN  ++I
Subjt:  ----------------------------------IKKLEKEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKI

Query:  DIPTQQVVINESGTRQKR----------------------------------------------------------------------------------
        D+P  Q+  NES  R KR                                                                                  
Subjt:  DIPTQQVVINESGTRQKR----------------------------------------------------------------------------------

Query:  DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-----------------------------------------------
        DPEP++VEE RHRNDW +WKE I+AELNSL+KR+VFGPVV+TPE VKPVGYKW                                               
Subjt:  DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-----------------------------------------------

Query:  -----------------------------------------------------------------------------------------------KSQSG
                                                                                                       KS++ 
Subjt:  -----------------------------------------------------------------------------------------------KSQSG

Query:  FAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNE
        FAII+VYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQIEH  +G+ VHQSTY +K+LKRFYMDKAH L+ PM VRSLDVKKD FRP   +E
Subjt:  FAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNE

Query:  ELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSG
        ELLGPEVPYL+AIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNGIKHILRYLRGT D+GLFYS +S   L+GYAD+GYLSD HK RSQTGY+F   
Subjt:  ELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSG

Query:  GTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK
        GTAISWRSVKQT+ ATSSNH+EILAIHEASREC+WLRSM QHIRE+CGLSS K
Subjt:  GTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK

A5C0A9 Uncharacterized protein9.9e-24340.18Show/hide
Query:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK
        MNLG TIK+GN  S+Q++AKA+IFLRH+LHEGLK EYLT+KDP  LW NLK+RYDHQKTVILPKARY+WMHLRLQ+FK+VS+YNS LFKISS+L LCGEK
Subjt:  MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR---NSYYFHGGHS
        IT+ DMLEKTF+TFHASN+LLQQQYRE  F +YSELISCLLVAEQNNELLM+NH+SRPTG+ PFPEVNA++   RGRGRGR  GRGR    +  +HG +S
Subjt:  ITDADMLEKTFSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGR---NSYYFHGGHS

Query:  N---------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANFAHQDN--------DIFDQSNMT
        N         H      T      GK  Q K  K+ E+ C+RCGM GHWSR C T KHLVDLYQAS+K KGK++E NF + D         D F     T
Subjt:  N---------HPNFKRTTRNDDHKGKAPQGKNSKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANFAHQDN--------DIFDQSNMT

Query:  NLDVAD---FFE----------------------------------------------------------------------------------------
        +  + D   F E                                                                                        
Subjt:  NLDVAD---FFE----------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------SPEEK------------------------------
                                                                         SP E                               
Subjt:  -----------------------------------------------------------------SPEEK------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------I
                                  GHAI+HAA+L RIRP  YH+YSP QL  G++ NISHLRIFGCAVYVPIAP QRTKMGPQRR G          I
Subjt:  --------------------------GHAILHAASLARIRPVAYHKYSPLQLAYGQELNISHLRIFGCAVYVPIAPPQRTKMGPQRRGG----------I

Query:  KKLE----------------------------------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKID
        + LE                                  +EI+W  S +++LDPRTNQCELEVQ+IIHLQN+ANQLPDAFID KK+TKSHIP AN  ++ID
Subjt:  KKLE----------------------------------KEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKID

Query:  IPTQQVVINESGTRQKR-----------------------------------------------------------------------------------
        +P  Q+  NES  R KR                                                                                   
Subjt:  IPTQQVVINESGTRQKR-----------------------------------------------------------------------------------

Query:  ----------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-------------------------------
                        DPEP++VEE RHRNDW +WKEAI+AELNSL+KR+VFGPVV+TPE VKPVGYKW                               
Subjt:  ----------------DPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKW-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVR
                   KS++GFAII+VYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQIEH  +G+ VHQSTY +K+LKRFYMDKAH L+ PM VR
Subjt:  -----------KSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVR

Query:  SLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLS
        SLDVKKD FRP   +EELLGPEVPYLSAIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNGIKHILRYLRGT D+ LFYS +S   L+GYAD+GYLS
Subjt:  SLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLS

Query:  DLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK
        D HK RSQTGY+F   GTAISWRSVKQT+ ATSSNH EILAIHEASREC+WLRSM QHIRE+CGLSS K
Subjt:  DLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSK

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.5e-3335.48Show/hide
Query:  ISVYVDDLNI-IGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEEL
        + +YVDD+ I  G    ++    YL ++F M DL + K  +G++IE   D I++ QS Y +KIL +F M+  + ++ P+             P   N EL
Subjt:  ISVYVDDLNI-IGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEEL

Query:  LGPE----VPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNF--YLVGYADSGYLSDLHKARSQTGYL
        L  +     P  S IG LMY+   TRPD+  +VN+L+RYSS      W  +K +LRYL+GTID+ L +     F   ++GY DS +       +S TGYL
Subjt:  LGPE----VPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNF--YLVGYADSGYLSDLHKARSQTGYL

Query:  FTS-GGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHI
        F       I W + +Q   A SS  AE +A+ EA RE +WL+ +   I
Subjt:  FTS-GGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHI

P0CV72 Secreted RxLR effector protein 1611.6e-2744.2Show/hide
Query:  VPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAISW
        VPYLSA+GA+MYL   TRPD+A +V +L++++S P   HW  +K +LRYL+ T   GL ++      LVGY+D+ +  D+   RS +GYLF   G  +SW
Subjt:  VPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAISW

Query:  RSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIR
        RS KQ   A SS   E +A+ EA++E VWL   T+  +
Subjt:  RSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-3838.28Show/hide
Query:  VGYKWKSQSGFAIISVYVDDLNIIGTPEEL-SKAIEYLKKEFEMKDLGKTKFCLGLQI--EHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLD
        V +K  S++ F I+ +YVDD+ I+G  + L +K    L K F+MKDLG  +  LG++I  E  +  +++ Q  Y  ++L+RF M  A  ++ P+      
Subjt:  VGYKWKSQSGFAIISVYVDDLNIIGTPEEL-SKAIEYLKKEFEMKDLGKTKFCLGLQI--EHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLD

Query:  VKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLH
         KK    P    E+    +VPY SA+G+LMY    TRPDIA +V +++R+  +P K HW  +K ILRYLRGT    L +   S+  L GY D+    D+ 
Subjt:  VKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLH

Query:  KARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHI
          +S TGYLFT  G AISW+S  Q   A S+  AE +A  E  +E +WL+   Q +
Subjt:  KARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-2432.77Show/hide
Query:  VYVDDLNIIGT-PEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLG
        VYVDD+ I G  P  L   ++ L + F +KD  +  + LG++ + +  G+ + Q  Y   +L R  M  A  +  PM   +   K  +    Y+  +L  
Subjt:  VYVDDLNIIGT-PEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLG

Query:  PEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAI
        P   Y   +G+L YLA  TRPDI+++VN L+++   PT+ H   +K ILRYL GT + G+F    +   L  Y+D+ +  D     S  GY+   G   I
Subjt:  PEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAI

Query:  SWRSVKQTITATSSNHAEILAIHEASRECVWLRSM
        SW S KQ     SS  AE  ++   S E  W+ S+
Subjt:  SWRSVKQTITATSSNHAEILAIHEASRECVWLRSM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.6e-2632.65Show/hide
Query:  QSGFAII--SVYVDDLNIIGTPEELSK-AIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFR
        Q G +II   VYVDD+ I G    L K  ++ L + F +K+     + LG++ + +  G+ + Q  YT  +L R  M  A  +  PM            +
Subjt:  QSGFAII--SVYVDDLNIIGTPEELSK-AIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFR

Query:  PRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTG
           ++   L     Y   +G+L YLA  TRPD++++VN L++Y   PT  HWN +K +LRYL GT D G+F    +   L  Y+D+ +  D     S  G
Subjt:  PRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTG

Query:  YLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSM
        Y+   G   ISW S KQ     SS  AE  ++   S E  W+ S+
Subjt:  YLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.0e-2632.24Show/hide
Query:  FAIISVYVDDLNIIGTPE-ELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNN
        F  + VYVDD+ I    +  + +    LK  F+++DLG  K+ LGL+I   A GI + Q  Y   +L     D+  LL    +  S+ +   +    ++ 
Subjt:  FAIISVYVDDLNIIGTPE-ELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNN

Query:  EELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTS
         + +  +  Y   IG LMYL   TR DI+F+VN L+++S +P   H   +  IL Y++GT+  GLFYS+++   L  ++D+ + S     RS  GY    
Subjt:  EELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTS

Query:  GGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIR
        G + ISW+S KQ + + SS  AE  A+  A+ E +WL    + ++
Subjt:  GGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIR

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.7e-0940.51Show/hide
Query:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGY
        MYL   TRPD+ F+VN L+++SS+        +  +L Y++GT+  GLFYS  S+  L  +ADS + S     RS TG+
Subjt:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGY

ATMG00810.1 DNA/RNA polymerases superfamily protein1.4e-2330.3Show/hide
Query:  VYVDDLNIIGTPEE-LSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLG
        +YVDD+ + G+    L+  I  L   F MKDLG   + LG+QI+    G+F+ Q+ Y  +IL    M     ++ P+    L +   +   +Y +     
Subjt:  VYVDDLNIIGTPEE-LSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLADGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLG

Query:  PEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAI
            + S +GAL YL   TRPDI+++VN++ +    PT   ++ +K +LRY++GTI  GL+    S   +  + DS +       RS TG+    G   I
Subjt:  PEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTIDVGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAI

Query:  SWRSVKQTITATSSNHAEILAIHEASRECVW
        SW + +Q   + SS   E  A+   + E  W
Subjt:  SWRSVKQTITATSSNHAEILAIHEASRECVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCTGGGAGAAACAATTAAGGAAGGAAATGCAACATCCATTCAGGAAAAAGCAAAAGCTATGATCTTCCTTCGCCATCATCTTCACGAGGGATTGAAGATGGAATA
TTTAACAATAAAAGATCCATGTATCTTGTGGAAGAATTTAAAACAAAGATATGATCATCAGAAAACAGTCATTCTTCCTAAAGCTCGTTATGAATGGATGCATTTGAGGC
TCCAAAATTTCAAATCAGTAAGTGATTACAACTCCACGTTATTTAAAATCAGTTCAAAATTATTATTGTGCGGAGAGAAAATTACTGATGCGGATATGTTGGAGAAGACA
TTTTCTACATTCCATGCCTCGAATATGCTCCTGCAGCAGCAATATCGAGAGAAAGGTTTTAAACAATATTCTGAATTAATTTCATGTCTTCTCGTGGCTGAACAAAATAA
TGAGCTATTAATGAAGAACCATGAATCTCGACCAACTGGAACAACACCATTCCCTGAAGTGAATGCTGTAAATTTTAATAATCGTGGTCGAGGTCGTGGTCGTGACCATG
GCAGAGGAAGAAATAGTTATTATTTTCATGGTGGTCATTCTAATCATCCAAATTTCAAAAGAACCACCCGAAATGATGATCATAAAGGAAAAGCTCCACAAGGTAAAAAT
TCAAAAGATGATGAGCATAAATGCTTCCGATGCGGGATGACTGGGCACTGGTCTCGTGTTTGCCATACATCAAAACACCTAGTTGATCTCTATCAAGCTTCCCTGAAGGA
AAAAGGGAAAGATGTGGAAGCAAATTTTGCACACCAGGATAATGATATATTTGACCAATCCAATATGACAAATTTGGATGTGGCGGACTTCTTTGAATCTCCTGAAGAGA
AAGGACATGCTATTTTGCATGCTGCGTCACTTGCACGCATTAGACCAGTAGCTTATCATAAATACTCACCATTACAATTAGCTTATGGCCAGGAGCTAAATATTTCCCAT
CTAAGAATTTTTGGATGTGCAGTATATGTTCCAATTGCTCCACCACAACGTACTAAGATGGGTCCTCAAAGGAGGGGAGGAATTAAGAAGTTGGAAAAGGAAATTGCATG
GAATGCATCATTATTATCTTATTTAGATCCTCGTACAAATCAATGTGAACTTGAAGTTCAGAAAATAATTCATTTGCAAAATATAGCAAATCAATTACCAGATGCATTTA
TAGATGCAAAGAAATTAACTAAATCGCATATACCTGTTGCAAATGTTTCGTCGAAAATTGATATCCCAACACAGCAAGTTGTCATTAATGAATCTGGAACACGCCAGAAG
CGTGATCCTGAACCAAAATCTGTTGAAGAATATCGACATAGAAATGATTGGTTTCGCTGGAAAGAAGCAATCGAGGCAGAATTAAACTCACTTTCAAAACGCCAAGTTTT
TGGACCAGTAGTCCGGACACCAGAAGCTGTCAAACCCGTGGGATACAAATGGAAATCACAATCAGGATTTGCTATTATATCTGTATATGTTGATGACCTAAATATAATTG
GAACTCCTGAAGAGCTTTCAAAGGCAATAGAATATCTTAAGAAAGAATTTGAGATGAAAGATCTCGGAAAAACAAAATTTTGCCTTGGATTGCAAATTGAGCATTTAGCA
GACGGGATTTTTGTTCATCAGTCAACCTATACAAGAAAAATTTTAAAAAGATTTTATATGGATAAAGCACATCTATTAAACATTCCAATGGAAGTTCGTTCACTAGATGT
GAAGAAAGATATATTTCGACCTCGATATAATAATGAAGAACTTCTTGGTCCTGAAGTACCATATCTTAGTGCGATTGGTGCACTTATGTATCTTGCTAATAATACAAGAC
CAGATATTGCATTTTCAGTAAATTTATTAGCTAGATATAGTTCTTCTCCAACAAAAAGACATTGGAATGGAATTAAGCATATACTCCGTTATCTTCGAGGAACGATTGAT
GTGGGTTTGTTTTATTCAAATAAATCAAATTTTTACTTAGTTGGTTATGCAGATTCTGGATATTTATCTGATCTACATAAAGCTAGATCTCAAACAGGTTATTTATTCAC
TTCTGGAGGAACTGCTATATCTTGGCGATCAGTGAAACAGACCATAACAGCTACTTCCTCAAATCATGCTGAAATTCTTGCAATCCATGAAGCTAGTAGAGAATGTGTAT
GGCTAAGATCAATGACTCAACACATTCGAGAAACATGTGGCTTGTCTTCTAGTAAAATCCTCCAACAACATTATACAAAGACAAAACAGCATAAACTTAAACAAGAGTTG
ACGAACTGCTCACTCTCCTGCTTTCCAAGAAGTGTAAGAGTTCCCCTGTTCCTTTTCCTCCTGAGAATTTGGATTAATCCCCGACCCATTTCTTCTTCCAAGAAACCAAA
GCTCGCAGAGAACAGAAACAGTCTAAAAACACAGTTTTCTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACCTGGGAGAAACAATTAAGGAAGGAAATGCAACATCCATTCAGGAAAAAGCAAAAGCTATGATCTTCCTTCGCCATCATCTTCACGAGGGATTGAAGATGGAATA
TTTAACAATAAAAGATCCATGTATCTTGTGGAAGAATTTAAAACAAAGATATGATCATCAGAAAACAGTCATTCTTCCTAAAGCTCGTTATGAATGGATGCATTTGAGGC
TCCAAAATTTCAAATCAGTAAGTGATTACAACTCCACGTTATTTAAAATCAGTTCAAAATTATTATTGTGCGGAGAGAAAATTACTGATGCGGATATGTTGGAGAAGACA
TTTTCTACATTCCATGCCTCGAATATGCTCCTGCAGCAGCAATATCGAGAGAAAGGTTTTAAACAATATTCTGAATTAATTTCATGTCTTCTCGTGGCTGAACAAAATAA
TGAGCTATTAATGAAGAACCATGAATCTCGACCAACTGGAACAACACCATTCCCTGAAGTGAATGCTGTAAATTTTAATAATCGTGGTCGAGGTCGTGGTCGTGACCATG
GCAGAGGAAGAAATAGTTATTATTTTCATGGTGGTCATTCTAATCATCCAAATTTCAAAAGAACCACCCGAAATGATGATCATAAAGGAAAAGCTCCACAAGGTAAAAAT
TCAAAAGATGATGAGCATAAATGCTTCCGATGCGGGATGACTGGGCACTGGTCTCGTGTTTGCCATACATCAAAACACCTAGTTGATCTCTATCAAGCTTCCCTGAAGGA
AAAAGGGAAAGATGTGGAAGCAAATTTTGCACACCAGGATAATGATATATTTGACCAATCCAATATGACAAATTTGGATGTGGCGGACTTCTTTGAATCTCCTGAAGAGA
AAGGACATGCTATTTTGCATGCTGCGTCACTTGCACGCATTAGACCAGTAGCTTATCATAAATACTCACCATTACAATTAGCTTATGGCCAGGAGCTAAATATTTCCCAT
CTAAGAATTTTTGGATGTGCAGTATATGTTCCAATTGCTCCACCACAACGTACTAAGATGGGTCCTCAAAGGAGGGGAGGAATTAAGAAGTTGGAAAAGGAAATTGCATG
GAATGCATCATTATTATCTTATTTAGATCCTCGTACAAATCAATGTGAACTTGAAGTTCAGAAAATAATTCATTTGCAAAATATAGCAAATCAATTACCAGATGCATTTA
TAGATGCAAAGAAATTAACTAAATCGCATATACCTGTTGCAAATGTTTCGTCGAAAATTGATATCCCAACACAGCAAGTTGTCATTAATGAATCTGGAACACGCCAGAAG
CGTGATCCTGAACCAAAATCTGTTGAAGAATATCGACATAGAAATGATTGGTTTCGCTGGAAAGAAGCAATCGAGGCAGAATTAAACTCACTTTCAAAACGCCAAGTTTT
TGGACCAGTAGTCCGGACACCAGAAGCTGTCAAACCCGTGGGATACAAATGGAAATCACAATCAGGATTTGCTATTATATCTGTATATGTTGATGACCTAAATATAATTG
GAACTCCTGAAGAGCTTTCAAAGGCAATAGAATATCTTAAGAAAGAATTTGAGATGAAAGATCTCGGAAAAACAAAATTTTGCCTTGGATTGCAAATTGAGCATTTAGCA
GACGGGATTTTTGTTCATCAGTCAACCTATACAAGAAAAATTTTAAAAAGATTTTATATGGATAAAGCACATCTATTAAACATTCCAATGGAAGTTCGTTCACTAGATGT
GAAGAAAGATATATTTCGACCTCGATATAATAATGAAGAACTTCTTGGTCCTGAAGTACCATATCTTAGTGCGATTGGTGCACTTATGTATCTTGCTAATAATACAAGAC
CAGATATTGCATTTTCAGTAAATTTATTAGCTAGATATAGTTCTTCTCCAACAAAAAGACATTGGAATGGAATTAAGCATATACTCCGTTATCTTCGAGGAACGATTGAT
GTGGGTTTGTTTTATTCAAATAAATCAAATTTTTACTTAGTTGGTTATGCAGATTCTGGATATTTATCTGATCTACATAAAGCTAGATCTCAAACAGGTTATTTATTCAC
TTCTGGAGGAACTGCTATATCTTGGCGATCAGTGAAACAGACCATAACAGCTACTTCCTCAAATCATGCTGAAATTCTTGCAATCCATGAAGCTAGTAGAGAATGTGTAT
GGCTAAGATCAATGACTCAACACATTCGAGAAACATGTGGCTTGTCTTCTAGTAAAATCCTCCAACAACATTATACAAAGACAAAACAGCATAAACTTAAACAAGAGTTG
ACGAACTGCTCACTCTCCTGCTTTCCAAGAAGTGTAAGAGTTCCCCTGTTCCTTTTCCTCCTGAGAATTTGGATTAATCCCCGACCCATTTCTTCTTCCAAGAAACCAAA
GCTCGCAGAGAACAGAAACAGTCTAAAAACACAGTTTTCTTCCTAA
Protein sequenceShow/hide protein sequence
MNLGETIKEGNATSIQEKAKAMIFLRHHLHEGLKMEYLTIKDPCILWKNLKQRYDHQKTVILPKARYEWMHLRLQNFKSVSDYNSTLFKISSKLLLCGEKITDADMLEKT
FSTFHASNMLLQQQYREKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRGRGRDHGRGRNSYYFHGGHSNHPNFKRTTRNDDHKGKAPQGKN
SKDDEHKCFRCGMTGHWSRVCHTSKHLVDLYQASLKEKGKDVEANFAHQDNDIFDQSNMTNLDVADFFESPEEKGHAILHAASLARIRPVAYHKYSPLQLAYGQELNISH
LRIFGCAVYVPIAPPQRTKMGPQRRGGIKKLEKEIAWNASLLSYLDPRTNQCELEVQKIIHLQNIANQLPDAFIDAKKLTKSHIPVANVSSKIDIPTQQVVINESGTRQK
RDPEPKSVEEYRHRNDWFRWKEAIEAELNSLSKRQVFGPVVRTPEAVKPVGYKWKSQSGFAIISVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQIEHLA
DGIFVHQSTYTRKILKRFYMDKAHLLNIPMEVRSLDVKKDIFRPRYNNEELLGPEVPYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGIKHILRYLRGTID
VGLFYSNKSNFYLVGYADSGYLSDLHKARSQTGYLFTSGGTAISWRSVKQTITATSSNHAEILAIHEASRECVWLRSMTQHIRETCGLSSSKILQQHYTKTKQHKLKQEL
TNCSLSCFPRSVRVPLFLFLLRIWINPRPISSSKKPKLAENRNSLKTQFSS