; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G007785 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G007785
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionBeta-galactosidase
Genome locationCG_Chr11:10615610..10618500
RNA-Seq ExpressionClCG11G007785
SyntenyClCG11G007785
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031941.1 Beta-galactosidase [Cucumis melo var. makuwa]1.8e-10237.92Show/hide
Query:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI
        ++LPMY +  VT FP  + + Y++  +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LT E  +P P D  ERLWK EDSL RS+LI+SME QI
Subjt:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI

Query:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-
        GK LLYAATA+D+W+  Q LYSKRQNASRLYTLRK++H C                                                     LN KFD 
Subjt:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-

Query:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------
           RILGQRP+P+LMEVC EVRLEED+++AM ++  P  D  AFS +SS    +K NGK  P                                      
Subjt:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------

Query:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------
        A ISETT  S SQ  +  +  T T +L AI Q                                                                    
Subjt:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL
                                         SGKRWFVTFIDDHTRLTW +L+++KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL
Subjt:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL

Query:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
        + K  VH++ CAY PQQNGVAERKNRHL+EV RSLMLSTSLPSYLWGDA+LTAA+LINRM SR+L+ QTPLD LK SYP+TRL+ +VPL V
Subjt:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

KAA0032571.1 Beta-galactosidase [Cucumis melo var. makuwa]1.2e-10147.85Show/hide
Query:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI
        ++LPMY +  +T FP L  + Y++  +G+S G    EKLNGQNYFSWSQSIKM LEGRH+FG+LT E  +P P D  ERLWK EDS  RS+LI+SME QI
Subjt:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI

Query:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHECLNSKFDAWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDY--DAFSTKSSGTT
        GK LLYAATA+D+W+  Q +YSKRQNASRLYTLRK++H C     D  +       +   +++C E   +    S    I  P + Y  D  S ++ GT 
Subjt:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHECLNSKFDAWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDY--DAFSTKSSGTT

Query:  RNKQ-----------NGKPPPALISETTSGSQSQ--------HHEN------------CLVDTGTSSL--------------------------------
        R+ +           +      L+S   S S+           H N              +D  + S                                 
Subjt:  RNKQ-----------NGKPPPALISETTSGSQSQ--------HHEN------------CLVDTGTSSL--------------------------------

Query:  ---RAIVQSGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKN
           +    SGKRWFVTFIDDHTRLTW +L+ +KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ K  VH++ CAY PQQNGVAE+KN
Subjt:  ---RAIVQSGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKN

Query:  RHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
         HL+EV RSLMLSTSLPSY+WGDA+LTAA+LINRM SR+L  QTPLD LK SYP+TRL+ +VPL V
Subjt:  RHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

KAA0052172.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-10137.92Show/hide
Query:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI
        ++LPMY +  VT FP  + + Y++  +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LT E  +P P D  ERLWK EDSL RS+LI+SME QI
Subjt:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI

Query:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-
        GK LLYAATA+D+W+  Q LYSKRQNASRLYTLRK++H C                                                     LN KFD 
Subjt:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-

Query:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKP-----------------------PP---------------
           RILGQRP+P+LMEVC EVRLEED+++AM ++  P  D  AFS +SS    +K NGK                        PP               
Subjt:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKP-----------------------PP---------------

Query:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------
        A ISET   S SQ  +  +  T T +L AI Q                                                                    
Subjt:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL
                                         SGKRWFVTFIDDHTRLTW +L+++KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL
Subjt:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL

Query:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
        + K  VH++ CAY PQQNGVAERKNRHL+EV RSLMLSTSLPSYLWGDA+LTAA+LINRM SR+L+ QTPLD LK SYP+TRL+ +VPL V
Subjt:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]7.6e-10135.03Show/hide
Query:  TTELAVQAAVHQYLQSL-NPDV--TAALFQSLSPPS-QNPSDATLQT-----RPPPTGTFRDS----SPSGFDEQQRFGYLHP--SSADATLYPEAIGFS
        T  +A  AA+ + LQ+L  P +  T  + Q  +PPS Q    A L +      PPP           +PS        G+ HP   S  +  +P     S
Subjt:  TTELAVQAAVHQYLQSL-NPDV--TAALFQSLSPPS-QNPSDATLQT-----RPPPTGTFRDS----SPSGFDEQQRFGYLHP--SSADATLYPEAIGFS

Query:  TLPLTTVAATRFGGFYSDPTTTAKFGGHASTKFFQQLADLQANFQQKIAALGAALSSSPHLGQNTVNSGVSSDLPMYPEYSVTIFPTLTTAPYLSRQMGN
        T+ L+   + +    Y DP     F G+   +  Q  +D++A              SS H           ++LPMY +  VT FP  + + Y++  +G+
Subjt:  TLPLTTVAATRFGGFYSDPTTTAKFGGHASTKFFQQLADLQANFQQKIAALGAALSSSPHLGQNTVNSGVSSDLPMYPEYSVTIFPTLTTAPYLSRQMGN

Query:  SFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQIGKLLLYAATARDIWEVVQKLYSKRQNASR
        S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LT EI +P P D  ERLWK EDSL RS+LI+SME QIGK LLYA TA+D+W+  Q LYSKRQNASR
Subjt:  SFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQIGKLLLYAATARDIWEVVQKLYSKRQNASR

Query:  LYTLRKKIHEC-----------------------------------------------------LNSKFD-AWSRILGQRPIPTLMEVCSEVRLEEDKSS
        LYTLRK++H C                                                     LN KFD    RILGQRP+P+LMEVC EVRLEED+++
Subjt:  LYTLRKKIHEC-----------------------------------------------------LNSKFD-AWSRILGQRPIPTLMEVCSEVRLEEDKSS

Query:  AMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------ALISETTSGSQSQHHENCLVDTGTSSLRA
        AM ++  P  D  AFS +SS    +K NGK  P                                      A ISETT  S SQ  +  +  T T +L A
Subjt:  AMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------ALISETTSGSQSQHHENCLVDTGTSSLRA

Query:  IVQ-------------------------------------------------------------------------------------------------
        I Q                                                                                                 
Subjt:  IVQ-------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLL
            SGKRWFVTFIDDHTRLTW +L+++KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ K  VH++ CAY PQQNGVAERKNRHL+
Subjt:  ----SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLL

Query:  EVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
        EV RSLMLSTSLPSYLWGDA+LTAA+LINRM SR+L+ QTPLD LK SYP+TRL+ +VPL V
Subjt:  EVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

TYK31717.1 Beta-galactosidase [Cucumis melo var. makuwa]1.4e-10237.92Show/hide
Query:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI
        ++LPMY +  VT FP  + + Y++  +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LT EI +P P D  ERLWK EDSL RS+LI+SME QI
Subjt:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI

Query:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-
        GK LLYA TA+D+W+  Q LYSKRQNASRLYTLRK++H C                                                     LN KFD 
Subjt:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-

Query:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------
           RILGQRP+P+LMEVC EVRLEED+++AM ++  P  D  AFS +SS    +K NGK  P                                      
Subjt:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------

Query:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------
        A ISETT  S SQ  +  +  T T +L AI Q                                                                    
Subjt:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL
                                         SGKRWFVTFIDDHTRLTW +L+++KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL
Subjt:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL

Query:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
        + K  VH++ CAY PQQNGVAERKNRHL+EV RSLMLSTSLPSYLWGDA+LTAA+LINRM SR+L+ QTPLD LK SYP+TRL+ +VPL V
Subjt:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

TrEMBL top hitse value%identityAlignment
A0A5A7SQW1 Beta-galactosidase8.8e-10337.92Show/hide
Query:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI
        ++LPMY +  VT FP  + + Y++  +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LT E  +P P D  ERLWK EDSL RS+LI+SME QI
Subjt:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI

Query:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-
        GK LLYAATA+D+W+  Q LYSKRQNASRLYTLRK++H C                                                     LN KFD 
Subjt:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-

Query:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------
           RILGQRP+P+LMEVC EVRLEED+++AM ++  P  D  AFS +SS    +K NGK  P                                      
Subjt:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------

Query:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------
        A ISETT  S SQ  +  +  T T +L AI Q                                                                    
Subjt:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL
                                         SGKRWFVTFIDDHTRLTW +L+++KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL
Subjt:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL

Query:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
        + K  VH++ CAY PQQNGVAERKNRHL+EV RSLMLSTSLPSYLWGDA+LTAA+LINRM SR+L+ QTPLD LK SYP+TRL+ +VPL V
Subjt:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

A0A5A7SSP2 Beta-galactosidase5.7e-10247.85Show/hide
Query:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI
        ++LPMY +  +T FP L  + Y++  +G+S G    EKLNGQNYFSWSQSIKM LEGRH+FG+LT E  +P P D  ERLWK EDS  RS+LI+SME QI
Subjt:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI

Query:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHECLNSKFDAWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDY--DAFSTKSSGTT
        GK LLYAATA+D+W+  Q +YSKRQNASRLYTLRK++H C     D  +       +   +++C E   +    S    I  P + Y  D  S ++ GT 
Subjt:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHECLNSKFDAWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDY--DAFSTKSSGTT

Query:  RNKQ-----------NGKPPPALISETTSGSQSQ--------HHEN------------CLVDTGTSSL--------------------------------
        R+ +           +      L+S   S S+           H N              +D  + S                                 
Subjt:  RNKQ-----------NGKPPPALISETTSGSQSQ--------HHEN------------CLVDTGTSSL--------------------------------

Query:  ---RAIVQSGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKN
           +    SGKRWFVTFIDDHTRLTW +L+ +KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ K  VH++ CAY PQQNGVAE+KN
Subjt:  ---RAIVQSGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKN

Query:  RHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
         HL+EV RSLMLSTSLPSY+WGDA+LTAA+LINRM SR+L  QTPLD LK SYP+TRL+ +VPL V
Subjt:  RHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

A0A5A7U8U2 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-10237.92Show/hide
Query:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI
        ++LPMY +  VT FP  + + Y++  +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LT E  +P P D  ERLWK EDSL RS+LI+SME QI
Subjt:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI

Query:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-
        GK LLYAATA+D+W+  Q LYSKRQNASRLYTLRK++H C                                                     LN KFD 
Subjt:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-

Query:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKP-----------------------PP---------------
           RILGQRP+P+LMEVC EVRLEED+++AM ++  P  D  AFS +SS    +K NGK                        PP               
Subjt:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKP-----------------------PP---------------

Query:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------
        A ISET   S SQ  +  +  T T +L AI Q                                                                    
Subjt:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL
                                         SGKRWFVTFIDDHTRLTW +L+++KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL
Subjt:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL

Query:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
        + K  VH++ CAY PQQNGVAERKNRHL+EV RSLMLSTSLPSYLWGDA+LTAA+LINRM SR+L+ QTPLD LK SYP+TRL+ +VPL V
Subjt:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

A0A5D3E603 Beta-galactosidase3.7e-10135.03Show/hide
Query:  TTELAVQAAVHQYLQSL-NPDV--TAALFQSLSPPS-QNPSDATLQT-----RPPPTGTFRDS----SPSGFDEQQRFGYLHP--SSADATLYPEAIGFS
        T  +A  AA+ + LQ+L  P +  T  + Q  +PPS Q    A L +      PPP           +PS        G+ HP   S  +  +P     S
Subjt:  TTELAVQAAVHQYLQSL-NPDV--TAALFQSLSPPS-QNPSDATLQT-----RPPPTGTFRDS----SPSGFDEQQRFGYLHP--SSADATLYPEAIGFS

Query:  TLPLTTVAATRFGGFYSDPTTTAKFGGHASTKFFQQLADLQANFQQKIAALGAALSSSPHLGQNTVNSGVSSDLPMYPEYSVTIFPTLTTAPYLSRQMGN
        T+ L+   + +    Y DP     F G+   +  Q  +D++A              SS H           ++LPMY +  VT FP  + + Y++  +G+
Subjt:  TLPLTTVAATRFGGFYSDPTTTAKFGGHASTKFFQQLADLQANFQQKIAALGAALSSSPHLGQNTVNSGVSSDLPMYPEYSVTIFPTLTTAPYLSRQMGN

Query:  SFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQIGKLLLYAATARDIWEVVQKLYSKRQNASR
        S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LT EI +P P D  ERLWK EDSL RS+LI+SME QIGK LLYA TA+D+W+  Q LYSKRQNASR
Subjt:  SFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQIGKLLLYAATARDIWEVVQKLYSKRQNASR

Query:  LYTLRKKIHEC-----------------------------------------------------LNSKFD-AWSRILGQRPIPTLMEVCSEVRLEEDKSS
        LYTLRK++H C                                                     LN KFD    RILGQRP+P+LMEVC EVRLEED+++
Subjt:  LYTLRKKIHEC-----------------------------------------------------LNSKFD-AWSRILGQRPIPTLMEVCSEVRLEEDKSS

Query:  AMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------ALISETTSGSQSQHHENCLVDTGTSSLRA
        AM ++  P  D  AFS +SS    +K NGK  P                                      A ISETT  S SQ  +  +  T T +L A
Subjt:  AMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------ALISETTSGSQSQHHENCLVDTGTSSLRA

Query:  IVQ-------------------------------------------------------------------------------------------------
        I Q                                                                                                 
Subjt:  IVQ-------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLL
            SGKRWFVTFIDDHTRLTW +L+++KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ K  VH++ CAY PQQNGVAERKNRHL+
Subjt:  ----SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLL

Query:  EVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
        EV RSLMLSTSLPSYLWGDA+LTAA+LINRM SR+L+ QTPLD LK SYP+TRL+ +VPL V
Subjt:  EVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

A0A5D3E6F8 Beta-galactosidase6.7e-10337.92Show/hide
Query:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI
        ++LPMY +  VT FP  + + Y++  +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LT EI +P P D  ERLWK EDSL RS+LI+SME QI
Subjt:  SDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQI

Query:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-
        GK LLYA TA+D+W+  Q LYSKRQNASRLYTLRK++H C                                                     LN KFD 
Subjt:  GKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHEC-----------------------------------------------------LNSKFD-

Query:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------
           RILGQRP+P+LMEVC EVRLEED+++AM ++  P  D  AFS +SS    +K NGK  P                                      
Subjt:  AWSRILGQRPIPTLMEVCSEVRLEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPP--------------------------------------

Query:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------
        A ISETT  S SQ  +  +  T T +L AI Q                                                                    
Subjt:  ALISETTSGSQSQHHENCLVDTGTSSLRAIVQ--------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL
                                         SGKRWFVTFIDDHTRLTW +L+++KS+V SIFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL
Subjt:  ---------------------------------SGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFL

Query:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV
        + K  VH++ CAY PQQNGVAERKNRHL+EV RSLMLSTSLPSYLWGDA+LTAA+LINRM SR+L+ QTPLD LK SYP+TRL+ +VPL V
Subjt:  SIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTTRLIPDVPLCV

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-2145.04Show/hide
Query:  KRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIK-ISVHRSLCAYIPQQNGVAERKNRHLLEVTRS
        K +FV F+D  T     +L+  KS V S+FQ F A  E  FN K+  L  DNGRE+L+N +R+F   K IS H ++  + PQ NGV+ER  R + E  R+
Subjt:  KRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIK-ISVHRSLCAYIPQQNGVAERKNRHLLEVTRS

Query:  LMLSTSLPSYLWGDAVLTAAYLINRMSSRVL
        ++    L    WG+AVLTA YLINR+ SR L
Subjt:  LMLSTSLPSYLWGDAVLTAAYLINRMSSRVL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-2543.38Show/hide
Query:  GKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRS
        G ++FVTFIDD +R  W ++L  K +V  +FQ+F+A +E +   K+  LRSDNG E+ +    E+ S     H       PQ NGVAER NR ++E  RS
Subjt:  GKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRS

Query:  LMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTP
        ++    LP   WG+AV TA YLINR  S  L F+ P
Subjt:  LMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTP

Q12491 Transposon Ty2-B Gag-Pol polyprotein5.3e-1231.06Show/hide
Query:  QSGKRWFVTFIDDHTRLTWFFLLANKSKVS--SIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLLE
        +S   +F++F D+ TR  W + L ++ + S  ++F    A I+ QFNA++ +++ D G E+   TL +F + +            + +GVAER NR LL 
Subjt:  QSGKRWFVTFIDDHTRLTWFFLLANKSKVS--SIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLLE

Query:  VTRSLMLSTSLPSYLWGDAVLTAAYLINRMSS
          R+L+  + LP++LW  AV  +  + N + S
Subjt:  VTRSLMLSTSLPSYLWGDAVLTAAYLINRMSS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.9e-1836.23Show/hide
Query:  RWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLM
        R++V F+D  TR TW + L  KS+V   F  F   +E +F  +I    SDNG EF+   L E+ S     H +   + P+ NG++ERK+RH++E   +L+
Subjt:  RWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLM

Query:  LSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYL
           S+P   W  A   A YLINR+ + +L  ++P   L
Subjt:  LSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-2038.41Show/hide
Query:  RWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLM
        R++V F+D  TR TW + L  KS+V   F  F + +E +F  +I  L SDNG EF+   LR++LS     H +   + P+ NG++ERK+RH++E+  +L+
Subjt:  RWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLM

Query:  LSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYL
           S+P   W  A   A YLINR+ + +L  Q+P   L
Subjt:  LSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYL

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.1e-0726.4Show/hide
Query:  NYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQIGKLLLYAATARDIWEVVQKLY------SKRQNASRLYTLRK-
        NY +W    +  L    KFG++   +PKP P  P  + W++ +++    L++SM  ++ + ++YA TA  +WE +++++         Q   RL TLR+ 
Subjt:  NYFSWSQSIKMVLEGRHKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQIGKLLLYAATARDIWEVVQKLY------SKRQNASRLYTLRK-

Query:  --KIHECLNSKFDAWSRILGQRPIP
           + E        W  +    PIP
Subjt:  --KIHECLNSKFDAWSRILGQRPIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTACGCCACTGCCACCACCGAGTTAGCCGTCCAAGCCGCTGTCCATCAGTACCTCCAGTCACTCAACCCAGACGTCACAGCCGCTCTATTTCAGAGTTTGTCGCC
GCCATCGCAAAACCCTAGCGACGCAACCTTACAGACTCGACCGCCGCCGACGGGGACCTTTAGAGATAGTTCTCCGTCGGGTTTTGATGAGCAGCAAAGGTTCGGGTATC
TTCATCCCTCCTCTGCCGATGCCACCCTCTATCCCGAAGCTATTGGTTTTTCGACTCTTCCTCTGACCACTGTGGCTGCCACCAGATTTGGTGGTTTTTACTCAGATCCA
ACCACTACTGCCAAATTTGGTGGGCATGCCTCAACAAAATTCTTTCAACAACTCGCGGACCTTCAAGCAAATTTTCAGCAAAAAATCGCCGCTCTTGGAGCTGCCCTGAG
TTCTTCTCCTCATCTTGGTCAAAATACAGTCAATTCAGGTGTTTCTTCAGATTTACCAATGTATCCAGAGTATTCGGTAACTATATTCCCTACTTTAACGACTGCACCAT
ATTTGTCTAGACAAATGGGTAACTCCTTTGGCTTAATTGTTGGCGAAAAATTGAATGGCCAAAATTATTTCTCTTGGTCTCAATCCATTAAAATGGTACTTGAAGGACGC
CATAAGTTCGGATATTTGACTGACGAAATACCTAAACCCAGACCCGAAGATCCTCAAGAGCGCCTCTGGAAGAGAGAAGATTCATTATTCCGATCTTTGCTAATTCACAG
CATGGAACTTCAAATTGGGAAGCTTCTGTTGTATGCTGCTACAGCTCGGGATATTTGGGAGGTAGTTCAAAAATTGTACTCCAAGAGGCAGAATGCATCACGACTCTACA
CTCTGCGAAAAAAAATCCATGAATGTCTCAATTCCAAGTTTGATGCTTGGAGCCGGATACTGGGACAAAGACCAATACCGACCCTGATGGAAGTATGTTCAGAGGTCCGT
CTAGAAGAGGATAAATCGAGTGCCATGAATATCATTGTTAACCCCATAACTGATTATGACGCCTTTAGTACAAAATCATCTGGAACGACTAGAAACAAGCAGAATGGGAA
ACCACCTCCAGCTCTAATAAGTGAGACTACTAGTGGGTCTCAATCTCAACATCATGAAAACTGCCTAGTTGATACTGGTACTTCCTCTCTGAGGGCGATTGTGCAATCAG
GTAAGCGTTGGTTTGTTACCTTCATTGATGACCACACTCGCCTTACTTGGTTCTTCCTTCTAGCAAATAAGTCGAAGGTCTCATCTATTTTTCAACAGTTTTACGCCACC
ATTGAAACTCAGTTTAATGCCAAAATTGCAATCCTTCGAAGTGACAATGGTCGTGAATTCCTCACCAATACTCTTCGTGAGTTCTTATCCATTAAAATTTCCGTTCACCG
GAGTTTGTGTGCCTATATACCCCAACAAAATGGAGTGGCTGAAAGAAAAAACCGTCATCTCCTAGAAGTTACTCGGTCTCTCATGCTGTCAACCTCTCTTCCGTCCTACC
TATGGGGGGATGCAGTCTTGACTGCCGCTTATCTTATAAATCGGATGTCTTCTCGGGTTCTAAACTTCCAAACTCCTCTTGACTACCTCAAATTGTCTTACCCTACCACT
CGCCTTATACCTGATGTCCCTCTTTGTGTTTTGGATGTACAGCTTTTGTCCATAGCTTCGGCCCCAACCAAACTAAATTTACCCCTTATGCCCAGAAATGTGTCTTCGTT
GGGTATCCTCTCCACCAACGCGGTTATAAATGCTTCCATCCCACTTCCCGGAAATACTTCATCTCCATGGATGTCACCTTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTACGCCACTGCCACCACCGAGTTAGCCGTCCAAGCCGCTGTCCATCAGTACCTCCAGTCACTCAACCCAGACGTCACAGCCGCTCTATTTCAGAGTTTGTCGCC
GCCATCGCAAAACCCTAGCGACGCAACCTTACAGACTCGACCGCCGCCGACGGGGACCTTTAGAGATAGTTCTCCGTCGGGTTTTGATGAGCAGCAAAGGTTCGGGTATC
TTCATCCCTCCTCTGCCGATGCCACCCTCTATCCCGAAGCTATTGGTTTTTCGACTCTTCCTCTGACCACTGTGGCTGCCACCAGATTTGGTGGTTTTTACTCAGATCCA
ACCACTACTGCCAAATTTGGTGGGCATGCCTCAACAAAATTCTTTCAACAACTCGCGGACCTTCAAGCAAATTTTCAGCAAAAAATCGCCGCTCTTGGAGCTGCCCTGAG
TTCTTCTCCTCATCTTGGTCAAAATACAGTCAATTCAGGTGTTTCTTCAGATTTACCAATGTATCCAGAGTATTCGGTAACTATATTCCCTACTTTAACGACTGCACCAT
ATTTGTCTAGACAAATGGGTAACTCCTTTGGCTTAATTGTTGGCGAAAAATTGAATGGCCAAAATTATTTCTCTTGGTCTCAATCCATTAAAATGGTACTTGAAGGACGC
CATAAGTTCGGATATTTGACTGACGAAATACCTAAACCCAGACCCGAAGATCCTCAAGAGCGCCTCTGGAAGAGAGAAGATTCATTATTCCGATCTTTGCTAATTCACAG
CATGGAACTTCAAATTGGGAAGCTTCTGTTGTATGCTGCTACAGCTCGGGATATTTGGGAGGTAGTTCAAAAATTGTACTCCAAGAGGCAGAATGCATCACGACTCTACA
CTCTGCGAAAAAAAATCCATGAATGTCTCAATTCCAAGTTTGATGCTTGGAGCCGGATACTGGGACAAAGACCAATACCGACCCTGATGGAAGTATGTTCAGAGGTCCGT
CTAGAAGAGGATAAATCGAGTGCCATGAATATCATTGTTAACCCCATAACTGATTATGACGCCTTTAGTACAAAATCATCTGGAACGACTAGAAACAAGCAGAATGGGAA
ACCACCTCCAGCTCTAATAAGTGAGACTACTAGTGGGTCTCAATCTCAACATCATGAAAACTGCCTAGTTGATACTGGTACTTCCTCTCTGAGGGCGATTGTGCAATCAG
GTAAGCGTTGGTTTGTTACCTTCATTGATGACCACACTCGCCTTACTTGGTTCTTCCTTCTAGCAAATAAGTCGAAGGTCTCATCTATTTTTCAACAGTTTTACGCCACC
ATTGAAACTCAGTTTAATGCCAAAATTGCAATCCTTCGAAGTGACAATGGTCGTGAATTCCTCACCAATACTCTTCGTGAGTTCTTATCCATTAAAATTTCCGTTCACCG
GAGTTTGTGTGCCTATATACCCCAACAAAATGGAGTGGCTGAAAGAAAAAACCGTCATCTCCTAGAAGTTACTCGGTCTCTCATGCTGTCAACCTCTCTTCCGTCCTACC
TATGGGGGGATGCAGTCTTGACTGCCGCTTATCTTATAAATCGGATGTCTTCTCGGGTTCTAAACTTCCAAACTCCTCTTGACTACCTCAAATTGTCTTACCCTACCACT
CGCCTTATACCTGATGTCCCTCTTTGTGTTTTGGATGTACAGCTTTTGTCCATAGCTTCGGCCCCAACCAAACTAAATTTACCCCTTATGCCCAGAAATGTGTCTTCGTT
GGGTATCCTCTCCACCAACGCGGTTATAAATGCTTCCATCCCACTTCCCGGAAATACTTCATCTCCATGGATGTCACCTTCCTAG
Protein sequenceShow/hide protein sequence
MEYATATTELAVQAAVHQYLQSLNPDVTAALFQSLSPPSQNPSDATLQTRPPPTGTFRDSSPSGFDEQQRFGYLHPSSADATLYPEAIGFSTLPLTTVAATRFGGFYSDP
TTTAKFGGHASTKFFQQLADLQANFQQKIAALGAALSSSPHLGQNTVNSGVSSDLPMYPEYSVTIFPTLTTAPYLSRQMGNSFGLIVGEKLNGQNYFSWSQSIKMVLEGR
HKFGYLTDEIPKPRPEDPQERLWKREDSLFRSLLIHSMELQIGKLLLYAATARDIWEVVQKLYSKRQNASRLYTLRKKIHECLNSKFDAWSRILGQRPIPTLMEVCSEVR
LEEDKSSAMNIIVNPITDYDAFSTKSSGTTRNKQNGKPPPALISETTSGSQSQHHENCLVDTGTSSLRAIVQSGKRWFVTFIDDHTRLTWFFLLANKSKVSSIFQQFYAT
IETQFNAKIAILRSDNGREFLTNTLREFLSIKISVHRSLCAYIPQQNGVAERKNRHLLEVTRSLMLSTSLPSYLWGDAVLTAAYLINRMSSRVLNFQTPLDYLKLSYPTT
RLIPDVPLCVLDVQLLSIASAPTKLNLPLMPRNVSSLGILSTNAVINASIPLPGNTSSPWMSPS