; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G33330 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G33330
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBeta-galactosidase
Genome locationChr1:28215868..28219837
RNA-Seq ExpressionCSPI01G33330
SyntenyCSPI01G33330
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025363.1 Beta-galactosidase [Cucumis melo var. makuwa]3.5e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKAAIS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+ +VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

KAA0048203.1 Beta-galactosidase [Cucumis melo var. makuwa]3.5e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKAAIS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+ +VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

TYJ97256.1 Beta-galactosidase [Cucumis melo var. makuwa]2.7e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKA IS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+V+VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

TYJ99952.1 Beta-galactosidase [Cucumis melo var. makuwa]3.5e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKAAIS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+ +VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

TYK08054.1 Beta-galactosidase [Cucumis melo var. makuwa]3.5e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKAAIS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+ +VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

TrEMBL top hitse value%identityAlignment
A0A5A7SM64 Beta-galactosidase1.7e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKAAIS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+ +VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

A0A5A7VLQ7 Beta-galactosidase1.7e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKAAIS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+ +VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

A0A5D3BE37 Beta-galactosidase1.3e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKA IS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+V+VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

A0A5D3BJK7 Beta-galactosidase1.7e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKAAIS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+ +VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

A0A5D3C4T4 Beta-galactosidase1.7e-21250.86Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I++   DNG E QNH+L++FL+SKGIV+Q+SCAYT QQNGVAERKNRHL+EVAR LMLSTSLPSYLWG AILT AHLINRMPSR+LH + PLDCLKES 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV
        PS  L+ +VPLRVFG TAY H+                      H+  Y                                               PTL+
Subjt:  PSIPLIPKVPLRVFGYTAYGHS----------------------HESSY-----------------------------------------------PTLV

Query:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE
        T+SD +PH I+LPTNQVPWK YYRRNLRKEV S   Q  A VQ+FEP RDQGM +     +NN MSEND  +   ++   ++ ++   E EV  E + +E
Subjt:  TLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQ-TALVQDFEPIRDQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENE

Query:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------
         +  H      YDPSLD+ IALRKGTRSCTKH ICNYVSY NLSPQFRAFTA+LDST IPKNI++AL+CPEWKN VMEE+ +                  
Subjt:  TKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPS------------------

Query:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK
                                                                                   G L E        GFE QFGQ+VCK
Subjt:  --------------------------------------------------------------------------RGGLHES-----SSGFEGQFGQQVCK

Query:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG
        LQKSLYGLKQSP+ W DRFTTFVKSQGYSQ                     VY                   +  +EFEIKDL N+KYFLGMEVARSK G
Subjt:  LQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ---------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSKAG

Query:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK
        ISVS RKYT+DLLTETGMLGCR AD PIEFN KLGN DDQVP                                   APYEKHMEAVNRILRYLK T GK
Subjt:  ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKLGNLDDQVP-----------------------------------APYEKHMEAVNRILRYLKTTRGK

Query:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS
        GLMFRK ++K IEAYTDSDW    +DR STS YCTFVW NL+TWRSKK+SVVARSSAEA+YRA+                   ECE P KLFCDNKAAIS
Subjt:  GLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAAIS

Query:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP
        +ANNPVQHDRTKHVEIDRHFIKERLDS SICI YIP S+Q+ +VLTKGLLRP FD   SKLGLI IY+P
Subjt:  MANNPVQHDRTKHVEIDRHFIKERLDSESICILYIP-SRQVVNVLTKGLLRPSFDFYASKLGLIGIYVP

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.9e-3329.57Show/hide
Query:  VCKLQKSLYGLKQSPKTWLDRF------TTFVKSQGYSQVYKAE-----------------------------------NEFEIKDLENMKYFLGMEVAR
        VCKL K++YGLKQ+ + W + F        FV S     +Y  +                                    +F + DL  +K+F+G+ +  
Subjt:  VCKLQKSLYGLKQSPKTWLDRF------TTFVKSQGYSQVYKAE-----------------------------------NEFEIKDLENMKYFLGMEVAR

Query:  SKAGISVSYRKYTIDLLTETGMLGCRLADIPI--EFNYKLGNLDDQVPAPYEKHM---------------EAVN------------------RILRYLKT
         +  I +S   Y   +L++  M  C     P+  + NY+L N D+    P    +                AVN                  R+LRYLK 
Subjt:  SKAGISVSYRKYTIDLLTETGMLGCRLADIPI--EFNYKLGNLDDQVPAPYEKHM---------------EAVN------------------RILRYLKT

Query:  TRGKGLMFRK---IDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWD-NLITWRSKKRSVVARSSAEAKYRALLEC------------------EIPFKLF
        T    L+F+K    + KII  Y DSDW    +DR ST+ Y   ++D NLI W +K+++ VA SS EA+Y AL E                   E P K++
Subjt:  TRGKGLMFRK---IDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWD-NLITWRSKKRSVVARSSAEAKYRALLEC------------------EIPFKLF

Query:  CDNKAAISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPS-RQVVNVLTKGLLRPSFDFYASKLGLI
         DN+  IS+ANNP  H R KH++I  HF +E++ +  IC+ YIP+  Q+ ++ TK L    F     KLGL+
Subjt:  CDNKAAISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPS-RQVVNVLTKGLLRPSFDFYASKLGLI

P04146 Copia protein4.3e-1137.93Show/hide
Query:  DNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVL--HFKIPLDCLKESCPSIPL
        DNG E  ++ + +F   KGI    +  +T Q NGV+ER  R + E AR ++    L    WG+A+LT  +LINR+PSR L    K P +      P +  
Subjt:  DNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVL--HFKIPLDCLKESCPSIPL

Query:  IPKVPLRVFGYTAYGH
             LRVFG T Y H
Subjt:  IPKVPLRVFGYTAYGH

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.0e-3724.56Show/hide
Query:  DNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIP----------LDCLK
        DNG E  +    ++ SS GI ++ +   T Q NGVAER NR ++E  R ++    LP   WG+A+ T  +LINR PS  L F+IP             LK
Subjt:  DNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIP----------LDCLK

Query:  E-SCPSIPLIPK----------VPLRVFGY--TAYGHS-----------------HESSYPTLVTLSDPNPHGIV-----LP------------TNQVPW
           C +   +PK          +P    GY    +G+                   ES   T   +S+   +GI+     +P            T++V  
Subjt:  E-SCPSIPLIPK----------VPLRVFGY--TAYGHS-----------------HESSYPTLVTLSDPNPHGIV-----LP------------TNQVPW

Query:  K-------IYYRRNLRKEVESLVVQTALVQDFEPIR--DQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENETKSNHYGNSSR
        +       I     L + VE +   T   +  +P+R  ++  ++S    S   +  +D  E  S+ E +   E   +  + + E  E+  K+  Y     
Subjt:  K-------IYYRRNLRKEVESLVVQTALVQDFEPIR--DQGMIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENETKSNHYGNSSR

Query:  YDPSLDLLIALRKGTR--------SCTKHSICNYVSY-------GNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPSRGG-LH--------
               L+ L KG R           K   C  V Y       G    +   F           +I + L      ++ +E++  +   LH        
Subjt:  YDPSLDLLIALRKGTR--------SCTKHSICNYVSY-------GNLSPQFRAFTASLDSTTIPKNIHSALKCPEWKNVVMEEIPSRGG-LH--------

Query:  -ESSSGFE-GQFGQQVCKLQKSLYGLKQSPKTWLDRFTTFVKSQGYSQVYK---------AEN-------------------------------EFEIKD
         E   GFE       VCKL KSLYGLKQ+P+ W  +F +F+KSQ Y + Y          +EN                                F++KD
Subjt:  -ESSSGFE-GQFGQQVCKLQKSLYGLKQSPKTWLDRFTTFVKSQGYSQVYK---------AEN-------------------------------EFEIKD

Query:  LENMKYFLGMEVARSKAG--ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKL------------GNLDDQVP--------------------------
        L   +  LGM++ R +    + +S  KY   +L    M   +    P+  + KL            GN+  +VP                          
Subjt:  LENMKYFLGMEVARSKAG--ISVSYRKYTIDLLTETGMLGCRLADIPIEFNYKL------------GNLDDQVP--------------------------

Query:  -----APYEKHMEAVNRILRYLKTTRGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRALLEC-----
              P ++H EAV  ILRYL+ T G  L F   D  I++ YTD+D      +R S++ Y        I+W+SK +  VA S+ EA+Y A  E      
Subjt:  -----APYEKHMEAVNRILRYLKTTRGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRALLEC-----

Query:  ------------EIPFKLFCDNKAAISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPSRQ-VVNVLTKGLLRPSFDFYASKLGL
                    +  + ++CD+++AI ++ N + H RTKH+++  H+I+E +D ES+ +L I + +   ++LTK + R  F+     +G+
Subjt:  ------------EIPFKLFCDNKAAISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPSRQ-VVNVLTKGLLRPSFDFYASKLGL

P92519 Uncharacterized mitochondrial protein AtMg008109.6e-1927.98Show/hide
Query:  VYKAENEFEIKDLENMKYFLGMEVARSKAGISVSYRKYTIDLLTETGMLGCRLADIPI----------------------------------EFNYKLGN
        +++  + F +KDL  + YFLG+++    +G+ +S  KY   +L   GML C+    P+                                  + +Y +  
Subjt:  VYKAENEFEIKDLENMKYFLGMEVARSKAGISVSYRKYTIDLLTETGMLGCRLADIPI----------------------------------EFNYKLGN

Query:  LDDQVPAPYEKHMEAVNRILRYLKTTRGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL
        +  ++  P     + + R+LRY+K T   GL   K  K  ++A+ DSDW      R ST+ +CTF+  N+I+W +K++  V+RSS E +YRAL
Subjt:  LDDQVPAPYEKHMEAVNRILRYLKTTRGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.9e-4030.05Show/hide
Query:  VCKLQKSLYGLKQSPKTW----------------LDRFTTFVKSQGYSQVY-----------------------KAENEFEIKDLENMKYFLGMEVARSK
        VCKL+K+LYGLKQ+P+ W                +   + FV  +G S VY                            F +KD E + YFLG+E  R  
Subjt:  VCKLQKSLYGLKQSPKTW----------------LDRFTTFVKSQGYSQVY-----------------------KAENEFEIKDLENMKYFLGMEVARSK

Query:  AGISVSYRKYTIDLLTETGML-------------------GCRLADIPIEF-----------------NYKLGNLDDQVPAPYEKHMEAVNRILRYLKTT
         G+ +S R+Y +DLL  T M+                   G +L D P E+                 +Y +  L   +  P E+H++A+ RILRYL  T
Subjt:  AGISVSYRKYTIDLLTETGML-------------------GCRLADIPIEF-----------------NYKLGNLDDQVPAPYEKHMEAVNRILRYLKTT

Query:  RGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRALLE--------CEI----------PFKLFCDNKA
           G+  +K +   + AY+D+DW     D +ST+ Y  ++  + I+W SKK+  V RSS EA+YR++          C +          P  ++CDN  
Subjt:  RGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRALLE--------CEI----------PFKLFCDNKA

Query:  AISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPSR-QVVNVLTKGLLRPSFDFYASKLGL
        A  +  NPV H R KH+ ID HFI+ ++ S ++ ++++ +  Q+ + LTK L R +F  +ASK+G+
Subjt:  AISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPSR-QVVNVLTKGLLRPSFDFYASKLGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-0933.61Show/hide
Query:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC
        +I  F  DNG E    +L ++ S  GI + +S  +T + NG++ERK+RH++E    L+   S+P   W  A     +LINR+P+ +L  + P   L  + 
Subjt:  QISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVLHFKIPLDCLKESC

Query:  PSIPLIPKVPLRVFGYTAY
        P+        LRVFG   Y
Subjt:  PSIPLIPKVPLRVFGYTAY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.4e-3928.53Show/hide
Query:  VCKLQKSLYGLKQSPKTWLDRFTT----------------FVKSQGYSQVY-----------------------KAENEFEIKDLENMKYFLGMEVARSK
        VC+L+K++YGLKQ+P+ W     T                FV  +G S +Y                            F +K+ E++ YFLG+E  R  
Subjt:  VCKLQKSLYGLKQSPKTWLDRFTT----------------FVKSQGYSQVY-----------------------KAENEFEIKDLENMKYFLGMEVARSK

Query:  AGISVSYRKYTIDLLTETGML-------------------GCRLADIPIEF-----------------NYKLGNLDDQVPAPYEKHMEAVNRILRYLKTT
         G+ +S R+YT+DLL  T ML                   G +L D P E+                 +Y +  L   +  P + H  A+ R+LRYL  T
Subjt:  AGISVSYRKYTIDLLTETGML-------------------GCRLADIPIEF-----------------NYKLGNLDDQVPAPYEKHMEAVNRILRYLKTT

Query:  RGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKA
           G+  +K +   + AY+D+DW     D +ST+ Y  ++  + I+W SKK+  V RSS EA+YR++                  ++   P  ++CDN  
Subjt:  RGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKA

Query:  AISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPSR-QVVNVLTKGLLRPSFDFYASKLGLIGIYVPICG
        A  +  NPV H R KH+ +D HFI+ ++ S ++ ++++ +  Q+ + LTK L R +F  ++ K+G+I +  P CG
Subjt:  AISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPSR-QVVNVLTKGLLRPSFDFYASKLGLIGIYVPICG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-0831.82Show/hide
Query:  IVDGSLAPIAGKGQISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVL
        I+  SL     + +I     DNG E     L  +LS  GI + +S  +T + NG++ERK+RH++E+   L+   S+P   W  A     +LINR+P+ +L
Subjt:  IVDGSLAPIAGKGQISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAILTVAHLINRMPSRVL

Query:  HFKIPLDCLKESCPSIPLIPKVPLRVFGYTAY
          + P   L    P+        L+VFG   Y
Subjt:  HFKIPLDCLKESCPSIPLIPKVPLRVFGYTAY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.8e-4330.67Show/hide
Query:  VCKLQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ--------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSK
        VC L+KS+YGLKQ+ + W  +F+  +   G+ Q                    VY                   + ++ F+++DL  +KYFLG+E+ARS 
Subjt:  VCKLQKSLYGLKQSPKTWLDRFTTFVKSQGYSQ--------------------VY-------------------KAENEFEIKDLENMKYFLGMEVARSK

Query:  AGISVSYRKYTIDLLTETGMLGCRLADIP-----------------------------------IEFNYKLGNLDDQVPAPYEKHMEAVNRILRYLKTTR
        AGI++  RKY +DLL ETG+LGC+ + +P                                   ++ ++ +  L     AP   H +AV +IL Y+K T 
Subjt:  AGISVSYRKYTIDLLTETGMLGCRLADIP-----------------------------------IEFNYKLGNLDDQVPAPYEKHMEAVNRILRYLKTTR

Query:  GKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAA
        G+GL +    +  ++ ++D+ +      R ST+ YC F+  +LI+W+SKK+ VV++SSAEA+YRAL                  L    P  LFCDN AA
Subjt:  GKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL------------------LECEIPFKLFCDNKAA

Query:  ISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPSRQVVNVLTKGL---LRPSFDFYASKLGLIGIYVPIC
        I +A N V H+RTKH+E D H ++ER   ++       +    +  T+ L   LR +  +  S  GL G+   IC
Subjt:  ISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPSRQVVNVLTKGL---LRPSFDFYASKLGLIGIYVPIC

ATMG00810.1 DNA/RNA polymerases superfamily protein6.8e-2027.98Show/hide
Query:  VYKAENEFEIKDLENMKYFLGMEVARSKAGISVSYRKYTIDLLTETGMLGCRLADIPI----------------------------------EFNYKLGN
        +++  + F +KDL  + YFLG+++    +G+ +S  KY   +L   GML C+    P+                                  + +Y +  
Subjt:  VYKAENEFEIKDLENMKYFLGMEVARSKAGISVSYRKYTIDLLTETGMLGCRLADIPI----------------------------------EFNYKLGN

Query:  LDDQVPAPYEKHMEAVNRILRYLKTTRGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL
        +  ++  P     + + R+LRY+K T   GL   K  K  ++A+ DSDW      R ST+ +CTF+  N+I+W +K++  V+RSS E +YRAL
Subjt:  LDDQVPAPYEKHMEAVNRILRYLKTTRGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVARSSAEAKYRAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGCTTCCTCATTGGGAATAAGTCAGCAACAACTGGAAGAGCTTCAACAACAAATTGCAGCAATTGAGGCTATCTTAGGGACGACATCCAACACTCTTGTACCGAT
GTATTCTAAGAATCCGGTAACCTCGTTCCCTATTTTATCTTCTTCCTATGTGAGTGAGAATGCTAGTGCAACCACACTTGGAGCTATTGTCCAGTCAGGTGGTCCAGGGA
GGTCCTGTGTTCAAGCCCCTGCATTGTCGTTTCCTCCCCAATTAAAATCAATTTCCACTTGTTGGGCCTTTCAGATATTTCAAGCCCACAAAAAGATTCTGCCACAAGAA
GTTATCAACGAAAATGTCCTCCAGGCCTTGTTCACAGCAGAATCTGACACCCTTGGGAATAATCAGAATATTAGAATTGTTGATGGATCCTTGGCTCCCATTGCTGGGAA
AGGGCAGATTTCTCTTTTTGACGGTGATAATGGTCATGAGCTTCAAAACCACTCTCTTAACAAGTTCTTATCCTCTAAAGGAATTGTTAACCAAAGTTCCTGTGCTTACA
CCCTTCAACAAAATGGGGTTGCCGAACGAAAAAATCGTCACCTTTTGGAAGTTGCTCGTTTCCTTATGTTGTCCACTTCTCTTCCTTCCTATCTTTGGGGTAAGGCCATT
CTTACTGTCGCTCATCTCATAAACCGCATGCCTTCTCGTGTTCTTCATTTCAAGATACCATTAGATTGCCTCAAAGAATCCTGTCCCTCCATTCCTCTCATCCCTAAAGT
TCCCCTTCGGGTGTTTGGCTACACCGCCTATGGCCATAGCCATGAGTCTTCTTATCCTACTCTGGTTACCTTATCTGACCCTAACCCTCACGGTATAGTCCTACCAACAA
ATCAAGTTCCTTGGAAAATCTACTATAGAAGGAATCTCAGAAAGGAAGTTGAGTCTCTTGTTGTTCAGACGGCTCTAGTGCAGGATTTTGAACCAATAAGAGATCAAGGT
ATGATTGATTCAATTAATTCATATAGTAATAACAGAATGAGTGAGAATGATATGGGTGAGCAGGTCAGTATTGATGAGGCCATTGTAGACAGAGAGGACAGGATTATTGA
GAACGAGGTTGTTGCTGAAAATACTGAAAATGAAACTAAGTCAAATCATTATGGAAATAGTAGCAGGTATGATCCATCTCTTGATCTTCTTATTGCACTGAGGAAAGGTA
CAAGGTCCTGCACAAAGCACTCCATATGTAATTATGTGTCATACGGGAATCTCTCACCGCAGTTCAGAGCTTTTACTGCCAGCCTTGACTCTACCACAATACCGAAAAAT
ATTCACAGTGCGTTAAAATGTCCTGAATGGAAGAATGTTGTCATGGAAGAAATACCTAGTAGAGGAGGTCTACACGAGTCCTCGTCTGGCTTTGAAGGACAGTTTGGTCA
GCAGGTTTGTAAACTCCAAAAATCCTTATATGGTCTGAAACAATCACCCAAAACTTGGTTGGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGTCAAGTTTACA
AAGCAGAGAATGAATTTGAAATCAAGGATTTGGAAAATATGAAATATTTCCTTGGAATGGAGGTGGCTAGATCTAAAGCAGGTATCTCCGTGTCTTATAGAAAATACACT
ATTGATTTGCTAACCGAGACTGGTATGTTAGGATGCCGTCTTGCTGACATTCCTATTGAATTTAACTATAAACTAGGGAACCTTGATGATCAAGTTCCAGCTCCATATGA
GAAACACATGGAAGCTGTCAACAGAATTTTGAGATACTTGAAAACAACACGTGGTAAAGGGCTAATGTTTAGAAAAATAGACAAAAAGATCATTGAGGCATATACTGACT
CGGATTGGACATGGTTTTTTGTTGATAGAATGTCTACTTCTAGTTATTGTACCTTTGTTTGGGACAATCTTATAACTTGGAGGAGTAAGAAGCGAAGTGTTGTGGCCAGG
AGTAGCGCTGAGGCCAAATATAGAGCGCTATTAGAATGTGAGATACCATTCAAACTTTTTTGTGATAATAAAGCTGCTATTAGTATGGCTAACAACCCTGTTCAACATGA
TAGAACTAAACATGTTGAGATTGATCGACATTTCATCAAAGAAAGACTTGACAGTGAGAGCATATGCATTCTATACATCCCTTCAAGACAGGTTGTTAATGTTCTTACCA
AGGGGCTTCTTAGACCAAGCTTCGACTTTTATGCTAGCAAGTTGGGCCTCATTGGTATTTACGTCCCAATATGTGGTCGTTGGAATATTAGAATATTTATGGAAAGATTA
TGGGAATATTTTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGCTTCCTCATTGGGAATAAGTCAGCAACAACTGGAAGAGCTTCAACAACAAATTGCAGCAATTGAGGCTATCTTAGGGACGACATCCAACACTCTTGTACCGAT
GTATTCTAAGAATCCGGTAACCTCGTTCCCTATTTTATCTTCTTCCTATGTGAGTGAGAATGCTAGTGCAACCACACTTGGAGCTATTGTCCAGTCAGGTGGTCCAGGGA
GGTCCTGTGTTCAAGCCCCTGCATTGTCGTTTCCTCCCCAATTAAAATCAATTTCCACTTGTTGGGCCTTTCAGATATTTCAAGCCCACAAAAAGATTCTGCCACAAGAA
GTTATCAACGAAAATGTCCTCCAGGCCTTGTTCACAGCAGAATCTGACACCCTTGGGAATAATCAGAATATTAGAATTGTTGATGGATCCTTGGCTCCCATTGCTGGGAA
AGGGCAGATTTCTCTTTTTGACGGTGATAATGGTCATGAGCTTCAAAACCACTCTCTTAACAAGTTCTTATCCTCTAAAGGAATTGTTAACCAAAGTTCCTGTGCTTACA
CCCTTCAACAAAATGGGGTTGCCGAACGAAAAAATCGTCACCTTTTGGAAGTTGCTCGTTTCCTTATGTTGTCCACTTCTCTTCCTTCCTATCTTTGGGGTAAGGCCATT
CTTACTGTCGCTCATCTCATAAACCGCATGCCTTCTCGTGTTCTTCATTTCAAGATACCATTAGATTGCCTCAAAGAATCCTGTCCCTCCATTCCTCTCATCCCTAAAGT
TCCCCTTCGGGTGTTTGGCTACACCGCCTATGGCCATAGCCATGAGTCTTCTTATCCTACTCTGGTTACCTTATCTGACCCTAACCCTCACGGTATAGTCCTACCAACAA
ATCAAGTTCCTTGGAAAATCTACTATAGAAGGAATCTCAGAAAGGAAGTTGAGTCTCTTGTTGTTCAGACGGCTCTAGTGCAGGATTTTGAACCAATAAGAGATCAAGGT
ATGATTGATTCAATTAATTCATATAGTAATAACAGAATGAGTGAGAATGATATGGGTGAGCAGGTCAGTATTGATGAGGCCATTGTAGACAGAGAGGACAGGATTATTGA
GAACGAGGTTGTTGCTGAAAATACTGAAAATGAAACTAAGTCAAATCATTATGGAAATAGTAGCAGGTATGATCCATCTCTTGATCTTCTTATTGCACTGAGGAAAGGTA
CAAGGTCCTGCACAAAGCACTCCATATGTAATTATGTGTCATACGGGAATCTCTCACCGCAGTTCAGAGCTTTTACTGCCAGCCTTGACTCTACCACAATACCGAAAAAT
ATTCACAGTGCGTTAAAATGTCCTGAATGGAAGAATGTTGTCATGGAAGAAATACCTAGTAGAGGAGGTCTACACGAGTCCTCGTCTGGCTTTGAAGGACAGTTTGGTCA
GCAGGTTTGTAAACTCCAAAAATCCTTATATGGTCTGAAACAATCACCCAAAACTTGGTTGGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGTCAAGTTTACA
AAGCAGAGAATGAATTTGAAATCAAGGATTTGGAAAATATGAAATATTTCCTTGGAATGGAGGTGGCTAGATCTAAAGCAGGTATCTCCGTGTCTTATAGAAAATACACT
ATTGATTTGCTAACCGAGACTGGTATGTTAGGATGCCGTCTTGCTGACATTCCTATTGAATTTAACTATAAACTAGGGAACCTTGATGATCAAGTTCCAGCTCCATATGA
GAAACACATGGAAGCTGTCAACAGAATTTTGAGATACTTGAAAACAACACGTGGTAAAGGGCTAATGTTTAGAAAAATAGACAAAAAGATCATTGAGGCATATACTGACT
CGGATTGGACATGGTTTTTTGTTGATAGAATGTCTACTTCTAGTTATTGTACCTTTGTTTGGGACAATCTTATAACTTGGAGGAGTAAGAAGCGAAGTGTTGTGGCCAGG
AGTAGCGCTGAGGCCAAATATAGAGCGCTATTAGAATGTGAGATACCATTCAAACTTTTTTGTGATAATAAAGCTGCTATTAGTATGGCTAACAACCCTGTTCAACATGA
TAGAACTAAACATGTTGAGATTGATCGACATTTCATCAAAGAAAGACTTGACAGTGAGAGCATATGCATTCTATACATCCCTTCAAGACAGGTTGTTAATGTTCTTACCA
AGGGGCTTCTTAGACCAAGCTTCGACTTTTATGCTAGCAAGTTGGGCCTCATTGGTATTTACGTCCCAATATGTGGTCGTTGGAATATTAGAATATTTATGGAAAGATTA
TGGGAATATTTTCCTTAA
Protein sequenceShow/hide protein sequence
MQASSLGISQQQLEELQQQIAAIEAILGTTSNTLVPMYSKNPVTSFPILSSSYVSENASATTLGAIVQSGGPGRSCVQAPALSFPPQLKSISTCWAFQIFQAHKKILPQE
VINENVLQALFTAESDTLGNNQNIRIVDGSLAPIAGKGQISLFDGDNGHELQNHSLNKFLSSKGIVNQSSCAYTLQQNGVAERKNRHLLEVARFLMLSTSLPSYLWGKAI
LTVAHLINRMPSRVLHFKIPLDCLKESCPSIPLIPKVPLRVFGYTAYGHSHESSYPTLVTLSDPNPHGIVLPTNQVPWKIYYRRNLRKEVESLVVQTALVQDFEPIRDQG
MIDSINSYSNNRMSENDMGEQVSIDEAIVDREDRIIENEVVAENTENETKSNHYGNSSRYDPSLDLLIALRKGTRSCTKHSICNYVSYGNLSPQFRAFTASLDSTTIPKN
IHSALKCPEWKNVVMEEIPSRGGLHESSSGFEGQFGQQVCKLQKSLYGLKQSPKTWLDRFTTFVKSQGYSQVYKAENEFEIKDLENMKYFLGMEVARSKAGISVSYRKYT
IDLLTETGMLGCRLADIPIEFNYKLGNLDDQVPAPYEKHMEAVNRILRYLKTTRGKGLMFRKIDKKIIEAYTDSDWTWFFVDRMSTSSYCTFVWDNLITWRSKKRSVVAR
SSAEAKYRALLECEIPFKLFCDNKAAISMANNPVQHDRTKHVEIDRHFIKERLDSESICILYIPSRQVVNVLTKGLLRPSFDFYASKLGLIGIYVPICGRWNIRIFMERL
WEYFP