; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G14400 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G14400
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBeta-galactosidase
Genome locationChr7:12899717..12903866
RNA-Seq ExpressionCSPI07G14400
SyntenyCSPI07G14400
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050819.1 Beta-galactosidase [Cucumis melo var. makuwa]1.2e-12848.52Show/hide
Query:  MLLFVRTDPTFPLAIFRG--ENLRKEVVPSTSQSPAPVLDSEPPR-------------------DQGMENLIEPYIKQWD--------------------
        MLL ++TDPTFPLAIFRG   NLRKEV   TSQ PAPV D EPP                    +QG    ++ Y    D                    
Subjt:  MLLFVRTDPTFPLAIFRG--ENLRKEVVPSTSQSPAPVLDSEPPR-------------------DQGMENLIEPYIKQWD--------------------

Query:  ----------------------------ANEC--------------------------------------SLKYKADGTLDRHKTRLVAKGFTQTYGVDY
                                    A EC                                      SLKYKADGTLDRHK RLVAKGFTQTYG+DY
Subjt:  ----------------------------ANEC--------------------------------------SLKYKADGTLDRHKTRLVAKGFTQTYGVDY

Query:  SETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ----------------------FTTFVKSQGYSQGHSDHTL
        SETFSPVAK+NT+RVLLS+AVN+D PLYQ DV+N FLNG+L EEVYMSP PGFEAQFGQ                      FTTFVKSQGYSQGHSDHTL
Subjt:  SETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ----------------------FTTFVKSQGYSQGHSDHTL

Query:  FTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRNYTLDLLIETSMLGCCLADIP----------
        FTK SKT KIA                                 DLENLKYFLGMEVARSKEGISVSQR YTLDLL ET MLGC  AD P          
Subjt:  FTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRNYTLDLLIETSMLGCCLADIP----------

Query:  ------------------------------TFLLNS-------------------------------LTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKD
                                       +L N+                                T  YCTFVWGN VTWRSKKQ VVARSS EA+ 
Subjt:  ------------------------------TFLLNS-------------------------------LTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKD

Query:  TTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKL
          MSLGIC+EIWL K LS LHQECETPLKLFCDNK  ISIANN VQ+D+TKHVEIDRHFIKERLDSGS CIPYIPS+Q  A+VLTK L  P+F+ CVSKL
Subjt:  TTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKL

Query:  GLIDIYIP
        GLIDIY+P
Subjt:  GLIDIYIP

TYJ97179.1 Beta-galactosidase [Cucumis melo var. makuwa]6.5e-13061.12Show/hide
Query:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------
        SLKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAK+NT+RVLLS+AVN+D PLYQLDV N FLNG+L EEVYMSP PGFEAQFGQ           
Subjt:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------

Query:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN
                   FTTFVKSQGYSQGHSDHTLFTK SKTGKIA                                 DL NLKYFLGMEVARSKEGISVSQR 
Subjt:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN

Query:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV
        YTLDLL ET MLGC  AD P                                                              +L   T GYCTFVWGN V
Subjt:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV

Query:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA
        TWRSKKQ VVARSS EA+   MSLGICEEIWLQK LS LHQECETPLKLFCDNK  ISIANNPVQ+D+TKHVEIDRHFIKERLDSGS CIPYIPSSQ  A
Subjt:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA

Query:  NVLTKRLLKPNFNFCVSKLGLIDIYIP
        +VLTK LL+P+F+ CVSKLG IDIY+P
Subjt:  NVLTKRLLKPNFNFCVSKLGLIDIYIP

TYJ97179.1 Beta-galactosidase [Cucumis melo var. makuwa]4.8e-0837.82Show/hide
Query:  IEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFGHSQPYEINQIYSKAAFEVGISLAQSKSTAL
        +EKLL  +QKP IY +G           QK  HAP +S  WA A P  H+TAHP  FYA S VQ S+P GH  P+  +    +    V +S   SK    
Subjt:  IEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFGHSQPYEINQIYSKAAFEVGISLAQSKSTAL

Query:  ------PMYSKNPSSMPQS
              P++S N    PQ+
Subjt:  ------PMYSKNPSSMPQS

TYJ97179.1 Beta-galactosidase [Cucumis melo var. makuwa]1.1e-12940.3Show/hide
Query:  AQSKSTALPMYSKNPSSMPQSLGLISVDGKNLGILESGVTNHLTGLIQGGRALLGTARDCTFLMRIPHVTTSYGKRWFVT--------FIDDHTRLLPVS
        +Q+K+  L   ++  S MPQSLGLISVDGKN  IL+SG T+HLTG ++   +    A    FL    +       R   T         +DD T     S
Subjt:  AQSKSTALPMYSKNPSSMPQSLGLISVDGKNLGILESGVTNHLTGLIQGGRALLGTARDCTFLMRIPHVTTSYGKRWFVT--------FIDDHTRLLPVS

Query:  TLSPINLRIPNHNLSEFLASKGFTK------------------------TCVLVLLNKM-GWLSEKTVIFWKHV------CLLSIPFISAVINVFIPCP-
        +LS ++L     + SE  +SKG                            C L+L   +  +L    ++   H+      C+L +      +    P   
Subjt:  TLSPINLRIPNHNLSEFLASKGFTK------------------------TCVLVLLNKM-GWLSEKTVIFWKHV------CLLSIPFISAVINVFIPCP-

Query:  -----ENTLLLWMLLFVRTDPT------------------FPLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP-----------------
             EN        F   +PT                   P   +   NLRKEV   TSQ PAPV + EPPRDQGMEN  +P                 
Subjt:  -----ENTLLLWMLLFVRTDPT------------------FPLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP-----------------

Query:  ----------------------------------------------------------YIKQ-----------------------WDANECS--------
                                                                  Y+                         + A EC         
Subjt:  ----------------------------------------------------------YIKQ-----------------------WDANECS--------

Query:  ------------------------------LKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNL
                                      LKYK DGTLDRHK RLVAKGFTQTYG+DYSETFSPVAK+NT+RVLL +AVN+D PLYQLDV N FLNG+L
Subjt:  ------------------------------LKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNL

Query:  GEEVYMSPSPGFEAQFGQ----------------------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA------------------------------
         EEVYMSP PGFEAQFGQ                      FTTFVKSQGYSQGHSDHTLFTK SKTGKIA                              
Subjt:  GEEVYMSPSPGFEAQFGQ----------------------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA------------------------------

Query:  ---DLENLKYFLGMEVARSKEGISVSQRNYTLDLLIETSMLGCCLADIP---------------------------------------------------
           DLENLKYFLGMEVARSKEGISVSQR YTLDLL ET MLGC  AD P                                                   
Subjt:  ---DLENLKYFLGMEVARSKEGISVSQRNYTLDLLIETSMLGCCLADIP---------------------------------------------------

Query:  ---------TFLLNSLTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKH
                   +L   T GYCTFVWGN VTWRSKK  VVARSS EA+   MSLGICEEIWLQK LS LHQECETPLKLFCDNK  ISIANNPVQ+D+TKH
Subjt:  ---------TFLLNSLTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKH

Query:  VEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLIDIYIP
        VEIDRHFIKERLDSGS CIPYIPSSQ  A+VLTK LL+P+F+ CVSKLGLIDIY+P
Subjt:  VEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLIDIYIP

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]1.0e-12760.66Show/hide
Query:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------
        SLKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAK+NT+RVLLS+AVN+D PLYQLDV N FLNG+L EEVYMSP PGFEAQFGQ           
Subjt:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------

Query:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN
                   FTTFVKSQGYSQGHSDHTLFTK SKTGKIA                                 DL NLKYFLGMEVARSKEGISVSQR 
Subjt:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN

Query:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV
        YTLDLL ET MLGC  AD P                                                              +L   T GYCTFVWGN V
Subjt:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV

Query:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA
        TWRSKKQ VVARSS EA+   MSLGICEEIWLQK LS LHQE ETPLKLFCDNK  ISIA NPVQ+D+TKHVEIDRHFIKERLDSGS CIPYIPSSQ  A
Subjt:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA

Query:  NVLTKRLLKPNFNFCVSKLGLIDIYIP
        +VL K LL+P+F+ CVSKLGLIDIY+P
Subjt:  NVLTKRLLKPNFNFCVSKLGLIDIYIP

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]2.3e-1844.97Show/hide
Query:  MVSEQSNNETLENSLGKPQTKTE--AIAAAIEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFG
        MVSEQSNNETLEN+LG+ Q +TE  A AAA+EKLL  +QKP IY  G         + QK  HAP +S  WA A P  H+TAHP  FYA S VQ S+P G
Subjt:  MVSEQSNNETLENSLGKPQTKTE--AIAAAIEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFG

Query:  HSQPYEINQIYSKAAFEVGISLAQSKSTAL------PMYSKNPSSMPQS
        H  P+  +    +    V +S   SK          P++S N    PQ+
Subjt:  HSQPYEINQIYSKAAFEVGISLAQSKSTAL------PMYSKNPSSMPQS

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]1.0e-12760.66Show/hide
Query:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------
        SLKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAK+NT+RVLLS+AVN+D PLYQLDV N FLNG+L EEVYMSP PGFEAQFGQ           
Subjt:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------

Query:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN
                   FTTFVKSQGYSQGHSDHTLFTK SKTGKIA                                 DL NLKYFLGMEVARSKEGISVSQR 
Subjt:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN

Query:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV
        YTLDLL ET MLGC  AD P                                                              +L   T GYCTFVWGN V
Subjt:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV

Query:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA
        TWRSKKQ VVARSS EA+   MSLGICEEIWLQK LS LHQE ETPLKLFCDNK  ISIA NPVQ+D+TKHVEIDRHFIKERLDSGS CIPYIPSSQ  A
Subjt:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA

Query:  NVLTKRLLKPNFNFCVSKLGLIDIYIP
        +VL K LL+P+F+ CVSKLGLIDIY+P
Subjt:  NVLTKRLLKPNFNFCVSKLGLIDIYIP

TrEMBL top hitse value%identityAlignment
A0A5A7SW06 Beta-galactosidase3.3e-0259.52Show/hide
Query:  PLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP
        P   +   NLRKEV   TSQ PAPV + EPPRDQGMEN  +P
Subjt:  PLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP

A0A5A7SW06 Beta-galactosidase2.7e-12656.93Show/hide
Query:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------
        SLKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAK+NT+RVLLS+AVN+D PLYQLDV N FLNG+L EEVYMSP PGFEAQFGQ           
Subjt:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------

Query:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN
                   FTTFVKSQGYSQGHSDHTLFTK SKTGKIA                                 DL NLKYFLGMEVARSKEGISVSQR 
Subjt:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN

Query:  YTLDLLIETSMLGCCLADIP----------------------------------------------------------------TFLLNS----------
        YTLDLL ET MLGC  AD P                                                                 +L N+          
Subjt:  YTLDLLIETSMLGCCLADIP----------------------------------------------------------------TFLLNS----------

Query:  ---------------------LTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQ
                              T GYCTFVWGN VTWRSKKQ VVARSS EA+   MSLGICEEIWLQK LS LHQECETPLKLFCDNK  ISIANNPVQ
Subjt:  ---------------------LTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQ

Query:  YDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLIDIYIP
        +D+TKHVEIDRHFIKERLDSGS CIPYIPSSQ  A+VLTK LL+P+F+ CVSKLGLIDIY+P
Subjt:  YDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLIDIYIP

A0A5D3BDV4 Beta-galactosidase3.1e-13061.12Show/hide
Query:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------
        SLKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAK+NT+RVLLS+AVN+D PLYQLDV N FLNG+L EEVYMSP PGFEAQFGQ           
Subjt:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------

Query:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN
                   FTTFVKSQGYSQGHSDHTLFTK SKTGKIA                                 DL NLKYFLGMEVARSKEGISVSQR 
Subjt:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN

Query:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV
        YTLDLL ET MLGC  AD P                                                              +L   T GYCTFVWGN V
Subjt:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV

Query:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA
        TWRSKKQ VVARSS EA+   MSLGICEEIWLQK LS LHQECETPLKLFCDNK  ISIANNPVQ+D+TKHVEIDRHFIKERLDSGS CIPYIPSSQ  A
Subjt:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA

Query:  NVLTKRLLKPNFNFCVSKLGLIDIYIP
        +VLTK LL+P+F+ CVSKLG IDIY+P
Subjt:  NVLTKRLLKPNFNFCVSKLGLIDIYIP

A0A5D3BDV4 Beta-galactosidase2.3e-0837.82Show/hide
Query:  IEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFGHSQPYEINQIYSKAAFEVGISLAQSKSTAL
        +EKLL  +QKP IY +G           QK  HAP +S  WA A P  H+TAHP  FYA S VQ S+P GH  P+  +    +    V +S   SK    
Subjt:  IEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFGHSQPYEINQIYSKAAFEVGISLAQSKSTAL

Query:  ------PMYSKNPSSMPQS
              P++S N    PQ+
Subjt:  ------PMYSKNPSSMPQS

A0A5D3BDV4 Beta-galactosidase3.3e-0259.52Show/hide
Query:  PLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP
        P   +   NLRKEV   TSQ PAPV + EPPRDQGMEN  +P
Subjt:  PLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP

A0A5D3BDV4 Beta-galactosidase5.4e-13040.3Show/hide
Query:  AQSKSTALPMYSKNPSSMPQSLGLISVDGKNLGILESGVTNHLTGLIQGGRALLGTARDCTFLMRIPHVTTSYGKRWFVT--------FIDDHTRLLPVS
        +Q+K+  L   ++  S MPQSLGLISVDGKN  IL+SG T+HLTG ++   +    A    FL    +       R   T         +DD T     S
Subjt:  AQSKSTALPMYSKNPSSMPQSLGLISVDGKNLGILESGVTNHLTGLIQGGRALLGTARDCTFLMRIPHVTTSYGKRWFVT--------FIDDHTRLLPVS

Query:  TLSPINLRIPNHNLSEFLASKGFTK------------------------TCVLVLLNKM-GWLSEKTVIFWKHV------CLLSIPFISAVINVFIPCP-
        +LS ++L     + SE  +SKG                            C L+L   +  +L    ++   H+      C+L +      +    P   
Subjt:  TLSPINLRIPNHNLSEFLASKGFTK------------------------TCVLVLLNKM-GWLSEKTVIFWKHV------CLLSIPFISAVINVFIPCP-

Query:  -----ENTLLLWMLLFVRTDPT------------------FPLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP-----------------
             EN        F   +PT                   P   +   NLRKEV   TSQ PAPV + EPPRDQGMEN  +P                 
Subjt:  -----ENTLLLWMLLFVRTDPT------------------FPLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP-----------------

Query:  ----------------------------------------------------------YIKQ-----------------------WDANECS--------
                                                                  Y+                         + A EC         
Subjt:  ----------------------------------------------------------YIKQ-----------------------WDANECS--------

Query:  ------------------------------LKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNL
                                      LKYK DGTLDRHK RLVAKGFTQTYG+DYSETFSPVAK+NT+RVLL +AVN+D PLYQLDV N FLNG+L
Subjt:  ------------------------------LKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNL

Query:  GEEVYMSPSPGFEAQFGQ----------------------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA------------------------------
         EEVYMSP PGFEAQFGQ                      FTTFVKSQGYSQGHSDHTLFTK SKTGKIA                              
Subjt:  GEEVYMSPSPGFEAQFGQ----------------------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA------------------------------

Query:  ---DLENLKYFLGMEVARSKEGISVSQRNYTLDLLIETSMLGCCLADIP---------------------------------------------------
           DLENLKYFLGMEVARSKEGISVSQR YTLDLL ET MLGC  AD P                                                   
Subjt:  ---DLENLKYFLGMEVARSKEGISVSQRNYTLDLLIETSMLGCCLADIP---------------------------------------------------

Query:  ---------TFLLNSLTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKH
                   +L   T GYCTFVWGN VTWRSKK  VVARSS EA+   MSLGICEEIWLQK LS LHQECETPLKLFCDNK  ISIANNPVQ+D+TKH
Subjt:  ---------TFLLNSLTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKH

Query:  VEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLIDIYIP
        VEIDRHFIKERLDSGS CIPYIPSSQ  A+VLTK LL+P+F+ CVSKLGLIDIY+P
Subjt:  VEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLIDIYIP

A0A5D3BJK7 Beta-galactosidase8.0e-0937.82Show/hide
Query:  IEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFGHSQPYEINQIYSKAAFEVGISLAQSKSTAL
        +EKLL  +QKP IY  G         + QK+ HAP +S  WA A P  H+TAHP  FYA S VQ S+P GH  P+  +    +    V +S   SK    
Subjt:  IEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFGHSQPYEINQIYSKAAFEVGISLAQSKSTAL

Query:  ------PMYSKNPSSMPQS
              P++S N    PQ+
Subjt:  ------PMYSKNPSSMPQS

A0A5D3BJK7 Beta-galactosidase3.3e-0259.52Show/hide
Query:  PLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP
        P   +   NLRKEV   TSQ PAPV + EPPRDQGMEN  +P
Subjt:  PLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP

A0A5D3E603 Beta-galactosidase5.0e-12860.66Show/hide
Query:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------
        SLKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAK+NT+RVLLS+AVN+D PLYQLDV N FLNG+L EEVYMSP PGFEAQFGQ           
Subjt:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------

Query:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN
                   FTTFVKSQGYSQGHSDHTLFTK SKTGKIA                                 DL NLKYFLGMEVARSKEGISVSQR 
Subjt:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN

Query:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV
        YTLDLL ET MLGC  AD P                                                              +L   T GYCTFVWGN V
Subjt:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV

Query:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA
        TWRSKKQ VVARSS EA+   MSLGICEEIWLQK LS LHQE ETPLKLFCDNK  ISIA NPVQ+D+TKHVEIDRHFIKERLDSGS CIPYIPSSQ  A
Subjt:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA

Query:  NVLTKRLLKPNFNFCVSKLGLIDIYIP
        +VL K LL+P+F+ CVSKLGLIDIY+P
Subjt:  NVLTKRLLKPNFNFCVSKLGLIDIYIP

A0A5D3E603 Beta-galactosidase1.1e-1844.97Show/hide
Query:  MVSEQSNNETLENSLGKPQTKTE--AIAAAIEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFG
        MVSEQSNNETLEN+LG+ Q +TE  A AAA+EKLL  +QKP IY  G         + QK  HAP +S  WA A P  H+TAHP  FYA S VQ S+P G
Subjt:  MVSEQSNNETLENSLGKPQTKTE--AIAAAIEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFG

Query:  HSQPYEINQIYSKAAFEVGISLAQSKSTAL------PMYSKNPSSMPQS
        H  P+  +    +    V +S   SK          P++S N    PQ+
Subjt:  HSQPYEINQIYSKAAFEVGISLAQSKSTAL------PMYSKNPSSMPQS

A0A5D3E603 Beta-galactosidase3.3e-0259.52Show/hide
Query:  PLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP
        P   +   NLRKEV   TSQ PAPV + EPPRDQGMEN  +P
Subjt:  PLAIFRGENLRKEVVPSTSQSPAPVLDSEPPRDQGMENLIEP

A0A5D3E603 Beta-galactosidase5.0e-12860.66Show/hide
Query:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------
        SLKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAK+NT+RVLLS+AVN+D PLYQLDV N FLNG+L EEVYMSP PGFEAQFGQ           
Subjt:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQ-----------

Query:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN
                   FTTFVKSQGYSQGHSDHTLFTK SKTGKIA                                 DL NLKYFLGMEVARSKEGISVSQR 
Subjt:  -----------FTTFVKSQGYSQGHSDHTLFTKVSKTGKIA---------------------------------DLENLKYFLGMEVARSKEGISVSQRN

Query:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV
        YTLDLL ET MLGC  AD P                                                              +L   T GYCTFVWGN V
Subjt:  YTLDLLIETSMLGCCLADIP------------------------------------------------------------TFLLNSLTFGYCTFVWGNFV

Query:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA
        TWRSKKQ VVARSS EA+   MSLGICEEIWLQK LS LHQE ETPLKLFCDNK  ISIA NPVQ+D+TKHVEIDRHFIKERLDSGS CIPYIPSSQ  A
Subjt:  TWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFA

Query:  NVLTKRLLKPNFNFCVSKLGLIDIYIP
        +VL K LL+P+F+ CVSKLGLIDIY+P
Subjt:  NVLTKRLLKPNFNFCVSKLGLIDIYIP

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-3123.91Show/hide
Query:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQ--------------
        S+KY   G   R+K RLVA+GFTQ Y +DY ETF+PVA++++ R +LS+ +  +  ++Q+DV   FLNG L EE+YM    G                  
Subjt:  SLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQ--------------

Query:  -------FGQFTTFVKSQGYSQGHSDHTLF------------------TKVSKTG----------------KIADLENLKYFLGMEVARSKEGISVSQRN
               F  F   +K   +     D  ++                    V  TG                ++ DL  +K+F+G+ +   ++ I +SQ  
Subjt:  -------FGQFTTFVKSQGYSQGHSDHTLF------------------TKVSKTG----------------KIADLENLKYFLGMEVARSKEGISVSQRN

Query:  YTLDLLIE--------------------------------TSMLGCCL------------------------------------------ADIPTFLLNS
        Y   +L +                                 S++GC +                                           D+      +
Subjt:  YTLDLLIE--------------------------------TSMLGCCL------------------------------------------ADIPTFLLNS

Query:  LTF-----GYCTFVWG-------------------NFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANN
        L F     GY    W                    N + W +K+Q  VA SS EA+   +   + E +WL+  L+ ++ + E P+K++ DN+  ISIANN
Subjt:  LTF-----GYCTFVWG-------------------NFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANN

Query:  PVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLI
        P  + + KH++I  HF +E++ +   C+ YIP+    A++ TK L    F     KLGL+
Subjt:  PVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-2424.03Show/hide
Query:  LKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQ---------------
        LK   D  L R+K RLV KGF Q  G+D+ E FSPV KM ++R +LS+A + D  + QLDV   FL+G+L EE+YM    GFE                 
Subjt:  LKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQ---------------

Query:  --------FGQFTTFVKSQGYSQGHSDHTLFTK---------------------------------VSKTGKIADLENLKYFLGMEVARSKEG--ISVSQ
                + +F +F+KSQ Y + +SD  ++ K                                 +SK+  + DL   +  LGM++ R +    + +SQ
Subjt:  --------FGQFTTFVKSQGYSQGHSDHTLFTK---------------------------------VSKTGKIADLENLKYFLGMEVARSKEG--ISVSQ

Query:  RNYTLDLL----------IETSMLG------------------------------------CCLADI-------PTFLLN--------------------
          Y   +L          + T + G                                    C   DI         FL N                    
Subjt:  RNYTLDLL----------IETSMLG------------------------------------CCLADI-------PTFLLN--------------------

Query:  ----------------------------SLTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSI--LHQECETPLKLFCDNKT
                                      + GY     G  ++W+SK Q+ VA S+ EA+    +    E IWL++FL    LHQ+      ++CD+++
Subjt:  ----------------------------SLTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSI--LHQECETPLKLFCDNKT

Query:  GISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGL
         I ++ N + + +TKH+++  H+I+E +D  S  +  I +++  A++LTK + +  F  C   +G+
Subjt:  GISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGL

P92520 Uncharacterized mitochondrial protein AtMg008202.7e-0654.17Show/hide
Query:  KYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIA
        K  +DGTLDR K RLVAKGF Q  G+ + ET+SPV +  T+R +L++A
Subjt:  KYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-3827.03Show/hide
Query:  KYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGF-------------------
        KY +DG+L+R+K RLVAKG+ Q  G+DY+ETFSPV K  ++R++L +AV++  P+ QLDVNN FL G L ++VYMS  PGF                   
Subjt:  KYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGF-------------------

Query:  ----EAQFGQFTTFVKSQGYSQGHSDHTLFT--------------------------------KVSKTGKIADLENLKYFLGMEVARSKEGISVSQRNYT
             A + +   ++ + G+    SD +LF                                  +S+   + D E L YFLG+E  R   G+ +SQR Y 
Subjt:  ----EAQFGQFTTFVKSQGYSQGHSDHTLFT--------------------------------KVSKTGKIADLENLKYFLGMEVARSKEGISVSQRNYT

Query:  LDLLIETSML-------------------GCCLADIPT--------------------FLLNSL------------------------------------
        LDLL  T+M+                   G  L D PT                    + +N L                                    
Subjt:  LDLLIETSML-------------------GCCLADIPT--------------------FLLNSL------------------------------------

Query:  ---------------------TFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQY
                             T GY  ++  + ++W SKKQ+ V RSS EA+  +++    E  W+   L+ L      P  ++CDN     +  NPV +
Subjt:  ---------------------TFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQY

Query:  DKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGL
         + KH+ ID HFI+ ++ SG+  + ++ +    A+ LTK L +  F    SK+G+
Subjt:  DKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-3926.48Show/hide
Query:  KYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGF-------------------
        K+ +DG+L+R+K RLVAKG+ Q  G+DY+ETFSPV K  ++R++L +AV++  P+ QLDVNN FL G L +EVYMS  PGF                   
Subjt:  KYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGF-------------------

Query:  ----EAQFGQFTTFVKSQGYSQGHSDHTLFT--------------------------------KVSKTGKIADLENLKYFLGMEVARSKEGISVSQRNYT
             A + +  T++ + G+    SD +LF                                  +S+   + + E+L YFLG+E  R  +G+ +SQR YT
Subjt:  ----EAQFGQFTTFVKSQGYSQGHSDHTLFT--------------------------------KVSKTGKIADLENLKYFLGMEVARSKEGISVSQRNYT

Query:  LDLLIETSML-----GCCLADIPTFLLNS-----------------------------------------------------------------------
        LDLL  T+ML        +A  P   L+S                                                                       
Subjt:  LDLLIETSML-----GCCLADIPTFLLNS-----------------------------------------------------------------------

Query:  -------------------LTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYD
                            T GY  ++  + ++W SKKQ+ V RSS EA+  +++    E  W+   L+ L  +   P  ++CDN     +  NPV + 
Subjt:  -------------------LTFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYD

Query:  KTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLIDI
        + KH+ +D HFI+ ++ SG+  + ++ +    A+ LTK L +  F     K+G+I +
Subjt:  KTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-4930.24Show/hide
Query:  LKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFG-------------
        +KY +DGT++R+K RLVAKG+TQ  G+D+ ETFSPV K+ +++++L+I+   +  L+QLD++N FLNG+L EE+YM   PG+ A+ G             
Subjt:  LKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFG-------------

Query:  --------------QFTTFVKSQGYSQGHSDHTLFTKVSKT--------------------------------GKIADLENLKYFLGMEVARSKEGISVS
                      +F+  +   G+ Q HSDHT F K++ T                                 K+ DL  LKYFLG+E+ARS  GI++ 
Subjt:  --------------QFTTFVKSQGYSQGHSDHTLFTKVSKT--------------------------------GKIADLENLKYFLGMEVARSKEGISVS

Query:  QRNYTLDLLIETSMLGCCLADIP--------------------------------------TFLLNSL--------------------------------
        QR Y LDLL ET +LGC  + +P                                      +F +N L                                
Subjt:  QRNYTLDLLIETSMLGCCLADIP--------------------------------------TFLLNSL--------------------------------

Query:  -------------------------TFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANN
                                 T GYC F+  + ++W+SKKQ+VV++SS EA+   +S    E +WL +F   L      P  LFCDN   I IA N
Subjt:  -------------------------TFGYCTFVWGNFVTWRSKKQRVVARSSVEAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANN

Query:  PVQYDKTKHVEIDRHFIKER
         V +++TKH+E D H ++ER
Subjt:  PVQYDKTKHVEIDRHFIKER

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.9e-0754.17Show/hide
Query:  KYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIA
        K  +DGTLDR K RLVAKGF Q  G+ + ET+SPV +  T+R +L++A
Subjt:  KYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGCAAAGTAACAACGAAACCCTAGAAAACAGTTTAGGAAAACCCCAAACCAAAACAGAGGCTATCGCCGCCGCCATAGAAAAACTACTCTACCGAATTCA
AAAGCCGTCGATCTACACAGTGGGCCAACCCTCGACGTCGTCTGTGCAGTCAAACAGTCAGAAAGTACCTCACGCACCGCCTCTGTCAAACGTGTGGGCTCAGGCGCTGC
CGTTCGTTCATCTCACCGCCCATCCCAGACGCTTCTACGCATCGTCGCCTGTCCAAACGTCTCACCCTTTCGGTCATTCGCAGCCCTACGAGATCAACCAAATCTACAGC
AAAGCTGCTTTTGAAGTTGGTATATCCTTGGCCCAGTCTAAATCGACCGCTCTACCAATGTATTCAAAGAACCCGTCAAGTATGCCTCAATCCCTTGGCCTTATTAGTGT
TGATGGGAAGAATCTCGGGATTTTGGAATCAGGGGTCACAAATCATTTAACAGGACTCATTCAGGGAGGACGGGCACTACTCGGTACAGCAAGGGACTGTACATTCTTGA
TGAGAATACCTCATGTCACCACCTCATATGGGAAACGGTGGTTTGTAACTTTCATTGATGATCATACCCGTCTCTTACCTGTGTCTACCTTATCGCCAATAAATCTGAGA
ATTCCAAACCATAACCTTAGTGAATTCTTAGCCTCCAAGGGGTTCACCAAAACTTGTGTGCTTGTACTCCTCAACAAAATGGGGTGGTTGAGCGAAAAAACCGTCATCTT
CTGGAAGCATGTGTGTTTGTTGAGTATCCCCTTCATTAGCGCGGTTATAAATGTTTTCATCCCCTGTCCAGAAAATACTTTGTTACTATGGATGTTACTTTTTGTGAGGA
CCGACCCTACTTTCCCGTTAGCCATTTTTAGGGGAGAGAACCTGAGAAAGGAAGTTGTGCCCTCTACTAGTCAATCGCCGGCTCCAGTCCTAGACTCTGAACCTCCTCGA
GATCAAGGGATGGAAAACCTTATTGAACCTTACATAAAACAGTGGGATGCAAATGAGTGTTCTCTCAAATACAAAGCAGATGGAACACTTGACAGACACAAGACAAGGTT
AGTTGCAAAGGGATTTACTCAAACCTATGGTGTTGATTATTCAGAAACCTTTTCTCCAGTTGCTAAGATGAACACTATGAGAGTCTTGCTATCTATTGCTGTGAACCAAG
ATTGCCCTCTATACCAGCTAGATGTTAACAATGTCTTTTTGAATGGAAACCTAGGGGAGGAAGTCTACATGAGCCCCTCGCCTGGATTTGAAGCCCAGTTTGGTCAGTTT
ACTACTTTTGTCAAGTCCCAAGGGTACAGTCAGGGACATTCTGACCATACTTTATTTACAAAGGTTTCTAAGACAGGGAAGATTGCAGATTTGGAAAATCTGAAATATTT
CCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGCATCTCCGTGTCTCAAAGAAACTACACCCTTGATTTGCTAATTGAGACAAGTATGTTGGGATGTTGTCTTGCTGACA
TTCCAACATTCCTATTGAATTCATTGACCTTCGGTTATTGTACCTTCGTTTGGGGCAATTTTGTAACTTGGAGGAGTAAAAAGCAACGTGTAGTGGCCAGGAGCAGCGTT
GAGGCCAAAGACACAACTATGAGTTTGGGAATATGTGAGGAAATTTGGCTCCAAAAATTCTTGTCAATTCTTCATCAGGAATGTGAGACACCATTGAAGTTGTTTTGTGA
TAATAAAACTGGTATTAGTATTGCTAACAACCCAGTACAATATGATAAAACTAAACATGTTGAGATTGATCGGCATTTCATCAAAGAAAGACTCGACAGTGGGAGCACAT
GCATTCCGTACATTCCTTCGAGCCAACCGTTCGCTAATGTTCTCACCAAACGGCTTCTCAAACCAAACTTCAACTTTTGTGTTAGCAAGTTGGGCCTTATTGATATTTAC
ATCCCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGCAAAGTAACAACGAAACCCTAGAAAACAGTTTAGGAAAACCCCAAACCAAAACAGAGGCTATCGCCGCCGCCATAGAAAAACTACTCTACCGAATTCA
AAAGCCGTCGATCTACACAGTGGGCCAACCCTCGACGTCGTCTGTGCAGTCAAACAGTCAGAAAGTACCTCACGCACCGCCTCTGTCAAACGTGTGGGCTCAGGCGCTGC
CGTTCGTTCATCTCACCGCCCATCCCAGACGCTTCTACGCATCGTCGCCTGTCCAAACGTCTCACCCTTTCGGTCATTCGCAGCCCTACGAGATCAACCAAATCTACAGC
AAAGCTGCTTTTGAAGTTGGTATATCCTTGGCCCAGTCTAAATCGACCGCTCTACCAATGTATTCAAAGAACCCGTCAAGTATGCCTCAATCCCTTGGCCTTATTAGTGT
TGATGGGAAGAATCTCGGGATTTTGGAATCAGGGGTCACAAATCATTTAACAGGACTCATTCAGGGAGGACGGGCACTACTCGGTACAGCAAGGGACTGTACATTCTTGA
TGAGAATACCTCATGTCACCACCTCATATGGGAAACGGTGGTTTGTAACTTTCATTGATGATCATACCCGTCTCTTACCTGTGTCTACCTTATCGCCAATAAATCTGAGA
ATTCCAAACCATAACCTTAGTGAATTCTTAGCCTCCAAGGGGTTCACCAAAACTTGTGTGCTTGTACTCCTCAACAAAATGGGGTGGTTGAGCGAAAAAACCGTCATCTT
CTGGAAGCATGTGTGTTTGTTGAGTATCCCCTTCATTAGCGCGGTTATAAATGTTTTCATCCCCTGTCCAGAAAATACTTTGTTACTATGGATGTTACTTTTTGTGAGGA
CCGACCCTACTTTCCCGTTAGCCATTTTTAGGGGAGAGAACCTGAGAAAGGAAGTTGTGCCCTCTACTAGTCAATCGCCGGCTCCAGTCCTAGACTCTGAACCTCCTCGA
GATCAAGGGATGGAAAACCTTATTGAACCTTACATAAAACAGTGGGATGCAAATGAGTGTTCTCTCAAATACAAAGCAGATGGAACACTTGACAGACACAAGACAAGGTT
AGTTGCAAAGGGATTTACTCAAACCTATGGTGTTGATTATTCAGAAACCTTTTCTCCAGTTGCTAAGATGAACACTATGAGAGTCTTGCTATCTATTGCTGTGAACCAAG
ATTGCCCTCTATACCAGCTAGATGTTAACAATGTCTTTTTGAATGGAAACCTAGGGGAGGAAGTCTACATGAGCCCCTCGCCTGGATTTGAAGCCCAGTTTGGTCAGTTT
ACTACTTTTGTCAAGTCCCAAGGGTACAGTCAGGGACATTCTGACCATACTTTATTTACAAAGGTTTCTAAGACAGGGAAGATTGCAGATTTGGAAAATCTGAAATATTT
CCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGCATCTCCGTGTCTCAAAGAAACTACACCCTTGATTTGCTAATTGAGACAAGTATGTTGGGATGTTGTCTTGCTGACA
TTCCAACATTCCTATTGAATTCATTGACCTTCGGTTATTGTACCTTCGTTTGGGGCAATTTTGTAACTTGGAGGAGTAAAAAGCAACGTGTAGTGGCCAGGAGCAGCGTT
GAGGCCAAAGACACAACTATGAGTTTGGGAATATGTGAGGAAATTTGGCTCCAAAAATTCTTGTCAATTCTTCATCAGGAATGTGAGACACCATTGAAGTTGTTTTGTGA
TAATAAAACTGGTATTAGTATTGCTAACAACCCAGTACAATATGATAAAACTAAACATGTTGAGATTGATCGGCATTTCATCAAAGAAAGACTCGACAGTGGGAGCACAT
GCATTCCGTACATTCCTTCGAGCCAACCGTTCGCTAATGTTCTCACCAAACGGCTTCTCAAACCAAACTTCAACTTTTGTGTTAGCAAGTTGGGCCTTATTGATATTTAC
ATCCCAGCTTGA
Protein sequenceShow/hide protein sequence
MVSEQSNNETLENSLGKPQTKTEAIAAAIEKLLYRIQKPSIYTVGQPSTSSVQSNSQKVPHAPPLSNVWAQALPFVHLTAHPRRFYASSPVQTSHPFGHSQPYEINQIYS
KAAFEVGISLAQSKSTALPMYSKNPSSMPQSLGLISVDGKNLGILESGVTNHLTGLIQGGRALLGTARDCTFLMRIPHVTTSYGKRWFVTFIDDHTRLLPVSTLSPINLR
IPNHNLSEFLASKGFTKTCVLVLLNKMGWLSEKTVIFWKHVCLLSIPFISAVINVFIPCPENTLLLWMLLFVRTDPTFPLAIFRGENLRKEVVPSTSQSPAPVLDSEPPR
DQGMENLIEPYIKQWDANECSLKYKADGTLDRHKTRLVAKGFTQTYGVDYSETFSPVAKMNTMRVLLSIAVNQDCPLYQLDVNNVFLNGNLGEEVYMSPSPGFEAQFGQF
TTFVKSQGYSQGHSDHTLFTKVSKTGKIADLENLKYFLGMEVARSKEGISVSQRNYTLDLLIETSMLGCCLADIPTFLLNSLTFGYCTFVWGNFVTWRSKKQRVVARSSV
EAKDTTMSLGICEEIWLQKFLSILHQECETPLKLFCDNKTGISIANNPVQYDKTKHVEIDRHFIKERLDSGSTCIPYIPSSQPFANVLTKRLLKPNFNFCVSKLGLIDIY
IPA