; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019041 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019041
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPapain family cysteine protease
Genome locationchr5:37919677..37920758
RNA-Seq ExpressionLag0019041
SyntenyLag0019041
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR025660 - Cysteine peptidase, histidine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KDO63024.1 hypothetical protein CISIN_1g018958mg [Citrus sinensis]7.7e-1637.59Show/hide
Query:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYILSIPKTGDLQRTNRLECMFNEGC---
        TS DWRD   +TP++NQK    CWA  A  A+E +  I  G N +QLS Q L++C    N   +   S+E  +   I   G          +FN  C   
Subjt:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYILSIPKTGDLQRTNRLECMFNEGC---

Query:  --HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQE
          HAV +VGF     G  YW++KNSWG   G+ GY KI ++
Subjt:  --HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQE

OAE33067.1 hypothetical protein AXG93_1913s1690 [Marchantia polymorpha subsp. ruderalis]2.1e-1330.98Show/hide
Query:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYILSIPKTG----------------DLQ
        T  DWR    +T V++Q +   CWA    GA+ESL+ I+ G N + LS Q L++C +  N Y       + G+   +   G                +  
Subjt:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYILSIPKTG----------------DLQ

Query:  RTNRL--------ECMFNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLRGSKYLIEMLTYPT
        +  R+        + ++N  C     H V VVG+  D  GQ+YW+VKNSWGE  GE GY ++ Q  V    G   +     YPT
Subjt:  RTNRL--------ECMFNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLRGSKYLIEMLTYPT

OBS76496.1 hypothetical protein A6R68_17052, partial [Neotoma lepida]1.2e-1330.3Show/hide
Query:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCI-----------ICPNVYEVEIESKEIGYILSIPKTGDLQRTNRLE
        S DWR H+ +TPV++Q +   CWA  A G++E           + LS Q L++C            +    ++   ++  +    S P    +       
Subjt:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCI-----------ICPNVYEVEIESKEIGYILSIPKTGDLQRTNRLE

Query:  CMFNEGCHAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLRGSKYLIEMLTYPT
        C  N   HA+ VVG+  +S G+KYW+VKNSWGE+ G  GY K++++  +    + Y I    YPT
Subjt:  CMFNEGCHAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLRGSKYLIEMLTYPT

XP_006421530.1 ervatamin-B [Citrus clementina]2.7e-1328.28Show/hide
Query:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCII----------CPNVYEVEIESKEIGYILSIP------------
        TS DWR+   +TP++NQ +   CWA  A  A+E +  I  G N + LS Q +++C I            N ++  I+++ I      P            
Subjt:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCII----------CPNVYEVEIESKEIGYILSIP------------

Query:  -------------KTGDLQRTNRLECM---------------------FNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI
                     K  D Q   +   M                     FN GC     HAV +VGF     G KYW++KNSWGE+ GE GY +I +++
Subjt:  -------------KTGDLQRTNRLECM---------------------FNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI

XP_012647628.1 Papain family cysteine protease [Babesia microti strain RI]1.2e-1328.06Show/hide
Query:  FDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCI----ICP--NVYEVEIESKEI--GYIL--SIPKTGDLQRTNRLEC-
        FDWRD +V++PVR Q+    CWAI AAGAI++++NI++  + +  SPQ+L+NC+     C    V  + IE  ++  G  +   +P   + Q+     C 
Subjt:  FDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCI----ICP--NVYEVEIESKEI--GYIL--SIPKTGDLQRTNRLEC-

Query:  ------------------------------------------MFNEGC-----HAVFVVGFDI-DSTGQKYWIVKNSWGEEGGECGYGKISQEIVH
                                                  ++N  C     HA+ + G+   DS   +YWI KNSWG   G+ GY  +S++  H
Subjt:  ------------------------------------------MFNEGC-----HAVFVVGFDI-DSTGQKYWIVKNSWGEEGGECGYGKISQEIVH

TrEMBL top hitse value%identityAlignment
A0A067F6K1 Uncharacterized protein3.7e-1637.59Show/hide
Query:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYILSIPKTGDLQRTNRLECMFNEGC---
        TS DWRD   +TP++NQK    CWA  A  A+E +  I  G N +QLS Q L++C    N   +   S+E  +   I   G          +FN  C   
Subjt:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYILSIPKTGDLQRTNRLECMFNEGC---

Query:  --HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQE
          HAV +VGF     G  YW++KNSWG   G+ GY KI ++
Subjt:  --HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQE

A0A176WIY6 Uncharacterized protein1.0e-1330.98Show/hide
Query:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYILSIPKTG----------------DLQ
        T  DWR    +T V++Q +   CWA    GA+ESL+ I+ G N + LS Q L++C +  N Y       + G+   +   G                +  
Subjt:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYILSIPKTG----------------DLQ

Query:  RTNRL--------ECMFNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLRGSKYLIEMLTYPT
        +  R+        + ++N  C     H V VVG+  D  GQ+YW+VKNSWGE  GE GY ++ Q  V    G   +     YPT
Subjt:  RTNRL--------ECMFNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLRGSKYLIEMLTYPT

A0A1A6HE00 Uncharacterized protein (Fragment)5.9e-1430.3Show/hide
Query:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCI-----------ICPNVYEVEIESKEIGYILSIPKTGDLQRTNRLE
        S DWR H+ +TPV++Q +   CWA  A G++E           + LS Q L++C            +    ++   ++  +    S P    +       
Subjt:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCI-----------ICPNVYEVEIESKEIGYILSIPKTGDLQRTNRLE

Query:  CMFNEGCHAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLRGSKYLIEMLTYPT
        C  N   HA+ VVG+  +S G+KYW+VKNSWGE+ G  GY K++++  +    + Y I    YPT
Subjt:  CMFNEGCHAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLRGSKYLIEMLTYPT

A0A7N2LH65 Uncharacterized protein9.2e-1529.53Show/hide
Query:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKE----IGYILSIPKTGDLQRTNRLECMFNEGC
        + DWR+   +TP++NQ++   CWA  A  A+E +  I++G N + LS Q L++C+   N    ++ S      +  + + P +  ++ ++     ++ G 
Subjt:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKE----IGYILSIPKTGDLQRTNRLECMFNEGC

Query:  ----------HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQE
                  HAV V+G+ +   G KYW++KNSWG   GE GY +I ++
Subjt:  ----------HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQE

I7J8U7 Papain family cysteine protease5.9e-1428.06Show/hide
Query:  FDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCI----ICP--NVYEVEIESKEI--GYIL--SIPKTGDLQRTNRLEC-
        FDWRD +V++PVR Q+    CWAI AAGAI++++NI++  + +  SPQ+L+NC+     C    V  + IE  ++  G  +   +P   + Q+     C 
Subjt:  FDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCI----ICP--NVYEVEIESKEI--GYIL--SIPKTGDLQRTNRLEC-

Query:  ------------------------------------------MFNEGC-----HAVFVVGFDI-DSTGQKYWIVKNSWGEEGGECGYGKISQEIVH
                                                  ++N  C     HA+ + G+   DS   +YWI KNSWG   G+ GY  +S++  H
Subjt:  ------------------------------------------MFNEGC-----HAVFVVGFDI-DSTGQKYWIVKNSWGEEGGECGYGKISQEIVH

SwissProt top hitse value%identityAlignment
O91466 Viral cathepsin1.4e-1229.53Show/hide
Query:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN---------------------------VYEVEIESKEIGYI
        + DWRD + +TPV+NQ     CWA      IESL+NI++ +  L LS Q+L+NC    N                            Y  +   K+  + 
Subjt:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN---------------------------VYEVEIESKEIGYI

Query:  LSI--PKTGDLQRTNRLE----------------------------CMFNEGC-HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQE
        LSI   +   LQ  N+L                             C  NEG  HAV +VG+ +      YWI+KNSWG E GE GY ++ ++
Subjt:  LSI--PKTGDLQRTNRLE----------------------------CMFNEGC-HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQE

O97397 Cathepsin L-like proteinase2.4e-1226.05Show/hide
Query:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCII------CPNVYEV-------------------------------
        S DWR   V+ PVRNQ     CWA+  A AIES   I+ G + + LSPQ L++C        C   + V                               
Subjt:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCII------CPNVYEV-------------------------------

Query:  -----------EIESKEIGYILSIPKTGDLQRT--------------NRLECMFNEGCHAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVH
                   ++ + E     ++   G +                 +   C+ +   H V VVG+ I++ GQKYWI+KN+WG + GE GY ++ ++  H
Subjt:  -----------EIESKEIGYILSIPKTGDLQRT--------------NRLECMFNEGCHAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVH

Query:  TLRGSKYLIEMLTYP
        +    K    M +YP
Subjt:  TLRGSKYLIEMLTYP

P80884 Ananain4.4e-1426.8Show/hide
Query:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCII--------CPNVYEVEIESKEIGYILSIP--------KTGD---
        S DWRD   +T V+NQ R   CWA  +   +ES++ I+ G N + LS Q +++C +            Y   I +K +      P        KT     
Subjt:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCII--------CPNVYEVEIESKEIGYILSIP--------KTGD---

Query:  ---------LQRTNRLECM-------------------------FNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI
                 +QR N    M                         F   C     HA+ ++G+  DS+G+K+WIV+NSWG   GE GY ++++++
Subjt:  ---------LQRTNRLECM-------------------------FNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI

Q9FGR9 KDEL-tailed cysteine endopeptidase CEP14.1e-1227.86Show/hide
Query:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN-------------------------VYEVEIE------SK
        TS DWR +  +TPV+NQ +   CWA     A+E ++ I        LS Q L++C    N                         VY  +        +K
Subjt:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN-------------------------VYEVEIE------SK

Query:  EIGYILSIPKTGDLQRTNRLECM---------------------FNEGC----------HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIV
        E   ++SI    D+ + +  + M                     ++EG           H V VVG+     G KYWIVKNSWGEE GE GY ++ + I 
Subjt:  EIGYILSIPKTGDLQRTNRLECM---------------------FNEGC----------HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIV

Query:  H
        H
Subjt:  H

Q9PYY5 Viral cathepsin1.7e-1329.38Show/hide
Query:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN---------VYEVEIESKEIGYILSIPKTG-----------
        SFDWRD N +T V+ QK    CWA  A   IESL++I+H N +L LS Q L++C    N          +E  I +  I Y    P TG           
Subjt:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN---------VYEVEIESKEIGYILSIPKTG-----------

Query:  ---------DLQRTNRL----------------------------ECMFNEGC-HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI
                 DL+   +L                             C  + G  H V +VG+       KYW +KNSWG + GE G+ +I +++
Subjt:  ---------DLQRTNRL----------------------------ECMFNEGC-HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI

Arabidopsis top hitse value%identityAlignment
AT1G29080.1 Papain family cysteine protease1.6e-1124.24Show/hide
Query:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINC-------------------------IICPNVYEVEIES-------
        T+ DWR+   +TPV++Q     CWA  A  A+E L  I  G N + LS Q L++C                         I   N Y  +++        
Subjt:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINC-------------------------IICPNVYEVEIES-------

Query:  --------------------------KEIGYILSIPKTGDLQRTNRLECMFNEGC---HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI
                                  + +   +   + G +  +  +    N G    HAV +VG+     G KYW+ KNSWG+  GE GY +I +++
Subjt:  --------------------------KEIGYILSIPKTGDLQRTNRLECMFNEGC---HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI

AT3G48340.1 Cysteine proteinases superfamily protein5.5e-1226.63Show/hide
Query:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN------VYEVEIE-------------------------SK
        +S DWR    +T ++NQ +   CWA     A+E ++ I+  N  + LS Q L++C    N      + E+  E                         SK
Subjt:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN------VYEVEIE-------------------------SK

Query:  EIGYILSIPKTGDLQRTNR--------------------------LECMFNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI
        + G +++I    D+   +                            E +F   C     H V  VG+     G+KYWIV+NSWG E GE GY KI +EI
Subjt:  EIGYILSIPKTGDLQRTNR--------------------------LECMFNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEI

AT3G49340.1 Cysteine proteinases superfamily protein4.2e-1226.89Show/hide
Query:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYIL---------SIPKTGDLQ--RTNRLE
        S DW     +T V++Q++   CWA  A  A+E +  I +G   + LS Q L++C    N     I  K   YI          + P  G  Q   +N L 
Subjt:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYIL---------SIPKTGDLQ--RTNRLE

Query:  C-------------------------------------------MFNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLR
                                                    +FN  C     HAV +VG+ +   G KYW++KNSWGE  GE GY +I ++ V + +
Subjt:  C-------------------------------------------MFNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLR

Query:  GSKYLIEMLTYP
        G   L  +  YP
Subjt:  GSKYLIEMLTYP

AT5G45890.1 senescence-associated gene 122.7e-1126.98Show/hide
Query:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINC------------------------IICPNVYEVEIESKEIGYILSI
        S DWR    +TP++NQ     CWA  A  AIE    I+ G   + LS Q L++C                        +   + Y  + E        + 
Subjt:  SFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINC------------------------IICPNVYEVEIESKEIGYILSI

Query:  PK----TG--DLQRTNRLECM--------------------------FNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHT
        PK    TG  D+   +    M                          F   C     HAV  +G+   + G KYWI+KNSWG + GE GY +I Q+ V  
Subjt:  PK----TG--DLQRTNRLECM--------------------------FNEGC-----HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHT

Query:  LRGSKYLIEMLTYPT
         +G   L    +YPT
Subjt:  LRGSKYLIEMLTYPT

AT5G50260.1 Cysteine proteinases superfamily protein2.9e-1327.86Show/hide
Query:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN-------------------------VYEVEIE------SK
        TS DWR +  +TPV+NQ +   CWA     A+E ++ I        LS Q L++C    N                         VY  +        +K
Subjt:  TSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPN-------------------------VYEVEIE------SK

Query:  EIGYILSIPKTGDLQRTNRLECM---------------------FNEGC----------HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIV
        E   ++SI    D+ + +  + M                     ++EG           H V VVG+     G KYWIVKNSWGEE GE GY ++ + I 
Subjt:  EIGYILSIPKTGDLQRTNRLECM---------------------FNEGC----------HAVFVVGFDIDSTGQKYWIVKNSWGEEGGECGYGKISQEIV

Query:  H
        H
Subjt:  H


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGAGGTAACATCTTTTGATTGGAGAGACCATAATGTTCTTACACCCGTTCGAAATCAGAAACGATCCCGATTTTGCTGGGCTATCGTGGCCGCAGGAGCCATCGA
GTCGTTGCATAACATTGAGCATGGGAACAACAATCTGCAGCTTTCACCTCAGTACTTAATCAATTGTATCATATGTCCTAATGTGTATGAGGTCGAGATAGAGAGTAAGG
AGATTGGTTATATACTGTCAATTCCAAAAACAGGAGATTTACAAAGGACCAACAGATTGGAATGCATGTTTAACGAAGGTTGCCACGCAGTTTTTGTTGTTGGGTTCGAC
ATCGATTCTACTGGCCAGAAATACTGGATTGTCAAGAACTCGTGGGGCGAGGAAGGGGGAGAATGTGGCTATGGGAAAATTAGTCAAGAGATTGTCCATACTCTGAGAGG
CTCCAAATACTTGATAGAGATGTTAACTTATCCAACCGATATTGTGCTCAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGAGGTAACATCTTTTGATTGGAGAGACCATAATGTTCTTACACCCGTTCGAAATCAGAAACGATCCCGATTTTGCTGGGCTATCGTGGCCGCAGGAGCCATCGA
GTCGTTGCATAACATTGAGCATGGGAACAACAATCTGCAGCTTTCACCTCAGTACTTAATCAATTGTATCATATGTCCTAATGTGTATGAGGTCGAGATAGAGAGTAAGG
AGATTGGTTATATACTGTCAATTCCAAAAACAGGAGATTTACAAAGGACCAACAGATTGGAATGCATGTTTAACGAAGGTTGCCACGCAGTTTTTGTTGTTGGGTTCGAC
ATCGATTCTACTGGCCAGAAATACTGGATTGTCAAGAACTCGTGGGGCGAGGAAGGGGGAGAATGTGGCTATGGGAAAATTAGTCAAGAGATTGTCCATACTCTGAGAGG
CTCCAAATACTTGATAGAGATGTTAACTTATCCAACCGATATTGTGCTCAACTAA
Protein sequenceShow/hide protein sequence
MQEVTSFDWRDHNVLTPVRNQKRSRFCWAIVAAGAIESLHNIEHGNNNLQLSPQYLINCIICPNVYEVEIESKEIGYILSIPKTGDLQRTNRLECMFNEGCHAVFVVGFD
IDSTGQKYWIVKNSWGEEGGECGYGKISQEIVHTLRGSKYLIEMLTYPTDIVLN