; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1362 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1362
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProline-rich receptor-like protein kinase PERK14
Genome locationMC04:21654097..21656417
RNA-Seq ExpressionMC04g1362
SyntenyMC04g1362
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016404.1 hypothetical protein SDJN02_21513 [Cucurbita argyrosperma subsp. argyrosperma]5.64e-11486.27Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP
        M LAVEGGGFFSSS SG SSSLTLLLLGQKSED PMRVLP LVDREPDLN QLAS KTWISWRCASPS RCFR NPA    SPLKKAA+TQRQDS R SP
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP

Query:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL
         SDNGK+ HVPSSDDDNLARKMVLKSSLKKASDAP VSV NA GNEALG KGS D SHVERRKVQWTDTCGSELAEVKEFEPSEINASDDE+DIGK RCL
Subjt:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL

Query:  CAIM
        C IM
Subjt:  CAIM

XP_004140958.1 uncharacterized protein LOC101221691 [Cucumis sativus]1.05e-11485.92Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFR-CFRQNPA-VTNSSPLKKAATTQRQDSLRA
        MLLA+EGGGFFSSS SG SSSL+LLLLGQKSEDK MRVLPLLVDR+PD+NIQLASTKTWISWRCASPSFR CFR NPA  T   PLKK ATTQRQDSLR 
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFR-CFRQNPA-VTNSSPLKKAATTQRQDSLRA

Query:  SPRSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRR
        SP SDNGKN HVPSSD+DNLARKMVLKSSLKK SDA I SV NADGNEA GGKGSCD SHVERRKVQWTDTCGS+LAEVKEFEPSEINASDDE+D+GKRR
Subjt:  SPRSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRR

Query:  CLCAIM
        CLC+IM
Subjt:  CLCAIM

XP_022133486.1 uncharacterized protein LOC111006056 [Momordica charantia]1.29e-142100Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP
        MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP

Query:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL
        RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL
Subjt:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL

Query:  CAIM
        CAIM
Subjt:  CAIM

XP_023551469.1 uncharacterized protein LOC111809268 isoform X1 [Cucurbita pepo subsp. pepo]1.14e-11386.27Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP
        M LAVEGGGFFSSS SG SSSLTLLLLGQKSED PMRVLP LVDREPDLN QLAS KTWISWRCASPS RCFR NPA    SPLKKAA+TQRQDS R SP
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP

Query:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL
         SDNGK+ HVPSSDDDNLARKMVLKSSLKKASDAP VSV NA GNEALG KGS D SHVERRKVQWTDTCGSELAEVKEFEPSEINASDDE+DIGK RCL
Subjt:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL

Query:  CAIM
        C IM
Subjt:  CAIM

XP_038883990.1 uncharacterized protein LOC120074951 [Benincasa hispida]5.01e-11585.85Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFR-CFRQNPAVTNSSPLKKAATTQRQDSLRAS
        MLLAVEG GFFSSS SG SSSL+LLLLGQKS+DKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFR CFR NPA   +  LKK AT QRQDSLR S
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFR-CFRQNPAVTNSSPLKKAATTQRQDSLRAS

Query:  PRSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRC
        P SDNGKN HVPSSD+DNLARKMVLKSSLKKASDA   SV NADGNEA+GGKGSCD SHVERRKVQWTDTCGS+LAEVKEFEPSEINASDDE+D+GKRRC
Subjt:  PRSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRC

Query:  LCAIM
        LC IM
Subjt:  LCAIM

TrEMBL top hitse value%identityAlignment
A0A0A0K9H6 Uncharacterized protein3.27e-11385.92Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFR-CFRQNPA-VTNSSPLKKAATTQRQDSLRA
        MLLA+EGGGFFSSS SG SSSL+LLLLGQKSEDK MRVLPLLVDR+PD+NIQLASTKTWISWRCASPSFR CFR NPA  T   PLKK ATTQRQDSLR 
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFR-CFRQNPA-VTNSSPLKKAATTQRQDSLRA

Query:  SPRSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRR
        SP SDNGKN HVPSSD+DNLARKMVLKSSLKK SDA I SV NADGNEA GGKGSCD SHVERRKVQWTDTCGS+LAEVKEFEPSEINASDDE+D+GKRR
Subjt:  SPRSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRR

Query:  CLCAIM
        CLC+IM
Subjt:  CLCAIM

A0A6J1BV87 uncharacterized protein LOC1110060566.23e-143100Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP
        MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP

Query:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL
        RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL
Subjt:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL

Query:  CAIM
        CAIM
Subjt:  CAIM

A0A6J1FIB8 uncharacterized protein LOC111445595 isoform X17.48e-11285.29Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP
        M LAVEGGGFFSSS SG SSSLTLLLLGQKSED PMRVLP LVDREPDLN QLAS KTWISWRCASPS RCF  NPA    SPLKKAA+TQRQDS R SP
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP

Query:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL
         SDNGK+  VPSSDDDNLARKMVLKSSLKKASDAP VSV NA GNEALG KGS D SHVERRKVQWTDTCGSELAEVKEFEPSEINASDDE+DIGK RCL
Subjt:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL

Query:  CAIM
        C IM
Subjt:  CAIM

A0A6J1GNU0 uncharacterized protein LOC111455621 isoform X28.04e-10982.84Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP
        MLL  EGGGFFSSS SG SSSLTLL LGQKSE KPMRVLPLL+DREPDLN +LASTKTWISWRCAS S RCFR NPA  NSS LKKAA+TQRQDSL+ SP
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP

Query:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL
         SDNGKN  V SS+D+NLA KMVLKSSLKKA DA +VSV NADGNEALGGKGSCD SHVERRKVQW DTCGSELAEVKEFEPSEIN SDDE+D GKRRCL
Subjt:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL

Query:  CAIM
        C+IM
Subjt:  CAIM

A0A6J1JSF2 uncharacterized protein LOC111489351 isoform X19.11e-11385.29Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP
        M LAVEGGGFFSSS SG SSSLTLLLLGQKSED PMRVLP LVDREPDLN QLAS KTWISWRCASPS RCFR NPA    SPLKKAA+TQRQDS R SP
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASP

Query:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL
         SDNGK+ HVPSS+DD LARKMVLKSSLKKASD P VSV NADGNEALG KGS D SHVERRKVQWTDTCGSELAEVKEFEPSEINASDDE+DIGK RCL
Subjt:  RSDNGKNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCL

Query:  CAIM
        C IM
Subjt:  CAIM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22790.1 unknown protein5.0e-3143.12Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLP----LLVDREPDLN--IQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQD
        MLLAVEGGG FS+S SG S  LTLL  G K  D+PMRV+P     +VD+EP+ +  +QL S K  +S  CA+ SF CF    A   +    K    Q+Q 
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLP----LLVDREPDLN--IQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQD

Query:  SLRASPR-----SDNGKNNHVPSSDDDNL--ARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINA
           +SP      S+ GK + +  +D+ +   A K+ L+SSLK+ S A   S+ +    E L   GS     + RRKVQW D CGSEL +V+EFEPSE+  
Subjt:  SLRASPR-----SDNGKNNHVPSSDDDNL--ARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINA

Query:  SDDEHDIGKRR-CLCAIM
        SD+E ++G++R C C IM
Subjt:  SDDEHDIGKRR-CLCAIM

AT1G22790.2 unknown protein5.0e-3143.12Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLP----LLVDREPDLN--IQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQD
        MLLAVEGGG FS+S SG S  LTLL  G K  D+PMRV+P     +VD+EP+ +  +QL S K  +S  CA+ SF CF    A   +    K    Q+Q 
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLP----LLVDREPDLN--IQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQD

Query:  SLRASPR-----SDNGKNNHVPSSDDDNL--ARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINA
           +SP      S+ GK + +  +D+ +   A K+ L+SSLK+ S A   S+ +    E L   GS     + RRKVQW D CGSEL +V+EFEPSE+  
Subjt:  SLRASPR-----SDNGKNNHVPSSDDDNL--ARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINA

Query:  SDDEHDIGKRR-CLCAIM
        SD+E ++G++R C C IM
Subjt:  SDDEHDIGKRR-CLCAIM

AT1G34010.1 unknown protein7.3e-3041.31Show/hide
Query:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPL-----LVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDS
        ML A EGGGFFSSS SG S+ L LLLLGQK+E KP++V        LV  + D   +L S+K W+S  C   S  CF +               ++R +S
Subjt:  MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPL-----LVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDS

Query:  LRASPRSDNGKNNHVPSSDDDN---LARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDE-
                 GK +  PS +D N   +  +  LKSSLKK S + +V      G++ +   G  D  H++RRKVQW DTCG E+AEV+EFEPSE++ S+DE 
Subjt:  LRASPRSDNGKNNHVPSSDDDN---LARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDE-

Query:  HDIGKRRCLCAIM
        H    + C+C IM
Subjt:  HDIGKRRCLCAIM

AT1G55475.1 unknown protein3.6e-0532.38Show/hide
Query:  SSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALG------------GKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRC
        S DDD+    +     L  A D   V     +G + +G               S +    E++KVQW D  G ELAE++EFEPS+    D + D GK  C
Subjt:  SSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALG------------GKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRC

Query:  LCAIM
        +C I+
Subjt:  LCAIM

AT3G13480.1 unknown protein8.6e-0736.26Show/hide
Query:  DDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCLCAIM
        ++++L+   +LKSSLKK               E L    S D    E++KVQW D  G ELAE++EFE SE    +D    G + C+C I+
Subjt:  DDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCLCAIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACTGGCAGTAGAAGGGGGAGGGTTCTTTTCTTCTTCAGTTTCTGGTTGTAGCAGTAGTCTGACCCTTCTTCTCTTGGGTCAGAAGAGCGAAGATAAACCC
ATGAGAGTTTTGCCATTGTTGGTTGATCGAGAGCCTGATCTTAACATTCAGCTGGCTTCAACAAAGACGTGGATTTCCTGGAGGTGCGCCTCCCCCTCCTTTCGC
TGCTTTCGTCAGAATCCTGCAGTGACAAATTCATCTCCTCTAAAGAAAGCGGCCACTACACAGCGTCAAGACAGTTTAAGAGCATCTCCTCGTTCTGATAATGGC
AAGAACAATCACGTTCCCAGTTCAGATGATGATAATCTTGCAAGAAAGATGGTGCTTAAAAGTAGCTTGAAAAAGGCATCAGATGCTCCTATAGTTTCTGTTCCG
AATGCTGATGGAAATGAAGCATTGGGAGGAAAGGGTAGCTGTGATCCTAGTCATGTAGAAAGAAGGAAAGTTCAGTGGACAGATACTTGTGGAAGCGAGCTTGCT
GAAGTCAAAGAATTTGAACCAAGCGAAATAAACGCATCTGACGATGAACATGACATTGGGAAAAGAAGGTGTTTATGCGCCATCATGTAA
mRNA sequenceShow/hide mRNA sequence
AGTAATTTGCATATCATGCATCTATCTGTTGCTGTCTTCTTGCTTCGATTCTGCCAAACTGCAGATTAGTAGCAAGAAATTTCAACATCGGAGATTGATTTACCA
TCAGGGGAACTCTCTTAGGAGCTGCTAGATGAGCTCAACTGATGTTACTGGCAGTAGAAGGGGGAGGGTTCTTTTCTTCTTCAGTTTCTGGTTGTAGCAGTAGTC
TGACCCTTCTTCTCTTGGGTCAGAAGAGCGAAGATAAACCCATGAGAGTTTTGCCATTGTTGGTTGATCGAGAGCCTGATCTTAACATTCAGCTGGCTTCAACAA
AGACGTGGATTTCCTGGAGGTGCGCCTCCCCCTCCTTTCGCTGCTTTCGTCAGAATCCTGCAGTGACAAATTCATCTCCTCTAAAGAAAGCGGCCACTACACAGC
GTCAAGACAGTTTAAGAGCATCTCCTCGTTCTGATAATGGCAAGAACAATCACGTTCCCAGTTCAGATGATGATAATCTTGCAAGAAAGATGGTGCTTAAAAGTA
GCTTGAAAAAGGCATCAGATGCTCCTATAGTTTCTGTTCCGAATGCTGATGGAAATGAAGCATTGGGAGGAAAGGGTAGCTGTGATCCTAGTCATGTAGAAAGAA
GGAAAGTTCAGTGGACAGATACTTGTGGAAGCGAGCTTGCTGAAGTCAAAGAATTTGAACCAAGCGAAATAAACGCATCTGACGATGAACATGACATTGGGAAAA
GAAGGTGTTTATGCGCCATCATGTAATTCAAAATCCTTTTTTTTTTCCATTTTGAGGAGTTGTGCTCACGATCCTCAAAGATCTGCCGACTTGGATCAAATCAAC
TTGTTCGCTCAAGAATTTTTCATGGCAGAGGAAGCTGTGTCAGCAATTTTAGACAGGCAGAAACCAGCTCCAGGAATTCTGTTGGTCCCCTCCCCATGTTCTTCA
GAGTGATCAATTTATCCGTGGTAGTGCTGGTTTCAGTAAGTTTTTTTTTTCTTCACATGGATGGCCTCCACTTTTTAGCACTTTTTTGTTGCTTTTTCACTGTAT
CAGTCCCTTTTTTTATTTGTTACTATATATAAACTGTGTTCCTTCTGTGGTTTTCCTCCCTGTATGAACAATTTTTTGGGATATTTTCTGTTTAGTATTTAGAAG
GAAGCTTTTGATAATTTGATAAGGCTTTTGTTGGTTGGTATTAAAAAAAAAGATTAATGTTGCTACATGAATTCTGTTTGTTGAT
Protein sequenceShow/hide protein sequence
MLLAVEGGGFFSSSVSGCSSSLTLLLLGQKSEDKPMRVLPLLVDREPDLNIQLASTKTWISWRCASPSFRCFRQNPAVTNSSPLKKAATTQRQDSLRASPRSDNG
KNNHVPSSDDDNLARKMVLKSSLKKASDAPIVSVPNADGNEALGGKGSCDPSHVERRKVQWTDTCGSELAEVKEFEPSEINASDDEHDIGKRRCLCAIM