; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026894 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026894
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGATA zinc finger domain-containing protein isoform 1
Genome locationchr10:43028400..43034075
RNA-Seq ExpressionLag0026894
SyntenyLag0026894
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR023213 - Chloramphenicol acetyltransferase-like domain superfamily
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596578.1 hypothetical protein SDJN03_09758, partial [Cucurbita argyrosperma subsp. sororia]1.3e-23886.01Show/hide
Query:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD
        T AAS+MSD    PPAGESKSRPVGGTE+SWCRA PGGTGTTVLGLLLSKP D+ NLQS+LHSLQNLHPILRSKI +DPSRRDFSFLTPPSP LHLQILD
Subjt:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD

Query:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG
          AAARA+ASHPDADDPSVSDFHKILEHEIN+ TW DP+HPSY+DTDVMFA+VY +++GQWAVFLRLHTA CDR AA A+LRELLA +SG  EGG FEIG
Subjt:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG

Query:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH
        D+GEIGLGIEDLIPNGK NK LWARGLDMLGYSLNSFRLANLEFKDANS RFSQMIRLKMNSD TEKLLAGCKLRGIK+CGALAAAGLIATRCSKDLPP+
Subjt:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH

Query:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE
        +KEKY VVTLNDCRSLL+PPLTTHHLGFYHSAILNTHDISAED LW+VAKRCYFAFSNAKDNNKHFSDMSDLNFLMC+AIENPGLTPSSSMRTALISVFE
Subjt:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE

Query:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG
        +PI +   PAQ++LGL DYIGCASAHGVGPSIAFFDMIR+G LDCACVYP PLFSRDQMN+I DEMKKILV A+ VVEG
Subjt:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG

KAG7028117.1 hypothetical protein SDJN02_09297, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-23886.01Show/hide
Query:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD
        T AAS+MSD    PPAGESKSRPVGGTE+SWCRA PGGTGTTVLGLLLSKP D+ NLQS+LHSLQNLHPILRSKI +DPSRRDFSFLTPPSP LHLQILD
Subjt:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD

Query:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG
          AAARA+ASHPDADDPSVSDFHKILEHEIN+ TW DP+HPSY+DTDVMFA+VY +++GQWAVFLRLHTA CDR AA A+LRELLA +SG  EGG FEIG
Subjt:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG

Query:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH
        D+GEIGLGIEDLIPNGK NK LWARGLDMLGYSLNSFRLANLEFKDANS RFSQMIRLKMNSD TEKLLAGCKLRGIK+CGALAAAGLIATRCSKDLPP+
Subjt:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH

Query:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE
        +KEKY VVTLNDCRSLL+PPLTTHHLGFYHSAILNTHDISAED LW+VAKRCYFAFSNAKDNNKHFSDMSDLNFLMC+AIENPGLTPSSSMRTALISVFE
Subjt:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE

Query:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG
        +PI +   PAQ++LGL DYIGCASAHGVGPSIAFFDMIR+G LDCACVYP PLFSRDQMN+I DEMKKILV A+ VVEG
Subjt:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG

XP_023005679.1 uncharacterized protein LOC111498610 [Cucurbita maxima]5.0e-23885.39Show/hide
Query:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD
        T AAS+MSD    PPAGESKSRPVGGTE+SWCRA PGGTGTTVLGLLLSKP D+PNLQS+LHSLQNLHPIL SKIH+DP RRDFSFL PPSP LHLQILD
Subjt:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD

Query:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG
          A ARA+ASHPDADDPSVSDFHKILEHEIN+ TW DP+HPSY+DTDVMFA+VY +S+GQW VFLRLHTA CDR AA A+LRELLAA+SG  EGG FEIG
Subjt:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG

Query:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH
        D+GEIGLGIEDLIPNGK NK LWARGLDMLGYSLNSFRLANLEFKDANS RFSQMIRLKMNSD TEKLLAGCKLRGIK+CGALAAAGLIATRCSKDLPP+
Subjt:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH

Query:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE
        +KEKY VVTLNDCRSLL+PPLTTHHLGFYHSAILNTHD+SAED LW+VAKRCYFAFSNAKDNNKHFSDMSDLNFLMC+AIENPGLTPSSSMRTALISVFE
Subjt:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE

Query:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG
        +PI +   PAQ+HLGL DYIGCASAHGVGPSIAFFDMIR+G LDCACVYP PLFSRDQMN+I  +MKKILV A++VVEG
Subjt:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG

XP_023540823.1 uncharacterized protein LOC111801084 [Cucurbita pepo subsp. pepo]5.4e-24086.43Show/hide
Query:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD
        T AAS+MSD    PPAGESKSRPVGGTE+SWCRA PGGTGTTVLGLLLSKP D+PNLQS+LHSLQNLHPILRSKIH+DPSRRDFSFLTPPSP LHLQILD
Subjt:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD

Query:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG
          AAARA+ASHPDADDPSVSDFHKILEHEIN+ TW DP++PSY+DTDVMFA+VY +++GQWAVFLRLHTA CDR AA A+LRELLAA+SG  EGG FEIG
Subjt:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG

Query:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH
        D+GEIGLGIEDLIPNGK NK LWARGLDMLGYSLNSFRLANLEFKDANS RFSQMIRLKMNSD TEKLLAGCKLRGIK+CGALAAAGLIATRCSKDLP +
Subjt:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH

Query:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE
        +KEKY VVTLNDCRSLL+PPLTTHHLGFYHSAILNTHDISAED LWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMC+AIENPGLTPSSSMRTALISVFE
Subjt:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE

Query:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG
        +PI +   PAQ++LGL DYIGCASAHGVGPSIAFFDMIR+G LDCACVYP PLFSRDQMN+I DEMKKILV A++VVEG
Subjt:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG

XP_038905440.1 uncharacterized protein LOC120091472 [Benincasa hispida]2.2e-24188.03Show/hide
Query:  MSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILDHAAAAR
        MSDQ P PPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKP DIP+LQSSLH+LQNLHPILRSKIHHDPSRRDFSFL  PSP LHLQILD  A AR
Subjt:  MSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILDHAAAAR

Query:  AVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLREL--LAASSGGIEGGGFEIGDNGE
        A+ASHPDA+DPSVSDFHKI E EIN ATWFDP+HPSY+DTDVMFATVY +S+ QWA+FLRLHTA CDRAAAAA+LREL  L A+ G IEGGGFEIGDNGE
Subjt:  AVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLREL--LAASSGGIEGGGFEIGDNGE

Query:  IGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHNKEK
        IGLGIEDLIPNGK NK+LWARGLDMLGYSLNSFRLANLEFKDANS+RFSQMIRLKMNS ET+KLLAGCKLRG+KLCGALAAAGL+ATRCSKDLP H KEK
Subjt:  IGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHNKEK

Query:  YAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFEEPIV
        YAVVTLNDCRSLLDPPLT+HHLGFYHSAILNTHDISAED LWEVAKRCYF++SNAKDNNKHFSDMSDLNFLMC+AIENPGLTPSSSMRTALISVFE+PI+
Subjt:  YAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFEEPIV

Query:  DTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILV-NAMDVVEG
        DTS P QQ+LGL DY GCASAHGVGPSIA FDMIR+G LDCACVYPSPLFSRDQMNRIFDEMKKILV NAM+VVEG
Subjt:  DTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILV-NAMDVVEG

TrEMBL top hitse value%identityAlignment
A0A0A0LGP2 Uncharacterized protein2.7e-23783.7Show/hide
Query:  SDISRRQYPLSLYSTAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSF
        SD SRR +PL    T   S+MSDQ   PP  ESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKP DIP+LQSSLH+LQNLHPILRSKIHHDPSRRDFSF
Subjt:  SDISRRQYPLSLYSTAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSF

Query:  LTPPSPSLHLQILDHAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLA
        L PPSP LHLQILD AA ARA+ASHPDADDPSVSDFHKI EHEIN   WFDP+HPSY+DTDVMFATVY VSE QWAVFL LHTA CDRAAAAA+LRELL 
Subjt:  LTPPSPSLHLQILDHAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLA

Query:  ASSGG--IEGGGFEIGDNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALA
         ++GG  IEGGGFE GDNGE+GLGIEDLIPNGK NK+LWARG DMLGYSLNSFRLANLEFKD N++RFSQMIRL+MNSDET+KLLAGCKLRGIKLCGALA
Subjt:  ASSGG--IEGGGFEIGDNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALA

Query:  AAGLIATRCSKD-LPPHNKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENP
        AAGLIATRCSKD LPP+ KEKYAVVTLNDCRSLLDPPLT+HHLGFYHSAILNTHDISAED +WEVA RCYF+FSNAKDNNKHFSDMSDLNFLMC+AIENP
Subjt:  AAGLIATRCSKD-LPPHNKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENP

Query:  GLTPSSSMRTALISVFEEPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVN-AMDVVEG
         LTPSSSMRTALISVFE+PI++ S P QQ+LGL DYIG ASAHGVGPSIA FD IR+G LD ACVYPSPLFSRDQMNRIFD+MKKILVN +++V EG
Subjt:  GLTPSSSMRTALISVFEEPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVN-AMDVVEG

A0A1S3B7G8 uncharacterized protein LOC1034866234.6e-23787Show/hide
Query:  MSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILDHAAAAR
        MSDQ P PPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKP DIP+LQSSLH+LQNLHPILRSKIHHDP RRDFSFL P SPSLHLQILD AA  R
Subjt:  MSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILDHAAAAR

Query:  AVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLREL--LAASSGGIEGGGFEIGDNGE
        A+ASHPDADDPSVSDFHKI EHEIN   WFDP+HPSY+DTDVMFATVY VSE QWAVFL LHTA CDRAAAAA+LREL  LAA  G IEGG FEIGDNGE
Subjt:  AVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLREL--LAASSGGIEGGGFEIGDNGE

Query:  IGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHNKE
        IGLGIEDLIPNGK NK+LWARG DMLGYSLNSFRLANLEFKD NS+RFSQMIRLKMNSDET+KLLAGCKLRGIKLCGALAAAGLIATRCSKD LPP+ KE
Subjt:  IGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHNKE

Query:  KYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFEEPI
        KYAVVTLNDCRSLLDPPLT+HHLGFYHSAILNTHDISAED LWEVA RCYF+FSNAKDNNKHFSDMSDLNFLMC+AIENP LTPSSSMRTALISVFE+PI
Subjt:  KYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFEEPI

Query:  VDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVN-AMDVVEG
        ++TS P  Q+LGL DYIG ASAHGVGPSIA FD IR+G LDCACVYPSPLFSRDQMN+IFDEMKKILVN A++V EG
Subjt:  VDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVN-AMDVVEG

A0A6J1G619 uncharacterized protein LOC1114512044.2e-23885.8Show/hide
Query:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD
        T AAS+MSD    PPAGESKSRPVGGTE+SWCRA PGGTGTTVLGLLLSKP D+ NLQS+LHSLQNLHPILRSKI +DPSRRDFSFLTPPSP LHLQILD
Subjt:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD

Query:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG
          AAARA+ASHPDADDPSVSDFHKILEHEIN+ TW DP+HPSY+DTDVMFA+VY +++GQWAVFLRLHTA CDR AA A+LRELLAA+SG  EGG FEI 
Subjt:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG

Query:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH
        D+GEIGLGIEDLIPNGK NK LWARGLDMLGYSLNSFRLANLEFKDANS+RFSQMIRLKMNSD TEKLLAGCKLRGIK+CGALAAAGLIATRCSKDLPP+
Subjt:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH

Query:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE
        +KEKY VVTLNDCRSLL+PPLTTHHLGFYHSAILNTHDISAED LW+VAKRCYFAFSNAKDNNKHFSDMSDLNFLMC+AIENPGLTPSSSMRTALISVFE
Subjt:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE

Query:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG
        +PI +   PAQ++LGL DYIGCASAHGVGPSIAFFDMIR+G LDCACVYP PLFSR+QMN+I DEMKKILV A+ VVEG
Subjt:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG

A0A6J1J6C5 uncharacterized protein LOC1114816321.7e-23185.59Show/hide
Query:  ESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILDHAAAARAVASHPDADDP
        E + RPVGGTEHSWCRAVPGGTGTTVLGLLLSKP DI +LQ+SLH+LQNLHPILRSKIHHDPSRRDFSFL PPSPS+HLQILD AAAARA+ASHPDADDP
Subjt:  ESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILDHAAAARAVASHPDADDP

Query:  SVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLREL--LAASSGGIEGGGFEIGDNGEIGLGIEDLIPN
        S+SDFHKILEHEIN A W +PSHPSY+DTDVMFATVYAVS+GQWAVFL LHTAACDR AAA++LREL  L A+ G IEGGGF+IGDNGEIG GIEDLIP+
Subjt:  SVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLREL--LAASSGGIEGGGFEIGDNGEIGLGIEDLIPN

Query:  GKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHNKEKYAVVTLNDCRS
        GK +K LWARGLDMLGYSLNSFR ANLEFKDA+S+RFSQMIRLK+NSDET+KLLAGCK RGIKLCGAL AAGLIATRCSKDLPPH  EKYAVVTL DCRS
Subjt:  GKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHNKEKYAVVTLNDCRS

Query:  LLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFEEPIVDTSDPAQQHLG
        LLDPPLTTHHLGFYHSAILNTHDISAED LWEV++RCYF+FSNAKDNNKHF+DMSDLNFLM +AIENPGLTPSSSMRTALIS FE+PI+ TSDPAQQHLG
Subjt:  LLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFEEPIVDTSDPAQQHLG

Query:  LQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILV-NAMDVVEG
        + DYIGCASAHGVGPSIA FD+IR+G LDCACVYPSPLFSRDQMN++FDEMKKILV +AM+VVEG
Subjt:  LQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILV-NAMDVVEG

A0A6J1KZY5 uncharacterized protein LOC1114986102.4e-23885.39Show/hide
Query:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD
        T AAS+MSD    PPAGESKSRPVGGTE+SWCRA PGGTGTTVLGLLLSKP D+PNLQS+LHSLQNLHPIL SKIH+DP RRDFSFL PPSP LHLQILD
Subjt:  TAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQILD

Query:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG
          A ARA+ASHPDADDPSVSDFHKILEHEIN+ TW DP+HPSY+DTDVMFA+VY +S+GQW VFLRLHTA CDR AA A+LRELLAA+SG  EGG FEIG
Subjt:  HAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEIG

Query:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH
        D+GEIGLGIEDLIPNGK NK LWARGLDMLGYSLNSFRLANLEFKDANS RFSQMIRLKMNSD TEKLLAGCKLRGIK+CGALAAAGLIATRCSKDLPP+
Subjt:  DNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPH

Query:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE
        +KEKY VVTLNDCRSLL+PPLTTHHLGFYHSAILNTHD+SAED LW+VAKRCYFAFSNAKDNNKHFSDMSDLNFLMC+AIENPGLTPSSSMRTALISVFE
Subjt:  NKEKYAVVTLNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFE

Query:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG
        +PI +   PAQ+HLGL DYIGCASAHGVGPSIAFFDMIR+G LDCACVYP PLFSRDQMN+I  +MKKILV A++VVEG
Subjt:  EPIVDTSDPAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG

SwissProt top hitse value%identityAlignment
O45734 Cathepsin L-like5.1e-0758.97Show/hide
Query:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQD
        H VL+VGYG D     YWI+KNSWG GWGE GY +I+++
Subjt:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQD

P53634 Dipeptidyl peptidase 11.1e-0664.86Show/hide
Query:  HVVLVVGYGIDEADHR-YWIIKNSWGEGWGESGYEKI
        H VL+VGYG D A    YWI+KNSWG GWGE+GY +I
Subjt:  HVVLVVGYGIDEADHR-YWIIKNSWGEGWGESGYEKI

P80884 Ananain3.0e-0746.15Show/hide
Query:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQD
        H ++++GYG D +  ++WI++NSWG GWGE GY ++++D
Subjt:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQD

Q10991 Procathepsin L5.1e-0752.5Show/hide
Query:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDE
        H VLVVGYG +  ++++WI+KNSWG  WG  GY K+++D+
Subjt:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDE

Q26636 Cathepsin L2.3e-0757.5Show/hide
Query:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDE
        H VLVVGYG DE+   YW++KNSWG  WGE GY K+++++
Subjt:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDE

Arabidopsis top hitse value%identityAlignment
AT1G06260.1 Cysteine proteinases superfamily protein4.7e-0860.53Show/hide
Query:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQ
        H V VVGYG+ E D +YWI+KNSWG GWGE GY ++ +
Subjt:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQ

AT2G27420.1 Cysteine proteinases superfamily protein1.5e-0954.35Show/hide
Query:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDEVITPNG
        H V +VGYG+ E   +YW++KNSWGE WGE+GY +I +D V  P G
Subjt:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDEVITPNG

AT2G34080.1 Cysteine proteinases superfamily protein1.8e-0752.17Show/hide
Query:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDEVITPNG
        H V  VGYG  +   +YW+ KNSWGE WGE GY +I +D V  P G
Subjt:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDEVITPNG

AT3G49340.1 Cysteine proteinases superfamily protein1.1e-0954.35Show/hide
Query:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDEVITPNG
        H V +VGYG+ E   +YW++KNSWGE WGE+GY +I +D V +P G
Subjt:  HVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDEVITPNG

AT3G52610.1 unknown protein1.4e-13252.48Show/hide
Query:  ESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQI--LDHAAAARAVASHPDAD
        +S +RPVGGTE+SWCRA+ GGTG  V+ LLLS+   + NLQ++L  LQ  HP LRS I  D S   FSF+   +   H++I   D  + A+ +    D+D
Subjt:  ESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTPPSPSLHLQI--LDHAAAARAVASHPDAD

Query:  DPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEG--QWAVFLRLHTAACDRAAAAAVLRELL-AASSGGIEGGGFEIGDNGEIGLG--IE
        DP       ILEHE+N  TW +P     +++ V   ++Y +++   Q  +  RL+TAA DR AA  +LRE +   ++ G   G         +GLG  IE
Subjt:  DPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEG--QWAVFLRLHTAACDRAAAAAVLRELL-AASSGGIEGGGFEIGDNGEIGLG--IE

Query:  DLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDA-NSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHNKEKYAVVT
        +LIP+GK +K  WARG+D+LGYSLN+FR +NL F DA NS R SQ++RLK++ D+T KL+AGCK RG+KL  ALA++ LIA   SK+LPP+  EKYAVVT
Subjt:  DLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDA-NSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHNKEKYAVVT

Query:  LNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFEEPIVDTS-D
        L+DCRS+L+PPLT++  GFYH+ IL+THD++ E+ LW++AKRCY +F+++K++NK F+DMSDLNFLMC+AIENP LTPSSS+RTA IS+FE+P++D S +
Subjt:  LNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFEEPIVDTS-D

Query:  PAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILV
        P    LG+QDYIGCAS HGVGPS+A FD +R+G LDCA VYPSPL SR+QM+ +   MK IL+
Subjt:  PAQQHLGLQDYIGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAAGGAGTTGACGAGGACAACCAGAGAGAAATCGGGCTGGGAGATGGACCAAAGAGGGGAAACCGGCAAGTGGGACGGGCCAACAATTCGTCTTCTCCCGCTCTC
AAACAAATTCACTGTTGATTATCACGTGGAGCGAAGGAGATTTATAAAGGTCCAAGCAACCCAGAGGAATTGGTTGAAGAGCCGCGTACATGTGGTCCTTGTTGTTGGAT
ATGGAATCGATGAAGCTGACCACAGATACTGGATAATAAAAAATTCTTGGGGTGAAGGATGGGGAGAGAGCGGTTATGAAAAAATTAGTCAGGATGAAGTTATAACTCCC
AACGGCCCCTTATTGACAAGATCAGATATCTCACGGCGGCAGTACCCTCTCTCTCTGTACTCCACCGCCGCCGCTTCCAAAATGTCGGATCAGCCTCCTCTCCCTCCCGC
CGGCGAGTCCAAGTCTCGTCCCGTCGGCGGCACCGAGCACAGCTGGTGCCGCGCCGTCCCCGGCGGCACCGGCACCACCGTCCTCGGCCTCCTCCTCTCAAAACCTCTCG
ATATTCCCAATCTCCAATCCTCTCTCCACTCTCTCCAAAACCTCCACCCGATCCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGACTTCTCCTTCCTCACTCCG
CCTTCTCCGTCGCTCCACCTCCAGATCCTCGACCACGCCGCCGCCGCACGCGCCGTCGCCTCTCATCCGGACGCCGACGATCCTTCCGTCTCCGATTTCCACAAGATCCT
CGAGCACGAGATCAACCTCGCCACGTGGTTCGATCCGAGCCATCCGTCGTACACCGACACGGACGTGATGTTCGCCACCGTCTACGCCGTCAGCGAGGGCCAATGGGCGG
TGTTCCTCCGGCTGCACACGGCGGCGTGCGACCGGGCGGCGGCGGCGGCGGTGTTGAGAGAGCTGCTTGCGGCGTCGAGCGGAGGAATCGAGGGCGGAGGATTTGAAATT
GGGGATAATGGAGAGATTGGATTAGGGATTGAGGATCTAATCCCTAATGGGAAAATGAACAAGGCTCTTTGGGCGCGTGGATTGGACATGCTTGGTTACTCATTGAATTC
GTTTCGATTGGCGAATCTGGAATTCAAAGATGCAAATTCTCAGAGATTTAGTCAGATGATTAGGTTGAAGATGAACTCCGATGAGACCGAGAAACTTCTCGCTGGCTGCA
AATTGAGAGGCATTAAGCTGTGTGGAGCTTTGGCAGCTGCTGGATTGATTGCTACTCGTTGTTCTAAAGACCTTCCTCCTCACAACAAGGAGAAGTATGCTGTTGTTACC
TTAAACGACTGTCGTTCTCTCCTTGATCCTCCTCTCACAACCCATCATCTAGGGTTCTATCACTCTGCCATTCTCAACACACATGACATATCAGCTGAAGACAATCTATG
GGAAGTGGCAAAGCGATGCTATTTTGCCTTCTCAAACGCCAAAGACAACAACAAGCATTTCTCAGACATGTCTGACTTGAACTTCCTCATGTGCAGAGCCATCGAAAACC
CTGGCCTCACTCCGTCGTCGTCCATGAGAACGGCCCTGATCTCGGTGTTCGAGGAACCCATCGTTGACACTTCCGATCCTGCCCAGCAACACCTCGGCTTACAGGACTAC
ATTGGCTGTGCCTCCGCGCACGGCGTCGGGCCTTCGATCGCCTTCTTCGACATGATTCGCAACGGTCATTTGGATTGTGCTTGTGTTTACCCGTCGCCTTTGTTCTCTCG
AGATCAAATGAATCGGATTTTTGATGAGATGAAGAAAATTCTGGTGAATGCCATGGATGTAGTTGAAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTACAAGGAGTTGACGAGGACAACCAGAGAGAAATCGGGCTGGGAGATGGACCAAAGAGGGGAAACCGGCAAGTGGGACGGGCCAACAATTCGTCTTCTCCCGCTCTC
AAACAAATTCACTGTTGATTATCACGTGGAGCGAAGGAGATTTATAAAGGTCCAAGCAACCCAGAGGAATTGGTTGAAGAGCCGCGTACATGTGGTCCTTGTTGTTGGAT
ATGGAATCGATGAAGCTGACCACAGATACTGGATAATAAAAAATTCTTGGGGTGAAGGATGGGGAGAGAGCGGTTATGAAAAAATTAGTCAGGATGAAGTTATAACTCCC
AACGGCCCCTTATTGACAAGATCAGATATCTCACGGCGGCAGTACCCTCTCTCTCTGTACTCCACCGCCGCCGCTTCCAAAATGTCGGATCAGCCTCCTCTCCCTCCCGC
CGGCGAGTCCAAGTCTCGTCCCGTCGGCGGCACCGAGCACAGCTGGTGCCGCGCCGTCCCCGGCGGCACCGGCACCACCGTCCTCGGCCTCCTCCTCTCAAAACCTCTCG
ATATTCCCAATCTCCAATCCTCTCTCCACTCTCTCCAAAACCTCCACCCGATCCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGACTTCTCCTTCCTCACTCCG
CCTTCTCCGTCGCTCCACCTCCAGATCCTCGACCACGCCGCCGCCGCACGCGCCGTCGCCTCTCATCCGGACGCCGACGATCCTTCCGTCTCCGATTTCCACAAGATCCT
CGAGCACGAGATCAACCTCGCCACGTGGTTCGATCCGAGCCATCCGTCGTACACCGACACGGACGTGATGTTCGCCACCGTCTACGCCGTCAGCGAGGGCCAATGGGCGG
TGTTCCTCCGGCTGCACACGGCGGCGTGCGACCGGGCGGCGGCGGCGGCGGTGTTGAGAGAGCTGCTTGCGGCGTCGAGCGGAGGAATCGAGGGCGGAGGATTTGAAATT
GGGGATAATGGAGAGATTGGATTAGGGATTGAGGATCTAATCCCTAATGGGAAAATGAACAAGGCTCTTTGGGCGCGTGGATTGGACATGCTTGGTTACTCATTGAATTC
GTTTCGATTGGCGAATCTGGAATTCAAAGATGCAAATTCTCAGAGATTTAGTCAGATGATTAGGTTGAAGATGAACTCCGATGAGACCGAGAAACTTCTCGCTGGCTGCA
AATTGAGAGGCATTAAGCTGTGTGGAGCTTTGGCAGCTGCTGGATTGATTGCTACTCGTTGTTCTAAAGACCTTCCTCCTCACAACAAGGAGAAGTATGCTGTTGTTACC
TTAAACGACTGTCGTTCTCTCCTTGATCCTCCTCTCACAACCCATCATCTAGGGTTCTATCACTCTGCCATTCTCAACACACATGACATATCAGCTGAAGACAATCTATG
GGAAGTGGCAAAGCGATGCTATTTTGCCTTCTCAAACGCCAAAGACAACAACAAGCATTTCTCAGACATGTCTGACTTGAACTTCCTCATGTGCAGAGCCATCGAAAACC
CTGGCCTCACTCCGTCGTCGTCCATGAGAACGGCCCTGATCTCGGTGTTCGAGGAACCCATCGTTGACACTTCCGATCCTGCCCAGCAACACCTCGGCTTACAGGACTAC
ATTGGCTGTGCCTCCGCGCACGGCGTCGGGCCTTCGATCGCCTTCTTCGACATGATTCGCAACGGTCATTTGGATTGTGCTTGTGTTTACCCGTCGCCTTTGTTCTCTCG
AGATCAAATGAATCGGATTTTTGATGAGATGAAGAAAATTCTGGTGAATGCCATGGATGTAGTTGAAGGCTAA
Protein sequenceShow/hide protein sequence
MYKELTRTTREKSGWEMDQRGETGKWDGPTIRLLPLSNKFTVDYHVERRRFIKVQATQRNWLKSRVHVVLVVGYGIDEADHRYWIIKNSWGEGWGESGYEKISQDEVITP
NGPLLTRSDISRRQYPLSLYSTAAASKMSDQPPLPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPLDIPNLQSSLHSLQNLHPILRSKIHHDPSRRDFSFLTP
PSPSLHLQILDHAAAARAVASHPDADDPSVSDFHKILEHEINLATWFDPSHPSYTDTDVMFATVYAVSEGQWAVFLRLHTAACDRAAAAAVLRELLAASSGGIEGGGFEI
GDNGEIGLGIEDLIPNGKMNKALWARGLDMLGYSLNSFRLANLEFKDANSQRFSQMIRLKMNSDETEKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHNKEKYAVVT
LNDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDNLWEVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCRAIENPGLTPSSSMRTALISVFEEPIVDTSDPAQQHLGLQDY
IGCASAHGVGPSIAFFDMIRNGHLDCACVYPSPLFSRDQMNRIFDEMKKILVNAMDVVEG