; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006321 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006321
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold4:4426144..4428635
RNA-Seq ExpressionSpg006321
SyntenySpg006321
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]2.7e-3844.32Show/hide
Query:  LHNFLRNEDGPYKELSSG--VHPRDLT-YEWT-KASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQE
        + NFLR+ DG Y  + S   +  R  + Y+W  +A ++L Y  G HSD++  W  VDAVYLPYN+GG+HW+++CID + GE++V DS   +     +EQE
Subjt:  LHNFLRNEDGPYKELSSG--VHPRDLT-YEWT-KASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQE

Query:  FKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWAN
         K +  ++P+++ ++G    +  +P+  W +R   S PQQ   GDC +F   F EYDVT  SFDTLTQ  M  FRRQ+AVQLWAN
Subjt:  FKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWAN

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]4.9e-3246.94Show/hide
Query:  LRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRP
        + Y    HSD+ + W  V+AVYLP+N+ G HWV++CID   GE+VV DSLRA+     +E++ KV+  V+PS+L K   +  R  LP+  W +R   S P
Subjt:  LRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRP

Query:  QQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWAN
        +Q  SGDC +F  K+ EYDVT +S +TL Q+ M  FRRQ+A QLW+N
Subjt:  QQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWAN

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]1.1e-3642.7Show/hide
Query:  LHNFLRNEDGPYKELSSGVHPRDLTYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVL
        L N LR  DGPY  +  GV P   TY+W +   + RY  G  SD++  W+  D VY   N+GG HWV++ IDL  G++ V DSL+A+   E +E+  K +
Subjt:  LHNFLRNEDGPYKELSSGVHPRDLTYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVL

Query:  CHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF
        C ++P++L   G +  R  LP+  W +R   + PQQ    DC +F  +F EYDV GS  DTL Q  +  FRRQYAVQ+WA  P F
Subjt:  CHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]8.3e-4049.07Show/hide
Query:  TYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGR
        T +W+    VL+Y  G+H+D+++PW+ VDAVY+P+NL G+HWVLVC D +V E+++ DSL AL+    +E E +++C   P +L  +GA+     L + R
Subjt:  TYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGR

Query:  WPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF
        W LR +  R QQ +SGDC +F  KF EYDVTGS   TLTQD    FRRQYA+Q+WAN  LF
Subjt:  WPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]1.1e-3949.07Show/hide
Query:  TYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGR
        T +W+K +NV++Y  G+H+D+++PW+ VDA+Y+P+NL  +HWVLVC+D +V E++V DSL  L+    +E E + LC     +L     M+S   L + R
Subjt:  TYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGR

Query:  WPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF
        W LR +   PQQ  SGDC +F CKF EYDVTGS  DTLTQD M  +RRQYA+Q+ AN  LF
Subjt:  WPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF

TrEMBL top hitse value%identityAlignment
A0A6J1D492 uncharacterized protein LOC1110168904.5e-3143.42Show/hide
Query:  LTYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVG
        + Y W + + + RY  G  SDHN+PW+  D VY P N+GG HWV++ IDL  G++ V DSL+     + +E+E K +C ++P++L   G    R +LPV 
Subjt:  LTYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVG

Query:  RWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYA
         W +R  +  PQQ    DC +F  ++ EYD TGS+ DTLTQD +  FRRQYA
Subjt:  RWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYA

A0A6J1DID7 uncharacterized protein LOC1110207824.5e-3143.12Show/hide
Query:  YEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRW
        Y+W +   + RY  G  SD++ PW+  D VY P N+GG HWV++ IDL  G++ V DSL+A+   E +E+  K +C ++P +L   G +  R  L    W
Subjt:  YEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRW

Query:  PLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF
         +R   + PQQ    DC +F  +F EYDVTGS  DTL Q  +  FRRQYAVQ+WA  P F
Subjt:  PLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF

A0A6J1DLV0 uncharacterized protein LOC1110216461.3e-3844.32Show/hide
Query:  LHNFLRNEDGPYKELSSG--VHPRDLT-YEWT-KASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQE
        + NFLR+ DG Y  + S   +  R  + Y+W  +A ++L Y  G HSD++  W  VDAVYLPYN+GG+HW+++CID + GE++V DS   +     +EQE
Subjt:  LHNFLRNEDGPYKELSSG--VHPRDLT-YEWT-KASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQE

Query:  FKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWAN
         K +  ++P+++ ++G    +  +P+  W +R   S PQQ   GDC +F   F EYDVT  SFDTLTQ  M  FRRQ+AVQLWAN
Subjt:  FKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWAN

A0A6J1DQZ3 uncharacterized protein LOC1110234422.4e-3246.94Show/hide
Query:  LRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRP
        + Y    HSD+ + W  V+AVYLP+N+ G HWV++CID   GE+VV DSLRA+     +E++ KV+  V+PS+L K   +  R  LP+  W +R   S P
Subjt:  LRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRP

Query:  QQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWAN
        +Q  SGDC +F  K+ EYDVT +S +TL Q+ M  FRRQ+A QLW+N
Subjt:  QQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWAN

A0A6J1DY60 uncharacterized protein LOC1110252735.5e-3742.7Show/hide
Query:  LHNFLRNEDGPYKELSSGVHPRDLTYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVL
        L N LR  DGPY  +  GV P   TY+W +   + RY  G  SD++  W+  D VY   N+GG HWV++ IDL  G++ V DSL+A+   E +E+  K +
Subjt:  LHNFLRNEDGPYKELSSGVHPRDLTYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVL

Query:  CHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF
        C ++P++L   G +  R  LP+  W +R   + PQQ    DC +F  +F EYDV GS  DTL Q  +  FRRQYAVQ+WA  P F
Subjt:  CHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF

SwissProt top hitse value%identityAlignment
O65278 Putative ubiquitin-like-specific protease 1B2.0e-0426.19Show/hide
Query:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEY
        D +++P ++  +HW L  I+    + V  DSL       ++    K L   V           S+K + V  W +   + RPQQ++  DC +F+ K++++
Subjt:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEY

Query:  DVTGSSFDTLTQDMMPEFRRQYAVQL
           G S    +Q  MP FR + A ++
Subjt:  DVTGSSFDTLTQDMMPEFRRQYAVQL

Q8GYL3 Ubiquitin-like-specific protease 1A3.4e-0424.6Show/hide
Query:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEY
        D +++P ++  +HW L  I+++  +    DS +         +E K+L  +    + ++    S  +L V RW     +  P QR+  DC +F+ K++++
Subjt:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEY

Query:  DVTGSSFDTLTQDMMPEFRRQYAVQL
           G      TQ+ MP FR + A ++
Subjt:  DVTGSSFDTLTQDMMPEFRRQYAVQL

Arabidopsis top hitse value%identityAlignment
AT1G37020.1 Cysteine proteinases superfamily protein1.8e-0826.99Show/hide
Query:  DGK---RRKVLKYNPIPEIPEDLSMQFKTWLDSYR--FDLHNFLRNEDGPY-KELSSGVHPRDLTYEWTKASNVLRYARG-EHSD---HNIPWTTVDAVY
        DGK   +R +    P P IPE L       L S      L   +R     Y K  SS       TY WT     + +  G  H+D   +N  +T VD +Y
Subjt:  DGK---RRKVLKYNPIPEIPEDLSMQFKTWLDSYR--FDLHNFLRNEDGPY-KELSSGVHPRDLTYEWTKASNVLRYARG-EHSD---HNIPWTTVDAVY

Query:  LPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTG
            +   HWV +  +L+   + V DS+    +E  + Q+   L  ++P++L +       K+        R+ K  P   D GDC ++  K++E    G
Subjt:  LPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTG

Query:  SSFDTLTQDMMPEFRRQYAVQLWANM
         SFD L    M          LW N+
Subjt:  SSFDTLTQDMMPEFRRQYAVQLWANM

AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases6.0e-0432.08Show/hide
Query:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVP
        D VY+P+N    HWV +C+DL+  ++ + DS   L ++  +  E + L  ++P
Subjt:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVP

AT3G06910.1 UB-like protease 1A2.4e-0524.6Show/hide
Query:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEY
        D +++P ++  +HW L  I+++  +    DS +         +E K+L  +    + ++    S  +L V RW     +  P QR+  DC +F+ K++++
Subjt:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEY

Query:  DVTGSSFDTLTQDMMPEFRRQYAVQL
           G      TQ+ MP FR + A ++
Subjt:  DVTGSSFDTLTQDMMPEFRRQYAVQL

AT4G00690.1 UB-like protease 1B5.4e-0525.76Show/hide
Query:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEY
        D +++P ++  +HW L  I+    + V  DSL       ++    K L   V           S+K + V  W +   + RPQQ++  DC +F+ K++++
Subjt:  DAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHVVPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEY

Query:  DVTGSS--FDTLTQDM----MPEFRRQYAVQL
           G S  F  + +D+    MP FR + A ++
Subjt:  DVTGSS--FDTLTQDM----MPEFRRQYAVQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACAAGCCTGCTAGGTATAATTGATGAGTGGGAGAGATTTTGCAATGAAGATTGGAGCAAAATCATATTTGATAAGACCATTAAGTCATTGAAGAAAGCTTTGTC
TGGTAAAGCGGCGTCTTACAAGGAGAGGTCGGATGGTAAACAAGAGACGTACAGTCTTTACGGCTTCCCATACGCGTTTCAGGCAAGGGTCACATTGGAACTTGTGGCCA
CAGAGGAAGAGGTTCAATTTATGAACCGTGTGATGGAGCCACCCCATGCCCCACCTCCTCCTCCAGCTCTAGAACTTCTTGGTATGAATGTTGACGATGCAGATGTTGAG
ACTCATGATAGAACGGAGGATGTTGGGACTAGTTCTGAGGCTCCCGACCGAACTTGCCAGAAGTGTAGACTCCTTGATAGCCGTGTTGAGGGCATTGAAAATGCTGTCAA
GGAGTTAAATGGAAATATGAAGGGAATAGAAAGAGACCTGAAGGCAATAAAGAAGTTCATGCGTCGATTGTCTAAGGGAAAGTTCGTTGACGCATGCAAGTATCTGGATC
CCGCGCGAGATGAAGGTCCAGACGATGGTACAGGACCTGGACCTAGCGCGAAAGGAGCAGACACCCCCTTGAATGTGGCTACTGGTTCAGCTGTCCAGACTGACACACAA
CAAAAATCTCCAGTAGTGGAACGGACCACTGGTGGTGGTGGTGGTCCAGTTCAAGTGGTGGAGAAGGGAATCATTGAACACACTGATAGTGCTGAACTTGTAGTTGGCAA
GGAACTACAAGTCACCGAGGATCAAGCTGCCCTGGGTGTCCAGTCTACTTCACAACAAAACGAGCCCATTGAACGACGGGAGACTCGGAAGAGGAAGACTGCATGGAAGT
TGAGAACTCCATGGAAAGACACAAGGGAAGACGGGAAGAGGCGAAAGGTCCTGAAGTACAATCCTATCCCGGAGATCCCTGAAGATTTGTCTATGCAATTCAAGACATGG
CTAGACAGTTATCGATTCGATCTTCATAATTTTTTGAGAAATGAAGACGGGCCGTACAAAGAGCTAAGCAGCGGCGTACACCCCCGGGACTTGACATACGAATGGACCAA
GGCATCAAACGTCTTAAGGTACGCGAGGGGTGAGCATTCAGACCACAACATCCCATGGACCACTGTTGATGCGGTGTACTTGCCTTATAATCTCGGTGGTCTCCATTGGG
TTTTGGTGTGCATTGATTTGGAGGTCGGTGAGGTGGTCGTGTCAGATTCGCTCAGAGCATTGAACAAGGAAGAGGTGGTCGAGCAGGAGTTCAAGGTCCTTTGCCACGTC
GTGCCCAGTGTACTTTGGAAGATCGGGGCTATGGATTCAAGGAAGGAACTCCCTGTCGGAAGATGGCCTCTCCGTCTGGAAAAATCAAGGCCGCAACAGCGTGATAGTGG
TGATTGTTGGGTGTTTGTATGTAAATTTTTAGAGTACGATGTAACAGGGTCGTCATTCGACACCCTTACTCAAGATATGATGCCTGAATTTCGAAGGCAATATGCTGTAC
AATTGTGGGCCAATATGCCACTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATACAAGCCTGCTAGGTATAATTGATGAGTGGGAGAGATTTTGCAATGAAGATTGGAGCAAAATCATATTTGATAAGACCATTAAGTCATTGAAGAAAGCTTTGTC
TGGTAAAGCGGCGTCTTACAAGGAGAGGTCGGATGGTAAACAAGAGACGTACAGTCTTTACGGCTTCCCATACGCGTTTCAGGCAAGGGTCACATTGGAACTTGTGGCCA
CAGAGGAAGAGGTTCAATTTATGAACCGTGTGATGGAGCCACCCCATGCCCCACCTCCTCCTCCAGCTCTAGAACTTCTTGGTATGAATGTTGACGATGCAGATGTTGAG
ACTCATGATAGAACGGAGGATGTTGGGACTAGTTCTGAGGCTCCCGACCGAACTTGCCAGAAGTGTAGACTCCTTGATAGCCGTGTTGAGGGCATTGAAAATGCTGTCAA
GGAGTTAAATGGAAATATGAAGGGAATAGAAAGAGACCTGAAGGCAATAAAGAAGTTCATGCGTCGATTGTCTAAGGGAAAGTTCGTTGACGCATGCAAGTATCTGGATC
CCGCGCGAGATGAAGGTCCAGACGATGGTACAGGACCTGGACCTAGCGCGAAAGGAGCAGACACCCCCTTGAATGTGGCTACTGGTTCAGCTGTCCAGACTGACACACAA
CAAAAATCTCCAGTAGTGGAACGGACCACTGGTGGTGGTGGTGGTCCAGTTCAAGTGGTGGAGAAGGGAATCATTGAACACACTGATAGTGCTGAACTTGTAGTTGGCAA
GGAACTACAAGTCACCGAGGATCAAGCTGCCCTGGGTGTCCAGTCTACTTCACAACAAAACGAGCCCATTGAACGACGGGAGACTCGGAAGAGGAAGACTGCATGGAAGT
TGAGAACTCCATGGAAAGACACAAGGGAAGACGGGAAGAGGCGAAAGGTCCTGAAGTACAATCCTATCCCGGAGATCCCTGAAGATTTGTCTATGCAATTCAAGACATGG
CTAGACAGTTATCGATTCGATCTTCATAATTTTTTGAGAAATGAAGACGGGCCGTACAAAGAGCTAAGCAGCGGCGTACACCCCCGGGACTTGACATACGAATGGACCAA
GGCATCAAACGTCTTAAGGTACGCGAGGGGTGAGCATTCAGACCACAACATCCCATGGACCACTGTTGATGCGGTGTACTTGCCTTATAATCTCGGTGGTCTCCATTGGG
TTTTGGTGTGCATTGATTTGGAGGTCGGTGAGGTGGTCGTGTCAGATTCGCTCAGAGCATTGAACAAGGAAGAGGTGGTCGAGCAGGAGTTCAAGGTCCTTTGCCACGTC
GTGCCCAGTGTACTTTGGAAGATCGGGGCTATGGATTCAAGGAAGGAACTCCCTGTCGGAAGATGGCCTCTCCGTCTGGAAAAATCAAGGCCGCAACAGCGTGATAGTGG
TGATTGTTGGGTGTTTGTATGTAAATTTTTAGAGTACGATGTAACAGGGTCGTCATTCGACACCCTTACTCAAGATATGATGCCTGAATTTCGAAGGCAATATGCTGTAC
AATTGTGGGCCAATATGCCACTTTTTTAG
Protein sequenceShow/hide protein sequence
MDTSLLGIIDEWERFCNEDWSKIIFDKTIKSLKKALSGKAASYKERSDGKQETYSLYGFPYAFQARVTLELVATEEEVQFMNRVMEPPHAPPPPPALELLGMNVDDADVE
THDRTEDVGTSSEAPDRTCQKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKGKFVDACKYLDPARDEGPDDGTGPGPSAKGADTPLNVATGSAVQTDTQ
QKSPVVERTTGGGGGPVQVVEKGIIEHTDSAELVVGKELQVTEDQAALGVQSTSQQNEPIERRETRKRKTAWKLRTPWKDTREDGKRRKVLKYNPIPEIPEDLSMQFKTW
LDSYRFDLHNFLRNEDGPYKELSSGVHPRDLTYEWTKASNVLRYARGEHSDHNIPWTTVDAVYLPYNLGGLHWVLVCIDLEVGEVVVSDSLRALNKEEVVEQEFKVLCHV
VPSVLWKIGAMDSRKELPVGRWPLRLEKSRPQQRDSGDCWVFVCKFLEYDVTGSSFDTLTQDMMPEFRRQYAVQLWANMPLF