; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014994 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014994
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold3:35131086..35133584
RNA-Seq ExpressionSpg014994
SyntenySpg014994
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032137.1 Ulp1-like peptidase [Cucumis melo var. makuwa]9.5e-4035.51Show/hide
Query:  VYNPM-CVIPPELDRRFQAWLDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGD-----LVVADFLRRDE
        VY+PM  ++   LD R +AW+ D   D   R+T +  +SK +F+ L     W+++E +DALF+FIR K+ A        F T D     ++V+ +L   E
Subjt:  VYNPM-CVIPPELDRRFQAWLDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGD-----LVVADFLRRDE

Query:  VLEALDGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKC
         ++           +DW     ++DYV+G+  D   PW+SV  VY PFN+ GNHWVLLC DL + ++ V +S   L+T  E+     P+R L+P LL   
Subjt:  VLEALDGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKC

Query:  GVM--KVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF
        G    + + +     W +     +P Q+++ DCGVFT K+ EY      LDTL Q++M++ R+Q A QLW N P +
Subjt:  GVM--KVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF

KAA0044249.1 Ulp1-like peptidase [Cucumis melo var. makuwa]6.6e-4137.27Show/hide
Query:  VYNPM-CVIPPELDRRFQAWLDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL
        VY+PM  ++   LD R +AW+ D   D   R+T +  +SK +F+ L     W+SDE +DALF+FIR K+ A        F T D +    L     L   
Subjt:  VYNPM-CVIPPELDRRFQAWLDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL

Query:  DGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVM--
          + +R   +DW     ++DYV+G+  D   PW+SV  VY PFN+ GNHWVLLC DL + ++ V +S   L T  E+     P+R L+P LL   G    
Subjt:  DGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVM--

Query:  KVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF
        + + +     W +     +P Q+++ DCGVFT K+ EY  T   LDTL Q++M++ R+Q A QLW N P +
Subjt:  KVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF

XP_022148308.1 uncharacterized protein LOC111016993 [Momordica charantia]5.9e-4245.41Show/hide
Query:  GAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRID-----DYDW-SRWKTVMDY
        GA RKTVY+ ++K WF+ LL P +W + EV+D LFM +RKKL+ RPDLC  KF TGDLV+A++ RR + + A       +      +YDW  R +++M Y
Subjt:  GAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRID-----DYDW-SRWKTVMDY

Query:  VMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPL
          G H+D+ + W  V A+YIPFN+ G HWV++C DLE GE+VV +S   + TD  ++   K + T++P++L KC VMKV+  LP+
Subjt:  VMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPL

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]3.6e-4743.81Show/hide
Query:  MFIRKKLDARPDLCQCKFVTGDLVVADFLRRDE----VLEALDGRFDRI-DDYDW-SRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCAD
        MF+  KL  RP+LC+ KF TGD+++++FLR  +    ++++ +    R+  DYDW  R  +++ Y+ G HSD++  W  V AVY+P+N+GG HW+++C D
Subjt:  MFIRKKLDARPDLCQCKFVTGDLVVADFLRRDE----VLEALDGRFDRI-DDYDW-SRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCAD

Query:  LEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRR
         + GEL+V +SFM +    +L+++ KP+ T++P L+ + GV   K  +PL  WR++R    PQQ   GDCG+F   F EYDVT+   DTL+Q  M+F RR
Subjt:  LEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRR

Query:  QFAVQLWANR
        QFAVQLWAN+
Subjt:  QFAVQLWANR

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]3.0e-4638.65Show/hide
Query:  LDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL--DGRFDRIDDYDWSRWKTV
        +DDP  D   R T    + K WF  LL P   + DE ID+L M   +K++    L + +F  GD+++++ LRR +   A    G       YDW + +T+
Subjt:  LDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL--DGRFDRIDDYDWSRWKTV

Query:  MDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP
          YV+G  SD++  WS    VY   N+GGNHWV++  DL  G+L V +S   +    +L+K  KP+ T++P +L   G++ ++  LP+  WR++R   VP
Subjt:  MDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP

Query:  QQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF
        QQ    DC +F  +F EYDV  SK+DTL Q +++  RRQ+AVQ+WA RPFF
Subjt:  QQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF

TrEMBL top hitse value%identityAlignment
A0A5A7SRX1 Ulp1-like peptidase4.6e-4035.51Show/hide
Query:  VYNPM-CVIPPELDRRFQAWLDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGD-----LVVADFLRRDE
        VY+PM  ++   LD R +AW+ D   D   R+T +  +SK +F+ L     W+++E +DALF+FIR K+ A        F T D     ++V+ +L   E
Subjt:  VYNPM-CVIPPELDRRFQAWLDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGD-----LVVADFLRRDE

Query:  VLEALDGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKC
         ++           +DW     ++DYV+G+  D   PW+SV  VY PFN+ GNHWVLLC DL + ++ V +S   L+T  E+     P+R L+P LL   
Subjt:  VLEALDGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKC

Query:  GVM--KVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF
        G    + + +     W +     +P Q+++ DCGVFT K+ EY      LDTL Q++M++ R+Q A QLW N P +
Subjt:  GVM--KVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF

A0A5A7TSP7 Ulp1-like peptidase3.2e-4137.27Show/hide
Query:  VYNPM-CVIPPELDRRFQAWLDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL
        VY+PM  ++   LD R +AW+ D   D   R+T +  +SK +F+ L     W+SDE +DALF+FIR K+ A        F T D +    L     L   
Subjt:  VYNPM-CVIPPELDRRFQAWLDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL

Query:  DGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVM--
          + +R   +DW     ++DYV+G+  D   PW+SV  VY PFN+ GNHWVLLC DL + ++ V +S   L T  E+     P+R L+P LL   G    
Subjt:  DGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVM--

Query:  KVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF
        + + +     W +     +P Q+++ DCGVFT K+ EY  T   LDTL Q++M++ R+Q A QLW N P +
Subjt:  KVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF

A0A6J1D3R7 uncharacterized protein LOC1110169932.2e-4245.95Show/hide
Query:  GAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRID-----DYDW-SRWKTVMDY
        GA RKTVY+ ++K WF+ LL P +W + EV+D LFM +RKKL+ RPDLC  KF TGDLV+A++ RR + L A       +      +YDW  R +++M Y
Subjt:  GAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRID-----DYDW-SRWKTVMDY

Query:  VMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPL
          G H+D+ + W  V A+YIPFN+ G HWV++C DLE GE+VV +S   + TD  ++   K + T++P++L KC VMKV+  LP+
Subjt:  VMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPL

A0A6J1DLV0 uncharacterized protein LOC1110216461.7e-4743.81Show/hide
Query:  MFIRKKLDARPDLCQCKFVTGDLVVADFLRRDE----VLEALDGRFDRI-DDYDW-SRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCAD
        MF+  KL  RP+LC+ KF TGD+++++FLR  +    ++++ +    R+  DYDW  R  +++ Y+ G HSD++  W  V AVY+P+N+GG HW+++C D
Subjt:  MFIRKKLDARPDLCQCKFVTGDLVVADFLRRDE----VLEALDGRFDRI-DDYDW-SRWKTVMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCAD

Query:  LEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRR
         + GEL+V +SFM +    +L+++ KP+ T++P L+ + GV   K  +PL  WR++R    PQQ   GDCG+F   F EYDVT+   DTL+Q  M+F RR
Subjt:  LEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRR

Query:  QFAVQLWANR
        QFAVQLWAN+
Subjt:  QFAVQLWANR

A0A6J1DY60 uncharacterized protein LOC1110252731.5e-4638.65Show/hide
Query:  LDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL--DGRFDRIDDYDWSRWKTV
        +DDP  D   R T    + K WF  LL P   + DE ID+L M   +K++    L + +F  GD+++++ LRR +   A    G       YDW + +T+
Subjt:  LDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL--DGRFDRIDDYDWSRWKTV

Query:  MDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP
          YV+G  SD++  WS    VY   N+GGNHWV++  DL  G+L V +S   +    +L+K  KP+ T++P +L   G++ ++  LP+  WR++R   VP
Subjt:  MDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP

Query:  QQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF
        QQ    DC +F  +F EYDV  SK+DTL Q +++  RRQ+AVQ+WA RPFF
Subjt:  QQKDSGDCGVFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases1.4e-0925.91Show/hide
Query:  KEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDR-IDDYDWSRWKTVMDYVMGNHSDHNIP-WSSVQ
        KE F  +    H+ S +V+D L  F R  L  R D    + +  D++ + F+ +   L  L  +F + +   D+     ++D ++G    + +  ++   
Subjt:  KEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDR-IDDYDWSRWKTVMDYVMGNHSDHNIP-WSSVQ

Query:  AVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFT
         VY+PFN    HWV LC DL+A ++ + +S +QL  D  L  + +P+  ++P L  +         + L  + L R   +PQ     D GV +
Subjt:  AVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFT

AT5G45570.1 Ulp1 protease family protein4.9e-1030.23Show/hide
Query:  VQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKV-KHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFL
        V  +Y    + GNHWV L  DL    + V +S   L TD E+  Q   V T++P +L+     K  + +    EW  KR   +P+  D GDC +++ K++
Subjt:  VQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKV-KHTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFL

Query:  EYDVTNSKLDTLSQDSMNFCRRQFAVQLW
        E        D L  ++M   R + AV+++
Subjt:  EYDVTNSKLDTLSQDSMNFCRRQFAVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCAGGCGAGGATCATAGAGTTTCTTCAATCAACAGAGGAGGAGCGTCGATATATGAATCGGATAATGGAACCCCCACGTCCACCAATGCCTTCTCCACCCCCTCC
AGTAGTTGACGTATGTGTTGTCCAGGACATTGAGATGACGGATGTGACTGTGCATGACCCCGCCCAGATAGATGAGGCAGTAGGACCGATGGAAGATAAAGAACAGGACG
AGGGTGAAGAGATTAAAGATAAGAAGAAAAAAAAAAGTTGTGAATGCACTGAGTTGCTACAACGTCTCAATGAACGTGTGGACGCCATGGAAACCGACTTAAAGTCTGAC
TTGAAGGCCATCAAGAAATTCTTACGAAGATTTTCTAAGGGCAAATATGTAGATCCGGAGAAATACTTTGGATCAGATGAGGGTCCTTCTAAAGAAGATGATGAGGGTCC
GTCTAAAGGAGGTGACGTGTCTGAACACCGAGAAAAAGATGATGGGGATGATGATCCGTCTGAAGGAGGTGCTGAGGGTCCTTTGGGCGGGTCCGAACACCCAGAAAAAG
GAGATGAGCCAGCTGAAGAGAACCCAAAAGGAGATGATGAGGGTCCTTTAGGCGGGTCCGAACACCCAGAAAAAGGAGATGAGCCATCGGAAGAGAACCCAAATGGAGGT
GATGAGGGTCCTTTGGACGGGTCCGAACACCCAGAAAAAGGAGATGAGCCAGCTGAAGAGACCCCAAATGAGCCAGCGGAAGAGAACCTGCCTGTTGAGCCCTTACAGAG
TATTGAAGAGATAACGGCATTTGATGGAGTCACACACTCTTTGGTACCCAAAACTGAGCCTGTTGAATTGGACTCACAAAGTGTTGAAGAGACAAACCTTGAACCTGTTG
AACGTCGGGGGAAACGAAAGCGTCAGATCTCATGGAAGCTTCGATCTCCGTGGGCAGACACGAGACCAGATGGAAAAAGAAGAAAAGTTAAGGTGTACAACCCCATGTGC
GTCATACCACCAGAACTTGATCGTCGGTTCCAGGCGTGGTTGGACGACCCATTGTTGGATGGTGCAGAACGGAAGACGGTGTACGCATATAGGAGCAAAGAATGGTTTAA
AACCTTATTGACACCGTCACATTGGATGAGCGATGAGGTCATTGACGCGCTATTCATGTTTATTCGGAAAAAACTAGATGCTCGTCCGGACTTATGCCAATGCAAATTTG
TGACGGGAGACTTGGTTGTCGCGGATTTTCTACGACGAGATGAAGTACTAGAAGCACTGGATGGTAGGTTTGATCGTATCGATGACTATGACTGGAGTAGATGGAAGACC
GTCATGGATTATGTTATGGGCAACCATTCAGACCATAACATTCCTTGGAGTTCAGTTCAAGCGGTCTACATACCCTTCAACCTTGGTGGGAACCATTGGGTGTTGCTGTG
TGCTGACTTAGAGGCCGGCGAGTTGGTGGTGTCCAATTCCTTTATGCAGTTGAATACAGACATCGAATTAAAGAAGCAGTTCAAACCTGTTCGCACACTGATGCCAATCT
TGCTCGCTAAGTGTGGCGTAATGAAGGTGAAGCATACCCTTCCACTAAATGAATGGAGGTTGAAGAGGAACAAGCTAGTGCCACAACAAAAGGATAGTGGGGATTGTGGG
GTATTCACTGCCAAATTTTTGGAATATGATGTAACGAACTCCAAACTGGACACCCTTAGCCAGGATAGCATGAACTTTTGTAGACGTCAATTCGCTGTTCAACTTTGGGC
CAATAGGCCATTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATCAGGCGAGGATCATAGAGTTTCTTCAATCAACAGAGGAGGAGCGTCGATATATGAATCGGATAATGGAACCCCCACGTCCACCAATGCCTTCTCCACCCCCTCC
AGTAGTTGACGTATGTGTTGTCCAGGACATTGAGATGACGGATGTGACTGTGCATGACCCCGCCCAGATAGATGAGGCAGTAGGACCGATGGAAGATAAAGAACAGGACG
AGGGTGAAGAGATTAAAGATAAGAAGAAAAAAAAAAGTTGTGAATGCACTGAGTTGCTACAACGTCTCAATGAACGTGTGGACGCCATGGAAACCGACTTAAAGTCTGAC
TTGAAGGCCATCAAGAAATTCTTACGAAGATTTTCTAAGGGCAAATATGTAGATCCGGAGAAATACTTTGGATCAGATGAGGGTCCTTCTAAAGAAGATGATGAGGGTCC
GTCTAAAGGAGGTGACGTGTCTGAACACCGAGAAAAAGATGATGGGGATGATGATCCGTCTGAAGGAGGTGCTGAGGGTCCTTTGGGCGGGTCCGAACACCCAGAAAAAG
GAGATGAGCCAGCTGAAGAGAACCCAAAAGGAGATGATGAGGGTCCTTTAGGCGGGTCCGAACACCCAGAAAAAGGAGATGAGCCATCGGAAGAGAACCCAAATGGAGGT
GATGAGGGTCCTTTGGACGGGTCCGAACACCCAGAAAAAGGAGATGAGCCAGCTGAAGAGACCCCAAATGAGCCAGCGGAAGAGAACCTGCCTGTTGAGCCCTTACAGAG
TATTGAAGAGATAACGGCATTTGATGGAGTCACACACTCTTTGGTACCCAAAACTGAGCCTGTTGAATTGGACTCACAAAGTGTTGAAGAGACAAACCTTGAACCTGTTG
AACGTCGGGGGAAACGAAAGCGTCAGATCTCATGGAAGCTTCGATCTCCGTGGGCAGACACGAGACCAGATGGAAAAAGAAGAAAAGTTAAGGTGTACAACCCCATGTGC
GTCATACCACCAGAACTTGATCGTCGGTTCCAGGCGTGGTTGGACGACCCATTGTTGGATGGTGCAGAACGGAAGACGGTGTACGCATATAGGAGCAAAGAATGGTTTAA
AACCTTATTGACACCGTCACATTGGATGAGCGATGAGGTCATTGACGCGCTATTCATGTTTATTCGGAAAAAACTAGATGCTCGTCCGGACTTATGCCAATGCAAATTTG
TGACGGGAGACTTGGTTGTCGCGGATTTTCTACGACGAGATGAAGTACTAGAAGCACTGGATGGTAGGTTTGATCGTATCGATGACTATGACTGGAGTAGATGGAAGACC
GTCATGGATTATGTTATGGGCAACCATTCAGACCATAACATTCCTTGGAGTTCAGTTCAAGCGGTCTACATACCCTTCAACCTTGGTGGGAACCATTGGGTGTTGCTGTG
TGCTGACTTAGAGGCCGGCGAGTTGGTGGTGTCCAATTCCTTTATGCAGTTGAATACAGACATCGAATTAAAGAAGCAGTTCAAACCTGTTCGCACACTGATGCCAATCT
TGCTCGCTAAGTGTGGCGTAATGAAGGTGAAGCATACCCTTCCACTAAATGAATGGAGGTTGAAGAGGAACAAGCTAGTGCCACAACAAAAGGATAGTGGGGATTGTGGG
GTATTCACTGCCAAATTTTTGGAATATGATGTAACGAACTCCAAACTGGACACCCTTAGCCAGGATAGCATGAACTTTTGTAGACGTCAATTCGCTGTTCAACTTTGGGC
CAATAGGCCATTCTTTTAG
Protein sequenceShow/hide protein sequence
MYQARIIEFLQSTEEERRYMNRIMEPPRPPMPSPPPPVVDVCVVQDIEMTDVTVHDPAQIDEAVGPMEDKEQDEGEEIKDKKKKKSCECTELLQRLNERVDAMETDLKSD
LKAIKKFLRRFSKGKYVDPEKYFGSDEGPSKEDDEGPSKGGDVSEHREKDDGDDDPSEGGAEGPLGGSEHPEKGDEPAEENPKGDDEGPLGGSEHPEKGDEPSEENPNGG
DEGPLDGSEHPEKGDEPAEETPNEPAEENLPVEPLQSIEEITAFDGVTHSLVPKTEPVELDSQSVEETNLEPVERRGKRKRQISWKLRSPWADTRPDGKRRKVKVYNPMC
VIPPELDRRFQAWLDDPLLDGAERKTVYAYRSKEWFKTLLTPSHWMSDEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRIDDYDWSRWKT
VMDYVMGNHSDHNIPWSSVQAVYIPFNLGGNHWVLLCADLEAGELVVSNSFMQLNTDIELKKQFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCG
VFTAKFLEYDVTNSKLDTLSQDSMNFCRRQFAVQLWANRPFF