; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026444 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026444
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationscaffold13:16424479..16428808
RNA-Seq ExpressionSpg026444
SyntenySpg026444
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]2.2e-5037.59Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKDSKHKTVIKHSFPNSATTP--PSIKFLYRY-VEKLCKDDPM
        ++G+ LG  N +V VD+I+V  E+  +PIP+KGEIE L+Q+IG  VAWPR LV L ++K +      + +  +S  T    +IK L RY +  +   D +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKDSKHKTVIKHSFPNSATTP--PSIKFLYRY-VEKLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGN-
        ++ +++ IFG ++T+YL PDDI+Q+C M EI  +C+L YIA LW +  ++    RF +VD   I+    ++E+R+++L +        Q+VL+PYN G  
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGN-

Query:  HWVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQSTSKQQR-KTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL
        HW+L V+++ +N VY++D L   +L + + V+N +L+    + + KQ R +  W P+KCP   G +ECGYYV K++RE++ N    ++ L
Subjt:  HWVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQSTSKQQR-KTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]4.3e-5438.73Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM
        V+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWPR LV L ++K+  S   +  +           SIK L RYV   +  +D +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH
         + +S  IFG ++ +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E     +F IVD   I+P   ++E R ++L      V   Q+VL+PY  G H
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH

Query:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL-NSREDRMVVS
        W+L ++N+ +N VY+LDSL   + +D + V+NT+L++  A+ S  + +  T W  +KCP Q G VECGYYV K++REI+ N    ++ + N++       
Subjt:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL-NSREDRMVVS

Query:  VEEFRVINGAQNWTD
        ++E R+      W D
Subjt:  VEEFRVINGAQNWTD

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]4.3e-5438.73Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM
        V+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWPR LV L ++K+  S   +  +           SIK L RYV   +  +D +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH
         + +S  IFG ++ +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E     +F IVD   I+P   ++E R ++L      V   Q+VL+PY  G H
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH

Query:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL-NSREDRMVVS
        W+L ++N+ +N VY+LDSL   + +D + V+NT+L++  A+ S  + +  T W  +KCP Q G VECGYYV K++REI+ N    ++ + N++       
Subjt:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL-NSREDRMVVS

Query:  VEEFRVINGAQNWTD
        ++E R+      W D
Subjt:  VEEFRVINGAQNWTD

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]2.8e-5341.64Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM
        V+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWPR LV L ++K+  S   +  +           SIK L RYV   +  +D +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH
         + +S  IFG ++ +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E     +F IVD   I+P   ++E R ++L      V   Q+VL+PY  G H
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH

Query:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHN
        W+L ++N+ +N VY+LDSL   + +D + V+NT+L++  A+ S  + +  T W  +KCP Q G VECGYYV K++REI+ N
Subjt:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHN

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]5.8e-5139.1Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKDSKHKTVIKHSFPNSATTP--PSIKFLYRY-VEKLCKDDPM
        +N + LG  N +  VD  IV  E+  +PIP K +I+ L Q+IG  VAWPR LV   K+K +   T  K    +S  T    +IK L RY +  +  DD +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKDSKHKTVIKHSFPNSATTP--PSIKFLYRY-VEKLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH
        ++ +S++I G ++T+YL  DDI+Q+C M EI  +C+L YIA LW +  ++    +F IVD   I+     +E R+K+L      V   Q+VL+PYN G H
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH

Query:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQSTSKQQR-KTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL
        W+L ++N+ +N VY++DSL   +L++ + V+NT+L+   A+ + +Q R +  W P+KCP Q G +ECGYYV K++REI+ N    ++ L
Subjt:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQSTSKQQR-KTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.2e-4936.9Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKDSKHKTVIKHSFPNSATTP--PSIKFLYRY-VEKLCKDDPM
        ++G+ LG  N +V VD+ +V  E+  +PIP+KG+IE L+Q+IG  VAWPR LV + K+K +   T  + +  +S  T    +IK L RY ++ +  +D +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKDSKHKTVIKHSFPNSATTP--PSIKFLYRY-VEKLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGN-
        ++ +S+ IFG ++T+YL  DDI+Q+C M EI  +C+L YIA LW +  E+    RF +VD   I+    ++E+R+++L          Q+VL+PYN G  
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGN-

Query:  HWVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQSTSKQQR-KTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL
        HW+L ++++ +N VY++D L   +L + + V+N +L+    + + K  R +  W P+KCP   G +ECGYYV K++RE++ N    +  L
Subjt:  HWVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQSTSKQQR-KTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.2e-4936.9Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKDSKHKTVIKHSFPNSATTP--PSIKFLYRY-VEKLCKDDPM
        ++G+ LG  N +V VD+ +V  E+  +PIP+KG+IE L+Q+IG  VAWPR LV + K+K +   T  + +  +S  T    +IK L RY ++ +  +D +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKDSKHKTVIKHSFPNSATTP--PSIKFLYRY-VEKLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGN-
        ++ +S+ IFG ++T+YL  DDI+Q+C M EI  +C+L YIA LW +  E+    RF +VD   I+    ++E+R+++L          Q+VL+PYN G  
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGN-

Query:  HWVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQSTSKQQR-KTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL
        HW+L ++++ +N VY++D L   +L + + V+N +L+    + + K  R +  W P+KCP   G +ECGYYV K++RE++ N    +  L
Subjt:  HWVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQSTSKQQR-KTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X12.1e-5438.73Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM
        V+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWPR LV L ++K+  S   +  +           SIK L RYV   +  +D +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH
         + +S  IFG ++ +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E     +F IVD   I+P   ++E R ++L      V   Q+VL+PY  G H
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH

Query:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL-NSREDRMVVS
        W+L ++N+ +N VY+LDSL   + +D + V+NT+L++  A+ S  + +  T W  +KCP Q G VECGYYV K++REI+ N    ++ + N++       
Subjt:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL-NSREDRMVVS

Query:  VEEFRVINGAQNWTD
        ++E R+      W D
Subjt:  VEEFRVINGAQNWTD

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.4e-5341.64Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM
        V+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWPR LV L ++K+  S   +  +           SIK L RYV   +  +D +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH
         + +S  IFG ++ +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E     +F IVD   I+P   ++E R ++L      V   Q+VL+PY  G H
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH

Query:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHN
        W+L ++N+ +N VY+LDSL   + +D + V+NT+L++  A+ S  + +  T W  +KCP Q G VECGYYV K++REI+ N
Subjt:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHN

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X22.1e-5438.73Show/hide
Query:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM
        V+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWPR LV L ++K+  S   +  +           SIK L RYV   +  +D +
Subjt:  VNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKD--SKHKTVIKHSFPNSATTPPSIKFLYRYVE-KLCKDDPM

Query:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH
         + +S  IFG ++ +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E     +F IVD   I+P   ++E R ++L      V   Q+VL+PY  G H
Subjt:  RVPISDRIFGADRTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNH

Query:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL-NSREDRMVVS
        W+L ++N+ +N VY+LDSL   + +D + V+NT+L++  A+ S  + +  T W  +KCP Q G VECGYYV K++REI+ N    ++ + N++       
Subjt:  WVLCVVNVSDNTVYILDSLHPSLLDDIKHVVNTALRVCMAQ-STSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTAL-NSREDRMVVS

Query:  VEEFRVINGAQNWTD
        ++E R+      W D
Subjt:  VEEFRVINGAQNWTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAATGGAGTTCTCCTAGGAAAGCACAATGCCAAAGTGTTTGTTGACATGATCATTGTCGAAAAAGAAAACCCTCGCATTCCAATTCCAGTGAAAGGTGAGATAGA
GTTTCTCTCCCAATCTATAGGTGTTGCAGTTGCTTGGCCTCGTGCTTTGGTTGCTCTATGTAAAGATAAGGACTCGAAACATAAAACAGTGATAAAACACTCATTTCCTA
ATTCGGCAACCACACCTCCATCTATCAAATTTCTGTATCGCTATGTTGAAAAGCTATGCAAGGATGATCCGATGCGAGTGCCTATCAGCGATAGGATATTTGGAGCAGAC
AGAACATTATATCTCATGCCCGATGATATAATGCAATTTTGTAGTATGGTCGAGATATCAAATACTTGTGTATTGGTCTATATTGCGTTCCTTTGGACGCATTTTAAGGA
GACTGGTAGACTAGACAGGTTTAAGATCGTGGACTCAAACGACATTGCACCGGTCTTTGGGACCAAGGAAAGCCGTGCAAAAAGTTTAACTACCGTATTTTCTTCAGTAC
AACCGGGGCAAATGGTACTCCTTCCATATAATCCTGGGAATCACTGGGTATTGTGTGTTGTGAATGTAAGTGACAATACCGTTTATATATTGGACTCCTTACATCCTAGT
CTCTTGGATGACATCAAACATGTTGTAAACACAGCATTGAGGGTTTGTATGGCACAAAGTACATCGAAGCAACAACGAAAGACTTCTTGGATACCTGTAAAGTGTCCTCA
CCAACAAGGTTGCGTTGAATGTGGGTACTACGTGATGAAGTTTATGAGAGAAATTCTACATAATCTAGAGAAGCCCGTCACTGCTCTCAATAGTAGGGAAGATCGTATGG
TGGTGTCCGTTGAAGAATTCAGAGTCATCAATGGAGCTCAAAATTGGACCGATTGCATGGTTGTTTCTTCTCACCAATTGATCACAAACCACCACAAGTGTCTTCCTTGC
TATTCTCAGGCCTTGGAACGGATTGTGGGATCCTCTAGTGGTGGAATTTGGAAGGGAATGAGCTTGGGTTTCAAGCTTTTGGAGAGTTTTGAGTCTTTGGGTTTGAGAAG
CAAAATTGGCAGCATAACACCAAGCTTTTGTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCAATGGAGTTCTCCTAGGAAAGCACAATGCCAAAGTGTTTGTTGACATGATCATTGTCGAAAAAGAAAACCCTCGCATTCCAATTCCAGTGAAAGGTGAGATAGA
GTTTCTCTCCCAATCTATAGGTGTTGCAGTTGCTTGGCCTCGTGCTTTGGTTGCTCTATGTAAAGATAAGGACTCGAAACATAAAACAGTGATAAAACACTCATTTCCTA
ATTCGGCAACCACACCTCCATCTATCAAATTTCTGTATCGCTATGTTGAAAAGCTATGCAAGGATGATCCGATGCGAGTGCCTATCAGCGATAGGATATTTGGAGCAGAC
AGAACATTATATCTCATGCCCGATGATATAATGCAATTTTGTAGTATGGTCGAGATATCAAATACTTGTGTATTGGTCTATATTGCGTTCCTTTGGACGCATTTTAAGGA
GACTGGTAGACTAGACAGGTTTAAGATCGTGGACTCAAACGACATTGCACCGGTCTTTGGGACCAAGGAAAGCCGTGCAAAAAGTTTAACTACCGTATTTTCTTCAGTAC
AACCGGGGCAAATGGTACTCCTTCCATATAATCCTGGGAATCACTGGGTATTGTGTGTTGTGAATGTAAGTGACAATACCGTTTATATATTGGACTCCTTACATCCTAGT
CTCTTGGATGACATCAAACATGTTGTAAACACAGCATTGAGGGTTTGTATGGCACAAAGTACATCGAAGCAACAACGAAAGACTTCTTGGATACCTGTAAAGTGTCCTCA
CCAACAAGGTTGCGTTGAATGTGGGTACTACGTGATGAAGTTTATGAGAGAAATTCTACATAATCTAGAGAAGCCCGTCACTGCTCTCAATAGTAGGGAAGATCGTATGG
TGGTGTCCGTTGAAGAATTCAGAGTCATCAATGGAGCTCAAAATTGGACCGATTGCATGGTTGTTTCTTCTCACCAATTGATCACAAACCACCACAAGTGTCTTCCTTGC
TATTCTCAGGCCTTGGAACGGATTGTGGGATCCTCTAGTGGTGGAATTTGGAAGGGAATGAGCTTGGGTTTCAAGCTTTTGGAGAGTTTTGAGTCTTTGGGTTTGAGAAG
CAAAATTGGCAGCATAACACCAAGCTTTTGTGTTTGA
Protein sequenceShow/hide protein sequence
MVNGVLLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGVAVAWPRALVALCKDKDSKHKTVIKHSFPNSATTPPSIKFLYRYVEKLCKDDPMRVPISDRIFGAD
RTLYLMPDDIMQFCSMVEISNTCVLVYIAFLWTHFKETGRLDRFKIVDSNDIAPVFGTKESRAKSLTTVFSSVQPGQMVLLPYNPGNHWVLCVVNVSDNTVYILDSLHPS
LLDDIKHVVNTALRVCMAQSTSKQQRKTSWIPVKCPHQQGCVECGYYVMKFMREILHNLEKPVTALNSREDRMVVSVEEFRVINGAQNWTDCMVVSSHQLITNHHKCLPC
YSQALERIVGSSSGGIWKGMSLGFKLLESFESLGLRSKIGSITPSFCV