; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037330 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037330
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold8:845018..848973
RNA-Seq ExpressionSpg037330
SyntenySpg037330
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148308.1 uncharacterized protein LOC111016993 [Momordica charantia]8.6e-4346.49Show/hide
Query:  GAERKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRID-----DYDW-SRWKTVMDY
        GA RKTVY+ ++K WF+ LL P +W + EV+D LFM +RKKL+ RPDLC  KF TGDLV+A++ RR + + A       +      +YDW  R +++M Y
Subjt:  GAERKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRID-----DYDW-SRWKTVMDY

Query:  VMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPL
          G H+D+ + W  V+A+YIPFN+ G HWV++C DLE GE+VV DS   + TD  +E   K + T++P++L KC VMKV+  LP+
Subjt:  VMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPL

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]9.8e-4744.08Show/hide
Query:  MFIRKKLDARPDLCQCKFVTGDLVVADFLRRDE----VLEALDGRFDRI-DDYDW-SRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCAD
        MF+  KL  RP+LC+ KF TGD+++++FLR  +    ++++ +    R+  DYDW  R  +++ Y+ G HSD++  W  V+AVY+P+N+GG HW+++C D
Subjt:  MFIRKKLDARPDLCQCKFVTGDLVVADFLRRDE----VLEALDGRFDRI-DDYDW-SRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCAD

Query:  LEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRR
         + GEL+V DSFM +     +E++ K + T++P L+ + GV   K  +PL  WR++R    PQ    GDCG+F   FFEYDVT    DTL+Q  M F RR
Subjt:  LEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRR

Query:  QFAVQLWANRS
        QFAVQLWAN+S
Subjt:  QFAVQLWANRS

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]1.2e-4138.17Show/hide
Query:  RKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL--DGRFDRIDDYDWSRWKTVMDYVMGNHSD
        R T    + K WF  LL P   + +E ID+L M   +K++    L + +F  GD+++++ LRR +   A    G       YDW + +T+  YV+G  SD
Subjt:  RKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL--DGRFDRIDDYDWSRWKTVMDYVMGNHSD

Query:  HNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGV
        ++  WS  + VY   N+GGNHWV++  DL  G+L V DS   +    D+EK  K + T++P +L   G++ ++  LP+  WR++R   VPQ     DC +
Subjt:  HNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGV

Query:  FTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF
        F  +FFEYDV  SK+DTL Q ++   RRQ+AVQ+WA R FF
Subjt:  FTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]1.7e-3846.19Show/hide
Query:  VTGDLVVADFLRRDEVLEA--LDGRFD----RIDDY--DWSRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLN
        V GD+V+      D V E   +DGR D    R D +  DWS  K V+ YV G H+D+++PWS+V+AVY+PFNL G HWVL+CAD +  EL++ DS + L+
Subjt:  VTGDLVVADFLRRDEVLEA--LDGRFD----RIDDY--DWSRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLN

Query:  TDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF
         + D+E + + V    P LL    VM+    L ++ W L+R+    Q  +SGDCG+FT KFFEYDVT SK+ TL+QD   + RRQ+A+Q+WANR+ F
Subjt:  TDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]4.1e-3745.41Show/hide
Query:  DEVLEA---LDGRFD----RIDDY--DWSRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHV
        D+V++    +DGR D    R D +  DWS+   V+ YV G H+D+++PWS+V+A+Y+PFNL   HWVL+C D +  EL+V DS + L+ + D+E + + +
Subjt:  DEVLEA---LDGRFD----RIDDY--DWSRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHV

Query:  RTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF
              LL    VM+    L ++ W L+R+  VPQ   SGDCG+FT KFFEYDVT SK+DTL+QD M + RRQ+A+Q+ ANR+ F
Subjt:  RTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF

TrEMBL top hitse value%identityAlignment
A0A5A7TSP7 Ulp1-like peptidase1.4e-3528.2Show/hide
Query:  MQVWAYEIVSLVIGRVANRVNENAIPHILKWTCSHSPTYYRISTKVFGSRVVSFICHLFFTHLFRYLRLSSYVSGEDHRDIEMADVTVHDPAQIDEAVGS
        +QVWAYE +  +IG   ++VN++AIP +L+W C  SP    IS +VF S +      +  T     L+++S    E+ R   +         ++ E V  
Subjt:  MQVWAYEIVSLVIGRVANRVNENAIPHILKWTCSHSPTYYRISTKVFGSRVVSFICHLFFTHLFRYLRLSSYVSGEDHRDIEMADVTVHDPAQIDEAVGS

Query:  MEEKEQDEGEEIKDKKKKNKRSCECIELLQRLNERVDAMEIDL---KSGLKAIKKFLRRFSKGKYVDPEKYFGSDEGPSKEGDEGPSKGGDMSENPEKDD
              DE E  K KK+K+K   +  + ++ L +RV  +E  L   KS +  +K  +    K              G  ++GDEG  K   +SE  E + 
Subjt:  MEEKEQDEGEEIKDKKKKNKRSCECIELLQRLNERVDAMEIDL---KSGLKAIKKFLRRFSKGKYVDPEKYFGSDEGPSKEGDEGPSKGGDMSENPEKDD

Query:  GDDGPSEGGDKGPLGGSEHSEKGDEPTEENPKGGDEGSLGGSEHP--EKGDEPAEENPKGGDEGPSGGSEHFQ--KEVKGTDEADEGHTNLPVELLQSIE
                         E  +KGDE   E           G E P     DE   E      + P     H +  +  + +       T LP   L S  
Subjt:  GDDGPSEGGDKGPLGGSEHSEKGDEPTEENPKGGDEGSLGGSEHP--EKGDEPAEENPKGGDEGPSGGSEHFQ--KEVKGTDEADEGHTNLPVELLQSIE

Query:  EIAAFDGVTHYLVPK-------------TEEPNNGAERKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRR
          +  + + +  + K             T++  +   R+T +  +SK +F+ L     W+S+E +DALF+FIR K+ A        F T D +    L  
Subjt:  EIAAFDGVTHYLVPK-------------TEEPNNGAERKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRR

Query:  DEVLEALDGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLS
           L     + +R   +DW     ++DYV+G+  D   PW+SV+ VY PFN+ GNHWVL+C DL + ++ V DS   L T  ++      +R L+P LL 
Subjt:  DEVLEALDGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLS

Query:  KCGVM--KVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF
          G    + + +     W +     +P  +++ DCGVFT K+FEY  T   LDTL Q++M + R+Q A QLW N   +
Subjt:  KCGVM--KVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF

A0A5A7UGY3 Ulp1-like peptidase7.6e-3728.69Show/hide
Query:  MQVWAYEIVSLVIGRVANRVNENAIPHILKWTCSHSPTYYRISTKVFGSRVVSFICHLFFTHLFRYLRLSSYVSGEDHRDIEMADVTVHDPAQIDEAVGS
        +QVWAYE +  +IG   ++VN++AIP +L+W C  SP    IS +VF S +      +  T     L+++S    E+ R   +         ++ E V  
Subjt:  MQVWAYEIVSLVIGRVANRVNENAIPHILKWTCSHSPTYYRISTKVFGSRVVSFICHLFFTHLFRYLRLSSYVSGEDHRDIEMADVTVHDPAQIDEAVGS

Query:  MEEKEQDEGEEIKDKKKKNKRSCECIELLQRLNERVDAMEIDLKSGLKAIKKFLRRFSKGKYVDPEKYFGSDEGPSKEGDEGPSK--GGDMSENPEKDDG
         ++ ++ E ++ K K KK  R+         L +RV  +E  L S    I +      KG      K+ G      ++GDEG  K   G +    E  D 
Subjt:  MEEKEQDEGEEIKDKKKKNKRSCECIELLQRLNERVDAMEIDLKSGLKAIKKFLRRFSKGKYVDPEKYFGSDEGPSKEGDEGPSK--GGDMSENPEKDDG

Query:  DDGPSEGGDKGPLGGSEHSEKGDEPTEENPKGGDEGSLGGSEHPEKGDEPAEENPKGGDEGPSGGSEHF---------QKEVKGTDEADEGHTNLPVELL
        D+  +E  D G         K DE  E   K GDE  L      +K D   EE     D+      E F          +  + +       T LP   L
Subjt:  DDGPSEGGDKGPLGGSEHSEKGDEPTEENPKGGDEGSLGGSEHPEKGDEPAEENPKGGDEGPSGGSEHF---------QKEVKGTDEADEGHTNLPVELL

Query:  QSIEEIAAFDGVTHYLVPK-------------TEEPNNGAERKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVAD
         S    + ++ + +  + K             T++  N   R+T +  +SK +F+ L     W+S+E +DA F+FI  K+ A        F T D +   
Subjt:  QSIEEIAAFDGVTHYLVPK-------------TEEPNNGAERKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVAD

Query:  FLRRDEVLEALDGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMP
         L     L     + +R   +DW     ++DYV+G+  D   PW+SV+ VY PFN+ GNHWVL+C DL + ++ V DS   L T  D+      +R L+P
Subjt:  FLRRDEVLEALDGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMP

Query:  ILLSKCGVM--KVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF
         LL   G    + + +     W +     +P  +++ DCGVFT K+FEY      LDTL Q++M + R+Q A QLW N   +
Subjt:  ILLSKCGVM--KVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF

A0A6J1D3R7 uncharacterized protein LOC1110169933.2e-4347.03Show/hide
Query:  GAERKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRID-----DYDW-SRWKTVMDY
        GA RKTVY+ ++K WF+ LL P +W + EV+D LFM +RKKL+ RPDLC  KF TGDLV+A++ RR + L A       +      +YDW  R +++M Y
Subjt:  GAERKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRID-----DYDW-SRWKTVMDY

Query:  VMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPL
          G H+D+ + W  V+A+YIPFN+ G HWV++C DLE GE+VV DS   + TD  +E   K + T++P++L KC VMKV+  LP+
Subjt:  VMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPL

A0A6J1DLV0 uncharacterized protein LOC1110216464.7e-4744.08Show/hide
Query:  MFIRKKLDARPDLCQCKFVTGDLVVADFLRRDE----VLEALDGRFDRI-DDYDW-SRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCAD
        MF+  KL  RP+LC+ KF TGD+++++FLR  +    ++++ +    R+  DYDW  R  +++ Y+ G HSD++  W  V+AVY+P+N+GG HW+++C D
Subjt:  MFIRKKLDARPDLCQCKFVTGDLVVADFLRRDE----VLEALDGRFDRI-DDYDW-SRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLMCAD

Query:  LEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRR
         + GEL+V DSFM +     +E++ K + T++P L+ + GV   K  +PL  WR++R    PQ    GDCG+F   FFEYDVT    DTL+Q  M F RR
Subjt:  LEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRR

Query:  QFAVQLWANRS
        QFAVQLWAN+S
Subjt:  QFAVQLWANRS

A0A6J1DY60 uncharacterized protein LOC1110252736.0e-4238.17Show/hide
Query:  RKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL--DGRFDRIDDYDWSRWKTVMDYVMGNHSD
        R T    + K WF  LL P   + +E ID+L M   +K++    L + +F  GD+++++ LRR +   A    G       YDW + +T+  YV+G  SD
Subjt:  RKTVYAYRSKEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEAL--DGRFDRIDDYDWSRWKTVMDYVMGNHSD

Query:  HNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGV
        ++  WS  + VY   N+GGNHWV++  DL  G+L V DS   +    D+EK  K + T++P +L   G++ ++  LP+  WR++R   VPQ     DC +
Subjt:  HNIPWSSVEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGV

Query:  FTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF
        F  +FFEYDV  SK+DTL Q ++   RRQ+AVQ+WA R FF
Subjt:  FTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLWANRSFF

SwissProt top hitse value%identityAlignment
Q01033 Uncharacterized gene 48 protein4.3e-0541.18Show/hide
Query:  GSDEGP--SKEGDEGPSKG---GDMSENPEKDDGDDGPSEGGDKGPLGGSEHSEKGDEPTEENPKGGDEGSLG---GSEHPEKGDEPAEENPKGGDEGPS
        G DEG     EGDEG  +G   GD  E+  +D+GD+G  EG D+G  G  E  ++GDE  +E    GDEG  G   G E  ++GDE  +E  +G DEG  
Subjt:  GSDEGP--SKEGDEGPSKG---GDMSENPEKDDGDDGPSEGGDKGPLGGSEHSEKGDEPTEENPKGGDEGSLG---GSEHPEKGDEPAEENPKGGDEGPS

Query:  GGSEHFQKEVKGTDEADEG
        G  E  + E +G +  DEG
Subjt:  GGSEHFQKEVKGTDEADEG

Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases3.9e-0924.87Show/hide
Query:  KEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDR-IDDYDWSRWKTVMDYVMGNHSDHNIP-WSSVE
        KE F  +    H+ S++V+D L  F R  L  R D    + +  D++ + F+ +   L  L  +F + +   D+     ++D ++G    + +  ++  +
Subjt:  KEWFKTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDR-IDDYDWSRWKTVMDYVMGNHSDHNIP-WSSVE

Query:  AVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFT
         VY+PFN    HWV +C DL+A ++ + DS +QL  D  +  + + +  ++P L  +         + L  + L R   +PQ+    D GV +
Subjt:  AVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFT

AT4G08430.1 Ulp1 protease family protein1.5e-0828.68Show/hide
Query:  VEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKV-KQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFF
        V+ +Y    + GNHWV +  DL    + V DS   L TD ++  Q   V T++P +LS     K  +++    EW  KR   +P+  D+ DC +++ K+ 
Subjt:  VEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKV-KQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFF

Query:  EYDVTKSKLDTLSQDSMDFCRRQFAVQLW
        E        D L  ++M     + AV+++
Subjt:  EYDVTKSKLDTLSQDSMDFCRRQFAVQLW

AT5G45570.1 Ulp1 protease family protein3.5e-1030.23Show/hide
Query:  VEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKV-KQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFF
        V+ +Y    + GNHWV +  DL    + V DS   L TD ++  Q   V T++P +LS     K  +++    EW  KR   +P+  D GDC +++ K+ 
Subjt:  VEAVYIPFNLGGNHWVLMCADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKV-KQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFF

Query:  EYDVTKSKLDTLSQDSMDFCRRQFAVQLW
        E        D L  ++M   R + AV+++
Subjt:  EYDVTKSKLDTLSQDSMDFCRRQFAVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGTATGGGCATACGAGATTGTGTCCTTAGTGATAGGGCGTGTTGCAAATCGTGTGAACGAGAATGCCATACCACACATTCTTAAATGGACATGCTCCCACTCGCC
TACGTACTACCGCATAAGCACTAAGGTGTTTGGGTCGAGGGTGGTGAGTTTCATTTGCCACTTGTTTTTTACTCATCTATTTCGATACCTAAGATTAAGTTCATATGTAT
CAGGCGAGGATCACAGAGACATTGAGATGGCGGATGTGACTGTGCACGACCCTGCCCAGATAGATGAGGCAGTAGGATCGATGGAAGAGAAAGAACAGGACGAGGGTGAA
GAGATTAAAGATAAGAAGAAAAAAAATAAGAGAAGTTGCGAATGCATTGAGTTGCTACAACGTCTCAATGAACGTGTGGACGCCATGGAAATCGACTTGAAGTCTGGCTT
GAAGGCCATCAAGAAATTCTTACGAAGATTTTCTAAGGGCAAATATGTAGATCCGGAGAAATACTTTGGATCAGATGAGGGTCCTTCTAAAGAAGGTGATGAGGGTCCGT
CTAAAGGAGGTGACATGTCTGAAAACCCAGAAAAAGATGATGGGGATGATGGTCCTTCTGAAGGAGGTGATAAGGGTCCTTTGGGCGGGTCCGAACACTCTGAAAAAGGA
GATGAGCCAACTGAAGAGAACCCAAAAGGAGGTGATGAGGGTTCTTTGGGTGGGTCCGAACACCCAGAAAAAGGAGATGAGCCAGCTGAAGAGAACCCAAAAGGAGGTGA
TGAGGGTCCTTCGGGCGGGTCCGAACACTTTCAAAAGGAGGTGAAAGGAACTGACGAGGCAGATGAGGGACACACAAACCTGCCTGTTGAGTTGTTACAGAGTATTGAAG
AGATAGCGGCATTTGATGGAGTCACACACTATTTGGTACCCAAAACTGAGGAACCCAACAATGGTGCAGAACGGAAGACGGTATACGCATATAGGAGCAAAGAATGGTTT
AAAACCTTATTGACACCGTCACATTGGATGAGCAATGAGGTCATCGATGCGCTATTCATGTTTATTCGGAAAAAACTAGATGCTCGTCCGGACTTGTGCCAATGCAAATT
TGTGACAGGAGACTTGGTTGTCGCGGATTTTCTACGACGAGATGAAGTACTAGAAGCACTGGATGGTAGGTTTGATCGTATTGATGACTATGACTGGAGTAGATGGAAGA
CCGTCATGGATTATGTTATGGGCAACCATTCAGACCATAACATTCCTTGGAGTTCAGTTGAAGCGGTCTACATACCCTTCAACCTTGGTGGGAACCATTGGGTGTTGATG
TGTGCTGACTTAGAGGCCGGCGAGTTGGTGGTGTCTGATTCTTTTATGCAGTTGAATACAGACATCGACGTAGAGAAACAGTTCAAACATGTTCGCACACTGATGCCAAT
CTTGCTCTCTAAGTGTGGTGTAATGAAGGTGAAGCAGACCCTTCCACTAAATGAATGGAGGTTGAAGAGGAACAAGCTAGTGCCACAACTAAAGGATAGTGGAGATTGTG
GGGTATTCACTGCCAAATTTTTTGAATATGATGTAACGAAATCCAAACTGGACACCCTTAGCCAGGATAGCATGGACTTTTGTAGACGTCAATTCGCTGTTCAACTTTGG
GCCAATAGGTCATTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGGTATGGGCATACGAGATTGTGTCCTTAGTGATAGGGCGTGTTGCAAATCGTGTGAACGAGAATGCCATACCACACATTCTTAAATGGACATGCTCCCACTCGCC
TACGTACTACCGCATAAGCACTAAGGTGTTTGGGTCGAGGGTGGTGAGTTTCATTTGCCACTTGTTTTTTACTCATCTATTTCGATACCTAAGATTAAGTTCATATGTAT
CAGGCGAGGATCACAGAGACATTGAGATGGCGGATGTGACTGTGCACGACCCTGCCCAGATAGATGAGGCAGTAGGATCGATGGAAGAGAAAGAACAGGACGAGGGTGAA
GAGATTAAAGATAAGAAGAAAAAAAATAAGAGAAGTTGCGAATGCATTGAGTTGCTACAACGTCTCAATGAACGTGTGGACGCCATGGAAATCGACTTGAAGTCTGGCTT
GAAGGCCATCAAGAAATTCTTACGAAGATTTTCTAAGGGCAAATATGTAGATCCGGAGAAATACTTTGGATCAGATGAGGGTCCTTCTAAAGAAGGTGATGAGGGTCCGT
CTAAAGGAGGTGACATGTCTGAAAACCCAGAAAAAGATGATGGGGATGATGGTCCTTCTGAAGGAGGTGATAAGGGTCCTTTGGGCGGGTCCGAACACTCTGAAAAAGGA
GATGAGCCAACTGAAGAGAACCCAAAAGGAGGTGATGAGGGTTCTTTGGGTGGGTCCGAACACCCAGAAAAAGGAGATGAGCCAGCTGAAGAGAACCCAAAAGGAGGTGA
TGAGGGTCCTTCGGGCGGGTCCGAACACTTTCAAAAGGAGGTGAAAGGAACTGACGAGGCAGATGAGGGACACACAAACCTGCCTGTTGAGTTGTTACAGAGTATTGAAG
AGATAGCGGCATTTGATGGAGTCACACACTATTTGGTACCCAAAACTGAGGAACCCAACAATGGTGCAGAACGGAAGACGGTATACGCATATAGGAGCAAAGAATGGTTT
AAAACCTTATTGACACCGTCACATTGGATGAGCAATGAGGTCATCGATGCGCTATTCATGTTTATTCGGAAAAAACTAGATGCTCGTCCGGACTTGTGCCAATGCAAATT
TGTGACAGGAGACTTGGTTGTCGCGGATTTTCTACGACGAGATGAAGTACTAGAAGCACTGGATGGTAGGTTTGATCGTATTGATGACTATGACTGGAGTAGATGGAAGA
CCGTCATGGATTATGTTATGGGCAACCATTCAGACCATAACATTCCTTGGAGTTCAGTTGAAGCGGTCTACATACCCTTCAACCTTGGTGGGAACCATTGGGTGTTGATG
TGTGCTGACTTAGAGGCCGGCGAGTTGGTGGTGTCTGATTCTTTTATGCAGTTGAATACAGACATCGACGTAGAGAAACAGTTCAAACATGTTCGCACACTGATGCCAAT
CTTGCTCTCTAAGTGTGGTGTAATGAAGGTGAAGCAGACCCTTCCACTAAATGAATGGAGGTTGAAGAGGAACAAGCTAGTGCCACAACTAAAGGATAGTGGAGATTGTG
GGGTATTCACTGCCAAATTTTTTGAATATGATGTAACGAAATCCAAACTGGACACCCTTAGCCAGGATAGCATGGACTTTTGTAGACGTCAATTCGCTGTTCAACTTTGG
GCCAATAGGTCATTCTTTTAG
Protein sequenceShow/hide protein sequence
MQVWAYEIVSLVIGRVANRVNENAIPHILKWTCSHSPTYYRISTKVFGSRVVSFICHLFFTHLFRYLRLSSYVSGEDHRDIEMADVTVHDPAQIDEAVGSMEEKEQDEGE
EIKDKKKKNKRSCECIELLQRLNERVDAMEIDLKSGLKAIKKFLRRFSKGKYVDPEKYFGSDEGPSKEGDEGPSKGGDMSENPEKDDGDDGPSEGGDKGPLGGSEHSEKG
DEPTEENPKGGDEGSLGGSEHPEKGDEPAEENPKGGDEGPSGGSEHFQKEVKGTDEADEGHTNLPVELLQSIEEIAAFDGVTHYLVPKTEEPNNGAERKTVYAYRSKEWF
KTLLTPSHWMSNEVIDALFMFIRKKLDARPDLCQCKFVTGDLVVADFLRRDEVLEALDGRFDRIDDYDWSRWKTVMDYVMGNHSDHNIPWSSVEAVYIPFNLGGNHWVLM
CADLEAGELVVSDSFMQLNTDIDVEKQFKHVRTLMPILLSKCGVMKVKQTLPLNEWRLKRNKLVPQLKDSGDCGVFTAKFFEYDVTKSKLDTLSQDSMDFCRRQFAVQLW
ANRSFF