; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029042 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029042
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr8:34561495..34568809
RNA-Seq ExpressionLag0029042
SyntenyLag0029042
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054867.1 Ulp1-like peptidase [Cucumis melo var. makuwa]1.2e-7528.36Show/hide
Query:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ
        KI P+ +F + ++C SHL KT  NIK KL   QL +FR+T FGHFLD +++FNG LIHY LLREV +   D ISF + G   +FGRREF++ITG+    +
Subjt:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ

Query:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV
             V ++RL   +  D   +   +L+ +F  + +E  D+D VK+ + YFIE++++G++R+ ++D       DDW  F N DW +++F +T+  LK+ +
Subjt:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV

Query:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM
          +    K+++  + + Y++ GFP+A Q   YE++ ++ G   +++ND+AIPR+LRW C  SP    +S +VF S M   +   E+   EE+++      
Subjt:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM

Query:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR
                                 A G++   +N       ++ N  ++ V     ++   K+ K  + ++K +K  ++ +++ + V+   E  L  I+
Subjt:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR

Query:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV
          +  L KG      K+I                        GL   G +G         DH+      ++H    EV + G+ +V+E    +     G+
Subjt:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV

Query:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY
         + I VV+ +  + ++    Q  P  R   RKR + + L +P+         +     + +A+ YDP+ +I   LD    R   W+  +  +D +R T +
Subjt:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY

Query:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW
          + K +FRDL    +W+ DE +D++F+FI+ K         + FT  D         +  LYKE      P    +DW     ++ Y  G   D   PW
Subjt:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW

Query:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN
        +++D VY P+N+ G HWVL+C+DL   +V V DS+  L   E +   L  + Q++P LL   G  D R      +  WP+    S P Q ++ DC  F  
Subjt:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN

Query:  LGLKFGGIGPGKHTVTDE
           ++   G G  T+  E
Subjt:  LGLKFGGIGPGKHTVTDE

TYJ95796.1 Ulp1-like peptidase [Cucumis melo var. makuwa]1.6e-7528.24Show/hide
Query:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ
        KI P+ +F + ++C SHL KT  NIK KL   QL +FR+T FGHFLD +++FNG LIHY LLREV +   D ISF + G   +FGRREF+++TG+    +
Subjt:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ

Query:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV
             V ++RL   +  D   +   +L+ +F  + +E  D+D VK+ + YFIE++++G++R+ ++D       DDW  F N DW +++F +T+  LK+ +
Subjt:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV

Query:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM
          +    K+++  + + Y++ GFP+A Q   YE++ ++ G   +++ND+AIPR+LRW C  SP    +S +VF S M   +   E+   EE+++      
Subjt:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM

Query:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR
                                 A G++   +N       ++ N  ++ V     ++   K+ K  + ++K +K  ++ +++ + V+   E  L  I+
Subjt:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR

Query:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV
          +  L KG      K+I                        GL   G +G         DH+      ++H    EV + G+ +V+E    +     G+
Subjt:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV

Query:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY
         + I VV+ +  + ++    Q  P  R   RKR + + L +P+         +     + +A+ YDP+ +I   LD    R   W+  +  +D +R T +
Subjt:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY

Query:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW
          + K +FRDL    +W+ DE +D++F+FI+ K         + FT  D         +  LYKE      P    +DW     ++ Y  G   D   PW
Subjt:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW

Query:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN
        +++D VY P+N+ G HWVL+C+DL   +V V DS+  L   E +   L  + Q++P LL   G  D R      +  WP+    S P Q ++ DC  F  
Subjt:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN

Query:  LGLKFGGIGPGKHTVTDE
           ++   G G  T+  E
Subjt:  LGLKFGGIGPGKHTVTDE

TYK23325.1 Ulp1-like peptidase [Cucumis melo var. makuwa]1.6e-7528.24Show/hide
Query:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ
        KI P+ +F + ++C SHL KT  NIK KL   QL +FR+T FGHFLD +++FNG LIHY LLREV +   D ISF + G   +FGRREF+++TG+    +
Subjt:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ

Query:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV
             V ++RL   +  D   +   +L+ +F  + +E  D+D VK+ + YFIE++++G++R+ ++D       DDW  F N DW +++F +T+  LK+ +
Subjt:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV

Query:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM
          +    K+++  + + Y++ GFP+A Q   YE++ ++ G   +++ND+AIPR+LRW C  SP    +S +VF S M   +   E+   EE+++      
Subjt:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM

Query:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR
                                 A G++   +N       ++ N  ++ V     ++   K+ K  + ++K +K  ++ +++ + V+   E  L  I+
Subjt:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR

Query:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV
          +  L KG      K+I                        GL   G +G         DH+      ++H    EV + G+ +V+E    +     G+
Subjt:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV

Query:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY
         + I VV+ +  + ++    Q  P  R   RKR + + L +P+         +     + +A+ YDP+ +I   LD    R   W+  +  +D +R T +
Subjt:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY

Query:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW
          + K +FRDL    +W+ DE +D++F+FI+ K         + FT  D         +  LYKE      P    +DW     ++ Y  G   D   PW
Subjt:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW

Query:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN
        +++D VY P+N+ G HWVL+C+DL   +V V DS+  L   E +   L  + Q++P LL   G  D R      +  WP+    S P Q ++ DC  F  
Subjt:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN

Query:  LGLKFGGIGPGKHTVTDE
           ++   G G  T+  E
Subjt:  LGLKFGGIGPGKHTVTDE

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.7e-7742.62Show/hide
Query:  DYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN
        D+FPA LT  +H+ KT   IK +LT TQL MFRQTCFG  LD  ++FNG LIH+ LLREV EPR DVISF++ G++VSFG+REFDLITG+ HR   V  +
Subjt:  DYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN

Query:  VSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTVGGKAVS
        +   RLR  Y  D + +K  EL+++F    F  DED VK+ I YFIELAMMG+ERKQ +DT+LLG +D W+ FCN DWS +IFD+TI  LK  +  K   
Subjt:  VSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTVGGKAVS

Query:  YKERTDGKQ---ETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRA
        Y+++        ETYSLYGFPYAFQ   YET+S+        L+D+AIPR+LRWSC +S     L+ EVF +  ++V   L+A++ + Q M RV+ PP  
Subjt:  YKERTDGKQ---ETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRA

Query:  ---PSPPPPP-----PPPPPPPP----PPPPAALGDIPVEDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDL
           P PP  P     P PP  P     P PPA +   P+ED +V+    +     A +G G       +    R+ +R K L++ V  I+D + +     
Subjt:  ---PSPPPPP-----PPPPPPPP----PPPPAALGDIPVEDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDL

Query:  KVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTDDGG-GGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHG
                 LK I+ ++++L+KGKF D+SKY     G DD G    RP    + DGG       Q   +D     D E   + T  HG
Subjt:  KVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTDDGG-GGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHG

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]4.0e-7952.86Show/hide
Query:  MALVPKIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI
        M +  KI   D+FPAAL+  +H+GKT   +K +LT +QL MF QTCFG  L  +++FNG L+H+ LLREV EP+ D+ISF + G +VSFG+REFDLITG+
Subjt:  MALVPKIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI

Query:  RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGL
        RH    V  +V + RLR LY  D  S+K  EL+++F    FE+DEDAVK+ I YFIELAMMG+ERK +MDTSLLG +D W+ FCN DWS +IF++T+  L
Subjt:  RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGL

Query:  KKTVGGKAVSYKERT---DGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE
        K  +  K   YK++        ETYSLY FPYAFQ   YET+S+L+ RVA RLND+AIPR+LRWSC +S     L REVF +  ++V   L A++ E
Subjt:  KKTVGGKAVSYKERT---DGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE

TrEMBL top hitse value%identityAlignment
A0A5A7UJU4 Ulp1-like peptidase5.8e-7628.36Show/hide
Query:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ
        KI P+ +F + ++C SHL KT  NIK KL   QL +FR+T FGHFLD +++FNG LIHY LLREV +   D ISF + G   +FGRREF++ITG+    +
Subjt:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ

Query:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV
             V ++RL   +  D   +   +L+ +F  + +E  D+D VK+ + YFIE++++G++R+ ++D       DDW  F N DW +++F +T+  LK+ +
Subjt:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV

Query:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM
          +    K+++  + + Y++ GFP+A Q   YE++ ++ G   +++ND+AIPR+LRW C  SP    +S +VF S M   +   E+   EE+++      
Subjt:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM

Query:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR
                                 A G++   +N       ++ N  ++ V     ++   K+ K  + ++K +K  ++ +++ + V+   E  L  I+
Subjt:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR

Query:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV
          +  L KG      K+I                        GL   G +G         DH+      ++H    EV + G+ +V+E    +     G+
Subjt:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV

Query:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY
         + I VV+ +  + ++    Q  P  R   RKR + + L +P+         +     + +A+ YDP+ +I   LD    R   W+  +  +D +R T +
Subjt:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY

Query:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW
          + K +FRDL    +W+ DE +D++F+FI+ K         + FT  D         +  LYKE      P    +DW     ++ Y  G   D   PW
Subjt:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW

Query:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN
        +++D VY P+N+ G HWVL+C+DL   +V V DS+  L   E +   L  + Q++P LL   G  D R      +  WP+    S P Q ++ DC  F  
Subjt:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN

Query:  LGLKFGGIGPGKHTVTDE
           ++   G G  T+  E
Subjt:  LGLKFGGIGPGKHTVTDE

A0A5D3BA80 Ulp1-like peptidase7.6e-7628.24Show/hide
Query:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ
        KI P+ +F + ++C SHL KT  NIK KL   QL +FR+T FGHFLD +++FNG LIHY LLREV +   D ISF + G   +FGRREF+++TG+    +
Subjt:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ

Query:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV
             V ++RL   +  D   +   +L+ +F  + +E  D+D VK+ + YFIE++++G++R+ ++D       DDW  F N DW +++F +T+  LK+ +
Subjt:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV

Query:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM
          +    K+++  + + Y++ GFP+A Q   YE++ ++ G   +++ND+AIPR+LRW C  SP    +S +VF S M   +   E+   EE+++      
Subjt:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM

Query:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR
                                 A G++   +N       ++ N  ++ V     ++   K+ K  + ++K +K  ++ +++ + V+   E  L  I+
Subjt:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR

Query:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV
          +  L KG      K+I                        GL   G +G         DH+      ++H    EV + G+ +V+E    +     G+
Subjt:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV

Query:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY
         + I VV+ +  + ++    Q  P  R   RKR + + L +P+         +     + +A+ YDP+ +I   LD    R   W+  +  +D +R T +
Subjt:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY

Query:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW
          + K +FRDL    +W+ DE +D++F+FI+ K         + FT  D         +  LYKE      P    +DW     ++ Y  G   D   PW
Subjt:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW

Query:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN
        +++D VY P+N+ G HWVL+C+DL   +V V DS+  L   E +   L  + Q++P LL   G  D R      +  WP+    S P Q ++ DC  F  
Subjt:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN

Query:  LGLKFGGIGPGKHTVTDE
           ++   G G  T+  E
Subjt:  LGLKFGGIGPGKHTVTDE

A0A5D3DIC1 Ulp1-like peptidase7.6e-7628.24Show/hide
Query:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ
        KI P+ +F + ++C SHL KT  NIK KL   QL +FR+T FGHFLD +++FNG LIHY LLREV +   D ISF + G   +FGRREF+++TG+    +
Subjt:  KIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQ

Query:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV
             V ++RL   +  D   +   +L+ +F  + +E  D+D VK+ + YFIE++++G++R+ ++D       DDW  F N DW +++F +T+  LK+ +
Subjt:  HVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFE-SDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTV

Query:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM
          +    K+++  + + Y++ GFP+A Q   YE++ ++ G   +++ND+AIPR+LRW C  SP    +S +VF S M   +   E+   EE+++      
Subjt:  GGKAVSYKERTDGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDM--ARVTTELVASEEEIQFMDRVM

Query:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR
                                 A G++   +N       ++ N  ++ V     ++   K+ K  + ++K +K  ++ +++ + V+   E  L  I+
Subjt:  QPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDLKVIKSMEKDLKAIR

Query:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV
          +  L KG      K+I                        GL   G +G         DH+      ++H    EV + G+ +V+E    +     G+
Subjt:  KFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEIGEHEVIEIGEHVEAPIEGV

Query:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY
         + I VV+ +  + ++    Q  P  R   RKR + + L +P+         +     + +A+ YDP+ +I   LD    R   W+  +  +D +R T +
Subjt:  GKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWK-------DTQEERKKRKAVKYDPLPQIPHDLDAPFKR---WLDTEDPEDNVRTTAY

Query:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW
          + K +FRDL    +W+ DE +D++F+FI+ K         + FT  D         +  LYKE      P    +DW     ++ Y  G   D   PW
Subjt:  AVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPW

Query:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN
        +++D VY P+N+ G HWVL+C+DL   +V V DS+  L   E +   L  + Q++P LL   G  D R      +  WP+    S P Q ++ DC  F  
Subjt:  STIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQR--WPLRRELSRPQQNSSGDCVSFEN

Query:  LGLKFGGIGPGKHTVTDE
           ++   G G  T+  E
Subjt:  LGLKFGGIGPGKHTVTDE

A0A6J1DJX9 uncharacterized protein LOC1110207578.1e-7842.62Show/hide
Query:  DYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN
        D+FPA LT  +H+ KT   IK +LT TQL MFRQTCFG  LD  ++FNG LIH+ LLREV EPR DVISF++ G++VSFG+REFDLITG+ HR   V  +
Subjt:  DYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGN

Query:  VSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTVGGKAVS
        +   RLR  Y  D + +K  EL+++F    F  DED VK+ I YFIELAMMG+ERKQ +DT+LLG +D W+ FCN DWS +IFD+TI  LK  +  K   
Subjt:  VSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTVGGKAVS

Query:  YKERTDGKQ---ETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRA
        Y+++        ETYSLYGFPYAFQ   YET+S+        L+D+AIPR+LRWSC +S     L+ EVF +  ++V   L+A++ + Q M RV+ PP  
Subjt:  YKERTDGKQ---ETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRA

Query:  ---PSPPPPP-----PPPPPPPP----PPPPAALGDIPVEDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDL
           P PP  P     P PP  P     P PPA +   P+ED +V+    +     A +G G       +    R+ +R K L++ V  I+D + +     
Subjt:  ---PSPPPPP-----PPPPPPPP----PPPPAALGDIPVEDNIVEDLGTENPNEVA-EGVG------TSGTNDRVCKRCKVLEDEVKVIKDDVKEIKEDL

Query:  KVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTDDGG-GGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHG
                 LK I+ ++++L+KGKF D+SKY     G DD G    RP    + DGG       Q   +D     D E   + T  HG
Subjt:  KVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTDDGG-GGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHG

A0A6J1DRZ7 uncharacterized protein LOC1110238471.9e-7952.86Show/hide
Query:  MALVPKIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI
        M +  KI   D+FPAAL+  +H+GKT   +K +LT +QL MF QTCFG  L  +++FNG L+H+ LLREV EP+ D+ISF + G +VSFG+REFDLITG+
Subjt:  MALVPKIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGI

Query:  RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGL
        RH    V  +V + RLR LY  D  S+K  EL+++F    FE+DEDAVK+ I YFIELAMMG+ERK +MDTSLLG +D W+ FCN DWS +IF++T+  L
Subjt:  RHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMTIFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGL

Query:  KKTVGGKAVSYKERT---DGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE
        K  +  K   YK++        ETYSLY FPYAFQ   YET+S+L+ RVA RLND+AIPR+LRWSC +S     L REVF +  ++V   L A++ E
Subjt:  KKTVGGKAVSYKERT---DGKQETYSLYGFPYAFQ--TYETVSSLTGRVANRLNDNAIPRILRWSCNHSPTLAALSREVFSSDMARVTTELVASEEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases2.3e-0824.66Show/hide
Query:  RKRKTAWKLRSPWKDTQEERKKRKAVKYDPLP-----QIPHDLDAPFKRWLDTEDPEDNVRTTAYAVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWG
        RK K    L S         KK K +   P P     ++  D +  ++R L     + ++     A        D+I   K    +V+D +  F  +   
Subjt:  RKRKTAWKLRSPWKDTQEERKKRKAVKYDPLP-----QIPHDLDAPFKRWLDTEDPEDNVRTTAYAVRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWG

Query:  ERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPWSTIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSM
           D+  +K  + D+  + F  +   L+ + +  + P+D  +   SA   L  G GE ++    ++  D VYMP+N    HWV +C+DL+  ++ + DS 
Subjt:  ERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPWSTIDVVYMPYNIGGLHWVLICIDLEVGEVVVSDSM

Query:  VVLNKDEVIEKELRILCQVLPAL
        + L +D  +  EL+ L  +LP L
Subjt:  VVLNKDEVIEKELRILCQVLPAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTGCAAGGTAATTCCAAGGAGCTTAGTCTTTTATAAAACCAAGATTGGGCTAACAAGTGAAGAAAGAAGAGAAAAAGAAAAAAGAAGAGAAAAGAAAGAAAAAAA
AAAAAGGTCGCCGACGGACGGCGGCGGCGGCGAGCGGTGGCGCCGGCCGCCGGTCGCCGGACATAGGAAGAAGAAGAAGGAGGAAGGAGAAGGTGGAGAAGGTGGAGAAG
ATGGAAAAGAAGAAGAAGAAGAGAGAGAAAATCCATTGCACGCCGCTGTTGAACCACGAACTCGAGTAGCCAGCGTCGATCGTTCTGGAACGATCTGGAACGAGCCGTCT
GGAACTCTCACCGTCAATCGAGTAGCGATCGCGATTTCAGGGACGATCTGGAACGAGTCGTCTGAATACGAGATCTGGAACGAGCTGCTTCTTTCCGGCGTCGATGGCGA
TGAATCCCGGCCGGGATGCATGTTGGGGCACCGAGAAAATGGGCCGGGAATGGATATTTTGAATCCGAAAAACGCGATCTCGGCCGGATTTACCATCATGGCATTGGTAC
CAAAGATTGCACCTGCTGACTACTTTCCTGCTGCTTTGACATGTTGTTCACATCTCGGAAAAACCGTTAAAAATATTAAGGACAAATTAACTGATACCCAATTAGGAATG
TTTAGGCAAACATGTTTTGGACATTTCTTAGATACGTCCTTGATGTTTAATGGACAACTTATTCATTATTTTCTTTTGAGGGAAGTGAATGAGCCTAGGATTGATGTTAT
TAGCTTTGAGATTCTGGGAGAGAAAGTTTCATTTGGTCGGAGGGAATTTGACCTTATTACTGGAATTAGGCATAGAACCCAACATGTTAGGGGTAATGTATCTAGTACTA
GACTGAGAAGACTGTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCCTTCCATTAATTTTGAGAGCGATGAGGATGCTGTGAAGATGACC
ATATTTTATTTCATTGAGTTGGCTATGATGGGGAGGGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCTTTATTGATGATTGGCAAAGGTTTTGTAATGAGGATTG
GAGTAAGTTAATTTTTGATAAGACCATAAAGGGACTCAAGAAGACTGTAGGTGGGAAGGCAGTGTCCTATAAAGAGAGGACGGATGGCAAACAAGAAACGTACAGTCTAT
ATGGCTTCCCATACGCGTTTCAGACATACGAGACAGTATCTTCTTTGACCGGGCGTGTGGCTAATCGCTTGAATGACAATGCCATTCCACGCATATTAAGATGGTCATGT
AACCACTCACCTACACTTGCAGCGCTGAGTCGAGAGGTGTTTTCTTCAGATATGGCTCGGGTCACAACTGAACTTGTGGCCTCAGAAGAGGAGATCCAATTTATGGATCG
TGTGATGCAGCCACCTCGAGCCCCATCTCCACCGCCACCGCCACCTCCACCTCCACCTCCACCTCCACCTCCGCCCCCAGCAGCTTTGGGAGATATTCCAGTTGAAGATA
ATATCGTTGAGGATCTCGGGACTGAGAATCCAAATGAAGTGGCAGAGGGTGTTGGGACGTCTGGTACGAATGACAGAGTCTGCAAGAGGTGCAAAGTCCTCGAAGACGAG
GTGAAGGTGATTAAAGACGATGTGAAGGAGATTAAGGAGGATTTGAAGGTCATTAAGTCCATGGAAAAAGACCTGAAGGCGATAAGGAAGTTCATGCGTCGACTTTCGAA
GGGTAAATTCGTCGACGCCAGCAAGTACATAGAACCAGATGATGGTACAGACGATGGTGGTGGTGGATCTCGACCACATTCAAAAGGTCAGGATGATGGTGGTGGTCTTG
ATCCATCCGGGTCACAAGGAAAAGCAGATGACAACACCCCAATGGCTGACCATGAGGATCCGATGGATACAACAGAACAACATGGTGGTGCTGAGGAAGTAACTGAAATA
GGAGAACATGAAGTAATTGAAATAGGCGAACATGTAGAGGCCCCGATAGAGGGCGTGGGAAAGGATATTCATGTTGTCGAAAGTCAACATTCTCTGGGTGTCCAGTCCAT
TTCTGAACAGAACGAGCCGATAGAAAGACAGGGGACTCGTAAGAGGAAGACTGCATGGAAGTTGAGAAGTCCATGGAAAGACACACAGGAAGAACGTAAGAAACGCAAGG
CTGTGAAGTACGATCCTCTTCCCCAGATCCCCCACGATCTGGATGCTCCATTCAAAAGATGGCTTGACACTGAGGATCCAGAAGATAATGTTCGGACAACTGCGTATGCT
GTCCGAGACAAGACGTGGTTTCGTGATCTTATCACTCCATCGAAATGGATGATTGATGAGGTGATCGACTCGATTTTCATGTTCATCCAAAAGAAATGGGGAGAACGACC
AGATTTATGCCGCAAGAAGTTCACCATTGGTGACCTATGTGTAACGAATTTTTTCAGACGCGAAGATGGCCTATACAAGGAAATGAGTAGTGGCATACATCCCCGAGACT
TGACGTACGATTGGAGCAGCGCAGGCAATATCTTGAAGTACGGAAAGGGTGAGCTTGCAGACCATAACATACCATGGAGCACAATTGATGTGGTGTACATGCCGTATAAC
ATCGGTGGGCTGCATTGGGTCCTCATATGCATTGACCTCGAGGTGGGGGAGGTCGTTGTATCCGATTCCATGGTGGTGTTGAATAAGGACGAGGTGATTGAGAAGGAGTT
AAGGATCCTTTGCCAAGTCCTGCCAGCTCTGCTTTGGAAGATCGGGGTCATGGATGTGAGGAAGAATCTTCCCGTTCAAAGATGGCCTCTGCGTCGGGAATTGTCAAGGC
CGCAGCAGAATAGTAGTGGCGACTGCGTCTCGTTTGAGAATCTCGGTCTGAAGTTCGGCGGGATCGGGCCGGGAAAACACACAGTGACTGATGAAGTTCCCCTTATCTCG
TTTGAGGATCTCGGTCCAACATTCGGCCGAGCTCAGCGCCGGGAAAATAACAGTGCACTATCCCATCGTAAGACTCTCTCAAAGATCTCGCTTGAGGATCGCGGTCCGAG
GTTCGGCCGGGATCGAGCCGGGAAGAACACAGTGACTGATGAAGTTCCCCTTATGACTCTGTCAAGGTCTCGTTTGAGGATCTCGGTCCGAGGTCCGGCCGGGATCGAGC
CGGGAAGAAACACAGTGACTGATGAAGTTCCCCTTATGACTATGTCAAGGTCTCGTTTGAGGATCTCGGTACGAGGTTCGACCGGGAAAAATAACAGTGAACTATACCAT
CGTAAGACTCTCTTAAAGGTCTCGTTTGAGGATCTCGTTCCGAGGTTCGGCCGGGATCAAGCCGGGAAAAAACACAGTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATTGCAAGGTAATTCCAAGGAGCTTAGTCTTTTATAAAACCAAGATTGGGCTAACAAGTGAAGAAAGAAGAGAAAAAGAAAAAAGAAGAGAAAAGAAAGAAAAAAA
AAAAAGGTCGCCGACGGACGGCGGCGGCGGCGAGCGGTGGCGCCGGCCGCCGGTCGCCGGACATAGGAAGAAGAAGAAGGAGGAAGGAGAAGGTGGAGAAGGTGGAGAAG
ATGGAAAAGAAGAAGAAGAAGAGAGAGAAAATCCATTGCACGCCGCTGTTGAACCACGAACTCGAGTAGCCAGCGTCGATCGTTCTGGAACGATCTGGAACGAGCCGTCT
GGAACTCTCACCGTCAATCGAGTAGCGATCGCGATTTCAGGGACGATCTGGAACGAGTCGTCTGAATACGAGATCTGGAACGAGCTGCTTCTTTCCGGCGTCGATGGCGA
TGAATCCCGGCCGGGATGCATGTTGGGGCACCGAGAAAATGGGCCGGGAATGGATATTTTGAATCCGAAAAACGCGATCTCGGCCGGATTTACCATCATGGCATTGGTAC
CAAAGATTGCACCTGCTGACTACTTTCCTGCTGCTTTGACATGTTGTTCACATCTCGGAAAAACCGTTAAAAATATTAAGGACAAATTAACTGATACCCAATTAGGAATG
TTTAGGCAAACATGTTTTGGACATTTCTTAGATACGTCCTTGATGTTTAATGGACAACTTATTCATTATTTTCTTTTGAGGGAAGTGAATGAGCCTAGGATTGATGTTAT
TAGCTTTGAGATTCTGGGAGAGAAAGTTTCATTTGGTCGGAGGGAATTTGACCTTATTACTGGAATTAGGCATAGAACCCAACATGTTAGGGGTAATGTATCTAGTACTA
GACTGAGAAGACTGTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCCTTCCATTAATTTTGAGAGCGATGAGGATGCTGTGAAGATGACC
ATATTTTATTTCATTGAGTTGGCTATGATGGGGAGGGAGAGAAAGCAGCAGATGGACACTAGCCTGCTCGGCTTTATTGATGATTGGCAAAGGTTTTGTAATGAGGATTG
GAGTAAGTTAATTTTTGATAAGACCATAAAGGGACTCAAGAAGACTGTAGGTGGGAAGGCAGTGTCCTATAAAGAGAGGACGGATGGCAAACAAGAAACGTACAGTCTAT
ATGGCTTCCCATACGCGTTTCAGACATACGAGACAGTATCTTCTTTGACCGGGCGTGTGGCTAATCGCTTGAATGACAATGCCATTCCACGCATATTAAGATGGTCATGT
AACCACTCACCTACACTTGCAGCGCTGAGTCGAGAGGTGTTTTCTTCAGATATGGCTCGGGTCACAACTGAACTTGTGGCCTCAGAAGAGGAGATCCAATTTATGGATCG
TGTGATGCAGCCACCTCGAGCCCCATCTCCACCGCCACCGCCACCTCCACCTCCACCTCCACCTCCACCTCCGCCCCCAGCAGCTTTGGGAGATATTCCAGTTGAAGATA
ATATCGTTGAGGATCTCGGGACTGAGAATCCAAATGAAGTGGCAGAGGGTGTTGGGACGTCTGGTACGAATGACAGAGTCTGCAAGAGGTGCAAAGTCCTCGAAGACGAG
GTGAAGGTGATTAAAGACGATGTGAAGGAGATTAAGGAGGATTTGAAGGTCATTAAGTCCATGGAAAAAGACCTGAAGGCGATAAGGAAGTTCATGCGTCGACTTTCGAA
GGGTAAATTCGTCGACGCCAGCAAGTACATAGAACCAGATGATGGTACAGACGATGGTGGTGGTGGATCTCGACCACATTCAAAAGGTCAGGATGATGGTGGTGGTCTTG
ATCCATCCGGGTCACAAGGAAAAGCAGATGACAACACCCCAATGGCTGACCATGAGGATCCGATGGATACAACAGAACAACATGGTGGTGCTGAGGAAGTAACTGAAATA
GGAGAACATGAAGTAATTGAAATAGGCGAACATGTAGAGGCCCCGATAGAGGGCGTGGGAAAGGATATTCATGTTGTCGAAAGTCAACATTCTCTGGGTGTCCAGTCCAT
TTCTGAACAGAACGAGCCGATAGAAAGACAGGGGACTCGTAAGAGGAAGACTGCATGGAAGTTGAGAAGTCCATGGAAAGACACACAGGAAGAACGTAAGAAACGCAAGG
CTGTGAAGTACGATCCTCTTCCCCAGATCCCCCACGATCTGGATGCTCCATTCAAAAGATGGCTTGACACTGAGGATCCAGAAGATAATGTTCGGACAACTGCGTATGCT
GTCCGAGACAAGACGTGGTTTCGTGATCTTATCACTCCATCGAAATGGATGATTGATGAGGTGATCGACTCGATTTTCATGTTCATCCAAAAGAAATGGGGAGAACGACC
AGATTTATGCCGCAAGAAGTTCACCATTGGTGACCTATGTGTAACGAATTTTTTCAGACGCGAAGATGGCCTATACAAGGAAATGAGTAGTGGCATACATCCCCGAGACT
TGACGTACGATTGGAGCAGCGCAGGCAATATCTTGAAGTACGGAAAGGGTGAGCTTGCAGACCATAACATACCATGGAGCACAATTGATGTGGTGTACATGCCGTATAAC
ATCGGTGGGCTGCATTGGGTCCTCATATGCATTGACCTCGAGGTGGGGGAGGTCGTTGTATCCGATTCCATGGTGGTGTTGAATAAGGACGAGGTGATTGAGAAGGAGTT
AAGGATCCTTTGCCAAGTCCTGCCAGCTCTGCTTTGGAAGATCGGGGTCATGGATGTGAGGAAGAATCTTCCCGTTCAAAGATGGCCTCTGCGTCGGGAATTGTCAAGGC
CGCAGCAGAATAGTAGTGGCGACTGCGTCTCGTTTGAGAATCTCGGTCTGAAGTTCGGCGGGATCGGGCCGGGAAAACACACAGTGACTGATGAAGTTCCCCTTATCTCG
TTTGAGGATCTCGGTCCAACATTCGGCCGAGCTCAGCGCCGGGAAAATAACAGTGCACTATCCCATCGTAAGACTCTCTCAAAGATCTCGCTTGAGGATCGCGGTCCGAG
GTTCGGCCGGGATCGAGCCGGGAAGAACACAGTGACTGATGAAGTTCCCCTTATGACTCTGTCAAGGTCTCGTTTGAGGATCTCGGTCCGAGGTCCGGCCGGGATCGAGC
CGGGAAGAAACACAGTGACTGATGAAGTTCCCCTTATGACTATGTCAAGGTCTCGTTTGAGGATCTCGGTACGAGGTTCGACCGGGAAAAATAACAGTGAACTATACCAT
CGTAAGACTCTCTTAAAGGTCTCGTTTGAGGATCTCGTTCCGAGGTTCGGCCGGGATCAAGCCGGGAAAAAACACAGTGACTGA
Protein sequenceShow/hide protein sequence
MHCKVIPRSLVFYKTKIGLTSEERREKEKRREKKEKKKRSPTDGGGGERWRRPPVAGHRKKKKEEGEGGEGGEDGKEEEEERENPLHAAVEPRTRVASVDRSGTIWNEPS
GTLTVNRVAIAISGTIWNESSEYEIWNELLLSGVDGDESRPGCMLGHRENGPGMDILNPKNAISAGFTIMALVPKIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLGM
FRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEILGEKVSFGRREFDLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFPSINFESDEDAVKMT
IFYFIELAMMGRERKQQMDTSLLGFIDDWQRFCNEDWSKLIFDKTIKGLKKTVGGKAVSYKERTDGKQETYSLYGFPYAFQTYETVSSLTGRVANRLNDNAIPRILRWSC
NHSPTLAALSREVFSSDMARVTTELVASEEEIQFMDRVMQPPRAPSPPPPPPPPPPPPPPPPPAALGDIPVEDNIVEDLGTENPNEVAEGVGTSGTNDRVCKRCKVLEDE
VKVIKDDVKEIKEDLKVIKSMEKDLKAIRKFMRRLSKGKFVDASKYIEPDDGTDDGGGGSRPHSKGQDDGGGLDPSGSQGKADDNTPMADHEDPMDTTEQHGGAEEVTEI
GEHEVIEIGEHVEAPIEGVGKDIHVVESQHSLGVQSISEQNEPIERQGTRKRKTAWKLRSPWKDTQEERKKRKAVKYDPLPQIPHDLDAPFKRWLDTEDPEDNVRTTAYA
VRDKTWFRDLITPSKWMIDEVIDSIFMFIQKKWGERPDLCRKKFTIGDLCVTNFFRREDGLYKEMSSGIHPRDLTYDWSSAGNILKYGKGELADHNIPWSTIDVVYMPYN
IGGLHWVLICIDLEVGEVVVSDSMVVLNKDEVIEKELRILCQVLPALLWKIGVMDVRKNLPVQRWPLRRELSRPQQNSSGDCVSFENLGLKFGGIGPGKHTVTDEVPLIS
FEDLGPTFGRAQRRENNSALSHRKTLSKISLEDRGPRFGRDRAGKNTVTDEVPLMTLSRSRLRISVRGPAGIEPGRNTVTDEVPLMTMSRSRLRISVRGSTGKNNSELYH
RKTLLKVSFEDLVPRFGRDQAGKKHSD