; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032861 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032861
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase
Genome locationscaffold11:15999456..16002700
RNA-Seq ExpressionSpg032861
SyntenySpg032861
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR004264 - Transposase, Tnp1/En/Spm-like
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]1.4e-20154Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        M +  SSS DEGNV I+    R   RGPT M  L  +RN+GER TI YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WK+VP+ELKD IFDCI+M F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        VVD  SK  ILQSAS+KFR+FK+ LTQ Y+ P KDEP RL  PP KYSHID+K WE+FV +RL+ EWE  S AQ+ERR +C+YNHHISRKGYANLA++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------
        L+ DP NRA LWKEARK KN    D+ T   V RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++ NVAR   KLSQQ              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRQKQPQSDASSATHKKSKGKD------------VVREIPENKEVNYY------------YPSMGSMDNIVAVGTMYESPSHNATI
           QS+A +   ++  + + Q   SS + KK+KGK             VV+E  E  EV               + ++GS+DN+VAVG M+ES     TI
Subjt:  --PQSEASSVKTEAPRQKQPQSDASSATHKKSKGKD------------VVREIPENKEVNYY------------YPSMGSMDNIVAVGTMYESPSHNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPIKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL+QAIGNFV WPRKLVI   +K+ P + A +   QSSK+TDVHVTI+LLNRYAM +M+ ED   I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPIKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTIN

Query:  MHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML
        + E I GKE +I+L R+DI+QYCG  EIGYSCILTYI  LW V + EIT +F +VDQATIS ++KS+E RSRNL NRL+M +LDQLVLIP+NTG  HW+L
Subjt:  MHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML

Query:  IAIQPRENTVNILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE
        I I  +EN V +++ LRSK+  EF G IN  L+ WQ +HS   YRS I WK +K                                FNT +A+ Q+EID 
Subjt:  IAIQPRENTVNILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE

Query:  VRIEWANFVAGFV
        VR+EWA FVA FV
Subjt:  VRIEWANFVAGFV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]2.9e-20455.67Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E  F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KS             KT+  +
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
          +  ++ S+ + KKSKGK++V    EI     +  E    + ++ S+DNIVAVGT++++     T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI
        TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT  IN+ + I GKE +I+L R DIMQYC  +EIGYSCI
Subjt:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKS+E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN V +L+SLR K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV
        WQAKHS+ +YR+   WK +K                                FNTKNA+ Q+EIDEVRIEWA+FV G V
Subjt:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]4.2e-20355.67Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E  F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KS             KT+  +
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
          +  ++ S+ + KKSKGK++V    EI     +  E    + ++ S+DNIVAVGT++++     T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI
        TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT  IN+ + I GKE +I+L R DIMQYC  +EIGYSCI
Subjt:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKS+E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN V +L+SLR K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV
        WQAKHS+ +YR+   WK +K                                FNTKNA+ Q+EIDEVRIEWA+FV G V
Subjt:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]9.4e-21156.88Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        M + SSSS DEGNV I+    +   RGPT M  L  +RN+GER TI YN+ GQ VG+NA +MQS+IGVCVRQQIP+TY++WK VP+ELKD IFDCI+M F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        VVD  SK  ILQSAS+KFRTFK+ LTQ+Y+ P KDEP RL  PP KYSHID+K WE+FV +RL+ EWE  S AQ+ERR +C+YNHHISRKGYANLA++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DP NRA LWKEARK KN EY D  T     RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++YNVA+ K KL Q+ Q+EA + +++   
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  QKQPQSD-------ASSATHKKS------------------KGKDVVREIPENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVD
        + Q   D        SS   KK+                  KGK VV++  E  E    + ++GS+DNIVAVGTM+ES +   +I+ +PLG +NVR +VD
Subjt:  QKQPQSD-------ASSATHKKS------------------KGKDVVREIPENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVD

Query:  MVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEP-PIKAKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLN
        +V+G+D ALPIP  D+++TL QAIGNFV WPRKLVIT  +K+ P P  +K I QSSK+TDVHVTI+LLNRYAM SM+ +D   IN+ E+I+GKE +I+L 
Subjt:  MVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEP-PIKAKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLN

Query:  REDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVNILNS
        R+DI+QYCG  EIGYSCIL YI  LW   D EIT KF +VDQATIS +VK +ELRS+NL NRL+MV LDQLVLIP+NTG  HW+LI I  +EN V +++S
Subjt:  REDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVNILNS

Query:  LRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV
        LRSK+ EEF G INT L+ WQAKHSL QYR+ I WK +K                                FNT+ A+ Q EID VR+EWA FVA FV
Subjt:  LRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]6.5e-21256.96Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        M + SSSS DEGNV I+    +   RGPT M  L  +RN+GER TI YN+ GQ VG+NA +MQS+IGVCVRQQIP+TY++WK VP+ELKD IFDCI+M F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        VVD  SK  ILQSAS+KFRTFK+ LTQ+Y+ P KDEP RL  PP KYSHID+K WE+FV +RL+ EWE  S AQ+ERR +C+YNHHISRKGYANLA++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DP NRA LWKEARK KN EY D  T     RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++YNVA+ K KL Q+ Q+EA + +++   
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  QKQPQSD-------ASSATHKKS------------------KGKDVVREIPENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVD
        + Q   D        SS   KK+                  KGK VV++  E  E    + ++GS+DNIVAVGTM+ES +   +I+ +PLG +NVR +VD
Subjt:  QKQPQSD-------ASSATHKKS------------------KGKDVVREIPENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVD

Query:  MVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEP-PIKAKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLN
        +V+G+D ALPIP  D+++TL QAIGNFV WPRKLVIT  +K+ P P  +K I QSSK+TDVHVTI+LLNRYAM SM+ +D   IN+ E+I+GKE +I+L 
Subjt:  MVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEP-PIKAKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLN

Query:  REDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSL
        R+DI+QYCG  EIGYSCIL YI  LW   D EIT KF +VDQATIS +VK +ELRS+NL NRL+MV LDQLVLIP+NTG HW+LI I  +EN V +++SL
Subjt:  REDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSL

Query:  RSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV
        RSK+ EEF G INT L+ WQAKHSL QYR+ I WK +K                                FNT+ A+ Q EID VR+EWA FVA FV
Subjt:  RSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X16.6e-20254Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        M +  SSS DEGNV I+    R   RGPT M  L  +RN+GER TI YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WK+VP+ELKD IFDCI+M F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        VVD  SK  ILQSAS+KFR+FK+ LTQ Y+ P KDEP RL  PP KYSHID+K WE+FV +RL+ EWE  S AQ+ERR +C+YNHHISRKGYANLA++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------
        L+ DP NRA LWKEARK KN    D+ T   V RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++ NVAR   KLSQQ              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRQKQPQSDASSATHKKSKGKD------------VVREIPENKEVNYY------------YPSMGSMDNIVAVGTMYESPSHNATI
           QS+A +   ++  + + Q   SS + KK+KGK             VV+E  E  EV               + ++GS+DN+VAVG M+ES     TI
Subjt:  --PQSEASSVKTEAPRQKQPQSDASSATHKKSKGKD------------VVREIPENKEVNYY------------YPSMGSMDNIVAVGTMYESPSHNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPIKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL+QAIGNFV WPRKLVI   +K+ P + A +   QSSK+TDVHVTI+LLNRYAM +M+ ED   I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPIKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTIN

Query:  MHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML
        + E I GKE +I+L R+DI+QYCG  EIGYSCILTYI  LW V + EIT +F +VDQATIS ++KS+E RSRNL NRL+M +LDQLVLIP+NTG  HW+L
Subjt:  MHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML

Query:  IAIQPRENTVNILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE
        I I  +EN V +++ LRSK+  EF G IN  L+ WQ +HS   YRS I WK +K                                FNT +A+ Q+EID 
Subjt:  IAIQPRENTVNILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE

Query:  VRIEWANFVAGFV
        VR+EWA FVA FV
Subjt:  VRIEWANFVAGFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein6.6e-20254Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        M +  SSS DEGNV I+    R   RGPT M  L  +RN+GER TI YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WK+VP+ELKD IFDCI+M F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        VVD  SK  ILQSAS+KFR+FK+ LTQ Y+ P KDEP RL  PP KYSHID+K WE+FV +RL+ EWE  S AQ+ERR +C+YNHHISRKGYANLA++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------
        L+ DP NRA LWKEARK KN    D+ T   V RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++ NVAR   KLSQQ              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRQKQPQSDASSATHKKSKGKD------------VVREIPENKEVNYY------------YPSMGSMDNIVAVGTMYESPSHNATI
           QS+A +   ++  + + Q   SS + KK+KGK             VV+E  E  EV               + ++GS+DN+VAVG M+ES     TI
Subjt:  --PQSEASSVKTEAPRQKQPQSDASSATHKKSKGKD------------VVREIPENKEVNYY------------YPSMGSMDNIVAVGTMYESPSHNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPIKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL+QAIGNFV WPRKLVI   +K+ P + A +   QSSK+TDVHVTI+LLNRYAM +M+ ED   I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPIKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTIN

Query:  MHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML
        + E I GKE +I+L R+DI+QYCG  EIGYSCILTYI  LW V + EIT +F +VDQATIS ++KS+E RSRNL NRL+M +LDQLVLIP+NTG  HW+L
Subjt:  MHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML

Query:  IAIQPRENTVNILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE
        I I  +EN V +++ LRSK+  EF G IN  L+ WQ +HS   YRS I WK +K                                FNT +A+ Q+EID 
Subjt:  IAIQPRENTVNILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE

Query:  VRIEWANFVAGFV
        VR+EWA FVA FV
Subjt:  VRIEWANFVAGFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X11.4e-20455.67Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E  F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KS             KT+  +
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
          +  ++ S+ + KKSKGK++V    EI     +  E    + ++ S+DNIVAVGT++++     T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI
        TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT  IN+ + I GKE +I+L R DIMQYC  +EIGYSCI
Subjt:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKS+E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN V +L+SLR K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV
        WQAKHS+ +YR+   WK +K                                FNTKNA+ Q+EIDEVRIEWA+FV G V
Subjt:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X45.4e-19657.58Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E  F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KS             KT+  +
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
          +  ++ S+ + KKSKGK++V    EI     +  E    + ++ S+DNIVAVGT++++     T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI
        TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT  IN+ + I GKE +I+L R DIMQYC  +EIGYSCI
Subjt:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKS+E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN V +L+SLR K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVK
        WQAKHS+ +YR+   WK +K
Subjt:  WQAKHSLPQYRSAISWKLVK

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X22.0e-20355.67Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E  F
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KS             KT+  +
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
          +  ++ S+ + KKSKGK++V    EI     +  E    + ++ S+DNIVAVGT++++     T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  QKQPQSDASSATHKKSKGKDVV---REI----PENKEVNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI
        TL+Q IG FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT  IN+ + I GKE +I+L R DIMQYC  +EIGYSCI
Subjt:  TLHQAIGNFVGWPRKLVITVDDKEEPPIK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKS+E R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN V +L+SLR K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV
        WQAKHS+ +YR+   WK +K                                FNTKNA+ Q+EIDEVRIEWA+FV G V
Subjt:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVAGFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACTGTTACAATGAGTGAACAAAGCAGTAGCAGTGGAGATGAAGGAAATGTGTTTATTCAACCACGGGTGCCAGGACGAGGGCCTACTACGATGCATAGATTGGCACGCCT
CAGAAATAATGGAGAACGCTTGACGATTGTTTACAACAACCAAGGTCAAGCTGTTGGAGATAATGCTAACCAGATGCAGAGTTACATAGGGGTTTGTGTGAGACAACAAA
TCCCAATAACATACGAAAATTGGAAGGATGTGCCTAAGGAACTAAAGGATAAGATTTTTGATTGTATAGAGATGTTGTTCGTGGTGGACCCTAGGTCCAAGAGTAGTATA
CTTCAATCTGCGTCTAGAAAATTTCGAACATTCAAGACGTACTTAACGCAGAAGTATGTCAATCCATTGAAAGATGAACCAAAGCGCTTGGCAACTCCTCCTTCCAAATA
TTCACACATTGATCAAAAGGATTGGGAGACATTTGTTAGTAGCAGACTAACATCAGAGTGGGAGGCGTTAAGTAAGGCTCAGAAAGAAAGACGAGAGAGATGCTTGTATA
ACCATCATATCTCTCGTAAGGGATATGCAAATCTTGCCAAAGACTTAGAATTGACAGACGATCCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGGAAGGGAAAAAAT
AAAGAATATTGCGATGAGGTCACTGTAGCACGTGTCAATCGAATTGACGAATTAGCTGCATTGAATGAAGGTAAGGACATCTTGACTGAAGCGTTGGGCACGCCAGAACA
CAGAGGGCGTGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGAGTCAGCAACCACAAAGCGAAGCTTCGAGTG
TCAAGACCGAAGCCCCTCGACAAAAGCAACCACAAAGCGACGCTTCGAGTGCCACGCATAAAAAGTCAAAAGGAAAAGATGTCGTTCGTGAGATACCTGAGAATAAAGAG
GTAAATTATTATTATCCTTCGATGGGCTCTATGGATAACATTGTTGCCGTAGGCACAATGTACGAGTCGCCTTCACACAATGCAACCATCCATGGAGTTCCATTAGGAGT
CGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACCGATTCCTGTGAACGATGAACTACAAACGTTGCATCAAGCGATCGGTAATTTTGTGG
GATGGCCTCGCAAGCTTGTTATTACTGTAGATGACAAAGAGGAGCCTCCTATCAAAGCTAAGCCCATAGTACAATCAAGCAAACATACAGATGTCCATGTTACTATTAGG
CTCTTAAATAGATACGCGATGCTTTCGATGAAACAAGAAGATACACCAACGATCAATATGCACGAGCGTATCGTGGGAAAGGAAGCATCAATATTTTTAAATCGCGAAGA
CATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTACCTCTGGACTGTACTTGATCCCGAGATAACAAACAAGTTTTTTGTGG
TTGATCAAGCAACAATCTCATTGTACGTGAAGTCTGAAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTT
AACACTGGTCATCATTGGATGTTGATCGCGATCCAGCCTCGGGAAAACACTGTGAATATATTGAATTCTTTGCGCAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAA
TACGGGGTTGAGAATGTGGCAAGCTAAGCACTCGCTTCCTCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTTTAACACAAAGAATGCATTTACACAAGACGAGA
TCGACGAGGTTCGTATAGAATGGGCAAATTTTGTTGCCGGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ACTGTTACAATGAGTGAACAAAGCAGTAGCAGTGGAGATGAAGGAAATGTGTTTATTCAACCACGGGTGCCAGGACGAGGGCCTACTACGATGCATAGATTGGCACGCCT
CAGAAATAATGGAGAACGCTTGACGATTGTTTACAACAACCAAGGTCAAGCTGTTGGAGATAATGCTAACCAGATGCAGAGTTACATAGGGGTTTGTGTGAGACAACAAA
TCCCAATAACATACGAAAATTGGAAGGATGTGCCTAAGGAACTAAAGGATAAGATTTTTGATTGTATAGAGATGTTGTTCGTGGTGGACCCTAGGTCCAAGAGTAGTATA
CTTCAATCTGCGTCTAGAAAATTTCGAACATTCAAGACGTACTTAACGCAGAAGTATGTCAATCCATTGAAAGATGAACCAAAGCGCTTGGCAACTCCTCCTTCCAAATA
TTCACACATTGATCAAAAGGATTGGGAGACATTTGTTAGTAGCAGACTAACATCAGAGTGGGAGGCGTTAAGTAAGGCTCAGAAAGAAAGACGAGAGAGATGCTTGTATA
ACCATCATATCTCTCGTAAGGGATATGCAAATCTTGCCAAAGACTTAGAATTGACAGACGATCCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGGAAGGGAAAAAAT
AAAGAATATTGCGATGAGGTCACTGTAGCACGTGTCAATCGAATTGACGAATTAGCTGCATTGAATGAAGGTAAGGACATCTTGACTGAAGCGTTGGGCACGCCAGAACA
CAGAGGGCGTGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGAGTCAGCAACCACAAAGCGAAGCTTCGAGTG
TCAAGACCGAAGCCCCTCGACAAAAGCAACCACAAAGCGACGCTTCGAGTGCCACGCATAAAAAGTCAAAAGGAAAAGATGTCGTTCGTGAGATACCTGAGAATAAAGAG
GTAAATTATTATTATCCTTCGATGGGCTCTATGGATAACATTGTTGCCGTAGGCACAATGTACGAGTCGCCTTCACACAATGCAACCATCCATGGAGTTCCATTAGGAGT
CGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACCGATTCCTGTGAACGATGAACTACAAACGTTGCATCAAGCGATCGGTAATTTTGTGG
GATGGCCTCGCAAGCTTGTTATTACTGTAGATGACAAAGAGGAGCCTCCTATCAAAGCTAAGCCCATAGTACAATCAAGCAAACATACAGATGTCCATGTTACTATTAGG
CTCTTAAATAGATACGCGATGCTTTCGATGAAACAAGAAGATACACCAACGATCAATATGCACGAGCGTATCGTGGGAAAGGAAGCATCAATATTTTTAAATCGCGAAGA
CATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTACCTCTGGACTGTACTTGATCCCGAGATAACAAACAAGTTTTTTGTGG
TTGATCAAGCAACAATCTCATTGTACGTGAAGTCTGAAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTT
AACACTGGTCATCATTGGATGTTGATCGCGATCCAGCCTCGGGAAAACACTGTGAATATATTGAATTCTTTGCGCAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAA
TACGGGGTTGAGAATGTGGCAAGCTAAGCACTCGCTTCCTCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTTTAACACAAAGAATGCATTTACACAAGACGAGA
TCGACGAGGTTCGTATAGAATGGGCAAATTTTGTTGCCGGATTTGTGTAA
Protein sequenceShow/hide protein sequence
TVTMSEQSSSSGDEGNVFIQPRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEMLFVVDPRSKSSI
LQSASRKFRTFKTYLTQKYVNPLKDEPKRLATPPSKYSHIDQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLELTDDPSNRAILWKEARKGKN
KEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRQKQPQSDASSATHKKSKGKDVVREIPENKE
VNYYYPSMGSMDNIVAVGTMYESPSHNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAIGNFVGWPRKLVITVDDKEEPPIKAKPIVQSSKHTDVHVTIR
LLNRYAMLSMKQEDTPTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISLYVKSEELRSRNLSNRLDMVDLDQLVLIPF
NTGHHWMLIAIQPRENTVNILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKFNTKNAFTQDEIDEVRIEWANFVAGFV