; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026226 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026226
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposase
Genome locationchr10:32755976..32759358
RNA-Seq ExpressionLag0026226
SyntenyLag0026226
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]1.2e-21955.82Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        M +  SSS DEGNV I+    R   RGPT M  L  +RN+GER TI YN++GQ +G+NA +MQS+IGVCVRQQIP+TY +WK+VP+ELKD IF C++M F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        VVD  SK  ILQSAS+KFR+FK+ LTQ Y+ P KDEP RL  PP KYSHI++K W++FV +RL+ EWE  S AQ+ERR KC+YNHHISRKGYAN A++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQ--------------
        L+ DP NRA LWKEARK KN    D+ T   V RI+ELA + +G+DILTEALGTPE+RGR+RGVGEFV+P+++ NVAR   KLS+Q              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQ--------------

Query:  --PQSEASSIKTEASRRKQPQSDALSAPQKKSKGKDV-----------------------VREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATI
           QS+A +   +++   + Q    S  +KK+KGK V                       V + PE+   G PCHLA+GS+DN+VAVG M+ES  Q  TI
Subjt:  --PQSEASSIKTEASRRKQPQSDALSAPQKKSKGKDV-----------------------VREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL QA+GNFV WPR LVI   +K+ P   A +   QSSK+TDVHVTI+LLNRYAM +M+ ED + I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTIN

Query:  MPERILGKEASVFLHREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML
        + E I GKE +++L R+DI+QYCG  EIGYSCILTYI  LW V + EIT +F +VDQATISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG  HW+L
Subjt:  MPERILGKEASVFLHREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML

Query:  IAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDE
        I I  +EN VY+++ LRSK+  EF G IN  L+ WQ +HS   YRS I WK +KCPR  GS ECGY+VQKY+RE++ N+ T I  LFNT +A+ Q+EID 
Subjt:  IAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDE

Query:  VRIEWANFVGGFV
        VR+EWA FV  FV
Subjt:  VRIEWANFVGGFV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]7.5e-22759.79Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ IG+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF CVE  F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R KCLYNHHISRKGYAN A++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR
        L+ DPSNRAILWKEARKGKN EY D+ T     RI+ELA +++G+DILTEALGT E+ GRVRGVGEFV+PS+Y+NV + KSK  E   ++++   TE S 
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR

Query:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
                 +  +KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVG ++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI
        TL Q +G FV WPR LVI  ++K    ++ ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT+ IN+ + I GKE +++L R DIMQYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +KCP Q GS ECGY+VQKYIREI+ N++T I+ +FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]1.1e-22559.79Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ IG+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF CVE  F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R KCLYNHHISRKGYAN A++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR
        L+ DPSNRAILWKEARKGKN EY D+ T     RI+ELA +++G+DILTEALGT E+ GRVRGVGEFV+PS+Y+NV + KSK  E   ++++   TE S 
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR

Query:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
                 +  +KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVG ++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI
        TL Q +G FV WPR LVI  ++K    ++ ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT+ IN+ + I GKE +++L R DIMQYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +KCP Q GS ECGY+VQKYIREI+ N++T I+ +FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]2.9e-23159.66Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        M + SSSS DEGNV I+    +   RGPT M  L  +RN+GER TI YN+ GQ +G+NA +MQS+IGVCVRQQIP+TY++WK VP+ELKD IF C++M F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        VVD  SK  ILQSAS+KFRTFK+ LTQ+Y+ P KDEP RL  PP KYSHI++K W++FV +RL+ EWE  S AQ+ERR KC+YNHHISRKGYAN A++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEA---------
        L+ DP NRA LWKEARK KN EY D  T     RI+ELA + +G+DILTEALGTPE+RGR+RGVGEFV+P+++YNVA+ K KL ++ Q+EA         
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEA---------

Query:  ---------------SSIKTEASRRKQPQSDALSAPQKK-SKGKDVVREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVV
                       SS+  + ++RK+ Q       +KK  KGK VV++ PE+   G PCHLA+GS+DNIVAVG M+ES +Q  +I+ +PLG +NVR +V
Subjt:  ---------------SSIKTEASRRKQPQSDALSAPQKK-SKGKDVVREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVV

Query:  DMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEP-PAKAKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFL
        D+V+G+D ALPIP  D+++TL QA+GNFV WPR LVIT  +K+ P P  +K I QSSK+TDVHVTI+LLNRYAM SM+ +D + IN+ E+ILGKE +++L
Subjt:  DMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEP-PAKAKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFL

Query:  HREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILN
         R+DI+QYCG  EIGYSCIL YI  LW   D EIT KF +VDQATISS+VK QELRS+NL NRL+MV LDQLVLIP+NTG  HW+LI I  +EN VY+++
Subjt:  HREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILN

Query:  SLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        SLRSK+ EEF G INT L+ WQAKHSL QYR+ I WK +KCPRQ G+ ECGY+VQKYIREI+ NS T I+ LFNT+ A+ Q EID VR+EWA FV  FV
Subjt:  SLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]2.0e-23259.74Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        M + SSSS DEGNV I+    +   RGPT M  L  +RN+GER TI YN+ GQ +G+NA +MQS+IGVCVRQQIP+TY++WK VP+ELKD IF C++M F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        VVD  SK  ILQSAS+KFRTFK+ LTQ+Y+ P KDEP RL  PP KYSHI++K W++FV +RL+ EWE  S AQ+ERR KC+YNHHISRKGYAN A++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEA---------
        L+ DP NRA LWKEARK KN EY D  T     RI+ELA + +G+DILTEALGTPE+RGR+RGVGEFV+P+++YNVA+ K KL ++ Q+EA         
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEA---------

Query:  ---------------SSIKTEASRRKQPQSDALSAPQKK-SKGKDVVREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVV
                       SS+  + ++RK+ Q       +KK  KGK VV++ PE+   G PCHLA+GS+DNIVAVG M+ES +Q  +I+ +PLG +NVR +V
Subjt:  ---------------SSIKTEASRRKQPQSDALSAPQKK-SKGKDVVREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVV

Query:  DMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEP-PAKAKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFL
        D+V+G+D ALPIP  D+++TL QA+GNFV WPR LVIT  +K+ P P  +K I QSSK+TDVHVTI+LLNRYAM SM+ +D + IN+ E+ILGKE +++L
Subjt:  DMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEP-PAKAKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFL

Query:  HREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNS
         R+DI+QYCG  EIGYSCIL YI  LW   D EIT KF +VDQATISS+VK QELRS+NL NRL+MV LDQLVLIP+NTG HW+LI I  +EN VY+++S
Subjt:  HREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNS

Query:  LRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        LRSK+ EEF G INT L+ WQAKHSL QYR+ I WK +KCPRQ G+ ECGY+VQKYIREI+ NS T I+ LFNT+ A+ Q EID VR+EWA FV  FV
Subjt:  LRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X15.6e-22055.82Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        M +  SSS DEGNV I+    R   RGPT M  L  +RN+GER TI YN++GQ +G+NA +MQS+IGVCVRQQIP+TY +WK+VP+ELKD IF C++M F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        VVD  SK  ILQSAS+KFR+FK+ LTQ Y+ P KDEP RL  PP KYSHI++K W++FV +RL+ EWE  S AQ+ERR KC+YNHHISRKGYAN A++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQ--------------
        L+ DP NRA LWKEARK KN    D+ T   V RI+ELA + +G+DILTEALGTPE+RGR+RGVGEFV+P+++ NVAR   KLS+Q              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQ--------------

Query:  --PQSEASSIKTEASRRKQPQSDALSAPQKKSKGKDV-----------------------VREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATI
           QS+A +   +++   + Q    S  +KK+KGK V                       V + PE+   G PCHLA+GS+DN+VAVG M+ES  Q  TI
Subjt:  --PQSEASSIKTEASRRKQPQSDALSAPQKKSKGKDV-----------------------VREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL QA+GNFV WPR LVI   +K+ P   A +   QSSK+TDVHVTI+LLNRYAM +M+ ED + I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTIN

Query:  MPERILGKEASVFLHREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML
        + E I GKE +++L R+DI+QYCG  EIGYSCILTYI  LW V + EIT +F +VDQATISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG  HW+L
Subjt:  MPERILGKEASVFLHREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML

Query:  IAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDE
        I I  +EN VY+++ LRSK+  EF G IN  L+ WQ +HS   YRS I WK +KCPR  GS ECGY+VQKY+RE++ N+ T I  LFNT +A+ Q+EID 
Subjt:  IAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDE

Query:  VRIEWANFVGGFV
        VR+EWA FV  FV
Subjt:  VRIEWANFVGGFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein5.6e-22055.82Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        M +  SSS DEGNV I+    R   RGPT M  L  +RN+GER TI YN++GQ +G+NA +MQS+IGVCVRQQIP+TY +WK+VP+ELKD IF C++M F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        VVD  SK  ILQSAS+KFR+FK+ LTQ Y+ P KDEP RL  PP KYSHI++K W++FV +RL+ EWE  S AQ+ERR KC+YNHHISRKGYAN A++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQ--------------
        L+ DP NRA LWKEARK KN    D+ T   V RI+ELA + +G+DILTEALGTPE+RGR+RGVGEFV+P+++ NVAR   KLS+Q              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQ--------------

Query:  --PQSEASSIKTEASRRKQPQSDALSAPQKKSKGKDV-----------------------VREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATI
           QS+A +   +++   + Q    S  +KK+KGK V                       V + PE+   G PCHLA+GS+DN+VAVG M+ES  Q  TI
Subjt:  --PQSEASSIKTEASRRKQPQSDALSAPQKKSKGKDV-----------------------VREIPEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL QA+GNFV WPR LVI   +K+ P   A +   QSSK+TDVHVTI+LLNRYAM +M+ ED + I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEPPAKA-KPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTIN

Query:  MPERILGKEASVFLHREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML
        + E I GKE +++L R+DI+QYCG  EIGYSCILTYI  LW V + EIT +F +VDQATISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG  HW+L
Subjt:  MPERILGKEASVFLHREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGH-HWML

Query:  IAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDE
        I I  +EN VY+++ LRSK+  EF G IN  L+ WQ +HS   YRS I WK +KCPR  GS ECGY+VQKY+RE++ N+ T I  LFNT +A+ Q+EID 
Subjt:  IAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDE

Query:  VRIEWANFVGGFV
        VR+EWA FV  FV
Subjt:  VRIEWANFVGGFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X13.6e-22759.79Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ IG+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF CVE  F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R KCLYNHHISRKGYAN A++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR
        L+ DPSNRAILWKEARKGKN EY D+ T     RI+ELA +++G+DILTEALGT E+ GRVRGVGEFV+PS+Y+NV + KSK  E   ++++   TE S 
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR

Query:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
                 +  +KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVG ++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI
        TL Q +G FV WPR LVI  ++K    ++ ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT+ IN+ + I GKE +++L R DIMQYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +KCP Q GS ECGY+VQKYIREI+ N++T I+ +FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

A0A6J1C398 uncharacterized protein LOC111007859 isoform X32.1e-21457.58Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ IG+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF CVE  F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R KCLYNHHISRKGYAN A++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR
        L+ DPSNRAILWKEARKGKN EY D+ T     RI+ELA +++G+DILTEALGT E+ GRVRGVGEFV+PS+Y+NV + KSK  E   ++++   TE S 
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR

Query:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
                 +  +KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVG ++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI
        TL Q +G FV WPR LVI  ++K    ++ ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT+ IN+ + I GKE +++L R DIMQYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G                       K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +KCP Q GS ECGY+VQKYIREI+ N++T I+ +FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X25.2e-22659.79Show/hide
Query:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ IG+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF CVE  F
Subjt:  MSEGSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDEP  L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R KCLYNHHISRKGYAN A++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLE

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR
        L+ DPSNRAILWKEARKGKN EY D+ T     RI+ELA +++G+DILTEALGT E+ GRVRGVGEFV+PS+Y+NV + KSK  E   ++++   TE S 
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASR

Query:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
                 +  +KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVG ++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKQPQSDALSAPQKKSKGKDVV---REI---PEDKEAGTPCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI
        TL Q +G FV WPR LVI  ++K    ++ ++   Q SKHTDVHV+I+LLNRY MLSM+ EDT+ IN+ + I GKE +++L R DIMQYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRWLVITVDDKEEPPAK-AKPIVQSSKHTDVHVTIRLLNRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM
        LTYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++
Subjt:  LTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRM

Query:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +KCP Q GS ECGY+VQKYIREI+ N++T I+ +FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIEWANFVGGFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAAGGAAGCAGCAGTAGTGGAGATGAAGGAAATGTGTTTATCCAACCACGGGTGCCAGGACGAGGGCCTACTACGATGCATAGATTGGCACGCCTCAGAAATAA
TGGAGAACGGTTGACGATTGTTTACAACAACCAAGGTCAAGCTATTGGAGATAATGCTAACCAGATGCAAAGTTACATAGGGGTTTGTGTGAGACAACAAATCCCAATAA
CATACGAAAACTGGAAGGATGTGCCTAAGGAACTGAAGGATAAAATTTTTTATTGTGTAGAGATGTTGTTCGTGGTGGACCCTAGGTCCAAGAGTAGTATACTTCAATCT
GCGTCTAGAAAATTTCGAACATTCAAGACATACCTAACGCAGAAGTATGTCAATCCATTGAAAGATGAACCAGAGCGTTTGACAACTCCTCCTTCCAAATATTCACACAT
TGAACAAAAGGATTGGAAGACATTTGTTAGTAGTAGACTAACATCAGAGTGGGAGGCGTTAAGTAAGGCTCAGAAAGAAAGAAGAGAGAAATGCTTGTATAATCATCATA
TCTCTCGTAAGGGATATGCAAATCCTGCCAAAGACTTAGAATTGACCGACGATCCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGAAAGGGAAAAAATAAAGAATAT
TGCGATGAGGTCACTGTAGCACGTGTCAATCGGATTAACGAATTAGCTACATTGAATGAAGGTAAGGACATTTTAACTGAAGCGTTGGGCACGCCAGAAAACAGAGGGCG
TGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGAGTGAGCAACCACAAAGCGAAGCTTCGAGTATCAAGACCG
AAGCCTCTCGACGAAAGCAACCACAAAGCGACGCTTTGAGTGCCCCGCAAAAAAAGTCAAAAGGAAAAGATGTCGTTCGTGAGATACCTGAGGATAAAGAGGCTGGAACA
CCTTGTCACCTAGCGATGGGCTCTATGGATAACATTGTTGCCGTAGGCATAATGTATGAGTCGCCTTCACAAAATGCAACCATCCATGGAGTTCCATTAGGAGTCGAAAA
TGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACCGATTCCTGTGAACGATGAACTACAAACGTTGTATCAAGCGGTCGGTAATTTTGTGGGATGGC
CTCGCTGGCTTGTTATTACTGTAGACGACAAAGAGGAGCCTCCTGCCAAAGCTAAGCCTATTGTACAATCAAGCAAACATACAGATGTTCATGTTACCATTAGGCTCCTA
AATAGATACGCGATGCTTTCGATGCGACAAGAAGATACACTTACGATCAATATGCCCGAGCGTATCTTGGGAAAGGAAGCATCAGTATTTTTACATCGCGAAGACATCAT
GCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTACCTATGGACTGTACTTGATCCCGAGATAACAAACAAGTTTTTTGTGGTTGATC
AAGCAACAATCTCATCGTACGTGAAGTCTCAAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACT
GGTCATCATTGGATGTTGATCGCGATCCAGCCTCGGGAAAACACTGTGTATATATTGAATTCTTTGCGTAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAATACGGG
GTTGAGAATGTGGCAAGCTAAGCACTCGCTTCCTCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTGCCCTCGTCAATCAGGTTCGACAGAGTGTGGGTATTTTG
TGCAAAAATATATAAGAGAAATAATGCACAACTCTACTACCCCTATAACTAAACTTTTTAACACAAAGAATGCATTTACACAAGACGAGATCGACGAGGTTCGTATAGAA
TGGGCTAATTTTGTTGGAGGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAAGGAAGCAGCAGTAGTGGAGATGAAGGAAATGTGTTTATCCAACCACGGGTGCCAGGACGAGGGCCTACTACGATGCATAGATTGGCACGCCTCAGAAATAA
TGGAGAACGGTTGACGATTGTTTACAACAACCAAGGTCAAGCTATTGGAGATAATGCTAACCAGATGCAAAGTTACATAGGGGTTTGTGTGAGACAACAAATCCCAATAA
CATACGAAAACTGGAAGGATGTGCCTAAGGAACTGAAGGATAAAATTTTTTATTGTGTAGAGATGTTGTTCGTGGTGGACCCTAGGTCCAAGAGTAGTATACTTCAATCT
GCGTCTAGAAAATTTCGAACATTCAAGACATACCTAACGCAGAAGTATGTCAATCCATTGAAAGATGAACCAGAGCGTTTGACAACTCCTCCTTCCAAATATTCACACAT
TGAACAAAAGGATTGGAAGACATTTGTTAGTAGTAGACTAACATCAGAGTGGGAGGCGTTAAGTAAGGCTCAGAAAGAAAGAAGAGAGAAATGCTTGTATAATCATCATA
TCTCTCGTAAGGGATATGCAAATCCTGCCAAAGACTTAGAATTGACCGACGATCCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGAAAGGGAAAAAATAAAGAATAT
TGCGATGAGGTCACTGTAGCACGTGTCAATCGGATTAACGAATTAGCTACATTGAATGAAGGTAAGGACATTTTAACTGAAGCGTTGGGCACGCCAGAAAACAGAGGGCG
TGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGAGTGAGCAACCACAAAGCGAAGCTTCGAGTATCAAGACCG
AAGCCTCTCGACGAAAGCAACCACAAAGCGACGCTTTGAGTGCCCCGCAAAAAAAGTCAAAAGGAAAAGATGTCGTTCGTGAGATACCTGAGGATAAAGAGGCTGGAACA
CCTTGTCACCTAGCGATGGGCTCTATGGATAACATTGTTGCCGTAGGCATAATGTATGAGTCGCCTTCACAAAATGCAACCATCCATGGAGTTCCATTAGGAGTCGAAAA
TGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACCGATTCCTGTGAACGATGAACTACAAACGTTGTATCAAGCGGTCGGTAATTTTGTGGGATGGC
CTCGCTGGCTTGTTATTACTGTAGACGACAAAGAGGAGCCTCCTGCCAAAGCTAAGCCTATTGTACAATCAAGCAAACATACAGATGTTCATGTTACCATTAGGCTCCTA
AATAGATACGCGATGCTTTCGATGCGACAAGAAGATACACTTACGATCAATATGCCCGAGCGTATCTTGGGAAAGGAAGCATCAGTATTTTTACATCGCGAAGACATCAT
GCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTACCTATGGACTGTACTTGATCCCGAGATAACAAACAAGTTTTTTGTGGTTGATC
AAGCAACAATCTCATCGTACGTGAAGTCTCAAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACT
GGTCATCATTGGATGTTGATCGCGATCCAGCCTCGGGAAAACACTGTGTATATATTGAATTCTTTGCGTAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAATACGGG
GTTGAGAATGTGGCAAGCTAAGCACTCGCTTCCTCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTGCCCTCGTCAATCAGGTTCGACAGAGTGTGGGTATTTTG
TGCAAAAATATATAAGAGAAATAATGCACAACTCTACTACCCCTATAACTAAACTTTTTAACACAAAGAATGCATTTACACAAGACGAGATCGACGAGGTTCGTATAGAA
TGGGCTAATTTTGTTGGAGGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MSEGSSSSGDEGNVFIQPRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAIGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFYCVEMLFVVDPRSKSSILQS
ASRKFRTFKTYLTQKYVNPLKDEPERLTTPPSKYSHIEQKDWKTFVSSRLTSEWEALSKAQKERREKCLYNHHISRKGYANPAKDLELTDDPSNRAILWKEARKGKNKEY
CDEVTVARVNRINELATLNEGKDILTEALGTPENRGRVRGVGEFVTPSVYYNVAREKSKLSEQPQSEASSIKTEASRRKQPQSDALSAPQKKSKGKDVVREIPEDKEAGT
PCHLAMGSMDNIVAVGIMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLYQAVGNFVGWPRWLVITVDDKEEPPAKAKPIVQSSKHTDVHVTIRLL
NRYAMLSMRQEDTLTINMPERILGKEASVFLHREDIMQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNT
GHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKCPRQSGSTECGYFVQKYIREIMHNSTTPITKLFNTKNAFTQDEIDEVRIE
WANFVGGFV