; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027267 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027267
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase
Genome locationscaffold8:8119422..8123188
RNA-Seq ExpressionSpg027267
SyntenySpg027267
Gene Ontology termsNA
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]4.1e-17348.95Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--
        M +  SSS DEGNV I+    R   RGPT M  L  +RN+GER TI YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WK+VP+ELKD IFDCI++  
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--

Query:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E
                       ++FR+FK+ LTQ Y+ P KDEP RL  PP KYSHID+K WE FV +RL+ EWE                               E
Subjt:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------
        L+ DP NRA LWKEARK KN    D+ T   V RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++ NVAR   KLSQQ              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRRKQPQSDASSATHKMSKGKDV-----------------------VREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTI
           QS+A +   ++    + Q   SS + K +KGK V                       V + PEN   G PCHLA+GS+DN+V VG M+ES  Q  TI
Subjt:  --PQSEASSVKTEAPRRKQPQSDASSATHKMSKGKDV-----------------------VREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEPPVKA-KPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL+QAI NFV WPRKLVI   +K+ P + A +   QSSK+TDVHVTI+LLNRYAM  MQ ED + I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEPPVKA-KPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------
        + E I GKE +I+L R+DI+QYCG  EIGYSCILTYI  LW V + EIT +F +VDQATISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG      
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------

Query:  ---QPRENTVYILNSLRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDE
             +EN VY+++ LRSK+  EF G IN  L  WQ +HS   YRS I WK +K                                 NT +A+ Q+EID 
Subjt:  ---QPRENTVYILNSLRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDE

Query:  VRIEWANFVGGFV
        VR+EWA FV  FV
Subjt:  VRIEWANFVGGFV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]7.6e-17550.81Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E   
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---

Query:  --------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E
                       ++FRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W  FV++RL+ EWE                               +
Subjt:  --------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q   S+       
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
             ++ S+ + K SKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCI
        TL+Q I  FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY ML MQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCI
Subjt:  TLHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNSLRSKVEEEFSGTINTGLIM
        LTYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G          REN VY+L+SLR K++E++   INT L +
Subjt:  LTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNSLRSKVEEEFSGTINTGLIM

Query:  WQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +K                                 NTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]5.8e-17550.88Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E   
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---

Query:  -------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------EL
                      ++FRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W  FV++RL+ EWE                               +L
Subjt:  -------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------EL

Query:  TDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRR
        + DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q   S+        
Subjt:  TDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRR

Query:  KQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQT
            ++ S+ + K SKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++T
Subjt:  KQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQT

Query:  LHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCIL
        L+Q I  FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY ML MQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCIL
Subjt:  LHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCIL

Query:  TYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNSLRSKVEEEFSGTINTGLIMW
        TYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G          REN VY+L+SLR K++E++   INT L +W
Subjt:  TYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNSLRSKVEEEFSGTINTGLIMW

Query:  QAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV
        QAKHS+ +YR+   WK +K                                 NTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  QAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]8.3e-18252.07Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--
        M + SSSS DEGNV I+    +   RGPT M  L  +RN+GER TI YN+ GQ VG+NA +MQS+IGVCVRQQIP+TY++WK VP+ELKD IFDCI++  
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--

Query:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E
                       ++FRTFK+ LTQ+Y+ P KDEP RL  PP KYSHID+K WE FV +RL+ EWE                               E
Subjt:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEA---------
        L+ DP NRA LWKEARK KN EY D  T     RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++YNVA+ K KL Q+ Q+EA         
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEA---------

Query:  ---------------SSVKTEAPRRKQPQSDAS-SATHKMSKGKDVVREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVV
                       SSV  +  +RK+ Q   +     K+ KGK VV++ PE    G PCHLA+GS+DNIV VGTM+ES +Q  +I+ +PLG +NVR +V
Subjt:  ---------------SSVKTEAPRRKQPQSDAS-SATHKMSKGKDVVREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVV

Query:  DMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFL
        D+V+G+D ALPIP  D+++TL QAI NFV WPRKLVIT  +K+ P P  +K I QSSK+TDVHVTI+LLNRYAM  MQ +D + IN+ E+ILGKE +I+L
Subjt:  DMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFL

Query:  NREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG---------QPRENTVYILN
         R+DI+QYCG  EIGYSCIL YI  LW   D EIT KF +VDQATISS+VK QELRS+NL NRL+MV LDQLVLIP+NTG           +EN VY+++
Subjt:  NREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG---------QPRENTVYILN

Query:  SLRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV
        SLRSK+ EEF G INT L  WQAKHSL QYR+ I WK +K                                 NT+ A+ Q EID VR+EWA FV  FV
Subjt:  SLRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]6.4e-18252.15Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--
        M + SSSS DEGNV I+    +   RGPT M  L  +RN+GER TI YN+ GQ VG+NA +MQS+IGVCVRQQIP+TY++WK VP+ELKD IFDCI++  
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--

Query:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E
                       ++FRTFK+ LTQ+Y+ P KDEP RL  PP KYSHID+K WE FV +RL+ EWE                               E
Subjt:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEA---------
        L+ DP NRA LWKEARK KN EY D  T     RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++YNVA+ K KL Q+ Q+EA         
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEA---------

Query:  ---------------SSVKTEAPRRKQPQSDAS-SATHKMSKGKDVVREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVV
                       SSV  +  +RK+ Q   +     K+ KGK VV++ PE    G PCHLA+GS+DNIV VGTM+ES +Q  +I+ +PLG +NVR +V
Subjt:  ---------------SSVKTEAPRRKQPQSDAS-SATHKMSKGKDVVREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVV

Query:  DMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFL
        D+V+G+D ALPIP  D+++TL QAI NFV WPRKLVIT  +K+ P P  +K I QSSK+TDVHVTI+LLNRYAM  MQ +D + IN+ E+ILGKE +I+L
Subjt:  DMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFL

Query:  NREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNS
         R+DI+QYCG  EIGYSCIL YI  LW   D EIT KF +VDQATISS+VK QELRS+NL NRL+MV LDQLVLIP+NTG          +EN VY+++S
Subjt:  NREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNS

Query:  LRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV
        LRSK+ EEF G INT L  WQAKHSL QYR+ I WK +K                                 NT+ A+ Q EID VR+EWA FV  FV
Subjt:  LRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X12.0e-17348.95Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--
        M +  SSS DEGNV I+    R   RGPT M  L  +RN+GER TI YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WK+VP+ELKD IFDCI++  
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--

Query:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E
                       ++FR+FK+ LTQ Y+ P KDEP RL  PP KYSHID+K WE FV +RL+ EWE                               E
Subjt:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------
        L+ DP NRA LWKEARK KN    D+ T   V RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++ NVAR   KLSQQ              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRRKQPQSDASSATHKMSKGKDV-----------------------VREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTI
           QS+A +   ++    + Q   SS + K +KGK V                       V + PEN   G PCHLA+GS+DN+V VG M+ES  Q  TI
Subjt:  --PQSEASSVKTEAPRRKQPQSDASSATHKMSKGKDV-----------------------VREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEPPVKA-KPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL+QAI NFV WPRKLVI   +K+ P + A +   QSSK+TDVHVTI+LLNRYAM  MQ ED + I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEPPVKA-KPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------
        + E I GKE +I+L R+DI+QYCG  EIGYSCILTYI  LW V + EIT +F +VDQATISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG      
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------

Query:  ---QPRENTVYILNSLRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDE
             +EN VY+++ LRSK+  EF G IN  L  WQ +HS   YRS I WK +K                                 NT +A+ Q+EID 
Subjt:  ---QPRENTVYILNSLRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDE

Query:  VRIEWANFVGGFV
        VR+EWA FV  FV
Subjt:  VRIEWANFVGGFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein2.0e-17348.95Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--
        M +  SSS DEGNV I+    R   RGPT M  L  +RN+GER TI YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WK+VP+ELKD IFDCI++  
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEV--

Query:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E
                       ++FR+FK+ LTQ Y+ P KDEP RL  PP KYSHID+K WE FV +RL+ EWE                               E
Subjt:  ---------------QEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------
        L+ DP NRA LWKEARK KN    D+ T   V RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++ NVAR   KLSQQ              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRRKQPQSDASSATHKMSKGKDV-----------------------VREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTI
           QS+A +   ++    + Q   SS + K +KGK V                       V + PEN   G PCHLA+GS+DN+V VG M+ES  Q  TI
Subjt:  --PQSEASSVKTEAPRRKQPQSDASSATHKMSKGKDV-----------------------VREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEPPVKA-KPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL+QAI NFV WPRKLVI   +K+ P + A +   QSSK+TDVHVTI+LLNRYAM  MQ ED + I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEPPVKA-KPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTIN

Query:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------
        + E I GKE +I+L R+DI+QYCG  EIGYSCILTYI  LW V + EIT +F +VDQATISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG      
Subjt:  MHERILGKEASIFLNREDIMQYCGNVEIGYSCILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------

Query:  ---QPRENTVYILNSLRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDE
             +EN VY+++ LRSK+  EF G IN  L  WQ +HS   YRS I WK +K                                 NT +A+ Q+EID 
Subjt:  ---QPRENTVYILNSLRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDE

Query:  VRIEWANFVGGFV
        VR+EWA FV  FV
Subjt:  VRIEWANFVGGFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X13.7e-17550.81Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E   
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---

Query:  --------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E
                       ++FRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W  FV++RL+ EWE                               +
Subjt:  --------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q   S+       
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
             ++ S+ + K SKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCI
        TL+Q I  FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY ML MQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCI
Subjt:  TLHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNSLRSKVEEEFSGTINTGLIM
        LTYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G          REN VY+L+SLR K++E++   INT L +
Subjt:  LTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNSLRSKVEEEFSGTINTGLIM

Query:  WQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +K                                 NTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV

A0A6J1C398 uncharacterized protein LOC111007859 isoform X33.0e-16950.07Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E   
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---

Query:  --------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E
                       ++FRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W  FV++RL+ EWE                               +
Subjt:  --------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q   S+       
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ
             ++ S+ + K SKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQ

Query:  TLHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCI
        TL+Q I  FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY ML MQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCI
Subjt:  TLHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCI

Query:  LTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGQPRENTVYILNSLRSKVEEEFSGTINTGLIMWQAKHSLP
        LTYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G               K++E++   INT L +WQAKHS+ 
Subjt:  LTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGQPRENTVYILNSLRSKVEEEFSGTINTGLIMWQAKHSLP

Query:  QYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV
        +YR+   WK +K                                 NTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  QYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X22.8e-17550.88Show/hide
Query:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---
        MS+ SSSS DE +V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+ELKDKIF+C+E   
Subjt:  MSEQSSSSGDEGNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIE---

Query:  -------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------EL
                      ++FRTFK+ LT+ Y+ P KDEP  L  PP KY HIDQ+ W  FV++RL+ EWE                               +L
Subjt:  -------------VQEFRTFKTYLTQKYVNPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWE-------------------------------EL

Query:  TDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRR
        + DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q   S+        
Subjt:  TDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRR

Query:  KQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQT
            ++ S+ + K SKGK++V    EI    E K  G PCHLA+ S+DNIV VGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++T
Subjt:  KQPQSDASSATHKMSKGKDVV---REI---PENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQT

Query:  LHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCIL
        L+Q I  FV WPR+LVI  ++K     + ++   Q SKHTDVHV+I+LLNRY ML MQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCIL
Subjt:  LHQAICNFVGWPRKLVITVDDKEEPPVK-AKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYSCIL

Query:  TYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNSLRSKVEEEFSGTINTGLIMW
        TYI YLW V + EIT KF +VD ATIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G          REN VY+L+SLR K++E++   INT L +W
Subjt:  TYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------QPRENTVYILNSLRSKVEEEFSGTINTGLIMW

Query:  QAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV
        QAKHS+ +YR+   WK +K                                 NTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  QAKHSLPQYRSAISWKLVKS--------------------------------NTKNAFTQDEIDEVRIEWANFVGGFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAACAAAGCAGTAGCAGTGGAGATGAAGGAAATGTGTTTATTCAACCACGGGTGCCAGGACGAGGGCCTACTACGATGCATAGATTGGCACGCCTCAGAAATAA
TGGAGAACGCTTGACGATTGTTTACAACAACCAAGGTCAAGCTGTTGGAGATAATGCTAACCAGATGCAGAGTTACATAGGGGTTTGTGTGAGACAACAAATCCCAATAA
CATACGAAAACTGGAAGGATGTGCCTAAGGAACTAAAGGATAAGATTTTTGATTGTATAGAGGTCCAAGAATTTCGAACATTCAAGACGTACTTAACGCAGAAGTATGTC
AATCCATTAAAAGATGAACCAGAGCGCTTGGCAACTCCTCCTTCCAAATATTCACACATTGATCAAAAGGATTGGGAGATATTTGTTAGTAGTAGACTAACATCAGAGTG
GGAGGAATTAACAGACGATCCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGAAAGGGAAAAAATAAAGAATATTGCGATGAGGTCACTGTAGCACGTGTCAATCGAA
TTGACGAATTAGCTGCATTGAATGAAGGTAAGGACATCTTGACTGAAGCGTTGGGCACGCCAGAACACAGAGGGCGTGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCT
GTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGAGTCAGCAACCACAAAGCGAAGCTTCGAGTGTCAAGACCGAAGCCCCTCGACGAAAGCAACCACAAAGCGACGC
TTCGAGTGCCACGCATAAAATGTCAAAAGGAAAAGATGTTGTTCGTGAGATACCTGAGAATAAAGAGGCTGGAACACCTTGTCACCTAGCGATGGGCTCTATGGATAACA
TTGTTATCGTAGGCACAATGTACGAGTCGCCTTCACAAAATACAACCATCCATGGAGTTCCATTAGGAGTCGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGAT
GATTGTGCATTACCGATTCCTGTGAACGATGAACTACAGACGTTGCATCAAGCGATCTGTAATTTTGTGGGATGGCCTCGCAAGCTTGTTATTACTGTAGATGACAAAGA
GGAGCCTCCTGTCAAAGCTAAGCCCATAGTACAATCAAGCAAACATACAGATGTCCATGTTACTATTAGGCTCTTAAATAGATACGCGATGCTTTTGATGCAACAAGAAG
ATACACTAACGATCAATATGCACGAGCGTATCTTGGGAAAGGAAGCATCAATATTTTTAAATCGCGAAGACATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCA
TGCATACTCACGTACATTACGTACCTCTGGACTGTACTTGATCTCGAGATAACAAACAAGTTTTTTGTGGTTGATCAAGCAACAATCTCATCGTACGTGAAGTCTCAAGA
ACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACTGGGCAACCTCGGGAAAACACTGTGTATATATTGA
ATTCTTTGCGCAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAATACGGGGTTGATAATGTGGCAAGCTAAGCACTCGCTTCCTCAATATCGATCTGCTATTAGTTGG
AAACTAGTGAAGTCTAACACAAAGAATGCATTTACACAAGACGAGATCGACGAGGTTCGTATAGAATGGGCAAATTTTGTTGGAGGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAACAAAGCAGTAGCAGTGGAGATGAAGGAAATGTGTTTATTCAACCACGGGTGCCAGGACGAGGGCCTACTACGATGCATAGATTGGCACGCCTCAGAAATAA
TGGAGAACGCTTGACGATTGTTTACAACAACCAAGGTCAAGCTGTTGGAGATAATGCTAACCAGATGCAGAGTTACATAGGGGTTTGTGTGAGACAACAAATCCCAATAA
CATACGAAAACTGGAAGGATGTGCCTAAGGAACTAAAGGATAAGATTTTTGATTGTATAGAGGTCCAAGAATTTCGAACATTCAAGACGTACTTAACGCAGAAGTATGTC
AATCCATTAAAAGATGAACCAGAGCGCTTGGCAACTCCTCCTTCCAAATATTCACACATTGATCAAAAGGATTGGGAGATATTTGTTAGTAGTAGACTAACATCAGAGTG
GGAGGAATTAACAGACGATCCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGAAAGGGAAAAAATAAAGAATATTGCGATGAGGTCACTGTAGCACGTGTCAATCGAA
TTGACGAATTAGCTGCATTGAATGAAGGTAAGGACATCTTGACTGAAGCGTTGGGCACGCCAGAACACAGAGGGCGTGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCT
GTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGAGTCAGCAACCACAAAGCGAAGCTTCGAGTGTCAAGACCGAAGCCCCTCGACGAAAGCAACCACAAAGCGACGC
TTCGAGTGCCACGCATAAAATGTCAAAAGGAAAAGATGTTGTTCGTGAGATACCTGAGAATAAAGAGGCTGGAACACCTTGTCACCTAGCGATGGGCTCTATGGATAACA
TTGTTATCGTAGGCACAATGTACGAGTCGCCTTCACAAAATACAACCATCCATGGAGTTCCATTAGGAGTCGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGAT
GATTGTGCATTACCGATTCCTGTGAACGATGAACTACAGACGTTGCATCAAGCGATCTGTAATTTTGTGGGATGGCCTCGCAAGCTTGTTATTACTGTAGATGACAAAGA
GGAGCCTCCTGTCAAAGCTAAGCCCATAGTACAATCAAGCAAACATACAGATGTCCATGTTACTATTAGGCTCTTAAATAGATACGCGATGCTTTTGATGCAACAAGAAG
ATACACTAACGATCAATATGCACGAGCGTATCTTGGGAAAGGAAGCATCAATATTTTTAAATCGCGAAGACATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCA
TGCATACTCACGTACATTACGTACCTCTGGACTGTACTTGATCTCGAGATAACAAACAAGTTTTTTGTGGTTGATCAAGCAACAATCTCATCGTACGTGAAGTCTCAAGA
ACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACTGGGCAACCTCGGGAAAACACTGTGTATATATTGA
ATTCTTTGCGCAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAATACGGGGTTGATAATGTGGCAAGCTAAGCACTCGCTTCCTCAATATCGATCTGCTATTAGTTGG
AAACTAGTGAAGTCTAACACAAAGAATGCATTTACACAAGACGAGATCGACGAGGTTCGTATAGAATGGGCAAATTTTGTTGGAGGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MSEQSSSSGDEGNVFIQPRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKELKDKIFDCIEVQEFRTFKTYLTQKYV
NPLKDEPERLATPPSKYSHIDQKDWEIFVSSRLTSEWEELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPS
VYYNVAREKSKLSQQPQSEASSVKTEAPRRKQPQSDASSATHKMSKGKDVVREIPENKEAGTPCHLAMGSMDNIVIVGTMYESPSQNTTIHGVPLGVENVRVVVDMVIGD
DCALPIPVNDELQTLHQAICNFVGWPRKLVITVDDKEEPPVKAKPIVQSSKHTDVHVTIRLLNRYAMLLMQQEDTLTINMHERILGKEASIFLNREDIMQYCGNVEIGYS
CILTYITYLWTVLDLEITNKFFVVDQATISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTGQPRENTVYILNSLRSKVEEEFSGTINTGLIMWQAKHSLPQYRSAISW
KLVKSNTKNAFTQDEIDEVRIEWANFVGGFV