; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019532 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019532
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase
Genome locationscaffold1:38884527..38894033
RNA-Seq ExpressionSpg019532
SyntenySpg019532
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR004264 - Transposase, Tnp1/En/Spm-like
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]5.1e-13840.43Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        +S+ SSSS DE +V I     +V  RGPTTM+ L C+RN GKR TI YN++GQ +G+NA +MQS+IGV VRQ+IP+TY +WK+VP+ELKD+IF+C+E   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T  C  RIDELAA+++G+DIL EALGT EH GRVRGVGEFV+PS+Y++V + KSK +Q+ Q   S+ +   P 
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGW
        +K+ +       H++     + K  G PCHLA+ S+DNIVAVGT++++  +  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q +G FV W
Subjt:  RKQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGW

Query:  PRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGR
        PRRLVI  ++K + +                H +I+LLNRY MLSMQ EDT+ IN+ + I G E +++L R DIMQYC  +EIGYSCILTYI        
Subjt:  PRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGR

Query:  GGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELH
                                YLW V + EIT  F +VD                                             ATIS YVKSQE  
Subjt:  GGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELH

Query:  YRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK-----------------
        +RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++WQAKHS+ +YR+   WK +K                 
Subjt:  YRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK-----------------

Query:  ---------------FNTKNAFTQDEIDEVRIEWTNFVGGFV
                       FNTKNA+ Q+EIDEVRIEW +FVGG V
Subjt:  ---------------FNTKNAFTQDEIDEVRIEWTNFVGGFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]3.9e-13840.49Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        +S+ SSSS DE +V I     +V  RGPTTM+ L C+RN GKR TI YN++GQ +G+NA +MQS+IGV VRQ+IP+TY +WK+VP+ELKD+IF+C+E   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  --------------------------------------------------------------------------------------------------EL
                                                                                                          +L
Subjt:  --------------------------------------------------------------------------------------------------EL

Query:  TDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPRR
        + DPSNRAILWKEARKGKN EY D+ T  C  RIDELAA+++G+DIL EALGT EH GRVRGVGEFV+PS+Y++V + KSK +Q+ Q   S+ +   P +
Subjt:  TDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPRR

Query:  KQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGWP
        K+ +       H++     + K  G PCHLA+ S+DNIVAVGT++++  +  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q +G FV WP
Subjt:  KQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGWP

Query:  RRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGRG
        RRLVI  ++K + +                H +I+LLNRY MLSMQ EDT+ IN+ + I G E +++L R DIMQYC  +EIGYSCILTYI         
Subjt:  RRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGRG

Query:  GGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELHY
                               YLW V + EIT  F +VD                                             ATIS YVKSQE  +
Subjt:  GGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELHY

Query:  RNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK------------------
        RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++WQAKHS+ +YR+   WK +K                  
Subjt:  RNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK------------------

Query:  --------------FNTKNAFTQDEIDEVRIEWTNFVGGFV
                      FNTKNA+ Q+EIDEVRIEW +FVGG V
Subjt:  --------------FNTKNAFTQDEIDEVRIEWTNFVGGFV

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]2.5e-12940.85Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        +S+ SSSS DE +V I     +V  RGPTTM+ L C+RN GKR TI YN++GQ +G+NA +MQS+IGV VRQ+IP+TY +WK+VP+ELKD+IF+C+E   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T  C  RIDELAA+++G+DIL EALGT EH GRVRGVGEFV+PS+Y++V + KSK +Q+ Q   S+ +   P 
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGW
        +K+ +       H++     + K  G PCHLA+ S+DNIVAVGT++++  +  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q +G FV W
Subjt:  RKQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGW

Query:  PRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGR
        PRRLVI  ++K + +                H +I+LLNRY MLSMQ EDT+ IN+ + I G E +++L R DIMQYC  +EIGYSCILTYI        
Subjt:  PRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGR

Query:  GGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELH
                                YLW V + EIT  F +VD                                             ATIS YVKSQE  
Subjt:  GGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELH

Query:  YRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK
        +RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++WQAKHS+ +YR+   WK +K
Subjt:  YRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]5.9e-13439.97Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        + + SSSS DEGNV I+    +   RGPT M  L  +RN+G+R TI YN+ GQ VG+NA +MQS+IGV VRQQIP+TY++WK VP+ELKD IFDCI+   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           E
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR
        L+ DP NRA LWKEARK KN EY D  T  C  RIDELAA+ +G+DIL EALGTPEHRGR+RGVGEFV+P+++Y+VA+ K KL Q+ Q+EA + +++   
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKQPQSD-------TSSATHKKSKGKDV---------------------------GTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVD
          Q   D        SS   KK+K K V                           G PCHLA+GS+DNIVAVGTM+ES ++  +I+ +PLG +NVR +VD
Subjt:  RKQPQSD-------TSSATHKKSKGKDV---------------------------GTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVD

Query:  MVIGDDCALPIPVNDELQTLHQAVGNFVGWPRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLH
        +V+G+D ALPIP  D+++TL QA+GNFV WPR+LVIT  +K+  +                H TI+LLNRYAM SMQ +D I IN+ ++ILG E +++L 
Subjt:  MVIGDDCALPIPVNDELQTLHQAVGNFVGWPRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLH

Query:  REDIMQYCGNVEIGYSCILTYITLVEFGGRGGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSI
        R+DI+QYCG  EIGYSCIL YI                                 LW   D EIT  F +VD                            
Subjt:  REDIMQYCGNVEIGYSCILTYITLVEFGGRGGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSI

Query:  DRNWKGKLMGISKLENQATISSYVKSQELHYRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHS
                        QATISS+VK QEL  +NL NRL+MV LDQLVLIP+NTG  HW+LI I  +EN VY+++SLRSK+ EEF G INT L+ WQAKHS
Subjt:  DRNWKGKLMGISKLENQATISSYVKSQELHYRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHS

Query:  LPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWTNFVGGFV
        L QYR+ I WK +K                                FNT+ A+ Q EID VR+EW  FV  FV
Subjt:  LPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWTNFVGGFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]4.1e-13540.03Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        + + SSSS DEGNV I+    +   RGPT M  L  +RN+G+R TI YN+ GQ VG+NA +MQS+IGV VRQQIP+TY++WK VP+ELKD IFDCI+   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           E
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR
        L+ DP NRA LWKEARK KN EY D  T  C  RIDELAA+ +G+DIL EALGTPEHRGR+RGVGEFV+P+++Y+VA+ K KL Q+ Q+EA + +++   
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKQPQSD-------TSSATHKKSKGKDV---------------------------GTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVD
          Q   D        SS   KK+K K V                           G PCHLA+GS+DNIVAVGTM+ES ++  +I+ +PLG +NVR +VD
Subjt:  RKQPQSD-------TSSATHKKSKGKDV---------------------------GTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVD

Query:  MVIGDDCALPIPVNDELQTLHQAVGNFVGWPRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLH
        +V+G+D ALPIP  D+++TL QA+GNFV WPR+LVIT  +K+  +                H TI+LLNRYAM SMQ +D I IN+ ++ILG E +++L 
Subjt:  MVIGDDCALPIPVNDELQTLHQAVGNFVGWPRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLH

Query:  REDIMQYCGNVEIGYSCILTYITLVEFGGRGGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSI
        R+DI+QYCG  EIGYSCIL YI                                 LW   D EIT  F +VD                            
Subjt:  REDIMQYCGNVEIGYSCILTYITLVEFGGRGGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSI

Query:  DRNWKGKLMGISKLENQATISSYVKSQELHYRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSL
                        QATISS+VK QEL  +NL NRL+MV LDQLVLIP+NTG HW+LI I  +EN VY+++SLRSK+ EEF G INT L+ WQAKHSL
Subjt:  DRNWKGKLMGISKLENQATISSYVKSQELHYRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSL

Query:  PQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWTNFVGGFV
         QYR+ I WK +K                                FNT+ A+ Q EID VR+EW  FV  FV
Subjt:  PQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWTNFVGGFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.5e-12737.94Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        + +  SSS DEGNV I+    R   RGPT M  L  +RN+G+R TI YN++GQ VG+NA +MQS+IGV VRQQIP+TY +WK+VP+ELKD IFDCI+   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           E
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQ--------------
        L+ DP NRA LWKEARK KN    D+ T  CV RIDELAA+ +G+DIL EALGTPEHRGR+RGVGEFV+P+++ +VAR   KLSQQ              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRRKQPQSDTSSATHKKSKGKDV---------------------------------GTPCHLAMGSMDNIVAVGTMYESPSENATI
           QS+A +   ++    + Q   SS + KK+KGK V                                 G PCHLA+GS+DN+VAVG M+ES  +  TI
Subjt:  --PQSEASSVKTEAPRRKQPQSDTSSATHKKSKGKDV---------------------------------GTPCHLAMGSMDNIVAVGTMYESPSENATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGWPRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL+QA+GNFV WPR+LVI   +K+  +                H TI+LLNRYAM +MQ ED I I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGWPRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITIN

Query:  MHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGRGGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAIL
        + + I G E +++L R+DI+QYCG  EIGYSCILTYI                                 LW V + EIT  F +VD             
Subjt:  MHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGRGGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAIL

Query:  DLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELHYRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFS
                                       QATISS++KSQE   RNL NRL+M +LDQLVLIP+NTG  HW+LI I  +EN VY+++ LRSK+  EF 
Subjt:  DLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELHYRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFS

Query:  GTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWTNFVGGFV
        G IN  L+ WQ +HS   YRS I WK +K                                FNT +A+ Q+EID VR+EW  FV  FV
Subjt:  GTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWTNFVGGFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.5e-12737.94Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        + +  SSS DEGNV I+    R   RGPT M  L  +RN+G+R TI YN++GQ VG+NA +MQS+IGV VRQQIP+TY +WK+VP+ELKD IFDCI+   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           E
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQ--------------
        L+ DP NRA LWKEARK KN    D+ T  CV RIDELAA+ +G+DIL EALGTPEHRGR+RGVGEFV+P+++ +VAR   KLSQQ              
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRRKQPQSDTSSATHKKSKGKDV---------------------------------GTPCHLAMGSMDNIVAVGTMYESPSENATI
           QS+A +   ++    + Q   SS + KK+KGK V                                 G PCHLA+GS+DN+VAVG M+ES  +  TI
Subjt:  --PQSEASSVKTEAPRRKQPQSDTSSATHKKSKGKDV---------------------------------GTPCHLAMGSMDNIVAVGTMYESPSENATI

Query:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGWPRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITIN
        HG+PLG EN+RV VD+ + +D ALPIP+  +++TL+QA+GNFV WPR+LVI   +K+  +                H TI+LLNRYAM +MQ ED I I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGWPRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITIN

Query:  MHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGRGGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAIL
        + + I G E +++L R+DI+QYCG  EIGYSCILTYI                                 LW V + EIT  F +VD             
Subjt:  MHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGRGGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAIL

Query:  DLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELHYRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFS
                                       QATISS++KSQE   RNL NRL+M +LDQLVLIP+NTG  HW+LI I  +EN VY+++ LRSK+  EF 
Subjt:  DLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELHYRNLSNRLDMVDLDQLVLIPFNTGH-HWMLIAIQPRENTVYILNSLRSKVEEEFS

Query:  GTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWTNFVGGFV
        G IN  L+ WQ +HS   YRS I WK +K                                FNT +A+ Q+EID VR+EW  FV  FV
Subjt:  GTINTGLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWTNFVGGFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X12.5e-13840.43Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        +S+ SSSS DE +V I     +V  RGPTTM+ L C+RN GKR TI YN++GQ +G+NA +MQS+IGV VRQ+IP+TY +WK+VP+ELKD+IF+C+E   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T  C  RIDELAA+++G+DIL EALGT EH GRVRGVGEFV+PS+Y++V + KSK +Q+ Q   S+ +   P 
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGW
        +K+ +       H++     + K  G PCHLA+ S+DNIVAVGT++++  +  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q +G FV W
Subjt:  RKQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGW

Query:  PRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGR
        PRRLVI  ++K + +                H +I+LLNRY MLSMQ EDT+ IN+ + I G E +++L R DIMQYC  +EIGYSCILTYI        
Subjt:  PRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGR

Query:  GGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELH
                                YLW V + EIT  F +VD                                             ATIS YVKSQE  
Subjt:  GGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELH

Query:  YRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK-----------------
        +RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++WQAKHS+ +YR+   WK +K                 
Subjt:  YRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK-----------------

Query:  ---------------FNTKNAFTQDEIDEVRIEWTNFVGGFV
                       FNTKNA+ Q+EIDEVRIEW +FVGG V
Subjt:  ---------------FNTKNAFTQDEIDEVRIEWTNFVGGFV

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.2e-12940.85Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        +S+ SSSS DE +V I     +V  RGPTTM+ L C+RN GKR TI YN++GQ +G+NA +MQS+IGV VRQ+IP+TY +WK+VP+ELKD+IF+C+E   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR
        L+ DPSNRAILWKEARKGKN EY D+ T  C  RIDELAA+++G+DIL EALGT EH GRVRGVGEFV+PS+Y++V + KSK +Q+ Q   S+ +   P 
Subjt:  LTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGW
        +K+ +       H++     + K  G PCHLA+ S+DNIVAVGT++++  +  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q +G FV W
Subjt:  RKQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGW

Query:  PRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGR
        PRRLVI  ++K + +                H +I+LLNRY MLSMQ EDT+ IN+ + I G E +++L R DIMQYC  +EIGYSCILTYI        
Subjt:  PRRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGR

Query:  GGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELH
                                YLW V + EIT  F +VD                                             ATIS YVKSQE  
Subjt:  GGGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELH

Query:  YRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK
        +RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++WQAKHS+ +YR+   WK +K
Subjt:  YRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.9e-13840.49Show/hide
Query:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---
        +S+ SSSS DE +V I     +V  RGPTTM+ L C+RN GKR TI YN++GQ +G+NA +MQS+IGV VRQ+IP+TY +WK+VP+ELKD+IF+C+E   
Subjt:  VSEQSSSSGDEGNVFIQ---PRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQQIPITYENWKDVPKELKDEIFDCIE---

Query:  --------------------------------------------------------------------------------------------------EL
                                                                                                          +L
Subjt:  --------------------------------------------------------------------------------------------------EL

Query:  TDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPRR
        + DPSNRAILWKEARKGKN EY D+ T  C  RIDELAA+++G+DIL EALGT EH GRVRGVGEFV+PS+Y++V + KSK +Q+ Q   S+ +   P +
Subjt:  TDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLSQQPQSEASSVKTEAPRR

Query:  KQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGWP
        K+ +       H++     + K  G PCHLA+ S+DNIVAVGT++++  +  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+Q +G FV WP
Subjt:  KQPQSDTSSATHKK----SKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFVGWP

Query:  RRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGRG
        RRLVI  ++K + +                H +I+LLNRY MLSMQ EDT+ IN+ + I G E +++L R DIMQYC  +EIGYSCILTYI         
Subjt:  RRLVITVDDKEVYA----------------HATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGRG

Query:  GGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELHY
                               YLW V + EIT  F +VD                                             ATIS YVKSQE  +
Subjt:  GGGVMYYALEVFELMNTCIIFFRYLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELHY

Query:  RNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK------------------
        RNL+NRL+MV+L+QLVLIP+ +G HWMLI I  REN VY+L+SLR K++E++   INT L++WQAKHS+ +YR+   WK +K                  
Subjt:  RNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQPRENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVK------------------

Query:  --------------FNTKNAFTQDEIDEVRIEWTNFVGGFV
                      FNTKNA+ Q+EIDEVRIEW +FVGG V
Subjt:  --------------FNTKNAFTQDEIDEVRIEWTNFVGGFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATCCCACAATCCGTTCCAAGGCCTGCGGATAGTAGAGAAGATCCAAGTGGTAGTCCAAAAGTTGTTCGTGTCAATTATAAAGCATCGGATTATACGAATTGGGAGCTTCA
AGAGATTGTTACAGTGAGTGAACAAAGTAGTAGCAGTGGAGATGAAGGAAATGTGTTTATTCAACCACGGGTGCCAGGACGAGGGCCTACTACGATGAATCGATTGGCAT
GCCTCAGAAATAATGGAAAACGCTGGACAATTGTTTACAACAACAAAGGTCAAGCTGTTGGAGATAATGCTAACCAGATGCAGAGTTACATAGGGGTTGGTGTGAGACAA
CAAATCCCAATAACATACGAAAACTGGAAGGATGTGCCTAAGGAACTGAAGGATGAGATTTTTGATTGTATAGAGGAATTGACCGACGATCCTTCCAATCGTGCAATTCT
ATGGAAGGAAGCACGAAAGGGAAAAAATAAAGAATATTGCGATGAGGTCACTGTAGCATGTGTCAATCGAATTGACGAATTAGCTGCATTGAATGAAGGTAAGGACATCT
TGATTGAAGCGTTGGGCACGCCAGAACACAGAGGGCGTGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAGTGTTGCAAGAGAGAAGTCAAAATTGAGT
CAGCAACCACAAAGTGAAGCTTCGAGTGTCAAGACCGAAGCCCCTCGACGAAAGCAACCACAAAGCGACACTTCGAGTGCCACACATAAAAAGTCAAAAGGAAAAGATGT
TGGAACACCTTGTCACCTAGCGATGGGCTCTATGGATAACATTGTTGCCGTAGGCACAATGTACGAGTCACCTTCAGAAAATGCAACCATCCATGGAGTTCCATTAGGAG
TCGAAAATGTTCGAGTTGTGGTGGACATGGTGATAGGTGATGATTGTGCATTACCGATTCCTGTAAACGATGAACTACAAACATTGCATCAAGCGGTCGGTAATTTTGTG
GGATGGCCTCGCAGGCTTGTTATTACTGTAGATGACAAAGAGGTATATGCCCATGCTACTATTAGGCTCCTAAATAGATACGCGATGCTTTCGATGCAACAAGAAGATAC
AATAACGATCAATATGCACCAGCGTATCTTGGGAAATGAAGCATCAGTATTTTTACATCGCGAAGACATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCA
TACTCACGTACATTACTTTAGTTGAGTTTGGAGGGAGAGGGGGGGGGGGGGTGATGTATTATGCTTTGGAAGTGTTTGAGTTGATGAATACTTGTATTATTTTCTTTAGG
TACCTCTGGACTGTACTTGATCCCGAGATAACAAACATTTTTTTTGTGGTGGATCATGTTGGTGTTAATATCACACATGCGGAAGCAATTTTGGATCTTAGGCACACAGC
CTTGGAACGGATTGTAGGATCCTCAATTGATAGGAATTGGAAAGGAAAGCTTATGGGTATAAGCAAGCTTGAGAATCAAGCAACAATCTCATCGTACGTGAAGTCTCAAG
AACTTCATTATAGAAATCTCTCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACTGGTCATCATTGGATGTTGATCGCCATCCAGCCT
CGGGAAAATACTGTGTATATATTGAATTCTTTGCGCAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAATACGGGGTTGAGAATGTGGCAAGCTAAGCACTCGCTTCC
TCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTTTAACACAAAGAATGCATTTACACAAGACGAGATCGACGAGGTTCGTATAGAATGGACAAATTTTGTTGGAG
GATTTGTAAAACTTGTTGATCGTGTCTTTTCGTGGGCCGCTTCCGTCGAAGGAGCGTTGCGTCGGCAGCAATTGATGGCGCCGGCAGCAATTGATGTCGCCGGAGATGGC
CCAAGGCGAGAGGAAATCGAAGGATGGAGAAGGGGAGACGCGACTAGGTGTGTGTGTTCTGCCGAGAGGGAAAAAGTGTACTGA
mRNA sequenceShow/hide mRNA sequence
ATCCCACAATCCGTTCCAAGGCCTGCGGATAGTAGAGAAGATCCAAGTGGTAGTCCAAAAGTTGTTCGTGTCAATTATAAAGCATCGGATTATACGAATTGGGAGCTTCA
AGAGATTGTTACAGTGAGTGAACAAAGTAGTAGCAGTGGAGATGAAGGAAATGTGTTTATTCAACCACGGGTGCCAGGACGAGGGCCTACTACGATGAATCGATTGGCAT
GCCTCAGAAATAATGGAAAACGCTGGACAATTGTTTACAACAACAAAGGTCAAGCTGTTGGAGATAATGCTAACCAGATGCAGAGTTACATAGGGGTTGGTGTGAGACAA
CAAATCCCAATAACATACGAAAACTGGAAGGATGTGCCTAAGGAACTGAAGGATGAGATTTTTGATTGTATAGAGGAATTGACCGACGATCCTTCCAATCGTGCAATTCT
ATGGAAGGAAGCACGAAAGGGAAAAAATAAAGAATATTGCGATGAGGTCACTGTAGCATGTGTCAATCGAATTGACGAATTAGCTGCATTGAATGAAGGTAAGGACATCT
TGATTGAAGCGTTGGGCACGCCAGAACACAGAGGGCGTGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAGTGTTGCAAGAGAGAAGTCAAAATTGAGT
CAGCAACCACAAAGTGAAGCTTCGAGTGTCAAGACCGAAGCCCCTCGACGAAAGCAACCACAAAGCGACACTTCGAGTGCCACACATAAAAAGTCAAAAGGAAAAGATGT
TGGAACACCTTGTCACCTAGCGATGGGCTCTATGGATAACATTGTTGCCGTAGGCACAATGTACGAGTCACCTTCAGAAAATGCAACCATCCATGGAGTTCCATTAGGAG
TCGAAAATGTTCGAGTTGTGGTGGACATGGTGATAGGTGATGATTGTGCATTACCGATTCCTGTAAACGATGAACTACAAACATTGCATCAAGCGGTCGGTAATTTTGTG
GGATGGCCTCGCAGGCTTGTTATTACTGTAGATGACAAAGAGGTATATGCCCATGCTACTATTAGGCTCCTAAATAGATACGCGATGCTTTCGATGCAACAAGAAGATAC
AATAACGATCAATATGCACCAGCGTATCTTGGGAAATGAAGCATCAGTATTTTTACATCGCGAAGACATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCA
TACTCACGTACATTACTTTAGTTGAGTTTGGAGGGAGAGGGGGGGGGGGGGTGATGTATTATGCTTTGGAAGTGTTTGAGTTGATGAATACTTGTATTATTTTCTTTAGG
TACCTCTGGACTGTACTTGATCCCGAGATAACAAACATTTTTTTTGTGGTGGATCATGTTGGTGTTAATATCACACATGCGGAAGCAATTTTGGATCTTAGGCACACAGC
CTTGGAACGGATTGTAGGATCCTCAATTGATAGGAATTGGAAAGGAAAGCTTATGGGTATAAGCAAGCTTGAGAATCAAGCAACAATCTCATCGTACGTGAAGTCTCAAG
AACTTCATTATAGAAATCTCTCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACTGGTCATCATTGGATGTTGATCGCCATCCAGCCT
CGGGAAAATACTGTGTATATATTGAATTCTTTGCGCAGTAAAGTTGAAGAAGAGTTTAGTGGAACTATAAATACGGGGTTGAGAATGTGGCAAGCTAAGCACTCGCTTCC
TCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTTTAACACAAAGAATGCATTTACACAAGACGAGATCGACGAGGTTCGTATAGAATGGACAAATTTTGTTGGAG
GATTTGTAAAACTTGTTGATCGTGTCTTTTCGTGGGCCGCTTCCGTCGAAGGAGCGTTGCGTCGGCAGCAATTGATGGCGCCGGCAGCAATTGATGTCGCCGGAGATGGC
CCAAGGCGAGAGGAAATCGAAGGATGGAGAAGGGGAGACGCGACTAGGTGTGTGTGTTCTGCCGAGAGGGAAAAAGTGTACTGA
Protein sequenceShow/hide protein sequence
IPQSVPRPADSREDPSGSPKVVRVNYKASDYTNWELQEIVTVSEQSSSSGDEGNVFIQPRVPGRGPTTMNRLACLRNNGKRWTIVYNNKGQAVGDNANQMQSYIGVGVRQ
QIPITYENWKDVPKELKDEIFDCIEELTDDPSNRAILWKEARKGKNKEYCDEVTVACVNRIDELAALNEGKDILIEALGTPEHRGRVRGVGEFVTPSVYYSVAREKSKLS
QQPQSEASSVKTEAPRRKQPQSDTSSATHKKSKGKDVGTPCHLAMGSMDNIVAVGTMYESPSENATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQAVGNFV
GWPRRLVITVDDKEVYAHATIRLLNRYAMLSMQQEDTITINMHQRILGNEASVFLHREDIMQYCGNVEIGYSCILTYITLVEFGGRGGGGVMYYALEVFELMNTCIIFFR
YLWTVLDPEITNIFFVVDHVGVNITHAEAILDLRHTALERIVGSSIDRNWKGKLMGISKLENQATISSYVKSQELHYRNLSNRLDMVDLDQLVLIPFNTGHHWMLIAIQP
RENTVYILNSLRSKVEEEFSGTINTGLRMWQAKHSLPQYRSAISWKLVKFNTKNAFTQDEIDEVRIEWTNFVGGFVKLVDRVFSWAASVEGALRRQQLMAPAAIDVAGDG
PRREEIEGWRRGDATRCVCSAEREKVY