; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038289 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038289
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase
Genome locationscaffold12:38776784..38780980
RNA-Seq ExpressionSpg038289
SyntenySpg038289
Gene Ontology termsNA
InterPro domainsIPR004264 - Transposase, Tnp1/En/Spm-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.8e-18753.02Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        MS+ SSSS DER+V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+E+KDKIF+C+E  F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDE   L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ D SNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q          P 
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ
        + T  ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVGT+++++ Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ

Query:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI
        TL Q +G FV WPRRLVI  ++K    ++ ++   Q SKHT  HV+IKLLNRY + SM+ +DT+ I + + I GKE + +L   DI QYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG-----------------------------------GLRM
        LTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G                                    L++
Subjt:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG-----------------------------------GLRM

Query:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +K                                FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]3.4e-18653.02Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        MS+ SSSS DER+V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+E+KDKIF+C+E  F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDE   L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ D SNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q          P 
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ
        + T  ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVGT+++++ Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ

Query:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI
        TL Q +G FV WPRRLVI  ++K    ++ ++   Q SKHT  HV+IKLLNRY + SM+ +DT+ I + + I GKE + +L   DI QYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG-----------------------------------GLRM
        LTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G                                    L++
Subjt:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG-----------------------------------GLRM

Query:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +K                                FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]3.9e-19054.88Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        MS+ SSSS DER+V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+E+KDKIF+C+E  F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDE   L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ D SNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q          P 
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ
        + T  ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVGT+++++ Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ

Query:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI
        TL Q +G FV WPRRLVI  ++K    ++ ++   Q SKHT  HV+IKLLNRY + SM+ +DT+ I + + I GKE + +L   DI QYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------------GLRMWQAKHSLPQYRSAISWKLVK---
        LTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G             L++WQAKHS+ +YR+   WK +K   
Subjt:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------------GLRMWQAKHSLPQYRSAISWKLVK---

Query:  -----------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV
                                     FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  -----------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]1.1e-18953.36Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        M + SSSS DE NV I+    +   RGPT M  L  +RN+GER TI YN+ GQ VG+NA +MQS+IGVCVRQQIP+TY++WK VP+E+KD IFDCI+M F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        VVD  SK  ILQSAS+KFRTFK+ LTQ+Y+ P KDE  RL  PP KYSHI++K WE+FV +RL+ EWE  S AQ+ERR +C+YNHHISRKGYANLA++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEA---------
        L+ D  NRA LWKEARK KN EY D  T     RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++YNVA+ K KL Q+ Q+EA         
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEA---------

Query:  ---------------SSVKTEAPRRKTPQSDASSATHKK-SKGKDVVREISENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVV
                       SSV  +  +RK  Q   +    KK  KGK VV++  E  E G PCHLA+ S+DNIVAVGTM+ES +Q  +I+ +PLG +NVR +V
Subjt:  ---------------SSVKTEAPRRKTPQSDASSATHKK-SKGKDVVREISENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVV

Query:  DMVIGDDCELPIPVNDELQTLYQAVGNFVGWPRRLVITVDDKEEP-PAKAKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFL
        D+V+G+D  LPIP  D+++TL QA+GNFV WPR+LVIT  +K+ P P  +K I QSSK+T  HVTIKLLNRYA+ SM+  D + I + E+ILGKE + +L
Subjt:  DMVIGDDCELPIPVNDELQTLYQAVGNFVGWPRRLVITVDDKEEP-PAKAKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFL

Query:  HCEDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------------------
          +DI QYCG  EIGYSCIL YI  LW   D EIT KF +VDQ TISS+VK QELRS+NL NRL+MV LDQLVLIP+NTG                    
Subjt:  HCEDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------------------

Query:  ----------------GLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV
                         L+ WQAKHSL QYR+ I WK +K                                FNT+ A+ Q EID VR+EWA FV  FV
Subjt:  ----------------GLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]8.6e-19053.44Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        M + SSSS DE NV I+    +   RGPT M  L  +RN+GER TI YN+ GQ VG+NA +MQS+IGVCVRQQIP+TY++WK VP+E+KD IFDCI+M F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        VVD  SK  ILQSAS+KFRTFK+ LTQ+Y+ P KDE  RL  PP KYSHI++K WE+FV +RL+ EWE  S AQ+ERR +C+YNHHISRKGYANLA++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEA---------
        L+ D  NRA LWKEARK KN EY D  T     RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++YNVA+ K KL Q+ Q+EA         
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEA---------

Query:  ---------------SSVKTEAPRRKTPQSDASSATHKK-SKGKDVVREISENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVV
                       SSV  +  +RK  Q   +    KK  KGK VV++  E  E G PCHLA+ S+DNIVAVGTM+ES +Q  +I+ +PLG +NVR +V
Subjt:  ---------------SSVKTEAPRRKTPQSDASSATHKK-SKGKDVVREISENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVV

Query:  DMVIGDDCELPIPVNDELQTLYQAVGNFVGWPRRLVITVDDKEEP-PAKAKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFL
        D+V+G+D  LPIP  D+++TL QA+GNFV WPR+LVIT  +K+ P P  +K I QSSK+T  HVTIKLLNRYA+ SM+  D + I + E+ILGKE + +L
Subjt:  DMVIGDDCELPIPVNDELQTLYQAVGNFVGWPRRLVITVDDKEEP-PAKAKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFL

Query:  HCEDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------------------
          +DI QYCG  EIGYSCIL YI  LW   D EIT KF +VDQ TISS+VK QELRS+NL NRL+MV LDQLVLIP+NTG                    
Subjt:  HCEDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG--------------------

Query:  ---------------GLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV
                        L+ WQAKHSL QYR+ I WK +K                                FNT+ A+ Q EID VR+EWA FV  FV
Subjt:  ---------------GLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.9e-18250.49Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        M +  SSS DE NV I+    R   RGPT M  L  +RN+GER TI YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WK+VP+E+KD IFDCI+M F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        VVD  SK  ILQSAS+KFR+FK+ LTQ Y+ P KDE  RL  PP KYSHI++K WE+FV +RL+ EWE  S AQ+ERR +C+YNHHISRKGYANLA++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------
        L+ D  NRA LWKEARK KN    D+ T   V RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++ NVAR   KLSQQ              
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRRKTPQSDASSATHKKSKGKDV-----------------------VREISENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATI
           QS+A +   ++      Q   SS + KK+KGK V                       V +  EN   G PCHLA+ S+DN+VAVG M+ES  Q  TI
Subjt:  --PQSEASSVKTEAPRRKTPQSDASSATHKKSKGKDV-----------------------VREISENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCELPIPVNDELQTLYQAVGNFVGWPRRLVITVDDKEEPPAKA-KPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTIT
        HG+PLG EN+RV VD+ + +D  LPIP+  +++TL QA+GNFV WPR+LVI   +K+ P   A +   QSSK+T  HVTIKLLNRYA+ +M+ +D + I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCELPIPVNDELQTLYQAVGNFVGWPRRLVITVDDKEEPPAKA-KPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTIT

Query:  MPERILGKEASTFLHCEDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------
        + E I GKE + +L  +DI QYCG  EIGYSCILTYI  LW V + EIT +F LVDQ TISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG      
Subjt:  MPERILGKEASTFLHCEDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------

Query:  ------------------------------GLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE
                                       L+ WQ +HS   YRS I WK +K                                FNT +A+ Q+EID 
Subjt:  ------------------------------GLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE

Query:  VRIEWANFVGGFV
        VR+EWA FV  FV
Subjt:  VRIEWANFVGGFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.9e-18250.49Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        M +  SSS DE NV I+    R   RGPT M  L  +RN+GER TI YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WK+VP+E+KD IFDCI+M F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        VVD  SK  ILQSAS+KFR+FK+ LTQ Y+ P KDE  RL  PP KYSHI++K WE+FV +RL+ EWE  S AQ+ERR +C+YNHHISRKGYANLA++LE
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------
        L+ D  NRA LWKEARK KN    D+ T   V RIDELAA+ +G+DILTEALGTPEHRGR+RGVGEFV+P+++ NVAR   KLSQQ              
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQ--------------

Query:  --PQSEASSVKTEAPRRKTPQSDASSATHKKSKGKDV-----------------------VREISENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATI
           QS+A +   ++      Q   SS + KK+KGK V                       V +  EN   G PCHLA+ S+DN+VAVG M+ES  Q  TI
Subjt:  --PQSEASSVKTEAPRRKTPQSDASSATHKKSKGKDV-----------------------VREISENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATI

Query:  HGVPLGVENVRVVVDMVIGDDCELPIPVNDELQTLYQAVGNFVGWPRRLVITVDDKEEPPAKA-KPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTIT
        HG+PLG EN+RV VD+ + +D  LPIP+  +++TL QA+GNFV WPR+LVI   +K+ P   A +   QSSK+T  HVTIKLLNRYA+ +M+ +D + I+
Subjt:  HGVPLGVENVRVVVDMVIGDDCELPIPVNDELQTLYQAVGNFVGWPRRLVITVDDKEEPPAKA-KPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTIT

Query:  MPERILGKEASTFLHCEDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------
        + E I GKE + +L  +DI QYCG  EIGYSCILTYI  LW V + EIT +F LVDQ TISS++KSQE RSRNL NRL+M +LDQLVLIP+NTG      
Subjt:  MPERILGKEASTFLHCEDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------

Query:  ------------------------------GLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE
                                       L+ WQ +HS   YRS I WK +K                                FNT +A+ Q+EID 
Subjt:  ------------------------------GLRMWQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDE

Query:  VRIEWANFVGGFV
        VR+EWA FV  FV
Subjt:  VRIEWANFVGGFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X18.7e-18853.02Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        MS+ SSSS DER+V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+E+KDKIF+C+E  F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDE   L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ D SNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q          P 
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ
        + T  ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVGT+++++ Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ

Query:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI
        TL Q +G FV WPRRLVI  ++K    ++ ++   Q SKHT  HV+IKLLNRY + SM+ +DT+ I + + I GKE + +L   DI QYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG-----------------------------------GLRM
        LTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G                                    L++
Subjt:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG-----------------------------------GLRM

Query:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +K                                FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV

A0A6J1C398 uncharacterized protein LOC111007859 isoform X31.9e-19054.88Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        MS+ SSSS DER+V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+E+KDKIF+C+E  F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDE   L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ D SNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q          P 
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ
        + T  ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVGT+++++ Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ

Query:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI
        TL Q +G FV WPRRLVI  ++K    ++ ++   Q SKHT  HV+IKLLNRY + SM+ +DT+ I + + I GKE + +L   DI QYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------------GLRMWQAKHSLPQYRSAISWKLVK---
        LTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G             L++WQAKHS+ +YR+   WK +K   
Subjt:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG------------GLRMWQAKHSLPQYRSAISWKLVK---

Query:  -----------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV
                                     FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  -----------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.6e-18653.02Show/hide
Query:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF
        MS+ SSSS DER+V I     +V  RGPTTMH L  +RN G+R TI YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY +WK+VP+E+KDKIF+C+E  F
Subjt:  MSEESSSSGDERNVFIQ---PRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLF

Query:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE
        V+D RSK  ILQSAS+KFRTFK+ LT+ Y+ P KDE   L  PP KY HI+Q+ W +FV++RL+ EWE LS+A KE R +CLYNHHISRKGYANLA++L+
Subjt:  VVDPRSKSSILQSASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLE

Query:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR
        L+ D SNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH GRVRGVGEFV+PS+Y+NV + KSK +Q+ Q          P 
Subjt:  LTDDHSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPR

Query:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ
        + T  ++ S+ + KKSKGK++V    EI    E K  G PCHLA+ S+DNIVAVGT+++++ Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++
Subjt:  RKTPQSDASSATHKKSKGKDVV---REI---SENKEAGTPCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQ

Query:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI
        TL Q +G FV WPRRLVI  ++K    ++ ++   Q SKHT  HV+IKLLNRY + SM+ +DT+ I + + I GKE + +L   DI QYC  +EIGYSCI
Subjt:  TLYQAVGNFVGWPRRLVITVDDKEEPPAK-AKPIVQSSKHTKAHVTIKLLNRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCI

Query:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG-----------------------------------GLRM
        LTYI YLW V + EIT KF +VD  TIS YVKSQE R RNL+NRL+MV+L+QLVLIP+ +G                                    L++
Subjt:  LTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNTG-----------------------------------GLRM

Query:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV
        WQAKHS+ +YR+   WK +K                                FNTKNA+ Q+EIDEVRIEWA+FVGG V
Subjt:  WQAKHSLPQYRSAISWKLVK--------------------------------FNTKNAFTQDEIDEVRIEWANFVGGFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAAGAAAGTAGTAGTAGTGGAGATGAAAGAAATGTGTTTATCCAACCACGAGTGCCAGGACGAGGGCCTACTACGATGCATAGATTGGCACGCCTAAGAAATAA
TGGAGAACGCTTGACGATTGTTTACAACAATCAAGGTCAAGCTGTTGGAGATAATGCTAACCAGATGCAAAGTTACATAGGGGTTTGTGTGAGACAACAAATCCCAATAA
CATACGAAAACTGGAAGGATGTGCCTAAGGAAATGAAGGATAAAATTTTTGATTGTATAGAGATGTTGTTCGTGGTGGACCCTAGGTCCAAGAGTAGTATACTTCAATCT
GCGTCTAGAAAATTTCGAACATTCAAGACATACCTAACGCAGAAGTATGTCAATCCATTAAAAGATGAATCAGAGCGTTTGGCAACTCCTCCTTCTAAATACTCACACAT
TGAACAAAAGGATTGGGAGACATTTGTGAGTAGTAGACTAACATCAGAGTGGGAGGCGTTAAGTAAGGCTCAGAAAGAAAGACGAGAGAGATGCTTGTATAATCATCATA
TCTCTCGTAAGGGATATGCAAATCTTGCCAAAGACTTAGAATTGACCGACGATCATTCCAATCGTGCAATTCTATGGAAGGAAGCACGAAAAGGAAAAAATAAAGAATAT
TGCGATGAGGTCACTGTAGCACGTGTCAATCGAATTGACGAATTAGCTGCACTAAATGAAGGTAAGGACATCTTAACTGAAGCGTTGGGCACGCCAGAACACAGAGGGCG
TGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAACGTTGCAAGAGAGAAGTCAAAATTGAGTCAGCAACCGCAAAGCGAAGCTTCGAGTGTGAAGACCG
AAGCCCCTCGCCGAAAGACACCACAAAGCGACGCTTCGAGTGCCACACATAAGAAGTCAAAAGGAAAAGATGTTGTTCGTGAGATATCTGAGAATAAAGAGGCTGGAACA
CCTTGTCACCTAGCGATGGTCTCTATGGATAACATTGTTGCCGTAGGCACAATGTATGAGTCGCATTCACAAAATGCAACCATCCATGGAGTTCCATTAGGAGTAGAAAA
TGTTCGCGTTGTGGTGGACATGGTGATAGGTGATGATTGTGAATTACCGATTCCTGTGAACGATGAACTACAAACGTTGTATCAAGCGGTCGGTAATTTTGTGGGATGGC
CTCGCAGACTTGTTATTACTGTAGATGACAAAGAGGAGCCTCCTGCCAAAGCTAAGCCCATAGTACAATCGAGCAAACACACAAAGGCCCATGTTACTATTAAGCTCCTA
AATAGATATGCGGTGTTTTCGATGAGACAAAAAGATACACTAACGATCACAATGCCCGAGCGTATCTTGGGAAAGGAAGCATCGACATTTTTACATTGCGAAGACATCCA
ACAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTACCTTTGGACTGTACTTGATCCCGAGATAACAAACAAGTTTTTTCTGGTTGATC
AAACAACAATCTCATCGTACGTGAAGTCTCAAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACT
GGGGGGTTGAGAATGTGGCAAGCTAAGCACTCGCTTCCTCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTTTAACACAAAGAATGCATTTACACAAGACGAGAT
TGACGAGGTTCGTATAGAATGGGCAAATTTTGTTGGAGGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAAGAAAGTAGTAGTAGTGGAGATGAAAGAAATGTGTTTATCCAACCACGAGTGCCAGGACGAGGGCCTACTACGATGCATAGATTGGCACGCCTAAGAAATAA
TGGAGAACGCTTGACGATTGTTTACAACAATCAAGGTCAAGCTGTTGGAGATAATGCTAACCAGATGCAAAGTTACATAGGGGTTTGTGTGAGACAACAAATCCCAATAA
CATACGAAAACTGGAAGGATGTGCCTAAGGAAATGAAGGATAAAATTTTTGATTGTATAGAGATGTTGTTCGTGGTGGACCCTAGGTCCAAGAGTAGTATACTTCAATCT
GCGTCTAGAAAATTTCGAACATTCAAGACATACCTAACGCAGAAGTATGTCAATCCATTAAAAGATGAATCAGAGCGTTTGGCAACTCCTCCTTCTAAATACTCACACAT
TGAACAAAAGGATTGGGAGACATTTGTGAGTAGTAGACTAACATCAGAGTGGGAGGCGTTAAGTAAGGCTCAGAAAGAAAGACGAGAGAGATGCTTGTATAATCATCATA
TCTCTCGTAAGGGATATGCAAATCTTGCCAAAGACTTAGAATTGACCGACGATCATTCCAATCGTGCAATTCTATGGAAGGAAGCACGAAAAGGAAAAAATAAAGAATAT
TGCGATGAGGTCACTGTAGCACGTGTCAATCGAATTGACGAATTAGCTGCACTAAATGAAGGTAAGGACATCTTAACTGAAGCGTTGGGCACGCCAGAACACAGAGGGCG
TGTAAGGGGAGTGGGCGAGTTCGTAACGCCCTCTGTGTACTACAACGTTGCAAGAGAGAAGTCAAAATTGAGTCAGCAACCGCAAAGCGAAGCTTCGAGTGTGAAGACCG
AAGCCCCTCGCCGAAAGACACCACAAAGCGACGCTTCGAGTGCCACACATAAGAAGTCAAAAGGAAAAGATGTTGTTCGTGAGATATCTGAGAATAAAGAGGCTGGAACA
CCTTGTCACCTAGCGATGGTCTCTATGGATAACATTGTTGCCGTAGGCACAATGTATGAGTCGCATTCACAAAATGCAACCATCCATGGAGTTCCATTAGGAGTAGAAAA
TGTTCGCGTTGTGGTGGACATGGTGATAGGTGATGATTGTGAATTACCGATTCCTGTGAACGATGAACTACAAACGTTGTATCAAGCGGTCGGTAATTTTGTGGGATGGC
CTCGCAGACTTGTTATTACTGTAGATGACAAAGAGGAGCCTCCTGCCAAAGCTAAGCCCATAGTACAATCGAGCAAACACACAAAGGCCCATGTTACTATTAAGCTCCTA
AATAGATATGCGGTGTTTTCGATGAGACAAAAAGATACACTAACGATCACAATGCCCGAGCGTATCTTGGGAAAGGAAGCATCGACATTTTTACATTGCGAAGACATCCA
ACAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTACCTTTGGACTGTACTTGATCCCGAGATAACAAACAAGTTTTTTCTGGTTGATC
AAACAACAATCTCATCGTACGTGAAGTCTCAAGAACTTCGTTCTAGAAATCTATCTAACAGACTAGATATGGTTGATTTGGATCAACTAGTTCTCATTCCCTTTAACACT
GGGGGGTTGAGAATGTGGCAAGCTAAGCACTCGCTTCCTCAATATCGATCTGCTATTAGTTGGAAACTAGTGAAGTTTAACACAAAGAATGCATTTACACAAGACGAGAT
TGACGAGGTTCGTATAGAATGGGCAAATTTTGTTGGAGGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MSEESSSSGDERNVFIQPRVPGRGPTTMHRLARLRNNGERLTIVYNNQGQAVGDNANQMQSYIGVCVRQQIPITYENWKDVPKEMKDKIFDCIEMLFVVDPRSKSSILQS
ASRKFRTFKTYLTQKYVNPLKDESERLATPPSKYSHIEQKDWETFVSSRLTSEWEALSKAQKERRERCLYNHHISRKGYANLAKDLELTDDHSNRAILWKEARKGKNKEY
CDEVTVARVNRIDELAALNEGKDILTEALGTPEHRGRVRGVGEFVTPSVYYNVAREKSKLSQQPQSEASSVKTEAPRRKTPQSDASSATHKKSKGKDVVREISENKEAGT
PCHLAMVSMDNIVAVGTMYESHSQNATIHGVPLGVENVRVVVDMVIGDDCELPIPVNDELQTLYQAVGNFVGWPRRLVITVDDKEEPPAKAKPIVQSSKHTKAHVTIKLL
NRYAVFSMRQKDTLTITMPERILGKEASTFLHCEDIQQYCGNVEIGYSCILTYITYLWTVLDPEITNKFFLVDQTTISSYVKSQELRSRNLSNRLDMVDLDQLVLIPFNT
GGLRMWQAKHSLPQYRSAISWKLVKFNTKNAFTQDEIDEVRIEWANFVGGFV