; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001135 (gene) of Snake gourd v1 genome

Gene IDTan0001135
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein HAIKU1-like
Genome locationLG01:25731177..25733372
RNA-Seq ExpressionTan0001135
SyntenyTan0001135
Gene Ontology termsGO:0009960 - endosperm development (biological process)
GO:0080113 - regulation of seed growth (biological process)
InterPro domainsIPR008889 - VQ
IPR039612 - VQ motif-containing protein 5/9/14
IPR039825 - VQ motif-containing protein 5/14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577702.1 Protein HAIKU1, partial [Cucurbita argyrosperma subsp. sororia]6.6e-15788.89Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS
        MDRNR NENLGVNK+GKNIRKSP+HQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QDPQ PRAPQNPPKPQS+RLQRIRPPPLTPINRP+MP+
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS

Query:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLLPNP
        PIP PVPVPPPQ L+ NNVPRPAQFAQPPPRQLPPLAPGGDS+W NP AESPISAYMRYLQNSMMNPSP+ANQ   +PQPQIPGQMH PQAPPSGLLPNP
Subjt:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLLPNP

Query:  N------PPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPP
        N      PPVPALP  RLNGP PPVPNFPSP+WN PALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFP+ PQSGILGPGPHPPP
Subjt:  N------PPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPP

Query:  SPGVMFPLSPSGIFPIFSPRWRDQ
        SPGVMFPLSPSG FPI SPRWRDQ
Subjt:  SPGVMFPLSPSGIFPIFSPRWRDQ

KAG7015745.1 Protein HAIKU1, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-15788.34Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS
        MDRNR NENLGVNK+GKNIRKSP+HQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QDPQ PRAPQNPPKPQS+RLQRIRPPPLTPINRP+MP+
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS

Query:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL---
        PIP PVPVPPPQ L++NNVPRPAQFAQPPPRQLPPLAPGGDS+W NP AESPISAYMRYLQNSMMNPSP+ANQ   +PQPQIPGQMH PQAPPSGLL   
Subjt:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL---

Query:  -----PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHP
             PNPNPPVPALP  RLNGP PPVPNFPSP+WN PALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFP+ PQSGILGPGPHP
Subjt:  -----PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHP

Query:  PPSPGVMFPLSPSGIFPIFSPRWRDQ
        PPSPGVMFPLSPSG FPI SPRWRDQ
Subjt:  PPSPGVMFPLSPSGIFPIFSPRWRDQ

XP_008448685.1 PREDICTED: protein HAIKU1-like [Cucumis melo]5.1e-15788.54Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQ-APRAPQNPPKPQSMRLQRIRPPPLTPINRPSMP
        MDRNR NENLGVNK+GKNIRKSP+HQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QD Q  PR PQNPPK QSMRLQRIRPPPLTPINRP++P
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQ-APRAPQNPPKPQSMRLQRIRPPPLTPINRPSMP

Query:  SPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL--
        +PIP PVPVPPPQ +VNNNVPRP QFAQPPPRQLPP+A GGDSHW NP AESPISAYMRYLQNSMMNPSPV NQ Q VPQPQIPGQ+H P APPSGLL  
Subjt:  SPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL--

Query:  --PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPS
          PNPNPPVPALPSPRLNGP PP+PNFPSPHWNGPALLPSPTSQFLLPSPTGYYN+LSPKSPYPLLSPGIQF+PPLTPNFAFP+MPQSGILGPGPHPPPS
Subjt:  --PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPS

Query:  PGVMFPLSPSGIFPIFSPRWRDQ
        PGV+FPLSPSGIFPI SPRWRDQ
Subjt:  PGVMFPLSPSGIFPIFSPRWRDQ

XP_022923451.1 protein HAIKU1 isoform X1 [Cucurbita moschata]1.9e-15688.04Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS
        MDRNR NENLGVNK+GKNIRKSP+HQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QDPQ PR PQNPPKPQS+RLQRIRPPPLTPINRP+MP+
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS

Query:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL---
        PIP PVPVPPPQ L+ NNVPRPAQFAQPPPRQLPPLAPGGDS+W NP AESPISAYMRYLQNSMMNPSP+ANQ   +PQPQIPGQMH PQAPPSGLL   
Subjt:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL---

Query:  -----PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHP
             PNPNPPVPALP  RLNGP PPVPNFPSP+WN PALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFP+ PQSGILGPGPHP
Subjt:  -----PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHP

Query:  PPSPGVMFPLSPSGIFPIFSPRWRDQ
        PPSPGVMFPLSPSG FPI SPRWRDQ
Subjt:  PPSPGVMFPLSPSGIFPIFSPRWRDQ

XP_038905684.1 protein HAIKU1-like [Benincasa hispida]5.1e-15789.03Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQ-APRAPQNPPKPQSMRLQRIRPPPLTPINRPSMP
        MDRNR NENLGVNKMGKNIRKSP+HQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QD Q  PR PQNPPKPQSMRLQRIRPPPLTPINR +MP
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQ-APRAPQNPPKPQSMRLQRIRPPPLTPINRPSMP

Query:  SPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLLPN
        +PIP PVP+PPPQ LVNNNVPRP  FAQPPPRQ PP+APGGDS+W NP AESPISAYMRYLQNSMMNPSPVANQ Q VPQPQIPGQMH P  PPSGLLPN
Subjt:  SPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLLPN

Query:  PNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPSPGVM
        PNPPVP LPSPR+NGP PPVPNFPSPHWNGP LLPSPTSQFLLPSPTG+YNLLSPKSPYPLLSPGIQFSPPLTPNFAFP+MPQSGILGPGPHPPPSPGVM
Subjt:  PNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPSPGVM

Query:  FPLSPSGIFPIFSPRWRDQ
        FPLSPSG+F + SPRWRDQ
Subjt:  FPLSPSGIFPIFSPRWRDQ

TrEMBL top hitse value%identityAlignment
A0A1S3BKA6 protein HAIKU1-like2.5e-15788.54Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQ-APRAPQNPPKPQSMRLQRIRPPPLTPINRPSMP
        MDRNR NENLGVNK+GKNIRKSP+HQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QD Q  PR PQNPPK QSMRLQRIRPPPLTPINRP++P
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQ-APRAPQNPPKPQSMRLQRIRPPPLTPINRPSMP

Query:  SPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL--
        +PIP PVPVPPPQ +VNNNVPRP QFAQPPPRQLPP+A GGDSHW NP AESPISAYMRYLQNSMMNPSPV NQ Q VPQPQIPGQ+H P APPSGLL  
Subjt:  SPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL--

Query:  --PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPS
          PNPNPPVPALPSPRLNGP PP+PNFPSPHWNGPALLPSPTSQFLLPSPTGYYN+LSPKSPYPLLSPGIQF+PPLTPNFAFP+MPQSGILGPGPHPPPS
Subjt:  --PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPS

Query:  PGVMFPLSPSGIFPIFSPRWRDQ
        PGV+FPLSPSGIFPI SPRWRDQ
Subjt:  PGVMFPLSPSGIFPIFSPRWRDQ

A0A5D3CMQ7 Protein HAIKU1-like2.5e-15788.54Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQ-APRAPQNPPKPQSMRLQRIRPPPLTPINRPSMP
        MDRNR NENLGVNK+GKNIRKSP+HQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QD Q  PR PQNPPK QSMRLQRIRPPPLTPINRP++P
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQ-APRAPQNPPKPQSMRLQRIRPPPLTPINRPSMP

Query:  SPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL--
        +PIP PVPVPPPQ +VNNNVPRP QFAQPPPRQLPP+A GGDSHW NP AESPISAYMRYLQNSMMNPSPV NQ Q VPQPQIPGQ+H P APPSGLL  
Subjt:  SPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL--

Query:  --PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPS
          PNPNPPVPALPSPRLNGP PP+PNFPSPHWNGPALLPSPTSQFLLPSPTGYYN+LSPKSPYPLLSPGIQF+PPLTPNFAFP+MPQSGILGPGPHPPPS
Subjt:  --PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPS

Query:  PGVMFPLSPSGIFPIFSPRWRDQ
        PGV+FPLSPSGIFPI SPRWRDQ
Subjt:  PGVMFPLSPSGIFPIFSPRWRDQ

A0A6J1CWP9 protein HAIKU1-like7.9e-15689.24Show/hide
Query:  RNRPNENLGVNKMGKNIRKSPVHQPNFG-NNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPSP
        RNR NENLGVNKMGKNIRKSP+HQPNFG NNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QDPQ PR PQNPPKPQSMRL RIRPPPLTPINRP++P+P
Subjt:  RNRPNENLGVNKMGKNIRKSPVHQPNFG-NNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPSP

Query:  IPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPSGLLPNPNP
        IPGPVPV PPQG+VN+N+PRPAQF+QPPPRQ PPLA GGDSHW NP AESPISAYMRYLQNSM+NPSPVAN   L PQPQIP QMHPQAPPSGLLPNPNP
Subjt:  IPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPSGLLPNPNP

Query:  PVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPSPGVMFPL
        PVPALP PR+NGP P VPN PSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNF FP+MPQSGILGPGPHPPPSPGV+FPL
Subjt:  PVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPSPGVMFPL

Query:  SPSGIFPIFSPRWRDQ
        SPSGIFPI SPRWRDQ
Subjt:  SPSGIFPIFSPRWRDQ

A0A6J1E6U6 protein HAIKU1 isoform X19.3e-15788.04Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS
        MDRNR NENLGVNK+GKNIRKSP+HQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QDPQ PR PQNPPKPQS+RLQRIRPPPLTPINRP+MP+
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS

Query:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL---
        PIP PVPVPPPQ L+ NNVPRPAQFAQPPPRQLPPLAPGGDS+W NP AESPISAYMRYLQNSMMNPSP+ANQ   +PQPQIPGQMH PQAPPSGLL   
Subjt:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLL---

Query:  -----PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHP
             PNPNPPVPALP  RLNGP PPVPNFPSP+WN PALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFP+ PQSGILGPGPHP
Subjt:  -----PNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHP

Query:  PPSPGVMFPLSPSGIFPIFSPRWRDQ
        PPSPGVMFPLSPSG FPI SPRWRDQ
Subjt:  PPSPGVMFPLSPSGIFPIFSPRWRDQ

A0A6J1HKT6 protein HAIKU1 isoform X13.5e-15689.31Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS
        MDRNR NENLGVNK+GKNIRKSP+HQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSP+QDP  PRAPQNPPKPQS+RLQRIRPPPLTPINRP+MP+
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPS

Query:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLLPNP
        PIP PVPVPPPQ L+ NNVPRPAQFAQPPPRQLPPLAPGGDS W NP AESPISAYMRYLQNSMMNPSP+ANQ   +PQ QIPGQMH PQAPPSGLLP P
Subjt:  PIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMH-PQAPPSGLLPNP

Query:  NPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPSPGVMF
        NPPVPALP  RLNGP PPVPNFPSP+WN PALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNF+FP+ PQSGILGPGPHPPPSPGVMF
Subjt:  NPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPSPGVMF

Query:  PLSPSGIFPIFSPRWRDQ
        PLSPSG FPI SPRWRDQ
Subjt:  PLSPSGIFPIFSPRWRDQ

SwissProt top hitse value%identityAlignment
O82170 Protein HAIKU12.3e-6745.45Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFG---NNAARP--QPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPP-KPQSMRLQRIRPPPLTPIN
        MDR R N++LGVN++GKNIRKSP+HQ  F    +N A P  Q QPQ+YNISKNDFR+IVQQLTGSP+++   PR PQN   +PQ+ RLQRIRP PLT +N
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFG---NNAARP--QPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPP-KPQSMRLQRIRPPPLTPIN

Query:  RPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLP-------PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVAN---------------
        RP++P P      + PPQ           QFA+ PP Q P       P+    D  W N  AESP+S YMRYLQ+S+ +  P AN               
Subjt:  RPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLP-------PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVAN---------------

Query:  -----------------------------------------QPQLVPQPQ---IPG-----------QMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPP
                                                 QPQ  PQPQ   +PG           Q +   PP GL+P+P P    LPSPR N P P 
Subjt:  -----------------------------------------QPQLVPQPQ---IPG-----------QMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPP

Query:  VP------------NFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPG-------PHPPPSPGVM
         P             FPSP +NG   L SPTSQFL PSPTGY N+ SP+SPYPLLSPG+Q+  PLTPNF+F  + Q G LGPG       P PPPSPG+M
Subjt:  VP------------NFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPG-------PHPPPSPGVM

Query:  FPLSPSGIFPIFSPRWRD
        FPLSPSG FP+ SPRW D
Subjt:  FPLSPSGIFPIFSPRWRD

Q84ZL0 Formin-like protein 52.9e-0631.16Show/hide
Query:  PQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTP-------INRPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQP
        P P P  Y  S +   ++       P   P  P  P  PP P S  L  I PPP  P         R  +P P   P P PPP+  V  N P PA    P
Subjt:  PQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTP-------INRPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQP

Query:  PPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPPVPNFPSPHWNG
        PP  L    P      ++P    P                P +  P   P P  P    P AP S    +  PP P  P P L    PP P  P  H N 
Subjt:  PPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPPVPNFPSPHWNG

Query:  PALLPSPTSQFLL-----PSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPP---SPGVMFPLSPSGIFPIFSP
        P   P P ++F       P PT ++N   P  P P+   G   SPP  P+   P  P     GP P PPP    PG   P  P G  P   P
Subjt:  PALLPSPTSQFLL-----PSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPP---SPGVMFPLSPSGIFPIFSP

Q9FPQ6 Vegetative cell wall protein gp12.9e-0635.8Show/hide
Query:  SPAQDPQAPRAPQNP-PKPQSMRLQRIRPPPLTPINRPS--MPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAY
        SPA    AP +P  P P P S       PP   P + PS   PSP P P P PP     +   P PA  +  PP   PP  P       +P A  P  + 
Subjt:  SPAQDPQAPRAPQNP-PKPQSMRLQRIRPPPLTPINRPS--MPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAY

Query:  MRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSP
                 +P+P +  P + P P  P    P APPS   P+P+PPVP  PSP    P+PPVP  P+P    P + PSP      PSP       SP SP
Subjt:  MRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSP

Query:  YPLLSPGIQFSP-PLTPNFAFPAMPQSGILGPGPHPPPSPGVMFPLSPSGIFPIFSP
         P  SP     P P+ P+ A P+        P P PPPSP    P  P   FP  +P
Subjt:  YPLLSPGIQFSP-PLTPNFAFPAMPQSGILGPGPHPPPSPGVMFPLSPSGIFPIFSP

Q9LK03 Proline-rich receptor-like protein kinase PERK22.5e-0531.78Show/hide
Query:  PQNPPKPQSMRLQRIRPPPLTPINRPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPV
        P   P P    L    PP   P+  P  P+ +P  +P PPP   +   +P P     PPP  +PP+ P        P    P++        S + PSP 
Subjt:  PQNPPKPQSMRLQRIRPPPLTPINRPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPV

Query:  ANQPQLVPQPQIPGQMHPQAPPSGLLPNPN-PPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPP
           P L P P  P      +PP  + P+P   P P  PSP    P PP P+ PS     P L PSP       SP      L P SP P  SP    +PP
Subjt:  ANQPQLVPQPQIPGQMHPQAPPSGLLPNPN-PPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPP

Query:  LTPNFAFPAMPQSGILGPGPHPPPSP-GVMFPLSPS
         +P       P   +    P PP SP G   P +PS
Subjt:  LTPNFAFPAMPQSGILGPGPHPPPSP-GVMFPLSPS

Q9M9F0 VQ motif-containing protein 92.6e-1537.63Show/hide
Query:  PVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQD--PQAPRAPQNPPKP-QSMRLQRIRPPPLT-PINRPSMPSPIPGPVPVPPPQGLVNNN
        P  Q N GN     Q QP +YNI+KNDFR++VQ+LTGSPA +     P+ P + PKP QS RL RIRPPPL   INR   P  +     +P     +N N
Subjt:  PVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQD--PQAPRAPQNPPKP-QSMRLQRIRPPPLT-PINRPSMPSPIPGPVPVPPPQGLVNNN

Query:  VPRPAQFAQP--PPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPSGLLPNPNPPVPALPSPRLNGPSP
                +P  P   LPPL P      ++  AESP+S+YMRYLQNSM   +  +N+ +                 SGL P      P     + N P  
Subjt:  VPRPAQFAQP--PPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPSGLLPNPNPPVPALPSPRLNGPSP

Query:  PVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPSP
           +FP PH   P+   S T    +P+P  +    SPKSPY LLSP I  SP  +    FP  P +        P PSP
Subjt:  PVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPSP

Arabidopsis top hitse value%identityAlignment
AT1G32610.1 hydroxyproline-rich glycoprotein family protein1.3e-4142.32Show/hide
Query:  ENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPSPIPGPVP
        + LGVNK+GKNI+KSP+             PQPQ Y++S NDF +IVQQLT SP+++      P+N  KPQ    Q+IRP     INRP +P P+     
Subjt:  ENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPSPIPGPVP

Query:  VPPPQGLVNNNVPRPAQFAQPPPRQLP----PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQ----LVPQPQIPGQMHPQAPPSGLLPNP
                    P     A+PP   LP    P+   GD    N  AES +S YMRY Q+S+ +  P  NQ Q       QPQ+ GQ       S    + 
Subjt:  VPPPQGLVNNNVPRPAQFAQPPPRQLP----PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQ----LVPQPQIPGQMHPQAPPSGLLPNP

Query:  NPPVPALPSPRLNGPSPPVPN--FPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTP-NFAFPAMPQSGILGPGPHPPPSPG
            P LP+P+ +GP   + N   PSP +NG  +LP+PTSQ+   SPT Y NLLSP+SP PLLS G+Q+ PPLTP N+ F +M Q GILGPG  P P   
Subjt:  NPPVPALPSPRLNGPSPPVPN--FPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTP-NFAFPAMPQSGILGPGPHPPPSPG

Query:  VMFPLSPSGIFPIFSPRWR
             SP G+ PI S RWR
Subjt:  VMFPLSPSGIFPIFSPRWR

AT1G32610.2 hydroxyproline-rich glycoprotein family protein1.3e-4142.32Show/hide
Query:  ENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPSPIPGPVP
        + LGVNK+GKNI+KSP+             PQPQ Y++S NDF +IVQQLT SP+++      P+N  KPQ    Q+IRP     INRP +P P+     
Subjt:  ENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPSPIPGPVP

Query:  VPPPQGLVNNNVPRPAQFAQPPPRQLP----PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQ----LVPQPQIPGQMHPQAPPSGLLPNP
                    P     A+PP   LP    P+   GD    N  AES +S YMRY Q+S+ +  P  NQ Q       QPQ+ GQ       S    + 
Subjt:  VPPPQGLVNNNVPRPAQFAQPPPRQLP----PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQ----LVPQPQIPGQMHPQAPPSGLLPNP

Query:  NPPVPALPSPRLNGPSPPVPN--FPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTP-NFAFPAMPQSGILGPGPHPPPSPG
            P LP+P+ +GP   + N   PSP +NG  +LP+PTSQ+   SPT Y NLLSP+SP PLLS G+Q+ PPLTP N+ F +M Q GILGPG  P P   
Subjt:  NPPVPALPSPRLNGPSPPVPN--FPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTP-NFAFPAMPQSGILGPGPHPPPSPG

Query:  VMFPLSPSGIFPIFSPRWR
             SP G+ PI S RWR
Subjt:  VMFPLSPSGIFPIFSPRWR

AT2G35230.1 VQ motif-containing protein1.6e-6845.45Show/hide
Query:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFG---NNAARP--QPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPP-KPQSMRLQRIRPPPLTPIN
        MDR R N++LGVN++GKNIRKSP+HQ  F    +N A P  Q QPQ+YNISKNDFR+IVQQLTGSP+++   PR PQN   +PQ+ RLQRIRP PLT +N
Subjt:  MDRNRPNENLGVNKMGKNIRKSPVHQPNFG---NNAARP--QPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPP-KPQSMRLQRIRPPPLTPIN

Query:  RPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLP-------PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVAN---------------
        RP++P P      + PPQ           QFA+ PP Q P       P+    D  W N  AESP+S YMRYLQ+S+ +  P AN               
Subjt:  RPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLP-------PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVAN---------------

Query:  -----------------------------------------QPQLVPQPQ---IPG-----------QMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPP
                                                 QPQ  PQPQ   +PG           Q +   PP GL+P+P P    LPSPR N P P 
Subjt:  -----------------------------------------QPQLVPQPQ---IPG-----------QMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPP

Query:  VP------------NFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPG-------PHPPPSPGVM
         P             FPSP +NG   L SPTSQFL PSPTGY N+ SP+SPYPLLSPG+Q+  PLTPNF+F  + Q G LGPG       P PPPSPG+M
Subjt:  VP------------NFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPG-------PHPPPSPGVM

Query:  FPLSPSGIFPIFSPRWRD
        FPLSPSG FP+ SPRW D
Subjt:  FPLSPSGIFPIFSPRWRD

AT2G35230.2 VQ motif-containing protein2.7e-3941.52Show/hide
Query:  QFAQPPPRQLP-------PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVAN--------------------------------------------
        QFA+ PP Q P       P+    D  W N  AESP+S YMRYLQ+S+ +  P AN                                            
Subjt:  QFAQPPPRQLP-------PLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVAN--------------------------------------------

Query:  ------------QPQLVPQPQ---IPG-----------QMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPPVP------------NFPSPHWNGPALLPS
                    QPQ  PQPQ   +PG           Q +   PP GL+P+P P    LPSPR N P P  P             FPSP +NG   L S
Subjt:  ------------QPQLVPQPQ---IPG-----------QMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPPVP------------NFPSPHWNGPALLPS

Query:  PTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPG-------PHPPPSPGVMFPLSPSGIFPIFSPRWRD
        PTSQFL PSPTGY N+ SP+SPYPLLSPG+Q+  PLTPNF+F  + Q G LGPG       P PPPSPG+MFPLSPSG FP+ SPRW D
Subjt:  PTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPG-------PHPPPSPGVMFPLSPSGIFPIFSPRWRD

AT5G46780.1 VQ motif-containing protein4.6e-2337.73Show/hide
Query:  NRPNEN---------LGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQ-NPPKPQSMRLQRIRPPPLTPI
        NR N+N         LGVNKMGKNIRK P +Q N   N     PQ  +YNI+K DFR+IVQQLTG  +     P  PQ N PKP + RL ++RP PLT +
Subjt:  NRPNEN---------LGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQ-NPPKPQSMRLQRIRPPPLTPI

Query:  NRPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPS
        N P  P P P PV   P   + +  V    QF+  P                   AESPISAYMRYL  S    SPV N+ Q  PQ Q            
Subjt:  NRPSMPSPIPGPVPVPPPQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPS

Query:  GLLPNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPH--P
            NP  P   L      GP+             P    SP SQF L SP        P+SP+PL SP   FSP                LG      P
Subjt:  GLLPNPNPPVPALPSPRLNGPSPPVPNFPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPH--P

Query:  PPSPGVMFPLSPSGIFPIFSPRWRDQ
        PPSPG  FPL         SP W++Q
Subjt:  PPSPGVMFPLSPSGIFPIFSPRWRDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGGAACAGGCCGAATGAGAATTTGGGTGTGAATAAAATGGGGAAGAATATTAGGAAGAGTCCAGTACACCAGCCAAATTTTGGAAATAATGCTGCTAGGCCTCA
ACCCCAGCCCCAAATTTACAACATAAGTAAGAATGATTTTCGGAACATTGTTCAGCAGCTTACAGGCTCACCAGCTCAGGACCCTCAAGCTCCTAGAGCTCCACAGAATC
CTCCAAAACCCCAAAGCATGCGTTTGCAGAGAATTAGACCTCCCCCGTTAACACCGATCAATCGACCAAGTATGCCTTCTCCTATCCCTGGCCCCGTTCCTGTGCCTCCA
CCGCAGGGTCTAGTCAATAACAATGTACCTAGGCCTGCACAATTTGCTCAGCCACCTCCAAGACAGTTGCCGCCATTGGCACCAGGGGGAGACTCGCATTGGCTGAACCC
TGTGGCTGAGTCTCCCATTTCGGCGTACATGCGTTACCTTCAAAATTCAATGATGAATCCATCTCCAGTAGCAAACCAACCTCAACTTGTACCACAGCCGCAGATTCCTG
GTCAAATGCATCCTCAAGCACCTCCATCTGGTTTATTGCCTAATCCCAATCCACCGGTTCCTGCTCTTCCATCCCCAAGATTAAATGGTCCTTCACCCCCTGTGCCGAAC
TTCCCTTCACCGCATTGGAACGGTCCCGCCCTTTTACCGTCCCCAACTTCCCAGTTTCTATTGCCTTCTCCTACTGGTTACTACAATTTGTTGTCCCCCAAATCACCTTA
TCCATTACTCTCACCAGGGATACAGTTTTCTCCGCCACTGACTCCTAATTTTGCATTTCCAGCCATGCCTCAATCCGGGATCTTAGGGCCAGGGCCTCATCCACCACCTT
CCCCAGGGGTTATGTTTCCATTATCTCCTTCAGGGATTTTCCCCATCTTTAGTCCAAGATGGAGAGATCAATAA
mRNA sequenceShow/hide mRNA sequence
TTTCTTTCGATCGATTGCCAATTGATTTCTTCCAGAAATTTCTCACCAATGCATTCAGAATCATTTGTTAATTTCCCTTCAATTTTCCCGTTGACTTCAAAATCAAATGT
TGGTTTTGATTTGGGTTTTTCTTTTTCTTGATCTGATTGTTGGGTTTTTCTTCACATCTTTGATCTCCTCTTCTTTCTTAGGGTTTGGACCTTTTCTTCGTCGTCTCAGA
TCCTCAATTTCACCGCGTAATTTCTTTCCACTTCCCAGTTCCCACCGCTCGGTCTTCATTTTTCTTTTCCGTTTTCTTCGCGTTTACTCGTTTTTCTTTTTAGTGTTTAA
GACGTCGGAATCTTTGTTCGTTGAGCGAGCACCTATTTAATTGGGTTGGTAACTGCACTTCCTTTTCTGTTCAGACCATTTGGGTGTTTTTGGTTTCTGAGAATGGCTAT
GCGGCTGATTGCTTCTGGGATGTTTAAATTGCGGAGGATCAAGTGAAGGGTATGATTGTGTTTTCGTTTTTGCTTCTTGAGCTTTGTGATTGAATTTGTCTTGCTGTGTT
TATTGGATTGTTCTTACTGTTGAGTTCGGGCTGAATTTTGAAATTTCTGCTGAGATTGGATTTGGAAGAAGGTTTCTAGTTTTGCACTTGGATTCTGTGCAATCCTTGTG
GATTTGTTTTGTCGAAGGATTGTGAATGTTTTGACTTGGTGGGATTTTATGAGAAGTGGGTTTTGATTTAATTGGTGTTTTTCTAGCCTGTGTTTTGGGATAAGTTTGTT
GTGGAGATGGATAGGAACAGGCCGAATGAGAATTTGGGTGTGAATAAAATGGGGAAGAATATTAGGAAGAGTCCAGTACACCAGCCAAATTTTGGAAATAATGCTGCTAG
GCCTCAACCCCAGCCCCAAATTTACAACATAAGTAAGAATGATTTTCGGAACATTGTTCAGCAGCTTACAGGCTCACCAGCTCAGGACCCTCAAGCTCCTAGAGCTCCAC
AGAATCCTCCAAAACCCCAAAGCATGCGTTTGCAGAGAATTAGACCTCCCCCGTTAACACCGATCAATCGACCAAGTATGCCTTCTCCTATCCCTGGCCCCGTTCCTGTG
CCTCCACCGCAGGGTCTAGTCAATAACAATGTACCTAGGCCTGCACAATTTGCTCAGCCACCTCCAAGACAGTTGCCGCCATTGGCACCAGGGGGAGACTCGCATTGGCT
GAACCCTGTGGCTGAGTCTCCCATTTCGGCGTACATGCGTTACCTTCAAAATTCAATGATGAATCCATCTCCAGTAGCAAACCAACCTCAACTTGTACCACAGCCGCAGA
TTCCTGGTCAAATGCATCCTCAAGCACCTCCATCTGGTTTATTGCCTAATCCCAATCCACCGGTTCCTGCTCTTCCATCCCCAAGATTAAATGGTCCTTCACCCCCTGTG
CCGAACTTCCCTTCACCGCATTGGAACGGTCCCGCCCTTTTACCGTCCCCAACTTCCCAGTTTCTATTGCCTTCTCCTACTGGTTACTACAATTTGTTGTCCCCCAAATC
ACCTTATCCATTACTCTCACCAGGGATACAGTTTTCTCCGCCACTGACTCCTAATTTTGCATTTCCAGCCATGCCTCAATCCGGGATCTTAGGGCCAGGGCCTCATCCAC
CACCTTCCCCAGGGGTTATGTTTCCATTATCTCCTTCAGGGATTTTCCCCATCTTTAGTCCAAGATGGAGAGATCAATAACCCGCCCTCCTCAGAGCGTTTTCGTGTTTT
AAGAGCTCCATGCCTGCTGACATTTTCATCTACTTGTGAAGATGGTTACCAAACTCATGGTGTAAGCTTGCTGCCCCATTGCTGTTTCATATCTTTCCATTAGTTTAATA
TTTTTTTTAACACTTTAAATTGTAAAATCTTTTTACTTCCCATTTTCTTTTAGGCAATGTAGTGAAGATCAAGTTACCAAAATTGTATTACCATATGTAACAGGAAGAAC
AGAGCTGTAACCTTAGACGTAGGATCATAGATGTCAAGTTGATAACTTGGTCGGTCGGTCGGTCTAGATGAATAATAGTCCAATTCTGTAAGATGGTGGAATCAACATTA
TTTGGTATCTGTGCTCTTTTTTTAGCTGGTTTATAGAAGGAAATGTAGAAGTAGTGCAGTTTTATTCCTTTTTATTGTTTTTTCCTAATTAAGCACAACATTTCAT
Protein sequenceShow/hide protein sequence
MDRNRPNENLGVNKMGKNIRKSPVHQPNFGNNAARPQPQPQIYNISKNDFRNIVQQLTGSPAQDPQAPRAPQNPPKPQSMRLQRIRPPPLTPINRPSMPSPIPGPVPVPP
PQGLVNNNVPRPAQFAQPPPRQLPPLAPGGDSHWLNPVAESPISAYMRYLQNSMMNPSPVANQPQLVPQPQIPGQMHPQAPPSGLLPNPNPPVPALPSPRLNGPSPPVPN
FPSPHWNGPALLPSPTSQFLLPSPTGYYNLLSPKSPYPLLSPGIQFSPPLTPNFAFPAMPQSGILGPGPHPPPSPGVMFPLSPSGIFPIFSPRWRDQ