; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC06G118460 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC06G118460
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionprotein tesmin/TSO1-like CXC 5
Genome locationCicolChr06:13091995..13126611
RNA-Seq ExpressionCcUC06G118460
SyntenyCcUC06G118460
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR028307 - Lin-54 family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142516.1 uncharacterized protein LOC101209122 isoform X1 [Cucumis sativus]9.7e-18685.08Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYE                           GSVSP RRRDVHRY+S+FDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        R REFGGGRDL RYRDTSPHY RR+SGGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFIT
        G PSLHSPPRRFAAHP+ERSPGRTL+EYRSPPRSWARDG R++AAGGLAPPRYESRY DHLRRDRVDYLEDSFRGRSKFDRP+PSADW LRDNGRDDFIT
Subjt:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFIT

Query:  ERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        ERKGFERRPPSPPL LLPQRGRW+R+VR+RSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  ERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

XP_004142517.1 uncharacterized protein LOC101209122 isoform X2 [Cucumis sativus]4.7e-18889.78Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLTRSREFGGGRDLGRYRDTSPH
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYE       GSVSP RRRDVHRY+S+FDHS GLTR REFGGGRDL RYRDTSPH
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLTRSREFGGGRDLGRYRDTSPH

Query:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS
        Y RR+SGGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY G PSLHSPPRRFAAHP+ERS
Subjt:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS

Query:  PGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPSPPLSLLPQR
        PGRTL+EYRSPPRSWARDG R++AAGGLAPPRYESRY DHLRRDRVDYLEDSFRGRSKFDRP+PSADW LRDNGRDDFITERKGFERRPPSPPL LLPQR
Subjt:  PGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPSPPLSLLPQR

Query:  GRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        GRW+R+VR+RSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  GRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

XP_008462734.1 PREDICTED: uncharacterized protein LOC103501026 isoform X2 [Cucumis melo]4.0e-18789.5Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLTRSREFGGGRDLGRYRDTSPH
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE       GSVSP RRRDVHRYIS+F+HS GL R REFGGGRDL RYRDTSPH
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLTRSREFGGGRDLGRYRDTSPH

Query:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS
        Y RR+ GGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY G PSLH+PPRRFAAHP+ERS
Subjt:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS

Query:  PGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPSPPLSLLPQR
        PGRTL+EYRSPPRSWARDG R++AAGGLAPPRYESRY DHLRRDRVDYLEDSFRGRSKFDRP+PSADW LRDNGRDDFITERKGFERRPPSPPL LLPQR
Subjt:  PGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPSPPLSLLPQR

Query:  GRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        GRW+R+VRERSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  GRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

XP_038880192.1 uncharacterized protein LOC120071861 isoform X1 [Benincasa hispida]1.8e-18787.47Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSPTRRRDVHRYISDFDHSG LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        R R+FGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGP  A GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR GA GSPRRGYV
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFIT
        G PSLHSPPRRFAAHP+ERSPGRTL+EYRSPPRSWARDGPR+IAAGGLAPPRYESRY DHLRRDRVDYL+DSFRGRSKFDRP+PSADW LRDNGRDDFI+
Subjt:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFIT

Query:  ERKGFERRPPS-PPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        ERKGFERRPPS PPLSLLPQRGRWAR+VRERSRSPIRGPVRSPLRV LRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
Subjt:  ERKGFERRPPS-PPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

XP_038880195.1 uncharacterized protein LOC120071861 isoform X3 [Benincasa hispida]8.5e-19092.29Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLTRSREFGGGRDLGRYRDTSPH
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYE       GSVSPTRRRDVHRYISDFDHSG LTR R+FGGGRDLGRYRDTSPH
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLTRSREFGGGRDLGRYRDTSPH

Query:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS
        YSRRISGGRPFGRGVDGP  A GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR GA GSPRRGYVG PSLHSPPRRFAAHP+ERS
Subjt:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS

Query:  PGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPS-PPLSLLPQ
        PGRTL+EYRSPPRSWARDGPR+IAAGGLAPPRYESRY DHLRRDRVDYL+DSFRGRSKFDRP+PSADW LRDNGRDDFI+ERKGFERRPPS PPLSLLPQ
Subjt:  PGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPS-PPLSLLPQ

Query:  RGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        RGRWAR+VRERSRSPIRGPVRSPLRV LRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
Subjt:  RGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

TrEMBL top hitse value%identityAlignment
A0A1S3CHL9 uncharacterized protein LOC103501026 isoform X14.0e-18584.82Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSP RRRDVHRYIS+F+HS GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        R REFGGGRDL RYRDTSPHY RR+ GGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFIT
        G PSLH+PPRRFAAHP+ERSPGRTL+EYRSPPRSWARDG R++AAGGLAPPRYESRY DHLRRDRVDYLEDSFRGRSKFDRP+PSADW LRDNGRDDFIT
Subjt:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFIT

Query:  ERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        ERKGFERRPPSPPL LLPQRGRW+R+VRERSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  ERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

A0A1S3CHP1 uncharacterized protein LOC103501026 isoform X21.9e-18789.5Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLTRSREFGGGRDLGRYRDTSPH
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE       GSVSP RRRDVHRYIS+F+HS GL R REFGGGRDL RYRDTSPH
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLTRSREFGGGRDLGRYRDTSPH

Query:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS
        Y RR+ GGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY G PSLH+PPRRFAAHP+ERS
Subjt:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS

Query:  PGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPSPPLSLLPQR
        PGRTL+EYRSPPRSWARDG R++AAGGLAPPRYESRY DHLRRDRVDYLEDSFRGRSKFDRP+PSADW LRDNGRDDFITERKGFERRPPSPPL LLPQR
Subjt:  PGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPSPPLSLLPQR

Query:  GRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        GRW+R+VRERSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  GRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

A0A5D3E321 TATA-binding protein-associated factor 2N4.0e-18584.82Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSP RRRDVHRYIS+F+HS GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        R REFGGGRDL RYRDTSPHY RR+ GGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFIT
        G PSLH+PPRRFAAHP+ERSPGRTL+EYRSPPRSWARDG R++AAGGLAPPRYESRY DHLRRDRVDYLEDSFRGRSKFDRP+PSADW LRDNGRDDFIT
Subjt:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFIT

Query:  ERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        ERKGFERRPPSPPL LLPQRGRW+R+VRERSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  ERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X71.7e-18385.68Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSPTRRRD HRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY
        R REFGGGRDLGRYRDTSPHYSRR+SGGRPFGRG DGPG A GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GY
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY

Query:  VG-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPD-HLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDF
        VG PSLHSPPRRFAAHP+ERSPGRT++EYRSPPR WARDGPR+IAAGGLAPPRYESRY D HLRRDRVDYLEDSFRGRSKFDR +P +DW+LRDNGRDDF
Subjt:  VG-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPD-HLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDF

Query:  ITERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        I ERKGFERRPP PPLSLLPQRGRWAR+VRERSRSPIRGPVRSPLRV LRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDG
Subjt:  ITERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X61.6e-18185.45Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSPTRRRD HRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLT

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY
        R REFGGGRDLGRYRDTSPHYSRR+SGGRPFGRG DGPG A GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GY
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY

Query:  VG-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPD-HLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDF
        VG PSLHSPPRRFAAHP+ERSPGRT++EYRSPPR WARDGPR+IAAGGLAPPRYESRY D HLRRDRVDYLEDSFRGRSKFDR +P +DW+LRDNGRDDF
Subjt:  VG-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPD-HLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDF

Query:  ITERKGFERRPPS-PPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        I ERKGFERRP S PPLSLLPQRGRWAR+VRERSRSPIRGPVRSPLRV LRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDG
Subjt:  ITERKGFERRPPS-PPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

SwissProt top hitse value%identityAlignment
F4JY84 Protein tesmin/TSO1-like CXC 71.6e-2933.76Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSI-PQFHQANNVMSSGTST
        ANILCSENC+C DCKNFEGSEER+AL HG   ++  YIQQ  NAA+  AI  S Y   P S+KRK  ++       DS ++S   Q+ +AN+V  +G ++
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSI-PQFHQANNVMSSGTST

Query:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNR
            P       + A SG +  ++RS  ++  QPH ++ELCS+LV  S +VA K++++    K    P  +  A    +E                  N 
Subjt:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNR

Query:  SDRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSE-VYIEQERIVLTKFRDCLNKLITLGEIKETKFTSR
        S   V D +  D       +P+SP T ALMCDE   +      ++        TS    D   S  +Y+EQER +L+ FRD L +L           ++R
Subjt:  SDRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSE-VYIEQERIVLTKFRDCLNKLITLGEIKETKFTSR

Query:  SEVGNENLSNN
        + +  +N+ +N
Subjt:  SEVGNENLSNN

Q28009 RNA-binding protein FUS2.7e-0533.57Show/hide
Query:  GRPFGRGVDGPGLASGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTL
        G P GRG  G G + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G GG P   ++G + +   RR      +R   R  
Subjt:  GRPFGRGVDGPGLASGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTL

Query:  SEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDR
           R   R   R G      GG  P + +SR  +H R+DR
Subjt:  SEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDR

Q92804 TATA-binding protein-associated factor 2N1.4e-0642.67Show/hide
Query:  GRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR-RGYVG
        G G  G     G +RG          P+ GDW C +P C N+NFARR  CN CN PR      +GG  R RGY G
Subjt:  GRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR-RGYVG

Q9SL70 Protein tesmin/TSO1-like CXC 68.0e-5848.12Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS
        ANILCSENCKC+DCKNFEGSE RQ+LFHG+H++NLAY+Q  ANAAITGAIGSSG+A  P  K+RKG E+ F    KDS   S  +  QANN  ++ + T 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS

Query:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS
        S         G  A+ GPSK  ++SLLA++I+P D+K LCSVLV  + E AK + E+R A ++     + S ASS  D+                 +N++
Subjt:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS

Query:  DRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKET
        ++S  ++SN+DG   +K R +SP TLALMCDERDTM M A     S       +S +P+    +VY EQE++VLTKFRDCLN++I+ GE+KE+
Subjt:  DRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKET

Q9SZD1 Protein tesmin/TSO1-like CXC 51.2e-7251.81Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSG-TST
        ANILCSENCKC+DCKNFEGSEERQALFHG+H+N++AY+QQAANAAITGA+GSSG+A  P  K+RKG E+ F    KDS  + +  F Q NN  + G TS 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSG-TST

Query:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRN-AEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSS-
        +SP PV+  G    A+S PSKF +RSLLAD+IQPHD++ LCSVLV  + E AK   ++RN  E +++D  + S ASS  D+ Q       AAD E  ++ 
Subjt:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRN-AEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSS-

Query:  -NRSDRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFM-GAGLADGSAG-HDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETK
         N++D+S  + SNSDG+D +K  P+SP TLALMCDE+DT+FM  A   +GS   + C  +S    +  SE+Y EQER+VLTKFRDCLN+LI+  EIKE+K
Subjt:  -NRSDRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFM-GAGLADGSAG-HDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETK

Query:  FTS--RSEVGNENLSNNFTSNNGCHQRSISNG
          S  R  +    ++   T N    Q  I NG
Subjt:  FTS--RSEVGNENLSNNFTSNNGCHQRSISNG

Arabidopsis top hitse value%identityAlignment
AT2G20110.1 Tesmin/TSO1-like CXC domain-containing protein5.7e-5948.12Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS
        ANILCSENCKC+DCKNFEGSE RQ+LFHG+H++NLAY+Q  ANAAITGAIGSSG+A  P  K+RKG E+ F    KDS   S  +  QANN  ++ + T 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS

Query:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS
        S         G  A+ GPSK  ++SLLA++I+P D+K LCSVLV  + E AK + E+R A ++     + S ASS  D+                 +N++
Subjt:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS

Query:  DRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKET
        ++S  ++SN+DG   +K R +SP TLALMCDERDTM M A     S       +S +P+    +VY EQE++VLTKFRDCLN++I+ GE+KE+
Subjt:  DRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKET

AT2G20110.2 Tesmin/TSO1-like CXC domain-containing protein2.8e-5848.11Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS
        ANILCSENCKC+DCKNFEGSE RQ+LFHG+H++NLAY+Q  ANAAITGAIGSSG+A  P  K+RKG E+ F    KDS   S  +  QANN  ++ + T 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS

Query:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS
        S         G  A+ GPSK  ++SLLA++I+P D+K LCSVLV  + E AK + E+R A ++     + S ASS  D+                 +N++
Subjt:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS

Query:  DRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIK
        ++S  ++SN+DG   +K R +SP TLALMCDERDTM M A     S       +S +P+    +VY EQE++VLTKFRDCLN++I+ GE+K
Subjt:  DRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIK

AT4G28990.1 RNA-binding protein-related2.8e-4539.27Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNS---DGGGGGVGGTSGGRVGR----------------GSDYEPGSVSPTRR-RDVHRYISDFDHSGGLTRSREF
        MGS +K+ TT HH P +SSLVVRPS S   + G    G    G V R                G      S SP RR  + H++ SD +HSG   R RE 
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNS---DGGGGGVGGTSGGRVGR----------------GSDYEPGSVSPTRR-RDVHRYISDFDHSGGLTRSREF

Query:  GGGRDL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPS
           R+  GR+RD SP  +R  +G RP+ RG+DGP    G  R   S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP        
Subjt:  GGGRDL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPS

Query:  LHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKG
           P  R    P+  SP R  + YRSPPR W RD P         PPR++        RDR  Y +  +    +      ++DW   +         +  
Subjt:  LHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKG

Query:  FERRPPSPPLSLLPQRGRWAREVRERSRS-PIRGPVRSPLRVSLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDG
        ++RRPP  P    P+ GRW R +RERSRS P+R     PLR      L  G PP  +D+RRD   +RE RDD RG GR R G
Subjt:  FERRPPSPPLSLLPQRGRWAREVRERSRS-PIRGPVRSPLRVSLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDG

AT4G28990.2 RNA-binding protein-related1.0e-3934.88Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNS---DGGGGGVGGTSGGRVGR-----------------------------------------------------
        MGS +K+ TT HH P +SSLVVRPS S   + G    G    G V R                                                     
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNS---DGGGGGVGGTSGGRVGR-----------------------------------------------------

Query:  -----------GSDYEPGSVSPTRR-RDVHRYISDFDHSGGLTRSREFGGGRDL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNN-PN
                   G      S SP RR  + H++ SD +HSG   R RE    R+  GR+RD SP  +R  +G RP+ RG+DGP    G  R   S+NN   
Subjt:  -----------GSDYEPGSVSPTRR-RDVHRYISDFDHSGGLTRSREFGGGRDL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNN-PN

Query:  VRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESR
        V+PR+GDWYC DPLC NLNFARRE C  C R R     SP           P  R    P+  SP R  + YRSPPR W RD P         PPR++  
Subjt:  VRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPRDIAAGGLAPPRYESR

Query:  YPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPSPPLSLLPQRGRWAREVRERSRS-PIRGPVRSPLRVSLRSPLSSGL
              RDR  Y +  +    +      ++DW   +         +  ++RRPP  P    P+ GRW R +RERSRS P+R     PLR      L  G 
Subjt:  YPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPSPPLSLLPQRGRWAREVRERSRS-PIRGPVRSPLRVSLRSPLSSGL

Query:  PP--KDFRRDVFVERE-RDDRRGLGRDRDG
        PP  +D+RRD   +RE RDD RG GR R G
Subjt:  PP--KDFRRDVFVERE-RDDRRGLGRDRDG

AT4G29000.1 Tesmin/TSO1-like CXC domain-containing protein8.2e-7451.81Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSG-TST
        ANILCSENCKC+DCKNFEGSEERQALFHG+H+N++AY+QQAANAAITGA+GSSG+A  P  K+RKG E+ F    KDS  + +  F Q NN  + G TS 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSG-TST

Query:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRN-AEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSS-
        +SP PV+  G    A+S PSKF +RSLLAD+IQPHD++ LCSVLV  + E AK   ++RN  E +++D  + S ASS  D+ Q       AAD E  ++ 
Subjt:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRN-AEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSS-

Query:  -NRSDRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFM-GAGLADGSAG-HDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETK
         N++D+S  + SNSDG+D +K  P+SP TLALMCDE+DT+FM  A   +GS   + C  +S    +  SE+Y EQER+VLTKFRDCLN+LI+  EIKE+K
Subjt:  -NRSDRSVHDNSNSDGLDITKARPMSPGTLALMCDERDTMFM-GAGLADGSAG-HDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETK

Query:  FTS--RSEVGNENLSNNFTSNNGCHQRSISNG
          S  R  +    ++   T N    Q  I NG
Subjt:  FTS--RSEVGNENLSNNFTSNNGCHQRSISNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCAGAAGAATTAGGCAAACAAGGCGAAAGGATGGGTTCAAGGGATAAAGACTCTACGACTCACCATCAGCCGCTATTGAGCAGCCTTGTTGTCCGGCCT
TCGAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAGTGGCGGCCGCGTTGGTCGTGGAAGCGATTACGAGCCCGGTTCAGTTTCTCCAACACGCCGTCGG
GATGTTCACCGATATATTTCTGATTTTGATCATTCTGGTGGTCTCACTCGCAGTCGTGAATTTGGTGGTGGGAGGGATCTTGGTAGATATCGAGATACTTCACCT
CATTACAGTCGAAGAATAAGTGGTGGCAGGCCATTTGGGAGAGGTGTTGATGGCCCTGGACTTGCTTCTGGGCCATTTCGTGGGGAACGTAGTAAAAATAATCCA
AATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTCTATGTGACAACCTAAACTTTGCAAGACGAGAATTTTGTAACAACTGCAATAGACCCCGCACT
GGAGCTGGTGGAAGTCCTCGAAGAGGTTATGTTGGTCCATCCCTGCATTCTCCTCCTAGACGCTTTGCTGCGCACCCAGTTGAACGTTCTCCTGGCAGGACTCTT
AGTGAATATAGGTCTCCTCCCCGTAGTTGGGCCAGGGATGGTCCTAGGGATATTGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGGTATCCCGATCAC
CTGCGAAGAGACAGGGTGGACTATCTAGAAGATAGCTTCAGAGGAAGATCGAAGTTCGATAGGCCAGTTCCTTCAGCAGATTGGACCCTTAGAGACAATGGAAGG
GATGATTTCATCACAGAGAGGAAGGGATTTGAAAGAAGGCCACCATCCCCACCACTGTCGTTGCTTCCTCAGCGTGGGCGCTGGGCGCGTGAAGTGAGGGAGAGG
AGCCGTTCCCCAATCAGAGGTCCCGTCAGATCTCCATTAAGAGTCTCGCTACGGTCTCCATTAAGTAGTGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTC
GTTGAAAGGGAGCGGGATGATAGGCGTGGCCTAGGACGAGATCGCGATGGAGTCTTGCCGGAGCATCCTCATTTTCAGTCTCAGCCTCCGTGTCAACAATCAGAG
TCACGGGCCGTCATGGTGGTGCAGAGTCAATCGCAACCACAGTCCCCACAACATTTGACGGCCAATATTCTCTGCTCTGAAAACTGCAAGTGCATGGACTGTAAG
AATTTTGAAGGCAGCGAAGAGAGACAGGCTCTTTTCCATGGTGACCATGCCAACAACTTGGCTTATATTCAACAGGCAGCAAATGCTGCAATAACTGGAGCTATT
GGATCCTCTGGTTATGCTTGCCTTCCCACTTCAAAGAAAAGAAAAGGTCCAGAGCTATGCTTTGGCCCGGTAGGGAAGGATTCCCCCCTCAACAGCATACCGCAA
TTTCATCAGGCAAATAATGTAATGTCTTCAGGCACGTCTACTTCATCTCCCTTCCCAGTTGCTCATGTTGGCAATGGCAGCCCTGCAGCATCGGGGCCTTCAAAG
TTCTCATTCAGGTCCTTATTAGCTGACCTCATCCAACCGCATGACTTGAAGGAGCTTTGCTCAGTTTTAGTGGTGTTTTCACGTGAAGTTGCCAAGAAAATAGCA
GAACAAAGAAATGCTGAGAAACAGATAAATGACCCACCACAGATTTCTCGTGCTTCATCTACTGTTGATGAGTCACAGCATCAGAAGGCCGAAGAGAAAGCTGCA
GATGGCGAGTGTGGAAGTTCAAACCGAAGTGATAGGAGTGTACACGATAATTCAAATTCAGATGGTTTGGATATTACAAAAGCAAGGCCGATGTCCCCTGGAACT
TTAGCTTTAATGTGTGATGAACGAGATACGATGTTCATGGGAGCTGGTCTAGCTGATGGGTCGGCAGGCCATGATTGCAACACATCATCACATATGCCTGATAAG
TCTGTGTCAGAAGTCTACATAGAGCAAGAACGGATTGTGCTGACAAAGTTTCGAGATTGTCTTAATAAACTCATCACTCTAGGAGAGATAAAAGAAACAAAATTC
ACTTCTAGAAGCGAAGTGGGGAATGAAAATCTCAGCAACAATTTCACCTCAAATAATGGGTGCCACCAGAGATCTATTAGCAATGGGGTTGTAAAAAATGTCGCT
CTCTCGGCACACAGAATAACGCCAGTCGGCACTGCAGCTCGTCATCCGAATAGCGATCTCCTACTCAAAATTCTACCAATTCCTAAAAATAGTAAGAGTAAACCA
CAAGTTGACAGAGAAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGCAGAAGAATTAGGCAAACAAGGCGAAAGGATGGGTTCAAGGGATAAAGACTCTACGACTCACCATCAGCCGCTATTGAGCAGCCTTGTTGTCCGGCCT
TCGAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAGTGGCGGCCGCGTTGGTCGTGGAAGCGATTACGAGCCCGGTTCAGTTTCTCCAACACGCCGTCGG
GATGTTCACCGATATATTTCTGATTTTGATCATTCTGGTGGTCTCACTCGCAGTCGTGAATTTGGTGGTGGGAGGGATCTTGGTAGATATCGAGATACTTCACCT
CATTACAGTCGAAGAATAAGTGGTGGCAGGCCATTTGGGAGAGGTGTTGATGGCCCTGGACTTGCTTCTGGGCCATTTCGTGGGGAACGTAGTAAAAATAATCCA
AATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTCTATGTGACAACCTAAACTTTGCAAGACGAGAATTTTGTAACAACTGCAATAGACCCCGCACT
GGAGCTGGTGGAAGTCCTCGAAGAGGTTATGTTGGTCCATCCCTGCATTCTCCTCCTAGACGCTTTGCTGCGCACCCAGTTGAACGTTCTCCTGGCAGGACTCTT
AGTGAATATAGGTCTCCTCCCCGTAGTTGGGCCAGGGATGGTCCTAGGGATATTGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGGTATCCCGATCAC
CTGCGAAGAGACAGGGTGGACTATCTAGAAGATAGCTTCAGAGGAAGATCGAAGTTCGATAGGCCAGTTCCTTCAGCAGATTGGACCCTTAGAGACAATGGAAGG
GATGATTTCATCACAGAGAGGAAGGGATTTGAAAGAAGGCCACCATCCCCACCACTGTCGTTGCTTCCTCAGCGTGGGCGCTGGGCGCGTGAAGTGAGGGAGAGG
AGCCGTTCCCCAATCAGAGGTCCCGTCAGATCTCCATTAAGAGTCTCGCTACGGTCTCCATTAAGTAGTGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTC
GTTGAAAGGGAGCGGGATGATAGGCGTGGCCTAGGACGAGATCGCGATGGAGTCTTGCCGGAGCATCCTCATTTTCAGTCTCAGCCTCCGTGTCAACAATCAGAG
TCACGGGCCGTCATGGTGGTGCAGAGTCAATCGCAACCACAGTCCCCACAACATTTGACGGCCAATATTCTCTGCTCTGAAAACTGCAAGTGCATGGACTGTAAG
AATTTTGAAGGCAGCGAAGAGAGACAGGCTCTTTTCCATGGTGACCATGCCAACAACTTGGCTTATATTCAACAGGCAGCAAATGCTGCAATAACTGGAGCTATT
GGATCCTCTGGTTATGCTTGCCTTCCCACTTCAAAGAAAAGAAAAGGTCCAGAGCTATGCTTTGGCCCGGTAGGGAAGGATTCCCCCCTCAACAGCATACCGCAA
TTTCATCAGGCAAATAATGTAATGTCTTCAGGCACGTCTACTTCATCTCCCTTCCCAGTTGCTCATGTTGGCAATGGCAGCCCTGCAGCATCGGGGCCTTCAAAG
TTCTCATTCAGGTCCTTATTAGCTGACCTCATCCAACCGCATGACTTGAAGGAGCTTTGCTCAGTTTTAGTGGTGTTTTCACGTGAAGTTGCCAAGAAAATAGCA
GAACAAAGAAATGCTGAGAAACAGATAAATGACCCACCACAGATTTCTCGTGCTTCATCTACTGTTGATGAGTCACAGCATCAGAAGGCCGAAGAGAAAGCTGCA
GATGGCGAGTGTGGAAGTTCAAACCGAAGTGATAGGAGTGTACACGATAATTCAAATTCAGATGGTTTGGATATTACAAAAGCAAGGCCGATGTCCCCTGGAACT
TTAGCTTTAATGTGTGATGAACGAGATACGATGTTCATGGGAGCTGGTCTAGCTGATGGGTCGGCAGGCCATGATTGCAACACATCATCACATATGCCTGATAAG
TCTGTGTCAGAAGTCTACATAGAGCAAGAACGGATTGTGCTGACAAAGTTTCGAGATTGTCTTAATAAACTCATCACTCTAGGAGAGATAAAAGAAACAAAATTC
ACTTCTAGAAGCGAAGTGGGGAATGAAAATCTCAGCAACAATTTCACCTCAAATAATGGGTGCCACCAGAGATCTATTAGCAATGGGGTTGTAAAAAATGTCGCT
CTCTCGGCACACAGAATAACGCCAGTCGGCACTGCAGCTCGTCATCCGAATAGCGATCTCCTACTCAAAATTCTACCAATTCCTAAAAATAGTAAGAGTAAACCA
CAAGTTGACAGAGAAGTCTAGAAAAAGCCCAAGAAAGCAACTGGCGTGTTGCTCCATCTCATTGTTCATTCAGGTTTAATAAAAACAAAAATTAGGTTTAATGAC
AATGATTAATGAGAAGAAATGAGAAGAGAAGAGAAGATTTTATACCTTAAACAATGGCAGAAAGGTAGAAGAAGAAGAAAGAAGAAGAAGAATCGGAAAAAGAAG
AGGAAGAAGAAGTAGATGATAGAAGAATATGAAGAAGAATAGGAATAAGAAGGAGGAGGAGGAGGAGGAGGAGAAGGAGAAAAATGAGACAAAGGAGAAAAAGTA
GCAGACGAAGAAGATGAAGAAGACGAAAAGGCAAAACGAAGGGAAAATTATTAGACATTAATAGAACACCATCACTATCTATCGTAAAATTTAAGATTAATGTTT
ATACCTTTGTTTTTTGTTTATCTTTCTGGTTTTCAGTTTTTTATTTTTATTTTTCCTTTGCTTGATGAGTCTCGGATGACTATTTTTCTTTTGATTTTGCTTACC
ATTTGTTTATAAAATATAGGTTTAATAAGAACAAAAATTAGTCCTTAGCACGGCCCTGTAAAGAGAAAGGAAAAAGTCTTGATTTAATAGTATCATTAGAAGCAC
TATTCATTTTTACCATTGCGCATATTTCTAAGAAGGACTGTAACTGCTTGTGAGGATCTTCATTAGTTCTTCCTCTAAAGGCTAGCTCTCTAGCCATTTGGATCA
ACCCTGATTTCAACTCAAAGTTGTTGACATTGATGGACACATTCATTATATTGAGTTGATTTGCCGGTAATGTAAGTTGGAAGTAGTCTCGAATTGCTTTTGAAA
GCTCCTCCGTCATCTCCTTGATGTGATTGTGTTGAACCCTTAGGTTTCTTCTACATGTCCTTTCAATCTCGGGATCAAGAGGGAGAATATTAGTATTGTCACGAG
GCATAAACCGACAAAATTATCAACCAAATAATACTATACACCTCAAATTAAGACCTACCAATTCTAATGAATATTAAATAACAAAAACTCTTCTTCGGATCACTG
AAAACAACGCCAAAAACTTGATTGCAACACTTTAAGTGTTAATCGCAAGTGTACGGTTCAAGTTATAATATAAGACGATAAAGTTCGAATATCGTTTTACTAAGG
ATCCAATTGTGAAGGTTAAAGTTATTTAGCTATTTTAGAGTTGAATCAATTTTATTTGAGGATTGGAATTGAAATTGTAAACTATGGAAAATATAAACTAAATAA
GGAAAGTAATAAAGTGAGAAACACAAGAGTCAAAGAGAGTTCTAGGGTAATGAATTTTTGTTTATTTCATCATGTAATATTCTAGGGTTCATGGAGTTTATTGTA
ATTCATATGCCTAAGTTTAGAAACTTAATTCTAAGTTATTTTCTCTCAACAACTCTTTCTCATGCAACTTAATTGATTATCAATCTCTTGAAACCAATACAAATT
ACATGATATTAAGAATAGTCAACTTTTATTGTCAACCTAACTACTCTCGTGATAGATTAACAACAATTCATTATTATAGATCTAGTCAAACATACAATCTCTCAA
TCATACATCTAACCTAACATGTAAAGATGACCAATCCACAAGAATTAAACACAATAACAATGAAAAACATAATCATTCCAACAAAACCATGAAAGTCTACAATAT
TAACATGAATTCATAAACAATTCACCAAAACCCTAAAACTAAAGAGTTTAGCCATTCATAGCTTGAAGCAACATGTTCAAATAATACAAACAAGGAAAGAGGGTA
AGAAAGGAAAACTCTATTGAAAAAAGAGGCTCTTGATTGAGGTTGCACCGTCACAATCTCTTTGGCTCTCACGTCAGAACTTGAACGATGTTGATTCATGATTAG
GGCTGCTGAAAACCCCCAAATTTATAGAGTTGGCTGCCCAAAAGACTTTCAAAAAATAACAAAATTTCAATTAAGGAAGACTGACGTTGCGGCGCTTGTCTATAG
CGCCATGACATTGCCAATTATGGAAAGACAGCACTGTGGCGCTGCTGCTGTGTTGCGGCAGACCTCTGGGGAAATAGCGCCGCGGCGCTGCTCTGTTTTGGCAAT
ATTGTTGTTTTTCATCTTTTTTTCACTTTATTCTGGCTCGGTTTTGATCTTTTGGTCTTCTTTTCTAATTTTCCTCATTTTAACCTGAAAATCACTAGCAAACAA
GTGAAATCTAATATAAATTCTCTTAATTTGAGAGAATAGAAAGCACTTTTCAAGCGCTTTTCAATAGACAATGAACGTGTTGTAGTATTGGTATCTCGGATAGAT
AGTGATAGTGTTCTATCAATGTCTATCTAGGATAGACTTTGATGGTATATCTGGGATAAACTTTGATAGACTACTATCAAAGTCTATCCCAGATAGATAGTGAAC
TTGTTGTAGTATTAGTATCTTGAATAGACAGTGATAGTGTCCTATCAATGTCTATCTATGATAGACAGTAATATTTATATGATAGGTTTAATGACATATACTATC
ATTGTTTATATATATGCTAATGCTTCTATCACTGATATACATGTTATTGCTATTACTTCACTCTTGTTCTAGTATTTGATATGTGTTGTAGTATTAGTATCTCAG
ATAGAAAGTGATAGTGTTCTATCAATGTCTATCTGGGATAGACAATATTAGTGTTCTATCAATGTCTATCAAGAATAGATAGTGATAGACTACTATTAGGTTCTA
TCAACAGTAGAAAACAAATAATAATTTTGTTCTTGCATTACCATAATGTATTGAAATAGTAGTTCCTAATATATTATTTTTAATATACCTTTTCAACCGGTGGAA
CCTGAGTTATTAGTTGAAAATGACAACAAAAACTTAAGGATAACATTATACTTAAATTTGGAAGTCTTAAAGATCATTAAGAAGAAATTACATGAACAAAGTTTA
ATAGAAGTTAGAACGATCCTTTTATATACTTTCAAACACTATTAATTGGTGTTGAGTTGATGACCATGGTAATAGAAAACACATAATTACACAAGGACAAACAAA
CGATTGATTATGTGGTATAGGTTCAAAGAAGAAAAAAAAAACAACAAAAAACCATCAGGAAAGAAAAAATAAACTGATAGACAAAGATAACTCTATCACTATCTA
TCAATAGTTTTCTATCAAAGTTTATTAGTAAGATAGAGAACGATAGATTGCTATCTTTGTCAATCACTGATAGATAGTGTGAACTCTACTTTGAAATTTTTGGTT
TCAGCAAAAATATAATGAAGAAGAAGAAGAAAATCAATGAAAAAAAAAATTGATAATAATAAGAAGAAGAAGATCAATGAAGACAAACTAATTATTAATAATAAT
AATAATAAGAAGGCAATAATGGCAAAAGAAGAAGAAGAAGAAGAAGAAGAAAAAAAAAGATCTTGAGAAGAAAATGAAGAAAATCTAAGAAGAAGAAGAAGAAAA
AGAAAAAAAAGAAGAAGAAGAAAACCTAGAAGACGATGATGAAGAGAGACTGGCGTAGAAGAAGAAGAAGAAGAAGAAGAAGAAGGACATGGAGGAAGAAGCATC
AACAAAATCTTCGATCGAATTGTAAATCATGTTGAATCTGATTGCAGGAAAAAAGAAGAATGGAAAGAAGAAGACGAAAATCGCCAGAAAAACGTGCGCAACTTA
AAATTGAAGGAAAGAAAATAGGAAAAGATGAAGAGGAAAGAAGGAGATATAAAGACATGGTCTCTGAATGCGCGGGTAAATTAAAGATGACACGGCCAACTTATC
GAGTCTTGCAAATATGGAACCATATCAGCATCAATGTTAATGGTGCTTCAAACCCCCACAAATGGCTTCTTCTTCCTTAATTCCTGTGCAGCACAACCATTTTAT
CTCCTGCAACACTCTCTGTAAGTTCTCTTCAGCCCTTACTACCTCCAATTCCCCTTGTTTATCGCGTTTTCCCACTTTCTCTTCTCCCAAACACACAGGTCGTTT
AGTTAGAGTAGGAAATGGAGTTGCGGGAGAATTTTCTACTAGCAAAAACCGTAAACCTTCCAATAGACCTTTGTCTTCGATGACCCAGAAGCTAAACAAAAATGG
GTCTCCTCATTTAGGGTATGGAAATGAAACACTTGTTAAGAAATCTGTGGAAGACCTTGTTAATTTAGAGATGGAGGAGAGAAATGATAAAAGGCTTGAAAAGGG
TAAATACGCTAGGAATGGCTCTGGCTCTGTTAATGGAATGAAAACGAGTGCTGGGATTTCTTTCCTTAAGACTGGAAGTGATAGTGGTTCACTGAAGGTTTCTGG
TAAGATTAAGGAAAAGAGCATGCGAAATCAGGTGGTAGCGGAGAAGGAGAAATTGTCAAAAGGGAGTAACGAAATCCCATTTAGAGCAAATTTGGATATGTGCTC
AAAGACAGGGGATTTCATGGGCGCAATTAAATTGTACGAATGGGCTCAAAGGGAAGGAATTAAGCTGGAGCAATATCATTATGCTGTCATTTTGTATCTTTGTTC
CTCAGCGGCTTTAGGTGCTATTCAGCCCGCTATGAGTGGTAGTGGTAACCGAACTTCAAATTCATTAACTATGTTTAAGGTGGATACTTATGAAAATCCCATTAT
ACTGGATGAACAACATTCTTCCAAGACTAGTTATGTTTCAAAGAGAGAGAGTTGTGGAAGAACAGATTTGAGTGCCAAAAATGATAGAAATAATTCAGGTGGGAT
GATTGACAATAAGGAGAATATAGTTCATACCAACGGATCTATGGTGCCAAAAGCTTGGATATTGGATGAGAAGAGCCATTCTAATATTTTGGTAGATGAGGATTT
TAAAAAATATGCTCTCAAGAGGGGATTCGAGATCTATGAGAAAATGTGTGCAGAAAAGATCCCAATGAATGAAGCAACCTTGACATCTGTGGCTCGAATGGCAAT
GTCCATGGGTGATGGTGACATGGCATTTGATATGGTGAAGAAGATGAAGCCATTAGGACTTAATCCTAGATTGCGCTCTTATGGCCCTGCCCTTTCTGCTTTCTG
CAAAAATGGGGAATTAGATAAAGCATTTTCAGTTGAGAAGCACATGTTGGAGCATGGTGTCTATCCAGAAGAGCCTGAGTTGGCAGCACTTTTAAGAGTAAGTAT
AGATGCGTCTAACGGTGAGAAGGTATATTATTTGTTGCATAAACTAAGAACAAGTGTAAGGCAAGTCTCGCCCTCAACAGCCAATCTTATTATTACCTGGTTCAA
GAGCAAAGATGCTGCAAGAGTGGGAAAAGTAAAATTGGATAGAAAAAGAATAAAGAAAGCAATGGAAAATGGTGGCGGAGGTTGGCATGGACTGGGATGGTTGGG
AAGGGGAAAGTGGAGTGTATCATCTACAAATGTTGGAAAGGATGGCTTGTGTAAATCCTGTGGGGAAAAATTGGCAACAATTGATCTTGATCCTATTGAAACTGA
GAATTTTGCTGAATCTGTTGCAGCTATAGCCGCACAAAAAGAGAAAAATTCAAGTTTTCAGAAATTTCAAAAATGGCTTGAATATTATGGACCATTTGATGCAGT
GATTGATGCTGCTAATGTAGGCCTATTTAGCCAAAGAAAATTCACACCATCTAAAGTCAATTTAATTGCCAACGGTATACGGCAGAAGCTTCCTTCAAAGAAGTG
GCCACTTATTGTATTGCATAACAGACGAATCAATGGACGAAAGATGGAAGAGCCAGTAAATAAAACCTTGATTGAGAAGTGGAAAAATGCTGATGCACTATATGC
AACGCCTACAGGATCAAATGATGATTGGTACTGGTTGTATGCAGCAATCAAATTCAAGTGCTTAATCGTGACAAATGATGAGATGAGAGATCACACATTCCAACT
TCTTGGAAATGATTTCTTCCCTAGATGGAAAGAAAGGCATCAAGTGCATTTTAGTTTCTCTGCTACTGGTCCAGTATTTCACATGCCTCCACCCTGTTCTGTCAT
AATTCAGGAATCAGAGAAAGGGCATTGGCATGTGCCCCTTGCATCAGAGCATAGTTATGAAGAAGATAGAAAATGGTTGTGTATTACAAGGGGGAATTCACAATC
AAACATGATGAGGCCAGGGCCTCCACTCAAAGTTGAAGAACCGCAGCCTCTTCTTCTCAGCAAAGGAAATCTAGGAGCTCAAGCTGACGTTGAGATCAAGAAACA
ACCCTCAAGTCAAGTCCACACGAGAAATTCTTCACAGGAAAATTACAAAAGTCTCAAACAAATACTCTCTGCAGCTGTGTTCTTGAACGAATGCAACTTACTATC
AGAAATAGAGGCCGCAGAGAAGCTTGGTGGCTGCACCATAGACTTTCAAATTTGAAAAAAAAAAAGGAAAAAAGAGAGATATCTTCCATGGTTGGCCACCATACC
AAATTTTCCTTTTCCATTTGCGATCTCAATGATGTAACTGTAACTCAGATATTCTTCACCTAACATATCAATGAGTAGTGAAAAGAAAGGAAAAAAAGGTTGATC
ATAAGATCAGTGGTCACGAGCATGCACGTTTCAGAATTTTGGCATTCAGCAACGAAACCCATTCATAGAATAGCTTCTTGAAATGGCACGGTGAGGAGAATGTTG
AGCTCTATATCTTCTGGTGGTTTGAATGCAGAAGCGGAGGCTGAGGAAGCTAGTAGCTTTCAGACTGCTCTCCTGGAACTAAAAGGGTTGCGTTCCCAGCTTCAT
CAAGCGGCAGACTACTGTGAAACAACGTTCCTGAAAACTAAAGAGAAGAATGAAGTGGTGGAAAACACGAAAGAGTACGTATGCAGGGCCATGGTAACTATGGTT
GATCATCTTGGAAATGTCACTTCCAATTTAGAGAGCTGCATTTCTCAGACTAATGCCTTCAACGAGGTCGAGCTTCGATTAAACTGCTTAAACCAAAGGCTCCTC
TCCTGCAAACAGTATGCTCAAAAACTTGAACTATCCCGACTACGTTGGAGCGAAATCCTCCCTAGGTATCATGCGCGCTACATCTCTCCCGTTAGCAAAAATGCT
GAGCAATTAACTCGAAGTTCGAGTATTGGCCAGTGGAGAGACTCGAATGACCTTCCATTTGGGAAGACTGTTGGTAACTGCAAGCCAGACATTTTTACAGAGAAG
CCAATGCCAGTTCTGTTGTACAAATTCTACACCTATAATCTATCTCCTTCCAAAAACTTGAGTCATGGATTCACTTCAAAGAAAGATGACAAGGAGAAAACATTA
GGTACACTTCAAAGAGCAATATTACCAGCTGGTGATAATGGGTTTTCAGTGCGATCGAAAGGTCCCAATTCTACATTTTATTTCCAGGTTTCTAATCAGAAGCGT
GGACGCCAAAGAAAATCAAAGCTTGGGAGTGATATATTTTCCCTTCTAAAACGAACTAAACAAATAGCTTAACAAAAGATAAGCTTTGAGAATGCATTCCAATAA
TTCCTTGAGCATCCATACATTTTTGTAAATATATTATAACCTTTATCTTTAAGGGTTGACTGAATATTAGCGCAAGATGGAACCAAAATTTGATAACTATTTGAT
TTTTTAGTTTTTAAAAATTAACATGTTTCTTTGTTTTATTATTCACTTTA
Protein sequenceShow/hide protein sequence
MNAEELGKQGERMGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEPGSVSPTRRRDVHRYISDFDHSGGLTRSREFGGGRDLGRYRDTSP
HYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTL
SEYRSPPRSWARDGPRDIAAGGLAPPRYESRYPDHLRRDRVDYLEDSFRGRSKFDRPVPSADWTLRDNGRDDFITERKGFERRPPSPPLSLLPQRGRWAREVRER
SRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGVLPEHPHFQSQPPCQQSESRAVMVVQSQSQPQSPQHLTANILCSENCKCMDCK
NFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTSSPFPVAHVGNGSPAASGPSK
FSFRSLLADLIQPHDLKELCSVLVVFSREVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRSDRSVHDNSNSDGLDITKARPMSPGT
LALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETKFTSRSEVGNENLSNNFTSNNGCHQRSISNGVVKNVA
LSAHRITPVGTAARHPNSDLLLKILPIPKNSKSKPQVDREV