; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G09510 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G09510
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein tesmin/TSO1-like CXC 5
Genome locationClcChr06:12373619..12389389
RNA-Seq ExpressionClc06G09510
SyntenyClc06G09510
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR028307 - Lin-54 family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142517.1 uncharacterized protein LOC101209122 isoform X2 [Cucumis sativus]2.3e-18790.08Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLARSREFGGGRDLGRYRDTSPH
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYE       GSVSP RRRDVHRY+S+FDHS GL R REFGGGRDL RYRDTSPH
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLARSREFGGGRDLGRYRDTSPH

Query:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS
        Y RR+SGGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY G PSLHSPPRRFAAHP+ERS
Subjt:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS

Query:  PGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPSPPLSLLPQ
        PGRTL+EYRSPPRSWARDG RE+AAGGLAPPRYE SRYSDHLRRDRVDYLEDSFRGRSKFDRP+PSADWALRDNGRDDFITERKGFERRPPSPPL LLPQ
Subjt:  PGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPSPPLSLLPQ

Query:  RGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        RGRW+R+VR+RSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  RGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

XP_008462733.1 PREDICTED: uncharacterized protein LOC103501026 isoform X1 [Cucumis melo]3.7e-18585.64Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSP RRRDVHRYIS+F+HS GLA
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        R REFGGGRDL RYRDTSPHY RR+ GGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFI
        G PSLH+PPRRFAAHP+ERSPGRTL+EYRSPPRSWARDG RE+AAGGLAPPRYE SRYSDHLRRDRVDYLEDSFRGRSKFDRP+PSADWALRDNGRDDFI
Subjt:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFI

Query:  TERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        TERKGFERRPPSPPL LLPQRGRW+R+VRERSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  TERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

XP_008462734.1 PREDICTED: uncharacterized protein LOC103501026 isoform X2 [Cucumis melo]1.8e-18790.36Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLARSREFGGGRDLGRYRDTSPH
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE       GSVSP RRRDVHRYIS+F+HS GLAR REFGGGRDL RYRDTSPH
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLARSREFGGGRDLGRYRDTSPH

Query:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS
        Y RR+ GGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY G PSLH+PPRRFAAHP+ERS
Subjt:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS

Query:  PGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPSPPLSLLPQ
        PGRTL+EYRSPPRSWARDG RE+AAGGLAPPRYE SRYSDHLRRDRVDYLEDSFRGRSKFDRP+PSADWALRDNGRDDFITERKGFERRPPSPPL LLPQ
Subjt:  PGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPSPPLSLLPQ

Query:  RGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        RGRW+R+VRERSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  RGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

XP_038880192.1 uncharacterized protein LOC120071861 isoform X1 [Benincasa hispida]8.8e-18787.76Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSPTRRRDVHRYISDFDHSG L 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        R R+FGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGP  A GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR GA GSPRRGYV
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFI
        G PSLHSPPRRFAAHP+ERSPGRTL+EYRSPPRSWARDGPREIAAGGLAPPRYE SRYSDHLRRDRVDYL+DSFRGRSKFDRP+PSADWALRDNGRDDFI
Subjt:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFI

Query:  TERKGFERRPPS-PPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        +ERKGFERRPPS PPLSLLPQRGRWAR+VRERSRSPIRGPVRSPLRV LRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
Subjt:  TERKGFERRPPS-PPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

XP_038880195.1 uncharacterized protein LOC120071861 isoform X3 [Benincasa hispida]4.2e-18992.58Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLARSREFGGGRDLGRYRDTSPH
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYE       GSVSPTRRRDVHRYISDFDHSG L R R+FGGGRDLGRYRDTSPH
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLARSREFGGGRDLGRYRDTSPH

Query:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS
        YSRRISGGRPFGRGVDGP  A GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR GA GSPRRGYVG PSLHSPPRRFAAHP+ERS
Subjt:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS

Query:  PGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPS-PPLSLLP
        PGRTL+EYRSPPRSWARDGPREIAAGGLAPPRYE SRYSDHLRRDRVDYL+DSFRGRSKFDRP+PSADWALRDNGRDDFI+ERKGFERRPPS PPLSLLP
Subjt:  PGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPS-PPLSLLP

Query:  QRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        QRGRWAR+VRERSRSPIRGPVRSPLRV LRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
Subjt:  QRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

TrEMBL top hitse value%identityAlignment
A0A1S3CHL9 uncharacterized protein LOC103501026 isoform X11.8e-18585.64Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSP RRRDVHRYIS+F+HS GLA
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        R REFGGGRDL RYRDTSPHY RR+ GGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFI
        G PSLH+PPRRFAAHP+ERSPGRTL+EYRSPPRSWARDG RE+AAGGLAPPRYE SRYSDHLRRDRVDYLEDSFRGRSKFDRP+PSADWALRDNGRDDFI
Subjt:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFI

Query:  TERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        TERKGFERRPPSPPL LLPQRGRW+R+VRERSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  TERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

A0A1S3CHP1 uncharacterized protein LOC103501026 isoform X28.6e-18890.36Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLARSREFGGGRDLGRYRDTSPH
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE       GSVSP RRRDVHRYIS+F+HS GLAR REFGGGRDL RYRDTSPH
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE------PGSVSPTRRRDVHRYISDFDHSGGLARSREFGGGRDLGRYRDTSPH

Query:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS
        Y RR+ GGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY G PSLH+PPRRFAAHP+ERS
Subjt:  YSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVG-PSLHSPPRRFAAHPVERS

Query:  PGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPSPPLSLLPQ
        PGRTL+EYRSPPRSWARDG RE+AAGGLAPPRYE SRYSDHLRRDRVDYLEDSFRGRSKFDRP+PSADWALRDNGRDDFITERKGFERRPPSPPL LLPQ
Subjt:  PGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPSPPLSLLPQ

Query:  RGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        RGRW+R+VRERSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  RGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

A0A5D3E321 TATA-binding protein-associated factor 2N1.8e-18585.64Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSP RRRDVHRYIS+F+HS GLA
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        R REFGGGRDL RYRDTSPHY RR+ GGRPFGRGVDGP LA GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFI
        G PSLH+PPRRFAAHP+ERSPGRTL+EYRSPPRSWARDG RE+AAGGLAPPRYE SRYSDHLRRDRVDYLEDSFRGRSKFDRP+PSADWALRDNGRDDFI
Subjt:  G-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFI

Query:  TERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        TERKGFERRPPSPPL LLPQRGRW+R+VRERSRSPIRGP+RSPLRV LRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+G
Subjt:  TERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X72.4e-18285.71Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSPTRRRD HRYISDFDHS GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY
        R REFGGGRDLGRYRDTSPHYSRR+SGGRPFGRG DGPG A GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GY
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY

Query:  VG-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSD-HLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDD
        VG PSLHSPPRRFAAHP+ERSPGRT++EYRSPPR WARDGPREIAAGGLAPPRYE SRYSD HLRRDRVDYLEDSFRGRSKFDR +P +DW+LRDNGRDD
Subjt:  VG-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSD-HLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDD

Query:  FITERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        FI ERKGFERRPP PPLSLLPQRGRWAR+VRERSRSPIRGPVRSPLRV LRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDG
Subjt:  FITERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X62.3e-18085.49Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE                           GSVSPTRRRD HRYISDFDHS GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYE--------------------------PGSVSPTRRRDVHRYISDFDHSGGLA

Query:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY
        R REFGGGRDLGRYRDTSPHYSRR+SGGRPFGRG DGPG A GPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GY
Subjt:  RSREFGGGRDLGRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY

Query:  VG-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSD-HLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDD
        VG PSLHSPPRRFAAHP+ERSPGRT++EYRSPPR WARDGPREIAAGGLAPPRYE SRYSD HLRRDRVDYLEDSFRGRSKFDR +P +DW+LRDNGRDD
Subjt:  VG-PSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSD-HLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDD

Query:  FITERKGFERRPPS-PPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG
        FI ERKGFERRP S PPLSLLPQRGRWAR+VRERSRSPIRGPVRSPLRV LRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDG
Subjt:  FITERKGFERRPPS-PPLSLLPQRGRWAREVRERSRSPIRGPVRSPLRVSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDG

SwissProt top hitse value%identityAlignment
F4JY84 Protein tesmin/TSO1-like CXC 76.0e-2933.76Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSI-PQFHQANNVMSSGTST
        ANILCSENC+C DCKNFEGSEER+AL HG   ++  YIQQ  NAA+  AI  S Y   P S+KRK  ++       DS ++S   Q+ +AN+V  +G ++
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSI-PQFHQANNVMSSGTST

Query:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNR
            P       + A SG +  ++RS  ++  QPH ++ELCS+LV  S +VA K++++    K    P  +  A    +E                  N 
Subjt:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNR

Query:  SDRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSE-VYIEQERIVLTKFRDCLNKLITLGEIKETKFTCR
        S   V D +  D       +P+SP T ALMCDE   +      ++        TS    D   S  +Y+EQER +L+ FRD L +L           + R
Subjt:  SDRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSE-VYIEQERIVLTKFRDCLNKLITLGEIKETKFTCR

Query:  SEVGNENLSNN
        + +  +N+ +N
Subjt:  SEVGNENLSNN

Q28009 RNA-binding protein FUS3.5e-0531.91Show/hide
Query:  GRPFGRGVDGPGLASGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTL
        G P GRG  G G + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G GG P   ++G + +   RR      +R   R  
Subjt:  GRPFGRGVDGPGLASGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTL

Query:  SEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDR
           R   R     G R    GG  P + +S       RR+R
Subjt:  SEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDR

Q92804 TATA-binding protein-associated factor 2N1.4e-0642.67Show/hide
Query:  GRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR-RGYVG
        G G  G     G +RG          P+ GDW C +P C N+NFARR  CN CN PR      +GG  R RGY G
Subjt:  GRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR-RGYVG

Q9SL70 Protein tesmin/TSO1-like CXC 68.9e-5744.92Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS
        ANILCSENCKC+DCKNFEGSE RQ+LFHG+H++NLAY+Q  ANAAITGAIGSSG+A  P  K+RKG E+ F    KDS   S  +  QANN  ++ + T 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS

Query:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS
        S         G  A+ GPSK  ++SLLA++I+P D+K LCSVLV  + E AK + E+R A ++     + S ASS  D+                 +N++
Subjt:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS

Query:  DRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETKFTCRSE
        ++S  ++SN+D S   K R +SP TLALMCDERDTM M A     S       +S +P+    +VY EQE++VLTKFRDCLN++I+ GE+KE+  +    
Subjt:  DRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETKFTCRSE

Query:  VGNENLSNNFTSNNGCQQRSISNGV
          +  +      +   QQ  ++NGV
Subjt:  VGNENLSNNFTSNNGCQQRSISNGV

Q9SZD1 Protein tesmin/TSO1-like CXC 51.3e-7151.51Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSG-TST
        ANILCSENCKC+DCKNFEGSEERQALFHG+H+N++AY+QQAANAAITGA+GSSG+A  P  K+RKG E+ F    KDS  + +  F Q NN  + G TS 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSG-TST

Query:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRN-AEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSS-
        +SP PV+  G    A+S PSKF +RSLLAD+IQPHD++ LCSVLV  + E AK   ++RN  E +++D  + S ASS  D+ Q       AAD E  ++ 
Subjt:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRN-AEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSS-

Query:  -NRSDRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFM-GAGLADGSAG-HDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETK
         N++D+S  + SNSD  D +K  P+SP TLALMCDE+DT+FM  A   +GS   + C  +S    +  SE+Y EQER+VLTKFRDCLN+LI+  EIKE+K
Subjt:  -NRSDRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFM-GAGLADGSAG-HDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETK

Query:  --FTCRSEVGNENLSNNFTSNNGCQQRSISNG
             R  +    ++   T N   QQ  I NG
Subjt:  --FTCRSEVGNENLSNNFTSNNGCQQRSISNG

Arabidopsis top hitse value%identityAlignment
AT2G20110.1 Tesmin/TSO1-like CXC domain-containing protein6.3e-5844.92Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS
        ANILCSENCKC+DCKNFEGSE RQ+LFHG+H++NLAY+Q  ANAAITGAIGSSG+A  P  K+RKG E+ F    KDS   S  +  QANN  ++ + T 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS

Query:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS
        S         G  A+ GPSK  ++SLLA++I+P D+K LCSVLV  + E AK + E+R A ++     + S ASS  D+                 +N++
Subjt:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS

Query:  DRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETKFTCRSE
        ++S  ++SN+D S   K R +SP TLALMCDERDTM M A     S       +S +P+    +VY EQE++VLTKFRDCLN++I+ GE+KE+  +    
Subjt:  DRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETKFTCRSE

Query:  VGNENLSNNFTSNNGCQQRSISNGV
          +  +      +   QQ  ++NGV
Subjt:  VGNENLSNNFTSNNGCQQRSISNGV

AT2G20110.2 Tesmin/TSO1-like CXC domain-containing protein1.1e-5748.11Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS
        ANILCSENCKC+DCKNFEGSE RQ+LFHG+H++NLAY+Q  ANAAITGAIGSSG+A  P  K+RKG E+ F    KDS   S  +  QANN  ++ + T 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTS

Query:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS
        S         G  A+ GPSK  ++SLLA++I+P D+K LCSVLV  + E AK + E+R A ++     + S ASS  D+                 +N++
Subjt:  SPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRS

Query:  DRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIK
        ++S  ++SN+D S   K R +SP TLALMCDERDTM M A     S       +S +P+    +VY EQE++VLTKFRDCLN++I+ GE+K
Subjt:  DRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIK

AT4G28990.1 RNA-binding protein-related2.5e-4639.69Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNS---DGGGGGVGGTSGGRVGR----------------GSDYEPGSVSPTRR-RDVHRYISDFDHSGGLARSREF
        MGS +K+ TT HH P +SSLVVRPS S   + G    G    G V R                G      S SP RR  + H++ SD +HSG   R RE 
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNS---DGGGGGVGGTSGGRVGR----------------GSDYEPGSVSPTRR-RDVHRYISDFDHSGGLARSREF

Query:  GGGRDL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPS
           R+  GR+RD SP  +R  +G RP+ RG+DGP    G  R   S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP        
Subjt:  GGGRDL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPS

Query:  LHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERK
           P  R    P+  SP R  + YRSPPR W RD P         PPR++   + D   RDR  Y +  +    +      ++DWA  +         + 
Subjt:  LHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERK

Query:  GFERRPPSPPLSLLPQRGRWAREVRERSRS-PIRGPVRSPLRVSLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDG
         ++RRPP  P    P+ GRW R +RERSRS P+R     PLR      L  G PP  +D+RRD   +RE RDD RG GR R G
Subjt:  GFERRPPSPPLSLLPQRGRWAREVRERSRS-PIRGPVRSPLRVSLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDG

AT4G28990.2 RNA-binding protein-related9.2e-4135.27Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNS---DGGGGGVGGTSGGRVGR-----------------------------------------------------
        MGS +K+ TT HH P +SSLVVRPS S   + G    G    G V R                                                     
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNS---DGGGGGVGGTSGGRVGR-----------------------------------------------------

Query:  -----------GSDYEPGSVSPTRR-RDVHRYISDFDHSGGLARSREFGGGRDL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNN-PN
                   G      S SP RR  + H++ SD +HSG   R RE    R+  GR+RD SP  +R  +G RP+ RG+DGP    G  R   S+NN   
Subjt:  -----------GSDYEPGSVSPTRR-RDVHRYISDFDHSGGLARSREFGGGRDL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLASGPFRGERSKNN-PN

Query:  VRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESS
        V+PR+GDWYC DPLC NLNFARRE C  C R R     SP           P  R    P+  SP R  + YRSPPR W RD P         PPR++  
Subjt:  VRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSWARDGPREIAAGGLAPPRYESS

Query:  RYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPSPPLSLLPQRGRWAREVRERSRS-PIRGPVRSPLRVSLRSPLSSG
         + D   RDR  Y +  +    +      ++DWA  +         +  ++RRPP  P    P+ GRW R +RERSRS P+R     PLR      L  G
Subjt:  RYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPSPPLSLLPQRGRWAREVRERSRS-PIRGPVRSPLRVSLRSPLSSG

Query:  LPP--KDFRRDVFVERE-RDDRRGLGRDRDG
         PP  +D+RRD   +RE RDD RG GR R G
Subjt:  LPP--KDFRRDVFVERE-RDDRRGLGRDRDG

AT4G29000.1 Tesmin/TSO1-like CXC domain-containing protein9.1e-7351.51Show/hide
Query:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSG-TST
        ANILCSENCKC+DCKNFEGSEERQALFHG+H+N++AY+QQAANAAITGA+GSSG+A  P  K+RKG E+ F    KDS  + +  F Q NN  + G TS 
Subjt:  ANILCSENCKCMDCKNFEGSEERQALFHGDHANNLAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSG-TST

Query:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRN-AEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSS-
        +SP PV+  G    A+S PSKF +RSLLAD+IQPHD++ LCSVLV  + E AK   ++RN  E +++D  + S ASS  D+ Q       AAD E  ++ 
Subjt:  SSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLVVFSSEVAKKIAEQRN-AEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSS-

Query:  -NRSDRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFM-GAGLADGSAG-HDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETK
         N++D+S  + SNSD  D +K  P+SP TLALMCDE+DT+FM  A   +GS   + C  +S    +  SE+Y EQER+VLTKFRDCLN+LI+  EIKE+K
Subjt:  -NRSDRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFM-GAGLADGSAG-HDCNTSSHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETK

Query:  --FTCRSEVGNENLSNNFTSNNGCQQRSISNG
             R  +    ++   T N   QQ  I NG
Subjt:  --FTCRSEVGNENLSNNFTSNNGCQQRSISNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCAGAAGAATTAGGCAAACTAGGCGAAAGGATGGGTTCAAGGGATAAAGACTCTACGACTCACCATCAGCCGTTATTGAGCAGCCTTGTTGTCCGGCCTTCGAA
TAGTGACGGAGGTGGTGGTGGAGTCGGTGGAACCAGTGGCGGCCGCGTTGGTCGTGGAAGCGATTATGAGCCCGGTTCAGTTTCTCCAACACGCCGTCGGGATGTTCACC
GATATATTTCTGATTTTGATCATTCTGGTGGTCTCGCTCGCAGTCGTGAATTTGGTGGTGGGAGGGATCTTGGTAGATATCGAGATACTTCACCTCATTACAGTCGAAGA
ATAAGTGGTGGCAGGCCATTTGGAAGAGGTGTTGATGGCCCTGGACTTGCTTCTGGGCCATTTCGTGGGGAACGTAGTAAAAACAATCCAAATGTGCGTCCTAGGGATGG
GGATTGGTATTGCTCAGATCCTCTATGTGACAACCTAAACTTTGCAAGACGAGAATTTTGTAACAACTGCAATAGACCCCGCACTGGAGCTGGTGGAAGTCCTCGAAGAG
GTTATGTTGGTCCATCCCTGCATTCTCCTCCTAGGCGCTTTGCTGCGCACCCAGTTGAACGTTCTCCTGGCAGGACTCTTAGTGAATATAGGTCTCCTCCCCGTAGTTGG
GCCAGGGATGGTCCTAGGGAGATTGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGCCGGTATTCCGATCACCTGCGAAGAGACAGGGTGGACTATCTAGAAGA
CAGCTTCAGAGGAAGATCGAAGTTCGATAGGCCAGTTCCTTCAGCAGATTGGGCCCTTAGAGACAATGGAAGGGATGATTTCATCACAGAGAGGAAGGGATTTGAAAGAA
GGCCACCATCCCCACCACTGTCGTTGCTTCCTCAGCGTGGGCGCTGGGCGCGTGAAGTGAGGGAGAGGAGCCGTTCCCCAATCAGAGGTCCCGTCAGATCTCCATTAAGA
GTCTCGCTACGGTCTCCATTAAGTAGTGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTCGTTGAAAGGGAGCGGGATGATAGGCGTGGCCTAGGACGAGATCGCGA
TGGAGTCTTGCCGGAGCATCCTCATTTTCAGTCTCAGCCTCCGTGTCAACAATCAGAGTCACCGGCCGTCATGGTGGTGCAGAGTCAGTCGCAGCCACAGTCCCCACAAC
ATTTGACGGCCAATATTCTCTGCTCTGAAAACTGCAAGTGCATGGACTGTAAGAATTTTGAAGGCAGTGAAGAGAGACAGGCTCTTTTCCATGGTGACCATGCCAACAAC
TTGGCTTATATTCAACAGGCAGCAAATGCTGCAATAACTGGAGCTATTGGATCCTCTGGTTATGCTTGCCTTCCCACTTCAAAGAAAAGAAAAGGTCCAGAGCTATGCTT
TGGCCCTGTAGGGAAGGATTCCCCCCTCAACAGCATACCACAATTTCATCAGGCAAATAATGTAATGTCTTCAGGCACGTCTACTTCATCTCCCTTCCCAGTTGCTCATG
TTGGAAATGGCAGCCCTGCAGCATCGGGGCCTTCAAAGTTCTCATTCAGGTCCTTATTAGCTGACCTCATCCAACCGCATGACTTGAAGGAGCTTTGCTCAGTTTTAGTG
GTGTTTTCAAGTGAAGTTGCCAAGAAAATAGCAGAACAAAGAAATGCTGAGAAACAGATAAATGACCCACCACAGATTTCTCGTGCTTCATCTACTGTTGATGAGTCACA
GCATCAGAAGGCCGAAGAGAAAGCTGCAGATGGCGAGTGTGGAAGTTCAAACCGAAGTGATAGGAGTGTACATGATAATTCCAATTCAGATAGTTCGGATATTACAAAAG
CAAGGCCGATGTCCCCTGGAACTTTAGCTTTAATGTGTGATGAACGAGATACAATGTTCATGGGAGCTGGTCTAGCTGATGGGTCGGCAGGCCATGATTGCAACACATCA
TCACACATGCCTGATAAGTCTGTGTCAGAAGTCTACATCGAGCAAGAACGGATTGTGTTGACAAAGTTTCGAGATTGTCTTAATAAACTCATCACTCTGGGAGAGATAAA
AGAAACAAAATTCACTTGTAGAAGCGAAGTGGGGAATGAAAATCTCAGCAACAATTTCACCTCAAATAATGGGTGCCAACAGAGATCTATTAGCAATGGGGTTGTAAAAA
ATGTGGCTCTCTCGGCACACAGAATACCGCCAGTCGGCGCTGCAGCTCGTCATCCAAATAACGATCTCCTACTCAAAATTCTACCACTTCCTAAAAATAGTAAGAGTAAA
CCACAAGTTGACAGAGAAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGCAGAAGAATTAGGCAAACTAGGCGAAAGGATGGGTTCAAGGGATAAAGACTCTACGACTCACCATCAGCCGTTATTGAGCAGCCTTGTTGTCCGGCCTTCGAA
TAGTGACGGAGGTGGTGGTGGAGTCGGTGGAACCAGTGGCGGCCGCGTTGGTCGTGGAAGCGATTATGAGCCCGGTTCAGTTTCTCCAACACGCCGTCGGGATGTTCACC
GATATATTTCTGATTTTGATCATTCTGGTGGTCTCGCTCGCAGTCGTGAATTTGGTGGTGGGAGGGATCTTGGTAGATATCGAGATACTTCACCTCATTACAGTCGAAGA
ATAAGTGGTGGCAGGCCATTTGGAAGAGGTGTTGATGGCCCTGGACTTGCTTCTGGGCCATTTCGTGGGGAACGTAGTAAAAACAATCCAAATGTGCGTCCTAGGGATGG
GGATTGGTATTGCTCAGATCCTCTATGTGACAACCTAAACTTTGCAAGACGAGAATTTTGTAACAACTGCAATAGACCCCGCACTGGAGCTGGTGGAAGTCCTCGAAGAG
GTTATGTTGGTCCATCCCTGCATTCTCCTCCTAGGCGCTTTGCTGCGCACCCAGTTGAACGTTCTCCTGGCAGGACTCTTAGTGAATATAGGTCTCCTCCCCGTAGTTGG
GCCAGGGATGGTCCTAGGGAGATTGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGCCGGTATTCCGATCACCTGCGAAGAGACAGGGTGGACTATCTAGAAGA
CAGCTTCAGAGGAAGATCGAAGTTCGATAGGCCAGTTCCTTCAGCAGATTGGGCCCTTAGAGACAATGGAAGGGATGATTTCATCACAGAGAGGAAGGGATTTGAAAGAA
GGCCACCATCCCCACCACTGTCGTTGCTTCCTCAGCGTGGGCGCTGGGCGCGTGAAGTGAGGGAGAGGAGCCGTTCCCCAATCAGAGGTCCCGTCAGATCTCCATTAAGA
GTCTCGCTACGGTCTCCATTAAGTAGTGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTCGTTGAAAGGGAGCGGGATGATAGGCGTGGCCTAGGACGAGATCGCGA
TGGAGTCTTGCCGGAGCATCCTCATTTTCAGTCTCAGCCTCCGTGTCAACAATCAGAGTCACCGGCCGTCATGGTGGTGCAGAGTCAGTCGCAGCCACAGTCCCCACAAC
ATTTGACGGCCAATATTCTCTGCTCTGAAAACTGCAAGTGCATGGACTGTAAGAATTTTGAAGGCAGTGAAGAGAGACAGGCTCTTTTCCATGGTGACCATGCCAACAAC
TTGGCTTATATTCAACAGGCAGCAAATGCTGCAATAACTGGAGCTATTGGATCCTCTGGTTATGCTTGCCTTCCCACTTCAAAGAAAAGAAAAGGTCCAGAGCTATGCTT
TGGCCCTGTAGGGAAGGATTCCCCCCTCAACAGCATACCACAATTTCATCAGGCAAATAATGTAATGTCTTCAGGCACGTCTACTTCATCTCCCTTCCCAGTTGCTCATG
TTGGAAATGGCAGCCCTGCAGCATCGGGGCCTTCAAAGTTCTCATTCAGGTCCTTATTAGCTGACCTCATCCAACCGCATGACTTGAAGGAGCTTTGCTCAGTTTTAGTG
GTGTTTTCAAGTGAAGTTGCCAAGAAAATAGCAGAACAAAGAAATGCTGAGAAACAGATAAATGACCCACCACAGATTTCTCGTGCTTCATCTACTGTTGATGAGTCACA
GCATCAGAAGGCCGAAGAGAAAGCTGCAGATGGCGAGTGTGGAAGTTCAAACCGAAGTGATAGGAGTGTACATGATAATTCCAATTCAGATAGTTCGGATATTACAAAAG
CAAGGCCGATGTCCCCTGGAACTTTAGCTTTAATGTGTGATGAACGAGATACAATGTTCATGGGAGCTGGTCTAGCTGATGGGTCGGCAGGCCATGATTGCAACACATCA
TCACACATGCCTGATAAGTCTGTGTCAGAAGTCTACATCGAGCAAGAACGGATTGTGTTGACAAAGTTTCGAGATTGTCTTAATAAACTCATCACTCTGGGAGAGATAAA
AGAAACAAAATTCACTTGTAGAAGCGAAGTGGGGAATGAAAATCTCAGCAACAATTTCACCTCAAATAATGGGTGCCAACAGAGATCTATTAGCAATGGGGTTGTAAAAA
ATGTGGCTCTCTCGGCACACAGAATACCGCCAGTCGGCGCTGCAGCTCGTCATCCAAATAACGATCTCCTACTCAAAATTCTACCACTTCCTAAAAATAGTAAGAGTAAA
CCACAAGTTGACAGAGAAGTCTAG
Protein sequenceShow/hide protein sequence
MNAEELGKLGERMGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEPGSVSPTRRRDVHRYISDFDHSGGLARSREFGGGRDLGRYRDTSPHYSRR
ISGGRPFGRGVDGPGLASGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPSLHSPPRRFAAHPVERSPGRTLSEYRSPPRSW
ARDGPREIAAGGLAPPRYESSRYSDHLRRDRVDYLEDSFRGRSKFDRPVPSADWALRDNGRDDFITERKGFERRPPSPPLSLLPQRGRWAREVRERSRSPIRGPVRSPLR
VSLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGVLPEHPHFQSQPPCQQSESPAVMVVQSQSQPQSPQHLTANILCSENCKCMDCKNFEGSEERQALFHGDHANN
LAYIQQAANAAITGAIGSSGYACLPTSKKRKGPELCFGPVGKDSPLNSIPQFHQANNVMSSGTSTSSPFPVAHVGNGSPAASGPSKFSFRSLLADLIQPHDLKELCSVLV
VFSSEVAKKIAEQRNAEKQINDPPQISRASSTVDESQHQKAEEKAADGECGSSNRSDRSVHDNSNSDSSDITKARPMSPGTLALMCDERDTMFMGAGLADGSAGHDCNTS
SHMPDKSVSEVYIEQERIVLTKFRDCLNKLITLGEIKETKFTCRSEVGNENLSNNFTSNNGCQQRSISNGVVKNVALSAHRIPPVGAAARHPNNDLLLKILPLPKNSKSK
PQVDREV