; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G33850 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G33850
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptiondapper homolog 3 isoform X1
Genome locationChr1:28797595..28803763
RNA-Seq ExpressionCSPI01G33850
SyntenyCSPI01G33850
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR034870 - TAF15/EWS/TLS family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142516.1 uncharacterized protein LOC101209122 isoform X1 [Cucumis sativus]1.5e-22799.74Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVR+RSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

XP_004142517.1 uncharacterized protein LOC101209122 isoform X2 [Cucumis sativus]3.3e-21194.55Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEA                    GYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVR+RSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

XP_008462733.1 PREDICTED: uncharacterized protein LOC103501026 isoform X1 [Cucumis melo]2.9e-22397.66Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRL RYSDDLGYRIHAGSVSP RRRDVHRY+SNF+HS+GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

XP_008462734.1 PREDICTED: uncharacterized protein LOC103501026 isoform X2 [Cucumis melo]1.0e-20792.73Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEA                    GYRIHAGSVSP RRRDVHRY+SNF+HS+GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

XP_038880192.1 uncharacterized protein LOC120071861 isoform X1 [Benincasa hispida]3.1e-20992.49Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRG+DYEAGE+ RDPPQYSRLDRYSDDLGYRIHAGSVSP RRRDVHRY+S+FDHS  LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGR+FGGGRDL RYRDTSPHY RR+SGGRPFGRGVDGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR G  GSPRRGY 
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDG RE+AAGGLAPPRYESRYSDHLRRDRVDYL+DSFRGRSKFDRPLPSADWALRDNGRDDFI+
Subjt:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPS-PPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPS PPL LLPQRGRW+RDVRERSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ERKGFERRPPS-PPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

TrEMBL top hitse value%identityAlignment
A0A1S3CHL9 uncharacterized protein LOC103501026 isoform X11.4e-22397.66Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRL RYSDDLGYRIHAGSVSP RRRDVHRY+SNF+HS+GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

A0A1S3CHP1 uncharacterized protein LOC103501026 isoform X24.9e-20892.73Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEA                    GYRIHAGSVSP RRRDVHRY+SNF+HS+GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

A0A5D3E321 TATA-binding protein-associated factor 2N1.4e-22397.66Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRL RYSDDLGYRIHAGSVSP RRRDVHRY+SNF+HS+GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X77.7e-20691.21Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSP RRRD HRY+S+FDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY
        RGREFGGGRDL RYRDTSPHY RRVSGGRPFGRG DGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRR GY
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY

Query:  AGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
         GPPSLHSPPRRFAAHPIERSPGRT+NEYRSPPR WARDG RE+AAGGLAPPRYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  AGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        I ERKGFERRPP PPL LLPQRGRW+RDVRERSRSPIRGP+RSPLRVPLRSPLS GLPPKD+RRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X67.3e-20490.98Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSP RRRD HRY+S+FDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY
        RGREFGGGRDL RYRDTSPHY RRVSGGRPFGRG DGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRR GY
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY

Query:  AGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
         GPPSLHSPPRRFAAHPIERSPGRT+NEYRSPPR WARDG RE+AAGGLAPPRYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  AGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPS-PPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        I ERKGFERRP S PPL LLPQRGRW+RDVRERSRSPIRGP+RSPLRVPLRSPLS GLPPKD+RRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ITERKGFERRPPS-PPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

SwissProt top hitse value%identityAlignment
P56959 RNA-binding protein FUS1.9e-0435.14Show/hide
Query:  GREFGGG--------RDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG
        G+EF G         R  D  R      G R  GG P GRG  G   + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G
Subjt:  GREFGGG--------RDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG

Query:  -GGSPRRGYAG
         GG P   + G
Subjt:  -GGSPRRGYAG

Q01844 RNA-binding protein EWS1.9e-0441.86Show/hide
Query:  GRPFGRGVDGPRLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP-------RRGYAGP
        GR  GRG D     P   RG  S+ NP    NV+ R GDW C +P C N NFA R  CN C  P+  G   P        RG  GP
Subjt:  GRPFGRGVDGPRLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP-------RRGYAGP

Q28009 RNA-binding protein FUS3.3e-0432Show/hide
Query:  GREFGGG--------RDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG
        G+EF G         R  D  R      G R  GG P GRG  G   + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G
Subjt:  GREFGGG--------RDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG

Query:  -GGSPRRGYAGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR
         GG P   + G    +   RR      +R   R     R   R   R G      GG  P + +SR  +H R+DR
Subjt:  -GGSPRRGYAGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR

Q61545 RNA-binding protein EWS1.9e-0441.86Show/hide
Query:  GRPFGRGVDGPRLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP-------RRGYAGP
        GR  GRG D     P   RG  S+ NP    NV+ R GDW C +P C N NFA R  CN C  P+  G   P        RG  GP
Subjt:  GRPFGRGVDGPRLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP-------RRGYAGP

Q92804 TATA-binding protein-associated factor 2N8.5e-0836.19Show/hide
Query:  GREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRP-----RTGGGGSPR
        G+EF G      +    P + R        G G  G R   G +RG          P+ GDW C +P C N+NFARR  CN CN P     R  GG    
Subjt:  GREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRP-----RTGGGGSPR

Query:  RGYAG
        RGY G
Subjt:  RGYAG

Arabidopsis top hitse value%identityAlignment
AT4G28990.1 RNA-binding protein-related9.5e-5541.31Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRY-SDDLGYRIHAGSVSPPRR-RDVHRYVSNFDHSE
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D+ G+R  A S SP RR  + H++ S+ +HS 
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRY-SDDLGYRIHAGSVSPPRR-RDVHRYVSNFDHSE

Query:  GLTRGREFGGGRDL-DRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP
           RGRE    R+   R+RD SP   R  +G RP+ RG+DGP   P   R   S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP
Subjt:  GLTRGREFGGGRDL-DRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP

Query:  RRGYAGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGR
                    P  R    P+  SP R  N YRSPPR W RD           PPR++        RDR  Y +  +    +      ++DWA  +   
Subjt:  RRGYAGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGR

Query:  DDFITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDREGGPF
              +  ++RRPP  P    P+ GRW R +RERSRSP   P+R     PLR      L  G PP  +D+RRD   +RE RDD RG GR R G  +
Subjt:  DDFITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDREGGPF

AT4G28990.2 RNA-binding protein-related7.0e-5037.08Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDD-------------------------------
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D                               
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDD-------------------------------

Query:  ------------------LGYRIHAGSVSPPRR-RDVHRYVSNFDHSEGLTRGREFGGGRDL-DRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGE
                          LG+R  A S SP RR  + H++ S+ +HS    RGRE    R+   R+RD SP   R  +G RP+ RG+DGP   P   R  
Subjt:  ------------------LGYRIHAGSVSPPRR-RDVHRYVSNFDHSEGLTRGREFGGGRDL-DRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGE

Query:  RSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYAGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGG
         S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP            P  R    P+  SP R  N YRSPPR W RD         
Subjt:  RSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYAGPPSLHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGG

Query:  LAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPL
          PPR++        RDR  Y +  +    +      ++DWA  +         +  ++RRPP  P    P+ GRW R +RERSRSP   P+R     PL
Subjt:  LAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPL

Query:  RS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDREGGPF
        R      L  G PP  +D+RRD   +RE RDD RG GR R G  +
Subjt:  RS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDREGGPF

AT5G58470.1 TBP-associated factor 15B1.5e-0432.93Show/hide
Query:  EGLTRGREFGGGRDLDRYRDTS--PHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG
        +G   G  +GGG      R  S    YG R   G   GRG  G     G   G+R         RDGDW C +P C N+NFARR  CN C    P     
Subjt:  EGLTRGREFGGGRDLDRYRDTS--PHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG

Query:  GSPRRGYAGPPSLHSPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP
        G+  RG  G         R          GR+   + Y    RS    GS   RE  + G APP
Subjt:  GSPRRGYAGPPSLHSPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP

AT5G58470.2 TBP-associated factor 15B1.5e-0432.93Show/hide
Query:  EGLTRGREFGGGRDLDRYRDTS--PHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG
        +G   G  +GGG      R  S    YG R   G   GRG  G     G   G+R         RDGDW C +P C N+NFARR  CN C    P     
Subjt:  EGLTRGREFGGGRDLDRYRDTS--PHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG

Query:  GSPRRGYAGPPSLHSPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP
        G+  RG  G         R          GR+   + Y    RS    GS   RE  + G APP
Subjt:  GSPRRGYAGPPSLHSPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCCAGGGATAAAGATTCTACCACTCACCACCAGCCGTTATTGAGCAGCCTTGTTGTCCGCCCTTCCAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAG
TGGCGGCCGCGTTGGTCGTGGAACCGATTACGAGGCCGGTGAGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGATATAGAATAC
ATGCAGGTTCAGTTTCTCCACCACGCCGTCGGGATGTTCATCGATACGTTTCTAATTTTGATCATTCTGAGGGTCTCACACGAGGTCGTGAATTTGGTGGTGGGAGGGAT
CTTGATAGATATCGTGATACTTCACCTCACTATGGTCGAAGAGTAAGTGGTGGTAGGCCATTTGGGAGAGGTGTGGATGGCCCTAGACTTGCTCCTGGGCCATTTCGAGG
GGAACGCAGTAAAAATAATCCAAATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTTTATGTGACAACCTAAACTTTGCAAGACGAGAATTCTGTAACAACT
GCAACAGACCTCGCACTGGAGGTGGTGGAAGTCCTCGAAGAGGCTATGCTGGTCCACCATCCCTGCATTCTCCTCCTAGACGTTTCGCTGCCCACCCAATTGAACGTTCT
CCTGGCAGGACTCTTAATGAATATAGGTCTCCTCCCCGTAGCTGGGCGAGGGATGGTTCTAGGGAGATGGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGGTA
TTCCGATCACCTGCGGAGAGATAGGGTGGACTATCTCGAAGACAGCTTCAGAGGAAGATCCAAGTTCGATAGACCACTTCCTTCAGCAGATTGGGCCCTTAGAGACAATG
GAAGGGATGACTTCATCACAGAGAGGAAGGGATTCGAAAGAAGGCCACCATCCCCACCACTGCCGTTGCTTCCTCAGCGTGGACGCTGGTCACGTGATGTTAGAGAGAGG
AGCCGTTCTCCCATCAGAGGTCCAATCAGATCTCCACTAAGAGTCCCATTACGGTCTCCATTAAGTAGTGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTCGGCGA
AAGGGAGCGCGATGATAGGCGTGGCCTAGGACGAGATCGTGAGGGAGGTCCATTTTAG
mRNA sequenceShow/hide mRNA sequence
GCAGCGGAGGCGTACTCCCTAGCGAAGGAGACGTACCCTTTTACGTAATTTCTCATGCTGAATTCCTCACATCTAGAATTTAGGGTTTATGAATATACACCCCACCTTTT
TCAACCGATTGATCGGCGTTCACACTTAGGACCTACCATTTTCGTCGCTGCTCATCGTAGCAGGCAGAAGGATTAACCAAACAACGCCGAATAATGGGTTCCAGGGATAA
AGATTCTACCACTCACCACCAGCCGTTATTGAGCAGCCTTGTTGTCCGCCCTTCCAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAGTGGCGGCCGCGTTGGTC
GTGGAACCGATTACGAGGCCGGTGAGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGATATAGAATACATGCAGGTTCAGTTTCT
CCACCACGCCGTCGGGATGTTCATCGATACGTTTCTAATTTTGATCATTCTGAGGGTCTCACACGAGGTCGTGAATTTGGTGGTGGGAGGGATCTTGATAGATATCGTGA
TACTTCACCTCACTATGGTCGAAGAGTAAGTGGTGGTAGGCCATTTGGGAGAGGTGTGGATGGCCCTAGACTTGCTCCTGGGCCATTTCGAGGGGAACGCAGTAAAAATA
ATCCAAATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTTTATGTGACAACCTAAACTTTGCAAGACGAGAATTCTGTAACAACTGCAACAGACCTCGCACT
GGAGGTGGTGGAAGTCCTCGAAGAGGCTATGCTGGTCCACCATCCCTGCATTCTCCTCCTAGACGTTTCGCTGCCCACCCAATTGAACGTTCTCCTGGCAGGACTCTTAA
TGAATATAGGTCTCCTCCCCGTAGCTGGGCGAGGGATGGTTCTAGGGAGATGGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGGTATTCCGATCACCTGCGGA
GAGATAGGGTGGACTATCTCGAAGACAGCTTCAGAGGAAGATCCAAGTTCGATAGACCACTTCCTTCAGCAGATTGGGCCCTTAGAGACAATGGAAGGGATGACTTCATC
ACAGAGAGGAAGGGATTCGAAAGAAGGCCACCATCCCCACCACTGCCGTTGCTTCCTCAGCGTGGACGCTGGTCACGTGATGTTAGAGAGAGGAGCCGTTCTCCCATCAG
AGGTCCAATCAGATCTCCACTAAGAGTCCCATTACGGTCTCCATTAAGTAGTGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTCGGCGAAAGGGAGCGCGATGATA
GGCGTGGCCTAGGACGAGATCGTGAGGGAGGTCCATTTTAGTGTTGAAACTTGGTTACCCATGTAATTTTTTGGTTAGAACATAATGGGCAACCTGTAATATGTGTTTAG
AGCTGTGGAAAGAGGATGGTAAGAAGAGGGTACATGATTATTCTTGACTGGTCTGGTGTGTGGAAACTTGCTGATGGGGCGTTTTATCTGTTGCTCGTGTGGTTTTCGTG
TTTTTACCATTTTATGTAGTTAGCAGCCAAGCAGAAATAGTTGAGTAAATGCTTTAAAATACAGTATCTGTCAACTTCATTTGCCGTCTTTGGTTTATATTGGCTTCTGG
GCAATGGTTTTGAACCATCATTATCTTTGCACAATTTTAGATTTTTTAGATTTTTTTAGATGTGGC
Protein sequenceShow/hide protein sequence
MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPPRRRDVHRYVSNFDHSEGLTRGREFGGGRD
LDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYAGPPSLHSPPRRFAAHPIERS
PGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLPLLPQRGRWSRDVRER
SRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF