; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G041160 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G041160
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionTATA-binding protein-associated factor 2N
Genome locationchrH02:20794959..20800582
RNA-Seq ExpressionChy2G041160
SyntenyChy2G041160
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR034870 - TAF15/EWS/TLS family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142516.1 uncharacterized protein LOC101209122 isoform X1 [Cucumis sativus]4.88e-28897.92Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSP RRRDVHRY+SNFDHS+GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPS+HSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        ERKGFERRPPSPPLPLLPQRGRWSR+VR+RSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRD EGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

XP_004142517.1 uncharacterized protein LOC101209122 isoform X2 [Cucumis sativus]1.01e-26692.73Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEA                    GYR+HAGSVSP RRRDVHRY+SNFDHS+GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPS+HSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        ERKGFERRPPSPPLPLLPQRGRWSR+VR+RSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRD EGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

XP_008462733.1 PREDICTED: uncharacterized protein LOC103501026 isoform X1 [Cucumis melo]9.45e-28697.4Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRL RYSDDLGYR+HAGSVSPARRRDVHRYISNF+HSDGL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPS+H+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        ERKGFERRPPSPPLPLLPQRGRWSR+VRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRD EGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

XP_008462734.1 PREDICTED: uncharacterized protein LOC103501026 isoform X2 [Cucumis melo]1.68e-26592.47Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEA                    GYR+HAGSVSPARRRDVHRYISNF+HSDGL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPS+H+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        ERKGFERRPPSPPLPLLPQRGRWSR+VRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRD EGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

XP_038880192.1 uncharacterized protein LOC120071861 isoform X1 [Benincasa hispida]2.08e-26691.71Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRG+DYEAGE+ RDPPQYSRLDRYSDDLGYR+HAGSVSP RRRDVHRYIS+FDHS  LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGR+FGGGRDL RYRDTSPHY RR+SGGRPFGRGVDGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR G  GSPRRGY 
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPS+HSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDG RE+AAGGLAPPRYESRYSDHLRRDRVDYL+DSFRGRSKFDRPLPSADWALRDNGRDDFI+
Subjt:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPP-LPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        ERKGFERRPPSPP L LLPQRGRW+R+VRERSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRD +GGPF
Subjt:  ERKGFERRPPSPP-LPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

TrEMBL top hitse value%identityAlignment
A0A1S3CHL9 uncharacterized protein LOC103501026 isoform X15.3e-22397.4Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRL RYSDDLGYR+HAGSVSPARRRDVHRYISNF+HSDGL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPS+H+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        ERKGFERRPPSPPLPLLPQRGRWSR+VRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRD EGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

A0A1S3CHP1 uncharacterized protein LOC103501026 isoform X21.4e-20792.47Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEA                    GYR+HAGSVSPARRRDVHRYISNF+HSDGL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPS+H+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        ERKGFERRPPSPPLPLLPQRGRWSR+VRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRD EGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

A0A5D3E321 TATA-binding protein-associated factor 2N5.3e-22397.4Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRL RYSDDLGYR+HAGSVSPARRRDVHRYISNF+HSDGL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPS+H+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        ERKGFERRPPSPPLPLLPQRGRWSR+VRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRD EGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X71.1e-20490.7Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSP RRRD HRYIS+FDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY
        RGREFGGGRDL RYRDTSPHY RRVSGGRPFGRG DGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRR GY
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY

Query:  AGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
         GPPS+HSPPRRFAAHPIERSPGRT+NEYRSPPR WARDG RE+AAGGLAPPRYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  AGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        I ERKGFERRPP PPL LLPQRGRW+R+VRERSRSPIRGP+RSPLRVPLRSPLS GLPPKD+RRDVF ERERDDRRGLGRD +GGPF
Subjt:  ITERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X61.0e-20290.46Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSP RRRD HRYIS+FDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLT

Query:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY
        RGREFGGGRDL RYRDTSPHY RRVSGGRPFGRG DGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRR GY
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY

Query:  AGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
         GPPS+HSPPRRFAAHPIERSPGRT+NEYRSPPR WARDG RE+AAGGLAPPRYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  AGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPS-PPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF
        I ERKGFERRP S PPL LLPQRGRW+R+VRERSRSPIRGP+RSPLRVPLRSPLS GLPPKD+RRDVF ERERDDRRGLGRD +GGPF
Subjt:  ITERKGFERRPPS-PPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF

SwissProt top hitse value%identityAlignment
P35637 RNA-binding protein FUS3.3e-0432Show/hide
Query:  GREFGGG--------RDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG
        G+EF G         R  D  R      G R  GG P GRG  G   + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G
Subjt:  GREFGGG--------RDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG

Query:  -GGSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR
         GG P   + G    +   RR      +R   R     R   R   R G      GG  P + +SR  +H R+DR
Subjt:  -GGSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR

Q01844 RNA-binding protein EWS1.1e-0441.57Show/hide
Query:  GRPFGRGVDGPRLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP-------RRGYAGPPSM
        GR  GRG D     P   RG  S+ NP    NV+ R GDW C +P C N NFA R  CN C  P+  G   P        RG  GP  M
Subjt:  GRPFGRGVDGPRLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP-------RRGYAGPPSM

Q28009 RNA-binding protein FUS3.3e-0432Show/hide
Query:  GREFGGG--------RDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG
        G+EF G         R  D  R      G R  GG P GRG  G   + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G
Subjt:  GREFGGG--------RDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG

Query:  -GGSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR
         GG P   + G    +   RR      +R   R     R   R   R G      GG  P + +SR  +H R+DR
Subjt:  -GGSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR

Q61545 RNA-binding protein EWS1.1e-0441.57Show/hide
Query:  GRPFGRGVDGPRLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP-------RRGYAGPPSM
        GR  GRG D     P   RG  S+ NP    NV+ R GDW C +P C N NFA R  CN C  P+  G   P        RG  GP  M
Subjt:  GRPFGRGVDGPRLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP-------RRGYAGPPSM

Q92804 TATA-binding protein-associated factor 2N8.5e-0836.19Show/hide
Query:  GREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRP-----RTGGGGSPR
        G+EF G      +    P + R        G G  G R   G +RG          P+ GDW C +P C N+NFARR  CN CN P     R  GG    
Subjt:  GREFGGGRDLDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRP-----RTGGGGSPR

Query:  RGYAG
        RGY G
Subjt:  RGYAG

Arabidopsis top hitse value%identityAlignment
AT4G28990.1 RNA-binding protein-related2.1e-5441.06Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRY-SDDLGYRLHAGSVSPARR-RDVHRYISNFDHSD
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D+ G+R  A S SP RR  + H++ S+ +HS 
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRY-SDDLGYRLHAGSVSPARR-RDVHRYISNFDHSD

Query:  GLTRGREFGGGRDL-DRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP
           RGRE    R+   R+RD SP   R  +G RP+ RG+DGP   P   R   S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP
Subjt:  GLTRGREFGGGRDL-DRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP

Query:  RRGYAGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGR
                    P  R    P+  SP R  N YRSPPR W RD           PPR++        RDR  Y +  +    +      ++DWA  +   
Subjt:  RRGYAGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGR

Query:  DDFITERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDHEGGPF
              +  ++RRPP  P    P+ GRW R +RERSRSP   P+R     PLR      L  G PP  +D+RRD   +RE RDD RG GR   G  +
Subjt:  DDFITERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPLRS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDHEGGPF

AT4G28990.2 RNA-binding protein-related1.6e-4936.85Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDD-------------------------------
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D                               
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDD-------------------------------

Query:  ------------------LGYRLHAGSVSPARR-RDVHRYISNFDHSDGLTRGREFGGGRDL-DRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGE
                          LG+R  A S SP RR  + H++ S+ +HS    RGRE    R+   R+RD SP   R  +G RP+ RG+DGP   P   R  
Subjt:  ------------------LGYRLHAGSVSPARR-RDVHRYISNFDHSDGLTRGREFGGGRDL-DRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGE

Query:  RSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGG
         S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP            P  R    P+  SP R  N YRSPPR W RD         
Subjt:  RSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGG

Query:  LAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPL
          PPR++        RDR  Y +  +    +      ++DWA  +         +  ++RRPP  P    P+ GRW R +RERSRSP   P+R     PL
Subjt:  LAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLPLLPQRGRWSREVRERSRSPIRGPIRSPLRVPL

Query:  RS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDHEGGPF
        R      L  G PP  +D+RRD   +RE RDD RG GR   G  +
Subjt:  RS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDHEGGPF

AT5G58470.1 TBP-associated factor 15B6.9e-0533.54Show/hide
Query:  DGLTRGREFGGGRDLDRYRDTS--PHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG
        DG   G  +GGG      R  S    YG R   G   GRG  G     G   G+R         RDGDW C +P C N+NFARR  CN C    P     
Subjt:  DGLTRGREFGGGRDLDRYRDTS--PHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG

Query:  GSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP
        G+  RG  G         R          GR+   + Y    RS    GS   RE  + G APP
Subjt:  GSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP

AT5G58470.2 TBP-associated factor 15B6.9e-0533.54Show/hide
Query:  DGLTRGREFGGGRDLDRYRDTS--PHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG
        DG   G  +GGG      R  S    YG R   G   GRG  G     G   G+R         RDGDW C +P C N+NFARR  CN C    P     
Subjt:  DGLTRGREFGGGRDLDRYRDTS--PHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG

Query:  GSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP
        G+  RG  G         R          GR+   + Y    RS    GS   RE  + G APP
Subjt:  GSPRRGYAGPPSMHSPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCCAGGGATAAAGATTCTACCACTCACCACCAGCCCTTATTGAGCAGCCTTGTTGTCCGGCCTTCCAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAG
TGGCGGCCGCGTTGGTCGTGGAACCGATTACGAGGCCGGTGAGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGATATAGATTAC
ATGCAGGTTCAGTTTCTCCAGCACGTCGTCGGGATGTTCATCGATACATTTCTAATTTTGATCATTCTGATGGTCTCACTCGAGGTCGTGAATTTGGTGGTGGGAGGGAT
CTTGATAGATATCGTGATACTTCACCTCACTATGGTCGAAGAGTAAGTGGTGGTAGGCCATTTGGGAGAGGTGTGGATGGCCCTAGACTTGCTCCTGGGCCATTTCGAGG
GGAACGCAGTAAAAATAATCCAAATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTTTATGTGACAACCTAAACTTTGCAAGACGAGAATTCTGTAACAACT
GCAACAGACCTCGCACTGGAGGTGGTGGAAGTCCTCGAAGAGGCTATGCTGGTCCACCATCCATGCATTCTCCTCCTAGACGTTTCGCTGCCCACCCAATTGAACGTTCT
CCTGGCAGGACTCTTAACGAATATAGGTCTCCTCCCCGTAGCTGGGCGAGGGATGGTTCTAGGGAGATGGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGGTA
TTCCGATCACCTGCGGAGAGATAGGGTGGACTATCTAGAAGACAGCTTCAGAGGAAGATCCAAGTTCGATAGACCACTGCCTTCAGCAGATTGGGCCCTTAGAGACAATG
GAAGGGATGACTTCATCACAGAGAGGAAGGGATTCGAAAGAAGGCCACCATCCCCACCACTGCCATTGCTTCCTCAGCGTGGACGCTGGTCGCGTGAAGTGAGAGAGAGA
AGCCGTTCTCCCATCAGAGGTCCAATCAGATCTCCACTAAGAGTCCCACTAAGGTCTCCATTAAGTAGCGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTCGGCGA
AAGGGAGCGCGATGATAGGCGCGGCCTAGGACGAGATCATGAGGGAGGTCCATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCCAGGGATAAAGATTCTACCACTCACCACCAGCCCTTATTGAGCAGCCTTGTTGTCCGGCCTTCCAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAG
TGGCGGCCGCGTTGGTCGTGGAACCGATTACGAGGCCGGTGAGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGATATAGATTAC
ATGCAGGTTCAGTTTCTCCAGCACGTCGTCGGGATGTTCATCGATACATTTCTAATTTTGATCATTCTGATGGTCTCACTCGAGGTCGTGAATTTGGTGGTGGGAGGGAT
CTTGATAGATATCGTGATACTTCACCTCACTATGGTCGAAGAGTAAGTGGTGGTAGGCCATTTGGGAGAGGTGTGGATGGCCCTAGACTTGCTCCTGGGCCATTTCGAGG
GGAACGCAGTAAAAATAATCCAAATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTTTATGTGACAACCTAAACTTTGCAAGACGAGAATTCTGTAACAACT
GCAACAGACCTCGCACTGGAGGTGGTGGAAGTCCTCGAAGAGGCTATGCTGGTCCACCATCCATGCATTCTCCTCCTAGACGTTTCGCTGCCCACCCAATTGAACGTTCT
CCTGGCAGGACTCTTAACGAATATAGGTCTCCTCCCCGTAGCTGGGCGAGGGATGGTTCTAGGGAGATGGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGGTA
TTCCGATCACCTGCGGAGAGATAGGGTGGACTATCTAGAAGACAGCTTCAGAGGAAGATCCAAGTTCGATAGACCACTGCCTTCAGCAGATTGGGCCCTTAGAGACAATG
GAAGGGATGACTTCATCACAGAGAGGAAGGGATTCGAAAGAAGGCCACCATCCCCACCACTGCCATTGCTTCCTCAGCGTGGACGCTGGTCGCGTGAAGTGAGAGAGAGA
AGCCGTTCTCCCATCAGAGGTCCAATCAGATCTCCACTAAGAGTCCCACTAAGGTCTCCATTAAGTAGCGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTCGGCGA
AAGGGAGCGCGATGATAGGCGCGGCCTAGGACGAGATCATGAGGGAGGTCCATTTTAG
Protein sequenceShow/hide protein sequence
MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGTDYEAGEVPRDPPQYSRLDRYSDDLGYRLHAGSVSPARRRDVHRYISNFDHSDGLTRGREFGGGRD
LDRYRDTSPHYGRRVSGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYAGPPSMHSPPRRFAAHPIERS
PGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLPLLPQRGRWSREVRER
SRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDHEGGPF