; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007929 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007929
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiondapper homolog 3 isoform X1
Genome locationChr10:17451346..17456425
RNA-Seq ExpressionHG10007929
SyntenyHG10007929
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR034870 - TAF15/EWS/TLS family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142516.1 uncharacterized protein LOC101209122 isoform X1 [Cucumis sativus]3.8e-21594.03Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSP RRRDVHRY+S+FDHS  LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        RGREFGGGR+L RYRDTSPHY RR+SGGRPFGRGVDGP LAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHSPPRRF AHPIERSPGRTLNEYRSPPRSWARDG RE+AAGGLAP RYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        ERKGFERRPPSPPL +LPQRGRWSRDVR+RSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

XP_008462733.1 PREDICTED: uncharacterized protein LOC103501026 isoform X1 [Cucumis melo]2.1e-21393.51Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRL RYSDDLGYRIHAGSVSP RRRDVHRYIS+F+HS  L 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        RGREFGGGR+L RYRDTSPHY RR+ GGRPFGRGVDGP LAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRF AHPIERSPGRTLNEYRSPPRSWARDG RE+AAGGLAP RYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        ERKGFERRPPSPPL +LPQRGRWSRDVRERSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

XP_022961321.1 uncharacterized protein LOC111461847 isoform X6 [Cucurbita moschata]5.3e-20993.3Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRD HRYISDFDHS  LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY
        RGREFGGGR+LGRYRDTSPHYSRR+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GY
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY

Query:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
        VGPPSLHSPPRRF AHPIERSPGRT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPS-PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        I ERKGFERRP S PPLS+LPQRGRW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  ITERKGFERRPPS-PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

XP_022961322.1 uncharacterized protein LOC111461847 isoform X7 [Cucurbita moschata]5.7e-21193.54Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRD HRYISDFDHS  LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY
        RGREFGGGR+LGRYRDTSPHYSRR+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GY
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY

Query:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
        VGPPSLHSPPRRF AHPIERSPGRT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        I ERKGFERRPP PPLS+LPQRGRW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  ITERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

XP_038880192.1 uncharacterized protein LOC120071861 isoform X1 [Benincasa hispida]7.6e-21695.6Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYEAGE+ RDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSG+LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        RGR+FGGGR+LGRYRDTSPHYSRRISGGRPFGRGVDGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR GA GSPRRGYV
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHSPPRRF AHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAP RYESRYSDHLRRDRVDYL+DSFRGRSKFDRPLPSADWALRDNGRDDFI+
Subjt:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPS-PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        ERKGFERRPPS PPLS+LPQRGRW+RDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
Subjt:  ERKGFERRPPS-PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

TrEMBL top hitse value%identityAlignment
A0A1S3CHL9 uncharacterized protein LOC103501026 isoform X11.0e-21393.51Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRL RYSDDLGYRIHAGSVSP RRRDVHRYIS+F+HS  L 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        RGREFGGGR+L RYRDTSPHY RR+ GGRPFGRGVDGP LAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRF AHPIERSPGRTLNEYRSPPRSWARDG RE+AAGGLAP RYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        ERKGFERRPPSPPL +LPQRGRWSRDVRERSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A5D3E321 TATA-binding protein-associated factor 2N1.0e-21393.51Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRL RYSDDLGYRIHAGSVSP RRRDVHRYIS+F+HS  L 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV
        RGREFGGGR+L RYRDTSPHY RR+ GGRPFGRGVDGP LAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY 
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYV

Query:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRF AHPIERSPGRTLNEYRSPPRSWARDG RE+AAGGLAP RYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        ERKGFERRPPSPPL +LPQRGRWSRDVRERSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X72.7e-21193.54Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRD HRYISDFDHS  LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY
        RGREFGGGR+LGRYRDTSPHYSRR+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GY
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY

Query:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
        VGPPSLHSPPRRF AHPIERSPGRT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        I ERKGFERRPP PPLS+LPQRGRW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  ITERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X62.6e-20993.3Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRD HRYISDFDHS  LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY
        RGREFGGGR+LGRYRDTSPHYSRR+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GY
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY

Query:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
        VGPPSLHSPPRRF AHPIERSPGRT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPS-PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        I ERKGFERRP S PPLS+LPQRGRW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  ITERKGFERRPPS-PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A6J1HBT6 uncharacterized protein LOC111461847 isoform X25.4e-20788.73Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRD HRYISDFDHS  LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLT

Query:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY
        RGREFGGGR+LGRYRDTSPHYSRR+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GY
Subjt:  RGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GY

Query:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
        VGPPSLHSPPRRF AHPIERSPGRT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  VGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPS---------------------PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLG
        I ERKGFERRP S                     PPLS+LPQRGRW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLG
Subjt:  ITERKGFERRPPS---------------------PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLG

Query:  RDRDGGPF
        RDRDGGPF
Subjt:  RDRDGGPF

SwissProt top hitse value%identityAlignment
P35637 RNA-binding protein FUS1.8e-0530.46Show/hide
Query:  GREFGGGRNLGRYRDTSPHYSRRISGGR-------PFGRGVDGPGLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGA
        G+EF G      +      ++R    GR       P GRG  G G + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G 
Subjt:  GREFGGGRNLGRYRDTSPHYSRRISGGR-------PFGRGVDGPGLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGA

Query:  GGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDR
        GG P   ++G    +   RR      +R   R     R   R     G R    GG  P + +SR  +H R+DR
Subjt:  GGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDR

Q01844 RNA-binding protein EWS1.3e-0542.5Show/hide
Query:  GRPFGRGVDGPGLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPP
        GR  GRG D  G  P   RG  S+ NP    NV+ R GDW C +P C N NFA R  CN C  P+         G++ PP
Subjt:  GRPFGRGVDGPGLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPP

Q28009 RNA-binding protein FUS1.8e-0530.46Show/hide
Query:  GREFGGGRNLGRYRDTSPHYSRRISGGR-------PFGRGVDGPGLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGA
        G+EF G      +      ++R    GR       P GRG  G G + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G 
Subjt:  GREFGGGRNLGRYRDTSPHYSRRISGGR-------PFGRGVDGPGLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGA

Query:  GGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDR
        GG P   ++G    +   RR      +R   R     R   R     G R    GG  P + +SR  +H R+DR
Subjt:  GGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDR

Q61545 RNA-binding protein EWS1.3e-0542.5Show/hide
Query:  GRPFGRGVDGPGLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPP
        GR  GRG D  G  P   RG  S+ NP    NV+ R GDW C +P C N NFA R  CN C  P+         G++ PP
Subjt:  GRPFGRGVDGPGLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPP

Q92804 TATA-binding protein-associated factor 2N7.1e-0736.19Show/hide
Query:  GREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR-
        G+EF G      +    P + R        G G  G     G +RG          P+ GDW C +P C N+NFARR  CN CN PR      +GG  R 
Subjt:  GREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR-

Query:  RGYVG
        RGY G
Subjt:  RGYVG

Arabidopsis top hitse value%identityAlignment
AT4G28990.1 RNA-binding protein-related5.9e-5742.13Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRY-SDDLGYRIHAGSVSPTRR-RDVHRYISDFDHSG
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D+ G+R  A S SP RR  + H++ SD +HSG
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRY-SDDLGYRIHAGSVSPTRR-RDVHRYISDFDHSG

Query:  SLTRGREFGGGRNL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSP
           RGRE    R   GR+RD SP  +R  +G RP+ RG+DGP   P   R   S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP
Subjt:  SLTRGREFGGGRNL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSP

Query:  RRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGR
                    P  R    P+  SP R  N YRSPPR W RD P         P R++        RDR  Y +  +    +      ++DWA  +   
Subjt:  RRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGR

Query:  DDFITERKGFERRPPSPPLSMLPQRGRWSRDVRERSRS-PIRGPVRSPLRVPLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF
              +  ++RRPP  P    P+ GRW R +RERSRS P+R     PLR      L  G PP  +D+RRD   +RE RDD RG GR R G  +
Subjt:  DDFITERKGFERRPPSPPLSMLPQRGRWSRDVRERSRS-PIRGPVRSPLRVPLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF

AT4G28990.2 RNA-binding protein-related4.4e-5237.78Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDD-------------------------------
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D                               
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDD-------------------------------

Query:  ------------------LGYRIHAGSVSPTRR-RDVHRYISDFDHSGSLTRGREFGGGRNL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGE
                          LG+R  A S SP RR  + H++ SD +HSG   RGRE    R   GR+RD SP  +R  +G RP+ RG+DGP   P   R  
Subjt:  ------------------LGYRIHAGSVSPTRR-RDVHRYISDFDHSGSLTRGREFGGGRNL-GRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGE

Query:  RSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGG
         S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP            P  R    P+  SP R  N YRSPPR W RD P       
Subjt:  RSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGG

Query:  LAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRWSRDVRERSRS-PIRGPVRSPLRVP
          P R++        RDR  Y +  +    +      ++DWA  +         +  ++RRPP  P    P+ GRW R +RERSRS P+R     PLR  
Subjt:  LAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRWSRDVRERSRS-PIRGPVRSPLRVP

Query:  LRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF
            L  G PP  +D+RRD   +RE RDD RG GR R G  +
Subjt:  LRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF

AT5G58470.1 TBP-associated factor 15B8.9e-0537.62Show/hide
Query:  GREFGGG--RNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGAGGSPRR
        G  +GGG     GR       Y  R   G   GRG  G G   G   G+R         RDGDW C +P C N+NFARR  CN C    P   + G+  R
Subjt:  GREFGGG--RNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGAGGSPRR

Query:  G
        G
Subjt:  G

AT5G58470.2 TBP-associated factor 15B8.9e-0537.62Show/hide
Query:  GREFGGG--RNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGAGGSPRR
        G  +GGG     GR       Y  R   G   GRG  G G   G   G+R         RDGDW C +P C N+NFARR  CN C    P   + G+  R
Subjt:  GREFGGG--RNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGAGGSPRR

Query:  G
        G
Subjt:  G


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCCCGAGATAAAGACTCTACGACTCACCACCAGCCGTTATTGAGCAGCCTTGTTGTCCGGCCTTCGAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAG
TGGCGGCCGCGTTGGTCGTGGAAGCGATTACGAGGCCGGTGAGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGATATAGAATAC
ATGCAGGTTCAGTTTCTCCAACGCGCCGTCGGGATGTTCACCGATATATTTCTGATTTTGATCATTCTGGTAGTCTCACTCGCGGTCGTGAATTTGGTGGTGGGAGGAAT
CTTGGTAGATATCGAGATACTTCACCTCATTACAGTCGAAGAATAAGTGGTGGCAGGCCATTTGGGAGAGGTGTTGATGGCCCTGGACTTGCTCCTGGGCCATTTCGAGG
GGAACGCAGTAAAAATAATCCAAATGTGCGTCCAAGAGATGGGGATTGGTATTGCTCAGATCCTCTATGTGACAACCTAAACTTTGCAAGACGAGAGTTTTGTAACAACT
GCAACAGACCCCGCACTGGAGCTGGTGGAAGTCCTCGAAGAGGCTATGTTGGTCCACCATCCCTGCATTCTCCTCCTAGACGCTTCACTGCCCACCCAATTGAACGTTCT
CCTGGCAGGACTCTTAATGAATATAGGTCTCCTCCCCGTAGTTGGGCCAGGGATGGTCCTAGGGAGATTGCAGCTGGTGGTCTGGCACCTTCGAGGTATGAAAGCAGGTA
TTCCGATCACCTGCGGAGAGATAGGGTGGACTATCTAGAAGACAGCTTCAGAGGAAGATCTAAGTTCGATAGGCCACTTCCTTCAGCAGATTGGGCCCTTAGAGACAATG
GAAGGGATGATTTCATCACAGAGAGGAAGGGATTTGAAAGAAGGCCACCATCACCACCACTGTCGATGCTTCCTCAGCGTGGGCGCTGGTCGCGTGATGTGAGAGAGAGG
AGCCGTTCCCCAATCAGAGGTCCTGTCAGATCTCCATTAAGAGTCCCGCTACGGTCTCCATTAAGTAGCGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTTGTTGA
AAGGGAGCGCGATGATAGGCGTGGCCTAGGGCGAGATCGCGATGGAGGTCCATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCCCGAGATAAAGACTCTACGACTCACCACCAGCCGTTATTGAGCAGCCTTGTTGTCCGGCCTTCGAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAG
TGGCGGCCGCGTTGGTCGTGGAAGCGATTACGAGGCCGGTGAGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGATATAGAATAC
ATGCAGGTTCAGTTTCTCCAACGCGCCGTCGGGATGTTCACCGATATATTTCTGATTTTGATCATTCTGGTAGTCTCACTCGCGGTCGTGAATTTGGTGGTGGGAGGAAT
CTTGGTAGATATCGAGATACTTCACCTCATTACAGTCGAAGAATAAGTGGTGGCAGGCCATTTGGGAGAGGTGTTGATGGCCCTGGACTTGCTCCTGGGCCATTTCGAGG
GGAACGCAGTAAAAATAATCCAAATGTGCGTCCAAGAGATGGGGATTGGTATTGCTCAGATCCTCTATGTGACAACCTAAACTTTGCAAGACGAGAGTTTTGTAACAACT
GCAACAGACCCCGCACTGGAGCTGGTGGAAGTCCTCGAAGAGGCTATGTTGGTCCACCATCCCTGCATTCTCCTCCTAGACGCTTCACTGCCCACCCAATTGAACGTTCT
CCTGGCAGGACTCTTAATGAATATAGGTCTCCTCCCCGTAGTTGGGCCAGGGATGGTCCTAGGGAGATTGCAGCTGGTGGTCTGGCACCTTCGAGGTATGAAAGCAGGTA
TTCCGATCACCTGCGGAGAGATAGGGTGGACTATCTAGAAGACAGCTTCAGAGGAAGATCTAAGTTCGATAGGCCACTTCCTTCAGCAGATTGGGCCCTTAGAGACAATG
GAAGGGATGATTTCATCACAGAGAGGAAGGGATTTGAAAGAAGGCCACCATCACCACCACTGTCGATGCTTCCTCAGCGTGGGCGCTGGTCGCGTGATGTGAGAGAGAGG
AGCCGTTCCCCAATCAGAGGTCCTGTCAGATCTCCATTAAGAGTCCCGCTACGGTCTCCATTAAGTAGCGGCCTTCCACCAAAAGACTTCCGTAGAGATGTTTTTGTTGA
AAGGGAGCGCGATGATAGGCGTGGCCTAGGGCGAGATCGCGATGGAGGTCCATTTTAG
Protein sequenceShow/hide protein sequence
MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRN
LGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERS
PGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRWSRDVRER
SRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF