; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034445 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034445
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiondapper homolog 3 isoform X1
Genome locationchr3:7399363..7405678
RNA-Seq ExpressionLag0034445
SyntenyLag0034445
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR034870 - TAF15/EWS/TLS family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022961315.1 uncharacterized protein LOC111461847 isoform X2 [Cucurbita moschata]2.4e-20788.24Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRDDHRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY
        RGREF GGRDLGRYRDTSPHYSRR+SGGRPFGRGFDGPG APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GAGGSPRR GY
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY

Query:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF
        V PPSLHSPPRRFAAHPIERSPGR++N YRSPPRGWAR+GPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDR +P +DW+LRD+GRDDF
Subjt:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF

Query:  ITERKGFERRPPS---------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLG
        I ERKGFERRP S                     PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLSGGLPPKD+RRDVFVER+RDDRRGLG
Subjt:  ITERKGFERRPPS---------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLG

Query:  RDRDGGPF
        RDRDGGPF
Subjt:  RDRDGGPF

XP_022961316.1 uncharacterized protein LOC111461847 isoform X3 [Cucurbita moschata]2.4e-20788.24Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRDDHRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY
        RGREF GGRDLGRYRDTSPHYSRR+SGGRPFGRGFDGPG APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GAGGSPRR GY
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY

Query:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF
        V PPSLHSPPRRFAAHPIERSPGR++N YRSPPRGWAR+GPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDR +P +DW+LRD+GRDDF
Subjt:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF

Query:  ITERKGFERRPPS---------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLG
        I ERKGFERRP S                     PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLSGGLPPKD+RRDVFVER+RDDRRGLG
Subjt:  ITERKGFERRPPS---------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLG

Query:  RDRDGGPF
        RDRDGGPF
Subjt:  RDRDGGPF

XP_022961321.1 uncharacterized protein LOC111461847 isoform X6 [Cucurbita moschata]1.1e-20992.78Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRDDHRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY
        RGREF GGRDLGRYRDTSPHYSRR+SGGRPFGRGFDGPG APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GAGGSPRR GY
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY

Query:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF
        V PPSLHSPPRRFAAHPIERSPGR++N YRSPPRGWAR+GPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDR +P +DW+LRD+GRDDF
Subjt:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF

Query:  ITERKGFERRPPS-PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF
        I ERKGFERRP S PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLSGGLPPKD+RRDVFVER+RDDRRGLGRDRDGGPF
Subjt:  ITERKGFERRPPS-PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF

XP_022961322.1 uncharacterized protein LOC111461847 isoform X7 [Cucurbita moschata]1.2e-21193.02Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRDDHRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY
        RGREF GGRDLGRYRDTSPHYSRR+SGGRPFGRGFDGPG APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GAGGSPRR GY
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY

Query:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF
        V PPSLHSPPRRFAAHPIERSPGR++N YRSPPRGWAR+GPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDR +P +DW+LRD+GRDDF
Subjt:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF

Query:  ITERKGFERRPPSPPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF
        I ERKGFERRPP PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLSGGLPPKD+RRDVFVER+RDDRRGLGRDRDGGPF
Subjt:  ITERKGFERRPPSPPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF

XP_038880192.1 uncharacterized protein LOC120071861 isoform X1 [Benincasa hispida]1.5e-20692.25Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYEAGE+ RDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRD HRYISDFDHSG LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRRGYV
        RGR+F GGRDLGRYRDTSPHYSRRISGGRPFGRG DGP  APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GA GSPRRGYV
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRRGYV

Query:  APPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDFI
         PPSLHSPPRRFAAHPIERSPGR+LN YRSPPR WAR+GPREIAAGGLAPPRYESRYSD HLRRDRVDYL+DSFRGRSKFDRP+PS DWALRD+GRDDFI
Subjt:  APPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDFI

Query:  TERKGFERRPPS-PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF
        +ERKGFERRPPS PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLS GLPPKDFRRDVFVER+RDDRRGLGRDRDGGPF
Subjt:  TERKGFERRPPS-PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF

TrEMBL top hitse value%identityAlignment
A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X75.8e-21293.02Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRDDHRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY
        RGREF GGRDLGRYRDTSPHYSRR+SGGRPFGRGFDGPG APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GAGGSPRR GY
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY

Query:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF
        V PPSLHSPPRRFAAHPIERSPGR++N YRSPPRGWAR+GPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDR +P +DW+LRD+GRDDF
Subjt:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF

Query:  ITERKGFERRPPSPPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF
        I ERKGFERRPP PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLSGGLPPKD+RRDVFVER+RDDRRGLGRDRDGGPF
Subjt:  ITERKGFERRPPSPPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF

A0A6J1HA13 uncharacterized protein LOC111461847 isoform X31.1e-20788.24Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRDDHRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY
        RGREF GGRDLGRYRDTSPHYSRR+SGGRPFGRGFDGPG APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GAGGSPRR GY
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY

Query:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF
        V PPSLHSPPRRFAAHPIERSPGR++N YRSPPRGWAR+GPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDR +P +DW+LRD+GRDDF
Subjt:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF

Query:  ITERKGFERRPPS---------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLG
        I ERKGFERRP S                     PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLSGGLPPKD+RRDVFVER+RDDRRGLG
Subjt:  ITERKGFERRPPS---------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLG

Query:  RDRDGGPF
        RDRDGGPF
Subjt:  RDRDGGPF

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X65.5e-21092.78Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRDDHRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY
        RGREF GGRDLGRYRDTSPHYSRR+SGGRPFGRGFDGPG APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GAGGSPRR GY
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY

Query:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF
        V PPSLHSPPRRFAAHPIERSPGR++N YRSPPRGWAR+GPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDR +P +DW+LRD+GRDDF
Subjt:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF

Query:  ITERKGFERRPPS-PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF
        I ERKGFERRP S PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLSGGLPPKD+RRDVFVER+RDDRRGLGRDRDGGPF
Subjt:  ITERKGFERRPPS-PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF

A0A6J1HBT6 uncharacterized protein LOC111461847 isoform X21.1e-20788.24Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRDDHRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY
        RGREF GGRDLGRYRDTSPHYSRR+SGGRPFGRGFDGPG APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GAGGSPRR GY
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY

Query:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF
        V PPSLHSPPRRFAAHPIERSPGR++N YRSPPRGWAR+GPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDR +P +DW+LRD+GRDDF
Subjt:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF

Query:  ITERKGFERRPPS---------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLG
        I ERKGFERRP S                     PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLSGGLPPKD+RRDVFVER+RDDRRGLG
Subjt:  ITERKGFERRPPS---------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLG

Query:  RDRDGGPF
        RDRDGGPF
Subjt:  RDRDGGPF

A0A6J1HDP1 uncharacterized protein LOC111461847 isoform X11.6e-20686.12Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYR+HAGSVSPTRRRDDHRYISDFDHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLT

Query:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY
        RGREF GGRDLGRYRDTSPHYSRR+SGGRPFGRGFDGPG APGPFRGERSKNNPNVRPR+GDWYCSDPLCDNLNFARREFCNNCNRPR GAGGSPRR GY
Subjt:  RGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRR-GY

Query:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF
        V PPSLHSPPRRFAAHPIERSPGR++N YRSPPRGWAR+GPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDR +P +DW+LRD+GRDDF
Subjt:  VAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDF

Query:  ITERKGFERRPPS-------------------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVE
        I ERKGFERRP S                               PPLSLLPQRGRW RDVRERSRSPIR PVRSPLRV LRSPLSGGLPPKD+RRDVFVE
Subjt:  ITERKGFERRPPS-------------------------------PPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVE

Query:  RDRDDRRGLGRDRDGGPF
        R+RDDRRGLGRDRDGGPF
Subjt:  RDRDDRRGLGRDRDGGPF

SwissProt top hitse value%identityAlignment
P35637 RNA-binding protein FUS5.2e-0830.23Show/hide
Query:  DPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLTRGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRG--ERSKN
        DPP       + D   +  +   VS   RR      +DF+  GG  RG         GR R            G P GRG  G G + G  RG       
Subjt:  DPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLTRGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRG--ERSKN

Query:  NPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP-GAGGSPRRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPP
            + R GDW C +P C+N+NF+ R  CN C  P+P G GG P   ++     +   RR      +R   R   G R   RG    G R    GG  P 
Subjt:  NPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP-GAGGSPRRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPP

Query:  RYESRYSDHHLRRDR
        + +SR      RR+R
Subjt:  RYESRYSDHHLRRDR

P56959 RNA-binding protein FUS7.5e-0730.23Show/hide
Query:  DPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLTRGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRG--ERSKN
        DPP       + D   +  +   VS   RR      +DF+  GG  RG         GR R            G P GRG  G G + G  RG       
Subjt:  DPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLTRGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRG--ERSKN

Query:  NPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP-GAGGSPRRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPP
            + R GDW C +P C+N+NF+ R  CN C  P+P G GG P   ++     +   RR      +R   R   G R   RG    G R    GG  P 
Subjt:  NPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP-GAGGSPRRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPP

Query:  RYESRYSDHHLRRDR
        + +SR      RR+R
Subjt:  RYESRYSDHHLRRDR

Q01844 RNA-binding protein EWS1.3e-0643.75Show/hide
Query:  GRPFGRGFDGPGLAPGPFRGERSKNNP----NVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRRGYVAPP
        GR  GRG D  G  P   RG  S+ NP    NV+ R GDW C +P C N NFA R  CN C  P+P        G++ PP
Subjt:  GRPFGRGFDGPGLAPGPFRGERSKNNP----NVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRRGYVAPP

Q28009 RNA-binding protein FUS5.2e-0830.23Show/hide
Query:  DPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLTRGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRG--ERSKN
        DPP       + D   +  +   VS   RR      +DF+  GG  RG         GR R            G P GRG  G G + G  RG       
Subjt:  DPPQYSRLDRYSDDLGYRIHAGSVSPTRRRDDHRYISDFDHSGGLTRGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRG--ERSKN

Query:  NPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP-GAGGSPRRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPP
            + R GDW C +P C+N+NF+ R  CN C  P+P G GG P   ++     +   RR      +R   R   G R   RG    G R    GG  P 
Subjt:  NPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP-GAGGSPRRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPP

Query:  RYESRYSDHHLRRDR
        + +SR      RR+R
Subjt:  RYESRYSDHHLRRDR

Q92804 TATA-binding protein-associated factor 2N2.6e-0736.89Show/hide
Query:  GREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP----GAGGSPR-
        G+EF G      +    P + R        G G  G     G +RG          P+ GDW C +P C N+NFARR  CN CN PRP     +GG  R 
Subjt:  GREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP----GAGGSPR-

Query:  RGY
        RGY
Subjt:  RGY

Arabidopsis top hitse value%identityAlignment
AT4G28990.1 RNA-binding protein-related1.6e-6043.43Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRY-SDDLGYRIHAGSVSPTRR-RDDHRYISDFDHSG
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D+ G+R  A S SP RR  +DH++ SD +HSG
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRY-SDDLGYRIHAGSVSPTRR-RDDHRYISDFDHSG

Query:  GLTRGREFAGGRDL-GRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNN-PNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSP
           RGRE +  R+  GR+RD SP  +R  +G RP+ RG DGP   P   R   S+NN   V+PREGDWYC DPLC NLNFARRE C  C R R     SP
Subjt:  GLTRGREFAGGRDL-GRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNN-PNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSP

Query:  RRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYE-SRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDH
            + PP            P+  SP R  NGYRSPPRGW R+ P         PPR++   + D    RDR  Y +  +    +      ++DWA  + 
Subjt:  RRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYE-SRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDH

Query:  GRDDFITERKGFERRPPSPPLSLLPQRGRWGRDVRERSRS-PIRAPVRSPLRVSLRSPLSGGLPP--KDFRRDVFVERD-RDDRRGLGRDRDGGPF
                +  ++RRPP  P    P+ GRWGR +RERSRS P+R     PLR      L GG PP  +D+RRD   +R+ RDD RG GR R G  +
Subjt:  GRDDFITERKGFERRPPSPPLSLLPQRGRWGRDVRERSRS-PIRAPVRSPLRVSLRSPLSGGLPP--KDFRRDVFVERD-RDDRRGLGRDRDGGPF

AT4G28990.2 RNA-binding protein-related1.2e-5538.96Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDD-------------------------------
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D                               
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDD-------------------------------

Query:  ------------------LGYRIHAGSVSPTRR-RDDHRYISDFDHSGGLTRGREFAGGRDL-GRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGE
                          LG+R  A S SP RR  +DH++ SD +HSG   RGRE +  R+  GR+RD SP  +R  +G RP+ RG DGP   P   R  
Subjt:  ------------------LGYRIHAGSVSPTRR-RDDHRYISDFDHSGGLTRGREFAGGRDL-GRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGE

Query:  RSKNN-PNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGG
         S+NN   V+PREGDWYC DPLC NLNFARRE C  C R R     SP    + PP            P+  SP R  NGYRSPPRGW R+ P       
Subjt:  RSKNN-PNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRPGAGGSPRRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGG

Query:  LAPPRYE-SRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDFITERKGFERRPPSPPLSLLPQRGRWGRDVRERSRS-PIRAPVRSPLR
          PPR++   + D    RDR  Y +  +    +      ++DWA  +         +  ++RRPP  P    P+ GRWGR +RERSRS P+R     PLR
Subjt:  LAPPRYE-SRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDHGRDDFITERKGFERRPPSPPLSLLPQRGRWGRDVRERSRS-PIRAPVRSPLR

Query:  VSLRSPLSGGLPP--KDFRRDVFVERD-RDDRRGLGRDRDGGPF
              L GG PP  +D+RRD   +R+ RDD RG GR R G  +
Subjt:  VSLRSPLSGGLPP--KDFRRDVFVERD-RDDRRGLGRDRDGGPF

AT5G25490.1 Ran BP2/NZF zinc finger-like superfamily protein6.1e-0456.25Show/hide
Query:  REGDWYCSDPLCDNLNFARREFCNNCNRPRPG
        R GDW C   LC +LNF RR+ C  C  PRPG
Subjt:  REGDWYCSDPLCDNLNFARREFCNNCNRPRPG

AT5G58470.1 TBP-associated factor 15B2.1e-0436.19Show/hide
Query:  GGRDL---GRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP----------GAGG
        GG D    GR       Y  R   G   GRG  G G   G   G+R         R+GDW C +P C N+NFARR  CN C    P          G GG
Subjt:  GGRDL---GRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP----------GAGG

Query:  SPRRG
          R G
Subjt:  SPRRG

AT5G58470.2 TBP-associated factor 15B2.1e-0436.19Show/hide
Query:  GGRDL---GRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP----------GAGG
        GG D    GR       Y  R   G   GRG  G G   G   G+R         R+GDW C +P C N+NFARR  CN C    P          G GG
Subjt:  GGRDL---GRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNCNRPRP----------GAGG

Query:  SPRRG
          R G
Subjt:  SPRRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACACCGGTGTGGTGCTAGCCACACCGCCTCCGATGGTCAAGAATTCGAAGGCGTTTCGGGACAAACCAGGAGGAACCGGGGTGCCTAGAGGCGGTAGGGACCAAAC
GGAGCCGGACAAGCTCGGCCCGCGCAAGCGGGCCGAGCAGGGGGTCGGGCCAAAAGCCCGACCCCTTCGGTCTTGGTCCGTCCCACTTTGTCGGTTTCGTCTCTGGGGTC
CATCTCCCAGCCTCAGTTTTGCCCAGTTGTCCTCGTTAGCTCTCTGTACATCGGAGTGGTCCAAAATTACCTATAACAGTTGGAACGACAGAGAATTTAGGGTTTATGAA
TGTACACCCCACATTTTCAACGATTTATCGGCGTTCAGACTTCAGACCGACCAGTTTCGTCGCTGCTCGTCGAAGCAGGCTGAAGAATTAGGAAAACAAGGCGAAAGGAT
GGGTTCGAGGGATAAAGACTCTACGACTCACCACCAGCCGTTATTGAGCAGCCTTGTAGTACGCCCATCGAATAGCGACGGAGGTGGTGGCGGAGTCGGTGGAACCAGTG
GCGGCCGCGTTGGTCGTGGAAGCGATTATGAGGCCGGTGAGGTTCCCCGCGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGATATAGAATACAT
GCAGGTTCGGTTTCTCCAACACGTCGTCGGGATGATCACCGATATATTTCTGATTTTGATCATTCTGGTGGTCTCACACGGGGCCGTGAATTTGCTGGTGGGAGGGATCT
TGGTAGATATCGAGATACTTCACCTCATTACAGTCGAAGAATAAGTGGTGGCCGGCCATTTGGGAGAGGTTTTGATGGCCCTGGACTTGCTCCTGGGCCATTTAGAGGGG
AACGTAGTAAAAATAATCCAAATGTGCGTCCTAGGGAGGGGGATTGGTATTGCTCAGATCCTTTATGTGACAACCTAAACTTTGCAAGACGAGAATTTTGTAACAACTGC
AACAGACCCCGCCCTGGAGCTGGTGGAAGTCCTCGAAGAGGCTATGTTGCTCCACCATCCCTGCATTCTCCTCCCAGACGCTTTGCTGCCCACCCAATTGAACGTTCTCC
TGGCAGGAGTCTTAACGGATATAGGTCTCCTCCCCGTGGTTGGGCCAGGGAAGGTCCTAGGGAGATTGCAGCTGGTGGTCTGGCACCTCCGAGGTATGAAAGCAGGTATT
CCGATCATCACCTCCGGAGAGATAGGGTGGATTATCTAGAAGACAGCTTCAGAGGAAGATCCAAGTTCGACAGGCCAATTCCTTCCACGGATTGGGCCCTTAGAGACCAT
GGAAGGGATGACTTCATCACCGAGCGGAAGGGATTTGAAAGAAGGCCACCATCCCCACCACTATCGCTGCTTCCTCAGCGTGGCCGTTGGGGGCGTGATGTGAGAGAGAG
GAGCCGTTCCCCAATCAGAGCTCCAGTTAGATCTCCTTTGAGAGTCTCGCTGCGGTCTCCATTAAGTGGCGGCCTTCCACCAAAAGACTTTCGTAGAGATGTCTTTGTTG
AAAGGGACCGTGACGATAGGCGTGGCCTAGGACGAGATCGCGACGGAGGTCCCTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACACCGGTGTGGTGCTAGCCACACCGCCTCCGATGGTCAAGAATTCGAAGGCGTTTCGGGACAAACCAGGAGGAACCGGGGTGCCTAGAGGCGGTAGGGACCAAAC
GGAGCCGGACAAGCTCGGCCCGCGCAAGCGGGCCGAGCAGGGGGTCGGGCCAAAAGCCCGACCCCTTCGGTCTTGGTCCGTCCCACTTTGTCGGTTTCGTCTCTGGGGTC
CATCTCCCAGCCTCAGTTTTGCCCAGTTGTCCTCGTTAGCTCTCTGTACATCGGAGTGGTCCAAAATTACCTATAACAGTTGGAACGACAGAGAATTTAGGGTTTATGAA
TGTACACCCCACATTTTCAACGATTTATCGGCGTTCAGACTTCAGACCGACCAGTTTCGTCGCTGCTCGTCGAAGCAGGCTGAAGAATTAGGAAAACAAGGCGAAAGGAT
GGGTTCGAGGGATAAAGACTCTACGACTCACCACCAGCCGTTATTGAGCAGCCTTGTAGTACGCCCATCGAATAGCGACGGAGGTGGTGGCGGAGTCGGTGGAACCAGTG
GCGGCCGCGTTGGTCGTGGAAGCGATTATGAGGCCGGTGAGGTTCCCCGCGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGATATAGAATACAT
GCAGGTTCGGTTTCTCCAACACGTCGTCGGGATGATCACCGATATATTTCTGATTTTGATCATTCTGGTGGTCTCACACGGGGCCGTGAATTTGCTGGTGGGAGGGATCT
TGGTAGATATCGAGATACTTCACCTCATTACAGTCGAAGAATAAGTGGTGGCCGGCCATTTGGGAGAGGTTTTGATGGCCCTGGACTTGCTCCTGGGCCATTTAGAGGGG
AACGTAGTAAAAATAATCCAAATGTGCGTCCTAGGGAGGGGGATTGGTATTGCTCAGATCCTTTATGTGACAACCTAAACTTTGCAAGACGAGAATTTTGTAACAACTGC
AACAGACCCCGCCCTGGAGCTGGTGGAAGTCCTCGAAGAGGCTATGTTGCTCCACCATCCCTGCATTCTCCTCCCAGACGCTTTGCTGCCCACCCAATTGAACGTTCTCC
TGGCAGGAGTCTTAACGGATATAGGTCTCCTCCCCGTGGTTGGGCCAGGGAAGGTCCTAGGGAGATTGCAGCTGGTGGTCTGGCACCTCCGAGGTATGAAAGCAGGTATT
CCGATCATCACCTCCGGAGAGATAGGGTGGATTATCTAGAAGACAGCTTCAGAGGAAGATCCAAGTTCGACAGGCCAATTCCTTCCACGGATTGGGCCCTTAGAGACCAT
GGAAGGGATGACTTCATCACCGAGCGGAAGGGATTTGAAAGAAGGCCACCATCCCCACCACTATCGCTGCTTCCTCAGCGTGGCCGTTGGGGGCGTGATGTGAGAGAGAG
GAGCCGTTCCCCAATCAGAGCTCCAGTTAGATCTCCTTTGAGAGTCTCGCTGCGGTCTCCATTAAGTGGCGGCCTTCCACCAAAAGACTTTCGTAGAGATGTCTTTGTTG
AAAGGGACCGTGACGATAGGCGTGGCCTAGGACGAGATCGCGACGGAGGTCCCTTCTAG
Protein sequenceShow/hide protein sequence
MHTGVVLATPPPMVKNSKAFRDKPGGTGVPRGGRDQTEPDKLGPRKRAEQGVGPKARPLRSWSVPLCRFRLWGPSPSLSFAQLSSLALCTSEWSKITYNSWNDREFRVYE
CTPHIFNDLSAFRLQTDQFRRCSSKQAEELGKQGERMGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGYRIH
AGSVSPTRRRDDHRYISDFDHSGGLTRGREFAGGRDLGRYRDTSPHYSRRISGGRPFGRGFDGPGLAPGPFRGERSKNNPNVRPREGDWYCSDPLCDNLNFARREFCNNC
NRPRPGAGGSPRRGYVAPPSLHSPPRRFAAHPIERSPGRSLNGYRSPPRGWAREGPREIAAGGLAPPRYESRYSDHHLRRDRVDYLEDSFRGRSKFDRPIPSTDWALRDH
GRDDFITERKGFERRPPSPPLSLLPQRGRWGRDVRERSRSPIRAPVRSPLRVSLRSPLSGGLPPKDFRRDVFVERDRDDRRGLGRDRDGGPF