; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003772 (gene) of Snake gourd v1 genome

Gene IDTan0003772
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiondapper homolog 3 isoform X1
Genome locationLG09:64946147..64951726
RNA-Seq ExpressionTan0003772
SyntenyTan0003772
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR034870 - TAF15/EWS/TLS family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589926.1 Zinc finger Ran-binding domain-containing protein 2, partial [Cucurbita argyrosperma subsp. sororia]2.7e-20688.75Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGP SLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL
        FI ERKGFERRP S                     PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFVERERDDRRGL
Subjt:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL

Query:  GRDRDGGPF
        GRDRDGGPF
Subjt:  GRDRDGGPF

XP_022961315.1 uncharacterized protein LOC111461847 isoform X2 [Cucurbita moschata]3.2e-20789Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGPPSLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL
        FI ERKGFERRP S                     PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFVERERDDRRGL
Subjt:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL

Query:  GRDRDGGPF
        GRDRDGGPF
Subjt:  GRDRDGGPF

XP_022961316.1 uncharacterized protein LOC111461847 isoform X3 [Cucurbita moschata]3.2e-20789Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGPPSLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL
        FI ERKGFERRP S                     PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFVERERDDRRGL
Subjt:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL

Query:  GRDRDGGPF
        GRDRDGGPF
Subjt:  GRDRDGGPF

XP_022961321.1 uncharacterized protein LOC111461847 isoform X6 [Cucurbita moschata]1.5e-20993.57Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGPPSLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPS-PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGLGRDRDGGPF
        FI ERKGFERRP S PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  FITERKGFERRPPS-PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGLGRDRDGGPF

XP_022961322.1 uncharacterized protein LOC111461847 isoform X7 [Cucurbita moschata]1.6e-21193.81Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGPPSLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPSPPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGLGRDRDGGPF
        FI ERKGFERRPP PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  FITERKGFERRPPSPPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGLGRDRDGGPF

TrEMBL top hitse value%identityAlignment
A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X78.0e-21293.81Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGPPSLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPSPPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGLGRDRDGGPF
        FI ERKGFERRPP PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  FITERKGFERRPPSPPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A6J1HA13 uncharacterized protein LOC111461847 isoform X31.6e-20789Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGPPSLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL
        FI ERKGFERRP S                     PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFVERERDDRRGL
Subjt:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL

Query:  GRDRDGGPF
        GRDRDGGPF
Subjt:  GRDRDGGPF

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X67.5e-21093.57Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGPPSLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPS-PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGLGRDRDGGPF
        FI ERKGFERRP S PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  FITERKGFERRPPS-PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A6J1HBT6 uncharacterized protein LOC111461847 isoform X21.6e-20789Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGPPSLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL
        FI ERKGFERRP S                     PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFVERERDDRRGL
Subjt:  FITERKGFERRPPS---------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGL

Query:  GRDRDGGPF
        GRDRDGGPF
Subjt:  GRDRDGGPF

A0A6J1HDP1 uncharacterized protein LOC111461847 isoform X12.2e-20686.87Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYS DLGYR+HAGS SPTRRRDDHRYISDFDHSSGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRIHAGSASPTRRRDDHRYISDFDHSSGLT

Query:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
        RGREFGGGRDLGRYRDTSPHYSRR+SGGR FGRGFDGPG A GPFRGE RSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG
Subjt:  RGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGG

Query:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD
        YVGPPSLHSPPRRFAAHPIERSPGRT+N YRSPPRGWARDGPREIAAGGL APPRYESR+SDHHLRR+RVDYLEDSFRGRSKFDR  PSDW+LRD+GRDD
Subjt:  YVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDD

Query:  FITERKGFERRPPS-------------------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFV
        FI ERKGFERRP S                               PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GL PKD+RRDVFV
Subjt:  FITERKGFERRPPS-------------------------------PPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFV

Query:  ERERDDRRGLGRDRDGGPF
        ERERDDRRGLGRDRDGGPF
Subjt:  ERERDDRRGLGRDRDGGPF

SwissProt top hitse value%identityAlignment
P35637 RNA-binding protein FUS5.1e-0632.37Show/hide
Query:  TRGREF--GGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSP
        TR  +F  GGG   G      P       GG S G G  G     G   G+QR+          GDW C +P C+N+NF+ R  CN C  P+  G GG P
Subjt:  TRGREF--GGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSP

Query:  RRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRER
          G ++G    +   RR      +R   R   G R   RG    G R     G   P + +SR      RRER
Subjt:  RRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRER

P56959 RNA-binding protein FUS1.5e-0532.95Show/hide
Query:  TRGREF--GGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSP
        TR  +F  GGG   G      P       GG S G G  G     G   G+QR+          GDW C +P C+N+NF+ R  CN C  P+  G GG P
Subjt:  TRGREF--GGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSP

Query:  RRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAG---GLVAPPRYESRFSDHHLRRER
          G ++G    +   RR          G    GYR   RG  R G R    G   G   P + +SR      RRER
Subjt:  RRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAG---GLVAPPRYESRFSDHHLRRER

Q28009 RNA-binding protein FUS5.1e-0632.37Show/hide
Query:  TRGREF--GGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSP
        TR  +F  GGG   G      P       GG S G G  G     G   G+QR+          GDW C +P C+N+NF+ R  CN C  P+  G GG P
Subjt:  TRGREF--GGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGAGGSP

Query:  RRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRER
          G ++G    +   RR      +R   R   G R   RG    G R     G   P + +SR      RRER
Subjt:  RRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRER

Q92804 TATA-binding protein-associated factor 2N1.3e-0636.79Show/hide
Query:  GREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR
        G+EF G      +    P + R   GG   GR   G     G F+G           P+ GDW C +P C N+NFARR  CN CN PR      +GG  R
Subjt:  GREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR

Query:  RGGYVG
          GY G
Subjt:  RGGYVG

Q94KD0 Transcription initiation factor TFIID subunit 15b3.3e-0537.27Show/hide
Query:  GREFGGGRDLGRYRDTSPHY-SRRISGGRSFG--RGFDG------PGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRT
        G  +GGG   G Y      Y  R  SGG S+G   G+ G       G   G ++G  R         RDGDW C +P C N+NFARR  CN C    P  
Subjt:  GREFGGGRDLGRYRDTSPHY-SRRISGGRSFG--RGFDG------PGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRT

Query:  GAGGSPRRGG
         + G+  RGG
Subjt:  GAGGSPRRGG

Arabidopsis top hitse value%identityAlignment
AT4G28990.1 RNA-binding protein-related3.8e-5742.57Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADL-GYRIHAGSASPTRR-RDDHRYISDFDHSS
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D  G+R  A S+SP RR  +DH++ SD +HS 
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADL-GYRIHAGSASPTRR-RDDHRYISDFDHSS

Query:  GLTRGREFGGGRDL-GRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGS
           RGRE    R+  GR+RD SP  +R  +G R + RG DGP     P   +  S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     S
Subjt:  GLTRGREFGGGRDL-GRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGS

Query:  PRRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRD
        P     + PP  HSP R F             NGYRSPPRGW RD P          PPR++   +     R+R  Y +  +       R I SDWA  +
Subjt:  PRRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRD

Query:  HGRDDFITERKGFERRPPSPPLSLLPQRGRWARDVRERSRS-PIRGPVRSPLRVSLRSPLSSGLAP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF
                 +  ++RRPP  P    P+ GRW R +RERSRS P+R     PLR      L  G  P  +D+RRD   +RE RDD RG GR R G  +
Subjt:  HGRDDFITERKGFERRPPSPPLSLLPQRGRWARDVRERSRS-PIRGPVRSPLRVSLRSPLSSGLAP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF

AT4G28990.2 RNA-binding protein-related2.2e-5238.2Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSAD-------------------------------
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D                               
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSAD-------------------------------

Query:  ------------------LGYRIHAGSASPTRR-RDDHRYISDFDHSSGLTRGREFGGGRDL-GRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGE
                          LG+R  A S+SP RR  +DH++ SD +HS    RGRE    R+  GR+RD SP  +R  +G R + RG DGP     P   +
Subjt:  ------------------LGYRIHAGSASPTRR-RDDHRYISDFDHSSGLTRGREFGGGRDL-GRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGE

Query:  QRSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAA
          S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP     + PP  HSP R F             NGYRSPPRGW RD P     
Subjt:  QRSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAA

Query:  GGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDDFITERKGFERRPPSPPLSLLPQRGRWARDVRERSRS-PIRGPVRSPL
             PPR++   +     R+R  Y +  +       R I SDWA  +         +  ++RRPP  P    P+ GRW R +RERSRS P+R     PL
Subjt:  GGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWALRDHGRDDFITERKGFERRPPSPPLSLLPQRGRWARDVRERSRS-PIRGPVRSPL

Query:  RVSLRSPLSSGLAP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF
        R      L  G  P  +D+RRD   +RE RDD RG GR R G  +
Subjt:  RVSLRSPLSSGLAP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF

AT5G58470.1 TBP-associated factor 15B2.4e-0637.27Show/hide
Query:  GREFGGGRDLGRYRDTSPHY-SRRISGGRSFG--RGFDG------PGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRT
        G  +GGG   G Y      Y  R  SGG S+G   G+ G       G   G ++G  R         RDGDW C +P C N+NFARR  CN C    P  
Subjt:  GREFGGGRDLGRYRDTSPHY-SRRISGGRSFG--RGFDG------PGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRT

Query:  GAGGSPRRGG
         + G+  RGG
Subjt:  GAGGSPRRGG

AT5G58470.2 TBP-associated factor 15B2.4e-0637.27Show/hide
Query:  GREFGGGRDLGRYRDTSPHY-SRRISGGRSFG--RGFDG------PGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRT
        G  +GGG   G Y      Y  R  SGG S+G   G+ G       G   G ++G  R         RDGDW C +P C N+NFARR  CN C    P  
Subjt:  GREFGGGRDLGRYRDTSPHY-SRRISGGRSFG--RGFDG------PGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRT

Query:  GAGGSPRRGG
         + G+  RGG
Subjt:  GAGGSPRRGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACACCCCCACCCTTTTCAACGATTTATCGGCGTTCAGACTTCAGACCAACCAGTTTCGTCGCTGCTCATCGAAGCGGGCAGAAGAATTAGGAAAGCAAGGTGAAAG
GATGGGTTCGAGGGATAAAGACTCCACGACTCACCACCAGCCGTTATTGAGCAGCCTGGTAGTACGGCCATCGAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCA
GTGGCGGCCGCGTTGGTCGTGGAAGCGATTATGAGGCCGGTGAGGTTCCTCGCGACCCTCCACAATATTCTCGATTGGATCGATATTCAGCTGATTTGGGATATAGAATA
CATGCAGGTTCAGCTTCTCCAACACGTCGTCGGGATGATCACCGATATATTTCTGATTTCGATCATTCTAGTGGTCTCACAAGGGGCCGTGAATTTGGTGGCGGGAGGGA
TCTTGGTAGATATCGAGATACTTCACCTCATTACAGTCGCAGAATAAGTGGTGGCAGGTCATTTGGGAGAGGTTTTGATGGCCCTGGACTTGCTTCTGGACCATTTAGAG
GGGAACAACGTAGTAAAAATAATCCAAATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTTTATGTGACAACCTAAACTTTGCAAGACGAGAATTTTGTAAC
AACTGCAACAGACCCCGCACTGGAGCTGGTGGAAGTCCTCGAAGAGGAGGCTATGTTGGTCCCCCATCCCTGCATTCTCCTCCTAGACGCTTCGCTGCCCACCCAATTGA
ACGTTCTCCTGGCAGGACTCTTAATGGATATAGGTCTCCTCCCCGTGGTTGGGCCAGGGATGGTCCTAGGGAGATTGCAGCTGGTGGTCTGGTGGCACCTCCGAGGTATG
AAAGTAGGTTTTCTGATCACCACCTGCGGAGGGAGAGGGTGGATTATCTTGAAGACAGCTTCAGAGGAAGATCCAAGTTTGATAGGCCAATTCCCTCCGATTGGGCCCTT
AGAGACCATGGAAGGGATGACTTCATCACCGAGAGAAAGGGATTTGAAAGAAGGCCTCCATCCCCACCACTGTCGTTACTTCCTCAGCGTGGCCGTTGGGCACGTGACGT
GAGAGAGAGGAGCCGTTCCCCAATCAGAGGTCCAGTTAGATCTCCATTGAGAGTCTCACTTCGGTCTCCATTAAGCAGCGGCCTTGCACCAAAAGACTTTCGTCGAGATG
TATTTGTTGAAAGGGAGCGCGATGACAGGCGTGGCCTAGGACGAGATCGCGATGGAGGTCCATTTTAG
mRNA sequenceShow/hide mRNA sequence
CTGAATTCCTCATAACTAGAATTTAGGGTTTATGAATGTACACCCCCACCCTTTTCAACGATTTATCGGCGTTCAGACTTCAGACCAACCAGTTTCGTCGCTGCTCATCG
AAGCGGGCAGAAGAATTAGGAAAGCAAGGTGAAAGGATGGGTTCGAGGGATAAAGACTCCACGACTCACCACCAGCCGTTATTGAGCAGCCTGGTAGTACGGCCATCGAA
TAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAGTGGCGGCCGCGTTGGTCGTGGAAGCGATTATGAGGCCGGTGAGGTTCCTCGCGACCCTCCACAATATTCTCGAT
TGGATCGATATTCAGCTGATTTGGGATATAGAATACATGCAGGTTCAGCTTCTCCAACACGTCGTCGGGATGATCACCGATATATTTCTGATTTCGATCATTCTAGTGGT
CTCACAAGGGGCCGTGAATTTGGTGGCGGGAGGGATCTTGGTAGATATCGAGATACTTCACCTCATTACAGTCGCAGAATAAGTGGTGGCAGGTCATTTGGGAGAGGTTT
TGATGGCCCTGGACTTGCTTCTGGACCATTTAGAGGGGAACAACGTAGTAAAAATAATCCAAATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTTTATGTG
ACAACCTAAACTTTGCAAGACGAGAATTTTGTAACAACTGCAACAGACCCCGCACTGGAGCTGGTGGAAGTCCTCGAAGAGGAGGCTATGTTGGTCCCCCATCCCTGCAT
TCTCCTCCTAGACGCTTCGCTGCCCACCCAATTGAACGTTCTCCTGGCAGGACTCTTAATGGATATAGGTCTCCTCCCCGTGGTTGGGCCAGGGATGGTCCTAGGGAGAT
TGCAGCTGGTGGTCTGGTGGCACCTCCGAGGTATGAAAGTAGGTTTTCTGATCACCACCTGCGGAGGGAGAGGGTGGATTATCTTGAAGACAGCTTCAGAGGAAGATCCA
AGTTTGATAGGCCAATTCCCTCCGATTGGGCCCTTAGAGACCATGGAAGGGATGACTTCATCACCGAGAGAAAGGGATTTGAAAGAAGGCCTCCATCCCCACCACTGTCG
TTACTTCCTCAGCGTGGCCGTTGGGCACGTGACGTGAGAGAGAGGAGCCGTTCCCCAATCAGAGGTCCAGTTAGATCTCCATTGAGAGTCTCACTTCGGTCTCCATTAAG
CAGCGGCCTTGCACCAAAAGACTTTCGTCGAGATGTATTTGTTGAAAGGGAGCGCGATGACAGGCGTGGCCTAGGACGAGATCGCGATGGAGGTCCATTTTAGTGTTGAA
AATTGATTTTGATGTGGCTTCCCATGTAATTTTGGTTAGAACAGCACCCTGTAATCCGTGTTTAGATCCTTTGGAAAGAGGATGTTAAGAAGAGCTTGCTTATTTTTTCC
TGACTGGTCCAGTGTGTCGAAACCTGCTATGCTAAATGGGGTCCTTTTTCTGCCTCGTGTGTTTCATGTTTCTTATTATTTTATGCTGTAGTTGGCAGCCAGCAGTAATA
GTGAAGTAAACGCTTTGCATATACGGTGTCTGTAAACTTTGCCATATTTAGTTTTCTGGCTTCTGGGCAATGGTTTTGAACCGTTATTATCCTTCCATAAATTTAGATCT
ATCAGATTTCATGTC
Protein sequenceShow/hide protein sequence
MYTPTLFNDLSAFRLQTNQFRRCSSKRAEELGKQGERMGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSADLGYRI
HAGSASPTRRRDDHRYISDFDHSSGLTRGREFGGGRDLGRYRDTSPHYSRRISGGRSFGRGFDGPGLASGPFRGEQRSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCN
NCNRPRTGAGGSPRRGGYVGPPSLHSPPRRFAAHPIERSPGRTLNGYRSPPRGWARDGPREIAAGGLVAPPRYESRFSDHHLRRERVDYLEDSFRGRSKFDRPIPSDWAL
RDHGRDDFITERKGFERRPPSPPLSLLPQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSSGLAPKDFRRDVFVERERDDRRGLGRDRDGGPF