; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0692 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0692
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiondapper homolog 3 isoform X1
Genome locationMC06:5683629..5689205
RNA-Seq ExpressionMC06g0692
SyntenyMC06g0692
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR034870 - TAF15/EWS/TLS family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144869.1 dapper homolog 3 isoform X1 [Momordica charantia]5.09e-292100Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
        RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY

Query:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT
        LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT
Subjt:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT

Query:  ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
Subjt:  ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

XP_022144870.1 dapper homolog 3 isoform X2 [Momordica charantia]1.51e-27094.81Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEA                    GYRIHAGSASPTRRRDDHRYISDLDHSGGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
        RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY

Query:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT
        LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT
Subjt:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT

Query:  ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
Subjt:  ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

XP_022961321.1 uncharacterized protein LOC111461847 isoform X6 [Cucurbita moschata]2.78e-24486.34Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSR KDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYEAGEVPR+PPQ++RLDRYSDDLGYR+HAGS SPTRRRDDHRYISD DHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRS-
        RGR+FGGGRDLGRYRD+SPHY+RR+S GRPFGRG  GPG A G FRG+RSKNNPNVRPRDGDWYCSDPLC NLNFARRE CNNCNR R G AGGSPRR  
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRS-

Query:  YLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIPADWALRDHGRDDF
        Y+GPPSLHSPPRRFAAHP+ERSPGRT+  YRSPPR WARDGPREIAAG LAPPRYESRY DH LRRDRVDYLED  RGRSKFDR  P+DW+LRD+GRDDF
Subjt:  YLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIPADWALRDHGRDDF

Query:  LTERKGFERGPPSPP-LPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        + ERKGFER P SPP L LL QRGRWARDVRERSRSPIRGPVRSPLRV LRSPLSGGLPPKD+RRDV+VERERDDRRGLGRDRDGGPF
Subjt:  LTERKGFERGPPSPP-LPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

XP_022961322.1 uncharacterized protein LOC111461847 isoform X7 [Cucurbita moschata]6.88e-24786.56Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSR KDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYEAGEVPR+PPQ++RLDRYSDDLGYR+HAGS SPTRRRDDHRYISD DHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRS-
        RGR+FGGGRDLGRYRD+SPHY+RR+S GRPFGRG  GPG A G FRG+RSKNNPNVRPRDGDWYCSDPLC NLNFARRE CNNCNR R G AGGSPRR  
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRS-

Query:  YLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIPADWALRDHGRDDF
        Y+GPPSLHSPPRRFAAHP+ERSPGRT+  YRSPPR WARDGPREIAAG LAPPRYESRY DH LRRDRVDYLED  RGRSKFDR  P+DW+LRD+GRDDF
Subjt:  YLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIPADWALRDHGRDDF

Query:  LTERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        + ERKGFER PP PPL LL QRGRWARDVRERSRSPIRGPVRSPLRV LRSPLSGGLPPKD+RRDV+VERERDDRRGLGRDRDGGPF
Subjt:  LTERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

XP_038880192.1 uncharacterized protein LOC120071861 isoform X1 [Benincasa hispida]4.30e-24587.63Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGE+ R+PPQ++RLDRYSDDLGYRIHAGS SPTRRRD HRYISD DHSG LT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
        RGRDFGGGRDLGRYRD+SPHY+RRIS GRPFGRGV GP  A G FRG+RSKNNPNVRPRDGDWYCSDPLC NLNFARRE CNNCNR R  GA GSPRR Y
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY

Query:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIP-ADWALRDHGRDDF
        +GPPSLHSPPRRFAAHP+ERSPGRTL  YRSPPR+WARDGPREIAAG LAPPRYESRY DH LRRDRVDYL+D  RGRSKFDRP+P ADWALRD+GRDDF
Subjt:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIP-ADWALRDHGRDDF

Query:  LTERKGFERGPPSPP-LPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        ++ERKGFER PPSPP L LL QRGRWARDVRERSRSPIRGPVRSPLRV LRSPLS GLPPKDFRRDV+VERERDDRRGLGRDRDGGPF
Subjt:  LTERKGFERGPPSPP-LPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

TrEMBL top hitse value%identityAlignment
A0A5D3E321 TATA-binding protein-associated factor 2N6.90e-24285.27Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYEAGEVPR+PPQ++RL RYSDDLGYRIHAGS SP RRRD HRYIS+ +HS GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
        RGR+FGGGRDL RYRD+SPHY RR+  GRPFGRGV GP LA G FRG+RSKNNPNVRPRDGDWYCSDPLC NLNFARRE CNNCNR R GG GGSPRR Y
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY

Query:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIP-ADWALRDHGRDDF
         GPPSLH+PPRRFAAHP+ERSPGRTL  YRSPPR+WARDG RE+AAG LAPPRYESRY DH LRRDRVDYLED  RGRSKFDRP+P ADWALRD+GRDDF
Subjt:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIP-ADWALRDHGRDDF

Query:  LTERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        +TERKGFER PPSPPLPLL QRGRW+RDVRERSRSPIRGP+RSPLRV LRSPLS GLPPKDFRRDV+ ERERDDRRGLGRDR+GGPF
Subjt:  LTERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

A0A6J1CSV5 dapper homolog 3 isoform X27.30e-27194.81Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEA                    GYRIHAGSASPTRRRDDHRYISDLDHSGGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
        RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY

Query:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT
        LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT
Subjt:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT

Query:  ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
Subjt:  ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

A0A6J1CTI5 dapper homolog 3 isoform X12.47e-292100Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
        RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSY

Query:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT
        LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT
Subjt:  LGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLT

Query:  ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
Subjt:  ERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X73.33e-24786.56Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSR KDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYEAGEVPR+PPQ++RLDRYSDDLGYR+HAGS SPTRRRDDHRYISD DHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRS-
        RGR+FGGGRDLGRYRD+SPHY+RR+S GRPFGRG  GPG A G FRG+RSKNNPNVRPRDGDWYCSDPLC NLNFARRE CNNCNR R G AGGSPRR  
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRS-

Query:  YLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIPADWALRDHGRDDF
        Y+GPPSLHSPPRRFAAHP+ERSPGRT+  YRSPPR WARDGPREIAAG LAPPRYESRY DH LRRDRVDYLED  RGRSKFDR  P+DW+LRD+GRDDF
Subjt:  YLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIPADWALRDHGRDDF

Query:  LTERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        + ERKGFER PP PPL LL QRGRWARDVRERSRSPIRGPVRSPLRV LRSPLSGGLPPKD+RRDV+VERERDDRRGLGRDRDGGPF
Subjt:  LTERKGFERGPPSPPLPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X61.35e-24486.34Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT
        MGSR KDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYEAGEVPR+PPQ++RLDRYSDDLGYR+HAGS SPTRRRDDHRYISD DHS GLT
Subjt:  MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLT

Query:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRS-
        RGR+FGGGRDLGRYRD+SPHY+RR+S GRPFGRG  GPG A G FRG+RSKNNPNVRPRDGDWYCSDPLC NLNFARRE CNNCNR R G AGGSPRR  
Subjt:  RGRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRS-

Query:  YLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIPADWALRDHGRDDF
        Y+GPPSLHSPPRRFAAHP+ERSPGRT+  YRSPPR WARDGPREIAAG LAPPRYESRY DH LRRDRVDYLED  RGRSKFDR  P+DW+LRD+GRDDF
Subjt:  YLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLED-LRGRSKFDRPIPADWALRDHGRDDF

Query:  LTERKGFERGPPSPP-LPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF
        + ERKGFER P SPP L LL QRGRWARDVRERSRSPIRGPVRSPLRV LRSPLSGGLPPKD+RRDV+VERERDDRRGLGRDRDGGPF
Subjt:  LTERKGFERGPPSPP-LPLLSQRGRWARDVRERSRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF

SwissProt top hitse value%identityAlignment
P35637 RNA-binding protein FUS4.5e-0930.86Show/hide
Query:  GRDFGGGRDLGRYRDSSPHYNRRISSGR-------PFGRGVYGPGLAHGSFRG--DRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGA
        G++F G      +      +NR   +GR       P GRG YG G + G  RG           + R GDW C +P C N+NF+ R  CN C   +P G 
Subjt:  GRDFGGGRDLGRYRDSSPHYNRRISSGR-------PFGRGVYGPGLAHGSFRG--DRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGA

Query:  GGSPRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDR
        GG P  S++G    +   RR      +R   R   G R   R     G R    G   P + +SR      RR+R
Subjt:  GGSPRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDR

P56959 RNA-binding protein FUS4.9e-0830.86Show/hide
Query:  GRDFGGGRDLGRYRDSSPHYNRRISSGR-------PFGRGVYGPGLAHGSFRG--DRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGA
        G++F G      +      +NR   +GR       P GRG YG G + G  RG           + R GDW C +P C N+NF+ R  CN C   +P G 
Subjt:  GRDFGGGRDLGRYRDSSPHYNRRISSGR-------PFGRGVYGPGLAHGSFRG--DRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGA

Query:  GGSPRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDR
        GG P  S++G    +   RR      +R   R   G R   R     G R    G   P + +SR      RR+R
Subjt:  GGSPRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDR

Q28009 RNA-binding protein FUS4.5e-0930.86Show/hide
Query:  GRDFGGGRDLGRYRDSSPHYNRRISSGR-------PFGRGVYGPGLAHGSFRG--DRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGA
        G++F G      +      +NR   +GR       P GRG YG G + G  RG           + R GDW C +P C N+NF+ R  CN C   +P G 
Subjt:  GRDFGGGRDLGRYRDSSPHYNRRISSGR-------PFGRGVYGPGLAHGSFRG--DRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGA

Query:  GGSPRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDR
        GG P  S++G    +   RR      +R   R   G R   R     G R    G   P + +SR      RR+R
Subjt:  GGSPRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDR

Q92804 TATA-binding protein-associated factor 2N1.0e-0534.09Show/hide
Query:  GRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRP
        G++F G      +    P + R        G G  G     G +RG          P+ GDW C +P C N+NFARR  CN CN  RP
Subjt:  GRDFGGGRDLGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRP

Q94KD0 Transcription initiation factor TFIID subunit 15b2.1e-0639Show/hide
Query:  GRDFGGGRDLGRYRDSSPHYNRRISSGRPF--GRGVYGPGLAHGS--------FRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGG
        G  +GGG   G Y      Y  R +SG     GRG YG G   G+          GDR         RDGDW C +P C N+NFARR  CN C    P G
Subjt:  GRDFGGGRDLGRYRDSSPHYNRRISSGRPF--GRGVYGPGLAHGS--------FRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGG

Arabidopsis top hitse value%identityAlignment
AT1G50300.1 TBP-associated factor 159.9e-0436.99Show/hide
Query:  ISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPR
        + +GR  GRG    G A G   G +    P       DW C  P+C N+N+A+R  CN CN ++PG   G  R
Subjt:  ISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPR

AT4G28990.1 RNA-binding protein-related6.3e-5943.78Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRY-SDDLGYRIHAGSASPTRR-RDDHRYISDLDHSG
        MGS +K+ TT HH P +SSLVVR S S+           GR   G DYE GEV R+ P F R DRY  D+ G+R  A S+SP RR  +DH++ SDL+HSG
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRY-SDDLGYRIHAGSASPTRR-RDDHRYISDLDHSG

Query:  GLTRGRDFGGGRDL-GRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNN-PNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGS
           RGR+    R+  GR+RD SP   R  +  RP+ RG+ GP   HG  R   S+NN   V+PR+GDWYC DPLCRNLNFARRE C  C R R   A   
Subjt:  GLTRGRDFGGGRDL-GRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNN-PNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGS

Query:  PRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKF-------DRPIPADW
        P    L PP  HSP R F              GYRSPPR W RD P         PPR+     DHP  RDR       R R  +        R I +DW
Subjt:  PRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKF-------DRPIPADW

Query:  ALRDHGRDDFLTERKGFERGPP-SPPLPLLSQRGRWARDVRERSRS-PIRGPVRSPLRVSLRSPLSGGLPP--KDFRRDVYVERE-RDDRRGLGRDRDGG
        A  +         +  ++R PP SPP     + GRW R +RERSRS P+R     PLR      L GG PP  +D+RRD + +RE RDD RG GR R G 
Subjt:  ALRDHGRDDFLTERKGFERGPP-SPPLPLLSQRGRWARDVRERSRS-PIRGPVRSPLRVSLRSPLSGGLPP--KDFRRDVYVERE-RDDRRGLGRDRDGG

Query:  PF
         +
Subjt:  PF

AT4G28990.2 RNA-binding protein-related4.7e-5439.33Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDD-------------------------------
        MGS +K+ TT HH P +SSLVVR S S+           GR   G DYE GEV R+ P F R DRY  D                               
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDD-------------------------------

Query:  ------------------LGYRIHAGSASPTRR-RDDHRYISDLDHSGGLTRGRDFGGGRDL-GRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGD
                          LG+R  A S+SP RR  +DH++ SDL+HSG   RGR+    R+  GR+RD SP   R  +  RP+ RG+ GP   HG  R  
Subjt:  ------------------LGYRIHAGSASPTRR-RDDHRYISDLDHSGGLTRGRDFGGGRDL-GRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGD

Query:  RSKNN-PNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAG
         S+NN   V+PR+GDWYC DPLCRNLNFARRE C  C R R   A   P    L PP  HSP R F              GYRSPPR W RD P      
Subjt:  RSKNN-PNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSYLGPPSLHSPPRRFAAHPMERSPGRTLAGYRSPPRAWARDGPREIAAG

Query:  ALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKF-------DRPIPADWALRDHGRDDFLTERKGFERGPP-SPPLPLLSQRGRWARDVRERSRS-PIRGP
           PPR+     DHP  RDR       R R  +        R I +DWA  +         +  ++R PP SPP     + GRW R +RERSRS P+R  
Subjt:  ALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKF-------DRPIPADWALRDHGRDDFLTERKGFERGPP-SPPLPLLSQRGRWARDVRERSRS-PIRGP

Query:  VRSPLRVSLRSPLSGGLPP--KDFRRDVYVERE-RDDRRGLGRDRDGGPF
           PLR      L GG PP  +D+RRD + +RE RDD RG GR R G  +
Subjt:  VRSPLRVSLRSPLSGGLPP--KDFRRDVYVERE-RDDRRGLGRDRDGGPF

AT5G58470.1 TBP-associated factor 15B1.5e-0739Show/hide
Query:  GRDFGGGRDLGRYRDSSPHYNRRISSGRPF--GRGVYGPGLAHGS--------FRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGG
        G  +GGG   G Y      Y  R +SG     GRG YG G   G+          GDR         RDGDW C +P C N+NFARR  CN C    P G
Subjt:  GRDFGGGRDLGRYRDSSPHYNRRISSGRPF--GRGVYGPGLAHGS--------FRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGG

AT5G58470.2 TBP-associated factor 15B1.5e-0739Show/hide
Query:  GRDFGGGRDLGRYRDSSPHYNRRISSGRPF--GRGVYGPGLAHGS--------FRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGG
        G  +GGG   G Y      Y  R +SG     GRG YG G   G+          GDR         RDGDW C +P C N+NFARR  CN C    P G
Subjt:  GRDFGGGRDLGRYRDSSPHYNRRISSGRPF--GRGVYGPGLAHGS--------FRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCGAGGGATAAAGATTCTACGACTCACCATCAACCGTTATTGAGCAGCCTTGTAGTACGAGCCTCCAATAGTGATGGAGGTGGTGGTGGAGTCGGCGGAACCAG
TGGTGGCCGCGTGGGTCGTGGAAGCGATTATGAGGCCGGTGAGGTTCCCCGCAATCCTCCACAATTTACTCGATTGGATCGATATTCAGATGATTTGGGATATAGAATAC
ATGCAGGTTCAGCTTCTCCAACACGTCGTCGGGATGATCACCGTTATATTTCTGATCTCGATCATTCTGGTGGTCTTACGCGGGGCCGTGACTTCGGTGGGGGAAGGGAT
CTTGGTAGATATCGAGACTCTTCACCTCATTACAATCGAAGAATAAGTAGTGGTAGGCCATTTGGGAGAGGCGTTTATGGCCCTGGACTTGCTCATGGGTCATTTAGAGG
GGATCGTAGTAAGAATAATCCAAATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTTTATGTCGCAACCTAAACTTTGCAAGACGAGAACTTTGTAACAACT
GCAACAGATCCCGCCCTGGAGGAGCGGGTGGAAGCCCTCGAAGAAGCTATCTTGGTCCACCATCCCTGCATTCTCCTCCTAGACGTTTTGCTGCCCACCCAATGGAACGT
TCTCCTGGCAGAACTCTGGCAGGATATAGGTCTCCTCCCCGTGCTTGGGCCAGGGATGGTCCTAGAGAGATTGCAGCTGGTGCTCTAGCACCTCCGAGGTATGAAAGCAG
GTATCCCGATCACCCCCTGCGAAGAGATAGGGTGGACTATCTAGAAGACTTGAGAGGAAGATCCAAGTTTGATAGGCCGATTCCTGCTGATTGGGCCCTTCGAGACCATG
GAAGGGATGACTTCCTTACCGAGAGGAAGGGCTTTGAGAGAGGACCACCTTCTCCACCACTACCACTGCTCTCTCAACGTGGCCGTTGGGCGCGCGATGTGCGGGAGAGA
AGTCGTTCCCCAATTAGAGGTCCGGTTAGATCTCCATTGAGAGTCTCGCTTCGATCACCACTAAGTGGAGGCCTTCCGCCAAAAGACTTCCGTAGAGATGTTTATGTTGA
AAGGGAGCGCGACGATAGGCGTGGCCTAGGACGAGATCGCGATGGAGGTCCATTCTAA
mRNA sequenceShow/hide mRNA sequence
GTGAGAAGTGCATTTTACCGGGAAGGGACAGCGGAGGCGCACACCCTACTTCAACGGGGCATACCCTTTTTCGGAATTTCACATGCTGAATTCCTTAACTAGAATTTAGG
GTTTATGAATGTCCACAACCCATCTTTTTCAACGATTTATCGGCGTTCGGACTTCAGACCCACCAGTTTCGTCGCTGCTCAGGCGGAAGCGGAAGAAATAGGAAAAGAGG
ACGAAGGATGGGTTCGAGGGATAAAGATTCTACGACTCACCATCAACCGTTATTGAGCAGCCTTGTAGTACGAGCCTCCAATAGTGATGGAGGTGGTGGTGGAGTCGGCG
GAACCAGTGGTGGCCGCGTGGGTCGTGGAAGCGATTATGAGGCCGGTGAGGTTCCCCGCAATCCTCCACAATTTACTCGATTGGATCGATATTCAGATGATTTGGGATAT
AGAATACATGCAGGTTCAGCTTCTCCAACACGTCGTCGGGATGATCACCGTTATATTTCTGATCTCGATCATTCTGGTGGTCTTACGCGGGGCCGTGACTTCGGTGGGGG
AAGGGATCTTGGTAGATATCGAGACTCTTCACCTCATTACAATCGAAGAATAAGTAGTGGTAGGCCATTTGGGAGAGGCGTTTATGGCCCTGGACTTGCTCATGGGTCAT
TTAGAGGGGATCGTAGTAAGAATAATCCAAATGTGCGTCCTAGGGATGGGGATTGGTATTGCTCAGATCCTTTATGTCGCAACCTAAACTTTGCAAGACGAGAACTTTGT
AACAACTGCAACAGATCCCGCCCTGGAGGAGCGGGTGGAAGCCCTCGAAGAAGCTATCTTGGTCCACCATCCCTGCATTCTCCTCCTAGACGTTTTGCTGCCCACCCAAT
GGAACGTTCTCCTGGCAGAACTCTGGCAGGATATAGGTCTCCTCCCCGTGCTTGGGCCAGGGATGGTCCTAGAGAGATTGCAGCTGGTGCTCTAGCACCTCCGAGGTATG
AAAGCAGGTATCCCGATCACCCCCTGCGAAGAGATAGGGTGGACTATCTAGAAGACTTGAGAGGAAGATCCAAGTTTGATAGGCCGATTCCTGCTGATTGGGCCCTTCGA
GACCATGGAAGGGATGACTTCCTTACCGAGAGGAAGGGCTTTGAGAGAGGACCACCTTCTCCACCACTACCACTGCTCTCTCAACGTGGCCGTTGGGCGCGCGATGTGCG
GGAGAGAAGTCGTTCCCCAATTAGAGGTCCGGTTAGATCTCCATTGAGAGTCTCGCTTCGATCACCACTAAGTGGAGGCCTTCCGCCAAAAGACTTCCGTAGAGATGTTT
ATGTTGAAAGGGAGCGCGACGATAGGCGTGGCCTAGGACGAGATCGCGATGGAGGTCCATTCTAATGTTTGAGACTTGAAATTAACGTTTGGCTTCCCATGTAATTTCGG
TTAGAACAGCATCCTGTAATCTGTTTAGAGCCATGGAAGGGGGGATGATTTTTGGAGCTTGCATGATTTTTCTGACTGATCCAGTATGTCGAAAATTGCCAATGGGGTCC
ATTCTCTGCCTCGTGTGTTTCTGTGCGTCATTATTTGATGCTGTAGTTGGCAGCCAGCAGAATTAGTGCTTGTAAAAACCAATTTTTTTTCTTGTTTCTTTCCCATGTCT
GGTTTGTTGGCTTCTGGGCAATGGTTTTGAATCGTGGTTATCTTTACATCAATTCAGCAGATTTGTGAAGTGTCTTTTGATGTCTGTTAAACTTGTACTGTGGCATGGAT
TTGCTTGGGCTTGGAGAGAGCCACTCCAGTCCTGTCTTAATGGCTGCTTTATTGTTCTTCACTTGTCGGATCTGAGTAACAGAAATTTTGAGGGGATGGTGGCTTGTTTT
TCACGTAGTGTTTTCTAGTACTGTTTTTTGGGCTCATAAATGAGCAATCAAGTTGAAAAAACAAGAATGTCAATAAAAATATCACCTATACGGATATAAGTATTTTAATA
ATGAGATTTATAATTTGTATTCTAGTGAGCGTAAACAAAATAATTTTTATGAATTTTTAGAATATTAGGCTCAGGCTCTGGTATTCGATATTTAATCGTAACCGTCCATT
TCTTCGCATCCTAGTCTGACAGTAGAGAATCTCGCC
Protein sequenceShow/hide protein sequence
MGSRDKDSTTHHQPLLSSLVVRASNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRNPPQFTRLDRYSDDLGYRIHAGSASPTRRRDDHRYISDLDHSGGLTRGRDFGGGRD
LGRYRDSSPHYNRRISSGRPFGRGVYGPGLAHGSFRGDRSKNNPNVRPRDGDWYCSDPLCRNLNFARRELCNNCNRSRPGGAGGSPRRSYLGPPSLHSPPRRFAAHPMER
SPGRTLAGYRSPPRAWARDGPREIAAGALAPPRYESRYPDHPLRRDRVDYLEDLRGRSKFDRPIPADWALRDHGRDDFLTERKGFERGPPSPPLPLLSQRGRWARDVRER
SRSPIRGPVRSPLRVSLRSPLSGGLPPKDFRRDVYVERERDDRRGLGRDRDGGPF