; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G008560 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G008560
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRanBP2-type domain-containing protein
Genome locationchr06:18013912..18019702
RNA-Seq ExpressionLsi06G008560
SyntenyLsi06G008560
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR034870 - TAF15/EWS/TLS family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023595.1 hypothetical protein SDJN02_14621, partial [Cucurbita argyrosperma subsp. argyrosperma]7.3e-21379.38Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGE                                          L VAACK
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
         LDLEEG GCIRENLWRY VVVLR++ L ++LLFLISY+PT FFISGYR+HAGSVSPTRRRD HRYISDFDHS  LTRGREFGGGR+LGRYRDTSPHYSR
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG
        R+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GYVGPPSLHSPPRRF AHPIERSPG
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG

Query:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPS----------
        RT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDFI ERKGFERRP S          
Subjt:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPS----------

Query:  -----------PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
                   PPLS+LPQRGRW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  -----------PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

XP_004142516.1 uncharacterized protein LOC101209122 isoform X1 [Cucumis sativus]7.3e-20578.35Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRLDRYSDDL                               
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
                                                      GYRIHAGSVSP RRRDVHRY+S+FDHS  LTRGREFGGGR+L RYRDTSPHY R
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR
        R+SGGRPFGRGVDGP LAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY GPPSLHSPPRRF AHPIERSPGR
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR

Query:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRW
        TLNEYRSPPRSWARDG RE+AAGGLAP RYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPL +LPQRGRW
Subjt:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRW

Query:  SRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        SRDVR+RSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  SRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

XP_008462733.1 PREDICTED: uncharacterized protein LOC103501026 isoform X1 [Cucumis melo]4.0e-20377.92Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRL RYSDDL                               
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
                                                      GYRIHAGSVSP RRRDVHRYIS+F+HS  L RGREFGGGR+L RYRDTSPHY R
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR
        R+ GGRPFGRGVDGP LAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY GPPSLH+PPRRF AHPIERSPGR
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR

Query:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRW
        TLNEYRSPPRSWARDG RE+AAGGLAP RYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPL +LPQRGRW
Subjt:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRW

Query:  SRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        SRDVRERSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  SRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

XP_022961322.1 uncharacterized protein LOC111461847 isoform X7 [Cucurbita moschata]8.3e-20178.02Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDL                               
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
                                                      GYR+HAGSVSPTRRRD HRYISDFDHS  LTRGREFGGGR+LGRYRDTSPHYSR
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG
        R+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GYVGPPSLHSPPRRF AHPIERSPG
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG

Query:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRG
        RT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDFI ERKGFERRPP PPLS+LPQRG
Subjt:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRG

Query:  RWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        RW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  RWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

XP_038880192.1 uncharacterized protein LOC120071861 isoform X1 [Benincasa hispida]1.5e-20579.7Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYEAGE+ RDPPQYSRLDRYSDDL                               
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
                                                      GYRIHAGSVSPTRRRDVHRYISDFDHSG+LTRGR+FGGGR+LGRYRDTSPHYSR
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR
        RISGGRPFGRGVDGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR GA GSPRRGYVGPPSLHSPPRRF AHPIERSPGR
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR

Query:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPS-PPLSMLPQRGR
        TLNEYRSPPRSWARDGPREIAAGGLAP RYESRYSDHLRRDRVDYL+DSFRGRSKFDRPLPSADWALRDNGRDDFI+ERKGFERRPPS PPLS+LPQRGR
Subjt:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPS-PPLSMLPQRGR

Query:  WSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        W+RDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
Subjt:  WSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

TrEMBL top hitse value%identityAlignment
A0A1S3CHL9 uncharacterized protein LOC103501026 isoform X11.9e-20377.92Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRL RYSDDL                               
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
                                                      GYRIHAGSVSP RRRDVHRYIS+F+HS  L RGREFGGGR+L RYRDTSPHY R
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR
        R+ GGRPFGRGVDGP LAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY GPPSLH+PPRRF AHPIERSPGR
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR

Query:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRW
        TLNEYRSPPRSWARDG RE+AAGGLAP RYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPL +LPQRGRW
Subjt:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRW

Query:  SRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        SRDVRERSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  SRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A5D3E321 TATA-binding protein-associated factor 2N1.9e-20377.92Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRL RYSDDL                               
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
                                                      GYRIHAGSVSP RRRDVHRYIS+F+HS  L RGREFGGGR+L RYRDTSPHY R
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR
        R+ GGRPFGRGVDGP LAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRRGY GPPSLH+PPRRF AHPIERSPGR
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGR

Query:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRW
        TLNEYRSPPRSWARDG RE+AAGGLAP RYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPL +LPQRGRW
Subjt:  TLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRW

Query:  SRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        SRDVRERSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  SRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X74.0e-20178.02Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDL                               
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
                                                      GYR+HAGSVSPTRRRD HRYISDFDHS  LTRGREFGGGR+LGRYRDTSPHYSR
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG
        R+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GYVGPPSLHSPPRRF AHPIERSPG
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG

Query:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRG
        RT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDFI ERKGFERRPP PPLS+LPQRG
Subjt:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRG

Query:  RWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        RW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  RWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X63.8e-19977.85Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDL                               
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
                                                      GYR+HAGSVSPTRRRD HRYISDFDHS  LTRGREFGGGR+LGRYRDTSPHYSR
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG
        R+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GYVGPPSLHSPPRRF AHPIERSPG
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG

Query:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPS-PPLSMLPQR
        RT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDFI ERKGFERRP S PPLS+LPQR
Subjt:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPS-PPLSMLPQR

Query:  GRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
        GRW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  GRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

A0A6J1HBT6 uncharacterized protein LOC111461847 isoform X27.9e-19774.64Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDL                               
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACK

Query:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR
                                                      GYR+HAGSVSPTRRRD HRYISDFDHS  LTRGREFGGGR+LGRYRDTSPHYSR
Subjt:  KLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSR

Query:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG
        R+SGGRPFGRG DGPG APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR GYVGPPSLHSPPRRF AHPIERSPG
Subjt:  RISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRR-GYVGPPSLHSPPRRFTAHPIERSPG

Query:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPS----------
        RT+NEYRSPPR WARDGPREIAAGGLAP RYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDFI ERKGFERRP S          
Subjt:  RTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPS----------

Query:  -----------PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF
                   PPLS+LPQRGRW+RDVRERSRSPIRGPVRSPLRVPLRSPLS GLPPKD+RRDVFVERERDDRRGLGRDRDGGPF
Subjt:  -----------PPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRDVFVERERDDRRGLGRDRDGGPF

SwissProt top hitse value%identityAlignment
P35637 RNA-binding protein FUS2.1e-0530.46Show/hide
Query:  GREFGGGRNLGRYRDTSPHYSRRISGGR-------PFGRGVDGPGLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGA
        G+EF G      +      ++R    GR       P GRG  G G + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G 
Subjt:  GREFGGGRNLGRYRDTSPHYSRRISGGR-------PFGRGVDGPGLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGA

Query:  GGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDR
        GG P   ++G    +   RR      +R   R     R   R     G R    GG  P + +SR  +H R+DR
Subjt:  GGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDR

Q01844 RNA-binding protein EWS1.6e-0542.5Show/hide
Query:  GRPFGRGVDGPGLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPP
        GR  GRG D  G  P   RG  S+ NP    NV+ R GDW C +P C N NFA R  CN C  P+         G++ PP
Subjt:  GRPFGRGVDGPGLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPP

Q28009 RNA-binding protein FUS2.1e-0530.46Show/hide
Query:  GREFGGGRNLGRYRDTSPHYSRRISGGR-------PFGRGVDGPGLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGA
        G+EF G      +      ++R    GR       P GRG  G G + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G 
Subjt:  GREFGGGRNLGRYRDTSPHYSRRISGGR-------PFGRGVDGPGLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR-TGA

Query:  GGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDR
        GG P   ++G    +   RR      +R   R     R   R     G R    GG  P + +SR  +H R+DR
Subjt:  GGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDR

Q61545 RNA-binding protein EWS1.6e-0542.5Show/hide
Query:  GRPFGRGVDGPGLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPP
        GR  GRG D  G  P   RG  S+ NP    NV+ R GDW C +P C N NFA R  CN C  P+         G++ PP
Subjt:  GRPFGRGVDGPGLAPGPFRGERSKNNP----NVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPP

Q92804 TATA-binding protein-associated factor 2N8.6e-0736.19Show/hide
Query:  GREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR-
        G+EF G      +    P + R        G G  G     G +RG          P+ GDW C +P C N+NFARR  CN CN PR      +GG  R 
Subjt:  GREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR----TGAGGSPR-

Query:  RGYVG
        RGY G
Subjt:  RGYVG

Arabidopsis top hitse value%identityAlignment
AT4G28990.1 RNA-binding protein-related9.3e-4935.53Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAAC
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D G                             
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAAC

Query:  KKLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRR-RDVHRYISDFDHSGSLTRGREFGGGRNL-GRYRDTSPH
                                                       G+R  A S SP RR  + H++ SD +HSG   RGRE    R   GR+RD SP 
Subjt:  KKLDLEEGIGCIRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRR-RDVHRYISDFDHSGSLTRGREFGGGRNL-GRYRDTSPH

Query:  YSRRISGGRPFGRGVDGPGLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIER
         +R  +G RP+ RG+DGP   P   R   S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP            P  R    P+  
Subjt:  YSRRISGGRPFGRGVDGPGLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIER

Query:  SPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQ
        SP R  N YRSPPR W RD P         P R++        RDR  Y +  +    +      ++DWA  +         +  ++RRPP  P    P+
Subjt:  SPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQ

Query:  RGRWSRDVRERSRS-PIRGPVRSPLRVPLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF
         GRW R +RERSRS P+R     PLR      L  G PP  +D+RRD   +RE RDD RG GR R G  +
Subjt:  RGRWSRDVRERSRS-PIRGPVRSPLRVPLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF

AT4G28990.2 RNA-binding protein-related1.3e-5336.94Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAAC
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R DRY  D G L           ++   +  +ML +   
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAAC

Query:  KKLDLEEGIGCIRENLWRYTVVVLRMI-SLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRR-RDVHRYISDFDHSGSLTRGREFGGGRNL-GRYRDTSP
                   +  N++   ++ + M+  L   ++  + Y        G+R  A S SP RR  + H++ SD +HSG   RGRE    R   GR+RD SP
Subjt:  KKLDLEEGIGCIRENLWRYTVVVLRMI-SLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRR-RDVHRYISDFDHSGSLTRGREFGGGRNL-GRYRDTSP

Query:  HYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIE
          +R  +G RP+ RG+DGP   P   R   S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP            P  R    P+ 
Subjt:  HYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIE

Query:  RSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLP
         SP R  N YRSPPR W RD P         P R++        RDR  Y +  +    +      ++DWA  +         +  ++RRPP  P    P
Subjt:  RSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLP

Query:  QRGRWSRDVRERSRS-PIRGPVRSPLRVPLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF
        + GRW R +RERSRS P+R     PLR      L  G PP  +D+RRD   +RE RDD RG GR R G  +
Subjt:  QRGRWSRDVRERSRS-PIRGPVRSPLRVPLRSPLSSGLPP--KDFRRDVFVERE-RDDRRGLGRDRDGGPF

AT5G58470.1 TBP-associated factor 15B1.1e-0437.62Show/hide
Query:  GREFGGG--RNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGAGGSPRR
        G  +GGG     GR       Y  R   G   GRG  G G   G   G+R         RDGDW C +P C N+NFARR  CN C    P   + G+  R
Subjt:  GREFGGG--RNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGAGGSPRR

Query:  G
        G
Subjt:  G

AT5G58470.2 TBP-associated factor 15B1.1e-0437.62Show/hide
Query:  GREFGGG--RNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGAGGSPRR
        G  +GGG     GR       Y  R   G   GRG  G G   G   G+R         RDGDW C +P C N+NFARR  CN C    P   + G+  R
Subjt:  GREFGGG--RNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGAGGSPRR

Query:  G
        G
Subjt:  G


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCCCGAGATAAAGACTCTACGACTCACCACCAGCCGTTATTGAGCAGCCTTGTTGTCCGGCCTTCGAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAG
TGGCGGCCGCGTTGGTCGTGGAAGCGATTACGAGGCCGGTGAGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGGAACTTGTGGT
GCACCTTTGATTTGAACTCGTTGGTAGACAAATATCCATCTAACATTGATAAGAGGATGCTCTATGTTGCCGCCTGCAAGAAATTAGATCTTGAGGAGGGCATTGGTTGT
ATCAGGGAAAATCTTTGGAGATATACGGTTGTTGTACTTCGAATGATTTCCCTATACAGCCTGTTACTTTTCTTAATATCTTATAAACCCACTACATTCTTTATTTCAGG
ATATAGAATACATGCAGGTTCAGTTTCTCCAACGCGCCGTCGGGATGTTCACCGATATATTTCTGATTTTGATCATTCTGGTAGTCTCACTCGCGGTCGTGAATTTGGTG
GTGGGAGGAATCTTGGTAGATATCGAGATACTTCACCTCATTACAGTCGAAGAATAAGTGGTGGCAGGCCATTTGGGAGAGGTGTTGATGGCCCTGGACTTGCTCCTGGG
CCATTTCGAGGGGAACGCAGTAAAAATAATCCAAATGTGCGTCCAAGAGATGGGGATTGGTATTGCTCAGATCCTCTATGTGACAACCTAAACTTTGCAAGACGAGAGTT
TTGTAACAACTGCAACAGACCCCGCACTGGAGCTGGTGGAAGTCCTCGAAGAGGCTATGTTGGTCCACCATCCCTGCATTCTCCTCCTAGACGCTTCACTGCCCACCCAA
TTGAACGTTCTCCTGGCAGGACTCTTAATGAATATAGGTCTCCTCCCCGTAGTTGGGCCAGGGATGGTCCTAGGGAGATTGCAGCTGGTGGTCTGGCACCTTCGAGGTAT
GAAAGCAGGTATTCCGATCACCTGCGGAGAGATAGGGTGGACTATCTAGAAGACAGCTTCAGAGGAAGATCTAAGTTCGATAGGCCACTTCCTTCAGCAGATTGGGCCCT
TAGAGACAATGGAAGGGATGATTTCATCACAGAGAGGAAGGGATTTGAAAGAAGGCCACCATCACCACCACTGTCGATGCTTCCTCAGCGTGGGCGCTGGTCGCGTGATG
TGAGAGAGAGGAGCCGTTCCCCAATCAGAGGTCCTGTCAGATCTCCATTAAGAGTCCCGCTACGGTCTCCATTAAGTAGCGGCCTTCCACCAAAAGACTTCCGTAGAGAT
GTTTTTGTTGAAAGGGAGCGCGATGATAGGCGTGGCCTAGGGCGAGATCGTGATGGAGGTCCATTTTAG
mRNA sequenceShow/hide mRNA sequence
GTCAGAGGAGGCGTATCCCCTAGCGAAGGAGGCATACCATTTTAGACTTCGGACCGACCAGTTTCGTCGCTGCTCATCGTAGCAGGCAGAAAAATTAGGCAAACAAGGCG
AAAGGATGGGTTCCCGAGATAAAGACTCTACGACTCACCACCAGCCGTTATTGAGCAGCCTTGTTGTCCGGCCTTCGAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGA
ACCAGTGGCGGCCGCGTTGGTCGTGGAAGCGATTACGAGGCCGGTGAGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGATCGATATTCAGATGATTTGGGGAACTT
GTGGTGCACCTTTGATTTGAACTCGTTGGTAGACAAATATCCATCTAACATTGATAAGAGGATGCTCTATGTTGCCGCCTGCAAGAAATTAGATCTTGAGGAGGGCATTG
GTTGTATCAGGGAAAATCTTTGGAGATATACGGTTGTTGTACTTCGAATGATTTCCCTATACAGCCTGTTACTTTTCTTAATATCTTATAAACCCACTACATTCTTTATT
TCAGGATATAGAATACATGCAGGTTCAGTTTCTCCAACGCGCCGTCGGGATGTTCACCGATATATTTCTGATTTTGATCATTCTGGTAGTCTCACTCGCGGTCGTGAATT
TGGTGGTGGGAGGAATCTTGGTAGATATCGAGATACTTCACCTCATTACAGTCGAAGAATAAGTGGTGGCAGGCCATTTGGGAGAGGTGTTGATGGCCCTGGACTTGCTC
CTGGGCCATTTCGAGGGGAACGCAGTAAAAATAATCCAAATGTGCGTCCAAGAGATGGGGATTGGTATTGCTCAGATCCTCTATGTGACAACCTAAACTTTGCAAGACGA
GAGTTTTGTAACAACTGCAACAGACCCCGCACTGGAGCTGGTGGAAGTCCTCGAAGAGGCTATGTTGGTCCACCATCCCTGCATTCTCCTCCTAGACGCTTCACTGCCCA
CCCAATTGAACGTTCTCCTGGCAGGACTCTTAATGAATATAGGTCTCCTCCCCGTAGTTGGGCCAGGGATGGTCCTAGGGAGATTGCAGCTGGTGGTCTGGCACCTTCGA
GGTATGAAAGCAGGTATTCCGATCACCTGCGGAGAGATAGGGTGGACTATCTAGAAGACAGCTTCAGAGGAAGATCTAAGTTCGATAGGCCACTTCCTTCAGCAGATTGG
GCCCTTAGAGACAATGGAAGGGATGATTTCATCACAGAGAGGAAGGGATTTGAAAGAAGGCCACCATCACCACCACTGTCGATGCTTCCTCAGCGTGGGCGCTGGTCGCG
TGATGTGAGAGAGAGGAGCCGTTCCCCAATCAGAGGTCCTGTCAGATCTCCATTAAGAGTCCCGCTACGGTCTCCATTAAGTAGCGGCCTTCCACCAAAAGACTTCCGTA
GAGATGTTTTTGTTGAAAGGGAGCGCGATGATAGGCGTGGCCTAGGGCGAGATCGTGATGGAGGTCCATTTTAGTGTTGAAACTTTTGAGATTTTATGTCTCTTTCCCAT
GTAATTTTTGGTTAGAACATCAGCACCATGTAATCTGTGTTTAGAGCCGTGGAAAGAGGATGCTAAGATGAGCTTGCTTGATTTTTCCCGACTGGTCCAGTGTGTCGAAA
CTTGCTGATGGGGTTTTATCTGCCTCGTGTGTTGTGTTTCTTATCATTTTATGGTGTAGTTAGCAGCCAAGCAGAAATAGTGAAGTAGTAAATGCTTTTAATATACATTC
TGTGAACTTACTTTTCTGTTTTTGGTTTGTTGGCTTCTGGGCAATGATTTTTAACACATCATTATCCTTTCAAATGTGGTTTGGATTTTGTTTGGACTTGAAAGAGCCCT
TCCAATATTTATATGTGGTCTGCTCTATTATTCTTTTAGAGCTTGTTTGGGAGGTTGAATTGTAATCGAGGAATATTAGAATGTGTGAGGGATCGAAAGTGGAATCAAAA
TAGAATCGTGGAATGAGTTTAGTATTACAAA
Protein sequenceShow/hide protein sequence
MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLDRYSDDLGNLWCTFDLNSLVDKYPSNIDKRMLYVAACKKLDLEEGIGC
IRENLWRYTVVVLRMISLYSLLLFLISYKPTTFFISGYRIHAGSVSPTRRRDVHRYISDFDHSGSLTRGREFGGGRNLGRYRDTSPHYSRRISGGRPFGRGVDGPGLAPG
PFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGAGGSPRRGYVGPPSLHSPPRRFTAHPIERSPGRTLNEYRSPPRSWARDGPREIAAGGLAPSRY
ESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLSMLPQRGRWSRDVRERSRSPIRGPVRSPLRVPLRSPLSSGLPPKDFRRD
VFVERERDDRRGLGRDRDGGPF