; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0051901 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0051901
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTATA-binding protein-associated factor 2N
Genome locationCMiso1.1chr02:18906861..18913436
RNA-Seq ExpressionCmc02g0051901
SyntenyCmc02g0051901
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0003723 - RNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR034870 - TAF15/EWS/TLS family
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142516.1 uncharacterized protein LOC101209122 isoform X1 [Cucumis sativus]3.2e-22297.4Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEAGEVPRDPPQYSRL RYSDDLGYRIHAGSVSP RRRDVHRY+SNF+HS+GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVR+RSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

XP_004142517.1 uncharacterized protein LOC101209122 isoform X2 [Cucumis sativus]1.4e-20692.47Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRG+DYEA                    GYRIHAGSVSP RRRDVHRY+SNF+HS+GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRV GGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVR+RSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

XP_008462733.1 PREDICTED: uncharacterized protein LOC103501026 isoform X1 [Cucumis melo]4.3e-227100Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

XP_008462734.1 PREDICTED: uncharacterized protein LOC103501026 isoform X2 [Cucumis melo]1.3e-21094.81Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEA                    GYRIHAGSVSPARRRDVHRYISNFNHSDGLA
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

XP_038880192.1 uncharacterized protein LOC120071861 isoform X1 [Benincasa hispida]1.9e-20691.71Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSRDKDSTTHHQPLLSSLVVR SNSDGGGGGVGGTSGGRVGRGSDYEAGE+ RDPPQYSRL RYSDDLGYRIHAGSVSP RRRDVHRYIS+F+HS  L 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGR+FGGGRDL RYRDTSPHY RR+ GGRPFGRGVDGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPR G  GSPRRGY 
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLH+PPRRFAAHPIERSPGRTLNEYRSPPRSWARDG RE+AAGGLAPPRYESRYSDHLRRDRVDYL+DSFRGRSKFDRPLPSADWALRDNGRDDFI+
Subjt:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPS-PPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPS PPL LLPQRGRW+RDVRERSRSPIRGP+RSPLRVPLRSPLSSGLPPKDFRRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ERKGFERRPPS-PPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

TrEMBL top hitse value%identityAlignment
A0A1S3CHL9 uncharacterized protein LOC103501026 isoform X12.1e-227100Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

A0A1S3CHP1 uncharacterized protein LOC103501026 isoform X26.1e-21194.81Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEA                    GYRIHAGSVSPARRRDVHRYISNFNHSDGLA
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

A0A5D3E321 TATA-binding protein-associated factor 2N2.1e-227100Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
        RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYA

Query:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
        GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT
Subjt:  GPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFIT

Query:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
Subjt:  ERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

A0A6J1H9W7 uncharacterized protein LOC111461847 isoform X78.0e-20390.44Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRL RYSDDLGYR+HAGSVSP RRRD HRYIS+F+HS GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY
        RGREFGGGRDL RYRDTSPHY RRV GGRPFGRG DGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRR GY
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY

Query:  AGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
         GPPSLH+PPRRFAAHPIERSPGRT+NEYRSPPR WARDG RE+AAGGLAPPRYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  AGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        I ERKGFERRPP PPL LLPQRGRW+RDVRERSRSPIRGP+RSPLRVPLRSPLS GLPPKD+RRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

A0A6J1HA20 uncharacterized protein LOC111461847 isoform X67.5e-20190.21Show/hide
Query:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA
        MGSR KDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRL RYSDDLGYR+HAGSVSP RRRD HRYIS+F+HS GL 
Subjt:  MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLA

Query:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY
        RGREFGGGRDL RYRDTSPHY RRV GGRPFGRG DGP  APGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTG GGSPRR GY
Subjt:  RGREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRR-GY

Query:  AGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF
         GPPSLH+PPRRFAAHPIERSPGRT+NEYRSPPR WARDG RE+AAGGLAPPRYESRYSD HLRRDRVDYLEDSFRGRSKFDR LP +DW+LRDNGRDDF
Subjt:  AGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSD-HLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDF

Query:  ITERKGFERRPPS-PPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF
        I ERKGFERRP S PPL LLPQRGRW+RDVRERSRSPIRGP+RSPLRVPLRSPLS GLPPKD+RRDVF ERERDDRRGLGRDR+GGPF
Subjt:  ITERKGFERRPPS-PPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF

SwissProt top hitse value%identityAlignment
P35637 RNA-binding protein FUS4.3e-0430.46Show/hide
Query:  GREFGGGRDLDRYRDTSPHYGRRVGGGR-------PFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG-
        G+EF G      +      + R  G GR       P GRG  G   + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G 
Subjt:  GREFGGGRDLDRYRDTSPHYGRRVGGGR-------PFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG-

Query:  GGSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR
        GG P   + G    +   RR      +R   R     R   R   R G      GG  P + +SR  +H R+DR
Subjt:  GGSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR

P56959 RNA-binding protein FUS1.9e-0432.73Show/hide
Query:  GREFGGGRDLDRYRDTSPHYGRRVGGGR-------PFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG-
        G+EF G      +      + R  G GR       P GRG  G   + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G 
Subjt:  GREFGGGRDLDRYRDTSPHYGRRVGGGR-------PFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG-

Query:  GGSPRRGYAG
        GG P   + G
Subjt:  GGSPRRGYAG

Q28009 RNA-binding protein FUS4.3e-0430.46Show/hide
Query:  GREFGGGRDLDRYRDTSPHYGRRVGGGR-------PFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG-
        G+EF G      +      + R  G GR       P GRG  G   + G  RG           + R GDW C +P C+N+NF+ R  CN C  P+  G 
Subjt:  GREFGGGRDLDRYRDTSPHYGRRVGGGR-------PFGRGVDGPRLAPGPFRG--ERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGG-

Query:  GGSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR
        GG P   + G    +   RR      +R   R     R   R   R G      GG  P + +SR  +H R+DR
Subjt:  GGSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDR

Q92804 TATA-binding protein-associated factor 2N8.4e-0836.19Show/hide
Query:  GREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRP-----RTGGGGSPR
        G+EF G      +    P + R        G G  G R   G +RG          P+ GDW C +P C N+NFARR  CN CN P     R  GG    
Subjt:  GREFGGGRDLDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRP-----RTGGGGSPR

Query:  RGYAG
        RGY G
Subjt:  RGYAG

Q94KD0 Transcription initiation factor TFIID subunit 15b2.5e-0434.15Show/hide
Query:  DGLARGREFGGGRDLDRYRDTS--PHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG
        DG   G  +GGG      R  S    YG R G G   GRG  G     G   G+R         RDGDW C +P C N+NFARR  CN C    P     
Subjt:  DGLARGREFGGGRDLDRYRDTS--PHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG

Query:  GSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP
        G+  RG  G         R          GR+   + Y    RS    GS   RE  + G APP
Subjt:  GSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP

Arabidopsis top hitse value%identityAlignment
AT4G28990.1 RNA-binding protein-related6.1e-5441.31Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRY-SDDLGYRIHAGSVSPARR-RDVHRYISNFNHSD
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R  RY  D+ G+R  A S SP RR  + H++ S+ NHS 
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRY-SDDLGYRIHAGSVSPARR-RDVHRYISNFNHSD

Query:  GLARGREFGGGRDL-DRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP
           RGRE    R+   R+RD SP   R   G RP+ RG+DGP   P   R   S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP
Subjt:  GLARGREFGGGRDL-DRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSP

Query:  RRGYAGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGR
                    P  R    P+  SP R  N YRSPPR W RD           PPR++        RDR  Y +  +    +      ++DWA  +   
Subjt:  RRGYAGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGR

Query:  DDFITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDREGGPF
              +  ++RRPP  P    P+ GRW R +RERSRSP   P+R     PLR      L  G PP  +D+RRD   +RE RDD RG GR R G  +
Subjt:  DDFITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPLRS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDREGGPF

AT4G28990.2 RNA-binding protein-related4.5e-4937.08Show/hide
Query:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDD-------------------------------
        MGS +K+ TT HH P +SSLVVRPS S+           GR   G DYE GEV RD P ++R  RY  D                               
Subjt:  MGSRDKDSTT-HHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDD-------------------------------

Query:  ------------------LGYRIHAGSVSPARR-RDVHRYISNFNHSDGLARGREFGGGRDL-DRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGE
                          LG+R  A S SP RR  + H++ S+ NHS    RGRE    R+   R+RD SP   R   G RP+ RG+DGP   P   R  
Subjt:  ------------------LGYRIHAGSVSPARR-RDVHRYISNFNHSDGLARGREFGGGRDL-DRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGE

Query:  RSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGG
         S+NN   V+PR+GDWYC DPLC NLNFARRE C  C R R     SP            P  R    P+  SP R  N YRSPPR W RD         
Subjt:  RSKNN-PNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTLNEYRSPPRSWARDGSREMAAGG

Query:  LAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPL
          PPR++        RDR  Y +  +    +      ++DWA  +         +  ++RRPP  P    P+ GRW R +RERSRSP   P+R     PL
Subjt:  LAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLPLLPQRGRWSRDVRERSRSPIRGPIRSPLRVPL

Query:  RS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDREGGPF
        R      L  G PP  +D+RRD   +RE RDD RG GR R G  +
Subjt:  RS----PLSSGLPP--KDFRRDVFGERE-RDDRRGLGRDREGGPF

AT5G58470.1 TBP-associated factor 15B1.8e-0534.15Show/hide
Query:  DGLARGREFGGGRDLDRYRDTS--PHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG
        DG   G  +GGG      R  S    YG R G G   GRG  G     G   G+R         RDGDW C +P C N+NFARR  CN C    P     
Subjt:  DGLARGREFGGGRDLDRYRDTS--PHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG

Query:  GSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP
        G+  RG  G         R          GR+   + Y    RS    GS   RE  + G APP
Subjt:  GSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP

AT5G58470.2 TBP-associated factor 15B1.8e-0534.15Show/hide
Query:  DGLARGREFGGGRDLDRYRDTS--PHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG
        DG   G  +GGG      R  S    YG R G G   GRG  G     G   G+R         RDGDW C +P C N+NFARR  CN C    P     
Subjt:  DGLARGREFGGGRDLDRYRDTS--PHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCN--RPRTGGG

Query:  GSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP
        G+  RG  G         R          GR+   + Y    RS    GS   RE  + G APP
Subjt:  GSPRRGYAGPPSLHTPPRRFAAHPIERSPGRTL--NEYRSPPRSWARDGS---REMAAGGLAPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCCAGGGATAAAGATTCTACCACTCACCACCAGCCGTTATTGAGCAGCCTTGTTGTCCGGCCTTCGAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAG
TGGCGGCCGTGTTGGTCGTGGAAGCGATTACGAGGCTGGTGAGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGGTCGATATTCAGATGATTTGGGATATAGAATAC
ATGCAGGTTCAGTTTCTCCAGCACGCCGTCGGGATGTTCATCGATACATTTCTAATTTTAATCATTCTGATGGTCTCGCCCGAGGTCGTGAATTTGGTGGTGGGAGGGAT
CTTGATAGATATCGTGATACTTCACCTCACTATGGTCGAAGAGTAGGTGGTGGTAGGCCATTTGGGAGAGGTGTTGATGGCCCTAGACTTGCTCCTGGGCCATTTCGAGG
GGAGCGCAGTAAAAATAATCCAAATGTGCGTCCTAGGGACGGGGATTGGTATTGCTCAGATCCTTTATGTGACAATCTAAACTTTGCAAGACGAGAATTCTGTAACAACT
GCAACAGACCTCGCACTGGAGGTGGTGGAAGTCCTCGAAGAGGCTATGCTGGTCCACCATCCCTGCATACTCCTCCTAGACGTTTCGCTGCCCACCCAATTGAACGTTCT
CCTGGCAGGACTCTTAATGAATATAGGTCTCCTCCCCGTAGCTGGGCGAGGGATGGTTCTAGGGAGATGGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGGTA
TTCTGATCACCTGCGGAGAGATAGGGTGGACTATCTAGAAGACAGCTTCAGAGGAAGATCCAAGTTCGATAGACCACTTCCTTCAGCAGATTGGGCCCTTAGAGACAATG
GAAGGGATGACTTCATCACAGAGAGGAAGGGATTCGAAAGAAGGCCACCATCCCCACCATTGCCGTTGCTTCCTCAGCGTGGACGCTGGTCGCGTGATGTGAGAGAGAGG
AGCCGTTCTCCCATCAGAGGTCCAATCAGGTCTCCACTAAGAGTCCCGCTACGGTCTCCATTAAGTAGCGGCCTTCCACCAAAAGACTTTCGTAGAGATGTTTTCGGTGA
AAGGGAGCGTGATGATAGGCGCGGTCTAGGACGAGATCGTGAGGGAGGTCCATTTTAG
mRNA sequenceShow/hide mRNA sequence
TGGTAACCTTTTCAAACAAATATCCCAAATTTTAGGGACTAAAAAGTAAAAAGTATATTTTCCTATTTTATATTTTACCGAAAACGGTCTGGGGAGGCGTACTACCTAGC
GAAGGAGACGTACCCTTTTACGTAATTTCTCGTGCTGAATTCCTCACAACTAGAATTTAGGGTTTATGAATGTGCACCGACCTTTTTCAACCATTCATCGGCGTTCAAAC
TTCGGACCGACCAATTTCGTCGCTGCTCATCGTAGCAGGCTGAAGGATTAAGCAAACAACGCGAAACCAATGGGTTCCAGGGATAAAGATTCTACCACTCACCACCAGCC
GTTATTGAGCAGCCTTGTTGTCCGGCCTTCGAATAGTGACGGAGGTGGTGGCGGAGTCGGTGGAACCAGTGGCGGCCGTGTTGGTCGTGGAAGCGATTACGAGGCTGGTG
AGGTTCCCCGTGACCCTCCACAATATTCTCGATTGGGTCGATATTCAGATGATTTGGGATATAGAATACATGCAGGTTCAGTTTCTCCAGCACGCCGTCGGGATGTTCAT
CGATACATTTCTAATTTTAATCATTCTGATGGTCTCGCCCGAGGTCGTGAATTTGGTGGTGGGAGGGATCTTGATAGATATCGTGATACTTCACCTCACTATGGTCGAAG
AGTAGGTGGTGGTAGGCCATTTGGGAGAGGTGTTGATGGCCCTAGACTTGCTCCTGGGCCATTTCGAGGGGAGCGCAGTAAAAATAATCCAAATGTGCGTCCTAGGGACG
GGGATTGGTATTGCTCAGATCCTTTATGTGACAATCTAAACTTTGCAAGACGAGAATTCTGTAACAACTGCAACAGACCTCGCACTGGAGGTGGTGGAAGTCCTCGAAGA
GGCTATGCTGGTCCACCATCCCTGCATACTCCTCCTAGACGTTTCGCTGCCCACCCAATTGAACGTTCTCCTGGCAGGACTCTTAATGAATATAGGTCTCCTCCCCGTAG
CTGGGCGAGGGATGGTTCTAGGGAGATGGCAGCTGGTGGCCTGGCACCTCCAAGGTATGAAAGCAGGTATTCTGATCACCTGCGGAGAGATAGGGTGGACTATCTAGAAG
ACAGCTTCAGAGGAAGATCCAAGTTCGATAGACCACTTCCTTCAGCAGATTGGGCCCTTAGAGACAATGGAAGGGATGACTTCATCACAGAGAGGAAGGGATTCGAAAGA
AGGCCACCATCCCCACCATTGCCGTTGCTTCCTCAGCGTGGACGCTGGTCGCGTGATGTGAGAGAGAGGAGCCGTTCTCCCATCAGAGGTCCAATCAGGTCTCCACTAAG
AGTCCCGCTACGGTCTCCATTAAGTAGCGGCCTTCCACCAAAAGACTTTCGTAGAGATGTTTTCGGTGAAAGGGAGCGTGATGATAGGCGCGGTCTAGGACGAGATCGTG
AGGGAGGTCCATTTTAGTGTTGAAACTTGGTTACCCATGTAATTTTTGGTTAGACATCATAGGCACCCTGTAATGTGTTTAGAACTGTGGAAGGAGGATGGTAAGAAGAG
GGTACTTCATTTTTCTTGACTGGGTCTAGTGTGTGGAAACTTGCTGATGGGGTGTTTTATCTGTTGCTCGTGTGATTTTCGTGTTTTTACCATTTTATGGTGTAGTTAGC
AGCCAAGCAGAAATAGTTGAGTCAATGCTTTAAAATACAGTATCTGTTTAACTTCATTTGTCATCTTTGGTTTATATTGGCTTCTGGGCAATGGTTTTGAACCAACATTT
TCTTTGCATAATTTAAAATTTTTCAGACGTGGTTTGGATTTTGTTTGGACATTTGGAGATAGCTATTCCTGTTTGGATTCATGGCTGTCTCTGTTCTTTTTCAATGGTCA
GATAATATTTATATTACAGTTAACGTAAATTTGGAGATTTTGAACTTCAAATCTCTTTAGTG
Protein sequenceShow/hide protein sequence
MGSRDKDSTTHHQPLLSSLVVRPSNSDGGGGGVGGTSGGRVGRGSDYEAGEVPRDPPQYSRLGRYSDDLGYRIHAGSVSPARRRDVHRYISNFNHSDGLARGREFGGGRD
LDRYRDTSPHYGRRVGGGRPFGRGVDGPRLAPGPFRGERSKNNPNVRPRDGDWYCSDPLCDNLNFARREFCNNCNRPRTGGGGSPRRGYAGPPSLHTPPRRFAAHPIERS
PGRTLNEYRSPPRSWARDGSREMAAGGLAPPRYESRYSDHLRRDRVDYLEDSFRGRSKFDRPLPSADWALRDNGRDDFITERKGFERRPPSPPLPLLPQRGRWSRDVRER
SRSPIRGPIRSPLRVPLRSPLSSGLPPKDFRRDVFGERERDDRRGLGRDREGGPF