; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030522 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030522
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionENT domain-containing protein
Genome locationtig00154107:1206722..1219401
RNA-Seq ExpressionSgr030522
SyntenySgr030522
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR005491 - ENT domain
IPR008395 - Agenet-like domain
IPR014002 - Agenet domain, plant type
IPR036142 - ENT domain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586407.1 hypothetical protein SDJN03_19140, partial [Cucurbita argyrosperma subsp. sororia]4.4e-19568.69Show/hide
Query:  MIPRQLRTVFTGAAVILGGICTLNFASFLTIQTLRLTSEAKRR-----------QGFYICKLCRGNAVIQWSPLSDPIAMNPCVCPTCEGNSLFPSNCWG
        M+PRQLRT+FTGAAVI GGI TL FASFLTIQTLR T+EAKRR           +GFYICKLCRGNAVIQWSPLSDPIAMNPCVCPTCEGN +   +C  
Subjt:  MIPRQLRTVFTGAAVILGGICTLNFASFLTIQTLRLTSEAKRR-----------QGFYICKLCRGNAVIQWSPLSDPIAMNPCVCPTCEGNSLFPSNCWG

Query:  FLVFGVIVTLEVSACQRREEIVLCLVIDLNYIHFHASRMSGAREIEIEPVLLVPPESTLPTVDRLPLNMNMYSLLNCLEKPFDLFSSCESIKETTSLTQR
         L  G   + ++    R+                               V L   ES    V R  L  +  S +N     FD  + C        L   
Subjt:  FLVFGVIVTLEVSACQRREEIVLCLVIDLNYIHFHASRMSGAREIEIEPVLLVPPESTLPTVDRLPLNMNMYSLLNCLEKPFDLFSSCESIKETTSLTQR

Query:  NDRTLPCNVIADSFLWKSVHTSVNMRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGD
         +RT     + DS L KSVH S NMRFRKGSKVEVLSKKE PSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPC  S EVLENWIPGD
Subjt:  NDRTLPCNVIADSFLWKSVHTSVNMRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGD

Query:  VVEVFNDRSWKMATVSEVLGKNNYLVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGS
        VVEVFNDRSWKMATVSEVLGKNNYLVRLLGSS EFKV KFDIRARRSWQDDKWVL+HK S + GDDSKED NASPRF GLSSQIQK QNV N +T +RGS
Subjt:  VVEVFNDRSWKMATVSEVLGKNNYLVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGS

Query:  NYKYFQAETMARAGLKRRVLKKKRRYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVG
        NY+  QAETM RAGLK R LKKK+RYH+VVA NPSTLHEHVKS+ IQRGMLG   GG +LDRT I EMN DRKKQMG AYHSF ENFELNDADRATCSVG
Subjt:  NYKYFQAETMARAGLKRRVLKKKRRYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVG

Query:  SCSVSNSDDHGLPFHVSIGRNEHTDGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRL
        SCSVSNSD HGL   VSIG NE TDGSTSDAESFCHLGY  DNFLL RD+PLEAEIHR+
Subjt:  SCSVSNSDDHGLPFHVSIGRNEHTDGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRL

XP_022143181.1 uncharacterized protein LOC111013114 isoform X1 [Momordica charantia]3.0e-19688.92Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRFRKGSKVEVLSKKEAPSGSW SAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPC  +QEVLENW+PGDVVEVFNDRSWKMATVSEVLGK N+
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR
        LVRLLGSSCEFKV+KFDIRARRSWQDDKWVLVHK SC+ GDDSKED+NASP F GLSSQIQ CQN SNFE+RKRGSNY+Y QAE+MARA LK RVLKKKR
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR

Query:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT
        RYH VVAGNPSTLH HVKS+AIQRGMLGE FGG SLDRTDI+EMN DRKK MG AY S +E  ELNDADRATCSVGSCSVSNSDD GLPFHVSIGRNE T
Subjt:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT

Query:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
        DGS SDAESFCHLGYG  NFLL RDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVST IS
Subjt:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS

XP_022937876.1 uncharacterized protein LOC111444134 isoform X1 [Cucurbita moschata]1.1e-18587.11Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRFRKGSKVEVLSKKE PSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPC  S EVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR
        LVRLLGSS EFKV KFDIRARRSWQDDKWVL+HK S + GDDSKED NASPRF GLSSQIQK QNV N +T +RGSNY+  QAETM RAGLK R LKKK+
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR

Query:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT
        RYH+VVA NPSTLHEHVKS+ IQRGMLG   GG +LDRT I EMN DRKKQMG AYHSF ENFELNDADRATCSVGSCSVSNSD HGL   VSIG NE T
Subjt:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT

Query:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
        DGSTSDAESFCHLGY  DNFLL RD+PLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
Subjt:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS

XP_022999271.1 uncharacterized protein LOC111493693 [Cucurbita maxima]1.8e-18585.05Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRFRKGSKVEVLSK E PSGSWRSAEIMSGSGHYYTVRYDKFEGG NQTVVER+SRKAIRPC +S EVLENWIPGDVVEVFNDRSWKMATV+EVLGK+NY
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR
        LVRLLGSS EFKVSKFDIRAR+SWQDD+ VL+HK S +HGDDSKED+ AS RF GLSSQIQKCQNV N ET KRGSNY+Y QAETM RAGLK RVLKKK+
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR

Query:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT
        RYHR VAGNPS LHEHVKS+ IQRGMLGE+ GG S DRT++ E NGDRKKQMG AYHSFEENFELNDADRATCSVGSCS+SNS+ HGLP  V+IG NE T
Subjt:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT

Query:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
        DGS+SDAESFCHLG G  N LL RDKPLEA+IH LELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
Subjt:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS

XP_038891200.1 uncharacterized protein LOC120080569 isoform X1 [Benincasa hispida]1.6e-18986.86Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRFRKGSKVEVLSKKE PSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPC  SQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR
        +VRLLGSSCEFKVSKFDIRARRS QDDKWVL+HK S +HGDDSKED+ AS RF GLSSQ+QKCQNV N  T KRG NY+Y QAETM R GLK R+ KKKR
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR

Query:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT
        RYH VVAGNPS LHEHVKS+ IQRGMLG   GG  LDRT I EMNGDRKKQMG AYHSFEENF+LN+ADRATCSVGSCS+++SDD GLP HV+IGRNE T
Subjt:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT

Query:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
        DGSTSDAESFCHLGYG  NFLL RDKPLEAE+HRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
Subjt:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS

TrEMBL top hitse value%identityAlignment
A0A6J1CPH4 uncharacterized protein LOC111013114 isoform X11.5e-19688.92Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRFRKGSKVEVLSKKEAPSGSW SAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPC  +QEVLENW+PGDVVEVFNDRSWKMATVSEVLGK N+
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR
        LVRLLGSSCEFKV+KFDIRARRSWQDDKWVLVHK SC+ GDDSKED+NASP F GLSSQIQ CQN SNFE+RKRGSNY+Y QAE+MARA LK RVLKKKR
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR

Query:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT
        RYH VVAGNPSTLH HVKS+AIQRGMLGE FGG SLDRTDI+EMN DRKK MG AY S +E  ELNDADRATCSVGSCSVSNSDD GLPFHVSIGRNE T
Subjt:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT

Query:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
        DGS SDAESFCHLGYG  NFLL RDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVST IS
Subjt:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS

A0A6J1FBL1 uncharacterized protein LOC111444134 isoform X15.2e-18687.11Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRFRKGSKVEVLSKKE PSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPC  S EVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR
        LVRLLGSS EFKV KFDIRARRSWQDDKWVL+HK S + GDDSKED NASPRF GLSSQIQK QNV N +T +RGSNY+  QAETM RAGLK R LKKK+
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR

Query:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT
        RYH+VVA NPSTLHEHVKS+ IQRGMLG   GG +LDRT I EMN DRKKQMG AYHSF ENFELNDADRATCSVGSCSVSNSD HGL   VSIG NE T
Subjt:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT

Query:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
        DGSTSDAESFCHLGY  DNFLL RD+PLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
Subjt:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS

A0A6J1G3S7 uncharacterized protein LOC1114504985.8e-18585.05Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRFRKGSKVEVLSK E PSGSWRSAEIMSGSGHYYTVRYDKFEGG NQTVVER+SRKAIRPC +S EVLENWIPGDVVEVFNDRSWKMATV+EVLGKNNY
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR
        LVR+LGSS EFKVSKFDIRARRSWQDD+ VL+ K S +HGDDSKED+ AS RF GLSSQIQKCQNV N ET KRGSNY+Y QAETM RAGLK RVLKKK+
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR

Query:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT
        RYHR VAGNPS LHEHVKS+ IQRGMLGE+ GG S DRT++ E NGDRKKQMG AYHSFEENFELNDADRATCSVGSCS+SNS+ HGLP  V+IGRNE T
Subjt:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT

Query:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
        DGS+SDAESFCHLG    N LL RDKPLEA+IH LELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
Subjt:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS

A0A6J1HQ07 uncharacterized protein LOC111465575 isoform X23.5e-18285.57Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRFRKGSKVEVLSKKE PSGSWRSAEI+SGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPC  S EVLENWI GDVVEVFNDRSWKMA VSEVLGKNNY
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR
        LVRLLGSS EFKV KFDIRARRSWQDDKWVL+HK S +HG+DSKED NASPRF GLSSQIQK QNV N +T +RGSNY+  QAETM RAGLK R LKKKR
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR

Query:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT
        RYH+VVA +PSTLHEHVKS+ IQRGMLG    G++LDRT I EMN DRKKQMG AYHSF ENFELNDADRATCSVGSCSVSNSD  GL   VSIG NE T
Subjt:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT

Query:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
        DGSTSDAESFCHLGY  DNFLL RD+PLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISN+EHLMEIKNLVSTSIS
Subjt:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS

A0A6J1KCL4 uncharacterized protein LOC1114936938.9e-18685.05Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRFRKGSKVEVLSK E PSGSWRSAEIMSGSGHYYTVRYDKFEGG NQTVVER+SRKAIRPC +S EVLENWIPGDVVEVFNDRSWKMATV+EVLGK+NY
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR
        LVRLLGSS EFKVSKFDIRAR+SWQDD+ VL+HK S +HGDDSKED+ AS RF GLSSQIQKCQNV N ET KRGSNY+Y QAETM RAGLK RVLKKK+
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKR

Query:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT
        RYHR VAGNPS LHEHVKS+ IQRGMLGE+ GG S DRT++ E NGDRKKQMG AYHSFEENFELNDADRATCSVGSCS+SNS+ HGLP  V+IG NE T
Subjt:  RYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHT

Query:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
        DGS+SDAESFCHLG G  N LL RDKPLEA+IH LELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS
Subjt:  DGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSIS

SwissProt top hitse value%identityAlignment
Q08A72 Protein EMSY-LIKE 46.7e-0534.88Show/hide
Query:  LEAEIHRLELHAYRCTMEALYASG-PLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSISRNWVARALAVGYAGGIVIQLTNRA
        +EA+IH++E  AY   + A  A G  +SWEKE ++T+LR  L +SN+EH    + L+    S + + R      +GG+   + N A
Subjt:  LEAEIHRLELHAYRCTMEALYASG-PLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSISRNWVARALAVGYAGGIVIQLTNRA

Q9C7C4 Protein EMSY-LIKE 11.1e-0448.98Show/hide
Query:  LEAEIHRLELHAYRCTMEALYA-SGPLSWEKELLLTDLRLSLHISNDEH
        +E +IH+LE  AY   + A  A S  +SWEKE L+T+LR  L +S+DEH
Subjt:  LEAEIHRLELHAYRCTMEALYA-SGPLSWEKELLLTDLRLSLHISNDEH

Arabidopsis top hitse value%identityAlignment
AT2G25590.1 Plant Tudor-like protein8.3e-3554.74Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVF-NDRSWKMATVSEVLGKNN
        MRFR+GS+VEV S KEA  G WRSAEI+SG+GH Y VRY  FE  +N+ V +RV RK IRPC    +V + W  G++VEV  N+ SWK ATV EVL    
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVF-NDRSWKMATVSEVLGKNN

Query:  YLVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSS
        Y+VRLLG+  E  V K  +RAR+SWQD++WV++ K +
Subjt:  YLVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSS

AT4G32440.1 Plant Tudor-like RNA-binding protein4.1e-5838.85Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MR RKGS+VEV S KEAP G+WR AEI+SG+GH Y VR+  F+    + V+E+V RK IRPC    +V E W  G++VEV ++ SWK ATV E L  + Y
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQ-NVSNFETRKRGSNYKYFQ-AETMARAGLKRRVLKK
        +VRLLG+  E    K ++RAR+SWQD++WV + K S S    +    +   +     + +   + +V +    KR S Y + + AE+      K R L+K
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQ-NVSNFETRKRGSNYKYFQ-AETMARAGLKRRVLKK

Query:  KRRYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELND-ADRATCSVGSCSVSNSDDHGLPFHVSIGRN
        + +  +V A +    +   KS  +Q  +   K G   + R         R K        F E+   +D +D   CSVGSCS ++ D+  +P  +  G  
Subjt:  KRRYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELND-ADRATCSVGSCSVSNSDDHGLPFHVSIGRN

Query:  EHTDGSTSDAESFCHLGY-----------GADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVST
        +  D  +SDAES C LG            GA N   CR           EL++YR T+  L++SGPLSWE+E  LTDLRLSL+IS+DEHLME++NL+ST
Subjt:  EHTDGSTSDAESFCHLGY-----------GADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVST

AT4G32440.2 Plant Tudor-like RNA-binding protein5.5e-5538.52Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MR RKGS+VEV S KEAP G+WR AEI+SG+GH Y VR+  F+    + V+E+V RK IRPC    +V E W  G++VEV ++ SWK ATV E L  + Y
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQ-NVSNFETRKRGSNYKYFQ-AETMARAGLKRRVLKK
        +VRLLG+  E    K ++RAR+SWQD++WV + K S S    +    +   +     + +   + +V +    KR S Y + + AE+      K R L+K
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQ-NVSNFETRKRGSNYKYFQ-AETMARAGLKRRVLKK

Query:  KRRYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELND-ADRATCSVGSCSVSNSDDHGLPFHVSIGRN
        + +  +V A +    +   KS  +Q  +   K G   + R         R K        F E+   +D +D   CSVGSCS ++ D+  +P  +  G  
Subjt:  KRRYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELND-ADRATCSVGSCSVSNSDDHGLPFHVSIGRN

Query:  EHTDGSTSDAESFCHLGY-----------GADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLME
        +  D  +SDAES C LG            GA N   CR           EL++YR T+  L++SGPLSWE+E  LTDLRLSL+IS+DEHLME
Subjt:  EHTDGSTSDAESFCHLGY-----------GADNFLLCRDKPLEAEIHRLELHAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLME

AT4G32440.3 Plant Tudor-like RNA-binding protein3.9e-3240.67Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MR RKGS+VEV S KEAP G+WR AEI+SG+GH Y VR+  F+    + V+E+V RK IRPC    +V E W  G++VEV ++ SWK ATV E L  + Y
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQ-NVSNFETRKRGSNYKYFQ-AETMARAGLKRRVLKK
        +VRLLG+  E    K ++RAR+SWQD++WV + K S S    +    +   +     + +   + +V +    KR S Y + + AE+      K R L+K
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQ-NVSNFETRKRGSNYKYFQ-AETMARAGLKRRVLKK

Query:  KRRYHRVVA
        + +  +V A
Subjt:  KRRYHRVVA

AT5G20030.1 Plant Tudor-like RNA-binding protein1.7e-5938.87Show/hide
Query:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY
        MRF KG+KVEVLSK   PSG+WRSAEI+SG+GHYYTV YD  +G       ERV RK++RP     +VL+ W PGD++EVF   SWKMA VS+VLG   +
Subjt:  MRFRKGSKVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNY

Query:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSS---CSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRV--
        LVRLLGSS +FKV+K DIR R+SWQD++W+++ + +    +     +  +  +P+ D +SS+                S  K  +++     GLK+R   
Subjt:  LVRLLGSSCEFKVSKFDIRARRSWQDDKWVLVHKSS---CSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRV--

Query:  LKKKRRYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIG
        L +     R +A  P    E VK                                             E  D +    SVGSC +   D  GL    ++ 
Subjt:  LKKKRRYHRVVAGNPSTLHEHVKSVAIQRGMLGEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIG

Query:  RNEHTDGSTSDAESFCHLGYGADNFLLCRDKPLE-AEIHRLELHAYRCTMEALYASGP-LSWEKELLLTDLRLSLHISNDEHLMEIKNLVS
         N    G++SD ES    GYG    L+   K  E A++HRLEL AYR ++E L+ASGP ++WE+E  +T+LRL L+ISN+EHLM+I+NL+S
Subjt:  RNEHTDGSTSDAESFCHLGYGADNFLLCRDKPLE-AEIHRLELHAYRCTMEALYASGP-LSWEKELLLTDLRLSLHISNDEHLMEIKNLVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACCGAGGCAATTGCGAACGGTCTTCACCGGCGCTGCCGTTATCCTCGGCGGTATTTGCACTCTCAACTTTGCTTCTTTTCTCACTATCCAAACGCTCCGTCTCAC
CTCTGAAGCCAAACGGAGGCAAGGGTTCTATATATGCAAATTATGCCGAGGAAATGCAGTCATTCAGTGGTCGCCTTTGTCCGATCCTATTGCCATGAACCCCTGCGTGT
GCCCGACTTGTGAGGGAAACAGTTTATTTCCTTCCAATTGTTGGGGTTTCTTAGTTTTTGGAGTTATTGTAACCTTAGAAGTCTCAGCATGCCAAAGGAGAGAAGAGATC
GTTCTGTGTCTCGTAATAGATTTAAATTATATCCATTTCCATGCATCTCGGATGTCAGGTGCGAGGGAGATTGAAATCGAACCTGTTTTGCTGGTTCCACCAGAATCTAC
TCTACCGACTGTGGACCGGCTTCCTTTAAATATGAACATGTATAGCTTGTTAAATTGTTTGGAGAAACCTTTTGATCTGTTCTCTTCTTGTGAAAGCATCAAGGAGACAA
CCAGTTTAACTCAGCGTAACGATCGAACTCTCCCATGTAACGTGATTGCAGATAGTTTTCTCTGGAAATCGGTTCACACATCAGTAAACATGCGATTCAGGAAAGGCAGT
AAAGTGGAAGTGCTAAGCAAAAAGGAGGCGCCTTCGGGCTCCTGGCGTTCTGCTGAGATTATGTCTGGCAGTGGTCACTATTATACTGTTAGATATGACAAATTTGAGGG
TGGCAGTAATCAAACTGTTGTGGAGAGGGTATCGCGCAAGGCTATCAGGCCTTGTCTATCTTCTCAGGAAGTTTTAGAGAATTGGATTCCTGGCGATGTTGTGGAGGTAT
TTAATGACCGTTCTTGGAAAATGGCTACGGTTTCTGAGGTTTTGGGGAAGAACAACTATTTGGTCAGATTACTTGGATCTTCTTGTGAATTTAAGGTCAGCAAATTCGAC
ATTCGGGCAAGAAGGTCTTGGCAAGATGACAAATGGGTTTTGGTGCACAAGAGTTCTTGCAGTCATGGTGATGATAGCAAAGAAGATAAGAATGCAAGCCCTAGATTTGA
TGGTCTGAGCTCCCAAATTCAGAAATGCCAAAATGTTTCTAACTTTGAAACCAGGAAAAGGGGATCAAACTACAAGTATTTTCAAGCTGAAACAATGGCTAGAGCTGGCT
TGAAACGTAGAGTGTTGAAGAAAAAGCGAAGGTATCATAGAGTGGTTGCTGGAAATCCATCCACGTTGCACGAGCATGTAAAATCAGTTGCTATCCAAAGAGGTATGCTG
GGGGAAAAATTTGGAGGCACTTCTTTGGATAGAACCGACATTTTTGAAATGAATGGCGATAGGAAGAAACAGATGGGTGCTGCTTACCATTCTTTTGAAGAAAACTTTGA
ACTAAATGATGCTGATAGAGCTACATGCTCTGTTGGTAGTTGCAGTGTCTCTAATAGTGACGACCATGGGTTACCTTTTCATGTTTCTATCGGTCGCAATGAACATACAG
ATGGTTCTACAAGTGATGCTGAATCTTTTTGTCACTTGGGATATGGAGCAGACAATTTTCTTCTATGCAGAGACAAGCCATTGGAAGCTGAAATCCATAGGTTAGAGTTA
CATGCCTATCGATGCACTATGGAGGCATTATATGCTTCGGGTCCTTTAAGTTGGGAAAAAGAATTATTGCTTACAGATCTTCGTCTGTCCCTCCATATATCAAACGACGA
GCATTTGATGGAAATAAAAAATTTAGTATCCACAAGCATTTCAAGGAATTGGGTTGCCAGAGCCTTAGCTGTAGGCTATGCAGGTGGAATTGTCATCCAGTTAACGAACA
GAGCTCCAAGTTTTGGCAAAAATGAGCAGTGCATTTATGTGGCTCCACGAAATCCTCTCCTGTTGTCTCGTTTGCACCTCTTCATGATTGCTATTCCCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATACCGAGGCAATTGCGAACGGTCTTCACCGGCGCTGCCGTTATCCTCGGCGGTATTTGCACTCTCAACTTTGCTTCTTTTCTCACTATCCAAACGCTCCGTCTCAC
CTCTGAAGCCAAACGGAGGCAAGGGTTCTATATATGCAAATTATGCCGAGGAAATGCAGTCATTCAGTGGTCGCCTTTGTCCGATCCTATTGCCATGAACCCCTGCGTGT
GCCCGACTTGTGAGGGAAACAGTTTATTTCCTTCCAATTGTTGGGGTTTCTTAGTTTTTGGAGTTATTGTAACCTTAGAAGTCTCAGCATGCCAAAGGAGAGAAGAGATC
GTTCTGTGTCTCGTAATAGATTTAAATTATATCCATTTCCATGCATCTCGGATGTCAGGTGCGAGGGAGATTGAAATCGAACCTGTTTTGCTGGTTCCACCAGAATCTAC
TCTACCGACTGTGGACCGGCTTCCTTTAAATATGAACATGTATAGCTTGTTAAATTGTTTGGAGAAACCTTTTGATCTGTTCTCTTCTTGTGAAAGCATCAAGGAGACAA
CCAGTTTAACTCAGCGTAACGATCGAACTCTCCCATGTAACGTGATTGCAGATAGTTTTCTCTGGAAATCGGTTCACACATCAGTAAACATGCGATTCAGGAAAGGCAGT
AAAGTGGAAGTGCTAAGCAAAAAGGAGGCGCCTTCGGGCTCCTGGCGTTCTGCTGAGATTATGTCTGGCAGTGGTCACTATTATACTGTTAGATATGACAAATTTGAGGG
TGGCAGTAATCAAACTGTTGTGGAGAGGGTATCGCGCAAGGCTATCAGGCCTTGTCTATCTTCTCAGGAAGTTTTAGAGAATTGGATTCCTGGCGATGTTGTGGAGGTAT
TTAATGACCGTTCTTGGAAAATGGCTACGGTTTCTGAGGTTTTGGGGAAGAACAACTATTTGGTCAGATTACTTGGATCTTCTTGTGAATTTAAGGTCAGCAAATTCGAC
ATTCGGGCAAGAAGGTCTTGGCAAGATGACAAATGGGTTTTGGTGCACAAGAGTTCTTGCAGTCATGGTGATGATAGCAAAGAAGATAAGAATGCAAGCCCTAGATTTGA
TGGTCTGAGCTCCCAAATTCAGAAATGCCAAAATGTTTCTAACTTTGAAACCAGGAAAAGGGGATCAAACTACAAGTATTTTCAAGCTGAAACAATGGCTAGAGCTGGCT
TGAAACGTAGAGTGTTGAAGAAAAAGCGAAGGTATCATAGAGTGGTTGCTGGAAATCCATCCACGTTGCACGAGCATGTAAAATCAGTTGCTATCCAAAGAGGTATGCTG
GGGGAAAAATTTGGAGGCACTTCTTTGGATAGAACCGACATTTTTGAAATGAATGGCGATAGGAAGAAACAGATGGGTGCTGCTTACCATTCTTTTGAAGAAAACTTTGA
ACTAAATGATGCTGATAGAGCTACATGCTCTGTTGGTAGTTGCAGTGTCTCTAATAGTGACGACCATGGGTTACCTTTTCATGTTTCTATCGGTCGCAATGAACATACAG
ATGGTTCTACAAGTGATGCTGAATCTTTTTGTCACTTGGGATATGGAGCAGACAATTTTCTTCTATGCAGAGACAAGCCATTGGAAGCTGAAATCCATAGGTTAGAGTTA
CATGCCTATCGATGCACTATGGAGGCATTATATGCTTCGGGTCCTTTAAGTTGGGAAAAAGAATTATTGCTTACAGATCTTCGTCTGTCCCTCCATATATCAAACGACGA
GCATTTGATGGAAATAAAAAATTTAGTATCCACAAGCATTTCAAGGAATTGGGTTGCCAGAGCCTTAGCTGTAGGCTATGCAGGTGGAATTGTCATCCAGTTAACGAACA
GAGCTCCAAGTTTTGGCAAAAATGAGCAGTGCATTTATGTGGCTCCACGAAATCCTCTCCTGTTGTCTCGTTTGCACCTCTTCATGATTGCTATTCCCATGTGA
Protein sequenceShow/hide protein sequence
MIPRQLRTVFTGAAVILGGICTLNFASFLTIQTLRLTSEAKRRQGFYICKLCRGNAVIQWSPLSDPIAMNPCVCPTCEGNSLFPSNCWGFLVFGVIVTLEVSACQRREEI
VLCLVIDLNYIHFHASRMSGAREIEIEPVLLVPPESTLPTVDRLPLNMNMYSLLNCLEKPFDLFSSCESIKETTSLTQRNDRTLPCNVIADSFLWKSVHTSVNMRFRKGS
KVEVLSKKEAPSGSWRSAEIMSGSGHYYTVRYDKFEGGSNQTVVERVSRKAIRPCLSSQEVLENWIPGDVVEVFNDRSWKMATVSEVLGKNNYLVRLLGSSCEFKVSKFD
IRARRSWQDDKWVLVHKSSCSHGDDSKEDKNASPRFDGLSSQIQKCQNVSNFETRKRGSNYKYFQAETMARAGLKRRVLKKKRRYHRVVAGNPSTLHEHVKSVAIQRGML
GEKFGGTSLDRTDIFEMNGDRKKQMGAAYHSFEENFELNDADRATCSVGSCSVSNSDDHGLPFHVSIGRNEHTDGSTSDAESFCHLGYGADNFLLCRDKPLEAEIHRLEL
HAYRCTMEALYASGPLSWEKELLLTDLRLSLHISNDEHLMEIKNLVSTSISRNWVARALAVGYAGGIVIQLTNRAPSFGKNEQCIYVAPRNPLLLSRLHLFMIAIPM