; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024734 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024734
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionOBERON-like protein
Genome locationtig00002486:2335781..2342338
RNA-Seq ExpressionSgr024734
SyntenySgr024734
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR032881 - Oberon, PHD finger domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582301.1 Protein OBERON 1, partial [Cucurbita argyrosperma subsp. sororia]1.4e-18873.64Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        MSGDPVETEVL DING  P+ NKN+L LRPVS     +              +   ++G  + ITG     YLY PR + +  NSSRRG  FAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        +IQS FPNADVDAFFASFSWKIPAKKSSLAQG R++ + + PLPSKE +ECSASD QID V CKAGNKNCNSLSVAENPS LKSMSCDICCSEPRFCRDC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILCSKIID+T  S SYIKC+AMVGDGYICGHHAHI CGL+SYMAGTVGG I LDAEYYCRRCDARTDLV+HV+ FLQ CQST+ RDDI +ILS+G+CI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF
        LRGS K +AKELLRH++L IAKLK+GTCLEEVWKMEED SA CT          GSHD S SIISSEWTM TPFD W ESLKLE+EIDQVLQALKRSQEF
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF

Query:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL
        EYNLAEEKLL HKNYL NLF QLDKEQ EL HQ+SSTG N FL+NV NRVDQIKREV +LKRMEKVA+GFGMTPKD L
Subjt:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL

KAG7018712.1 Protein VERNALIZATION INSENSITIVE 3 [Cucurbita argyrosperma subsp. argyrosperma]2.4e-18873.64Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        MSGDPVETEVL DING  P+ NKN+L LRPVS     +              +   ++G  + ITG     YLY PR + +  NSSRRG  FAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        +IQS FPNADVDAFFASFSWKIPAKKSSLAQG R++ + + PLPSKE +ECSASD QID V CKAGNKNCNSLSVAENPS LKSMSCDICCSEPRFCRDC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILCSKIID+T  S SYIKC+AMVGDGYICGHHAHI CGL+SYMAGTVGG I LDAEYYCRRCDARTDLV+HV+ FLQ CQST+ RDDI +ILS+G+CI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF
        LRGS K +AKELLRH++L IAKLK+GTCLEEVWKMEED SA CT          GSHD S SIISSEWTM TPFD W ESLKLE+EIDQVLQALKRSQEF
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF

Query:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL
        EYNLAEEKLL HKNYL NLF QLDKEQ EL HQ+SSTG N FL+NV NRVDQIKREV +LKRMEKVA+GFGMTPKD L
Subjt:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL

XP_022138271.1 OBERON-like protein [Momordica charantia]2.2e-20578.95Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        MSGDPVETEVLEDINGST RVNK+DL LRPVS     +              +   ++G  + ITG  +  YLYPPR++T+PENS+RRGQGFASKLSVER
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        FIQS FPNADVDAFFASFSWKIPA KSS AQG R + VP  PLPSKEK ECSASDSQID+ GCKAGNK+C+SLS AENP+ LKSMSCDICCSEPRFC DC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILCSKIID+T+GSYSYIKCEA VGDGYICGHHAHIICGLRSY+AGTVGGSI LDAEYYCRRCDARTDLV+HVQGFLQSCQST+SRDDIEKILSIGVCI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT-----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQE
        LRGS K +AKELLRHIELII+KLKTGTCLEE+WKMEE+ISAICT           GSHDTSGSII+S+WT+STPFD WTESLKLE EIDQVLQALKRSQE
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT-----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQE

Query:  FEYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTP
        FEYNLAEEKL LHKNYL NLF QLDKEQTELR QTSSTG N+FLNNVINRVDQ+KREV KLKRM  VA GFGMTP
Subjt:  FEYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTP

XP_022979490.1 OBERON-like protein isoform X1 [Cucurbita maxima]5.4e-18873.43Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        MSGDPVETEVL DING  P+ NKNDL LRPVS     +              +   ++G  + ITG     YLY PR + +  NSSRRG GFAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        +IQS FP+ADVDAFFASFSWKIPAKKSSLAQG R++ + + PLPSKE +ECSASDSQID V CKAGNKNCNSLSVAE PS LKSMSCDICCSEP+FCRDC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILCSK ID+T  S SYIKC+AMVGDGYICGHHAHI CGL+SYMAGTVGG I LDAEYYCRRCDARTDLV+HV+ FLQ CQST+ RDDI +ILS+G+CI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF
        LRGS K +AKELLRH +L IAKLKTGTCLEEVWKMEED SA CT          GSHD S SIISSEWT+STPFD W ESLKLE+EIDQVLQALK+SQEF
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF

Query:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL
        EYNLAEEKLL HKNYL NLF QLDKEQ EL HQ+SSTG N FL+NV NRVDQIKREV +LKRMEKVA+GFGMTPKD L
Subjt:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL

XP_023527162.1 OBERON-like protein isoform X1 [Cucurbita pepo subsp. pepo]1.1e-18873.85Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        MSGDPVETEVL DING  P+ NKN+L LRPVS     +              +   ++G  + ITG     YLY PR + +  NSSRRG  FAS+LSV R
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        +IQS FPNADVDAFFASFSWKIPAKKSSLAQG R++ + + PLPSKE +ECSASDSQID V CKAGNKNCNSLSVAENPS LKSMSCDICCSEPRFCRDC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILCSKIID+T  S SYIKC+AMVGDGYICGHHAHI CGL+SYMAGTVGG I LDAEYYCRRCDARTDLV+HV+ FLQ CQST+  DDI +ILS+G+CI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF
        LRGS K +AKELLRH++L IAKLKTGTCLEEVWKMEED SA CT          GSHD S SIISSEWTMSTPFD W ESLKLE EIDQVLQALKRSQEF
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF

Query:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL
        EYNLAEEKLL HKNYL NLF QLDKEQ EL HQ+SSTG N FL+NV NRVDQIKREV +LKRMEKVA+GFGMTPKD L
Subjt:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL

TrEMBL top hitse value%identityAlignment
A0A1S3AWZ1 uncharacterized protein LOC103483705 isoform X21.4e-18671.55Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKACHMLQ------------KIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        M+GDPV+TEVLED NG +  VNKN+L LRPV+     +                  ++G  + ITG  +  YLY PR ++  ENS+R+G  FASKLSVER
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKACHMLQ------------KIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        +IQS FPNAD+DAFFASFSWKIPAKKSSLAQG+R++ +P  PLPSK+ +ECSAS+SQ D VGCKAGNKNC+SLSV+ENPSS KSMSC ICCSEPRFCRDC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILC KIID+T+ SYSYIKC+ +VGDGYICGHHAHI CGL+SY AGTVGGSI LDAEYYCRRCDARTDLV+HV+ FLQSCQS + RDD+E+IL++G+CI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF
        LRGSHK +AKELLRHIEL I K+KTG CLEE+WKMEED SA CT           SH+TSGSIISSEWTMSTPFD W ESLKLEDEIDQVL  LKRSQEF
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF

Query:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL
        EYNLAEEKLLLHKNYL NLF QL+KEQTELRHQT STG NA    V NRVDQIKREV +LKRMEKVA+GFGMTPKD L
Subjt:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL

A0A1S4DSZ4 uncharacterized protein LOC103483705 isoform X11.4e-18671.55Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKACHMLQ------------KIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        M+GDPV+TEVLED NG +  VNKN+L LRPV+     +                  ++G  + ITG  +  YLY PR ++  ENS+R+G  FASKLSVER
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKACHMLQ------------KIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        +IQS FPNAD+DAFFASFSWKIPAKKSSLAQG+R++ +P  PLPSK+ +ECSAS+SQ D VGCKAGNKNC+SLSV+ENPSS KSMSC ICCSEPRFCRDC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILC KIID+T+ SYSYIKC+ +VGDGYICGHHAHI CGL+SY AGTVGGSI LDAEYYCRRCDARTDLV+HV+ FLQSCQS + RDD+E+IL++G+CI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF
        LRGSHK +AKELLRHIEL I K+KTG CLEE+WKMEED SA CT           SH+TSGSIISSEWTMSTPFD W ESLKLEDEIDQVL  LKRSQEF
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF

Query:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL
        EYNLAEEKLLLHKNYL NLF QL+KEQTELRHQT STG NA    V NRVDQIKREV +LKRMEKVA+GFGMTPKD L
Subjt:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL

A0A6J1C998 OBERON-like protein1.1e-20578.95Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        MSGDPVETEVLEDINGST RVNK+DL LRPVS     +              +   ++G  + ITG  +  YLYPPR++T+PENS+RRGQGFASKLSVER
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        FIQS FPNADVDAFFASFSWKIPA KSS AQG R + VP  PLPSKEK ECSASDSQID+ GCKAGNK+C+SLS AENP+ LKSMSCDICCSEPRFC DC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILCSKIID+T+GSYSYIKCEA VGDGYICGHHAHIICGLRSY+AGTVGGSI LDAEYYCRRCDARTDLV+HVQGFLQSCQST+SRDDIEKILSIGVCI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT-----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQE
        LRGS K +AKELLRHIELII+KLKTGTCLEE+WKMEE+ISAICT           GSHDTSGSII+S+WT+STPFD WTESLKLE EIDQVLQALKRSQE
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT-----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQE

Query:  FEYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTP
        FEYNLAEEKL LHKNYL NLF QLDKEQTELR QTSSTG N+FLNNVINRVDQ+KREV KLKRM  VA GFGMTP
Subjt:  FEYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTP

A0A6J1GVC0 protein OBERON 4-like isoform X17.6e-18873.43Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        MSGDPVETEVL DING  P+ NKN+L LRPVS     +              +   ++G  + ITG     YLY PR + +  NSSRRG  FAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        +IQS FPNADVDAFFASFSWKIPAKKSSLAQG R++ + + PLPSKE +ECSASDSQID V CKAGNKNCNSLSVAENPS LKSMSCDICCSEPRFCRDC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILCSKIID+T  S S+IKC+AMV DGYICGHHAHI CGL+SYMAGTVGG I LDAEYYCRRCDARTDLV+HV+ FLQ CQST+ RDDI +ILS+G+CI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF
        LRGS K +AKELLRH++L IAKLK+GTCLEEVWKMEED SA CT          GSHD S SIISSEWTM TPFD W ESLKLE+EIDQVLQALKRSQEF
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF

Query:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL
        EYNLAEEKLL HKNYL NLF QLDKEQ EL HQ+SSTG N FL+NV NRVDQIKREV +LKRMEKVA+GFGMTPKD L
Subjt:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL

A0A6J1ITE5 OBERON-like protein isoform X12.6e-18873.43Show/hide
Query:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER
        MSGDPVETEVL DING  P+ NKNDL LRPVS     +              +   ++G  + ITG     YLY PR + +  NSSRRG GFAS+LSVER
Subjt:  MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKAC------------HMLQKIGPILVITG--VGGYLYPPRSLTIPENSSRRGQGFASKLSVER

Query:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC
        +IQS FP+ADVDAFFASFSWKIPAKKSSLAQG R++ + + PLPSKE +ECSASDSQID V CKAGNKNCNSLSVAE PS LKSMSCDICCSEP+FCRDC
Subjt:  FIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDC

Query:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI
        CCILCSK ID+T  S SYIKC+AMVGDGYICGHHAHI CGL+SYMAGTVGG I LDAEYYCRRCDARTDLV+HV+ FLQ CQST+ RDDI +ILS+G+CI
Subjt:  CCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCI

Query:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF
        LRGS K +AKELLRH +L IAKLKTGTCLEEVWKMEED SA CT          GSHD S SIISSEWT+STPFD W ESLKLE+EIDQVLQALK+SQEF
Subjt:  LRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT----------GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEF

Query:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL
        EYNLAEEKLL HKNYL NLF QLDKEQ EL HQ+SSTG N FL+NV NRVDQIKREV +LKRMEKVA+GFGMTPKD L
Subjt:  EYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05410.1 Protein of unknown function (DUF1423)8.5e-9946.68Show/hide
Query:  KIGPILVITG--VGGYLYPPRSLT-IPENSSRRGQGFASKLSVERFIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASD
        K+GP +   G  V  YLYPP+ L  +     R+ + F S+LS++R+I+  FP ADV  FFASFSW IP +     QG+  +     P+ S +  E    D
Subjt:  KIGPILVITG--VGGYLYPPRSLT-IPENSSRRGQGFASKLSVERFIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPTPPLPSKEKKECSASD

Query:  SQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKL
           D   CKAGN+ C SL       +L +M CDICC E +FC DCCCILC K+I    G YSYIKCEA+V +G+ICGH AH+ C LR+Y+AGT+GGS+ L
Subjt:  SQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICGLRSYMAGTVGGSIKL

Query:  DAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCILRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICTGSHDTSGSIIS
        D EYYCRRCDA+ DL  HV  FL+ CQ+   + D+EKIL++G+CILRG+ +  AKELL  IE  + KLK GT LE++W    D +      +  SG    
Subjt:  DAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCILRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICTGSHDTSGSIIS

Query:  SEWTMS---------TPFDRWTESLKLEDEIDQVLQALKRSQEFEYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKRE
        ++   S          PF+   E  KLE+EI +VL+AL+++QEFEY +AE KL   K  L +L+ QL+KE++EL  + S T  N+ + NV+ R+DQI++E
Subjt:  SEWTMS---------TPFDRWTESLKLEDEIDQVLQALKRSQEFEYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKRE

Query:  VTKLKRMEKVAEGFGMTPKDKL
        VTKLK ME+VA+GFG TP+  L
Subjt:  VTKLKRMEKVAEGFGMTPKDKL

AT1G05410.2 Protein of unknown function (DUF1423)3.8e-9945.54Show/hide
Query:  VSMMNLVKACH-MLQKIGPILVITG--VGGYLYPPRSLT-IPENSSRRGQGFASKLSVERFIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPT
        +   N+ ++ H +L+ +GP +   G  V  YLYPP+ L  +     R+ + F S+LS++R+I+  FP ADV  FFASFSW IP +     QG+  +    
Subjt:  VSMMNLVKACH-MLQKIGPILVITG--VGGYLYPPRSLT-IPENSSRRGQGFASKLSVERFIQSVFPNADVDAFFASFSWKIPAKKSSLAQGLRLRNVPT

Query:  PPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICG
         P+ S +  E    D   D   CKAGN+ C SL       +L +M CDICC E +FC DCCCILC K+I    G YSYIKCEA+V +G+ICGH AH+ C 
Subjt:  PPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHHAHIICG

Query:  LRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCILRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDIS
        LR+Y+AGT+GGS+ LD EYYCRRCDA+ DL  HV  FL+ CQ+   + D+EKIL++G+CILRG+ +  AKELL  IE  + KLK GT LE++W    D +
Subjt:  LRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCILRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDIS

Query:  AICTGSHDTSGSIISSEWTMS---------TPFDRWTESLKLEDEIDQVLQALKRSQEFEYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNA
              +  SG    ++   S          PF+   E  KLE+EI +VL+AL+++QEFEY +AE KL   K  L +L+ QL+KE++EL  + S T  N+
Subjt:  AICTGSHDTSGSIISSEWTMS---------TPFDRWTESLKLEDEIDQVLQALKRSQEFEYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNA

Query:  FLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL
         + NV+ R+DQI++EVTKLK ME+VA+GFG TP+  L
Subjt:  FLNNVINRVDQIKREVTKLKRMEKVAEGFGMTPKDKL

AT3G22520.1 unknown protein1.7e-0642.62Show/hide
Query:  ITGVGGYLYPPRSLTIPENSSRR--GQGFASKLSVERFIQSVFPNADVDAFFASFSWKIPA
        +T +G   +  R L +P+   ++   + FASK  + R+++S FP  D DAFFASFSWK+PA
Subjt:  ITGVGGYLYPPRSLTIPENSSRR--GQGFASKLSVERFIQSVFPNADVDAFFASFSWKIPA

AT4G14840.1 unknown protein3.2e-0544Show/hide
Query:  RSLTIPENSSRRG--QGFASKLSVERFIQSVFPNADVDAFFASFSWKIPA
        R L +PE    +   + FASK ++ R++++ FP+ D +AFFASF+W IPA
Subjt:  RSLTIPENSSRRG--QGFASKLSVERFIQSVFPNADVDAFFASFSWKIPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGGGATCCTGTGGAGACTGAAGTTCTTGAGGATATAAATGGGAGCACACCTAGGGTAAATAAAAATGATCTGGCCCTTAGGCCAGTTTCCATGATGAATCTGGT
GAAGGCTTGCCATATGCTCCAGAAAATTGGCCCAATCCTGGTGATAACTGGAGTTGGAGGGTACCTTTATCCTCCTCGCAGTCTTACTATTCCTGAGAACTCATCTCGTA
GAGGACAAGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATTTATCCAGTCTGTGTTCCCTAATGCAGACGTTGATGCATTTTTTGCCTCATTCAGCTGGAAGATACCAGCA
AAAAAGTCATCTTTGGCACAAGGTCTTCGACTCAGAAATGTTCCAACCCCTCCTCTACCGTCGAAAGAGAAGAAAGAATGCTCAGCATCTGATTCCCAGATTGATATGGT
GGGTTGCAAGGCTGGAAATAAGAACTGTAATAGTTTATCTGTTGCAGAAAACCCATCTTCATTAAAATCCATGTCCTGTGATATTTGCTGCAGCGAACCTCGGTTTTGCC
GTGATTGCTGCTGTATACTTTGCAGTAAGATTATAGATTCAACCAGTGGAAGTTATAGCTACATAAAATGTGAAGCAATGGTGGGTGATGGTTATATTTGTGGACATCAT
GCTCATATTATATGTGGTCTTAGATCTTATATGGCTGGGACAGTTGGAGGAAGTATTAAATTGGATGCTGAGTATTATTGTCGACGTTGTGATGCTAGAACGGATTTGGT
AACACATGTTCAAGGATTTTTGCAGTCATGTCAATCAACCAATTCTCGTGATGATATTGAGAAGATCTTAAGCATTGGTGTCTGCATTTTGCGTGGTTCGCACAAATCAA
AAGCAAAGGAGTTGTTAAGACATATTGAATTGATCATTGCGAAGCTTAAAACTGGGACTTGCTTAGAAGAGGTTTGGAAGATGGAGGAAGACATATCAGCTATTTGCACT
GGTTCTCATGACACTTCAGGCTCCATTATAAGCTCGGAGTGGACTATGTCCACCCCTTTTGATCGCTGGACTGAGTCCCTAAAACTTGAAGATGAGATTGATCAGGTTCT
GCAGGCACTGAAAAGATCACAAGAGTTCGAGTATAATTTAGCAGAAGAAAAGCTTCTATTGCATAAAAATTATCTACAGAATCTATTTCACCAACTTGACAAGGAGCAAA
CTGAACTCAGACATCAAACATCATCAACTGGACATAATGCCTTCCTGAATAATGTAATAAATAGAGTGGATCAAATAAAACGAGAAGTAACGAAACTGAAGAGAATGGAA
AAGGTGGCTGAAGGATTTGGAATGACTCCAAAAGATAAACTTCCTCACCCCAATTGGTCCATTTACTGCCTGACGGGTTGCTTTTTATTGGCTTCAGTGAAGCTCAGTCT
GCCGGGAACAGGCTTTCTATCAGTACACGCAGAAGAGAACAAGCCGACCTCCCATGCCATCTCTGAAACTCCCGCTCCCATCTCCCCAGATATCTCCTCCACCATCTTCA
GAGTTCTCCTCCTCGTTCCCCTTCGGAACAGTAACTATAAGCTCTCCATCAACGAACGCCGCACTTGCAAGCTCTGGCCGCGTCGTCTCCGGATGGATTTCCACGGCATG
AGCCCTTACTCCATCGCTCATGCTGCCGTCAGTTTCGGCAATGAATCGGAAACAATCGGGATTTTCCTCAACCAAAACATCGGCATCAGATCGAAAAGGAAGCTCAAGGA
CCCGACTGAAGATATGAGGATTCCTCGAATTGGGGTTATACCGAACGGCGATATTGCGCTTCCTCGGCAATGGGTGAACCTTCATGGCGGTGGTGTTTTTGATTTCCAAC
TTCTTCCTCACACTGGCAATAGGAATCAAAACCTTGATGGGATTGTGGTTCAGAGGACGGAAAGGGAGAATCATAAAGGAATCAAACGAGTTGGAAATTATGTCACAGAG
GAATTTCAACAAATTGGGGAATTGAATCGAAACGAAGAAAGGAGAAAGGCATTCGAGGAAGAGAATGCAGGGAGAGCC
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGGGATCCTGTGGAGACTGAAGTTCTTGAGGATATAAATGGGAGCACACCTAGGGTAAATAAAAATGATCTGGCCCTTAGGCCAGTTTCCATGATGAATCTGGT
GAAGGCTTGCCATATGCTCCAGAAAATTGGCCCAATCCTGGTGATAACTGGAGTTGGAGGGTACCTTTATCCTCCTCGCAGTCTTACTATTCCTGAGAACTCATCTCGTA
GAGGACAAGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATTTATCCAGTCTGTGTTCCCTAATGCAGACGTTGATGCATTTTTTGCCTCATTCAGCTGGAAGATACCAGCA
AAAAAGTCATCTTTGGCACAAGGTCTTCGACTCAGAAATGTTCCAACCCCTCCTCTACCGTCGAAAGAGAAGAAAGAATGCTCAGCATCTGATTCCCAGATTGATATGGT
GGGTTGCAAGGCTGGAAATAAGAACTGTAATAGTTTATCTGTTGCAGAAAACCCATCTTCATTAAAATCCATGTCCTGTGATATTTGCTGCAGCGAACCTCGGTTTTGCC
GTGATTGCTGCTGTATACTTTGCAGTAAGATTATAGATTCAACCAGTGGAAGTTATAGCTACATAAAATGTGAAGCAATGGTGGGTGATGGTTATATTTGTGGACATCAT
GCTCATATTATATGTGGTCTTAGATCTTATATGGCTGGGACAGTTGGAGGAAGTATTAAATTGGATGCTGAGTATTATTGTCGACGTTGTGATGCTAGAACGGATTTGGT
AACACATGTTCAAGGATTTTTGCAGTCATGTCAATCAACCAATTCTCGTGATGATATTGAGAAGATCTTAAGCATTGGTGTCTGCATTTTGCGTGGTTCGCACAAATCAA
AAGCAAAGGAGTTGTTAAGACATATTGAATTGATCATTGCGAAGCTTAAAACTGGGACTTGCTTAGAAGAGGTTTGGAAGATGGAGGAAGACATATCAGCTATTTGCACT
GGTTCTCATGACACTTCAGGCTCCATTATAAGCTCGGAGTGGACTATGTCCACCCCTTTTGATCGCTGGACTGAGTCCCTAAAACTTGAAGATGAGATTGATCAGGTTCT
GCAGGCACTGAAAAGATCACAAGAGTTCGAGTATAATTTAGCAGAAGAAAAGCTTCTATTGCATAAAAATTATCTACAGAATCTATTTCACCAACTTGACAAGGAGCAAA
CTGAACTCAGACATCAAACATCATCAACTGGACATAATGCCTTCCTGAATAATGTAATAAATAGAGTGGATCAAATAAAACGAGAAGTAACGAAACTGAAGAGAATGGAA
AAGGTGGCTGAAGGATTTGGAATGACTCCAAAAGATAAACTTCCTCACCCCAATTGGTCCATTTACTGCCTGACGGGTTGCTTTTTATTGGCTTCAGTGAAGCTCAGTCT
GCCGGGAACAGGCTTTCTATCAGTACACGCAGAAGAGAACAAGCCGACCTCCCATGCCATCTCTGAAACTCCCGCTCCCATCTCCCCAGATATCTCCTCCACCATCTTCA
GAGTTCTCCTCCTCGTTCCCCTTCGGAACAGTAACTATAAGCTCTCCATCAACGAACGCCGCACTTGCAAGCTCTGGCCGCGTCGTCTCCGGATGGATTTCCACGGCATG
AGCCCTTACTCCATCGCTCATGCTGCCGTCAGTTTCGGCAATGAATCGGAAACAATCGGGATTTTCCTCAACCAAAACATCGGCATCAGATCGAAAAGGAAGCTCAAGGA
CCCGACTGAAGATATGAGGATTCCTCGAATTGGGGTTATACCGAACGGCGATATTGCGCTTCCTCGGCAATGGGTGAACCTTCATGGCGGTGGTGTTTTTGATTTCCAAC
TTCTTCCTCACACTGGCAATAGGAATCAAAACCTTGATGGGATTGTGGTTCAGAGGACGGAAAGGGAGAATCATAAAGGAATCAAACGAGTTGGAAATTATGTCACAGAG
GAATTTCAACAAATTGGGGAATTGAATCGAAACGAAGAAAGGAGAAAGGCATTCGAGGAAGAGAATGCAGGGAGAGCC
Protein sequenceShow/hide protein sequence
MSGDPVETEVLEDINGSTPRVNKNDLALRPVSMMNLVKACHMLQKIGPILVITGVGGYLYPPRSLTIPENSSRRGQGFASKLSVERFIQSVFPNADVDAFFASFSWKIPA
KKSSLAQGLRLRNVPTPPLPSKEKKECSASDSQIDMVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDSTSGSYSYIKCEAMVGDGYICGHH
AHIICGLRSYMAGTVGGSIKLDAEYYCRRCDARTDLVTHVQGFLQSCQSTNSRDDIEKILSIGVCILRGSHKSKAKELLRHIELIIAKLKTGTCLEEVWKMEEDISAICT
GSHDTSGSIISSEWTMSTPFDRWTESLKLEDEIDQVLQALKRSQEFEYNLAEEKLLLHKNYLQNLFHQLDKEQTELRHQTSSTGHNAFLNNVINRVDQIKREVTKLKRME
KVAEGFGMTPKDKLPHPNWSIYCLTGCFLLASVKLSLPGTGFLSVHAEENKPTSHAISETPAPISPDISSTIFRVLLLVPLRNSNYKLSINERRTCKLWPRRLRMDFHGM
SPYSIAHAAVSFGNESETIGIFLNQNIGIRSKRKLKDPTEDMRIPRIGVIPNGDIALPRQWVNLHGGGVFDFQLLPHTGNRNQNLDGIVVQRTERENHKGIKRVGNYVTE
EFQQIGELNRNEERRKAFEEENAGRA