; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G084860 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G084860
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionOBERON-like protein isoform X1
Genome locationCiama_Chr05:5159583..5167391
RNA-Seq ExpressionCaUC05G084860
SyntenyCaUC05G084860
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR032881 - Oberon, PHD finger domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049358.1 protein OBERON 1-like isoform X2 [Cucumis melo var. makuwa]7.2e-25691.08Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        M+GDPV+T+VLEDTNGC+   NKNELILRPVSQDESGEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLYSPRGI  SENSARKGH FASKLSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFPNAD+DAFFASFSWKIPAKKSSLAQGI++KQI CPLPSK++EECSASESQ DRVGCKAGNKNC+SLSV+ENPSS KSMSC ICCSEPRFCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILC KIIDTTTESYSYIKCK VVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQSCQSADCRDD+EEIL+LG CIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGSHKMRAKEL R+IEL IEK+K+G CLEEIWKMEEDSSANCTDAPD ADSTE SH+TS SIISSEWTMST FDHWIESLKLEDEIDQVL GLKRSQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNAV+NRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

XP_008438665.1 PREDICTED: uncharacterized protein LOC103483705 isoform X1 [Cucumis melo]7.2e-25690.87Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        M+GDPV+T+VLEDTNGC+   NKNELILRPV+QDESGEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLYSPRGI  SENSARKGH FASKLSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFPNAD+DAFFASFSWKIPAKKSSLAQGI++KQI CPLPSK++EECSASESQ DRVGCKAGNKNC+SLSV+ENPSS KSMSC ICCSEPRFCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILC KIIDTTTESYSYIKCK VVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQSCQSADCRDD+EEIL+LG CIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGSHKMRAKEL R+IEL IEK+K+G CLEEIWKMEEDSSANCTDAPD ADSTE SH+TS SIISSEWTMST FDHWIESLKLEDEIDQVL GLKRSQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNAV+NRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

XP_008438666.1 PREDICTED: uncharacterized protein LOC103483705 isoform X2 [Cucumis melo]7.2e-25690.87Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        M+GDPV+T+VLEDTNGC+   NKNELILRPV+QDESGEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLYSPRGI  SENSARKGH FASKLSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFPNAD+DAFFASFSWKIPAKKSSLAQGI++KQI CPLPSK++EECSASESQ DRVGCKAGNKNC+SLSV+ENPSS KSMSC ICCSEPRFCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILC KIIDTTTESYSYIKCK VVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQSCQSADCRDD+EEIL+LG CIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGSHKMRAKEL R+IEL IEK+K+G CLEEIWKMEEDSSANCTDAPD ADSTE SH+TS SIISSEWTMST FDHWIESLKLEDEIDQVL GLKRSQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNAV+NRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

XP_038897111.1 protein OBERON 4-like isoform X1 [Benincasa hispida]2.8e-25592.74Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        MSGDPVE  VLEDTNG  PRA+KNEL LRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIG SENSARKGH FASKLSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFPNADIDAFFASFSWKIPAKKSSLAQG ++KQI  PLPSKE+EECSASESQ  RVGCKAGNKNC SLSVA+NPSS KSMSCDICCSEPRFCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILCSKIIDTT ESYSYIKCKA+VGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQS QSADCRDDIEEILSLGFCIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGSHKMRAKEL R+IEL IEKLKSGTCLEEI KME DSSAN TDAPDNA STEGSHD SDS ISSEWTMST FDHWIESLKLEDEIDQVLQGLKRSQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEE LLLHKNYLHNLFQQL+KEQTELRHQTSSTGQNA+TNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

XP_038897148.1 protein OBERON 2-like isoform X2 [Benincasa hispida]1.2e-25392.53Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        MSGDPVE  VLEDTNG  PRA+KNEL LRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIG SENSARKGH FASKLSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFPNADIDAFFASFSWKIPAKKSSLAQG ++KQI  PLPSKE+EECSASESQ  RVGCKAGNKNC SLSVA+NPSS KSMSCDICCSEPRFCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILCSKIIDTT ESYSYIKCKA+VGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQS QSADCRDDIEEILSLGFCIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGSHKMRAKEL R+IEL IEKLKSGTCLEEI KME DSSAN  DAPDNA STEGSHD SDS ISSEWTMST FDHWIESLKLEDEIDQVLQGLKRSQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEE LLLHKNYLHNLFQQL+KEQTELRHQTSSTGQNA+TNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

TrEMBL top hitse value%identityAlignment
A0A0A0L5I4 PHD_Oberon domain-containing protein3.2e-24988.38Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        MSG+P +T+VLEDTNGC+   NKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGI  SENSARKG  FASKLSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFPNAD++AFFASFSWKIPAKKSS AQG ++K I C LPSK++EECSAS SQ D+VGCKAGNKNC SLSV+ENPSS KSMSC ICCSE RFCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILC KIIDTT ESYSYIKCKAVVGDGYICGH +HIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADC+DD+EEIL+LG CIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGSHKMRAKEL R+IEL IEK+K+GTCLE+IWKMEEDSSANCTDAPD ADSTE SH+TSDS+ISSEWTMST FDHWIESLKLEDEIDQVL GLKRSQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQ  STGQNAV+NRVDQIKREVKRLKRMEK+ADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

A0A1S3AWZ1 uncharacterized protein LOC103483705 isoform X23.5e-25690.87Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        M+GDPV+T+VLEDTNGC+   NKNELILRPV+QDESGEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLYSPRGI  SENSARKGH FASKLSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFPNAD+DAFFASFSWKIPAKKSSLAQGI++KQI CPLPSK++EECSASESQ DRVGCKAGNKNC+SLSV+ENPSS KSMSC ICCSEPRFCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILC KIIDTTTESYSYIKCK VVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQSCQSADCRDD+EEIL+LG CIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGSHKMRAKEL R+IEL IEK+K+G CLEEIWKMEEDSSANCTDAPD ADSTE SH+TS SIISSEWTMST FDHWIESLKLEDEIDQVL GLKRSQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNAV+NRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

A0A1S4DSZ4 uncharacterized protein LOC103483705 isoform X13.5e-25690.87Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        M+GDPV+T+VLEDTNGC+   NKNELILRPV+QDESGEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLYSPRGI  SENSARKGH FASKLSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFPNAD+DAFFASFSWKIPAKKSSLAQGI++KQI CPLPSK++EECSASESQ DRVGCKAGNKNC+SLSV+ENPSS KSMSC ICCSEPRFCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILC KIIDTTTESYSYIKCK VVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQSCQSADCRDD+EEIL+LG CIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGSHKMRAKEL R+IEL IEK+K+G CLEEIWKMEEDSSANCTDAPD ADSTE SH+TS SIISSEWTMST FDHWIESLKLEDEIDQVL GLKRSQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNAV+NRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

A0A5D3D0Q3 Protein OBERON 1-like isoform X23.5e-25691.08Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        M+GDPV+T+VLEDTNGC+   NKNELILRPVSQDESGEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLYSPRGI  SENSARKGH FASKLSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFPNAD+DAFFASFSWKIPAKKSSLAQGI++KQI CPLPSK++EECSASESQ DRVGCKAGNKNC+SLSV+ENPSS KSMSC ICCSEPRFCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILC KIIDTTTESYSYIKCK VVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQSCQSADCRDD+EEIL+LG CIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGSHKMRAKEL R+IEL IEK+K+G CLEEIWKMEEDSSANCTDAPD ADSTE SH+TS SIISSEWTMST FDHWIESLKLEDEIDQVL GLKRSQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNAV+NRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAVTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

A0A6J1ITE5 OBERON-like protein isoform X12.5e-24687.86Show/hide
Query:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER
        MSGDPVET+VL D NGC P+ NKN+LILRPVSQDESGEGLPYAPENWPN GDNWSWRVG+RVAITGHF DRYLYSPRGIG S NS+R+GHGFAS+LSVER
Subjt:  MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVER

Query:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC
        YIQSEFP+AD+DAFFASFSWKIPAKKSSLAQG +IKQISCPLPSKE EECSAS+SQIDRV CKAGNKNCNSLSVAE PS LKSMSCDICCSEP+FCRDCC
Subjt:  YIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCC

Query:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL
        CILCSK IDTT ES SYIKCKA+VGDGYICGHHAHIKCGLKSY AGTVGG IGLDAEYYCRRCDARTDLVSHVERFLQ CQS DCRDDI EILSLG CIL
Subjt:  CILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCIL

Query:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE
        RGS KMRAKEL R+ +L I KLK+GTCLEE+WKMEEDSSANCTDAPDNADSTEGSHD SDSIISSEWT+ST FDHWIESLKLE+EIDQVLQ LK+SQEFE
Subjt:  RGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFE

Query:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNA----VTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        YNLAEEKLL HKNYLHNLFQQLDKEQ EL HQ+SSTGQN     VTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
Subjt:  YNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNA----VTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05410.1 Protein of unknown function (DUF1423)1.8e-11948.92Show/hide
Query:  LILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGI-GTSENSARKGHGFASKLSVERYIQSEFPNADIDAFFASFSWKIPA
        L+LRPVS  ESGEGLPYAPENWPNPGD W W+VG R++  G+F+DRYLY P+ + G      RK   F S+LS++RYI+  FP AD+  FFASFSW IP 
Subjt:  LILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGI-GTSENSARKGHGFASKLSVERYIQSEFPNADIDAFFASFSWKIPA

Query:  KKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDTTTESYSYIKCKAVV
        +     QG+ + Q    LP    +E    +   D   CKAGN+ C SL       +L +M CDICC E +FC DCCCILC K+I      YSYIKC+AVV
Subjt:  KKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDTTTESYSYIKCKAVV

Query:  GDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCILRGSHKMRAKELFRNIELKIEKLKS
         +G+ICGH AH+ C L++Y AGT+GGS+GLD EYYCRRCDA+ DL  HV +FL+ CQ+ + + D+E+IL+LG CILRG+ +  AKEL   IE  + KLK 
Subjt:  GDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCILRGSHKMRAKELFRNIELKIEKLKS

Query:  GTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFEYNLAEEKLLLHKNYLHNLFQQLDK
        GT LE++W   +D+    +D  D+ ++ E  +DT  S+          F+H  E  KLE+EI +VL+ L+++QEFEY +AE KL   K  L +L++QL+K
Subjt:  GTCLEEIWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFEYNLAEEKLLLHKNYLHNLFQQLDK

Query:  EQTELRHQTSSTGQNA----VTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE
        E++EL  + S T  N+    V  R+DQI++EV +LK ME+VA GFG TP+ +L+E F L++E
Subjt:  EQTELRHQTSSTGQNA----VTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE

AT1G05410.2 Protein of unknown function (DUF1423)2.9e-10146.51Show/hide
Query:  VGKRVAITGHFLDRYLYSPRGI-GTSENSARKGHGFASKLSVERYIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQ
        VG R++  G+F+DRYLY P+ + G      RK   F S+LS++RYI+  FP AD+  FFASFSW IP +     QG+ + Q    LP    +E    +  
Subjt:  VGKRVAITGHFLDRYLYSPRGI-GTSENSARKGHGFASKLSVERYIQSEFPNADIDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQ

Query:  IDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDA
         D   CKAGN+ C SL       +L +M CDICC E +FC DCCCILC K+I      YSYIKC+AVV +G+ICGH AH+ C L++Y AGT+GGS+GLD 
Subjt:  IDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDTTTESYSYIKCKAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDA

Query:  EYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCILRGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSH
        EYYCRRCDA+ DL  HV +FL+ CQ+ + + D+E+IL+LG CILRG+ +  AKEL   IE  + KLK GT LE++W   +D+    +D  D+ ++ E  +
Subjt:  EYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCILRGSHKMRAKELFRNIELKIEKLKSGTCLEEIWKMEEDSSANCTDAPDNADSTEGSH

Query:  DTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNA----VTNRVDQIKREV
        DT  S+          F+H  E  KLE+EI +VL+ L+++QEFEY +AE KL   K  L +L++QL+KE++EL  + S T  N+    V  R+DQI++EV
Subjt:  DTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNA----VTNRVDQIKREV

Query:  KRLKRMEKVADGFGMTPKDILKEDFDLDVE
         +LK ME+VA GFG TP+ +L+E F L++E
Subjt:  KRLKRMEKVADGFGMTPKDILKEDFDLDVE

AT3G22520.1 unknown protein2.0e-2251.58Show/hide
Query:  PVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVERYIQSEFPNADIDAFFASFSWKIPA
        PVS   +G+GLPYAP +WP+PGD W+WRVG+RV   G+  DR+L  P+ +            FASK  + RY++S+FP  D DAFFASFSWK+PA
Subjt:  PVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVERYIQSEFPNADIDAFFASFSWKIPA

AT4G14840.1 unknown protein8.0e-1941.8Show/hide
Query:  GDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVERYI
        GD  + KV  D +  T   + N+L   P +   SG+GLP+AP ++P+PGD W+WRVG+RV   G   DR L  P  +            FASK ++ RY+
Subjt:  GDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVERYI

Query:  QSEFPNADIDAFFASFSWKIPA
        ++ FP+ D +AFFASF+W IPA
Subjt:  QSEFPNADIDAFFASFSWKIPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGGGATCCTGTGGAGACTAAAGTTCTTGAGGATACAAATGGCTGCACACCTAGGGCGAATAAAAATGAACTGATCCTTAGGCCAGTTTCTCAAGATGAATCTGG
GGAGGGCTTGCCATATGCTCCTGAAAATTGGCCCAATCCTGGTGATAACTGGAGTTGGAGGGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTCTGGATAGGTACCTTT
ATTCTCCTCGTGGTATTGGCACTTCTGAGAACTCAGCTCGTAAAGGGCATGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATATATCCAGTCTGAGTTTCCCAATGCAGAC
ATTGATGCATTTTTTGCCTCATTCAGTTGGAAGATACCAGCAAAAAAGTCATCTTTAGCGCAAGGTATTCAAATAAAACAAATTTCATGCCCTCTACCTTCAAAAGAGAT
GGAAGAATGCTCTGCATCTGAGTCCCAGATTGATAGAGTGGGTTGCAAGGCTGGAAATAAGAACTGTAATAGTTTATCTGTTGCAGAGAATCCATCTTCATTAAAATCCA
TGTCCTGTGATATTTGCTGCAGCGAACCTCGGTTTTGCCGTGATTGCTGCTGTATACTATGCAGCAAGATTATAGACACGACCACAGAAAGTTATAGCTACATAAAATGT
AAAGCAGTAGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAAAATGTGGTCTTAAATCATATACAGCTGGGACAGTTGGAGGAAGCATTGGATTGGATGCTGA
GTATTATTGTAGACGTTGTGATGCCAGAACGGATTTGGTATCACATGTAGAAAGATTTTTGCAGTCATGTCAATCAGCTGACTGTCGAGATGATATTGAGGAGATCTTAA
GCCTTGGTTTCTGCATTTTGCGTGGTTCACACAAAATGAGAGCAAAGGAGTTGTTTAGAAATATTGAATTGAAGATTGAAAAGCTTAAATCTGGGACTTGCTTGGAAGAG
ATTTGGAAGATGGAGGAAGACAGCTCAGCGAATTGCACTGATGCACCTGATAATGCTGATTCTACAGAAGGTTCTCATGACACTTCAGACTCCATTATAAGCTCAGAATG
GACTATGTCCACCCATTTTGATCATTGGATTGAGTCCCTAAAACTTGAAGACGAGATTGATCAGGTTTTGCAGGGACTGAAAAGATCACAAGAGTTCGAGTATAATTTGG
CAGAAGAAAAGCTTCTGTTACATAAAAATTATCTACATAATCTCTTTCAGCAACTTGACAAGGAGCAAACTGAACTCAGACATCAAACATCATCGACGGGACAAAATGCC
GTAACAAACAGAGTGGACCAAATAAAACGAGAAGTAAAGAGACTCAAGAGAATGGAAAAGGTTGCTGATGGATTTGGAATGACTCCAAAAGATATCCTCAAGGAGGACTT
CGATTTAGATGTTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGGGATCCTGTGGAGACTAAAGTTCTTGAGGATACAAATGGCTGCACACCTAGGGCGAATAAAAATGAACTGATCCTTAGGCCAGTTTCTCAAGATGAATCTGG
GGAGGGCTTGCCATATGCTCCTGAAAATTGGCCCAATCCTGGTGATAACTGGAGTTGGAGGGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTCTGGATAGGTACCTTT
ATTCTCCTCGTGGTATTGGCACTTCTGAGAACTCAGCTCGTAAAGGGCATGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATATATCCAGTCTGAGTTTCCCAATGCAGAC
ATTGATGCATTTTTTGCCTCATTCAGTTGGAAGATACCAGCAAAAAAGTCATCTTTAGCGCAAGGTATTCAAATAAAACAAATTTCATGCCCTCTACCTTCAAAAGAGAT
GGAAGAATGCTCTGCATCTGAGTCCCAGATTGATAGAGTGGGTTGCAAGGCTGGAAATAAGAACTGTAATAGTTTATCTGTTGCAGAGAATCCATCTTCATTAAAATCCA
TGTCCTGTGATATTTGCTGCAGCGAACCTCGGTTTTGCCGTGATTGCTGCTGTATACTATGCAGCAAGATTATAGACACGACCACAGAAAGTTATAGCTACATAAAATGT
AAAGCAGTAGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAAAATGTGGTCTTAAATCATATACAGCTGGGACAGTTGGAGGAAGCATTGGATTGGATGCTGA
GTATTATTGTAGACGTTGTGATGCCAGAACGGATTTGGTATCACATGTAGAAAGATTTTTGCAGTCATGTCAATCAGCTGACTGTCGAGATGATATTGAGGAGATCTTAA
GCCTTGGTTTCTGCATTTTGCGTGGTTCACACAAAATGAGAGCAAAGGAGTTGTTTAGAAATATTGAATTGAAGATTGAAAAGCTTAAATCTGGGACTTGCTTGGAAGAG
ATTTGGAAGATGGAGGAAGACAGCTCAGCGAATTGCACTGATGCACCTGATAATGCTGATTCTACAGAAGGTTCTCATGACACTTCAGACTCCATTATAAGCTCAGAATG
GACTATGTCCACCCATTTTGATCATTGGATTGAGTCCCTAAAACTTGAAGACGAGATTGATCAGGTTTTGCAGGGACTGAAAAGATCACAAGAGTTCGAGTATAATTTGG
CAGAAGAAAAGCTTCTGTTACATAAAAATTATCTACATAATCTCTTTCAGCAACTTGACAAGGAGCAAACTGAACTCAGACATCAAACATCATCGACGGGACAAAATGCC
GTAACAAACAGAGTGGACCAAATAAAACGAGAAGTAAAGAGACTCAAGAGAATGGAAAAGGTTGCTGATGGATTTGGAATGACTCCAAAAGATATCCTCAAGGAGGACTT
CGATTTAGATGTTGAGTAGAGACACGAGCACAAACATATGATGTCTCACAAAATTTCACTGAATTTTGTTGGTTCATTTAGCCTTATATGGGTTTTTGTAGTTTTTACTG
TGTATCATATGGCCTCACATGAGGCTGAGTGTATGATATCTGAGGTTTTAATTGGTTTATGAAGATAGCTGGATTCAGGAAGCAAAATACTGAACCTATGCAATTGCGGA
GCTGCCAGAAGGAATATAGAGTAGAGAAATTTTGGTCCATTGGAAACTAACCCTCTGCAATGAGTTTTCACGATAAACTTCCTCGTCCCCATTGGTCGCTTGACGGATTG
CTTTTTATGGCTTCTGTGGAGCTCAGTAGGGGATCCAGTCTGTCTGGTTCAGACTTCTTATCACTAAATGCAGAAGAGAGGATGATGATGATGATTGTTGCACCTACTGT
TGACGTGTGATTCTGTTCACGACAAAAAAGATATCTCCTCCACCATCTTCAGAATTCTCTTCCTCATTCCCCTTTGGAACAGTAACTATAAGCTCTCCATCAACAAACGC
CGCACTCGCAAGCTCCGGTCGCGTCGTCTCCGGTAGCCGAAACCTCCACATGTCCAATTCGAGCTCATCCATTGACATTTCCAACGATTCATTCTCACGGACAACGATCT
TAATAACCCCAGGATGGATTTCCACAGCATGAGCTCTTACTCCATCGCTAATGTTACCGTCAGTTTCAGCAATGAATCGGAAACAATCGGGATTTTCCTCCACCAAAACA
TCCGCATCAGATCGAAACGGAAGCTCAAGGACCCGACTGAAGATATGAGGTAATCTCCTGAGTTTCTTGTGGTTGTTCAGAAGGGATTGATCTTCAAGAGAGTTTCTCGA
AGTGGGGTTATTCCTGACGGCGATATTGCGCTTCCTCGGCAATGGGTGGACCTTCATGGCGGCGGTGGTGTTGATTTCGAGCTCGTTCTTCACAGTCGCAACAGGTTTTG
AAGGGGCGGGAGAAACAACAACAACAACAAAGGTTAAGACCCAAATCAAATGGAAAATCAAAGATGCCGTAGTGAAAAACGACGGACGAATTTCAAATGGAATCGAAATC
TTGATGGGATTGGTGTTCAGAGGACGGAAAGGGGAAATCATAAAGGAATCAAACGAATTGTTTCGTGTAAAAACAAACAGAAATCGAATGAGAACCCTAATATTCGAAAA
TGGGGACTGAAAAAGGGAAATTAATTGAATGACGGATTTGAGAGAATTGAAAAAATGAATCGAAAGGAAGGAAGGAGAAAGACATTCATTGGAGGAAGAGAATGCAGGAA
GAGCCGTGAGCAAAGACCAGAGAGGAAGAATCGCTCAGCAGATAAAAAAAGGAAGGGAAATATGTTATTTCTAGATTCAACCTTTAATTTTGATGAAATTACAAAGTCGG
TCATGTTATTATTATTATTATTTTTTCTGAAAAAAAAATTACATGTGGGGACTTTTGGATAAAATTTAGGTTAAGAAGCATCAATTCGCTTTGAGCTTCCTTACATTCGA
GGATTTTACATGTAATGATGTAAGTAATGATTTAGTCG
Protein sequenceShow/hide protein sequence
MSGDPVETKVLEDTNGCTPRANKNELILRPVSQDESGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYSPRGIGTSENSARKGHGFASKLSVERYIQSEFPNAD
IDAFFASFSWKIPAKKSSLAQGIQIKQISCPLPSKEMEECSASESQIDRVGCKAGNKNCNSLSVAENPSSLKSMSCDICCSEPRFCRDCCCILCSKIIDTTTESYSYIKC
KAVVGDGYICGHHAHIKCGLKSYTAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQSCQSADCRDDIEEILSLGFCILRGSHKMRAKELFRNIELKIEKLKSGTCLEE
IWKMEEDSSANCTDAPDNADSTEGSHDTSDSIISSEWTMSTHFDHWIESLKLEDEIDQVLQGLKRSQEFEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNA
VTNRVDQIKREVKRLKRMEKVADGFGMTPKDILKEDFDLDVE