; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007967 (gene) of Snake gourd v1 genome

Gene IDTan0007967
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPHD_Oberon domain-containing protein
Genome locationLG10:6941566..6948267
RNA-Seq ExpressionTan0007967
SyntenyTan0007967
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR032881 - Oberon, PHD finger domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049358.1 protein OBERON 1-like isoform X2 [Cucumis melo var. makuwa]2.3e-23381.95Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        M+GDP  V+ EVLEDTNG + GVNKN+LILRPVSQDE GEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLY PRGI   ENS R+G  FASKLSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQS FPNADLDAFFASFSWKIPAKKSSLAQG+RVKQ+PCPLPSK  +ECSAS+ Q D +GCKAGNKNC+SLS++E PS  KSMSC ICCSE RFCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L +  C
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGSHK+RAKELLR+IEL+I K+   KTG C+EEIWKMEED SA+CTDAPD+A+S E SH+TSGS ISSEWTMSTPFDHW ESLKLEDEIDQVL  LKR
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE
        SQEFEY+LAEEKLLLHKNYLHNLFQQL+KEQTELRHQT + GQNA    V+NRVDQIKREVK+LKRMEKVADGFG+TPKDILK+DF  DV+IE
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE

KAG7018712.1 Protein VERNALIZATION INSENSITIVE 3 [Cucurbita argyrosperma subsp. argyrosperma]9.7e-23282.28Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        MSGDP  VE EVL D NG     NKN+LILRPVSQDE GEGLPYAPENWPN GDNWSWRVG+RVAITGHF DRYLY PRGIGV  NS+RRG  FAS+LSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQS FPNAD+DAFFASFSWKIPAKKSSLAQG R+KQ+ CPLPSK T+ECSASD Q+D + CKAGNKNCNSLS+AE PSLLKSMSCDICCSE RFCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILC KIIDTT ES SYIKC+A VGDGYICGHHAHIKCGLKSYMAGTVGG IGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDI E L +  C
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGS K+RAKELLR+++LNIAK   LK+GTC+EE+WKMEED SA+CTDAPD+A+S EGSHD S S ISSEWTM TPFDHW ESLKLE+EIDQVLQALKR
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD
        SQEFEY+LAEEKLL HKNYLHNLFQQLDKEQ EL HQ+S+ GQN F DNVTNRVDQIKREVK+LKRMEKVADGFG+TPKDILK+DF  DV+
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD

XP_008438665.1 PREDICTED: uncharacterized protein LOC103483705 isoform X1 [Cucumis melo]2.3e-23381.74Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        M+GDP  V+ EVLEDTNG + GVNKN+LILRPV+QDE GEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLY PRGI   ENS R+G  FASKLSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQS FPNADLDAFFASFSWKIPAKKSSLAQG+RVKQ+PCPLPSK  +ECSAS+ Q D +GCKAGNKNC+SLS++E PS  KSMSC ICCSE RFCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L +  C
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGSHK+RAKELLR+IEL+I K+   KTG C+EEIWKMEED SA+CTDAPD+A+S E SH+TSGS ISSEWTMSTPFDHW ESLKLEDEIDQVL  LKR
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE
        SQEFEY+LAEEKLLLHKNYLHNLFQQL+KEQTELRHQT + GQNA    V+NRVDQIKREVK+LKRMEKVADGFG+TPKDILK+DF  DV+IE
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE

XP_008438666.1 PREDICTED: uncharacterized protein LOC103483705 isoform X2 [Cucumis melo]2.3e-23381.74Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        M+GDP  V+ EVLEDTNG + GVNKN+LILRPV+QDE GEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLY PRGI   ENS R+G  FASKLSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQS FPNADLDAFFASFSWKIPAKKSSLAQG+RVKQ+PCPLPSK  +ECSAS+ Q D +GCKAGNKNC+SLS++E PS  KSMSC ICCSE RFCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L +  C
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGSHK+RAKELLR+IEL+I K+   KTG C+EEIWKMEED SA+CTDAPD+A+S E SH+TSGS ISSEWTMSTPFDHW ESLKLEDEIDQVL  LKR
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE
        SQEFEY+LAEEKLLLHKNYLHNLFQQL+KEQTELRHQT + GQNA    V+NRVDQIKREVK+LKRMEKVADGFG+TPKDILK+DF  DV+IE
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE

XP_022979490.1 OBERON-like protein isoform X1 [Cucurbita maxima]2.6e-23282.28Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        MSGDP  VE EVL D NG     NKNDLILRPVSQDE GEGLPYAPENWPN GDNWSWRVG+RVAITGHF DRYLY PRGIGV  NS+RRG GFAS+LSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQS FP+AD+DAFFASFSWKIPAKKSSLAQG R+KQ+ CPLPSK T+ECSASD Q+D + CKAGNKNCNSLS+AETPSLLKSMSCDICCSE +FCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILC K IDTT ES SYIKC+A VGDGYICGHHAHIKCGLKSYMAGTVGG IGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDI E L +  C
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGS K+RAKELLR+ +LNIAK   LKTGTC+EE+WKMEED SA+CTDAPD+A+S EGSHD S S ISSEWT+STPFDHW ESLKLE+EIDQVLQALK+
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD
        SQEFEY+LAEEKLL HKNYLHNLFQQLDKEQ EL HQ+S+ GQN F DNVTNRVDQIKREVK+LKRMEKVADGFG+TPKDILK+DF  DV+
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD

TrEMBL top hitse value%identityAlignment
A0A1S3AWZ1 uncharacterized protein LOC103483705 isoform X21.1e-23381.74Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        M+GDP  V+ EVLEDTNG + GVNKN+LILRPV+QDE GEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLY PRGI   ENS R+G  FASKLSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQS FPNADLDAFFASFSWKIPAKKSSLAQG+RVKQ+PCPLPSK  +ECSAS+ Q D +GCKAGNKNC+SLS++E PS  KSMSC ICCSE RFCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L +  C
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGSHK+RAKELLR+IEL+I K+   KTG C+EEIWKMEED SA+CTDAPD+A+S E SH+TSGS ISSEWTMSTPFDHW ESLKLEDEIDQVL  LKR
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE
        SQEFEY+LAEEKLLLHKNYLHNLFQQL+KEQTELRHQT + GQNA    V+NRVDQIKREVK+LKRMEKVADGFG+TPKDILK+DF  DV+IE
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE

A0A1S4DSZ4 uncharacterized protein LOC103483705 isoform X11.1e-23381.74Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        M+GDP  V+ EVLEDTNG + GVNKN+LILRPV+QDE GEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLY PRGI   ENS R+G  FASKLSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQS FPNADLDAFFASFSWKIPAKKSSLAQG+RVKQ+PCPLPSK  +ECSAS+ Q D +GCKAGNKNC+SLS++E PS  KSMSC ICCSE RFCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L +  C
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGSHK+RAKELLR+IEL+I K+   KTG C+EEIWKMEED SA+CTDAPD+A+S E SH+TSGS ISSEWTMSTPFDHW ESLKLEDEIDQVL  LKR
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE
        SQEFEY+LAEEKLLLHKNYLHNLFQQL+KEQTELRHQT + GQNA    V+NRVDQIKREVK+LKRMEKVADGFG+TPKDILK+DF  DV+IE
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE

A0A5D3D0Q3 Protein OBERON 1-like isoform X21.1e-23381.95Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        M+GDP  V+ EVLEDTNG + GVNKN+LILRPVSQDE GEGLPYAPENWPNPGD WSWRVGKRVAITGHFLDRYLY PRGI   ENS R+G  FASKLSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQS FPNADLDAFFASFSWKIPAKKSSLAQG+RVKQ+PCPLPSK  +ECSAS+ Q D +GCKAGNKNC+SLS++E PS  KSMSC ICCSE RFCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSIGLDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L +  C
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGSHK+RAKELLR+IEL+I K+   KTG C+EEIWKMEED SA+CTDAPD+A+S E SH+TSGS ISSEWTMSTPFDHW ESLKLEDEIDQVL  LKR
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE
        SQEFEY+LAEEKLLLHKNYLHNLFQQL+KEQTELRHQT + GQNA    V+NRVDQIKREVK+LKRMEKVADGFG+TPKDILK+DF  DV+IE
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE

A0A6J1IE19 OBERON-like protein isoform X19.8e-23083.1Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        MSGDP  VEPEVLE+ N  T  +NKNDLILRPVS+DEGGEGLPYAPENWPNPGDNW WRVGKRVAITGHFLDRYLYPPR + VPEN++ RG+  ASKLSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQSV PN ++DAFFASF WKIPAKKS+ AQG+R +Q PCPLPSK TK+C+ASD Q  I+GCKAGN  CNSLSL E PSLLKSMSCDICCSESRFCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILC KIID TR+SYSYIKCEA VGDG ICGHHAHIKCGLKSYMAGTV G IGLDAEYYCRRCDARTDLVSHVE FLQLC STDC DDIEEFLGI F 
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGSHK+RAKELLR+IELNIAKV   KTGTC+E+IWKM+EDISA+CTDA  SANS E SH TS SFISSEWTMSTPFDHWTESLKLEDEI+QVLQALKR
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD
        SQEFEYSLAEEKLLLHKNYLHNLF QL KEQTEL H+TS+  Q AFQ+NVTNRVDQIKREVKKLKR+EKVADGFG TPKDILK+DFGF+VD
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD

A0A6J1ITE5 OBERON-like protein isoform X11.2e-23282.28Show/hide
Query:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV
        MSGDP  VE EVL D NG     NKNDLILRPVSQDE GEGLPYAPENWPN GDNWSWRVG+RVAITGHF DRYLY PRGIGV  NS+RRG GFAS+LSV
Subjt:  MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSV

Query:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD
        ERYIQS FP+AD+DAFFASFSWKIPAKKSSLAQG R+KQ+ CPLPSK T+ECSASD Q+D + CKAGNKNCNSLS+AETPSLLKSMSCDICCSE +FCRD
Subjt:  ERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRD

Query:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC
        CCCILC K IDTT ES SYIKC+A VGDGYICGHHAHIKCGLKSYMAGTVGG IGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDI E L +  C
Subjt:  CCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFC

Query:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR
        ILRGS K+RAKELLR+ +LNIAK   LKTGTC+EE+WKMEED SA+CTDAPD+A+S EGSHD S S ISSEWT+STPFDHW ESLKLE+EIDQVLQALK+
Subjt:  ILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKR

Query:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD
        SQEFEY+LAEEKLL HKNYLHNLFQQLDKEQ EL HQ+S+ GQN F DNVTNRVDQIKREVK+LKRMEKVADGFG+TPKDILK+DF  DV+
Subjt:  SQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05410.1 Protein of unknown function (DUF1423)1.7e-12048.39Show/hide
Query:  LILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGI-GVPENSTRRGQGFASKLSVERYIQSVFPNADLDAFFASFSWKIPA
        L+LRPVS  E GEGLPYAPENWPNPGD W W+VG R++  G+F+DRYLYPP+ + G+     R+ + F S+LS++RYI+  FP AD+  FFASFSW IP 
Subjt:  LILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGI-GVPENSTRRGQGFASKLSVERYIQSVFPNADLDAFFASFSWKIPA

Query:  KKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRDCCCILCCKIIDTTRESYSYIKCEATV
        +     QG+ + Q    LP   + E    D   D   CKAGN+ C SL        L +M CDICC E +FC DCCCILCCK+I      YSYIKCEA V
Subjt:  KKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRDCCCILCCKIIDTTRESYSYIKCEATV

Query:  GDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFCILRGSHKIRAKELLRNIELNIAKVKL
         +G+ICGH AH+ C L++Y+AGT+GGS+GLD EYYCRRCDA+ DL  HV +FL++CQ+ + + D+E+ L +  CILRG+ +  AKELL  IE  + K   
Subjt:  GDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFCILRGSHKIRAKELLRNIELNIAKVKL

Query:  LKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEFEYSLAEEKLLLHKNYLHNLFQQ
        LK GT +E++W   +D     +D  DS  + E  +DT  S          PF+H  E  KLE+EI +VL+AL+++QEFEY +AE KL   K  L +L++Q
Subjt:  LKTGTCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEFEYSLAEEKLLLHKNYLHNLFQQ

Query:  LDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD
        L+KE++EL  + S    N+   NV  R+DQI++EV KLK ME+VA GFG TP+ +L++ F  +++
Subjt:  LDKEQTELRHQTSTPGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVD

AT1G05410.2 Protein of unknown function (DUF1423)9.1e-10346.19Show/hide
Query:  VGKRVAITGHFLDRYLYPPRGI-GVPENSTRRGQGFASKLSVERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQ
        VG R++  G+F+DRYLYPP+ + G+     R+ + F S+LS++RYI+  FP AD+  FFASFSW IP +     QG+ + Q    LP   + E    D  
Subjt:  VGKRVAITGHFLDRYLYPPRGI-GVPENSTRRGQGFASKLSVERYIQSVFPNADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQ

Query:  VDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRDCCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDA
         D   CKAGN+ C SL        L +M CDICC E +FC DCCCILCCK+I      YSYIKCEA V +G+ICGH AH+ C L++Y+AGT+GGS+GLD 
Subjt:  VDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRDCCCILCCKIIDTTRESYSYIKCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDA

Query:  EYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFCILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAE
        EYYCRRCDA+ DL  HV +FL++CQ+ + + D+E+ L +  CILRG+ +  AKELL  IE  + K   LK GT +E++W   +D     +D  DS  + E
Subjt:  EYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFCILRGSHKIRAKELLRNIELNIAKVKLLKTGTCVEEIWKMEEDISADCTDAPDSANSAE

Query:  GSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIK
          +DT  S          PF+H  E  KLE+EI +VL+AL+++QEFEY +AE KL   K  L +L++QL+KE++EL  + S    N+   NV  R+DQI+
Subjt:  GSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSTPGQNAFQDNVTNRVDQIK

Query:  REVKKLKRMEKVADGFGLTPKDILKKDFGFDVD
        +EV KLK ME+VA GFG TP+ +L++ F  +++
Subjt:  REVKKLKRMEKVADGFGLTPKDILKKDFGFDVD

AT3G22520.1 unknown protein4.6e-2252.27Show/hide
Query:  GEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSVERYIQSVFPNADLDAFFASFSWKIPA
        G+GLPYAP +WP+PGD W+WRVG+RV   G+  DR+L  P+ +          + FASK  + RY++S FP  D DAFFASFSWK+PA
Subjt:  GEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSVERYIQSVFPNADLDAFFASFSWKIPA

AT4G14840.1 unknown protein1.4e-1848.35Show/hide
Query:  GEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPP---RGIGVPENSTRRGQGFASKLSVERYIQSVFPNADLDAFFASFSWKIPA
        G+GLP+AP ++P+PGD W+WRVG+RV   G   DR L  P   +G  VP++       FASK ++ RY+++ FP+ D +AFFASF+W IPA
Subjt:  GEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPP---RGIGVPENSTRRGQGFASKLSVERYIQSVFPNADLDAFFASFSWKIPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGGGATCCTGTTGCTGTGGAGCCTGAAGTTCTTGAGGATACAAATGGCAGCACATCTGGGGTAAATAAAAATGATTTGATCCTTAGGCCAGTTTCTCAAGATGA
AGGTGGGGAGGGTTTGCCATATGCTCCTGAAAATTGGCCCAATCCCGGTGATAACTGGAGTTGGAGGGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTCTGGATAGGT
ACCTTTATCCTCCTCGTGGTATTGGTGTTCCAGAAAACTCAACTCGTAGAGGGCAGGGTTTTGCAAGCAAGCTTTCTGTTGAAAGATATATCCAGTCTGTGTTCCCTAAT
GCAGACCTTGATGCATTTTTTGCCTCATTCAGCTGGAAGATACCAGCAAAAAAGTCATCTTTAGCACAAGGTATGCGAGTAAAGCAAGTTCCATGCCCTCTACCCTCAAA
AGTGACGAAAGAATGCTCAGCATCTGATCCCCAGGTTGATATACTGGGTTGCAAGGCTGGAAATAAGAACTGTAATAGTTTATCTCTAGCAGAAACCCCATCTTTATTAA
AATCCATGTCCTGTGATATTTGCTGCAGCGAATCTCGGTTTTGCCGTGATTGCTGCTGTATACTTTGCTGCAAGATTATAGACACGACCAGGGAAAGTTATAGCTACATA
AAATGTGAAGCAACGGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAAAATGTGGTCTTAAATCATATATGGCTGGGACAGTTGGAGGAAGCATTGGATTGGA
TGCTGAGTATTATTGTCGACGTTGTGATGCTAGAACCGATTTGGTATCACATGTTGAAAGATTTTTGCAGTTATGCCAATCAACCGATTGTCGTGATGATATTGAAGAGT
TCTTAGGCATTAGTTTTTGCATTTTGCGGGGTTCACACAAAATAAGAGCAAAGGAGTTGTTAAGAAATATTGAATTGAACATTGCAAAGGTAAAACTGCTTAAAACTGGG
ACTTGCGTGGAAGAGATCTGGAAGATGGAGGAAGACATCTCAGCGGATTGCACTGATGCACCTGATAGCGCTAATTCTGCAGAGGGTTCTCATGACACTTCAGGTTCTTT
TATAAGCTCAGAATGGACTATGTCCACCCCTTTTGATCATTGGACTGAATCTCTAAAACTGGAAGATGAGATCGATCAGGTTCTGCAGGCACTAAAAAGATCACAAGAGT
TCGAGTATAGTTTAGCAGAAGAAAAGCTTCTATTACATAAAAATTATCTACATAATCTATTTCAGCAACTTGACAAGGAACAAACTGAACTCAGACATCAAACATCAACA
CCTGGACAAAATGCCTTCCAAGATAATGTAACAAACAGAGTGGATCAAATAAAACGAGAAGTAAAGAAACTCAAAAGAATGGAAAAGGTCGCTGATGGATTTGGATTGAC
TCCTAAAGATATCCTCAAGAAGGACTTCGGTTTCGATGTTGACATTGAGTAG
mRNA sequenceShow/hide mRNA sequence
AGCCATTCGCCCCTCTTTGTCTCTCACACCGTCGTGCGGCCGTTCCTATGCTTCGTGCTGTGTTCTTCTACTTCATTTTCTTATTCCCGCGAAAATTAATTCGCTGCTTA
ATCTCTCTGTTTTCTCCTTCTCTCTCTCCAATATGAAGAAATGACTTCTGGATTGAACAAACCCTAAAGTTCTCTTGATGTGTAATTGAATTCCTCAAGAATTCGAGGCA
GAGATTTATTTAGGGCCGTTTTGATAGTTTGTGTTTGTTGTTCCTGATTTCGAAGAACGAAAAACAGGAACGTGTTTGGTGTGTTCCTCAACCATTGTAGATGTCTGGGG
ATCCTGTTGCTGTGGAGCCTGAAGTTCTTGAGGATACAAATGGCAGCACATCTGGGGTAAATAAAAATGATTTGATCCTTAGGCCAGTTTCTCAAGATGAAGGTGGGGAG
GGTTTGCCATATGCTCCTGAAAATTGGCCCAATCCCGGTGATAACTGGAGTTGGAGGGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTCTGGATAGGTACCTTTATCC
TCCTCGTGGTATTGGTGTTCCAGAAAACTCAACTCGTAGAGGGCAGGGTTTTGCAAGCAAGCTTTCTGTTGAAAGATATATCCAGTCTGTGTTCCCTAATGCAGACCTTG
ATGCATTTTTTGCCTCATTCAGCTGGAAGATACCAGCAAAAAAGTCATCTTTAGCACAAGGTATGCGAGTAAAGCAAGTTCCATGCCCTCTACCCTCAAAAGTGACGAAA
GAATGCTCAGCATCTGATCCCCAGGTTGATATACTGGGTTGCAAGGCTGGAAATAAGAACTGTAATAGTTTATCTCTAGCAGAAACCCCATCTTTATTAAAATCCATGTC
CTGTGATATTTGCTGCAGCGAATCTCGGTTTTGCCGTGATTGCTGCTGTATACTTTGCTGCAAGATTATAGACACGACCAGGGAAAGTTATAGCTACATAAAATGTGAAG
CAACGGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAAAATGTGGTCTTAAATCATATATGGCTGGGACAGTTGGAGGAAGCATTGGATTGGATGCTGAGTAT
TATTGTCGACGTTGTGATGCTAGAACCGATTTGGTATCACATGTTGAAAGATTTTTGCAGTTATGCCAATCAACCGATTGTCGTGATGATATTGAAGAGTTCTTAGGCAT
TAGTTTTTGCATTTTGCGGGGTTCACACAAAATAAGAGCAAAGGAGTTGTTAAGAAATATTGAATTGAACATTGCAAAGGTAAAACTGCTTAAAACTGGGACTTGCGTGG
AAGAGATCTGGAAGATGGAGGAAGACATCTCAGCGGATTGCACTGATGCACCTGATAGCGCTAATTCTGCAGAGGGTTCTCATGACACTTCAGGTTCTTTTATAAGCTCA
GAATGGACTATGTCCACCCCTTTTGATCATTGGACTGAATCTCTAAAACTGGAAGATGAGATCGATCAGGTTCTGCAGGCACTAAAAAGATCACAAGAGTTCGAGTATAG
TTTAGCAGAAGAAAAGCTTCTATTACATAAAAATTATCTACATAATCTATTTCAGCAACTTGACAAGGAACAAACTGAACTCAGACATCAAACATCAACACCTGGACAAA
ATGCCTTCCAAGATAATGTAACAAACAGAGTGGATCAAATAAAACGAGAAGTAAAGAAACTCAAAAGAATGGAAAAGGTCGCTGATGGATTTGGATTGACTCCTAAAGAT
ATCCTCAAGAAGGACTTCGGTTTCGATGTTGACATTGAGTAGAGACATGGGTGCCTTAGGCAATTACGATGTCTCACAAAAGTTCATTGAACTCTGTTGATTCATTTAGC
CTTATGTAGGCTTTAATAGTGTTTACCGTGTATCATATTCATATGGCTTCACATGAAGCCAGTGTACGATATCGAACAGGCATCTCAACTTCTCAAGTAGATTTTCATTT
TACTTGGTTGGTTTTGTTAAGATAGCTGGATTCAGCAGGCAAAATACTGAAGCTATGCAGTTATGGAGTTGCCAGAAGAGTGTAGAGTAGAGATATTTTGGTGCATTGGA
AGCTAACCCTCTGCAACGAGTTTTCACGATAAACTTCGTCGTCCCCAATTGGTTGCTTGACGGATTGGTTTTCATTGGCTTCTGTGGAGCTGTGTCTGTCTGGTTCAGGC
TTCTTTATCACTACACGCAGAAGAGGAATTGAGGATGATGATGAGTTTTTGTTCCATACTGTTGACGTGTGATTGACTGTTTGTGTTCACAGACAAAAAAAAGATGTGAA
ATTGGGAGTTGGAGCTTTACTCGCCCATTTGAGGGCATCATGCCCACTCCAAAGTCCAAGGGAGCCATCCAAGCTCACCTGAGGGTTTGAATCTGTGACCTCTACGTTGC
TCGATCCTTTACCAATGAAGTTGTCCTTTGGAGGCGAGTGAAATCTAGTCTTCTACTATACAATAAAGAATAAAGAGATGAGCCAAATATAACATAGCTCAACTGGTATC
TCAAATATATTAACGACTATGAGGTATGTGGTTTGAATCCTCCAACCCCCTATTGTATTCTAGAAAAAATAAATAAAATAAAGAGATGAATTTGGCTATCGAATTCTATT
ATATTATAAAACAAATTAAAATCGTTCTGGTAAAGAAAGATTATTCTAATGGAATTGTATCAATGTCATTTTGTTTCAAAATCGTTCGGTTCACATGTTAGATTTTTATA
GCCCAAGTCTGTGAG
Protein sequenceShow/hide protein sequence
MSGDPVAVEPEVLEDTNGSTSGVNKNDLILRPVSQDEGGEGLPYAPENWPNPGDNWSWRVGKRVAITGHFLDRYLYPPRGIGVPENSTRRGQGFASKLSVERYIQSVFPN
ADLDAFFASFSWKIPAKKSSLAQGMRVKQVPCPLPSKVTKECSASDPQVDILGCKAGNKNCNSLSLAETPSLLKSMSCDICCSESRFCRDCCCILCCKIIDTTRESYSYI
KCEATVGDGYICGHHAHIKCGLKSYMAGTVGGSIGLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLGISFCILRGSHKIRAKELLRNIELNIAKVKLLKTG
TCVEEIWKMEEDISADCTDAPDSANSAEGSHDTSGSFISSEWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEFEYSLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTST
PGQNAFQDNVTNRVDQIKREVKKLKRMEKVADGFGLTPKDILKKDFGFDVDIE