; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021257 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021257
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPHD_Oberon domain-containing protein
Genome locationchr7:5928877..5931498
RNA-Seq ExpressionLag0021257
SyntenyLag0021257
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR032881 - Oberon, PHD finger domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049358.1 protein OBERON 1-like isoform X2 [Cucumis melo var. makuwa]1.2e-23784.05Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLEDTNG +  VNKN LILRPVSQDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC   VGDGYICGHHAHIKCGLKSY AGTVGGSI LDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLE+IWKMEED SANCTDAPD A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQN     V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

KAG7018712.1 Protein VERNALIZATION INSENSITIVE 3 [Cucurbita argyrosperma subsp. argyrosperma]3.8e-23684.05Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        MSGDPVE EVL D NG  P+ NKN LILRPVSQDE+GEGLPYAPENWPN GD WSWRVG+RVAITGHF DRYLY PRG  V  NSSR+G  FAS+LSVER
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNADVDAFFASFSWKIPAKKSSLAQG R+KQ+  PLPSKE +ECSAS+ Q D V CKAGNKNC+SLSVAENPSLLKSMSCD+CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILC KIIDTT ES SYIKC A VGDGYICGHHAHIKCGLKSYMAGTVGG I LDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDI E LS+G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGS KMRAKELLRH++LNIAK   LK+GTCLE++WKMEED SANCTDAPDNA+S EGSHD S S+ISS+WTM TPFDHW ESLKLE+EIDQVLQALKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLL HKNYLHNLFQQLDKEQ EL HQ+SSTGQNVF +NVTNRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

XP_008438665.1 PREDICTED: uncharacterized protein LOC103483705 isoform X1 [Cucumis melo]1.2e-23783.84Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLEDTNG +  VNKN LILRPV+QDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC   VGDGYICGHHAHIKCGLKSY AGTVGGSI LDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLE+IWKMEED SANCTDAPD A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQN     V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

XP_008438666.1 PREDICTED: uncharacterized protein LOC103483705 isoform X2 [Cucumis melo]1.2e-23783.84Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLEDTNG +  VNKN LILRPV+QDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC   VGDGYICGHHAHIKCGLKSY AGTVGGSI LDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLE+IWKMEED SANCTDAPD A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQN     V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

XP_023527162.1 OBERON-like protein isoform X1 [Cucurbita pepo subsp. pepo]1.7e-23684.25Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        MSGDPVE EVL D NG  P+ NKN LILRPVSQDE+GEGLPYAPENWPN GD WSWRVG+RVAITGHF DRYLY PRG  V  NSSR+G  FAS+LSV R
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNADVDAFFASFSWKIPAKKSSLAQG R+KQ+  PLPSKE +ECSAS+SQ D V CKAGNKNC+SLSVAENPSLLKSMSCD+CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILC KIIDTT ES SYIKC A VGDGYICGHHAHIKCGLKSYMAGTVGG I LDAEYYCRRCDARTDLVSHVERFLQLCQSTDC DDI E LS+G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGS KMRAKELLRH++LNIAK   LKTGTCLE++WKMEED SANCTDAPDNA+S EGSHD S S+ISS+WTMSTPFDHW ESLKLE EIDQVLQALKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLL HKNYLHNLFQQLDKEQ EL HQ+SSTGQNVF +NVTNRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

TrEMBL top hitse value%identityAlignment
A0A1S3AWZ1 uncharacterized protein LOC103483705 isoform X25.7e-23883.84Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLEDTNG +  VNKN LILRPV+QDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC   VGDGYICGHHAHIKCGLKSY AGTVGGSI LDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLE+IWKMEED SANCTDAPD A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQN     V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

A0A1S4DSZ4 uncharacterized protein LOC103483705 isoform X15.7e-23883.84Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLEDTNG +  VNKN LILRPV+QDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC   VGDGYICGHHAHIKCGLKSY AGTVGGSI LDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLE+IWKMEED SANCTDAPD A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQN     V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

A0A5D3D0Q3 Protein OBERON 1-like isoform X25.7e-23884.05Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLEDTNG +  VNKN LILRPVSQDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC   VGDGYICGHHAHIKCGLKSY AGTVGGSI LDAEYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLE+IWKMEED SANCTDAPD A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQN     V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

A0A6J1GVC0 protein OBERON 4-like isoform X12.9e-23483.23Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        MSGDPVE EVL D NG  P+ NKN L LRPVSQDE+GEGLPYAPENWPN GD WSWRVG+RVAITGHF DRYLY PRG  V  NSSR+G  FAS+LSVER
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNADVDAFFASFSWKIPAKKSSLAQG R++Q+  PLPSKE +ECSAS+SQ D V CKAGNKNC+SLSVAENPSLLKSMSCD+CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILC KIIDTT ES S+IKC A V DGYICGHHAHIKCGLKSYMAGTVGG I LDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDI E LS+G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGS KMRAKELLRH++LNIAK   LK+GTCLE++WKMEED SANCTDAPDNA+S +GSHD S S+ISS+WTM TPFDHW ESLKLE+EIDQVLQALKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLL HKNYLHNLFQQLDKEQ EL HQ+SSTGQNVF +NVTNRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

A0A6J1ITE5 OBERON-like protein isoform X12.0e-23583.64Show/hide
Query:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        MSGDPVE EVL D NG  P+ NKN LILRPVSQDE+GEGLPYAPENWPN GD WSWRVG+RVAITGHF DRYLY PRG  V  NSSR+G GFAS+LSVER
Subjt:  MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FP+ADVDAFFASFSWKIPAKKSSLAQG R+KQ+  PLPSKE +ECSAS+SQ D V CKAGNKNC+SLSVAE PSLLKSMSCD+CCSE +FCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILC K IDTT ES SYIKC A VGDGYICGHHAHIKCGLKSYMAGTVGG I LDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDI E LS+G CIL
Subjt:  CILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGS KMRAKELLRH +LNIAK   LKTGTCLE++WKMEED SANCTDAPDNA+S EGSHD S S+ISS+WT+STPFDHW ESLKLE+EIDQVLQALK+SQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLL HKNYLHNLFQQLDKEQ EL HQ+SSTGQNVF +NVTNRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05410.1 Protein of unknown function (DUF1423)2.8e-12048.6Show/hide
Query:  LILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGF-SVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA
        L+LRPVS  E+GEGLPYAPENWPNPGD W W+VG R++  G+F+DRYLY P+    +     RK + F S+LS++RYI+  FP ADV  FFASFSW IP 
Subjt:  LILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGF-SVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA

Query:  KKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKCVAEV
        +     QG+ + Q    LP   + E    +  +DT  CKAGN+ C SL        L +M CD+CC E +FC DCCCILCCK+I      YSYIKC A V
Subjt:  KKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKCVAEV

Query:  GDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKL
         +G+ICGH AH+ C L++Y+AGT+GGS+ LD EYYCRRCDA+ DL  HV +FL++CQ+ + + D+E+ L++G CILRG+ +  AKELL  IE  + K   
Subjt:  GDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKL

Query:  LKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQ
        LK GT LED+W   +D     +D  D+  + E  +DT  S+         PF+H  E  KLE+EI +VL+AL+++QE+EY +AE KL   K  L +L++Q
Subjt:  LKTGTCLEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQ

Query:  LDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        L+KE++EL  + S T  N    NV  R+DQI++EV KLK ME+VA GFG TP+ +L+E F  ++E
Subjt:  LDKEQTELRHQTSSTGQNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

AT1G05410.2 Protein of unknown function (DUF1423)4.5e-10246.42Show/hide
Query:  VGKRVAITGHFLDRYLYLPRGF-SVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQ
        VG R++  G+F+DRYLY P+    +     RK + F S+LS++RYI+  FP ADV  FFASFSW IP +     QG+ + Q    LP   + E    +  
Subjt:  VGKRVAITGHFLDRYLYLPRGF-SVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQ

Query:  TDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDA
        +DT  CKAGN+ C SL        L +M CD+CC E +FC DCCCILCCK+I      YSYIKC A V +G+ICGH AH+ C L++Y+AGT+GGS+ LD 
Subjt:  TDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKCVAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDA

Query:  EYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAE
        EYYCRRCDA+ DL  HV +FL++CQ+ + + D+E+ L++G CILRG+ +  AKELL  IE  + K   LK GT LED+W   +D     +D  D+  + E
Subjt:  EYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKLLKTGTCLEDIWKMEEDLSANCTDAPDNANSAE

Query:  GSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIK
          +DT  S+         PF+H  E  KLE+EI +VL+AL+++QE+EY +AE KL   K  L +L++QL+KE++EL  + S T  N    NV  R+DQI+
Subjt:  GSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNVFQNNVTNRVDQIK

Query:  QEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        +EV KLK ME+VA GFG TP+ +L+E F  ++E
Subjt:  QEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

AT3G22520.1 unknown protein8.4e-2452.63Show/hide
Query:  PVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA
        PVS    G+GLPYAP +WP+PGD+W+WRVG+RV   G+  DR+L LP+            + FASK  + RY++S FP  D DAFFASFSWK+PA
Subjt:  PVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA

AT4G14840.1 unknown protein1.5e-2048.91Show/hide
Query:  AGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLP---RGFSVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA
        +G+GLP+AP ++P+PGD+W+WRVG+RV   G   DR L LP   +G +VP++       FASK ++ RY+++ FP+ D +AFFASF+W IPA
Subjt:  AGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLP---RGFSVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGGGATCCTGTGGAACCTGAAGTTCTTGAGGATACAAATGGCAGCACACCTAGGGTAAATAAAAATGGTCTGATCCTTAGGCCAGTTTCTCAAGATGAAGCTGG
GGAGGGTTTGCCATATGCTCCTGAAAATTGGCCCAATCCCGGTGATATCTGGAGTTGGAGGGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTCTGGATAGATACCTTT
ATCTTCCTCGTGGTTTTAGTGTTCCTGAGAACTCATCTCGTAAAGGGCAGGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATATATCCAGTCTGTGTTCCCTAATGCAGAC
GTTGATGCATTTTTTGCCTCATTCAGCTGGAAGATACCAGCAAAAAAGTCATCTTTAGCACAAGGTATTCGAGTAAAACAAGTTCCATACCCTCTACCCTCAAAAGAGGC
GAAAGAATGCTCAGCATCTAATTCCCAGACTGATACAGTGGGTTGCAAGGCTGGAAATAAGAACTGTCATAGTTTATCTGTAGCAGAAAACCCATCTCTATTAAAATCCA
TGTCCTGTGATGTTTGCTGCAGCGAATCTCGGTTTTGCCGTGATTGCTGCTGTATACTTTGCTGCAAGATTATAGACACGACCAAGGAAAGTTATAGCTATATAAAATGT
GTAGCAGAGGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAAAATGTGGTCTTAAATCATACATGGCTGGGACAGTTGGAGGGAGCATTAGATTGGATGCTGA
GTATTATTGTCGACGATGTGATGCTAGAACAGATTTGGTATCTCACGTCGAAAGATTTTTGCAGTTATGTCAATCAACTGATTGTCGTGATGATATTGAGGAGTTCTTAA
GCATTGGTTTCTGCATTTTGCGTGGTTCGCACAAAATGAGAGCAAAGGAATTGTTAAGACATATTGAATTGAACATTGCAAAGGTAAAACTGCTTAAAACTGGGACTTGC
TTGGAAGATATTTGGAAGATGGAGGAAGACCTCTCAGCGAATTGCACTGATGCACCTGATAATGCTAATTCCGCAGAAGGTTCTCATGATACTTCAGGCTCCGTTATAAG
CTCAGATTGGACTATGTCCACTCCTTTTGATCATTGGACTGAGTCCCTAAAACTTGAAGATGAGATTGATCAGGTACTGCAGGCACTGAAAAGATCACAAGAGTACGAGT
ATAATTTAGCAGAAGAAAAGCTTCTATTACATAAGAATTATCTACACAATCTATTTCAGCAACTTGACAAGGAGCAAACTGAACTCAGACATCAAACATCGTCAACTGGA
CAAAATGTCTTCCAGAATAATGTAACAAACAGAGTGGATCAAATAAAGCAAGAAGTAAAGAAACTCAAGAGAATGGAGAAGGTTGCTGATGGATTTGGAATGACTCCAAA
GGATATCCTCAAGGAGGACTTCGATTTCGATGTTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGGGATCCTGTGGAACCTGAAGTTCTTGAGGATACAAATGGCAGCACACCTAGGGTAAATAAAAATGGTCTGATCCTTAGGCCAGTTTCTCAAGATGAAGCTGG
GGAGGGTTTGCCATATGCTCCTGAAAATTGGCCCAATCCCGGTGATATCTGGAGTTGGAGGGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTCTGGATAGATACCTTT
ATCTTCCTCGTGGTTTTAGTGTTCCTGAGAACTCATCTCGTAAAGGGCAGGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATATATCCAGTCTGTGTTCCCTAATGCAGAC
GTTGATGCATTTTTTGCCTCATTCAGCTGGAAGATACCAGCAAAAAAGTCATCTTTAGCACAAGGTATTCGAGTAAAACAAGTTCCATACCCTCTACCCTCAAAAGAGGC
GAAAGAATGCTCAGCATCTAATTCCCAGACTGATACAGTGGGTTGCAAGGCTGGAAATAAGAACTGTCATAGTTTATCTGTAGCAGAAAACCCATCTCTATTAAAATCCA
TGTCCTGTGATGTTTGCTGCAGCGAATCTCGGTTTTGCCGTGATTGCTGCTGTATACTTTGCTGCAAGATTATAGACACGACCAAGGAAAGTTATAGCTATATAAAATGT
GTAGCAGAGGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAAAATGTGGTCTTAAATCATACATGGCTGGGACAGTTGGAGGGAGCATTAGATTGGATGCTGA
GTATTATTGTCGACGATGTGATGCTAGAACAGATTTGGTATCTCACGTCGAAAGATTTTTGCAGTTATGTCAATCAACTGATTGTCGTGATGATATTGAGGAGTTCTTAA
GCATTGGTTTCTGCATTTTGCGTGGTTCGCACAAAATGAGAGCAAAGGAATTGTTAAGACATATTGAATTGAACATTGCAAAGGTAAAACTGCTTAAAACTGGGACTTGC
TTGGAAGATATTTGGAAGATGGAGGAAGACCTCTCAGCGAATTGCACTGATGCACCTGATAATGCTAATTCCGCAGAAGGTTCTCATGATACTTCAGGCTCCGTTATAAG
CTCAGATTGGACTATGTCCACTCCTTTTGATCATTGGACTGAGTCCCTAAAACTTGAAGATGAGATTGATCAGGTACTGCAGGCACTGAAAAGATCACAAGAGTACGAGT
ATAATTTAGCAGAAGAAAAGCTTCTATTACATAAGAATTATCTACACAATCTATTTCAGCAACTTGACAAGGAGCAAACTGAACTCAGACATCAAACATCGTCAACTGGA
CAAAATGTCTTCCAGAATAATGTAACAAACAGAGTGGATCAAATAAAGCAAGAAGTAAAGAAACTCAAGAGAATGGAGAAGGTTGCTGATGGATTTGGAATGACTCCAAA
GGATATCCTCAAGGAGGACTTCGATTTCGATGTTGAGTAG
Protein sequenceShow/hide protein sequence
MSGDPVEPEVLEDTNGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVERYIQSVFPNAD
VDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKC
VAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDAEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKLLKTGTC
LEDIWKMEEDLSANCTDAPDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTG
QNVFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE