; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032086 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032086
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPHD_Oberon domain-containing protein
Genome locationscaffold11:39344309..39350564
RNA-Seq ExpressionSpg032086
SyntenySpg032086
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR032881 - Oberon, PHD finger domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049358.1 protein OBERON 1-like isoform X2 [Cucumis melo var. makuwa]1.3e-23683.64Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLED NG +  VNKN LILRPVSQDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSI LD+EYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLEE WKMEED SANCTDA D A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNA    V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

KAG7018712.1 Protein VERNALIZATION INSENSITIVE 3 [Cucurbita argyrosperma subsp. argyrosperma]2.5e-23583.84Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        MSGDPVE EVL DING  P+ NKN LILRPVSQDE+GEGLPYAPENWPN GD WSWRVG+RVAITGHF DRYLY PRG  V  NSSR+G  FAS+LSVER
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNADVDAFFASFSWKIPAKKSSLAQG R+KQ+  PLPSKE +ECSAS+ Q D V CKAGNKNC+SLSVAENPSLLKSMSCD+CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILC KIIDTT ES SYIKC+A VGDGYICGHHAHIKCGLKSYMAGTVGG I LD+EYYCRRCDARTDLVSHVERFLQLCQSTDCRDDI E LS+G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGS KMRAKELLRH++LNIAK   LK+GTCLEE WKMEED SANCTDA DNA+S EGSHD S S+ISS+WTM TPFDHW ESLKLE+EIDQVLQALKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLL HKNYLHNLFQQLDKEQ EL HQ+SSTGQN F +NVTNRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

XP_008438665.1 PREDICTED: uncharacterized protein LOC103483705 isoform X1 [Cucumis melo]1.3e-23683.44Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLED NG +  VNKN LILRPV+QDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSI LD+EYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLEE WKMEED SANCTDA D A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNA    V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

XP_008438666.1 PREDICTED: uncharacterized protein LOC103483705 isoform X2 [Cucumis melo]1.3e-23683.44Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLED NG +  VNKN LILRPV+QDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSI LD+EYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLEE WKMEED SANCTDA D A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNA    V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

XP_023527162.1 OBERON-like protein isoform X1 [Cucurbita pepo subsp. pepo]1.1e-23584.05Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        MSGDPVE EVL DING  P+ NKN LILRPVSQDE+GEGLPYAPENWPN GD WSWRVG+RVAITGHF DRYLY PRG  V  NSSR+G  FAS+LSV R
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNADVDAFFASFSWKIPAKKSSLAQG R+KQ+  PLPSKE +ECSAS+SQ D V CKAGNKNC+SLSVAENPSLLKSMSCD+CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILC KIIDTT ES SYIKC+A VGDGYICGHHAHIKCGLKSYMAGTVGG I LD+EYYCRRCDARTDLVSHVERFLQLCQSTDC DDI E LS+G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGS KMRAKELLRH++LNIAK   LKTGTCLEE WKMEED SANCTDA DNA+S EGSHD S S+ISS+WTMSTPFDHW ESLKLE EIDQVLQALKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLL HKNYLHNLFQQLDKEQ EL HQ+SSTGQN F +NVTNRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

TrEMBL top hitse value%identityAlignment
A0A1S3AWZ1 uncharacterized protein LOC103483705 isoform X26.3e-23783.44Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLED NG +  VNKN LILRPV+QDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSI LD+EYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLEE WKMEED SANCTDA D A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNA    V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

A0A1S4DSZ4 uncharacterized protein LOC103483705 isoform X16.3e-23783.44Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLED NG +  VNKN LILRPV+QDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSI LD+EYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLEE WKMEED SANCTDA D A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNA    V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

A0A5D3D0Q3 Protein OBERON 1-like isoform X26.3e-23783.64Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        M+GDPV+ EVLED NG +  VNKN LILRPVSQDE+GEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLY PRG S  ENS+RKG  FASKLSVER
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNAD+DAFFASFSWKIPAKKSSLAQGIRVKQ+P PLPSK+ +ECSAS SQ D VGCKAGNKNC SLSV+ENPS  KSMSC +CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILCCKIIDTT ESYSYIKC+  VGDGYICGHHAHIKCGLKSY AGTVGGSI LD+EYYCRRCDARTDLVSHVE FLQ CQS DCRDD+EE L++G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGSHKMRAKELLRHIEL+I K+   KTG CLEE WKMEED SANCTDA D A+S E SH+TSGS+ISS+WTMSTPFDHW ESLKLEDEIDQVL  LKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLLLHKNYLHNLFQQL+KEQTELRHQT STGQNA    V+NRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

A0A6J1GVC0 protein OBERON 4-like isoform X11.9e-23383.03Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        MSGDPVE EVL DING  P+ NKN L LRPVSQDE+GEGLPYAPENWPN GD WSWRVG+RVAITGHF DRYLY PRG  V  NSSR+G  FAS+LSVER
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FPNADVDAFFASFSWKIPAKKSSLAQG R++Q+  PLPSKE +ECSAS+SQ D V CKAGNKNC+SLSVAENPSLLKSMSCD+CCSE RFCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILC KIIDTT ES S+IKC+A V DGYICGHHAHIKCGLKSYMAGTVGG I LD+EYYCRRCDARTDLVSHVERFLQLCQSTDCRDDI E LS+G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGS KMRAKELLRH++LNIAK   LK+GTCLEE WKMEED SANCTDA DNA+S +GSHD S S+ISS+WTM TPFDHW ESLKLE+EIDQVLQALKRSQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLL HKNYLHNLFQQLDKEQ EL HQ+SSTGQN F +NVTNRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

A0A6J1ITE5 OBERON-like protein isoform X11.3e-23483.44Show/hide
Query:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER
        MSGDPVE EVL DING  P+ NKN LILRPVSQDE+GEGLPYAPENWPN GD WSWRVG+RVAITGHF DRYLY PRG  V  NSSR+G GFAS+LSVER
Subjt:  MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVER

Query:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC
        YIQS FP+ADVDAFFASFSWKIPAKKSSLAQG R+KQ+  PLPSKE +ECSAS+SQ D V CKAGNKNC+SLSVAE PSLLKSMSCD+CCSE +FCRDCC
Subjt:  YIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCC

Query:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL
        CILC K IDTT ES SYIKC+A VGDGYICGHHAHIKCGLKSYMAGTVGG I LD+EYYCRRCDARTDLVSHVERFLQLCQSTDCRDDI E LS+G CIL
Subjt:  CILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCIL

Query:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ
        RGS KMRAKELLRH +LNIAK   LKTGTCLEE WKMEED SANCTDA DNA+S EGSHD S S+ISS+WT+STPFDHW ESLKLE+EIDQVLQALK+SQ
Subjt:  RGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQ

Query:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        E+EYNLAEEKLL HKNYLHNLFQQLDKEQ EL HQ+SSTGQN F +NVTNRVDQIK+EVK+LKRMEKVADGFGMTPKDILKEDFD DVE
Subjt:  EYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05410.1 Protein of unknown function (DUF1423)5.6e-12148.6Show/hide
Query:  LILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGF-SVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA
        L+LRPVS  E+GEGLPYAPENWPNPGD W W+VG R++  G+F+DRYLY P+    +     RK + F S+LS++RYI+  FP ADV  FFASFSW IP 
Subjt:  LILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGF-SVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA

Query:  KKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKCEAEV
        +     QG+ + Q    LP   + E    +  +DT  CKAGN+ C SL        L +M CD+CC E +FC DCCCILCCK+I      YSYIKCEA V
Subjt:  KKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKCEAEV

Query:  GDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKL
         +G+ICGH AH+ C L++Y+AGT+GGS+ LD+EYYCRRCDA+ DL  HV +FL++CQ+ + + D+E+ L++G CILRG+ +  AKELL  IE  + K   
Subjt:  GDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKL

Query:  LKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQ
        LK GT LE+ W   +D     +D  D+  + E  +DT  S+         PF+H  E  KLE+EI +VL+AL+++QE+EY +AE KL   K  L +L++Q
Subjt:  LKTGTCLEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQ

Query:  LDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        L+KE++EL  + S T  N+   NV  R+DQI++EV KLK ME+VA GFG TP+ +L+E F  ++E
Subjt:  LDKEQTELRHQTSSTGQNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

AT1G05410.2 Protein of unknown function (DUF1423)6.9e-10346.42Show/hide
Query:  VGKRVAITGHFLDRYLYLPRGF-SVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQ
        VG R++  G+F+DRYLY P+    +     RK + F S+LS++RYI+  FP ADV  FFASFSW IP +     QG+ + Q    LP   + E    +  
Subjt:  VGKRVAITGHFLDRYLYLPRGF-SVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQ

Query:  TDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDS
        +DT  CKAGN+ C SL        L +M CD+CC E +FC DCCCILCCK+I      YSYIKCEA V +G+ICGH AH+ C L++Y+AGT+GGS+ LD+
Subjt:  TDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKCEAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDS

Query:  EYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAE
        EYYCRRCDA+ DL  HV +FL++CQ+ + + D+E+ L++G CILRG+ +  AKELL  IE  + K   LK GT LE+ W   +D     +D  D+  + E
Subjt:  EYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKLLKTGTCLEEFWKMEEDLSANCTDARDNANSAE

Query:  GSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIK
          +DT  S+         PF+H  E  KLE+EI +VL+AL+++QE+EY +AE KL   K  L +L++QL+KE++EL  + S T  N+   NV  R+DQI+
Subjt:  GSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTGQNAFQNNVTNRVDQIK

Query:  QEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE
        +EV KLK ME+VA GFG TP+ +L+E F  ++E
Subjt:  QEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE

AT3G22520.1 unknown protein8.4e-2452.63Show/hide
Query:  PVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA
        PVS    G+GLPYAP +WP+PGD+W+WRVG+RV   G+  DR+L LP+            + FASK  + RY++S FP  D DAFFASFSWK+PA
Subjt:  PVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA

AT4G14840.1 unknown protein1.5e-2048.91Show/hide
Query:  AGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLP---RGFSVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA
        +G+GLP+AP ++P+PGD+W+WRVG+RV   G   DR L LP   +G +VP++       FASK ++ RY+++ FP+ D +AFFASF+W IPA
Subjt:  AGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLP---RGFSVPENSSRKGQGFASKLSVERYIQSVFPNADVDAFFASFSWKIPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGGGATCCTGTGGAACCTGAAGTTCTTGAGGATATAAATGGCAGCACACCTAGGGTAAATAAAAATGGTCTGATCCTTAGGCCAGTTTCTCAAGATGAAGCTGG
GGAGGGTTTGCCATATGCTCCTGAAAATTGGCCCAATCCCGGTGATATCTGGAGTTGGAGGGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTCTGGATAGATACCTTT
ATCTTCCTCGTGGTTTTAGTGTTCCTGAGAACTCATCTCGTAAAGGGCAGGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATATATCCAGTCTGTGTTCCCTAATGCAGAC
GTTGATGCATTTTTTGCCTCATTCAGCTGGAAGATACCAGCAAAAAAGTCATCCTTAGCACAAGGTATTCGAGTAAAACAAGTTCCATACCCTCTACCCTCAAAAGAGGC
GAAAGAATGCTCAGCATCTAATTCGCAGACTGATACAGTGGGTTGCAAGGCTGGAAATAAGAACTGTCATAGTTTATCTGTAGCAGAAAACCCATCTCTATTAAAATCCA
TGTCCTGTGATGTTTGCTGCAGCGAATCTCGGTTTTGCCGTGACTGCTGCTGTATACTTTGCTGCAAGATTATAGACACGACCAAGGAAAGTTATAGCTATATAAAATGT
GAAGCAGAGGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAAAATGTGGTCTTAAATCATACATGGCTGGGACAGTTGGAGGGAGCATTAGATTGGATTCTGA
GTATTATTGTCGACGATGTGATGCTAGAACAGATTTGGTATCTCACGTCGAAAGATTTTTGCAGTTATGTCAATCAACTGATTGTCGTGATGATATTGAGGAGTTCTTAA
GCATTGGTTTCTGCATTTTGCGTGGTTCGCACAAAATGAGAGCAAAGGAATTGTTAAGACATATTGAATTGAACATTGCAAAGGTAAAACTGCTTAAAACTGGGACTTGC
TTGGAAGAGTTTTGGAAGATGGAGGAAGACCTCTCAGCGAATTGCACTGATGCACGTGATAATGCTAATTCTGCTGAAGGTTCTCATGACACTTCAGGCTCCGTTATAAG
CTCAGATTGGACTATGTCCACCCCTTTTGATCATTGGACTGAGTCCCTAAAACTTGAAGATGAGATTGATCAGGTTCTGCAGGCACTGAAAAGATCACAAGAGTACGAGT
ATAATTTAGCAGAAGAAAAGCTTCTATTACATAAGAATTATCTACACAATCTATTTCAGCAACTTGACAAGGAGCAAACTGAACTCAGACATCAAACATCGTCAACTGGA
CAAAATGCCTTCCAGAATAATGTAACAAACAGAGTGGATCAAATAAAGCAAGAAGTAAAGAAACTCAAGAGAATGGAGAAGGTTGCTGATGGATTTGGAATGACTCCAAA
GGATATCCTCAAGGAGGACTTCGATTTCGATGTTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGGGATCCTGTGGAACCTGAAGTTCTTGAGGATATAAATGGCAGCACACCTAGGGTAAATAAAAATGGTCTGATCCTTAGGCCAGTTTCTCAAGATGAAGCTGG
GGAGGGTTTGCCATATGCTCCTGAAAATTGGCCCAATCCCGGTGATATCTGGAGTTGGAGGGTGGGGAAGAGAGTTGCTATAACTGGCCATTTTCTGGATAGATACCTTT
ATCTTCCTCGTGGTTTTAGTGTTCCTGAGAACTCATCTCGTAAAGGGCAGGGTTTTGCAAGCAAGCTTTCAGTTGAAAGATATATCCAGTCTGTGTTCCCTAATGCAGAC
GTTGATGCATTTTTTGCCTCATTCAGCTGGAAGATACCAGCAAAAAAGTCATCCTTAGCACAAGGTATTCGAGTAAAACAAGTTCCATACCCTCTACCCTCAAAAGAGGC
GAAAGAATGCTCAGCATCTAATTCGCAGACTGATACAGTGGGTTGCAAGGCTGGAAATAAGAACTGTCATAGTTTATCTGTAGCAGAAAACCCATCTCTATTAAAATCCA
TGTCCTGTGATGTTTGCTGCAGCGAATCTCGGTTTTGCCGTGACTGCTGCTGTATACTTTGCTGCAAGATTATAGACACGACCAAGGAAAGTTATAGCTATATAAAATGT
GAAGCAGAGGTGGGTGATGGTTATATTTGTGGACATCATGCTCATATAAAATGTGGTCTTAAATCATACATGGCTGGGACAGTTGGAGGGAGCATTAGATTGGATTCTGA
GTATTATTGTCGACGATGTGATGCTAGAACAGATTTGGTATCTCACGTCGAAAGATTTTTGCAGTTATGTCAATCAACTGATTGTCGTGATGATATTGAGGAGTTCTTAA
GCATTGGTTTCTGCATTTTGCGTGGTTCGCACAAAATGAGAGCAAAGGAATTGTTAAGACATATTGAATTGAACATTGCAAAGGTAAAACTGCTTAAAACTGGGACTTGC
TTGGAAGAGTTTTGGAAGATGGAGGAAGACCTCTCAGCGAATTGCACTGATGCACGTGATAATGCTAATTCTGCTGAAGGTTCTCATGACACTTCAGGCTCCGTTATAAG
CTCAGATTGGACTATGTCCACCCCTTTTGATCATTGGACTGAGTCCCTAAAACTTGAAGATGAGATTGATCAGGTTCTGCAGGCACTGAAAAGATCACAAGAGTACGAGT
ATAATTTAGCAGAAGAAAAGCTTCTATTACATAAGAATTATCTACACAATCTATTTCAGCAACTTGACAAGGAGCAAACTGAACTCAGACATCAAACATCGTCAACTGGA
CAAAATGCCTTCCAGAATAATGTAACAAACAGAGTGGATCAAATAAAGCAAGAAGTAAAGAAACTCAAGAGAATGGAGAAGGTTGCTGATGGATTTGGAATGACTCCAAA
GGATATCCTCAAGGAGGACTTCGATTTCGATGTTGAGTAG
Protein sequenceShow/hide protein sequence
MSGDPVEPEVLEDINGSTPRVNKNGLILRPVSQDEAGEGLPYAPENWPNPGDIWSWRVGKRVAITGHFLDRYLYLPRGFSVPENSSRKGQGFASKLSVERYIQSVFPNAD
VDAFFASFSWKIPAKKSSLAQGIRVKQVPYPLPSKEAKECSASNSQTDTVGCKAGNKNCHSLSVAENPSLLKSMSCDVCCSESRFCRDCCCILCCKIIDTTKESYSYIKC
EAEVGDGYICGHHAHIKCGLKSYMAGTVGGSIRLDSEYYCRRCDARTDLVSHVERFLQLCQSTDCRDDIEEFLSIGFCILRGSHKMRAKELLRHIELNIAKVKLLKTGTC
LEEFWKMEEDLSANCTDARDNANSAEGSHDTSGSVISSDWTMSTPFDHWTESLKLEDEIDQVLQALKRSQEYEYNLAEEKLLLHKNYLHNLFQQLDKEQTELRHQTSSTG
QNAFQNNVTNRVDQIKQEVKKLKRMEKVADGFGMTPKDILKEDFDFDVE